5,675 Matching Annotations
  1. Last 7 days
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors first use a Bio-ID approach to search for interactors of the basket proteins TPR and NUP153, identifying proteins involved in various nuclear process, including many splicing components, and confirm some of these interactions using IP and PLA assays. PLA experiments further suggest that these interactions occur primary at or close to the nuclear periphery. Moreover, inhibiting splicing, but not transcription, reduced these interactions. The authors then investigated the role of NUP153 in loading of the splicing machinery and found a lower association of the NUP98/SF3A1 but not AQR interaction (measured through PLA). Furthermore, DamID experiments identified NUP153 bound genes proximal to LAD domains that are actively transcribed, contain overall longer introns with low GC-content, and fall within a group of genes located at the outermost shell of the nucleus (when compared to previously published LaminI ID /PGseq data). Lastly, they interrogate whether depletion of NUP153 results in a splicing defect for NUP153 bound genes.

      The authors identify many proteins in their BioID interaction screen, however, only a single nucleoporin (Nup35, an inner ring protein). Previous BioID studies have identified NUP153 in BioID experiments including proteins of the Y-complex (PMID: 24927568 and others).

      The BioID list summarises interactions of proteins present across five datasets and among two cell lines, HEK293T and Jurkat T cells. As the reviewer pointed out, these stringent criteria excluded proteins originally present in the BioID datasets. Indeed, the original datasets across the cell lines include a wide range of nucleoporins Nup37, Nup43, Nup53, Nup85, Nup107, Nup188, and Nup205. Apart from this, several other proteins were consistently found on the BioID of the basket nucleopore that had been previously found in the literature, namely transcription-related and export-related proteins, as the reviewer can depict on Fig 1 C.

      To ensure that the BioID experiments indeed probe for interactions of NUP152 and TPR at the NPC, the authors should include control experiments that show that their NUP153 and TPR-BirA fusions primarily localize to the NPC. If a significant fraction is not NPC bound, this has to be taken into account when interpreting/discussing their data.

      Before conducting the DamID experiment, we validated the peripheral localization of the constructs used here. We provide here the images showing the distribution of the two nucleoporins

      Figure R1: Immunofluorescent images obtained for Nup153 and TPR in Jurkat cells prior to BioID experiment.

      The authors describe DDX39b/UAP56 as an early splicing factor; DDX39b/UAP56 main role however seems to be in mRNA export and mRNP compaction. The authors might want to include this in the interpretation of their data.

      The reviewer is right; the role of DDX39b/UAP56 goes beyond pre-mRNA splicing. This is indeed involved in posttranscriptional maturation and export from the nucleus to the cytoplasm of cellular RNAs. Originally identified as a helicase involved in pre-mRNA splicing, UAP56 has been shown to facilitate the formation of the A complex during spliceosome assembly. This is the reason why we included it in our study. Additionally, DDX39b/UAP56 has been found to be critical for interactions between components of the exon junction and transcription and export complexes to promote the loading of export receptors, while more recently has also been identified as a DNA:RNA helicase involved in the resolution of R-loops (PMID:32439635). At present, we indeed cannot distinguish between multiple functions of this protein at the NPC, and this is now acknowledged in our manuscript.

      • Concerning PLA experiment controls, the authors perform TPR-PML as a negative control, however, no negative control for NUP153 is shown (Figure 2). Such a control should be added to allow evaluating the specificity of NUP153 PLA interactions, and/or discussed why this was not done. We would like to clarify better how we selected the antibodies for the control PLA experiments. In PLA, antibodies from different species have to be used. Nup153 is an anti-mouse antibody, and the control we used, PML, is also an anti-mouse; therefore, the two antibodies cannot be used in the PLA experiment. The controls used (mouse) were conjugated with rabbit antibodies, either TPR (PML) or AQR (B23). Both TPR:PML and AQR:B23 showed insignificant PLA signals. Therefore, we can confidently conclude that the PLA spots seen for the NPC/splicing proteins are of measurable quantities. Moreover, the conclusion that there’s a pool of splicing machinery associated with the NPC is sustained on several pieces of evidence accumulated through other experiments, not only PLA. We have supporting evidence from super-resolution microscopy, coIPs as well as the referred PLA.

      Figure R2: Control PLA assay with anti-AQR (rabbit) and Nucleophosmin (B23)(mouse) antibodies.

      • Quantification of the distance of PLA-NUP153/TPR interactions show interactions mainly close to the nuclear periphery. The imaging data shown in Figure 2b indeed shows that TPR/NUP153 interactions are exclusively at the nuclear periphery, whereas NUP153/splicing factor interactions are sometimes at the edge of the DAPI signal, but mostly somehow internalized (Figure 2B, S2b). Quantification (Figure 2d) shows these distributions to be very similar, likely due to the way the quantification was performed / the bin size of plotting the relative distance of a spot to the nuclear periphery was chosen. Looking at the scale bar/nuclear size and the position of the PLA spots for the NUP153/splicing factors, it appears that spots are often hundreds of nanometers away from the periphery. As the nuclear basket is thought to reach only about 100nm onto the nuclear interior, the conclusion by the authors that these interactions occur at the NPC would not be consistent with the data. The authors should better incorporate this in their interpretation of the data. Nup153/TPR are peripherally located at the most outer shells (0, 40% of signal and 1, 60% of signal). Consistently, based on figure 2d, Nup153:DDX39b is similarly distributed (0, 40% of signal and 1, 60% of signal) and the vast majority of the Nup153:AQR and Nup153:SF31A1 signal is also present in these two shells (20% and 40%). Although our super-resolution microscopy excludes the presence of internal Nup153 staining we cannot exclude that PLA potentially increases the signal of a possible internal Nup153/splicing interaction due to the rolling circle amplification reaction. However, as referred above this is not where primarily our interaction is occurring.

      • The conclusion ' Nup153 aids the loading of splicing machinery' is not sufficiently supported by the data. The authors observed a reduction in PLA signal for the NUP98-AQR interaction, but not the NUP98-SF3A1 (Figure 3g). Their conclusion has to reflect this discrepancy in their data. Moreover, the studies focus is to determine the role of NUP153/TPR in recruiting the splicing machinery to the NPC. As in the experiments the authors interrogate the interaction of only NUP98, who has to a large extend splicing factor interactions within the nuclear interior and not at the periphery, the relevance of the experiments in Figure 3 towards the main focus of the paper is unclear. The reviewer is right to point out that the study focuses on determining the role of Nup153/TPR in recruiting the splicing machinery to the NPC. As TPR is docked to the NPC through Nup153 (PMID: 12802065; 39127037) we investigated Nup98 and performed internal controls to show that 1) Nup98 wasn’t disrupted by shNup153 (Fig below) and technically 2) Nup98 was used in the shNup153 studies because of the availability of a reliable mouse antibody that could be coupled with the various rabbit antibodies used previously for SF3A1, DDX39, XAB2, and AQR in PLA experiments' for which mouse counterparts do not exist and would therefore hinder the continuation of the study as explained above. For TPR only a reliable rabbit antibody exists that works in our hands and therefore we wouldn’t have been able to perform the PLA experiments shown here

      Figure R3: Nup98 staining in wild type and shNup153 depleted Jurkat cells. In (a) co-immunofluorescence between AQR and Nup98 showing predominant positioning of Nup98 in the nuclear periphery (at the NPC). (b) Nup98 staining at the periphery persists also in shNup153 depleted cells, indicating that this Nup can be used as an NPC marker in Nup153 depleted conditions.

      When investigating the effect of NUP153 depletion on splicing, the authors observe a splicing phenotype for multiple NUP153 genes (Figure 5). The authors however show only a single negative control gene (CBX5). It would significantly strengthen their argument if the authors would investigate splicing defects of periphery located noneNUP153 bound genes as well as for genes located in the nuclear interior to better understand whether this splicing phenotype is indeed specific for NUP153 genes (at the nuclear periphery/NPC).

      We agree with the reviewer that expanding our observations to other peripherally located genes would be interesting; however, most of the other known peripheral genes are LAD-associated and mostly not expressed. While it was not our intention to claim that splicing at the periphery is specific only for Nup153-bound genes, we had obviously focused on Nup153-bound genes to understand the dynamics between Nup153/splicing machinery interactions. As stated above, other peripherally located genes within LADs are repressed.

      It Is out of the scope of this study to understand other relevant splicing hubs, as the reviewer knows splicing can occur throughout the nucleus at different sites (outside speckles). However, we do understand the reviewer's point of view, and to include more controls as requested by this and other reviewers, we have designed primers for additional non-Nup153 bound genes, and these additional experiments will be included in the manuscript.

      Figure R4: Preliminary data showing splicing of other Non-Nup153 Bound genes upon shNup153

      The authors state in the text describing the SABER-FISH experiments in Figure 5f that 'were able to visualize the presence of a site of transcription where accumulation of these probes was close to the periphery for all except for GSTK1, which showed a wider nuclear distribution, similar to CBX5 control region not bound by Nup153'. However, their statement is not supported by the images shown in Fig 5f, which show TS in control cells in the nuclear interior. Also, a single cell but no quantification is shown. Moreover, what distance from the periphery is considered as close to the periphery is not defined (see also earlier comment on the question what should be considered a periphery and/or NPC association).

      Measurement of the distance of FISH signals to the nuclear periphery for each probe (ie transript) performed in n>100 cells were represented graphically in Fig. 5f. We then compared total number of signals for each probe, obtained in shCtrl and shNup153 cells, and represented them graphically in Fig 5g; representative images shown in Fig 5g are those that were measured in Fig5g and represent the accumulation of the signal in cells upon shNup153 (not necessarily all at the nuclear periphery).

      We hope that this clarifies better what is represented.

      Limitation of the study does not discuss the limitations of the study but rather reads like the extension of the discussion. This section should be rewritten.

      We will take into consideration this comment and will expand the section in the revised manuscript

      Minor comments:

      Western in Figure 3c does not represent well the quantification in 3e.

      Figure S3 is mislabelled (pannel h is panel g).

      __Reviewer #1 (Significance (Required)): __

      Reviewer #1 (Significance (Required)):

      The manuscript interrogates an important question related to the role of the NPC in gene regulation, in particular how interaction of genes/pre-mRNAs with the NPC might stimulate expression of specific genes/mRNAs. Stimulating splicing would be one way that could contribute to efficient gene expression, and this is the question the authors address in this manuscript. This study is therefore important and relevant to a wide audience. However, as outlined in the section above, the conclusions drawn by the authors do not always reflect the experimental data, and it is therefore unclear whether the overall conclusion as stated in the title of the manuscript is valid. Moreover, conceptually, if intron containing genes are transcribed at or near nuclear pores, and splicing often occurs co-transcriptionally, it is to be expected to find splicing components close to nuclear pores. While it is relevant to show that this actually happens, and this is, at least in part, done by the authors. However, the experiments presented do not show that the splicing machinery actually actively docks to the NPC and is not just passively recruited close to NPCs because nascent pre-mRNAs are spliced where they are transcribed (the authors state in their title that the NUP153 docks the splicing machinery at the NPC). Showing this require identifying direct interactions between spliceosome components with NUP153/nuclear basket components to stimulate splicing at the NPC. If this would indeed be the case, these findings would describe a novel mechanistic step to stimulate efficient splicing and subsequently export of a selected set of NPC-associated genes. This would open other questions such as how to achieve specificity for only some pre-mRNAs/introns. While addressing this question is likely beyond the scope of this manuscript, the question whether the process described here is an active or passive process should be incorporated in the interpretation of the data.

      We are grateful to the reviewer for highlighting the importance and relevance of our work for a broad audience. We now provide additional experimental evidence that will hopefully aid in substantiating our overall conclusions, as suggested by the reviewer.

      __ Reviewer 2:__

      Summary:

      In this manuscript, using a combination of proximity labelling, immunoprecipitations and imaging, the authors report a physical interaction between splicing factors (SFs) and the nuclear basket of nuclear pore complexes (i.e. NUP153). Using DamID, they further identify a set of NUP153-bound genes characterized by long, GC-poor introns. Finally, based on molecular analyses for a set of candidate loci, they report that inactivation of NUP153 triggers a (modest) reduction of intron splicing, which may specifically affect NUP153-bound genes.

      Major comments:

      • The BioID experiments (Fig. 1) lack proper controls. Proteins biotinylated by NUP-BirA fusions need to be compared with those modified upon expression of a control BirA protein, as has been done previously, especially when other NUPs were used as baits in BioID experiments (PMID: 24927568, to be cited). This control fusion should ideally be targeted to the same compartment (i.e. the nucleus or the nuclear side of the nuclear envelope). In our experimental setting, we have opted to use an unrelated protein tagged with BirA (Lck-BirA) rather than BirA-only control. The peripheral membrane proteins of the Src family kinase (SFK) Lck and its GPF tagged version (LckN18.GFP) localize predominantly at the plasma membrane (PMID: 29588370), whereas the GFP only (non-biothinylated) shows a broader nuclear distribution. All MS-detected proteins from the Lckn18-BirA and GFP negative control experiments were excluded. Moreover, we have analysed carefully the published data of nuclear transporter receptors binding to the NPC and the respective controls (BirA alone or the shuttling NLS_NES_Dendra with C or N terminal tags) (PMID: 29254951), and we did not find that any of these protein controls interact with the proteins of the splicing machinery.

      • Here, the chosen controls are inappropriate as the authors are probing interactions between NPC proteins (NUP153/TPR) and proteins restricted to a different nuclear compartment (e.g., nucleophosmin in the nucleolus). We have used two different controls in our PLA experiments - initially, we used B23 for nucleolus stain and then PML protein, major component of PML NBs, as PML can be found scattered throughout the nucleus and sometimes even resides at the nuclear periphery. Both of these controls showed negligible amounts of PLA spots in all our experiments. We take the opportunity to clarify that in PLA, antibodies from different species have to be used. Nup153 is an anti-mouse antibody, and the control we used, PML is also an anti- mouse; therefore the two antibodies cannot be used in the PLA experiment. The controls used (mouse) were conjugated with rabbit antibodies, either TPR (PML) or AQR (B23). Both TPR:PML and AQR:B23 showed insignificant PLA signals. Therefore we can confidently conclude that the PLA spots seen for the NPC/splicing proteins are of measurable quantities.

      • Would a control soluble, diffusible nucleoplasmic protein be detected in the vicinity of the NPC and sometimes colocalized with Nups? Precisely for this purpose we have used PML protein, that can be found both disperse in nucleoplasm as well as in PML NBs.

      • In order to assess these possibilities, the authors should perform their immunoprecipitation on extracts treated with benzonase, thus abrogating DNA- and RNA-dependent interactions. We have performed this experiment assessing the binding of Nup153 with the components of the IBC and observed that Nup153 interaction with these splicing factors is DNA or RNA independent, with some factors being more affected than others (probably passively recruited by protein-protein interactions with their splicing counterparts).

      __Figure R5: __RNA or DNA do not affect the interaction between Nup153 and splicing proteins. Lysates from HEK293T cells transfected with eGFP-Nup153 or eGFP were treated with RNase or DNase or left untreated prior to Co-IP for GFP-Nup153. The membrane was probed for GFP (confirmed successful transfection), AQR, SF3A1, XAB2, and DDX39B. The graph shows quantified bands of the bound fractions, normalized to the input and untreated control from one experiment.

      NUP153 inactivation appears to have a modest effect on splicing (Fig. 5; S6), which is poorly characterized here. It is also unclear whether this effect is direct or caused by side consequences of the depletion of this nucleoporin (e.g., changes in nucleocytoplasmic exchanges or gene expression).

      We have indeed asked if the presence of splicing components at the periphery could be a consequence of protein trafficking. To address whether nucleocytoplasmic exchange has a role in these associations, we have pharmacologically inhibited nuclear import by ivermectin (IVM). ​​IVM has been shown to block the importin-α/β-mediated nuclear import by directly interacting with karyopherin importin-α.(PMID: 30826604). HEK293T cells transfected with eGFPNup153 or eGFP alone were treated with IVM for 2hr (24hrs post transfection). As biochemical fractionation demonstrated (data not shown) there were slightly decreased protein levels of AQR, DDX39B and SF3A1 in the nuclear insoluble fraction. However, we barely observed any decrease in interactions between Nup153 and the splicing components we tested, indicating that the interaction with the spliceosomal components is not a consequence of nucleocytoplasmic exchange.

      Figure R6: Association of splicing components with Nup153 is not only due to nuclear import. HEK293T cells transfected with eGFP-Nup153 or eGFP and treated with import inhibitor IVM or DMSO were analyzed with Co-IP for GFP-Nup153. The membrane was probed for GFP (confirmed successful transfection), AQR, SF3A1, XAB2 and DDX39 B. Quantified bands of the bound fractions were normalized to the input fraction and DMSO control and the results of two experiments were shown in the graph.

      Related to the gene expression levels, we have not observed any significant changes in total expression levels of tested genes, probed with designed exonic primers (as indicated with blue arrows in Figure 5a). Additional control genes will be added as suggested by this and other reviewers.

      • To confirm the specificity of the effects of NUP153 depletion on the splicing of NUP153-bound genes, the authors need to provide additional splicing measurements for several genes that are bound by NUP153 "in the nucleoplasm" (e.g. excluded from their analysis by the cutoff of proximity to LAD borders, line 192) and for other "non-NUP153" genes (beyond the unique control shown in Fig. S6a).

      We acknowledge the comment of the reviewer that our work will benefit from additional controls. We are currently designing primers and probes to amplify additional regions, Nup153 bound and non-LAD proximal or non-Nup153 bound; Please also see the comment below.

      - From the few examples provided, it is difficult to evaluate the type of splicing events affected by NUP153 inactivation. Are they uniquely intron retention events? The authors should analyze available RNA-seq data obtained from NUP153-depleted cells (PMID:32917881) to characterize the types of alternative splicing events that are impacted by NUP153.

      In Aksenova et al, only 28 differentially expressed genes were detected during rapid degron Nup153 depletion (2h). With this small number of genes, it is highly unlikely we would be able to perform a statistically significant and detailed analysis. Importantly, the depletion was performed in colorectal adenocarcinoma cell line (DLD-1), whereas here we are reporting on T lymphocytes. Based on the analysis which we performed and explained below, there seem to be significant cell type specific differences in Nup153 associations, as already reported by others (PMID: 27807035; 32451376; 28919367).

      Splicing dynamics and speckle localization propensity have been proposed to depend on the overall GC content and the overall average intron size by several studies (PMID: 39413186; 38720076; 35182478; 22832277; 35182477). Prompted by our observation that Nup153 genes have longer than average introns and lower than average GC content (Figure 4f and 4g), we analyzed the data from HeLa cells, where genes were classified into groups A, B and C based on their speckle localization and dynamics (PMID 39413186). We intersected our Nup153 genes with the list of ABC genes from HeLa cells and found that 43 out of our 461 protein-coding genes were represented among non-speckle enriched group C genes, with the lowest GC content and longest average intron length.

      Figure R7: Nup153 genes comparison to the A_B_C genes from Wu J et al study. GC content and number of introns, used to classify the identified genes from HeLa cells are plotted on X and Y axis. A genes in Red, B in green or C in turquoise from HeLa cells were compared with Nup153 genes from Jurkat cells in the graph on the left. Nup153 genes are represented as triangles. A subgroup of Nup153 genes, classified as C group genes (long introns with high GC content and spliced away from speckles) are shown as turquoise triangles. Graph on the right shows total pool of Nup153 genes in violet (not identified in HeLa cells) and a subgroup of C Nup153 genes as turquoise. The list of Nup153 C genes is shown below.

      query

      entrezgene

      name

      symbol

      ENSG00000168615

      8754

      ADAM metallopeptidase domain 9

      ADAM9

      ENSG00000139154

      121536

      AE binding protein 2

      AEBP2

      ENSG00000112249

      10973

      activating signal cointegrator 1 complex subunit 3

      ASCC3

      ENSG00000176788

      10409

      brain abundant membrane attached signal protein 1

      BASP1

      ENSG00000153956

      781

      calcium voltage-gated channel auxiliary subunit alpha2delta 1

      CACNA2D1

      ENSG00000153113

      831

      calpastatin

      CAST

      ENSG00000134371

      79577

      cell division cycle 73

      CDC73

      ENSG00000188517

      84570

      collagen type XXV alpha 1 chain

      COL25A1

      ENSG00000182158

      64764

      cAMP responsive element binding protein 3 like 2

      CREB3L2

      ENSG00000109861

      1075

      cathepsin C

      CTSC

      ENSG00000153904

      23576

      dimethylarginine dimethylaminohydrolase 1

      DDAH1

      ENSG00000139734

      81624

      diaphanous related formin 3

      DIAPH3

      ENSG00000102580

      5611

      DnaJ heat shock protein family (Hsp40) member C3

      DNAJC3

      ENSG00000173852

      23333

      dpy-19 like C-mannosyltransferase 1

      DPY19L1

      ENSG00000151914

      667

      dystonin

      DST

      ENSG00000165891

      144455

      E2F transcription factor 7

      E2F7

      ENSG00000138829

      2201

      fibrillin 2

      FBN2

      ENSG00000115414

      2335

      fibronectin 1

      FN1

      ENSG00000075420

      64778

      fibronectin type III domain containing 3B

      FNDC3B

      ENSG00000114861

      27086

      forkhead box P1

      FOXP1

      ENSG00000090615

      2802

      golgin A3

      GOLGA3

      ENSG00000196591

      3066

      histone deacetylase 2

      HDAC2

      ENSG00000071794

      6596

      helicase like transcription factor

      HLTF

      ENSG00000145012

      4026

      LIM domain containing preferred translocation partner in lipoma

      LPP

      ENSG00000065833

      4199

      malic enzyme 1

      ME1

      ENSG00000087053

      8898

      myotubularin related protein 2

      MTMR2

      ENSG00000145555

      4651

      myosin X

      MYO10

      ENSG00000061676

      10787

      NCK associated protein 1

      NCKAP1

      ENSG00000185630

      5087

      PBX homeobox 1

      PBX1

      ENSG00000113448

      5144

      phosphodiesterase 4D

      PDE4D

      ENSG00000163110

      10611

      PDZ and LIM domain 5

      PDLIM5

      ENSG00000070087

      5217

      profilin 2

      PFN2

      ENSG00000152952

      5352

      procollagen-lysine,2-oxoglutarate 5-dioxygenase 2

      PLOD2

      ENSG00000106772

      158471

      prune homolog 2 with BCH domain

      PRUNE2

      ENSG00000173482

      5797

      protein tyrosine phosphatase receptor type M

      PTPRM

      ENSG00000164292

      22836

      Rho related BTB domain containing 3

      RHOBTB3

      ENSG00000067900

      6093

      Rho associated coiled-coil containing protein kinase 1

      ROCK1

      ENSG00000112701

      26054

      SUMO specific peptidase 6

      SENP6

      ENSG00000154447

      57630

      SH3 domain containing ring finger 1

      SH3RF1

      ENSG00000187164

      57698

      shootin 1

      SHTN1

      ENSG00000198887

      23137

      structural maintenance of chromosomes 5

      SMC5

      ENSG00000116754

      9295

      serine and arginine rich splicing factor 11

      SRSF11

      ENSG00000152818

      7402

      utrophin

      UTRN

      Table R1: List of Nup153 genes that are characterized as C group of genes.

      Only two Nup153 gene were found among A and B genes (Serine and arginine rich splicing factor 11 SRSF11 among A, and Phosphodiesterase 4D PDE4D among B genes). Despite the cell type specific expression and splicing patterns it is worth noting that we find Nup153 genes enriched among C group genes that are spliced out of speckles. We are currently probing the splicing of some of these genes, and these data will be added to the list of control genes.

      Considering all these new observations related to the Nup153 splicing events and the general interest and relevance of our initial observations, a new dedicated study will have to be designed to tackle all these important questions that go beyond these current findings


      Minor comments:

      • Several studies have shown that the nuclear basket contributes to a splicing quality control process preventing the nuclear export of improperly spliced transcripts, both in yeast and mammalian cells (PMID:14718167, 19127978, 24452287, 25845599, 22253824, 22661231). These studies have to be mentioned and discussed here.

      • Line 31: "movement of active genes towards the NPC would be favorable for their transcription and export ". Please rephrase: "...transcription and mRNA export".

      • Line 163: "NUP153 plays a role in harboring splicing machinery". Please rephrase.

      • Line 200-202: Fig. 4d and 4e (instead of S4d and S4e)

      • Line 186 and beyond: All conclusions about NUP153-bound genes (e.g., "Majority of NUP153 bound genes are proximal to LADs and expressed") are not accurately phrased since the authors selected NUP153-bound genes with a cutoff of proximity to LAD borders. The conclusions are thus only valid for a subpopulation of NUP153-bound regions located in the vicinity of LADs.

      • Line 292: "transport Nups less likely interact with splicing machinery". The term "transport Nup" is not correct. Does this mean "nuclear transport receptors"? Or "FG-Nups" (which interact with NTRs)?

      All the comments will be addressed in the revised version of the manuscript

      Reviewer #2 (Significance (Required):

      It is increasingly recognized that NPCs are involved in a number of cellular processes beyond nucleo-cytoplasmic transport and, in particular, contribute to several genomic functions. In this context, the identification of a physical and functional interaction between NPCs and the splicing machinery could be of conceptual interest in the NPC field, and more generally, in cell and genome biology, although it needs to be (i) carefully controlled and validated in view of the strong limitations mentioned above, and (ii) discussed in line with the known links between the nuclear basket and splicing quality control (see minor comments). This coupling would be particularly relevant for genes that have been shown to be positioned at NPCs during transcriptional activation, in line with the "gene gating" model mentioned by the authors.

      We greatly appreciate these insightful comments and suggestions, and the time and effort that this reviewer invested in critical reading of our manuscript. We will certainly take the points into account as we revise the manuscript. Specifically, we will carefully address the concerns related to the NPC-splicing interaction, ensuring that the experimental validation is robust and well-controlled, and we will further discuss the connection between the nuclear basket and splicing quality control in the context of our findings.

      Once again, thank you for your thoughtful and constructive feedback.

      Reviewer #3

      The authors discovered that the splicing machinery and nuclear baskets are sometimes in close proximity using Nup153 as a representative for the nuclear basket. They characterize this interaction using several different methods and propose that NUP153 is required to assemble the splicing machinery on genes that are transcribed in the nuclear periphery, which would supporting the gene gating model.

      The manuscript is well written and structured and the experiments are carefully conducted and analyzed.

      We thank the reviewer for the appreciation of our work. We have addressed all the major points here and will amend the manuscript text according to the suggestions.

      Reviewer #3 (Significance (Required)):

      The impression that I get from this manuscript is that we are looking at rather rare events with a small effect size. A definitive proof that the splicing machinery really assembles in the vicinity of NPCs docked via NUP153 is lacking. To assist in the revision process I will raise some questions to discuss but also propose some additional experiments to substantiate the claims.

      1. It is not clear what NUP153 really binds to and which domain is important. The experiments shown suggest proximity and indirect interactions (Co-IPs), but it is not clear whether NUP153 binds to DNA, RNA or a specific splicing factor. The bioID experiment might label splicing factors because they are cargos that pass through the pores during import, or it labels splicing factors that remain bound to spliced mRNPs during mRNP export. For example DDX39b, also called UAP56 is an important subunit of the TREX complex and involved the final packaging of mRNPs at the NPC. In my opinion, this protein is not a good choice. Also, the negative control that was used in the experiments is a potassium channel in the plasmamembrane, which can exclude that signal occurs by chance. But it would have been better to use a nuclear protein as control to exclude these possibilities.

      In line with a rare event, the Co-IP signals are very weak and barely higher than the GFP control. They should be repeated in the presence of RNase to confirm that this interaction occurs on nascent RNA during splicing and not e.g. to recycle or reroute splicing factors or during import.

      We acknowledge all the points, some of them already brought to our attention by other reviewers, that we tried to address throughout our response here, and we will also incorporate our answers in the revised manuscript. In particular, we have already provided some evidence related to the role of DNA, RNA, as shown above in Figure R5 (Response to R2). We have also addressed the effect of nuclear-cytosolic transport by using IVM and described these results in Figure R6, showing that splicing factors interact with Nup153 even when cytosolic transport is blocked with IVM. We have also commented on the use of controls and on the additional control analysis that we performed (also mentioned in more detail in response to R2)

      Moreover, we are also further trying to understand the binding of Nup153 to the splicing components. Intron Binding Complex, recently shown to be crucial for the activation of the spliceosome due to the activity of its helicase AQR (PMID: 37165190) is one of the protein complexes that we found bound to the NPC basket. We are interested in different functions of this helicase and have created the previously described mutant that has been shown to be defective in splicing. We have probed the interaction of Nup153 to this mutant, which we also characterized for its splicing inefficiency, and observed that Nup153 interacts with the splicing competent AQR, whereas the interaction with the splicing mutant seems to be less efficientThis set of additional data strengthens the bulk of data present here, details of which remain still to be further elucidated in an additional follow-up study.

      __Figure R8 __Lysates from HEK 293T cells, transiently transfected with eGFP-NUP153 and AQR-His-FLAG or AQRK829A-His-FLAG were probed in co-IP experiments. Western blot of the co-IP performed with Dynabeads for His-tagged proteins; input (IN), unbound (U) and bound (B) fractions are shown. Densitometric analysis of the co-IP where eGFP-NUP153 intensity was quantified, bound fractions were normalized to input samples, and the results are expressed relative to the AQR-His-FLAG control (n = 2). pENTER is a control empty plasmid.

      I do not understand why a NUP would be required to recruit or tether splicing factors to peripheral genes. Usually, splicing factors hitchhike on transcribing polymerase II or they are delivered by nuclear speckles which could happen also at the periphery. The authors should co-stain with a nuclear speckle marker to exclude this possibility.

      __Figure R9 __STED microscopy resolves the position of sc-35, marker of splicing speckles with respect to AQR. Jurkat T cells were stained with anti AQR AB (rabbit) and anti - sc35 antibody (mouse) to probe the positioning of splicing speckles with respect to the splicing helicase AQR.

      This is a very interesting remark, which we have addressed through co-staining experiments. Since the Nup153 antibody we used is anti-mouse, like SC-35, we instead co-stained SC-35 with AQR, a representative splicing factor. Our results show that a substantial number of AQR spots are detected away from nuclear speckles and near the nuclear rim, suggesting that a subset of splicing factors localize independently of speckles. While we have not directly stained the nuclear periphery, this pattern is consistent with the idea that splicing factors can be recruited outside of speckle-mediated delivery.

      To further confirm that NPC-associated splicing does not rely on nuclear speckles, we are open to performing additional co-staining between Nup153 and SON, another speckle marker in a followup study. Furthermore, emerging evidence supports off-speckle splicing, particularly for genes with long introns and low GC content (PMID: 39413186, 38720076, 35182478, 22832277, 35182477). Our additional analysis (Figure R7) demonstrates that genes associated with Nup153 share characteristics with known off-speckle spliced genes, suggesting that these genes might be processed outside of speckles due to their transcription and splicing kinetics

      What would be the advantage to splice in the vicinity of the pore? Given that genes with long introns take a long time to be transcribed, splicing would block the pores for hours and would prevent other activities. It would also be possible that splicing does occur at the periphery but NUP153 picks the mRNPs up at a later stage.

      We thank the reviewer for these insightful comments. We would like to add here that there are numerous pores, according to our own estimations around 800 pores in Jurkat cells, which implies that there is also huge heterogeneity of the NPC which is at present largely unexplored. In yeast, some pores are basketless and the assembly of the basket is transcription dependent (PMID: 36220102) - suggesting that there’s more than one pore population. Similarly, looking into the statuses of genes that have been associated with pore, both polycomb-repressed and transcriptionally active genes have been found at pores- again pointing to an heterogeneity. This is a very interesting (and large) question that we pose ourselves and to the NPC field but not something we can address straight away.

      One major drawback of the story is that the authors use very long-term depletion of NUP153 via shRNAs that will definitely screw up the import of many nuclear proteins; and a splicing inhibitor that has broad effects on nuclear architecture. Degron lines of Nup153 exist and should be used to substantiate at least some of the conclusions. Alternatively, a NUP153 mutant without zinc finger or IDR could be used to prevent DNA binding or basket association.

      The reviewer is right, we have for over 2 years tried tirelessly to use the degron Nup153 system established in DLD-1 cells by Dasso’s Lab. This has been unsuccessful. Jurkats are difficult to transfect and more sensitive than other cell lines and they died with our trials. We have therefore used the shNup153 which has been used in X and Y and shown to not interfere with nucleocytoplasmic trafficking.

      However, we understand the reviewers point and since then have tried to use a Nup153 mutant construct, containaing Nup153 N-terminus with and without a zinc finger (McKay et al 2009.), kindly sent by the Ulman Lab. Unfortunately, in our hands, the construct has low levels of GFP:Nup153 expression, not comparable to the ones we used in our coIP experiments and that would make conclusions hard.

      We are now planning the cloning of our GFP:Nup153 construct to produce such plasmids,

      N-terminus with/without the Zinc finger and use it in coIP experiments to understand the importance of the Zn finger domain in the splicing interactions.

      Specific comments: Abstract: suggesting that a fraction of splicing occurs at the NPC. speckle-distant splicing events, it should be nuclear speckles, term not explained Line 20: Super enhancers not introduced

      Line 39: Splicing and the different spliceosomal subcomplexes needs more explanation and introduction to understand the selection of proteins that were used in the study. Line 65: Choice of controls. LckN18 The genes should be written once in full and their choice should be explained better.

      Line 123: The authors state: 'However, we detected a lower number of interacting spots between Nup98/DDX39b than with its Nup153 counterpart; a similar trend is followed between Nup98 and SF3A1 (Fig. S2g), suggesting that the interaction between splicing proteins and Nup98 might be further apart within the NPC structure.

      A more likely explanation is that both proteins are shed from the mRNP at the basket as they are not shuttling with the mRNA and should not enter the pore.

      Line 131: Mention right at the beginning of the sentence which splicing proteins were imaged here.

      Line 141: Nuclear speckles are still not properly introduced. Why is SF3A1 not expected to be in nuclear speckles? It Should be co-stained for nuclear speckle markers. Lane 149: PlaB drastically changes the nuclear architecture. The reduced interactions can have also different reasons. The authors should image the distribution of the investigated factors in the presence of PlaB. Again the Co-IPs should be performed in the presence of RNase to confirm that the observed interactions depend on RNA. Line 181: The requirement of Nup153 to tether the splicing machinery to the NPC is not convincing from the presented data. The knockdown is way too long and the changes are tiny. Wouldn't it be better to use a Nup153 mutant without Zinc knuckle or IDR to show that now the splicing factors interactions are lost? Alternatively degron lines should be used.

      Line 186: I do not understand the logic why NUP153 needs to bind to chromatin to fullfil its function in splicing. It could also bind to RNA with its zinc knuckle or IDRs. The authors should perform iCLIP or RIP to exclude this possibility? I also do not understand the logic to look to look for proximity to repressive LADs as a criterion, while investigating a function of NUP153 in splicing which requests actively transcribing genes. This has to be better motivated. Excluding the nucleoplasmic pool of NUP153 removes important data points that might be functionally relevant. Line 187: The entire paragraph on Dam-ID and all subsequent genome-wide analyses is way too densely written and hard to understand for non-experts. Analysis tools or thresholds are rarely given and it is unclear how the different data sets have been made and by who, what they mean and how they have been integrated. Chromatin patterns and expression profiles are very unspecific terms. Many of the used terms have not been introduced properly. The metaplots show very small differences. In the end I am not sure what we have learned from all the data integration. Is NUP153 bound to DNA, to nucleosomes, to nascent RNA or to splicing factors?

      Lane 232: Unclear what NUP153 introns are. Is the entire gene where NUP153 binds to considered or only the intron with a NUP153 peak.

      Lane 242: Again, the shRNA knockdown performed in this manuscript is way to long to observe a direct effect on splicing at the pore, which occurs at the level of minutes. Degrons should be used here to confirm this observation.

      Line 249: More negative controls are needed for genes not bound by NUP153 for the splicing analysis. RNA-Seq analyzed for intron retention could be helpful.

      All the text and specific comments will be addressed as suggested by the reviewer.

      The experimental part to be included in the revised manuscript is explained in more detail above.

      Reviewer expertise (keywords): nuclear pore complexes, nuclear organization, gene expression, mRNA biology

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors discovered that the splicing machinery and nuclear baskets are sometimes in close proximity using Nup153 as a representative for the nuclear basket. They characterize this interaction using several different methods and propose that NUP153 is required to assemble the splicing machinery on genes that are transcribed in the nuclear periphery, which would supporting the gene gating model.

      The manuscript is well written and structured and the experiments are carefully conducted and analyzed.

      Significance

      The impression that I get from this manuscript is that we are looking at rather rare events with a small effect size. A definitive proof that the splicing machinery really assembles in the vicinity of NPCs docked via NUP153 is lacking. To assist in the revision process I will raise some questions to discuss but also propose some additional experiments to substantiate the claims.

      1. It is not clear what NUP153 really binds to and which domain is important. The experiments shown suggest proximity and indirect interactions (Co-IPs), but it is not clear whether NUP153 binds to DNA, RNA or a specific splicing factor. The bioID experiment might label splicing factors because they are cargos that pass through the pores during import, or it labels splicing factors that remain bound to spliced mRNPs during mRNP export. For example DDX39b, also called UAP56 is an important subunit of the TREX complex and involved the final packaging of mRNPs at the NPC. In my opinion, this protein is not a good choice. Also, the negative control that was used in the experiments is a potassium channel in the plasmamembrane, which can exclude that signal occurs by chance. But it would have been better to use a nuclear protein as control to exclude these possibilities. In line with a rare event, the Co-IP signals are very weak and barely higher than the GFP control. They should be repeated in the presence of RNase to confirm that this interaction occurs on nascent RNA during splicing and not e.g. to recycle or reroute splicing factors or during import.
      2. I do not understand why a NUP would be required to recruit or tether splicing factors to peripheral genes. Usually, splicing factors hitchhike on transcribing polymerase II or they are delivered by nuclear speckles which could happen also at the periphery. The authors should co-stain with a nuclear speckle marker to exlcude this possibility.
      3. What would be the advantage to splice in the vicinity of the pore? Given that genes with long introns take a long time to be transcribed, splicing would block the pores for hours and would prevent other activities. It would also be possible that splicing does occur at the periphery but NUP153 picks the mRNPs up at a later stage.
      4. One major drawback of the story is that the authors use very long-term depletion of NUP153 via shRNAs that will definitely screw up the import of many nuclear proteins; and a splicing inhibitor that has broad effects on nuclear architecture. Degron lines of Nup153 exist and should be used to substantiate at least some of the conclusions. Alternatively, a NUP153 mutant without zinc finger or IDR could be used to prevent DNA binding or basket association.

      Specific comments:

      Abstract: suggesting that a fraction of splicing occurs at the NPC. speckle-distant splicing events, it should be nuclear speckles, term not explained

      Line 20: Super enhancers not introduced

      Line 39: Splicing and the different spliceosomal subcomplexes needs more explanation and introduction to understand the selection of proteins that were used in the study.

      Line 65: Choice of controls. LckN18 The genes should be written once in full and their choice should be explained better.

      Line 123: The authors state: 'However, we detected a lower number of interacting spots between Nup98/DDX39b than with its Nup153 counterpart; a similar trend is followed between Nup98 and SF3A1 (Fig. S2g), suggesting that the interaction between splicing proteins and Nup98 might be further apart within the NPC structure. A more likely explanation is that both proteins are shed from the mRNP at the basket as they are not shuttling with the mRNA and should not enter the pore.

      Line 131: Mention right at the beginning of the sentence which splicing proteins were imaged here.

      Line 141: Nuclear speckles are still not properly introduced. Why is SF3A1 not expected to be in nuclear speckles? It Should be co-stained for nuclear speckle markers.

      Lane 149: PlaB drastically changes the nuclear architecture. The reduced interactions can have also different reasons. The authors should image the distribution of the investigated factors in the presence of PlaB. Again the Co-IPs should be performed in the presence of RNase to confirm that the observed interactions depend on RNA.

      Line 181: The requirement of Nup153 to tether the splicing machinery to the NPC is not convincing from the presented data. The knockdown is way too long and the changes are tiny. Wouldn't it be better to use a Nup153 mutant without Zinc knuckle or IDR to show that now the splicing factors interactions are lost? Alternatively degron lines should be used.

      Line 186: I do not understand the logic why NUP153 needs to bind to chromatin to fullfil its function in splicing. It could also bind to RNA with its zinc knuckle or IDRs. The authors should perform iCLIP or RIP to exclude this possibility? I also do not understand the logic to look to look for proximity to repressive LADs as a criterion, while investigating a function of NUP153 in splicing which requests actively transcribing genes. This has to be better motivated. Excluding the nucleoplasmic pool of NUP153 removes important data points that might be functionally relevant.

      Line 187: The entire paragraph on Dam-ID and all subsequent genome-wide analyses is way too densely written and hard to understand for non-experts. Analysis tools or thresholds are rarely given and it is unclear how the different data sets have been made and by who, what they mean and how they have been integrated. Chromatin patterns and expression profiles are very unspecific terms. Many of the used terms have not been introduced properly. The metaplots show very small differences. In the end I am not sure what we have learned from all the data integration. Is NUP153 bound to DNA, to nucleosomes, to nascent RNA or to splicing factors?

      Lane 232: Unclear what NUP153 introns are. Is the entire gene where NUP153 binds to considered or only the intron with a NUP153 peak.

      Lane 242: Again, the shRNA knockdown performed in this manuscript is way to long to observe a direct effect on splicing at the pore, which occurs at the level of minutes. Degrons should be used here to confirm this observation.

      Line 249: More negative controls are needed for genes not bound by NUP153 for the splicing analysis. RNA-Seq analyzed for intron retention could be helpful.

      There is quite some typos and missing words in the text.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, using a combination of proximity labelling, immunoprecipitations and imaging, the authors report a physical interaction between splicing factors (SFs) and the nuclear basket of nuclear pore complexes (i.e. NUP153). Using DamID, they further identify a set of NUP153-bound genes characterized by long, GC-poor introns. Finally, based on molecular analyses for a set of candidate loci, they report that inactivation of NUP153 triggers a (modest) reduction of intron splicing, which may specifically affect NUP153-bound genes.

      Major comments:

      1. The data presented do not convincingly demonstrate a specific interaction between the NPC basket and the splicing machinery, mainly due to the lack of appropriate controls.
        • The BioID experiments (Fig. 1) lack proper controls. Proteins biotinylated by NUP-BirA fusions need to be compared with those modified upon expression of a control BirA protein, as has been done previously, especially when other NUPs were used as baits in BioID experiments (PMID: 24927568, to be cited). This control fusion should ideally be targeted to the same compartment (i.e. the nucleus or the nuclear side of the nuclear envelope).
        • NUP153 immunoprecipitates only very low levels of splicing factors, which are almost indistinguishable from those detected in control pull-downs (see for example AQR or XAB2 signals in blot images and error bars in the quantifications, Fig. 2a). In addition, in these analyses, the high/saturated signals do not allow comparison of the abundance of the proteins of interest in the input samples under different conditions. These limitations also apply to the interpretation of the changes in NUP153-SF association scored upon splicing inhibition (Fig. 3a), which also seems to affect SF abundance in inputs (e.g. AQR, SF3A1).
        • PLA experiments (e.g. Fig. 2b-d) also lack proper control. PLA has been shown to reveal artefactual signals for abundant proteins present in the same compartment (doi.org/10.1101/411355). Here, the chosen controls are inappropriate as the authors are probing interactions between NPC proteins (NUP153/TPR) and proteins restricted to a different nuclear compartment (e.g., nucleophosmin in the nucleolus). An abundant nucleoplasmic protein/epitope should be used as a control in these experiments. In addition, the authors need to show direct immunofluorescence images for each antibody used in PLA assays, in order to verify that the expression levels or localization of their targets are unchanged between conditions (e.g. upon PladB treatment, Fig. S3g, or NUP153 depletion, Fig. 3g).
        • The proximal localization of NUP153 and splicing factors in super resolution microscopy (Fig. 2e-f) is also not properly controlled. Would a control soluble, diffusible nucleoplasmic protein be detected in the vicinity of the NPC and sometimes colocalized with Nups?
        • Since NUP153 is located in the vicinity of peripheral genes (as also shown here through DamID), some of which contain introns, its association with the spliceosome could be indirect, i.e., mediated by DNA. Of note, the association of Mlp1 (the yeast ortholog of TPR) with the splicing factor SF1 has been shown to be mediated by RNAs (PMID:14718167). In order to assess these possibilities, the authors should perform their immunoprecipitation on extracts treated with benzonase, thus abrogating DNA- and RNA-dependent interactions.
      2. NUP153 inactivation appears to have a modest effect on splicing (Fig. 5; S6), which is poorly characterized here. It is also unclear whether this effect is direct or caused by side consequences of the depletion of this nucleoporin (e.g., changes in nucleocytoplasmic exchanges or gene expression).
        • To confirm the specificity of the effects of NUP153 depletion on the splicing of NUP153-bound genes, the authors need to provide additional splicing measurements for several genes that are bound by NUP153 "in the nucleoplasm" (e.g. excluded from their analysis by the cutoff of proximity to LAD borders, line 192) and for other "non-NUP153" genes (beyond the unique control shown in Fig. S6a).
        • From the few examples provided, it is difficult to evaluate the type of splicing events affected by NUP153 inactivation. Are they uniquely intron retention events? The authors should analyze available RNA-seq data obtained from NUP153-depleted cells (PMID:32917881) to characterize the types of alternative splicing events that are impacted by NUP153.

      Minor comments:

      • Several studies have shown that the nuclear basket contributes to a splicing quality control process preventing the nuclear export of improperly spliced transcripts, both in yeast and mammalian cells (PMID:14718167, 19127978, 24452287, 25845599, 22253824, 22661231). These studies have to be mentioned and discussed here.
      • Line 31: "movement of active genes towards the NPC would be favorable for their transcription and export ". Please rephrase: "...transcription and mRNA export".
      • Line 163: "NUP153 plays a role in harboring splicing machinery". Please rephrase.
      • Line 200-202: Fig. 4d and 4e (instead of S4d and S4e)
      • Line 186 and beyond: All conclusions about NUP153-bound genes (e.g., "Majority of NUP153 bound genes are proximal to LADs and expressed") are not accurately phrased since the authors selected NUP153-bound genes with a cutoff of proximity to LAD borders. The conclusions are thus only valid for a subpopulation of NUP153-bound regions located in the vicinity of LADs.
      • Line 292: "transport Nups less likely interact with splicing machinery". The term "transport Nup" is not correct. Does this mean "nuclear transport receptors"? Or "FG-Nups" (which interact with NTRs)?

      Significance

      It is increasingly recognized that NPCs are involved in a number of cellular processes beyond nucleo-cytoplasmic transport and, in particular, contribute to several genomic functions. In this context, the identification of a physical and functional interaction between NPCs and the splicing machinery could be of conceptual interest in the NPC field, and more generally, in cell and genome biology, although it needs to be (i) carefully controlled and validated in view of the strong limitations mentioned above, and (ii) discussed in line with the known links between the nuclear basket and splicing quality control (see minor comments). This coupling would be particularly relevant for genes that have been shown to be positioned at NPCs during transcriptional activation, in line with the "gene gating" model mentioned by the authors.

      Reviewer expertise (keywords): nuclear pore complexes, nuclear organization, gene expression, mRNA biology

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors first use a Bio-ID approach to search for interactors of the basket proteins TPR and NUP153, identifying proteins involved in various nuclear process, including many splicing components, and confirm some of these interactions using IP and PLA assays. PLA experiments further suggest that these interactions occur primary at or close to the nuclear periphery. Moreover, inhibiting splicing, but not transcription, reduced these interactions. The authors then investigated the role of NUP153 in loading of the splicing machinery and found a lower association of the NUP98/SF3A1 but not AQR interaction (measured through PLA). Furthermore, DamID experiments identified NUP153 bound genes proximal to LAD domains that are actively transcribed, contain overall longer introns with low GC-content, and fall within a group of genes located at the outermost shell of the nucleus (when compared to previously published LaminI ID /PGseq data). Lastly, they interrogate whether depletion of NUP153 results in a splicing defect for NUP153 bound genes.

      The authors identify many proteins in their BioID interaction screen, however, only a single nucleoporin (Nup35, an inner ring protein). Previous BioID studies have identified NUP153 in BioID experiments including proteins of the Y-complex (PMID: 24927568 and others). To ensure that the BioID experiments indeed probe for interactions of NUP152 and TPR at the NPC, the authors should include control experiments that show that their NUP153 and TPR-BirA fusions primarily localize to the NPC. If a significant fraction is not NPC bound, this has to be taken into account interpreting/discussing their data.<br /> The authors should be more precise when describing the role of the different splicing factor identified in the BioID screen and their function in specific steps of splicing, as this is important when claiming that they identify factors acting at all steps of splicing. For example, the authors describe DDX39b/UAP56 as an early splicing factor; DDX39b/UAP56 main role however seems to be in mRNA export and mRNP compaction. The authors might want to include this in the interpretation of their data.

      Concerning PLA experiment controls, the authors perform TPR-PML as a negative control, however, no negative control for NUP153 is shown (Figure 2). Such a control should be added to allow evaluating the specificity of NUP153 PLA interactions, and/or discussed why this was not done.

      Quantification of the distance of PLA-NUP153/TPR interactions show interactions mainly close to the nuclear periphery. The imaging data shown in Figure 2b indeed shows that TPR/NUP153 interactions are exclusively at the nuclear periphery, whereas NUP153/splicing factor interactions are sometimes at the edge of the DAPI signal, but mostly somehow internalized (Figure 2B, S2b). Quantification (Figure 2d) shows these distributions to be very similar, likely due to the way the quantification was performed / the bin size of plotting the relative distance of a spot to the nuclear periphery was chosen. Looking at the scale bar/nuclear size and the position of the PLA spots for the NUP153/splicing factors, it appears that spots are often hundreds of nanometers away from the periphery. As the nuclear basket is thought to reach only about 100nm onto the nuclear interior, the conclusion by the authors that these interactions occur at the NPC would not be consistent with the data. The authors should better incorporate this in their interpretation of the data. The conclusion ' Nup153 aids the loading of splicing machinery' is not sufficiently supported by the data. The authors observed a reduction in PLA signal for the NUP98-AQR interaction, but not the NUP98-SF3A1 (Figure 3g). Their conclusion has to reflect this discrepancy in their data. Moreover, the studies focus is to determine the role of NUP153/TPR in recruiting the splicing machinery to the NPC. As in the experiments the authors interrogate the interaction of only NUP98, who has to a large extend splicing factor interactions within the nuclear interior and not at the periphery, the relevance of the experiments in Figure 3 towards the main focus of the paper is unclear. When investigating the effect of NUP153 depletion on splicing, the authors observe a splicing phenotype for multiple NUP153 genes (Figure 5). The authors however show only a single negative control gene (CBX5). It would significantly strengthen their argument if the authors would investigate splicing defects of periphery located noneNUP153 bound genes as well as for genes located in the nuclear interior to better understand whether this splicing phenotype is indeed specific for NUP153 genes (at the nuclear periphery/NPC). The authors state in the text describing the SABER-FISH experiments in Figure 5f that 'were able to visualize the presence of a site of transcription where accumulation of these probes was close to the periphery for all except for GSTK1, which showed a wider nuclear distribution, similar to CBX5 control region not bound by Nup153'. However, their statement is not supported by the images shown in Fig 5f, which show TS in control cells in the nuclear interior. Also, a single cell but no quantification is shown. Moreover, what distance from the periphery is considered as close to the periphery is not defined (see also earlier comment on the question what should be considered a periphery and/or NPC association).

      Limitation of the study does not discuss the limitations of the study but rather reads like the extension of the discussion. This section should be rewritten.

      Minor comments:

      Western in Figure 3c does not represent well the quantification in 3e.

      Figure S3 is mislabelled (pannel h is panel g).

      Significance

      The manuscript interrogates an important question related to the role of the NPC in gene regulation, in particular how interaction of genes/pre-mRNAs with the NPC might stimulate expression of specific genes/mRNAs. Stimulating splicing would be one way that could contribute to efficient gene expression, and this is the question the authors address in this manuscript. This study is therefore important and relevant to a wide audience. However, as outlined in the section above, the conclusions drawn by the authors do not always reflect the experimental data, and it is therefore unclear whether the overall conclusion as stated in the title of the manuscript is valid. Moreover, conceptually, if intron containing genes are transcribed at or near nuclear pores, and splicing often occurs co-transcriptionally, it is to be expected to find splicing components close to nuclear pores. While it is relevant to show that this actually happens, and this is, at least in part, done by the authors. However, the experiments presented do not show that the splicing machinery actually actively docks to the NPC and is not just passively recruited close to NPCs because nascent pre-mRNAs are spliced where they are transcribed (the authors state in their title that the NUP153 docks the splicing machinery at the NPC). Showing this require identifying direct interactions between spliceosome components with NUP153/nuclear basket components to stimulate splicing at the NPC. If this would indeed be the case, these findings would describe a novel mechanistic step to stimulate efficient splicing and subsequently export of a selected set of NPC-associated genes. This would open other questions such as how to achieve specificity for only some pre-mRNAs/introns. While addressing this question is likely beyond the scope of this manuscript, the question whether the process described here is an active or passive process should be incorporated in the interpretation of the data.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      I would like to thank the reviewers for their comments and interest in the manuscript and the study.

      Reviewer #1

      1. I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning.

      The directional positioning of CTCF-binding sites at chromatin interaction sites was analyzed by CRISPR experiment (Guo Y et al. Cell 2015). We found that the machine learning and statistical analysis showed the same directional bias of CTCF-binding motif sequence and RAD21-binding motif sequence at chromatin interaction sites as the experimental analysis of Guo Y et al. (lines 229-253, Figure 3b, c, d and Table 1). Since CTCF is involved in different biological functions (Braccioli L et al. Essays Biochem. 2019 ResearchGate webpage), the directional bias of binding sites may be reduced in all binding sites including those at chromatin interaction sites (lines 68-73). In our study, we investigated the DNA-binding sites of proteins using the ChIP-seq data of DNA-binding proteins and DNase-seq data. We also confirmed that the DNA-binding sites of SMC3 and RAD21, which tend to be found in chromatin loops with CTCF, also showed the same directional bias as CTCF by the computational analysis.

      __2. Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure. __

      Following the reviewer's advice, I performed the same analysis with the DNA-binding sites that do no overlap with the DNA-binding sites of CTCF and cohesin (RAD21 and SMC3) (Fig. 6 and Supplementary Fig. 4). The result showed the same tendency in the distribution of DNA-binding sites. The height of a peak on the graph became lower for some DNA-binding proteins after removing the DNA-binding sites that overlapped with those of CTCF and cohesin. I have added the following sentence on lines 435 and 829: For the insulator-associated DBPs other than CTCF, RAD21, and SMC3, the DNA-binding sites that do not overlap with those of CTCF, RND21, and SMC3 were used to examine their distribution around interaction sites.

      3. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      As suggested by the reviewer, I have added the insulator scores and boundary sites from the 4D nucleome data portal as tracks in the UCSC genome browser. The insulator scores seem to correspond to some extent to the H3K27me3 histone marks from ChIP-seq (Fig. 4a and Supplementary Fig. 3). We found that the DNA-binding sites of the insulator-associated DBPs were statistically overrepresented in the 5 kb boundary sites more than other DBPs (Fig. 4d). The direction of DNA-binding sites on the genome can be shown with different colors (e.g. red and green), but the directionality of insulator-associated DNA-binding sites is their overall tendency, and it may be difficult to notice the directionality from each binding site because the directionality may be weaker than that of CTCF, RAD21, and SMC3 as shown in Table 1 and Supplementary Table 2. We also observed the directional biases of CTCF, RAD21, and SMC3 by using Micro-C chromatin interaction data as we estimated, but the directionality was more apparent to distinguish the differences between the four directions of FR, RF, FF, and RR using CTCF-mediated ChIA-pet chromatin interaction data (lines 287 and 288).

       I found that the CTCF binding sites examined by a wet experiment in the previous study may not always overlap with the boundary sites of chromatin interactions from Micro-C assay (Guo Y et al. *Cell* 2015). The chromatin interaction data do not include all interactions due to the high sequencing cost of the assay, and include less long-range interactions due to distance bias. The number of the boundary sites may be smaller than that of CTCF binding sites acting as insulators and/or some of the CTCF binding sites may not be locate in the boundary sites. It may be difficult for the boundary location algorithm to identify a short boundary location. Due to the limitations of the chromatin interaction data, I planned to search for insulator-associated DNA-binding proteins without using chromatin interaction data in this study.
      
       I discussed other causes in lines 614-622: Another reason for the difference may be that boundary sites are more closely associated with topologically associated domains (TADs) of chromosome than are insulator sites. Boundary sites are regions identified based on the separation of numerous chromatin interactions. On the other hand, we found that the multiple DNA-binding sites of insulator-associated DNA-binding proteins were located close to each other at insulator sites and were associated with distinct nested and focal chromatin interactions, as reported by Micro-C assay. These interactions may be transient and relatively weak, such as tissue/cell type, conditional or lineage-specific interactions.
      
       Furthermore, I have added the statistical summary of the analysis in lines 372-395 as follows: Overall, among 20,837 DNA-binding sites of the 97 insulator-associated proteins found at insulator sites identified by H3K27me3 histone modification marks (type 1 insulator sites), 1,315 (6%) overlapped with 264 of 17,126 5kb long boundary sites, and 6,137 (29%) overlapped with 784 of 17,126 25kb long boundary sites in HFF cells. Among 5,205 DNA-binding sites of the 97 insulator-associated DNA-binding proteins found at insulator sites identified by H3K27me3 histone modification marks and transcribed regions (type 2 insulator sites), 383 (7%) overlapped with 74 of 17,126 5-kb long boundary sites, 1,901 (37%) overlapped with 306 of 17,126 25-kb long boundary sites. Although CTCF-binding sites separate active and repressive domains, the limited number of DNA-binding sites of insulator-associated proteins found at type 1 and 2 insulator sites overlapped boundary sites identified by chromatin interaction data. Furthermore, by analyzing the regulatory regions of genes, the DNA-binding sites of the 97 insulator-associated DNA-binding proteins were found (1) at the type 1 insulator sites (based on H3K27me3 marks) in the regulatory regions of 3,170 genes, (2) at the type 2 insulator sites (based on H3K27me3 marks and gene expression levels) in the regulatory regions of 1,044 genes, and (3) at insulator sites as boundary sites identified by chromatin interaction data in the regulatory regions of 6,275 genes. The boundary sites showed the highest number of overlaps with the DNA-binding sites. Comparing the insulator sites identified by (1) and (3), 1,212 (38%) genes have both types of insulator sites. Comparing the insulator sites between (2) and (3), 389 (37%) genes have both types of insulator sites. From the comparison of insulator and boundary sites, we found that (1) or (2) types of insulator sites overlapped or were close to boundary sites identified by chromatin interaction data.
      

      4. The suggested alternative transcripts function, also highlighted in the manuscripts abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.

      According to the reviewer's comment, I performed the genome-wide analysis of alternative transcripts where the DNA-binding sites of insulator-associated proteins are located near splicing sites. The DNA-binding sites of insulator-associated DNA-binding proteins were found within 200 bp centered on splice sites more significantly than the other DNA-binding proteins (Fig. 4e and Table 2). I have added the following sentences on lines 405 - 412: We performed the statistical test to estimate the enrichment of insulator-associated DNA-binding sites compared to the other DNA-binding proteins, and found that the insulator-associated DNA-binding sites were significantly more abundant at splice sites than the DNA-binding sites of the other proteins (Fig 4e and Table 2; Mann‒Whitney U test, p value 5. Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.

      I believe that the Figure 1 would help researchers in other fields who are not familiar with biological phenomena and functions to understand the study. More explanation has been included in the Figures and legends of Figs. 4 and 5 to help readers outside the immediate research field understand the figures.

      6. Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Reviewer #2's comments would be related to this comment. I have introduced a more detailed explanation of the method in the Results section, as shown in the responses to Reviewer #2's comments.

      Reviewer #2

      1. Introduction, line 95: CTCF appears two times, it seems redundant.

      On lines 91-93, I deleted the latter CTCF from the sentence "We examine the directional bias of DNA-binding sites of CTCF and insulator-associated DBPs, including those of known DBPs such as RAD21 and SMC3".

      2. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?

      Although CTCF is known to be the main insulator protein in vertebrates, we found that 97 DNA-binding proteins including CTCF and cohesin are associated with insulator sites by modifying and developing a machine learning method to search for insulator-associated DNA-binding proteins. Most of the insulator-associated DNA-binding proteins showed the directional bias of DNA-binding motifs, suggesting that the directional bias is associated with the insulator.

       I have added the sentence in lines 96-99 as follows: Furthermore, statistical testing the contribution scores between the directional and non-directional DNA-binding sites of insulator-associated DBPs revealed that the directional sites contributed more significantly to the prediction of gene expression levels than the non-directional sites. I have revised the statement in lines 101-110 as follows: To validate these findings, we demonstrate that the DNA-binding sites of the identified insulator-associated DBPs are located within potential insulator sites, and some of the DNA-binding sites in the insulator site are found without the nearby DNA-binding sites of CTCF and cohesin. Homologous and heterologous insulator-insulator pairing interactions are orientation-dependent, as suggested by the insulator-pairing model based on experimental analysis in flies. Our method and analyses contribute to the identification of insulator- and chromatin-associated DNA-binding sites that influence EPIs and reveal novel functional roles and molecular mechanisms of DBPs associated with transcriptional condensation, phase separation and transcriptional regulation.
      

      3. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS.

      On lines 121-124, to explain the procedure for the SNP of an eQTL, I have added the sentence in the Methods: "If a DNA-binding site was located within a 100-bp region around a single-nucleotide polymorphism (SNP) of an eQTL, we assumed that the DNA-binding proteins regulated the expression of the transcript corresponding to the eQTL".

      4. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details.

      On line 119, I have included the explanation of the eQTL dataset of GTEx v8 as follows: " The eQTL data were derived from the GTEx v8 dataset, after quality control, consisting of 838 donors and 17,382 samples from 52 tissues and two cell lines". On lines 681 and 865, I have added the filename of the eQTL data "(GTEx_Analysis_v8_eQTL.tar)".

      5. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.

      The reviewer would mention Figure 2, not Figure 1. If so, the matrices in panels a and b in Figure 2 are equivalent. I have shown it in the figure: The same figure in panel a is rotated 90 degrees to the right. The green boxes in the matrix show the regions with the ChIP-seq peak of a DNA-binding protein overlapping with a SNP of an eQTL. I used eQTL data to associate a gene with a ChIP-seq peak that was more than 2 kb upstream and 1 kb downstream of a transcriptional start site of a gene. For each gene, the matrix was produced and the gene expression levels in cells were learned and predicted using the deep learning method. I have added the following sentences to explain the method in lines 133 - 139: Through the training, the tool learned to select the binding sites of DNA-binding proteins from ChIP-seq assays that were suitable for predicting gene expression levels in the cell types. The binding sites of a DNA-binding protein tend to be observed in common across multiple cell and tissue types. Therefore, ChIP-seq data and eQTL data in different cell and tissue types were used as input data for learning, and then the tool selected the data suitable for predicting gene expression levels in the cell types, even if the data were not obtained from the same cell types.

      6. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?

      As suggested by the reviewer, to help readers understand the observation, I have added Supplementary Fig. S4c to show the distribution of DNA-binding sites of "CTCF, RAD21, and SMC3" and "BACH2, FOS, ATF3, NFE2, and MAFK" around chromatin interaction sites. I have modified the following sentence to indicate the figure on line 501: Although a DNA-binding-site distribution pattern around chromatin interaction sites similar to those of CTCF, RAD21, and SMC3 was observed for DBPs such as BACH2, FOS, ATF3, NFE2, and MAFK, less than 1% of the DNA-binding sites of the latter set of DBPs colocalized with CTCF, RAD21, or SMC3 in a single bin (Fig. S4c).

       In Aljahani A et al. *Nature Communications* 2022, we find that depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. Together, our data show that loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression. Goel VY et al. *Nature Genetics* 2023 mentioned in the abstract: Microcompartments frequently connect enhancers and promoters and though loss of loop extrusion and inhibition of transcription disrupts some microcompartments, most are largely unaffected. These results suggested that chromatin loops can be driven by other DBPs independent of the known CTCF/Cohesin.
      
      I added the following sentence on lines 569-577: The depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently. Furthermore, the loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression.
      
       FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates (Ji D et al. *Molecular Cell* 2024). CTCF have also found to form transcriptional condensate and phase separation (Lee R et al. *Nucleic acids research* 2022). FOS was found to be an insulator-associated DNA-binding protein in this study and is potentially involved in chromatin remodeling, transcription condensation, and phase separation with the other factors such as BACH2, ATF3, NFE2 and MAFK. I have added the following sentence on line 556: FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates.
      

      7. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?

      Goel VY et al. Nature Genetics 2023 identified highly nested and focal interactions through region capture Micro-C, which resemble fine-scale compartmental interactions and are termed microcompartments. In the section titled "Most microcompartments are robust to loss of loop extrusion," the researchers noted that a small proportion of interactions between CTCF and cohesin-bound sites exhibited significant reductions in strength when cohesin was depleted. In contrast, the majority of microcompartmental interactions remained largely unchanged under cohesin depletion. Our findings indicate that most P-P and E-P interactions, aside from a few CTCF and cohesin-bound enhancers and promoters, are likely facilitated by a compartmentalization mechanism that differs from loop extrusion. We suggest that nested, multiway, and focal microcompartments correspond to small, discrete A-compartments that arise through a compartmentalization process, potentially influenced by factors upstream of RNA Pol II initiation, such as transcription factors, co-factors, or active chromatin states. It follows that if active chromatin regions at microcompartment anchors exhibit selective "stickiness" with one another, they will tend to co-segregate, leading to the development of nested, focal interactions. This microphase separation, driven by preferential interactions among active loci within a block copolymer, may account for the striking interaction patterns we observe.

       The authors of the paper proposed several mechanisms potentially involved in microcompartments. These mechanisms may be involved in looping with insulator function. Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently (Hsieh TS et al. *Nature Genetics* 2022). Among the identified insulator-associated DNA-binding proteins, Maz and MyoD1 form loops without CTCF (Xiao T et al. *Proc Natl Acad Sci USA* 2021 ; Ortabozkoyun H et al. *Nature genetics* 2022 ; Wang R et al. *Nature communications* 2022). I have added the following sentences on lines 571-575: Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently. I have included the following explanation on lines 582-584: Maz and MyoD1 among the identified insulator-associated DNA-binding proteins form loops without CTCF.
      
       As for the directionality of CTCF, if chromatin loop anchors have some structural conformation, as shown in the paper entitled "The structural basis for cohesin-CTCF-anchored loops" (Li Y et al. *Nature* 2020), directional DNA binding would occur similarly to CTCF binding sites. Moreover, cohesin complexes that interact with convergent CTCF sites, that is, the N-terminus of CTCF, might be protected from WAPL, but those that interact with divergent CTCF sites, that is, the C-terminus of CTCF, might not be protected from WAPL, which could release cohesin from chromatin and thus disrupt cohesin-mediated chromatin loops (Davidson IF et al. *Nature Reviews Molecular Cell Biology* 2021). Regarding loop extrusion, the 'loop extrusion' hypothesis is motivated by in vitro observations. The experiment in yeast, in which cohesin variants that are unable to extrude DNA loops but retain the ability to topologically entrap DNA, suggested that in vivo chromatin loops are formed independently of loop extrusion. Instead, transcription promotes loop formation and acts as an extrinsic motor that extends these loops and defines their final positions (Guerin TM et al. *EMBO Journal* 2024). I have added the following sentences on lines 543-547: Cohesin complexes that interact with convergent CTCF sites, that is, the N-terminus of CTCF, might be protected from WAPL, but those that interact with divergent CTCF sites, that is, the C-terminus of CTCF, might not be protected from WAPL, which could release cohesin from chromatin and thus disrupt cohesin-mediated chromatin loops. I have included the following sentences on lines 577-582: The 'loop extrusion' hypothesis is motivated by in vitro observations. The experiment in yeast, in which cohesin variants that are unable to extrude DNA loops but retain the ability to topologically entrap DNA, suggested that in vivo chromatin loops are formed independently of loop extrusion. Instead, transcription promotes loop formation and acts as an extrinsic motor that extends these loops and defines their final positions.
      
       Another model for the regulation of gene expression by insulators is the boundary-pairing (insulator-pairing) model (Bing X et al. *Elife* 2024) (Ke W et al. *Elife* 2024) (Fujioka M et al. *PLoS Genetics* 2016). Molecules bound to insulators physically pair with their partners, either head-to-head or head-to-tail, with different degrees of specificity at the termini of TADs in flies. Although the experiments do not reveal how partners find each other, the mechanism unlikely requires loop extrusion. Homologous and heterologous insulator-insulator pairing interactions are central to the architectural functions of insulators. The manner of insulator-insulator interactions is orientation-dependent. I have summarized the model on lines 559-567: Other types of chromatin regulation are also expected to be related to the structural interactions of molecules. As the boundary-pairing (insulator-pairing) model, molecules bound to insulators physically pair with their partners, either head-to-head or head-to-tail, with different degrees of specificity at the termini of TADs in flies (Fig. 7). Although the experiments do not reveal how partners find each other, the mechanism unlikely requires loop extrusion. Homologous and heterologous insulator-insulator pairing interactions are central to the architectural functions of insulators. The manner of insulator-insulator interactions is orientation-dependent.
      

      8. Do the authors think that the identified DBPs could work in that way as well?

      The boundary-pairing (insulator-pairing) model would be applied to the insulator-associated DNA-binding proteins other than CTCF and cohesin that are involved in the loop extrusion mechanism (Bing X et al. Elife 2024) (Ke W et al. Elife 2024) (Fujioka M et al. PLoS Genetics 2016).

       Liquid-liquid phase separation was shown to occur through CTCF-mediated chromatin loops and to act as an insulator (Lee, R et al. *Nucleic Acids Research* 2022). Among the identified insulator-associated DNA-binding proteins, CEBPA has been found to form hubs that colocalize with transcriptional co-activators in a native cell context, which is associated with transcriptional condensate and phase separation (Christou-Kent M et al. *Cell Reports* 2023). The proposed microcompartment mechanisms are also associated with phase separation. Thus, the same or similar mechanisms are potentially associated with the insulator function of the identified DNA-binding proteins. I have included the following information on line 554: CEBPA in the identified insulator-associated DNA-binding proteins was also reported to be involved in transcriptional condensates and phase separation.
      

      9. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?

      Snead WT et al. Molecular Cell 2019 mentioned that protein post-transcriptional modifications (PTMs) facilitate the control of molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin (Tang X et al. Nature Communications 2024). I found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Supplementary Fig. 2d). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation by PTMs. I have added the following explanation on lines 584-590: Furthermore, protein post-transcriptional modifications (PTMs) facilitate control over the molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin. We found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Fig. 4f and Supplementary Fig. 3c). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation through PTMs.

      10. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Structural molecular model of cohesin-CTCF-anchored loops has been published by Li Y et al. Nature 2020. The structural conformation of CTCF and cohesin in the loops would be the cause of the directional bias of CTCF binding sites, which I mentioned in lines 539 - 543 as follows: These results suggest that the directional bias of DNA-binding sites of insulator-associated DBPs may be involved in insulator function and chromatin regulation through structural interactions among DBPs, other proteins, DNAs, and RNAs. For example, the N-terminal amino acids of CTCF have been shown to interact with RAD21 in chromatin loops.

       To investigate the principles underlying the architectural functions of insulator-insulator pairing interactions, two insulators, Homie and Nhomie, flanking the *Drosophila even skipped *locus were analyzed. Pairing interactions between the transgene Homie and the eve locus are directional. The head-to-head pairing between the transgene and endogenous Homie matches the pattern of activation (Fujioka M et al. *PLoS Genetics* 2016).
      

      Reviewer #3

      Major Comments:

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.

      When a protein complex binds to DNA, one protein of the complex binds to the DNA directory, and the other proteins may not bind to DNA. However, the DNA motif sequence bound by the protein may be registered as the DNA-binding motif of all the proteins in the complex. The molecular structure of the complex of CTCF and Cohesin showed that both CTCF and Cohesin bind to DNA (Li Y et al. Nature 2020). I think there is a possibility that if the molecular structure of a protein complex becomes available, the previous recognition of the DNA-binding ability of a protein may be changed. Therefore, I searched the Pfam database for 99 insulator-associated DNA-binding proteins identified in this study. I found that 97 are registered as DNA-binding proteins and/or have a known DNA-binding domain, and EP300 and SIN3A do not directory bind to DNA, which was also checked by Google search. I have added the following explanation in line 257 to indicate direct and indirect DNA-binding proteins: Among 99 insulator-associated DBPs, EP300 and SIN3A do not directory interact with DNA, and thus 97 insulator-associated DBPs directory bind to DNA. I have updated the sentence in line 20 of the Abstract as follows: We discovered 97 directional and minor nondirectional motifs in human fibroblast cells that corresponded to 23 DBPs related to insulator function, CTCF, and/or other types of chromosomal transcriptional regulation reported in previous studies.

      2. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.

      As the reviewer mentioned, I recognize enhancers are relatively small regions. In the paper, I intended to examine further upstream and downstream of promoter regions where enhancers are found. Therefore, I have modified the sentence in lines 929 - 931 of the Fig. 2 legend as follows: Enhancer-gene regulatory interaction regions consist of 200 bins of 10 kbp between -1 Mbp and 1 Mbp region from TSS, not including promoter.

      3. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.

      Following the reviewer's advice, I have added the ChIP-seq data of H3K9me3 as a truck of the UCSC Genome Browser. The distribution of H3K9me3 signal was different from that of H3K27me3 in some regions. I also found the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions and took some screenshots of the UCSC Genome Browser of the regions around the sites in Supplementary Fig. 3b. I have modified the following sentence on lines 974 - 976 in the legend of Fig. 4: a Distribution of histone modification marks H3K27me3 (green color) and H3K9me3 (turquoise color) and transcript levels (pink color) in upstream and downstream regions of a potential insulator site (light orange color). I have also added the following result on lines 356 - 360: The same analysis was performed using H3K9me3 marks, instead of H3K27me3 (Fig. S3b). We found that the distribution of H3K9me3 signal was different from that of H3K27me3 in some regions, and discovered the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions (Fig. S3b).

      4. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      The resolution of the Micro-C assay is considered to be 100 bp and above, as the human nucleome core particle contains 145 bp (and 193 bp with linker) of DNA. However, internucleosomal DNA is cleaved by endonuclease into fragments of multiples of 10 nucleotides (Pospelov VA et al. Nucleic Acids Research 1979). Highly nested focal interactions were observed (Goel VY et al. Nature Genetics 2023). Base pair resolution was reported using Micro Capture-C (Hua P et al. Nature 2021). Sub-kilobase (20 bp resolution) chromatin topology was reported using an MNase-based chromosome conformation capture (3C) approach (Aljahani A et al. Nature Communications 2022). On the other hand, Hi-C data was analyzed at 1 kb resolution. (Gu H et al. bioRxiv 2021). If the resolution of Micro-C interactions is at best at 1 kb, the binding sites of a DNA-binding protein will not show a peak around the center of the genomic locations of interaction edges. Each panel shows the number of binding sites of a specific DNA-binding protein at a specific distance from the midpoint of all chromatin interaction edges. I have modified and added the following sentences in lines 593-597: High-resolution chromatin interaction data from a Micro-C assay indicated that most of the predicted insulator-associated DBPs showed DNA-binding-site distribution peaks around chromatin interaction sites, suggesting that these DBPs are involved in chromatin interactions and that the chromatin interaction data has a high degree of resolution. Base pair resolution was reported using Micro Capture-C.

      Minor Comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., ____https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2____or ____https://pubmed.ncbi.nlm.nih.gov/37486787____/). The authors should discuss how that would impact their results.

      The directional bias of CTCF binding sites was identified by ChIA-pet interactions of CTCF binding sites. The analysis of the contribution scores of DNA-binding sites of proteins considering the binding sites of CTCF as an insulator showed the same tendency of directional bias of CTCF binding sites. In the analysis, to remove the false-positive prediction of DNA-binding sites, I used the binding sites that overlapped with a ChIP-seq peak of the DNA-binding protein. This result suggests that the DNA-binding sites of CTCF obtained by the current analysis have sufficient quality. Therefore, if the accuracy of prediction of DNA-binding sites is improved, although the number of DNA-binding sites may be different, the overall tendency of the directionality of DNA-binding sites will not change and the results of this study will not change significantly.

       As for the first reference in the reviewer's comment, chromatin interaction data from Micro-C assay does not include all chromatin interactions in a cell or tissue, because it is expensive to cover all interactions. Therefore, it would be difficult to predict all chromatin interactions based on machine learning. As for the second reference in the reviewer's comment, pioneer factors such as FOXA are known to bind to closed chromatin regions, but transcription factors and DNA-binding proteins involved in chromatin interactions and insulators generally bind to open chromatin regions. The search for the DNA-binding motifs is not required in closed chromatin regions.
      

      2. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      In the DeepLIFT paper, the authors explain that DeepLIFT is a method for decomposing the output prediction of a neural network on a specific input by backpropagating the contributions of all neurons in the network to every feature of the input (Shrikumar A et al. ICML 2017). DeepLIFT compares the activation of each neuron to its 'reference activation' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.

       Truly explainable AI would be able to find cause and reason, and to make choices and decisions like humans. DeepLIFT does not perform causal inferences. I did not use the term "Explainable AI" in our manuscript, but I briefly explained it in Discussion. I have added the following explanation in lines 623-628: AI (Artificial Intelligence) is considered as a black box, since the reason and cause of prediction are difficult to know. To solve this issue, tools and methods have been developed to know the reason and cause. These technologies are called Explainable AI. DeepLIFT is considered to be a tool for Explainable AI. However, DeepLIFT does not answer the reason and cause for a prediction. It calculates scores representing the contribution of the input data to the prediction.
      
       Furthermore, to improve the readability of the manuscript, I have included the following explanation in lines 159-165: we computed DeepLIFT scores of the input data (i.e., each binding site of the ChIP-seq data of DNA-binding proteins) in the deep leaning analysis on gene expression levels. DeepLIFT compares the importance of each input for predicting gene expression levels to its 'reference or background level' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.
      
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Osato and Hamada propose a systematic approach to identify DNA binding proteins that display directional binding. They used a modified Deep Learning method (DEcode) to investigate binding profiles of 1356 DBP from GTRD database at promoters (30 of 100bp bins around TSS) and enhancers (200 bins of 10Kb around eSNPs) and use this to predict expression of 25,071 genes in Fibroblasts, Monocytes, HMEC and NPC. This method achieves a good prediction power (Spearman correlation between predicted and actual expression of 0.74). They then use PIQ, and overlap predicted binding sites with actual ChIP-seq data to investigate the motifs of TFs that are controlling gene expression. They find 99 insulator proteins showing either a specific directional bias or minor non-directional bias, corresponding to 23 DBP previously reported to have insulator function. Of the 23 proteins they identify as regulating enhancer promoter interactions, 13 are associated with CTCF. They also show that there are significantly more insulator proteins binding sites at borders of polycomb domains, transcriptionally active or boundary regions based on chromatin interactions than other proteins.

      Major Comments:

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.
      2. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.
      3. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.
      4. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      Minor comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2 or https://pubmed.ncbi.nlm.nih.gov/37486787/). The authors should discuss how that would impact their results.
      2. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      Referee Cross-Commenting

      I would like to mention that I agree with the comments of reviewers 1 and 2.

      Significance

      General assessment:

      This is the first study to my knowledge that attempts to use Deep Learning to identify insulators and directional biases in binding. One of the limitations is that no additional methods were used to show that these DBP have directional binding bias. It is not necessarily to employ additional methods, but it would definitely strengthen the paper.

      Advancements:

      This is a useful catalogue of potential DNA binding proteins of interest, beyond just CTCF. Some known TFs are there, but also new ones are found.

      Audience:

      Basic research mainly, with particular focus on chromatin conformation and TF binding fields.

      My expertise:

      ML/AI methods in genomics, TF binding models, epigenetics and 3D chromatin interactions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see my points below). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the following list.

      Also, I encourage the authors to integrate the current presentation of the data with other (published) data about chromatin architecture, to make more robust the claims and go deeper into the biological implications of the current work. Se my list below.

      It follows a specific list of relevant points to be addressed:

      Specific points:

      1. Introduction, line 95: CTCF appears two times, it seems redundant;
      2. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?
      3. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS;
      4. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details;
      5. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.
      6. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?
      7. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?
      8. Do the authors think that the identified DBPs could work in that way as well?
      9. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?
      10. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Significance

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, chromatin organization is an important topic in the context of a constantly expanding research field. Therefore, the work is timely and could be useful for the community. The paper appears overall well written and the figures look clear and of good quality. Nevertheless, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see list of specific points). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the above reported points.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study by Osato and Hamada aims at computationally identifying a set of novel putative insulator-associated DNA binding proteins (DBPs) via estimation of their contribution to the expression of genes in the same chromosome region of their binding sites (+- 1Mbp from TSS). To achieve this, the authors leverage a deep learning architecture already published via which ChIP-seq peaks of DBPs in the TSS of a given gene are used to predict its expression level in four human cell lines.

      Building on this, the authors used another tool called DeepLIFT to evaluate the weight of each DBP binding site on the final gene expression value. Hence they made the assumption that if a given DBP had an insulator function they could restrict the prediction of the gene's expression to the region included between pairs of that DBP binding sites, and evaluate the pair's motif directionality bias in the distribution of weights. They exemplify their approach's validity by the fact that they can predict the known directionality bias of CTCF/cohesin-bound sites as the highest of the lot, with the F-R orientation of the pairs the most enriched, recapitulating what already known in literature: i.e., that F-R chromatin interaction peaks are the most enriched. In addition, they find several new DBPs showing significant directionality bias; hence they could be candidates for insulation activity. They then provide correlation between these putative insulator binding sites and sites of transition between euchromatin and heterochromatin by independently using histone mark and gene expression datasets. This, of course, is not surprising because (a) there is insulation between regions with heterotypic chromatin identities, and (b) it was already known from the first papers describing insulated chromatin domains that their boundaries were well-enriched for active transcription and transcriptional regulators (e.g., Dixon et al, Nature 2012).

      Finally, they use chromatin interaction (looping) sites to check the overlap between CTCF and all other DBPs and define a subset of putative insulator DBPs not overlapping CTCF peaks, suggesting potentially new insulatory mechanisms. These factors were all known transcriptional activators, but this part of the findings carry most of the novelty in the work and have the potential of opening up new directions for research in chromatin organization.

      Overall, the methodology applied here is adequate, clear, and reproducible. The major issue, in our view, is that the entire manuscript's findings relies on the usage of deepLIFT, a tool which was not benchmarked previously or by the current study. In fact, deepLIFT is public as regards its code, and also appears as a preprint from 2017 on biorXiv and published in the Proceedings of Machine Learning Research conference. Also, this key tool was developed by the Kundaje lab (who produce high quality alogrithms), and not by the authors. Therefore, the manuscript is predominantly based on the execution of existing workflows to publicly-available data. This does not take anything away from the interesting question posed here, but at the same time does not provide the community with any new algorithm/workflow.

      Finally, although I appreciate that the authors are purely computational and have likely no capacity for experimental validation of their claims of new DBPs having insulator roles, I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning. Using this kind of data, effects on gene expression can at least be tested in regard to the authors' predictions. Moreover, in terms of validation, Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      As secondary issues, we would point out that:

      • The suggested alternative transcripts function, also highlighted in the manuscript;s abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.
      • Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.
      • Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Significance

      The scientific novelty of the work lies primarily in the identification of a set of DBPs that are proposed to confer insulator activity genome-wide. This has been long sought after in human data (whilst it is well understood and defined in Drosophila). The authors produce a quantitative ranking of the putative insulation effect of these DBPs and, most importantly, go on to identify a smaller subset that are apparently non-overlapping with anchors of CTCF-cohesin loop anchors; the presence of strong motif orientation biases in many DBPs can also be of broad interest, especially those that cannot be trivially ascribable to the loop extrusion process.

      However, although these findings open the way for speculation on multiple insulation mechanisms via proteins with multiple regulatory functions, the manuscript provide no experimental or computational means to test the proposed roles of these DBPs - and, as such, this limits the potential impact of the work and mostly targets researchers in the field of genome organization that can test these findings. Having said this, if validated, this work can significantly broaden our understanding of how chromatin is organized in 3D nuclear space.

      I typically identify myself to the authors: A. Papantonis, expertise in 3D genome architecture, chromatin biology, and genomics/bioinformatics.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their thoughtful comments


      Reviewer #1 (Evidence, reproducibility and clarity):

      SUMMARY: The manuscript is well written, with excellent explanation and documentation of experimental approaches. All conclusions are well supported by the data. The discussion is balanced and appropriate. The data, including images and movies, are of high quality and beautifully presented. The experimental design and analysis, including quantification of parameters in the images, is rigorous. Additional rigor is provided by comparing different cell types. The rapalog and iLID dimerization strategies have been described previously, as has their use to recruit kinesin motors to membranous organelles. However, this is the first application of these strategies to recruit motors to intermediate filaments. The evidence that vimentin filaments can be redistributed locally is clear and convincing and offers appealing potential for future experimentation. The redistribution was not fully reversible in all cells, but this is not surprising given the entanglement that must result from the action of motors along the length of these long flexible polymers.

      In terms of the biology of intermediate filaments, the authors show that vimentin redistribution had negligible effect on microtubule or F-actin organization, cell area, or the number of focal adhesions. Depletion of vimentin filaments locally reduced cell stiffness. Both ER and mitochondria segregated with vimentin filaments, but not lysosomes. These findings are consistent with published reports (e.g. comparing vimentin null and wildtype cell lines), but the acute and reversible nature of the motor recruitment strategy is a more elegant experimental approach, and the selectivity of the observed effects is evidence of its specificity. It is interesting that the ER network segregated with vimentin even in the absence of RNF26. While this is not explored further, it points to the potential power of this motor recruitment strategy for future studies on intermediate filament interactions.

      • *

      The following are some major and minor issues, which should all be easy for the authors to address.

      MAJOR COMMENTS:

        • Fig. S1 shows that the Vim-mCherry-FKBP construct coassembles with endogenous vimentin, but similar data for the iLID constructs appears to be lacking. I would like to see data demonstrating the incorporation of the Vim-mCherry-SspB constructs into the vimentin filaments. This should include high magnification images of single filaments in the cytoplasm of the cells.*
      • *

      Response:

      We have included a new Figure 2D, which illustrates the incorporation of the vimentin-mCherry-SspB construct into the vimentin network stained for endogenous vimentin.

        • The authors do not discuss the density of motor recruitment along the filaments. To address this, I'd like to see images showing the extent of recruitment of motors to the filaments using the rapalog and LID strategies. This should include high magnification images of single filaments in the cytoplasm of the cells.*
      • *

      Response:

      We have included new Figure S1B,C and Figure S2A, which illustrate the recruitment of kinesin motors to vimentin filaments upon induction with rapalog or light, respectively, by using super-resolution imaging with an Airyscan microscope. The motors were stained with antibodies against GFP. These data are discussed in the text, lines 126-132 and 165-168.

        • For the experiments on vimentin and keratin organization, the authors do not explain that these proteins form distinct networks and do not coassemble. The authors should show this in the cell types examined. This should also be explained explicitly in the body of the manuscript, though the data could be placed in the supplementary data. This is important because many intermediate filaments can coassemble freely, and coassembled proteins would be expected to segregate together.*
      • *

      Response:

      To address this important comment, we have now included images of vimentin and keratin in the three studied cell types using super-resolution imaging, both for cells expressing vimentin constructs (updated Figure 5) and endogenous filament staining in untransfected cells (updated Figure S4). These images illustrate that vimentin and keratin mostly form distinct filaments in HeLa cells. However, we do observe some degree of co-assembly of vimentin and keratin in COS-7 and U2OS cells. We were really surprised by this observation as, to our knowledge, it has not been clearly documented in the literature. These data help to explain why vimentin pulling causes keratin co-clustering in COS-7 and U2OS cells. We note that in a study where kinesin-1 mediated transport of vimentin and keratin has been previously investigated by the Gelfand lab in RPE1 cells, the two networks also appear to overlap quite strongly (Robert et al, 2019, FASEB J). Since no super-resolution microscopy was performed in that study, potential co-assembly of keratin and vimentin filaments was not discussed. Colocalization and coprecipitation of vimentin and keratin have been also described by Velez-delValle et al. in epithelial cells (Sci Rep 2016). Cell type-specific co-assembly of keratin and vimentin would require more investigation, and we make no strong conclusions about it, but we think that our data illustrate the usefulness of our methodology to address the co-dependence of different types of intermediate filaments.

      MINOR COMMENTS:

        • The authors refer to selecting cells within an "optimized expression range" for their transiently expressed recombinant proteins. They should state the proportion of the cells that met this criterion in their transient transfection experiments as this is important information for other researchers that might wish to use this approach in their own studies*. Response:

      These numbers are now included in lines 137 -142 and 173-176 of the revised paper. For the FRB-FKP system, ~50% of transfected cells could be used for analysis, for the light-induced system, ~40% were in the optimal range.

        • In Fig. 1F there should be a statistical comparison between cells transfected with the Kin14 construct and control (untransfected) cells in the absence of rapalog*
      • *

      Response:

      This comparison has been added.

        • In Fig. 1G there should be a statistical comparison between cells expressing Kin14 and KIF5A in the absence of rapalog.*
      • *

      Response:

      This comparison has been added.

        • The depletion of the ER network in the cell periphery is not evident in Fig. 7B, though the perinuclear accumulation is evident. Perhaps the authors could select another example or explain to the reader what exactly to look for in these images.*
      • *

      Response:

      We note that Figure 7B is a line scan of the image shown in Figure 7A. We assume that the reviewer meant Figure 7C, which is discussed in detail below.

        • In Fig. 7C, the intensity of the mCherry declines markedly over time. This is presumably due to photobleaching but should be explained in the legend.*
      • *

      Response:

      We have now improved Figure 7 by adding additional quantifications of ER and vimentin intensity and distribution in Figures 7D and E. We also extended the corresponding text (lines 288-297), which now reads; “Using the optogenetic tool, we observed that ER sheets and matrices, but not tubules, were pulled along with vimentin, confirming their previously described direct connections (Cremer et al., 2023) (black arrows, Figure 7C; Video S5). Most of the vimentin and ER repositioning occurred within approximately 10 minutes (Figure 7C, D, Video S5). While initially this resulted in a sparser tubular ER network at the cell periphery, over time, the network became denser, with smaller polygonal structures. This effect could also be observed in the ratio of perinuclear to peripheral intensity, where a subset of ER initially follows vimentin to the perinuclear region but then redistributes again towards the cell periphery (Figure 7D). It should be noted that while photobleaching of the ER channel was negligible, there was a 40% reduction in total Vim-mCh-SspB intensity over the course of the experiment due to photobleaching (Figure 7E).”

      • *

      Reviewer #1 (Significance):

      SUMMARY: The authors show that chemical-induced and light-induced dimerization strategies can be used to recruit microtubule motors to vimentin filaments, allowing rapid and reversible experimental manipulation of vimentin filament organization either locally or globally in cells. These strategies provide an experimental approach for investigating the physical interaction of intermediate filaments with organelles and other cytoskeletal component, as well as a method for probing the role of intermediate filaments in cell mechanics, cytoskeletal dynamics, etc. This is a technical improvement over previous experimental strategies, which have relied largely on chronic manipulation such as global disassembly or genetic deletion of intermediate filaments, e.g. comparison of vimentin null and wild type cells.

      The principal weakness of this study is that it offers limited insight into intermediate filament biology. As such, it might be most appropriate for a tools or techniques section of a journal. The dimerization strategies have been reported previously, so that is not new, but the application to intermediate filaments is novel.

      • *

      Response:

      We agree that our paper is primarily of technical nature and thus would be most appropriate for the tools and techniques section of a journal. We also agree that we used motor recruitment strategies that we and others have employed previously. However, we would like to emphasize that the demonstration that the tools work very well for intermediate filaments is entirely novel, as are the observations that these tools can be used to very rapidly alter cell stiffness or probe the links between intermediate filaments and organelles. Most importantly, the intermediate filament field currently lacks rapid specific manipulation strategies, and our tools will allow revisiting many important pending questions in the field. For example, they will allow to distinguish short-term and direct effects of intermediate filaments on cell polarity, adhesion and migration from their function in signaling and gene expression. We also report some new biology, such as evidence of some degree of co-assembly of vimentin and keratin.

      AUDIENCE: This paper will be of interest to cell biologists who study cytoskeletal interactions, particularly the interaction of intermediate filaments with other cellular organelles or cytoskeletal polymers, or the role of intermediate filaments in cellular mechanics.

      REVIEWER EXPERTISE: This reviewer has expertise on the cytoskeleton, cytoskeletal dynamics, and intracellular transport including intermediate filament biology.

      __ __


      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary: The manuscript presents a novel methodology for acute manipulation of vimentin intermediate filaments (IFs) using chemical genetic and optogenetic tools. By recruiting microtubule-based motors to vimentin via inducible dimerization systems, the authors achieve precise temporal and spatial control over vimentin distribution. Apart from the significant advancement in terms of methods development, key findings include:

      * Vimentin's role in organelle positioning: Mitochondria and ER are repositioned with vimentin, while lysosomes are less dependent on its organization.

      * Cytoskeletal interactions: Vimentin clustering minimally impacts actin and microtubule networks in the short term.

      * Cell stiffness: Vimentin repositioning reduces cell stiffness, indicating its significant role in cellular mechanics.

      * Cell-type-specific keratin interactions: The study highlights diverse interactions between vimentin and keratin-8 across cell lines.

      The study demonstrates methodological advancements enabling rapid vimentin manipulation and provides insights into vimentin's interactions with cellular structures.

      A major shortcoming is the unclear narrative, what do the authors want to present? This aspect requires significant attention.

      Response:

      By “unclear narrative” the reviewer meant that we should have provided a more balanced discussion of the insights that could be obtained using our new method compared to previously published literature, and we have modified our narrative accordingly.

      General Comments and Overall Assessment

      The manuscript represents an interesting contribution to the cytoskeletal field, addressing limitations of long-term perturbation methods. The tools developed are innovative, allowing controlled and reversible vimentin reorganization with minimal off-target effects. The findings are robust and provide important insights into the role of vimentin in cellular mechanics and organelle positioning.

      Strengths:

      Methodological novelty with broad applicability - this is the most exciting aspect.

      Comprehensive validation of the tools in multiple cell lines.

      Clear differentiation between vimentin's short- and long-term roles.

      Addressing gaps in understanding vimentin-organelle interactions.

      Limitations:

      * The manuscript is a little bit all over the place. While the method development is clear, the manuscript makes claims way beyond the method development. The message and narrative needs to be improved, and in the respect the whole structure needs an overhaul.

      Response:

      We have carefully modified the manuscript to avoid the impression that we make any claims that go beyond the immediate and quantifiable effects of vimentin repositioning on different cellular structures.

      * Unclear how much the differences in expression levels impact results and reproducibility.

      Response:

      Quantifications of expression levels and their discussion are included in Figures 1G-I, 2G-H, S2B and lines 137-142 and 173-176.

      * Would be good to discuss some findings that are specific to a given experimental cell line. How generalizable are these results?

      Response:

      Cell line-specific findings concerned mostly the co-displacement of keratin together with vimentin, which occurred in COS-7 and U2OS cells but in in HeLa cells. This interesting finding is discussed in the text, lines 246-269 and 375-383 (see also our answers on page 3 above and page 7 below).

      Major Comments

      Evidence and Claims:

      * While the methodological aspect is very strong the balance between presenting a novel method and presenting specific cell biological findings needs to be improved. Now it is quite unclear what the manuscript wants to present.

      * The abstract needs a complete overhaul. From reading the abstract, it is not clear what the manuscript wants to present.

      Response:

      We have modified the abstract to make it more clear that we do not make any general claims on the impact of vimentin on the interactions and functions of different organelles, but rather describe what can be directly observed after the acute displacement of vimentin and which conclusions can be made from these observations.

      Regarding the research findings there are a number of things for the authors to consider. Since the methods aspect is, in the eyes of this reviewer, in focus, I have not stringently assessed the experimental findings. Hence, the comments below are things to be considered in order to make the findings related to IF research stronger:

      • *

      * Cell-specific keratin interactions: The manuscript could benefit from some further validation of the physical interactions between vimentin and keratin-8 across different cell types.

      Response:

      We have improved the images of keratin and vimentin by using super-resolution (Airyscan) microscopy to show that they indeed form distinct filaments in HeLa cells, whereas in COS-7 and U2OS cells, where their co-displacement occurs, they can also incorporate into the same filaments. This observation was very surprising but agrees with the data published by the Gelfand lab on similarity in the distribution pattern and co-transport of vimentin and keratin in RPE1 cells (Robert et al, 2019, FASEB J). Colocalization and coprecipitation of vimentin and keratin has been also described by Velez-delValle at al. in epithelial cells (Sci Rep 2016).

      * Impact on microtubules: The disorganization of stable microtubules in cells expressing KIF5A was attributed to overexpression effects. It would be helpful to include additional controls, such as expressing KIF5A without vimentin constructs, to confirm this claim.

      Response:

      This control has been included in the new Figure S3. We note that this observation fully aligns with data published by another lab (Andreu-Carbó et al, 2024, Nat Comm).

      * ER-vimentin linkages: The observation that ER-vimentin interactions persist in RNF26 knockout cells is intriguing. The manuscript would benefit from a discussion on possible candidates for alternative linkers.

      Response:

      We have added a short discussion (lines 394-398) about the potential involvement of nesprins, such as nesprin-3, because they can connect the nuclear envelope to intermediate filaments, and might also partly participate in ER sheet-IF connections because ER and nuclear membranes are continuous and show some overlap in proteome.

      * Construct variability: Do the authors have some data on how much Expression level differences significantly affect the outcomes (e.g., incomplete recovery)?

      Response:

      We have added a figure (Figure S2B), which shows that incomplete recovery of vimentin clustering does not correlate with protein expression levels and likely depends on other factors, which could possibly be the cell cycle phase or degree of vimentin entanglement after repositioning. This point is discussed in revised text, lines 194-197.


      Reviewer #2 (Significance):

      Significance

      General Assessment: The study represents a significant technical advance in the study of cytoskeletal dynamics. The tools developed address critical limitations of traditional vimentin perturbation methods, allowing for spatiotemporally precise manipulation without long-term effects on gene expression or signaling pathways.

      Novelty:

      This is, to my knowledge, the first demonstration of reversible and acute vimentin repositioning using optogenetics. The study extends understanding of vimentin's short-term mechanical and organizational roles, distinguishing them from compensatory effects observed in knockdown models.

      Audience and Impact: The manuscript will appeal to researchers in cytoskeletal dynamics, cell mechanics, and organelle biology. The tools have broader applicability in studying other cytoskeletal systems and could inspire translational applications, such as investigating the role of vimentin in cancer or fibrosis.

      The reference list provide a relatively representative selection of articles relevant for the article. However, the authors may consider whether there could be relevant information in the relatively recent special edition of Current Opinion in Cell Biology, which focused on IFs, specially featuring vimentin https://www.sciencedirect.com/special-issue/10TFHK2QCKW

      Response:

      We thank the reviewer for this excellent suggestion, and we have included some additional references from this issue.

      Field of Expertise

      I specialize in cell biology, intermediate filaments, post-translational modifications, cytoskeletal dynamics, and advanced microscopy techniques.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary:

      This is an excellent paper describing the use of chemical and light-induced heterodimerization of microtubule-based motors to rapidly disrupt the distribution of the vimentin cytoskeletal network. Rapid clustering of vimentin did not significantly affect the microtubule or actin networks, cell spreading or focal adhesions. Other organelles were repositioned together with vimentin. Interestingly, in some cell lines, keratin networks were displaced along with vimentin while in other cells they were not.

      Major comments:

      The conclusions are well supported by the data presented and appropriate controls are included.

      Optional comments:

        • The authors should expand on why they think the plus end directed KIF5A gives such a strong localization of vimentin to the perinuclear area.* Response:

      We think that two factors can contribute to this counterintuitive effect. First, vimentin is strongly concentrated and entangled in the perinuclear region, and displacement of some vimentin filaments to the cell periphery can cause the collapse of the rest to the cell center, with kinesins being unable to pull the perinuclear network apart. Second, kinesin-1 KIF5A is a motor that strongly prefers stable, post-translationally modified microtubules, and our previous study has shown that a significant proportion of such microtubules are located with their minus ends facing towards the cell periphery (Chen et al., Elife 2016). This could contribute to the accumulation of vimentin in the cell center upon KIF5A recruitment. These considerations were added to the revised text, lines 344-347.

      • Consideration should be given to the idea that the pulling of ER and mitochondria along with the vimentin could be due to trapping of these organelles within the vimentin matrix and not necessarily due to direct interactions. Such reasoning could explain the transient localization of lysosomes with the center aggregate since lysosomes are generally not thought to significantly bind to vimentin networks.*

      Response:

      This is an excellent point, and we have included it in the revised article, lines 333-335 and 405.

      Reviewer #3 (Significance):

      This study describes some valuable tools that should be useful to cell biologists interested in determining the role of the cytoskeleton and possibly other organelles in a variety of cellular contexts. It overcomes some of the existing shortcomings of the pharmacological reagents currently available for studying intermediate filament biology and will provide a useful adjunct to other more long-term manipulations of the cytoskeleton. While much of the data presented confirm results obtained by other methods, this is a significant technical advance as it provides a short time scale, and in one instance, reversible manipulation of the cytoskeleton.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This is an excellent paper describing the use of chemical and light-induced heterodimerization of microtubule-based motors to rapidly disrupt the distribution of the vimentin cytoskeletal network. Rapid clustering of vimentin did not significantly affect the microtubule or actin networks, cell spreading or focal adhesions. Other organelles were repositioned together with vimentin. Interestingly, in some cell lines, keratin networks were displaced along with vimentin while in other cells they were not.

      Major comments:

      The conclusions are well supported by the data presented and appropriate controls are included.

      Optional comments:

      1. The authors should expand on why they think the plus end directed KIF5A gives such a strong localization of vimentin to the perinuclear area.
      2. Consideration should be given to the idea that the pulling of ER and mitochondria along with the vimentin could be due to trapping of these organelles within the vimentin matrix and not necessarily due to direct interactions. Such reasoning could explain the transient localization of lysosomes with the center aggregate since lysosomes are generally not thought to significantly bind to vimentin networks.

      Significance

      This study describes some valuable tools that should be useful to cell biologists interested in determining the role of the cytoskeleton and possibly other organelles in a variety of cellular contexts. It overcomes some of the existing shortcomings of the pharmacological reagents currently available for studying intermediate filament biology and will provide a useful adjunct to other more long-term manipulations of the cytoskeleton. While much of the data presented confirm results obtained by other methods, this is a significant technical advance as it provides a short time scale, and in one instance, reversible manipulation of the cytoskeleton.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The manuscript presents a novel methodology for acute manipulation of vimentin intermediate filaments (IFs) using chemical genetic and optogenetic tools. By recruiting microtubule-based motors to vimentin via inducible dimerization systems, the authors achieve precise temporal and spatial control over vimentin distribution. Apart from the significant advancement in terms of methods development, key findings include:

      • Vimentin's role in organelle positioning: Mitochondria and ER are repositioned with vimentin, while lysosomes are less dependent on its organization.
      • Cytoskeletal interactions: Vimentin clustering minimally impacts actin and microtubule networks in the short term.
      • Cell stiffness: Vimentin repositioning reduces cell stiffness, indicating its significant role in cellular mechanics.
      • Cell-type-specific keratin interactions: The study highlights diverse interactions between vimentin and keratin-8 across cell lines.

      The study demonstrates methodological advancements enabling rapid vimentin manipulation and provides insights into vimentin's interactions with cellular structures. A major shortcoming is the unclear narrative, what do the authors want to present? This aspect requires significant attention.

      General Comments and Overall Assessment

      The manuscript represents an interesting contribution to the cytoskeletal field, addressing limitations of long-term perturbation methods. The tools developed are innovative, allowing controlled and reversible vimentin reorganization with minimal off-target effects. The findings are robust and provide important insights into the role of vimentin in cellular mechanics and organelle positioning.

      Strengths:

      Methodological novelty with broad applicability - this is the most exciting aspect. Comprehensive validation of the tools in multiple cell lines. Clear differentiation between vimentin's short- and long-term roles. Addressing gaps in understanding vimentin-organelle interactions.

      Limitations:

      • The manuscript is a little bit all over the place. While the method development is clear, the manuscript makes claims way beyond the method development. The message and narrative needs to be improved, and in the respect the whole structure needs an overhaul.
      • Unclear how much the differences in expression levels impact results and reproducibility.
      • Would be good to discuss some findings that are specific to a given experimental cell line. How generalizable are these results?

      Major Comments

      Evidence and Claims:

      • While the methodological aspect is very strong the balance between presenting a novel method and presenting specific cell biological findings needs to be improved. Now it is quite unclear what the manuscript wants to present.
      • The abstract needs a complete overhaul. From reading the abstract, it is not clear what the manuscript wants to present.

      Regarding the research findings there are a number of things for the authors to consider. Since the methods aspect is, in the eyes of this reviewer, in focus, I have not stringently assessed the experimental findings. Hence, the comments below are things to be considered in order to make the findings related to IF research stronger:

      • Cell-specific keratin interactions: The manuscript could benefit from some further validation of the physical interactions between vimentin and keratin-8 across different cell types.
      • Impact on microtubules: The disorganization of stable microtubules in cells expressing KIF5A was attributed to overexpression effects. It would be helpful to include additional controls, such as expressing KIF5A without vimentin constructs, to confirm this claim.
      • ER-vimentin linkages: The observation that ER-vimentin interactions persist in RNF26 knockout cells is intriguing. The manuscript would benefit from a discussion on possible candidates for alternative linkers.
      • Construct variability: Do the authors have some data on how much Expression level differences significantly affect the outcomes (e.g., incomplete recovery)?

      Significance

      General Assessment: The study represents a significant technical advance in the study of cytoskeletal dynamics. The tools developed address critical limitations of traditional vimentin perturbation methods, allowing for spatiotemporally precise manipulation without long-term effects on gene expression or signaling pathways.

      Novelty:

      This is, to my knowledge, the first demonstration of reversible and acute vimentin repositioning using optogenetics. The study extends understanding of vimentin's short-term mechanical and organizational roles, distinguishing them from compensatory effects observed in knockdown models.

      Audience and Impact: The manuscript will appeal to researchers in cytoskeletal dynamics, cell mechanics, and organelle biology. The tools have broader applicability in studying other cytoskeletal systems and could inspire translational applications, such as investigating the role of vimentin in cancer or fibrosis.

      The reference list provide a relatively representative selection of articles relevant for the article. However, the authors may consider whether there could be relevant information in the relatively recent special edition of Current Opinion in Cell Biology, which focused on IFs, specially featuring vimentin https://www.sciencedirect.com/special-issue/10TFHK2QCKW

      Field of Expertise

      I specialize in cell biology, intermediate filaments, post-translational modifications, cytoskeletal dynamics, and advanced microscopy techniques.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary The manuscript is well written, with excellent explanation and documentation of experimental approaches. All conclusions are well supported by the data. The discussion is balanced and appropriate. The data, including images and movies, are of high quality and beautifully presented. The experimental design and analysis, including quantification of parameters in the images, is rigorous. Additional rigor is provided by comparing different cell types. The rapalog and iLID dimerization strategies have been described previously, as has their use to recruit kinesin motors to membranous organelles. However, this is the first application of these strategies to recruit motors to intermediate filaments. The evidence that vimentin filaments can be redistributed locally is clear and convincing and offers appealing potential for future experimentation. The redistribution was not fully reversible in all cells, but this is not surprising given the entanglement that must result from the action of motors along the length of these long flexible polymers.

      In terms of the biology of intermediate filaments, the authors show that vimentin redistribution had negligible effect on microtubule or F-actin organization, cell area, or the number of focal adhesions. Depletion of vimentin filaments locally reduced cell stiffness. Both ER and mitochondria segregated with vimentin filaments, but not lysosomes. These findings are consistent with published reports (e.g. comparing vimentin null and wildtype cell lines), but the acute and reversible nature of the motor recruitment strategy is a more elegant experimental approach, and the selectivity of the observed effects is evidence of its specificity. It is interesting that the ER network segregated with vimentin even in the absence of RNF26. While this is not explored further, it points to the potential power of this motor recruitment strategy for future studies on intermediate filament interactions.

      The following are some major and minor issues, which should all be easy for the authors to address.

      Major Comments:

      • Fig. S1 shows that the Vim-mCherry-FKBP construct coassembles with endogenous vimentin, but similar data for the iLID constructs appears to be lacking. I would like to see data demonstrating the incorporation of the Vim-mCherry-SspB constructs into the vimentin filaments. This should include high magnification images of single filaments in the cytoplasm of the cells.
      • The authors do not discuss the density of motor recruitment along the filaments. To address this, I'd like to see images showing the extent of recruitment of motors to the filaments using the rapalog and LID strategies. This should include high magnification images of single filaments in the cytoplasm of the cells.
      • For the experiments on vimentin and keratin organization, the authors do not explain that these proteins form distinct networks and do not coassemble. The authors should show this in the cell types examined. This should also be explained explicitly in the body of the manuscript, though the data could be placed in the supplementary data. This is important because many intermediate filaments can coassemble freely, and coassembled proteins would be expected to segregate together.

      Minor Comments:

      • The authors refer to selecting cells within an "optimized expression range" for their transiently expressed recombinant proteins. They should state the proportion of the cells that met this criterion in their transient transfection experiments as this is important information for other researchers that might wish to use this approach in their own studies.
      • In Fig. 1F there should be a statistical comparison between cells transfected with the Kin14 construct and control (untransfected) cells in the absence of rapalog
      • In Fig. 1G there should be a statistical comparison between cells expressing Kin14 and KIF5A in the absence of rapalog
      • The depletion of the ER network in the cell periphery is not evident in Fig. 7B, though the perinuclear accumulation is evident. Perhaps the authors could select another example or explain to the reader what exactly to look for in these images.
      • In Fig. 7C, the intensity of the mCherry declines markedly over time. This is presumably due to photobleaching but should be explained in the legend.

      Referees cross-commenting

      This session contains comments of Reviewer 1 and Reviewer 2

      Reviewer 1:

      I don't understand what Reviewer 2 means by "A major shortcoming is the unclear narrative, what do the authors want to present? This aspect requires significant attention." I found the narrative, purpose and conclusions of this study very clear to me. I also do not understand Reviewer 2's concern with the abstract. I re-read it and it still seems very clear and appropriate to me. For example, the authors state "Here, we present tools that allow rapid manipulation of vimentin IFs in the whole cytoplasm or within specific subcellular regions by inducibly coupling them to microtubule motors, either pharmacologically or using light". This seems clear and correct to me. It would be helpful if Reviewer 2 could point to specific language and explain why it is problematic.

      Reviewer 2:

      The strength of this paper is clearly the strong methods development and I find this aspect very intriguing and attractive. There is an imbalance in the narrative presenting on one hand the method and on the other hand presenting concrete research results. In my view, although interesting, the different experimental results serve more as proof-of-concept and they should not be presented as bona fide evidence of an existing or lacking bilateral interrealtionship.

      Indeed, the cited sentence makes sense: "Here, we present tools that allow rapid manipulation of vimentin IFs in the whole cytoplasm or within specific subcellular regions by inducibly coupling them to microtubule motors, either pharmacologically or using light." as it features the methods aspect of the paper. However, the following sentences: "Perinuclear clustering of vimentin had no strong effect on the actin or microtubule organization, cell spreading, and focal adhesions, but reduced cell stiffness. Mitochondria and endoplasmic reticulum sheets were repositioned together with vimentin, whereas lysosomes were only briefly repositioned and rapidly regained their normal distribution. Keratin was displaced along with vimentin in some cell lines but remained intact in others. " embraces everything from actin to microtubules to cell spreading to focal ahdesions to cell stiffness to mitochondrial function to lysosomes to interactions with other IF family members etc. This gives the impression that the authors want to make claims on how vimentin affects or does not affect these cellular functions and structures and once just cannot make such sweeping claims with so little evidence. With the experimental setting included, non of these claims can be really made without rigidly examining each and every interaction (which has been done separately for many of these bilateral interactions during the past 20 years or so).

      Hence, it should be made clear that these observations are used and mentioned as proof of concept that the tool is working, not as evidence that this or that interaction takes place or does not take place. As I indicated in my review, such claims on any of these bilateral interactions would require a lot more evidence to be properly substantiated.

      My comment is to be regarded as a positive one. If I would judge the paper based on how one could interpret the abstract and the text regarding, for example, that vimentin does not affect focal adhesions but changes cellular stiffness, my review would be significantly more stringent. However, I would really like to see this paper being published, but the claims on revealing new vimentin functions or disproving earlier observations based on these very limited data are just not sufficiently substantiated to be acceptable. Hence, I urge the authors to adjust the narrative to be clear on the methods development, which is also the focus of the title. I believe this is a justified recommendation and also, overall, a fair shake of the study and a constructive approach on how to publish this manuscript without extensive experiments.

      Reviewer 1:

      I thank Reviewer 2 for this explanation. I do understand their point. However, while not the end of the story, I do feel the authors' data are a bit more than just a proof of principle and do offer important insights into the biology which the field will need to grapple with. Each graph includes measurements on dozens of cells from multiple experiments and there is clearly selectivity to what segregates with the vimentin filaments and what does not. I would just ask the authors to be a bit more nuanced in their interpretation and conclusions about the biology to address Reviewer 2's concerns. Reviewer 2:

      That sounds like a fair assessment. Main thing is that this data is presented in a balanced way, with emphasis on the model development. Some of the presented data are in contradiction with quite established concepts by several researchers and the data presented here does not substantiate a paradigm shift. Regardless of this, some pieces of the data are intriguing, for example, the live cell imaging.

      Significance

      Summary: The authors show that chemical-induced and light-induced dimerization strategies can be used to recruit microtubule motors to vimentin filaments, allowing rapid and reversible experimental manipulation of vimentin filament organization either locally or globally in cells. These strategies provide an experimental approach for investigating the physical interaction of intermediate filaments with organelles and other cytoskeletal component, as well as a method for probing the role of intermediate filaments in cell mechanics, cytoskeletal dynamics, etc. This is a technical improvement over previous experimental strategies, which have relied largely on chronic manipulation such as global disassembly or genetic deletion of intermediate filaments, e.g. comparison of vimentin null and wild type cells.

      The principal weakness of this study is that it offers limited insight into intermediate filament biology. As such, it might be most appropriate for a tools or techniques section of a journal. The dimerization strategies have been reported previously, so that is not new, but the application to intermediate filaments is novel.

      Audience: This paper will be of interest to cell biologists who study cytoskeletal interactions, particularly the interaction of intermediate filaments with other cellular organelles or cytoskeletal polymers, or the role of intermediate filaments in cellular mechanics.

      Reviewer Expertise This reviewer has expertise on the cytoskeleton, cytoskeletal dynamics, and intracellular transport including intermediate filament biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __Reviewer #1 __

      (Evidence, reproducibility and clarity (Required)):

      The manuscript identified a novel role of Intraflagellar Transport Protein 20 (IFT20) in the function and homeostasis of lymphatic endothelial junctions. The authors showed that IFT20 regulates VE-cadherin localization at adherens junctions in lymphatic endothelial cells. The authors performed impressive in vivo work that shows the requirement for IFT20 for the homeostasis of intercellular junctions, lymphangiogenesis, and drainage function of lymphatic vessels. In contrast, the cell biology part of the paper was underwhelming and will need significant revisions to support the proposed model. In the result section, several conclusions have to be toned down to match the actual results. The study employs in vivo mouse models, immunofluorescence, biochemical assays, and loss-of-function experiments to support their conclusions.

      Major comments - The authors present disrupted localization of VE-cadherin. Is this a mislocalization and/or protein stability issue in IFT20 KD cells? A western blot can help assess protein levels, and a phase-chase endocytosis assay of VE-cadherin can strengthen evidence. The authors did not confirm the permeability phenotype seen in vivo.

      We thank the reviewer for this helpful suggestion.

      Planned 1: Western blot to assess total VE-cadherin protein levels in IFT20 WT and KD cells.

      Planned 2: Immunofluorescence staining for cell-surface VE-cadherin using permeabilized and non-permeabilized IFT20 WT and KD cells during VEGF-C stimulation and washout.

      Together, these two experiments will assess VE-cadherin stability and more directly test the hypothesis that VE-cadherin does not recycle effectively back to the cell surface in the absence of IFT20.

      • While the authors focused on IFT20 and rab5, we do not have a clear idea about the vesicular dynamics as well as the status of early, late, and recycling endosomes in IFT20 KD cells. Is IFT 20 localized to non-rab5+ endosomes, and if yes, what are the species? A more general endosomal profiling would help strengthen the authors' message. For example, in Fig. 4-5, the authors will have to stain for other early endosomal markers as well as late, and recycling endosomal markers in control and IFT20 KD cells.

      Thank you for this helpful suggestion.

      Planned 3: Immunofluorescence staining for EEA1 (early endosome), RAB7 (late endosome), RAB4 (fast recycling), RAB11 (recycling endosome) along with IFT20 to determine its localization pattern.

      This experiment will determine the localization of IFT20 relative to various endosomal compartments.

      • In fig. 6C, a majority of VE-cadherin is not associated with Rab5. Staining with additional endosomal markers might help identify other endosomal species colocalizing with VE-cadherin. It will be critical to add to Fig. 6c the intensity profiles depicting colocalizations. The authors can also live image a fluorescently (f)-tagged VE-cadherin (maybe with another f-tagged rab5) and assess their association dynamics in IFT20 KD cells (similar to fig6C).

      Thank you for this helpful suggestion.

      Planned 4: Immunofluorescence staining for EEA1 (early endosome), RAB7 (late endosome), RAB4 (fast recycling), RAB11 (recycling endosome) along with VE-cadherin in IFT20 WT and KD cells to determine its localization pattern.

      Planned 5: Additional colocalization analysis such as adding intensity profiles and possibly proximity ligation assay.

      Beyond the scope of this manuscript 1: While we agree that imaging the dynamics of FP-tagged VE-cadherin in live cells would provide more detail about its localization, we feel that this is beyond the scope of the current manuscript.

      These experiments will determine the localization of VE-cadherin across various endosomal compartments and strengthen the current colocalization data.

      • Primary cilia do seem to regulate vascular plexus in the mouse retina as well as endothelial permeability through mediating subcellular localization of junction proteins. The authors do not clearly exclude the ciliary function of IFT20 in mediating lymphatic endothelial cell-cell junctions. A rescue experiment can help settle this question by targeting IFT20 exclusively to cilia (or not) and assessing, for example, VE-cadherin localization. The following is optional: It is also unclear whether the described regulation is specific to IFT20 or can be phenocopied by the ablation of another IFT subunit and/or cilia ablation through the depletion of a non-IFT cilia assembly regulator.

      Thank you for this helpful suggestion. We propose an alternative strategy.

      Planned 6: To determine the role of ciliary vs. nonciliary functions, we will knockdown IFT74, an IFT protein in the same IFT complex B as IFT20 that is required for cilia assembly and function but is not known to participate in vesicular trafficking. We will assess VE-cadherin localization in IFT74 WT and KD cells by immunofluorescence.

      Beyond the scope of the manuscript 2: We have not optimized reagents for targeting IFT20 to the cilium (e.g. ciliary targeting sequence) and believe that assessing the effects of a protein from the same IFT complex (IFT74) without known nonciliary functions will alleviate the reviewer’s concern.

      • Figs. 7A and B do not seem very convincing. The control vs. IFT20 KD western blot levels look mostly similar between the two conditions. The result section does not translate the actual data in Fig. 7A and B. Additionally, there are no statistical comparisons between control and KD conditions in the graphs. Except for a potential pVEGFR-3 increase at 30 min VEGF-C in IFT20 KD cells, but after washout the level is similar to control. This figure does not support well the model presented in fig. 8. The conclusion in lines 456-459 has to be toned down.

      Thank you for this helpful suggestion.

      Planned 7: We will remove these western blot data with the exception of pVEGFR-3 and add phospho-tyrosine immunofluorescence. We will use immunofluorescence to quantify phosphorylated tyrosine levels and repeat western blots for pVEGFR-3 at different concentrations and time points of VEGF-C stimulation in IFT20 WT and KD cells. We will remove the other western blot data and revise the text accordingly. We will also attempt to pull down total VEGFR-3 and then blot for pVEGFR-3 to improve sensitivity of this assay.

      These experiments will focus our analysis on the activation of VEGFR-3.

      • The authors were not able to stain for pVEGFR3. It would still be helpful to see a colocalization between total VEGFR3, IFT20, and VE-cadherin in control cells and IFT20 KD cells (VEGFR3 and VE-cadherin).

      Thank you for this helpful suggestion.

      Planned 8: We will perform immunofluorescence for VEGFR-3, IFT20, and VECAD and assess their localization.

      Minor comments - The control used in Figures 1 and 2 does not seem ideal. The proper control would be IFT20fl/fl cre neg. Is there a reason why the authors excluded a lox allele in control? Also, the authors have to provide the mice age used in these figures and when the Cre kicks in in the result section.

      Thank you for this helpful suggestion.

      Planned 9: We will clarify the use of control genotypes, and add mouse ages and Cre details to results/methods. This is a constitutive LYVE-1 Cre.

      • Please describe the overall mouse phenotype(s) of the LYVE1 CRE-IFT20 flox.

      Thank you for pointing out this oversight.

      Planned 10: We will include a description of the overall phenotypes of LYVE1 Cre IFT20 KOs in the text. One notable phenotype is abdominal ascites.

      • Line 109:'By expression, the authors probably mean immunostained.

      Thank you for pointing out this oversight.

      Planned 11: We will change to “immunostaining for”.

      • Many graphs exhibit undefined Y-axis labels and units. Please clarify these as well as the way they were quantified. Include such information in figure legends and/or in the materials and methods section. The figures in question are fig1C, E and F, fig2E, fig3E, fig4b and D, fig6b and D, fig7B and C.

      Thank you for this helpful suggestion.

      Planned 12: We will clarify the quantification strategies and units in the text and figure legends and make sure the axes are clearly labeled.

      • Line 295:'homeostasis"-the authors probably mean in a serum-rich condition.

      Thank you for this helpful suggestion.

      Planned 13: That is indeed what we meant. We will merge this sentence and the next sentence to be clearer.

      • Fig4C specifically the lower two images on the right side: the images do not seem to represent the corresponding graphs.

      Thank you for this helpful suggestion.

      Planned 14: We will double check these images and adjust if necessary.

      • Please add the statistical tests used to evaluate significance in all figure legends.

      Thank you for pointing out this oversight.

      Planned 15: We will be sure statistical tests are named in all figure legends.

      Reviewer #1 (Significance (Required)):

      This study provides novel insights into IFT20's role in VE-cadherin trafficking and endothelial junction stability, with its strongest aspect being the in vivo data in Figures 1 and 2, demonstrating lymphatic defects upon IFT20 loss. This represents a conceptual advance by extending IFT protein function beyond cilia (if one of the major comments is addressed) to vascular integrity. However, mechanistic depth is lacking, and ciliary role was not tested-additional rescue and colocalization experiments are needed to confirm the model. The study will interest vascular and lymphatic biologists, as well as cell biologists studying intracellular trafficking and cilia.

      Expertise: cilia and mouse genetics

      __Reviewer #2 __

      (Evidence, reproducibility and clarity (Required)):

      Paulson et al. use an in vivo model of IFT20 deletion (Lyve1-Cre) and primary lymphatic endothelial cell (LEC) cultures to investigate the role of IFT20 in controlling LEC-LEC junction dynamics. The key findings/suggestions include: i) Authors show alterations in the VE-cadherin (or ZO-1) staining at the LEC junctions upon IFT20 deletion or silencing. ii) They also show evidence of the IFT20 localization to RAB5 endosomes and alteration of RAB5 endosome dynamics upon IFT20 silencing.

      In the current manuscript, some of the key data are not convincing. Further experimentation and analysis (also of the existing data) are needed to solidify the authors' statements as detailed below. I expect that the suggested experiments can be executed in 3-to-6 months and require, at least, antibodies, which have not been used in the current manuscript.

      Major comments

      1. The data information, presented in the figure legends, is difficult to understand. The authors should always indicate how many biological replicates and independent experiments the data is derived from. This holds also for the representative images. Now, it seems that some of the quantified data are derived from only 1 experiment (see, for example, rows 423-425: "Graphs show one representative biological replicate of two, each comprising two technical replicates with 100+ cells per condition"). The quantifications should be based on data from at least three independent experiments.

      Often data points represent the field of views from a single sample, thus, biasing the statistical testing. The data points should represent biological replicates or independent experiments to allow the reader to make conclusions, about whether the findings are statistically significant and can be repeated.

      Thank you for this helpful critique.

      Planned 16: We will be sure to indicate biological and technical replicates and ensure that quantifications are representative of at least three independent experiments. We will also ensure that quantifications are statistically robust.

      The Lyve1-Cre is not specific for lymphatic vasculature (for example https://www.jax.org/strain/012601# and Lee LK et al. 2020, Cell Reports), as also stated by the authors (row 112). However, this is not shown in the data and complicates the interpretation of the data. Here, authors can stain the IFT20 with their existing mouse IFT20-specific antibody to show the loss in the lymphatic and/or blood vasculature. If IFT20 is lost in both vasculature types, it is not possible to say "lymphatic specific" (for example, row 143) and draw conclusions that the observed phenotypes would be primary to IFT20 loss in the lymphatic vasculature.

      Thank you for this helpful suggestion.

      Planned 17: We will assess IFT20 KO in blood vasculature and tone down lymphatic-specific language in the text.

      The authors write (rows 164-168) "Lymphatic vessels in the IFT20 KO or VE-cadherin KO embryonic dorsal skin exhibited increased and variable lumen size and excessive branching, suggesting that impaired lymphatic organization and function contributed to the fluid homeostasis defect. Here, immunofluorescence staining for LYVE-1 in the ear skin revealed similar patterning defects in adult IFT20 KO lymphatic vessels (Figure 2A), that have also been described in VE-cadherin KO mice (Hägerling et al., 2018)." However, based on Figure 2A, it is not obvious that there would be excessive lymphatic vessel branching, impaired organization or similarities to VE-cadherin deleted lymphatic vessels. To justify their statement, the authors should provide quantification of the branching (at least 3 mice/genotype).

      Thank you for this helpful critique.

      Planned 18: Based on the suggestion from Reviewer 3, we will remove these morphological and skin drainage data.

      IFT20 deletion or silencing causes alterations in the cell junction pattern/VE-cadherin intensity. The authors' interpretation that IFT20 deletion/silencing would cause discontinuous or "button-like" junctions is not supported by the provided images (Figures 1E, 3F, 6A, 6C). Rather, it seems that the levels of VE-cadherin in vivo are decreased, whereas the "continuity" of the junction is not altered. In cell culture, IFT20 silencing seems to cause wider and, to some extent, overlapping VE-cadherin junctions and not "discontinuous". These junctions may represent a more immature state. The authors should change the nomenclature accordingly or provide additional data. Using the existing cell culture experiment images, it would be more appropriate to analyze the width of the VE-cadherin junctions, instead of the "granularity".

      Thank you for this helpful suggestion.

      To assess VE-cadherin levels in vitro, we will perform western blots as described in Planned 1 above.

      Planned 19: We will measure widths of junctions from IFT20 KD and WT images and adjust the language in the text.

      Paulson et al. show images of IFT20 and RAB5 double-stained samples. The co-localization seems to happen mostly at the weakly IFT20 positive puncta (Figure 3A-B). Authors should show the disappearance of the signal in the siIFT20 treated samples (in comparison to siControl samples) to highlight the specificity of the weak signal.

      Thank you for this helpful suggestion.

      Planned 20: We will add data showing the IFT20 KD more clearly at high magnification.

      1. The Authors analyze the co-localization of VE-cadherin and RAB5 as co-localization area (Figure 6C-D). The images show that the co-localization is stated to happen at LEC periphery/junctions. LEC periphery is notoriously thin and microscope Z-resolution does not allow distinction of truly co-localizing or "on top of each other" signal. Based on row 607 co-localization would be expected to happen at least in EEA1+ vesicles, which are located perinuclearly (not at the junctions) in LECs (Korhonen et al. 2022, JCI). Authors could use EEA1, RAB5, and VE-cadherin triple staining for the quantification.

      Thank you for this helpful suggestion.

      Please see Planned 3 and Planned 4 above where we propose experiments to address this concern.

      In the current experiments, authors cannot conclude whether the VE-cadherin signal is at the cell junction (non-internalized), in endosomes (internalized during the experiment), or newly produced VE-cadherin on its way to the plasma membrane. To allow conclusions about the internalized VE-cadherin, and its localization in RAB5 vesicles, authors should conduct, for example, a classical endocytosis assay: incubation of live cells with non-blocking anti-VE-cadherin antibody, followed by acid wash to remove the non-internalized antibody, fixation and staining for RAB5. Also, shorter VEGF-C treatment would allow conclusions about the VE-cadherin dynamics.

      Thank you for this helpful suggestion.

      In Planned 2 above, we will perform immunofluorescence staining for cell-surface VE-cadherin using permeabilized and non-permeabilized IFT20 WT and KD cells during VEGF-C stimulation at various timepoints and washout to address this concern.

      siRNAs can have off-target effects and, thus, the use of at least two independent methods/oligos for silencing is needed. Paulson et al. use a pool of 4 oligos for silencing. They should rather test the efficacy of the single oligos and then use the two best oligos (1/sample) to show and quantify the same phenotype. This is needed at least for the key experiments shown in Figures 4C-D, Figure 6A-B (see also comment #3), Figure 7A-B

      Thank you for this helpful suggestion. We chose these reagents based on pooled siRNAs at low concentration minimizing off-target effects while still achieving strong KD vs. single siRNAs at higher concentration. Please see this technical note for further information about minimizing off-target effects by the use of pooled siRNAs vs. single siRNAs: https://horizondiscovery.com/-/media/Files/Horizon/resources/Application-notes/off-target-tech-review-technote.pdf?sc_lang=en

      1. “SMARTpool siRNA reagents pool four highly functional SMARTselection designed siRNAs targeting the same gene. Studies show that strong on-target gene knockdown can be achieved with minimal off-target effects if a pool consisting of highly functional multiple siRNA is subsituted for individual duplexes. This finding is in contrast to speculation that mixtrues of siRNAs can compound off-target effects. … [Their data show that] while individual duplexes delivered at 100 nM can induce varying numbers of off-targeted genes, transfection of the corresponding SMARTpool siRNA (100 nM total concentration) induces only a fraction of the total off-target profile.”
      2. “Our scientists have identified a unique combination of [chemical] modifications that eliminate as much as 80% of off-target effects.”
      3. “The ON-TARGETplus product line is comprised of four individual siRNAs, and SMARTpool reagents which are chemically modified and rationally designed to minimize off-target effects.”

        OPTIONAL: Paulson et al stated in the first article (2021, Front. Cell Dev. Biol.) that IFT20 deletion/silencing causes lymphatic endothelial phenotypes due to its role in primary cilia, whereas here the authors conclude that IFT20 controls VE-cadherin dynamics at the RAB5 vesicles. However, the current experiments cannot dissect the role of IFT20 in these two distinct locations. For this, authors could delete/silence another gene required for primary cilia or RAB5 endosomes and then analyze, which IFT20 phenotypes are recapitulated.

      Thank you for this helpful suggestion. Please see Planned 6 above where we propose to determine the role of ciliary vs. nonciliary IFT functions by knocking down IFT74, an IFT protein in the same IFT complex B as IFT20 that is required for cilia assembly and function but is not known to participate in vesicular trafficking. We will assess VE-cadherin localization in IFT74 WT and KD cells by immunofluorescence.

      The data shown in Figure 2 B-E (Lymphatic drainage) is not necessary for the current manuscript ("IFT20 regulates VE-cadherin traffic in LECs") and can be removed. As the authors state in the manuscript, the drainage phenotype may be due to lymphatic vessel valve defects (rows 584-585) rather than primary for LEC-LEC junction defects. The data does not justify the abstract sentence "and lymph transport is impaired by intracellular sequestration of VE-cadherin" (row 42).

      Thank you for this helpful suggestion. Please see Planned 18 above, where we propose to remove these data.

      Minor comments

      1. For some of the images, the signal should be enhanced to allow visual inspection also in the paper version (Figures 5A-B and 6C, magenta).

      Thank you for this helpful suggestion.

      Planned 21: We will enhance the signal in the indicated figures.

      Authors show representative Western Blots and quantification of several biological replicates/sample types to investigate signaling responses upon VEGF-C treatment of control and siIFT20 cells. The authors state that the P-levels of VEGFR3, ERK, VE-cadherin, and AKT have different dynamics in control and IFT20-silenced cells. To justify this conclusion, authors should test the statistical significance between the siControl and siIFT20 samples at each time point. The current quantification (Figure 7B) shows that there is, at least, a trend of increased p-VEGFR3, p-VE-cadherin, p-ERK, and p-AKT in IFT20 silenced cells. However, the representative Western Blot image does not display a clear difference (Figure 7A). Authors should include the original western blots, used for quantification, as supplements.

      Thank you for this helpful suggestion. Please see Planned 7 above where we propose to remove these data with the exception of pVEGFR-3 and add corresponding immunofluorescence data. We will ensure blots are included as supplemental figures.

      The authors use western blot quantification to show that the altered LEC junctions affect VEGFR3 signaling. They further hypothesize that the increased VEGFR3 signaling may be a consequence of VEGFR3 localization in endosomes. The authors did not detect any signal using the phospho-specific VEGFR3 antibody (rows 441-442). To analyze the location of VEGFR3 upon VEGF-C treatment in siControl and siIFT20 LECs, the authors should use anti-VEGFR3 (total) antibodies that have been shown to detect VEGFR3 in similar assays.

      Thank you for this helpful suggestion.

      Please see Planned 8 above where we will perform immunofluorescence for VEGFR-3, IFT20, and VECAD and assess their localization.

      The normality of the data should be tested before the selection of the statistical test. If this has been done, please, indicate it in the materials and methods or re-run the statistical analysis, if some of the data is not normally distributed.

      Thank you for this helpful suggestion.

      Planned 22: We will double check the statistics and normality for all quantifications.

      The authors should use arrows, arrowheads, etc. to highlight examples of relevant features in the images. For example, in Figure 3C, the increased stress fiber formation is not obvious to the reader.

      Thank you for this helpful suggestion.

      Planned 23: We will add arrows etc. where appropriate.

      Reviewer #2 (Significance (Required)):

      Lymphatics are essential for fluid, leukocyte, and lipid trafficking to lymph nodes and/or systemic circulation. Recent findings have promoted lymphatics as a potential target to control the level of adaptive immunity in inflammation-associated diseases, including tumorigenesis (for example Song et al 2020, Nature). Early work on lymphatic endothelium in vivo, highlighted the dynamics of lymphatic endothelial junction, which, reversibly, can alter between continuous and discontinuous ("button-like") states (Baluk 2007, Am. Jour. Pathol.; Yao 2012, Am J. Pathol.). These changes may have an effect on fluid drainage capacity, lymphatic vessel growth, and prevention of pathogen dissemination to the systemic circulation. Recently, lymphatic junctions have been shown to present hubs of VEGFR3 signaling, VEGFR3 and VE-cadherin dynamics, and leukocyte transmigration (Sung et al. 2022, Nat. Cardiovasc. Res.; Hagerling et al. 2018, EMBO J.; Liaqat et al. 2024, EMBO J.). Thus, the manuscript by Paulson et al. investigates a topical subject.

      The authors suggest a role for IFT20 in the control of VE-cadherin dynamics. Based on my expertise in lymphatic endothelial biology, I envision that the manuscript can potentially increase knowledge on the regulators of the lymphatic endothelial junctions, which might have physiological, and in the long term, translational significance. However, in the current manuscript, the exact mechanisms of how IFT20 controls lymphatic endothelial junctions are left open. In addition to the lymphatic research field, the study is, potentially of interest to researchers working on blood vasculature or, even, epithelium, i.e. tissues where junctional dynamics play a major role in health and disease.

      Furter controls, analysis, and experimentation are needed to warrant the authors' statements. In their future work, the authors should also consider means to rigorously dissect the IFT20 functions in primary cilia and endosomes.

      __Reviewer #3 __

      (Evidence, reproducibility and clarity (Required)):

      In this manuscript, the group of Fink and coworkers investigates mechanistic aspects of the intraflagellar transport protein 20 (IFT20) function in lymphatic endothelial cells (LECs). In a previous study, this group had demonstrated the presence of primary cilia on LECs and shown that loss of IFT20 during development resulted in edema, lymphatic vessel dilation and altered branching. Lymphatic-specific deletion of IFT20 cell-autonomously exacerbated acute lymphangiogenesis after corneal suture. In this manuscript, Paulson et al. recapitulate the suture-induced hyper-lymphangiogenesis after lymphatic-specific IFT20 KO using a LYVE1-Cre delete strain and demonstrate a reduced, more discontinuous VE-Cadherin (VECad) staining in newly formed lymphatic vessels (LVs). Prompted by distended and hyperbranching dermal vessels, the performed functional tracer injection experiments and demonstrate increased lymphatic backflow and leakage into the interstitium. To gain further mechanistic insights the authors turned to reductionist cell culture models, starting with a mouse LEC line, in which IFT20 had been deleted using CRISPR/Cas9 resulting in loss of primary cilia, increased stress fibre formation and impaired junctional integrity. More importantly, similar effects were detected in human dermal (HD)LECs after IFT20 KD. Further IFT20 KD HDLECs showed accumulation of RAB5+ vesicles indicating defective endosome maturation. Indistinguishable formation of RAB5+ endosomes after VEGF-C stimulation in HDLECs and IFT20KD HDLECs indicated that endocytosis and formation of early endosomes occur independent if IFT20. Through starvation, stimulation and wash-out experiments the authors provide colocalization data suggesting that after VEGF-C stimulation IFT20 is recruited to endosomes where it contributes to VECad recycling. Finally, the authors addressed if the increase in RAB5+ endosomes following VEGF-C stimulation resulted in prolonged retention of signaling-active VEGFR-3 in endosomes. Western blotting for phosphorylation of VEGFR-3 and its downstream signaling components after activation of starved HDLECs or IFT20KD HDLECs and subsequent factor wash out provided evidence towards this model.

      Subsequently open question and potential suggestions for improvement are listed: The authors describe a slight leakiness of the LYVE1-Cre deleter strain to result in massive hemangiogenesis (line112). How extensive is the resulting deletion in blood endothelial cells? What are the consequences for VECad distribution in BEC junctions i.e. for blood vessels and vascular permeability? Are the defects described specific for LECs or are the manifestation of generic defects in LECs?

      Thank you for these helpful suggestions.

      Please see Planned 17 above where we will assess IFT20 KO in blood vasculature and tone down lymphatic-specific language in the text.

      Fig. 1 E, what is the distribution of LYVE1 in IFT20 KO LECs at higher magnification, is LYVE1 excluded from the VECad expression domain?

      Thank you for this helpful suggestion.

      Planned 24: We will review our corneal confocal data to address this question.

      Fig.1 F, what does VECad-positive LV (%) area (line 154 - 155) refer to, given that all LECs are VECad+ but the junctional distribution of the protein is distinctly different?

      Thank you for pointing out our need to clarify. This quantification measures the overlap of VE-cadherin with LYVE-1 as a way to measure the area covered by adherens junctions between lymphatic endothelial cells. Where junctions are punctate, they have smaller area vs. long continuous junctions.

      Planned 25: We will update the text to clarify this measurement.

      In the discussion, the authors speculate that the development of valves could be potentially impaired in IFT20 LEC KO mice. Ear skin would be an excellent tissue to stain the valves and analyse their structure in collecting LVs. Of particular interest in this context are Int a9, VECad, FOXC2 and PROX1 expression. The later two are required for valve formation and upregulated in valve forming areas in response to oscillatory shear stress (Sabine A et al. (2012) Dev Cell 22 (2):430-445. doi:10.1016/j.devcel.2011.12.020).

      Thank you for this helpful suggestion. Based on the suggestion from Reviewer 2, we will remove the ear lymph drainage data and focus on the cell biology in this manuscript. Our current experiments focus more on lymphatic valve formation in this context and these data can be moved to a separate manuscript.

      Planned 26: We will revise the text to remove speculation about valve development in this model and address this in a later manuscript.

      Does IFT20 KO and loss of the primary cilium impair OSS sensing and result in a failure to express sufficient levels of PROX1 for valve formation (Fig. 7 C).

      Thank you for this helpful comment. We will address the role of cilia in OSS sensing and valve formation in a forthcoming manuscript.

      A larger area view including pre-collectors and collectors would be informative and reveal changes in the overall structure of the lymphatic vessel bed in absence of IFT20.

      Based on the suggestion from Reviewer 2, we will remove these data.

      Fig. 2 A, (line 187 - 190) please indicate the age of analysed animals.

      Planned 27: We will add the ages of mice used.

      With respect to Fig.1, LVs in the area are mainly capillaries, what is the distribution of VECad? Are the LVs comprised of oak-leave shaped LECs, higher magn. pictures would be required.

      Thank you for this helpful suggestion.

      Planned 28: We will include higher magnification images of capillaries.

      Fig. 2 (C - E) Line 201 - 203 the description of retrograde flow using a clock terminology is unusual and not clear to the reader. Is this meant relative to the point of injection with 12 being at the top or relative to the injection axis (i.e. forward / backward direction)? It would seem that indication of the angle in combination with a sketch of the analysis would help the reader to interpret these data.

      Thank you for this helpful critique. We will remove these data based on this suggestion and that of Reviewer 2.

      The application of cell culture models is appropriate, however, the value of the mLEC model is questionable given that VECad is not detectable in these cells and PROX1 and VEGFR-3 staining are not shown. Therefore, the HDLEC model bears significantly more relevance. In Fig. 3D, were mLECS mitotically arrested during the 24hrs transwell migration, to exclude division and crowding effects during the observation time?

      Thank you for this helpful critique.

      Planned 29: We will clarify the methods for this experiment in the text.

      Fig. 6 It is commendable that the authors report their lack of success to directly visualize VEGFR-3 endocytosis by IF and attempt a WB analysis instead. However given the spread of the results normalization to ß-actin as a loading control appears inappropriate. Phosphorylated forms of VEGFR-3 and VECad should be normalized to the expression of the total protein as measured with a non-phospospecific antibody, exactly the way done here for ERK1/2 and AKT. Generally, IP-WB experiments provide superior data in this type of setting.

      Thank you for this helpful suggestion. Based on suggestions from the other reviewers, we will remove these WB data with the exception of pVEGFR-3 and add corresponding immunofluorescence. We will include additional time points and include blots used for quantifications as supplements.

      Line 597 - 599: "VEGFR-3 signaling is required for the establishment of VE-cadherin button junctions as lymphatic collecting vessels mature but is not required for their maintenance (Jannaway et al., 2023)." Collecting LVs are characterized by zipper junctions, but not button junctions. Therefore, this sentence needs clarification.

      Thank you for this helpful suggestion.

      Planned 30: We will clarify this text.

      Reviewer #3 (Significance (Required)):

      The role of IFT20 in formation of the primary cilium and endocytic vesicle transport warrants its investigation in lymphatic endothelial cells. Therefore, this study addresses relevant questions and provides important first insights into the cell biological function of IFT20 in this cell type. IFT20 has so far not been implicated in endocytosis and recycling of VECad and VEGFR-3 and the model suggested by the authors is compelling and adds to the mechanistic understanding of previous studies on the role of VECad in LECs. In particular, it could be of relevance for the enigmatic formation of button junction in lymphatic capillaries and the mechano-response of LECS underlying valve formation. At this point, the picture obtained from the endocytosis assays is more conclusive compared to the analysis of the impact of IFT20 loss on button junction formation. Clearly the study is of interest for a general cell biological audience as well as vascular biologists.

      • *
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, the group of Fink and coworkers investigates mechanistic aspects of the intraflagellar transport protein 20 (IFT20) function in lymphatic endothelial cells (LECs). In a previous study, this group had demonstrated the presence of primary cilia on LECs and shown that loss of IFT20 during development resulted in edema, lymphatic vessel dilation and altered branching. Lymphatic-specific deletion of IFT20 cell-autonomously exacerbated acute lymphangiogenesis after corneal suture. In this manuscript, Paulson et al. recapitulate the suture-induced hyper-lymphangiogenesis after lymphatic-specific IFT20 KO using a LYVE1-Cre delete strain and demonstrate a reduced, more discontinuous VE-Cadherin (VECad) staining in newly formed lymphatic vessels (LVs). Prompted by distended and hyperbranching dermal vessels, the performed functional tracer injection experiments and demonstrate increased lymphatic backflow and leakage into the interstitium. To gain further mechanistic insights the authors turned to reductionist cell culture models, starting with a mouse LEC line, in which IFT20 had been deleted using CRISPR/Cas9 resulting in loss of primary cilia, increased stress fibre formation and impaired junctional integrity. More importantly, similar effects were detected in human dermal (HD)LECs after IFT20 KD. Further IFT20 KD HDLECs showed accumulation of RAB5+ vesicles indicating defective endosome maturation. Indistinguishable formation of RAB5+ endosomes after VEGF-C stimulation in HDLECs and IFT20KD HDLECs indicated that endocytosis and formation of early endosomes occur independent if IFT20. Through starvation, stimulation and wash-out experiments the authors provide colocalization data suggesting that after VEGF-C stimulation IFT20 is recruited to endosomes where it contributes to VECad recycling. Finally, the authors addressed if the increase in RAB5+ endosomes following VEGF-C stimulation resulted in prolonged retention of signaling-active VEGFR-3 in endosomes. Western blotting for phosphorylation of VEGFR-3 and its downstream signaling components after activation of starved HDLECs or IFT20KD HDLECs and subsequent factor wash out provided evidence towards this model.

      Subsequently open question and potential suggestions for improvement are listed: The authors describe a slight leakiness of the LYVE1-Cre deleter strain to result in massive hemangiogenesis (line112). How extensive is the resulting deletion in blood endothelial cells? What are the consequences for VECad distribution in BEC junctions i.e. for blood vessels and vascular permeability? Are the defects described specific for LECs or are the manifestation of generic defects in LECs? Fig. 1 E, what is the distribution of LYVE1 in IFT20 KO LECs at higher magnification, is LYVE1 excluded from the VECad expression domain? Fig.1 F, what does VECad-positive LV (%) area (line 154 - 155) refer to, given that all LECs are VECad+ but the junctional distribution of the protein is distinctly different?

      In the discussion, the authors speculate that the development of valves could be potentially impaired in IFT20 LEC KO mice. Ear skin would be an excellent tissue to stain the valves and analyse their structure in collecting LVs. Of particular interest in this context are Int a9, VECad, FOXC2 and PROX1 expression. The later two are required for valve formation and upregulated in valve forming areas in response to oscillatory shear stress (Sabine A et al. (2012) Dev Cell 22 (2):430-445. doi:10.1016/j.devcel.2011.12.020). Does IFT20 KO and loss of the primary cilium impair OSS sensing and result in a failure to express sufficient levels of PROX1 for valve formation (Fig. 7 C). A larger area view including pre-collectors and collectors would be informative and reveal changes in the overall structure of the lymphatic vessel bed in absence of IFT20. Fig. 2 A, (line 187 - 190) please indicate the age of analysed animals. With respect to Fig.1, LVs in the area are mainly capillaries, what is the distribution of VECad? Are the LVs comprised of oak-leave shaped LECs, higher magn. pictures would be required. Fig. 2 (C - E) Line 201 - 203 the description of retrograde flow using a clock terminology is unusual and not clear to the reader. Is this meant relative to the point of injection with 12 being at the top or relative to the injection axis (i.e. forward / backward direction)? It would seem that indication of the angle in combination with a sketch of the analysis would help the reader to interpret these data.

      The application of cell culture models is appropriate, however, the value of the mLEC model is questionable given that VECad is not detectable in these cells and PROX1 and VEGFR-3 staining are not shown. Therefore, the HDLEC model bears significantly more relevance. In Fig. 3D, were mLECS mitotically arrested during the 24hrs transwell migration, to exclude division and crowding effects during the observation time?

      Fig. 6 It is commendable that the authors report their lack of success to directly visualize VEGFR-3 endocytosis by IF and attempt a WB analysis instead. However given the spread of the results normalization to ß-actin as a loading control appears inappropriate. Phosphorylated forms of VEGFR-3 and VECad should be normalized to the expression of the total protein as measured with a non-phospospecific antibody, exactly the way done here for ERK1/2 and AKT. Generally, IP-WB experiments provide superior data in this type of setting. Line 597 - 599: "VEGFR-3 signaling is required for the establishment of VE-cadherin button junctions as lymphatic collecting vessels mature but is not required for their maintenance (Jannaway et al., 2023)." Collecting LVs are characterized by zipper junctions, but not button junctions. Therefore, this sentence needs clarification.

      Significance

      The role of IFT20 in formation of the primary cilium and endocytic vesicle transport warrants its investigation in lymphatic endothelial cells. Therefore, this study addresses relevant questions and provides important first insights into the cell biological function of IFT20 in this cell type. IFT20 has so far not been implicated in endocytosis and recycling of VECad and VEGFR-3 and the model suggested by the authors is compelling and adds to the mechanistic understanding of previous studies on the role of VECad in LECs. In particular, it could be of relevance for the enigmatic formation of button junction in lymphatic capillaries and the mechano-response of LECS underlying valve formation. At this point, the picture obtained from the endocytosis assays is more conclusive compared to the analysis of the impact of IFT20 loss on button junction formation. Clearly the study is of interest for a general cell biological audience as well as vascular biologists.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Paulson et al. use an in vivo model of IFT20 deletion (Lyve1-Cre) and primary lymphatic endothelial cell (LEC) cultures to investigate the role of IFT20 in controlling LEC-LEC junction dynamics. The key findings/suggestions include: i) Authors show alterations in the VE-cadherin (or ZO-1) staining at the LEC junctions upon IFT20 deletion or silencing. ii) They also show evidence of the IFT20 localization to RAB5 endosomes and alteration of RAB5 endosome dynamics upon IFT20 silencing.

      In the current manuscript, some of the key data are not convincing. Further experimentation and analysis (also of the existing data) are needed to solidify the authors' statements as detailed below. I expect that the suggested experiments can be executed in 3-to-6 months and require, at least, antibodies, which have not been used in the current manuscript.

      Major comments

      1. The data information, presented in the figure legends, is difficult to understand. The authors should always indicate how many biological replicates and independent experiments the data is derived from. This holds also for the representative images. Now, it seems that some of the quantified data are derived from only 1 experiment (see, for example, rows 423-425: "Graphs show one representative biological replicate of two, each comprising two technical replicates with 100+ cells per condition"). The quantifications should be based on data from at least three independent experiments.

      Often data points represent the field of views from a single sample, thus, biasing the statistical testing. The data points should represent biological replicates or independent experiments to allow the reader to make conclusions, about whether the findings are statistically significant and can be repeated. 2. The Lyve1-Cre is not specific for lymphatic vasculature (for example https://www.jax.org/strain/012601# and Lee LK et al. 2020, Cell Reports), as also stated by the authors (row 112). However, this is not shown in the data and complicates the interpretation of the data. Here, authors can stain the IFT20 with their existing mouse IFT20-specific antibody to show the loss in the lymphatic and/or blood vasculature. If IFT20 is lost in both vasculature types, it is not possible to say "lymphatic specific" (for example, row 143) and draw conclusions that the observed phenotypes would be primary to IFT20 loss in the lymphatic vasculature. 3. The authors write (rows 164-168) "Lymphatic vessels in the IFT20 KO or VE-cadherin KO embryonic dorsal skin exhibited increased and variable lumen size and excessive branching, suggesting that impaired lymphatic organization and function contributed to the fluid homeostasis defect. Here, immunofluorescence staining for LYVE-1 in the ear skin revealed similar patterning defects in adult IFT20 KO lymphatic vessels (Figure 2A), that have also been described in VE-cadherin KO mice (Hägerling et al., 2018)." However, based on Figure 2A, it is not obvious that there would be excessive lymphatic vessel branching, impaired organization or similarities to VE-cadherin deleted lymphatic vessels. To justify their statement, the authors should provide quantification of the branching (at least 3 mice/genotype). 4. IFT20 deletion or silencing causes alterations in the cell junction pattern/VE-cadherin intensity. The authors' interpretation that IFT20 deletion/silencing would cause discontinuous or "button-like" junctions is not supported by the provided images (Figures 1E, 3F, 6A, 6C). Rather, it seems that the levels of VE-cadherin in vivo are decreased, whereas the "continuity" of the junction is not altered. In cell culture, IFT20 silencing seems to cause wider and, to some extent, overlapping VE-cadherin junctions and not "discontinuous". These junctions may represent a more immature state. The authors should change the nomenclature accordingly or provide additional data. Using the existing cell culture experiment images, it would be more appropriate to analyze the width of the VE-cadherin junctions, instead of the "granularity". 5. Paulson et al. show images of IFT20 and RAB5 double-stained samples. The co-localization seems to happen mostly at the weakly IFT20 positive puncta (Figure 3A-B). Authors should show the disappearance of the signal in the siIFT20 treated samples (in comparison to siControl samples) to highlight the specificity of the weak signal. 6. The Authors analyze the co-localization of VE-cadherin and RAB5 as co-localization area (Figure 6C-D). The images show that the co-localization is stated to happen at LEC periphery/junctions. LEC periphery is notoriously thin and microscope Z-resolution does not allow distinction of truly co-localizing or "on top of each other" signal. Based on row 607 co-localization would be expected to happen at least in EEA1+ vesicles, which are located perinuclearly (not at the junctions) in LECs (Korhonen et al. 2022, JCI). Authors could use EEA1, RAB5, and VE-cadherin triple staining for the quantification.

      In the current experiments, authors cannot conclude whether the VE-cadherin signal is at the cell junction (non-internalized), in endosomes (internalized during the experiment), or newly produced VE-cadherin on its way to the plasma membrane. To allow conclusions about the internalized VE-cadherin, and its localization in RAB5 vesicles, authors should conduct, for example, a classical endocytosis assay: incubation of live cells with non-blocking anti-VE-cadherin antibody, followed by acid wash to remove the non-internalized antibody, fixation and staining for RAB5. Also, shorter VEGF-C treatment would allow conclusions about the VE-cadherin dynamics. 7. siRNAs can have off-target effects and, thus, the use of at least two independent methods/oligos for silencing is needed. Paulson et al. use a pool of 4 oligos for silencing. They should rather test the efficacy of the single oligos and then use the two best oligos (1/sample) to show and quantify the same phenotype. This is needed at least for the key experiments shown in Figures 4C-D, Figure 6A-B (see also comment #3), Figure 7A-B 8. OPTIONAL: Paulson et al stated in the first article (2021, Front. Cell Dev. Biol.) that IFT20 deletion/silencing causes lymphatic endothelial phenotypes due to its role in primary cilia, whereas here the authors conclude that IFT20 controls VE-cadherin dynamics at the RAB5 vesicles. However, the current experiments cannot dissect the role of IFT20 in these two distinct locations. For this, authors could delete/silence another gene required for primary cilia or RAB5 endosomes and then analyze, which IFT20 phenotypes are recapitulated. 9. The data shown in Figure 2 B-E (Lymphatic drainage) is not necessary for the current manuscript ("IFT20 regulates VE-cadherin traffic in LECs") and can be removed. As the authors state in the manuscript, the drainage phenotype may be due to lymphatic vessel valve defects (rows 584-585) rather than primary for LEC-LEC junction defects. The data does not justify the abstract sentence "and lymph transport is impaired by intracellular sequestration of VE-cadherin" (row 42).

      Minor comments

      1. For some of the images, the signal should be enhanced to allow visual inspection also in the paper version (Figures 5A-B and 6C, magenta).
      2. Authors show representative Western Blots and quantification of several biological replicates/sample types to investigate signaling responses upon VEGF-C treatment of control and siIFT20 cells. The authors state that the P-levels of VEGFR3, ERK, VE-cadherin, and AKT have different dynamics in control and IFT20-silenced cells. To justify this conclusion, authors should test the statistical significance between the siControl and siIFT20 samples at each time point.

      The current quantification (Figure 7B) shows that there is, at least, a trend of increased p-VEGFR3, p-VE-cadherin, p-ERK, and p-AKT in IFT20 silenced cells. However, the representative Western Blot image does not display a clear difference (Figure 7A). Authors should include the original western blots, used for quantification, as supplements. 12. The authors use western blot quantification to show that the altered LEC junctions affect VEGFR3 signaling. They further hypothesize that the increased VEGFR3 signaling may be a consequence of VEGFR3 localization in endosomes. The authors did not detect any signal using the phospho-specific VEGFR3 antibody (rows 441-442). To analyze the location of VEGFR3 upon VEGF-C treatment in siControl and siIFT20 LECs, the authors should use anti-VEGFR3 (total) antibodies that have been shown to detect VEGFR3 in similar assays. 13. The normality of the data should be tested before the selection of the statistical test. If this has been done, please, indicate it in the materials and methods or re-run the statistical analysis, if some of the data is not normally distributed. 14. The authors should use arrows, arrowheads, etc. to highlight examples of relevant features in the images. For example, in Figure 3C, the increased stress fiber formation is not obvious to the reader.

      Significance

      Lymphatics are essential for fluid, leukocyte, and lipid trafficking to lymph nodes and/or systemic circulation. Recent findings have promoted lymphatics as a potential target to control the level of adaptive immunity in inflammation-associated diseases, including tumorigenesis (for example Song et al 2020, Nature). Early work on lymphatic endothelium in vivo, highlighted the dynamics of lymphatic endothelial junction, which, reversibly, can alter between continuous and discontinuous ("button-like") states (Baluk 2007, Am. Jour. Pathol.; Yao 2012, Am J. Pathol.). These changes may have an effect on fluid drainage capacity, lymphatic vessel growth, and prevention of pathogen dissemination to the systemic circulation. Recently, lymphatic junctions have been shown to present hubs of VEGFR3 signaling, VEGFR3 and VE-cadherin dynamics, and leukocyte transmigration (Sung et al. 2022, Nat. Cardiovasc. Res.; Hagerling et al. 2018, EMBO J.; Liaqat et al. 2024, EMBO J.). Thus, the manuscript by Paulson et al. investigates a topical subject.

      The authors suggest a role for IFT20 in the control of VE-cadherin dynamics. Based on my expertise in lymphatic endothelial biology, I envision that the manuscript can potentially increase knowledge on the regulators of the lymphatic endothelial junctions, which might have physiological, and in the long term, translational significance. However, in the current manuscript, the exact mechanisms of how IFT20 controls lymphatic endothelial junctions are left open. In addition to the lymphatic research field, the study is, potentially of interest to researchers working on blood vasculature or, even, epithelium, i.e. tissues where junctional dynamics play a major role in health and disease.

      Furter controls, analysis, and experimentation are needed to warrant the authors' statements. In their future work, the authors should also consider means to rigorously dissect the IFT20 functions in primary cilia and endosomes.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript identified a novel role of Intraflagellar Transport Protein 20 (IFT20) in the function and homeostasis of lymphatic endothelial junctions. The authors showed that IFT20 regulates VE-cadherin localization at adherens junctions in lymphatic endothelial cells. The authors performed impressive in vivo work that shows the requirement for IFT20 for the homeostasis of intercellular junctions, lymphangiogenesis, and drainage function of lymphatic vessels. In contrast, the cell biology part of the paper was underwhelming and will need significant revisions to support the proposed model. In the result section, several conclusions have to be toned down to match the actual results. The study employs in vivo mouse models, immunofluorescence, biochemical assays, and loss-of-function experiments to support their conclusions.

      Major comments

      • The authors present disrupted localization of VE-cadherin. Is this a mislocalization and/or protein stability issue in IFT20 KD cells? A western blot can help assess protein levels, and a phase-chase endocytosis assay of VE-cadherin can strengthen evidence. The authors did not confirm the permeability phenotype seen in vivo.
      • While the authors focused on IFT20 and rab5, we do not have a clear idea about the vesicular dynamics as well as the status of early, late, and recycling endosomes in IFT20 KD cells. Is IFT 20 localized to non-rab5+ endosomes, and if yes, what are the species? A more general endosomal profiling would help strengthen the authors' message. For example, in Fig. 4-5, the authors will have to stain for other early endosomal markers as well as late, and recycling endosomal markers in control and IFT20 KD cells.
      • In fig. 6C, a majority of VE-cadherin is not associated with Rab5. Staining with additional endosomal markers might help identify other endosomal species colocalizing with VE-cadherin. It will be critical to add to Fig. 6c the intensity profiles depicting colocalizations. The authors can also live image a fluorescently (f)-tagged VE-cadherin (maybe with another f-tagged rab5) and assess their association dynamics in IFT20 KD cells (similar to fig6C).
      • Primary cilia do seem to regulate vascular plexus in the mouse retina as well as endothelial permeability through mediating subcellular localization of junction proteins. The authors do not clearly exclude the ciliary function of IFT20 in mediating lymphatic endothelial cell-cell junctions. A rescue experiment can help settle this question by targeting IFT20 exclusively to cilia (or not) and assessing, for example, VE-cadherin localization. The following is optional: It is also unclear whether the described regulation is specific to IFT20 or can be phenocopied by the ablation of another IFT subunit and/or cilia ablation through the depletion of a non-IFT cilia assembly regulator.
      • Figs. 7A and B do not seem very convincing. The control vs. IFT20 KD western blot levels look mostly similar between the two conditions. The result section does not translate the actual data in Fig. 7A and B. Additionally, there are no statistical comparisons between control and KD conditions in the graphs. Except for a potential pVEGFR-3 increase at 30 min VEGF-C in IFT20 KD cells, but after washout the level is similar to control. This figure does not support well the model presented in fig. 8. The conclusion in lines 456-459 has to be toned down.
      • The authors were not able to stain for pVEGFR3. It would still be helpful to see a colocalization between total VEGFR3, IFT20, and VE-cadherin in control cells and IFT20 KD cells (VEGFR3 and VE-cadherin).

      Minor comments

      • The control used in Figures 1 and 2 does not seem ideal. The proper control would be IFT20fl/fl cre neg. Is there a reason why the authors excluded a lox allele in control? Also, the authors have to provide the mice age used in these figures and when the Cre kicks in in the result section.
      • Please describe the overall mouse phenotype(s) of the LYVE1 CRE-IFT20 flox.
      • Line 109:'By expression, the authors probably mean immunostained.
      • Many graphs exhibit undefined Y-axis labels and units. Please clarify these as well as the way they were quantified. Include such information in figure legends and/or in the materials and methods section. The figures in question are fig1C, E and F, fig2E, fig3E, fig4b and D, fig6b and D, fig7B and C.
      • Line 295:'homeostasis"-the authors probably mean in a serum-rich condition.
      • Fig4C specifically the lower two images on the right side: the images do not seem to represent the corresponding graphs.
      • Please add the statistical tests used to evaluate significance in all figure legends.

      Significance

      This study provides novel insights into IFT20's role in VE-cadherin trafficking and endothelial junction stability, with its strongest aspect being the in vivo data in Figures 1 and 2, demonstrating lymphatic defects upon IFT20 loss. This represents a conceptual advance by extending IFT protein function beyond cilia (if one of the major comments is addressed) to vascular integrity. However, mechanistic depth is lacking, and ciliary role was not tested-additional rescue and colocalization experiments are needed to confirm the model. The study will interest vascular and lymphatic biologists, as well as cell biologists studying intracellular trafficking and cilia.

      Expertise: cilia and mouse genetics

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Kemp et al. aimed to explore the transcriptional cell cycle regulation of replication-dependent (RD) histone genes at histone locus body (HLB) in Drosophila. They evaluate the accumulation of RNA pol II and RD histone transcripts at HLB during the cell cycle using live and fixed imaging of Drosophila tissues at different stages of development. They find that RNA pol II is enriched at HLB, not only during S phase when RD histone genes are transcribed but throughout the cell cycle. Outside of S phase, they detect short but not full-length RD histone transcripts suggesting a mechanism of RNA pol II pausing. Full length RD transcripts are only produced upon cyclin E/Cdk2 activation when cells enter S phase, arguing that Cyclin E/Cdk2 can activate transcription elongation. They propose that the elongation release triggered by Cyclin E/Cdk2 is the critical step linking RD histone gene expression and cell cycle progression rather than the recruitment of RNA pol II to HLB. The data are interesting and robust, however, additional experiments could reinforce the findings and the proposed model.

      Specific comments/concerns are listed below.

      1. In Figure 3, quantifications of the fluorescence at HLBs for mCherry-RBP1 and MXC-mScarlet should be provided.
      2. In Figure 5C, both 5' and 3' transcripts are observed in G214 cells. However, their accumulation in the cytoplasm is not visible. How do the authors explain this result? What happens in S14 cells?
      3. In Figure 6, the authors observed RD histone 3' transcripts only in replicating cells (EdU positive) while they detected 5' transcripts in both replicating and non-replicating cells. They argue that the appearance of 3' transcripts is due to the release from transcriptional pausing. To further support particular states in the transcriptional arrest, data by immunofluorescence using specific antibodies recognizing either RNA pol II ser5P or ser2P would determine whether the presence of 3' transcripts is associated with the accumulation at HLB of RNA pol II ser2P (elongating polymerase). Moreover, is there a correlation between P-MXC and RNA pol II ser2P?
      4. In Figure 7 panels C and D, the 5' transcripts should be shown. Although RD histone 3' transcripts accumulate in CyE+ embryonic cells, unfortunately, their presence at HLBs (pointed by arrows) is not visible in the image of panel E. To firm up conclusions quantifications of the 3' and 5' transcripts should be provided for CycE+ and CycEnull cells. In Hur et al., 2020, the authors looked at RD histone transcripts in WT embryo and CycE+/-/Cdk2+/- mutant. They found that the amount of H3 transcripts using a probe corresponding to the coding sequence is not changed in the mutant as compared to the WT. In contrast, they found that there is an increase of transcripts that are not correctly processed using probes downstream the stem-loop region. This seems inconsistent with the results presented here where a decrease of 3' transcripts is observed. This needs an explanation/discussion. Are such incorrectly processed transcripts observed in CycEnull mutant?
      5. The authors suggest that active Cyclin E/Cdk2 triggers the release of RNA pol II promoter-proximal pausing and therefore induces transcriptional elongation at RD histone genes when cells enter S phase. To further support this hypothesis, determining whether there is an enrichment of the elongation factor p-TEFb at HLB when Cyclin E/Cdk2 is active would help.
      6. Instead of using cycling E mutants, to determine whether it is the phosphorylation of MXC which directly impacts the elongation of RD histone genes, it would be interesting to generate phospho-null or phospho-mimetic mutant of MXC.
      7. In Suzuki et al., 2022, the authors described 3' RNA pol II pausing at RD histone genes. Although this study used human cells, it would be interesting to discuss that in addition to a promoter-proximal pausing that regulates transcription elongation, a 3' pausing could also regulate the transcription termination and 3' processing.
      8. In the discussion, the authors should point out some limitations of their studies linked to the method and could propose for the future that a more precise and molecular view of the pausing mechanism could be carried out using sequencing methods such as ChIP-seq of various isoforms of the RNA pol II (total, ser2P, ser5P) and elongation regulators (p-TEFb.....) and PRO-seq.

      Minor points:

      1. In Figure 1, for panels B and D as well as for panels C and E, to falicitate comparison of the localization of the different proteins, it would help to show the same developmental stages and the same image scales.
      2. In Figures 3 and 7 (C-F), the developmental stages should be indicated on the images, as it is done in the other figures.
      3. In the legend of Figure 7, it is indicated (D) and (E) instead of (C) and (D) in the sentence: "Endocycling midgut cells in (D) contain cytoplasmic histone mRNA which is absent in (E) (boxed regions)."

      Significance

      Kemp et al. aimed to explore the transcriptional cell cycle regulation of replication-dependent (RD) histone genes at histone locus body (HLB) in Drosophila. They evaluate the accumulation of RNA pol II and RD histone transcripts at HLB during the cell cycle using live and fixed imaging of Drosophila tissues at different stages of development. They find that RNA pol II is enriched at HLB, not only during S phase when RD histone genes are transcribed but throughout the cell cycle. Outside of S phase, they detect short but not full-length RD histone transcripts suggesting a mechanism of RNA pol II pausing. Full length RD transcripts are only produced upon cyclin E/Cdk2 activation when cells enter S phase, arguing that Cyclin E/Cdk2 can activate transcription elongation. They propose that the elongation release triggered by Cyclin E/Cdk2 is the critical step linking RD histone gene expression and cell cycle progression rather than the recruitment of RNA pol II to HLB.

      The data are interesting and robust, however, additional experiments could reinforce the findings and the proposed model.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Kemp et al. seek to define the molecular interactions that limit replication-dependent histone gene transcription to S-phase of the cell cycle. They use the Drosophila model system and leverage live-imaging tools, such as tagged proteins and Jabba trap, and RNA FISH in several tissues to determine that RNA Pol II is enriched at the locus throughout the cell cycle and is paused outside of S-phase. Therefore, they conclude that it is not Pol II recruitment to the locus that couples histone transcription to S-phase, but release of Pol II pausing.

      Major comments:

      The data presented are clean and well-presented. The claims are supported by the data without exaggeration. It would be helpful to provide -omics support for this entirely image-based analysis (e.g. PRO- or GRO-seq data from synchronized, sorted Drosophila cells may already exist- OPTIONAL).

      A major requirement is that the authors make clear in Introduction and Discussion that the observation of Pol Ii pausing at RD histone genes is not novel. This requires, at minimum, a discussion of Liu (2024) and Suzuki (2022). This allows readers to focus on the advance novel to this work, which is specifically the cell cycle coupling of Pol II pausing.

      As the authors are claiming different dynamics between Spt6 and RPB1 in Figure 1, they should provide similarly-staged embryos for comparison. For example, the authors should show RPB1 in early/mid S of NC 14, as this is when they see Spt6 variability. In theory, this should be relatively easy as these are stills from the live videos.

      Minor comments:

      The use of Spt6 live imaging early on was slightly confusing. The authors should consider moving this data later in the results or providing more written justification for why they investigated Spt6 (further than "to further explore the regulation of RNA pol II dynamics... p6). Similarly, Spt6 is included in the model figure, which might be a stretch given the only Spt6 data involves the timing of Spt6 colocalization with Mxc during the cell cycle.

      Misleading language/missed citations:

      p3: "600 kB array" is misleading. The whole locus is ~ 600 kB.

      p3: Mxc may remain at the locus throughout the cell cycle, so the whole HLB does not disassemble (Terzo, 2015).

      p4: H1-specific factors include cramped (Gibert and Karch, 2011; Bodner et al. 2024 bioRxiv)

      p4: Hodkinson, 2023 is not the correct reference. The correct reference is Hodkinson, 2024, Genetics.

      p5: The Drosophila HLB is detectable at NC 10 (White, 2011; Terzo, 2015) not White, 2007

      p5: A typo: "imagining"

      p7: The section title "RNA pol II is necessary for HLB assembly" is incorrect, as Figure 3 shows that pol II is NOT necessary for Mxc recruitment, but for HLB growth. Mxc, however, is necessary for pol II recruitment.

      p9: The authors should clarify what "HisC" means as this is the first usage.

      Figures/experiments:

      Fig 2: The authors should show the gating in Figure I that led to the three categories in Figure J. The legend/colors in Figure J are not necessary.

      An "easy" experiment would be to use the FUCCI cell lines and 5'/3' RNA FISH in combination (assuming fluorophores allow) - OPTIONAL

      Discussion:

      p13: The reference to the work of Gugliemlmi, 2013 should first come up in the Introduction, as it provides rationale.

      p13: "without engaging in transcription" is misleading, as pol II is transcribing, but paused.

      p15: It makes sense for pol II to pause at histone genes in G1, as they are preparing for the rapid burst of histone transcription needed in S phase. But what might be the functional rationale for pol II pausing in G2, if the HLB disassembles in M?

      Methods:

      It should be made clear how embryos were staged for live imaging, as it is likely by timing after cell cycle events. What is this timing? It would be best if this detail is not just mentioned in the methods, but also in the main text. This is especially important for readers not familiar with Drosophila embryogenesis. Please cite/acknowledge DGRC for Fly-FUCCI line (if appropriate)

      Significance

      This study provides convincing evidence that pol II is enriched at the histone locus and paused outside of S-phase. What limits the significance is that several prior studies concluded that Pol II is paused at the histone locus:

      Lu et al. bioRxiv 2024, "Integrator-mediated clustering of poised RNA polymerase II synchronizes histone transcription"

      Suzuki et al. Nat Comm 2022, "The 3' Pol II pausing at replication-dependent histone genes is regulated by Mediator through Cajal bodies' association with histone locus bodies"

      Neither of these studies is discussed or even cited in the manuscript, which is disappointing. Therefore, the advance is limited to the cell-cycle coupling of pausing. This is still important, as a major knowledge gap as outlined by the authors is that it is not clear how histone transcription is coupled to S-phase and they rule out Pol II pausing as a possible mechanism, and point toward Pol II pausing release.

      Moreover, there is also evidence (from these authors) that Mxc phosphorylation is not always coupled to histone transcription in Drosophila ovaries. This work is also not discussed or cited:

      Potter-Birriel et al. J Cell Sci 2021, "A region of SLBP outside the mRNA-processing domain is essential for deposition of histone mRNA into the Drosophila egg"

      The current research may be of interest to the broad cell cycle field, but it may also be useful as a model for those conducting basic, foundational research who seek to describe how Pol II is released from pausing. The histone locus may be of interest as a novel, facile model for pausing.

      Reviewer expertise: Drosophila, chromatin, gene expression

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      __Evidence, reproducibility and clarity __

      This is a well-written manuscript that describes a thorough study of the functionality of individual residues of a central component of the ESX-3 type VII secretion system of Mycobacterium smegmatis, EccD3, in the essential role of this protein transport system in iron acquisition. Using the powerful and unbiased approach of deep mutational scanning (DMS), the authors assessed the impact of different mutations on a large number of residues of this component. This carefully executed research highlights the importance of hydrophobic residues at the center the ubiquitin-like domain, specific residues of the linker domain that connects this domain with the transmembrane domains and specific residues that connect EccD3 with the MycP3 component.

      Major comments

      Since the LOF effects in the iron-sufficient and iron-deficient condition differ less than expected, the differences of the DMS results between these two conditions should be better presented, explained and discussed: 1. The authors discuss: "Of the 270 LOF mutations seen in the iron-deficient condition, 37 (13.7%) were tolerant in the iron sufficient condition, and 39 (14.44%) had strong LOF effects but weak LOF effects in the iron sufficient condition." Do the authors mean that 39 (14.44%) had strong LOF effects in the iron-deficient condition, but weak LOF effects in the iron-sufficient condition. In turn, does this mean that the remaining mutants (71.9%) had similar LOF effects in the two conditions?

      We thank this reviewer for their comment and for highlighting a lack of clarity. We have updated the main text to more effectively communicate our point - that 270 mutants had LOF effects in the iron-deficient media. 37 of these 270 mutants were tolerant in the iron-sufficient media. 39 of these 270 mutants had strong LOF effects in iron-deficient media, but were weak LOF in iron-sufficient media. The remaining 124/270 mutants had weak LOF effects in both conditions. The larger point is that removing iron leads to stronger selection - tolerant mutants become LOF, weak LOF become strong LOF. Removing iron pushes mutants at the bounds over the limit.

      __ The diagonal shape of the scatter plot in Fig. 2C, which shows the correlation of the Enrich2 scores of all mutants in the two conditions, indicates that the growth of most mutants is affected similarly in these conditions, but in Fig. 2D lower graph, which shows only the Enrich2 scores of missense mutants, there are clear differences between the two conditions. How can this be explained?__

      We apologize for any confusion created by this presentation of our data. We hoped to highlight that while effects are largely similar across conditions, there are some differences. As communicated in our first response, 270 out of our ~2700 missense mutations had LOF effects in the iron-deficient condition. 37 of these 270 mutants were tolerant in the iron-sufficient media. 39 of these 270 mutants had strong LOF effects in iron-deficient media, but were weak LOF in iron-sufficient media. The remaining 124 mutations had weak LOF effects in both conditions.

      While Figure 2C shows this difference, it is hard to see by nature of using a scatter plot. We have added contours to highlight how our data is distributed. Our density plots in Figure 2D are meant to try to highlight these differences, where the top plot is showing the effects of all missense mutations. Negatively scored mutations represent LOF effects, mutations with scores around 0 are considered tolerant, and the extremely rare scores with positive scores have GOF effects. Our bottom plot specifically zooms into the negatively scored mutations, to show the 270 LOF mutants we discussed. Specifically, we were hoping to highlight the 39 mutations that have strong LOF effects in iron-deficient media (so the purple line scores are more negative), but weak LOF effects in iron-sufficient media (the green line scores are less negative).

      __ Regarding the authors' explanation for the observed LOF effects in the permissive condition, "This speaks to the sensitivity of next-generation sequencing compared to the strong differences observed between conditions in phenotypic growth curves." But this sensitivity does not explain the observed large LOF effects but no growth difference in the permissive condition, unless the analysis is less quantitative than expected? Could it be that there is local iron depletion in this mixed culture, causing selection pressure even in the iron-sufficient condition? Moreover, the severity of the growth defect at the time of sampling, i.e., after 24 hours of growth, is unclear. Indeed, the growth curve in Fig. 1 shows that the growth of the double mutant in iron-deficient conditions is significantly impaired at that timepoint. In the growth curve in Fig. 2B (and also slightly in Fig. 2F), however, the growth defect is less pronounced: the double mutant has a similar OD600 as the WT strain, although the error bar is larger. Is this variability between replicates also seen in the DMS analysis? In general, no statistics are shown for the DMS analysis and there is no information on the significance of the observed LOF effects. In addition, the legend should explain how many replicates the DMS data are based on.__

      We thank this reviewer for their comment and for highlighting a point of confusion. In addition to increased sensitivity in next generation sequencing compared to our growth curve experiments, our data analysis and variant scoring was performed by comparing growth rates of our mutant strains to our wild type strain. So, any effect on viability or growth rates seen by expression mutant variants will be more notable in our DMS scoring, as they are relative to wild type. In contrast, our growth curves are plotted as the raw OD600 values of each strain. We believe this difference underlies the difference seen in our heatmaps and growth rates.

      It is also a relevant and important point that our libraries are grown as mixed cultures, where there is competition over the limited iron in their growth media, as we highlight in our discussion.

      While the double mutant does show a stark growth defect at 24 hours in Figure 1 compared to the WT and complement, it grows just as well as those strains in Figure 2B. The growth defect becomes notable after 24 hours. Within this experiment, we observed variability in growth at the 24hr timepoint for the negative control strain, but also selection when compared to the positive control and library growth at later time points. We analyzed our DMS data in accordance with typical methods used in the field (see: https://doi.org/10.1186/s13059-017-1272-5). We include statistics for the DMS analysis as supplemental Figure 1. We apologize for any confusion regarding the figure caption, however in our manuscript we do point out that our library growth in Figure 2B was repeated in triplicate in the figure caption, and the samples collected during that experiment were the ones used to generate the DMS data.

      Minor comments

      1. Line and page numbering should be added to the manuscript to facilitate the reviewing process.

      We have updated our manuscript to include line and page numbering.

      __ "Knockout of the entire ESX-3 operon leads to inhibited M. smegmatis growth in a low-iron environment. When individual components of the ESX-3 system are deleted, growth is only available under impaired if the additional siderophore exochelin formyltransferase fxbA is also knocked out20." First, a reference should be added to the first sentence. Second, Siegrist et al. did not exactly show this. They showed that the fxbA/eccC3 double mutant grows slower that the fxbA single mutant. To my knowledge there is no publication showing that single esx-3 component mutants grow as WT in iron-deficient conditions. Do the authors have data demonstrating this? If true, it is surprising that mutating EccD3 has a milder phenotype compared the complete region deletion, as it is a crucial ESX-3 component.__

      We apologize for any confusion. We had the relevant reference two lines prior, and have since added it to that sentence as well.

      The reviewer is correct that Siegrest et al did not show the effects of just ESX-3 component single deletions. However, Siegrest et al. 2009 demonstrated that deleting the entire ESX-3 operon results in growth similar to the wild type strain in low-iron media. In contrast, the fxbA single knockout exhibits a notable growth defect, and the fxbA/ESX-3 double knockout has an even more severe growth defect. Following the logic that a double knockout is needed to observe a growth defect in low-iron media, Siegrest et al. 2014 demonstrated this also extends to single ESX-3 component knockouts, such as the fxbA/eccD3 double knockout strain. To ensure clarity and accuracy, I will edit the sentence to say "When individual components of the ESX-3 system are deleted, growth is significantly impaired when the additional siderophore exochelin formyltransferase fxbA is also knocked out."

      __ Reference to Table 1, should be a reference to Table S1.__

      We have updated our manuscript to correct this reference.

      __ "Our heatmaps surprisingly reveal residues where substitutions are deleterious specifically in the iron-sufficient condition" Refer here to Fig. S2.__

      We have updated our manuscript to include this reference.

      __ "In the iron-deficient condition, 6/551 (1.08%) missense mutations have a weak LOF effect, and 0 have strong effects." More clearly explain this refers to the residues of the transmembrane region.__

      We have updated our manuscript to provide more clarity.

      __ "The MycP transmembrane helix has been hypothesized to be required for ESX complex specificity, targeting MycP to associate with the correct ESX homologue." I miss a reference here. And I thought that the transmembrane domain of MycP was required for complex stability not for specificity?__

      We thank the reviewer for pointing out our missing citation, and asking us to clarify our point. I believe the literature suggests that both the protease and transmembrane domains of MycP are required for both complex stability and specificity. van Winden et al. 2016 https://doi.org/10.1128/mbio.01471-16 show that MycP5 needs to be present for secretion. The protease activity can be abolished and the ESX-5 complex can still secrete and be pulled down, as seen by BN-PAGE. van Winden et al. 2019 https://doi.org/10.1074/jbc.RA118.007090 show that truncated mutants missing either the protease domain or the transmembrane domain cannot rescue ESX-5 secretion or complex stability in a MycP knockout strain. More relevant, they attempted to rescue MycP1 and MycP5 mutants by creating chimeric proteins that either had the MycP1 protease domain and MycP5 transmembrane domain, or the MycP5 protease domain and MycP1 transmembrane domain. If the protease and transmembrane domains were required for complex stability and NOT specificity, we would see MycP5 rescue ESX-1 secretion in the MycP1 mutant strains and vice versa. We would also see the chimera proteins rescue both ESX-1 and ESX-5 secretion and complex stability. Instead, we see that neither chimera rescued ESX-1 nor ESX-5 secretion or complex stability, implying that both MycP domains are necessary.

      We will amend our paper text to reference MycP's role in complex stability instead of specificity, and soften the language: "The MycP transmembrane helix has been shown to be required for ESX complex stability, as MycP knockouts and truncated mutants abolish ESX secretion and pulldowns of the entire complex."

      __ "....role in ESX function relating to EccB3 and EccC3. In the transmembrane, ..... we" Insert "region" after "transmembrane"__

      We have updated our manuscript to include this update.

      Significance

      The study provides insight into individual residues of a central component of the ESX-3 type VII secretion system for functionality, which is useful for those studying the functioning of mycobacterial type VII secretion systems. Moreover, because this system is essential for the growth of the important pathogen M. tuberculosis, this knowledge can be used to design new anti-tuberculosis compounds that block the ESX-3 system. Although the results mainly confirm previous observations (highlighting specific residues important for the stability of ubiquitin and residues of other parts of EccD important for protein-protein interactions within the ESX-3/ESX-5 membrane complex), to my knowledge this is the first time DMS has been applied to mycobacteria. This study is therefore of interest to mycobacteriologists.


      Reviewer #2

      __Evidence, reproducibility and clarity __

      This work provides valuable insights into EccD3 function, a core component of the ESX-3 secretion system. The strength of this study lies in the development of a robust functional assay for the systematic mapping of functionally relevant amino acids in EccD3. The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system. 1. The authors engineered an M. smegmatis knockout strain with deletions of fxbA and eccD3. Deletion of fxbA renders the exocholin iron uptake system non-functional, forcing the bacteria to rely on siderophore-mediated iron uptake under iron-limiting conditions. This process, in turn, depends on ESX-3 secretion activity, as PPE4, a known ESX-3 substrate, has been previously implicated in iron utilization in M. tuberculosis (Tufariello et al., 2016). This experimental setup links EccD3 function to a growth phenotype under iron-limiting conditions, as mutations impairing ESX-3 secretion disrupt iron utilization and mycobacterial growth. 2. By complementing the knockout strain with a library of EccD3 mutant variants, the authors systematically identify residues essential for protein-protein interactions within the ESX-3 core complex. Structural analysis corroborates the functional relevance of these residues, specifically those mediating interactions between EccD3 and other ESX-3 components, or those disrupting the hydrophobic core of the EccD3 ubiquitin-like (Ubl) domain. 3. Structural comparisons with the MycP5-bound ESX-5 complex allow the authors to predict residues within EccD3 that may interact with MycP3 during ESX-3 core complex assembly. Furthermore, comparisons with the ESX-5 hexamer suggest residues that may stabilize or drive oligomerization of the ESX-3 dimer into its putative hexameric state. These insights are significant and provide testable hypotheses for future studies. 4. The methodology is limited to ESX-3. The authors exploit the essentiality of ESX-3 for siderophore-dependent growth under iron-limiting conditions. However, this functional readout cannot be directly transferred to other ESX systems (ESX-1, ESX-2, ESX-4, ESX-5), which have distinct substrates, biological roles, and regulatory mechanisms.

      Significance

      This work provides valuable insights into EccD3 function, a core component of the ESX-3 secretion system. The strength of this study lies in the development of a robust functional assay for the systematic mapping of functionally relevant amino acids in EccD3. The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system.

      Thank you for your thoughtful and supportive feedback. We appreciate your time and effort in reviewing our study.


      Reviewer #3

      __Evidence, reproducibility and clarity __

      The manuscript by Trinidad et al. provides a deep mutational scanning (DMS) analysis to investigate the functional roles of residues from the EccD3 subunit of the Type VII ESX-3 secretion apparatus from M. smegmatis. A previously published structure of ESX-3 from M. smegmatis by the Rosenberg group (Oren Rosenberg is also an author of this paper) is used as basis for structural interpretation of the DMS data presented in this contribution. A shortcoming of the previous structure, despite being very rich in terms of structural details, was in the lack of hexameric pore formation, which has been established more recently by structures of the related ESX-5 system.

      Technically, DMS is state-of-the art and a powerful approach to systematically scan residues of potential functional interest. Therefore, the data presented here, provide a remarkable repository for further interpretation in this contribution and by other future investigations. The experimental data have been deposited in Github enabling access by others in the future.

      Overall, the paper would benefit from an improved overall organisation. I found in part hard to extract some of the main points from the way the data are presented. In essence, two separate screens were performed, the first one focusing on the EccD3 Ubl domain and adjacent linker regions and a second one on the EccD3 TM region. I think the paper could be better structured accordingly. Tables of residues with strong effects in iron-deficient and iron-sufficient media, together with their structural annotation, would facilitate extracting main messages from this manuscript. Without going too much in detail, there is also scope for improvement of most of the structural figures. More consistency in terms of color coding with the previous paper by Powileit et al. (2019) would also help navigation.

      A potential weakness of the paper is in the limited scope of interpretation of the data in the context of the dimeric ESX-3 assembly, which is actually acknowledged by the authors. Computational AI-based methods should allow generating a complete pore model of ESX-3, which would allow interpretation of some of the data in a more functional relevant context. This would enhance the validity of the current interpretations presented.

      We acknowledge the lack of a hexameric ESX-3 structure, and would love to base our analysis on such a structure. Unfortunately, experimentally purifying and determining such a structure is beyond the scope of this manuscript. While AI-based methods are certainly exciting and helpful to make sense of mutational data, they are not able to computationally predict such large structures. The AlphaFold3 server website is commonly used for these purposes and allows predictions of up to 5000 tokens (or amino acids). An ESX-3 hexamer would be composed of 6x EccB proteins (519 AA each), 6x EccC proteins (1326 AA each), 12x EccD proteins (476 AA each), and 6x EccE proteins (310 AA each). Together, this complex would be made up of 18,642 amino acids.

      We tried using alphafold to predict an ESX-5 dimer complex, as well as reproduce the ESX-3 dimer complex, and were unable to produce these structures. Each ESX protomer is assembled correctly, as each protein within the complex makes appropriate contacts with each other. We see the EccD-dimers still form the membrane vestibule within each ESX complex. The issue is the ESX dimer complex has not assembled correctly: the EccC transmembrane helix 1 of a protomer should interact with the EccB transmembrane helix of the neighboring protomer; and, the N-terminus of EccB in one protomer should interact with the loop between the EccD transmembrane helices 10 and 11 in the neighboring protomer. Instead, Alphafold creates contacts along the EccD proteins from both complexes. We have included a "top-down" view of the ESX-5 dimer, where the periplasmic domains of EccB have been cleaved off for clarity.

      A side view:

      Here we have the ESX-3 dimer structure published by Poweleit et al. side-by-side with the ESX-3 dimer predicted by alphafold, visualized in Pmyol. The alphafold structure largely has each proteins' domains and folds properly predicted, including even the EccD3 dimer found in each ESX protomer. However, the protomers are not assembled into a dimer properly as compared to the purified ESX-3 dimer from PDB: 6umm. We included a "front" and "side view", as well as a "top down" view where the cytoplasmic domains have been hidden for visual clarity.

      The use of full names and acronyms needs to be more consistent. As an example, the terms "ubiquitin-like" and ubiquitin-like (Ubl) and UBl are used in parallel throughout the manuscript. The percentages given in various places of the paper could be reduced to integers, as they generally relate to relatively small data sets. Please express numbers with a precision, reasonable matching expected statistical significance.

      We apologize for the lack of consistency in how we referred to the ubiquitin-like domain. I originally wrote "ubiquitin-like (Ubl)" once per section (intro, results, discussion). I have edited these all to just "Ubl" after the introduction, except for figure and section titles. We have also reduced our percentages to integers.

      Some of the DMS experiments have been repeated three-fold, which should be a minimal number to allow extracting statistical significance, other experiments have only been repeated two-fold. Could this be clarified, please?

      We apologize for this oversight, and thank the reviewer for pointing this out. All experiments were done in triplicate, the exception being the site-directed mutant growth curves, which were performed in duplicate. We have repeated this experiment in triplicate in response to this point. As we repeated this experiment, mutant R134A dropped out due to technical reasons, and so we did not include it in the updated growth curves.

      Specific comments on text and figures:

      Figure 1: The EM densities shown considerably deviate from those that were shown in the original publication by Poweleit et al (2019). If there is an aim is to reinterpret the data this needs to be described in sufficient technical detail. There may be a case for this, in light of recent advances in computational AI-vased structural biology.

      We acknowledge this may be confusing and we apologize for that, as the EM density I have shown in this manuscript uses the same map we used to create the one seen in the original publication Poweleit et al 2019. There are existing crystal structures of EccB1 and the ATPase domains of EccC1 that we used to create homology models of EccB3 and EccC3 using the structure-prediction software RaptorX for the 2019 publication. These homology models were then combined with a low resolution EM density to create the model seen in the 2019 eLife paper. I did not include those homology models in this manuscript, as I did not believe those predictions were relevant to this study. I wanted to include the highest resolution and thus most accurate depiction of our ESX-3 structure.

      Introduction, statement "We made comparisons to a prior DMS on ubiquitin to increase signal-to-noise in our interpretation of the Ubl domain mutagenesis data." Could this be further explained please? I could not find anything in addition in the Methods section and elsewhere.

      __ __We apologize for the confusion!

      EccD3 Ubl domain and ubiquitin DMS dataset comparisons

      To compare the DMS data of EccD3 Ubl with that of ubiquitin, we first identified homologous residues in each structure. This was achieved by aligning the EccD3 Ubl domain with ubiquitin (PDB: 1ubq) using PyMOL and assessing the positional correspondence of side chains (e.g., ubiquitin residue I3 aligned with EccD3 residue V12). Next, we referenced missense mutation datasets to calculate the average DMS score for each residue position in both proteins. We then generated a scatter plot to compare the average missense scores for ubiquitin and EccD3 Ubl using ggplot2. Data points were color-coded according to the functional roles assigned to ubiquitin, with residues forming the hydrophobic patch and core highlighted, while all other residues were represented in grey.

      Description of "vestibule" as a core feature of the ESX-3 structure. As mentioned above, this is very much a result of the presented dimeric arrangement. In the context of a complete pore model, these features may change or even disappear.

      While we would certainly welcome an ESX-3 hexamer model to definitively determine whether this feature persists, such a model is not currently available. However, the highly homologous ESX-5 complex retains these EccD vestibules, and there is no reason to believe these features would change or disappear. Therefore, based on our interpretation of the ESX-3 dimer and ESX-5 hexamer we believe that the EccD membrane vestibule is not just an artifact of the ESX-3 dimer complex.

      It is possible that the reviewer misunderstood what we were referring to as the vestibule. We updated the language in the text to improve clarity. However the vestibule is not a consequence of ESX-3 complex dimer formation. It is an inherent feature of the ESX monomer complexes, where two EccD proteins dimerize to form said vestibule. Furthermore, there is no evidence to suggest that this feature would be lost in a hexameric state.

      Structurally, the ESX-3 dimer consists of two ESX-3 monomer complexes, each containing one EccB, one EccC, one EccE, and two EccD proteins. Therefore, each ESX-3 monomer inherently includes an EccD dimer. The presence of the EccD dimer is not exclusive to the ESX-3 dimer but is a fundamental component of each ESX-3 complex. Similarly, the ESX-5 hexamer retains the EccD dimer within each ESX-5 complex, further supporting the idea that this structural feature is conserved.

      Figure 2, panel B: Isn't right that "positive" and "negative" need to exchanged? Perhaps, there is something I misunderstood.

      We apologize for the confusion, and appreciate the reviewer pointing out this inconsistency. We have updated the manuscript to correct this.

      Figure 2, panel F: it is hard to extract the assignments from the overlaid curves.

      We apologize for a lack of clarity in how this growth curve was presented. We have included labels at the end point to show where each sample is.

      Figure 3, caption "from low (red) to white (tolerant)": for the sake of consistency, please either put the color in parentheses, or functional description. Does this statement relate to panel A or B? "All other residues are colored white". I can't see this.

      We apologize for the inconsistency, and have updated this label. We hope we have clarified the fact that the entire structure is white except for the residues we colored red.

      Results text "In contrast to ubiquitin, all hydrophobic core residues in the EccD3 Ubl domain are equally intolerant to charged residue swaps. Unsurprisingly, residues important for ubiquitin's specific degradation interactions are not sensitive to substitutions in the EccD3 Ubl domain." Does this mean that proper folding of Ubl is less critical for ESX_3 function? Please elaborate on this further.

      We apologize for any confusion. Our data shows that residues which side chains extend into the hydrophobic core of the Ubl domain are intolerant to swaps to charge residues. We hypothesize these missense mutations disrupt this hydrophobic core, and lead to destabilization of this domain. These intolerant missense mutations each have negative Enrich2 scores, implying a loss of ESX-3 function, and that proper folding of the Ubl is critical for ESX-3 function. We have updated our text to clarify this point:

      Unsurprisingly, residues important for ubiquitin function's specific interactions are not sensitive to substitutions in the EccD3 Ubl domain. There is no simple discernable preference within the Ubl domain to any side that maintains protein-protein interactions, implying that the scores are dominated by stability effects and that the Ubl domain must maintain a stable β-grasp fold for ESX-3 function.

      Figure 4, panel C: the surface does not provide residue-specific information, hence this panel is not very informative.

      We agree with the reviewer that Figure 4 panel C was not very informative, and so we have removed it from Figure 4 for the sake of brevity.

      Results text "T148 extends out from transmembrane helix 1 into a hydrophobic pocket between transmembrane helices 1, 2, and 3." Could this please be illustrated in one of the structural presentations?

      We have updated figure 5 to include a snapshot of this residue and the hydrophobic pocket it extends into, as panel E.

      Results text, last paragraph, Figure 5C-D: interpretation of the experimental ESX-3 data based on ESX-5 models is problematic, without showing proof of conservation of relevant sequence/structural features. As mentioned above, I would encourage the authors to establish a hexameric ESX-3 model and interpret the data starting from there. Extrapolation of the interpretation of data to other ESX systems, including ESX-5, would expand the scope by generalization, which however would open another chapter. The ESX-5 structure does not explain e.g. why W227 when mutated is less sensitive to iron depletion as opposed to iron being present.

      We do not believe we can use AI to predict a hexameric ESX-3 model. We will update our supplement to include a figure showing proof of conservation between the EccD3 and EccD5 sequences. We can superpose the ESX-3 dimer structure onto the ESX-5 hexamer structure, and see that this dimeric complex overlays quite well on top of an ESX-5 subcomplex. We can imagine this hexamer as a trimer of dimers, where three copies of this dimeric complex interact to form the hexamer. The superposition is not perfect and there are slight rearrangements to different helices to allow for hexamer formation, but these do not imply we cannot compare these two homologous structures.

      We have included a new structure snapshot in Figure 5, where panel D is the ESX-3 dimer (PDB: 6umm) shown as a side and top-down view. This allows for a comparison with panel C, the snapshot of the ESX-5 complex (PDB: 7np7) where in two protomers the EccB, EccC, and EccD proteins are colored the same way as ESX-3, and the other ESX-5 protomers are colored white. Note that in this hexamer, EccE is missing. We see the EccD membrane vestibule is conserved in both structures.

      Significance

      Strength and Limitations: already assessed under "Evidence, reproducibility and clarity".

      There is scope for further interpretation using experimental structural and modeling data. There is also scope for applying complementary assays for selected mutants, most likely within a lower throughput format.

      Advance: The contribution demonstrates well the power of DMS for systematic screening, in the context of Type VII secretion. The main advance is in the raw data generated and deposited.

      Audience: microbiology with a specific interest in secretion, structural biology

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Trinidad et al. provides a deep mutational scanning (DMS) analysis to investigate the functional roles of residues from the EccD3 subunit of the Type VII ESX-3 secretion apparatus from M. smegmatis. A previously published structure of ESX-3 from M. smegmatis by the Rosenberg group (Oren Rosenberg is also an author of this paper) is used as basis for structural interpretation of the DMS data presented in this contribution. A shortcoming of the previous structure, despite being very rich in terms of structural details, was in the lack of hexameric pore formation, which has been established more recently by structures of the related ESX-5 system.

      Technically, DMS is state-of-the art and a powerful approach to systematically scan residues of potential functional interest. Therefore, the data presented here, provide a remarkable repository for further interpretation in this contribution and by other future investigations. The experimental data have been deposited in Github enabling access by others in the future.

      Overall, the paper would benefit from an improved overall organisation. I found in part hard to extract some of the main points from the way the data are presented. In essence, two separate screens were performed, the first one focusing on the EccD3 Ubl domain and adjacent linker regions and a second one on the EccD3 TM region. I think the paper could be better structured accordingly. Tables of residues with strong effects in iron-deficient and iron-sufficient media, together with their structural annotation, would facilitate extracting main messages from this manuscript. Without going too much in detail, there is also scope for improvement of most of the structural figures. More consistency in terms of color coding with the previous paper by Powileit et al. (2019) would also help navigation.

      A potential weakness of the paper is in the limited scope of interpretation of the data in the context of the dimeric ESX-3 assembly, which is actually acknowledged by the authors. Computational AI-based methods should allow generating a complete pore model of ESX-3, which would allow interpretation of some of the data in a more functional relevant context. This would enhance the validity of the current interpretations presented.

      The use of full names and acronyms needs to be more consistent. As an example, the terms "ubiquitin-like" and ubiquitin-like (Ubl) and UBl are used in parallel throughout the manuscript. The percentages given in various places of the paper could be reduced to integers, as they generally relate to relatively small data sets. Please express numbers with a precision, reasonable matching expected statistical significance.

      Some of the DMS experiments have been repeated three-fold, which should be a minimal number to allow extracting statistical significance, other experiments have only been repeated two-fold. Could this be clarified, please?

      Specific comments on text and figures:

      Figure 1: The EM densities shown considerably deviate from those that were shown in the original publication by Poweleit et al (2019). If there is an aim is to reinterpret the data this needs to be described in sufficient technical detail. There may be a case for this, in light of recent advances in computational AI-vased structural biology.

      Introduction, statement "We made comparisons to a prior DMS on ubiquitin to increase signal-to-noise in our interpretation of the Ubl domain mutagenesis data." Could this be further explained please? I could not find anything in addition in the Methods section and elsewhere.

      Description of "vestibule" as a core feature of the ESX-3 structure. As mentioned above, this is very much a result of the presented dimeric arrangement. In the context of a complete pore model, these features may change or even disappear.

      Figure 2, panel B: Isn't right that "positive" and "negative" need to exchanged? Perhaps, there is something I misunderstood.

      Figure 2, panel F: it is hard to extract the assignments from the overlaid curves.

      Figure 3, caption "from low (red) to white (tolerant)": for the sake of consistency, please either put the color in parentheses, or functional description. Does this statement relate to panel A or B? "All other residues are colored white". I can't see this.

      Results text "In contrast to ubiquitin, all hydrophobic core residues in the EccD3 Ubl domain are equally intolerant to charged residue swaps. Unsurprisingly, residues important for ubiquitin's specific degradation interactions are not sensitive to substitutions in the EccD3 Ubl domain." Does this mean that proper folding of Ubl is less critical for ESX_3 function? Please elaborate on this further.

      Figure 4, panel C: the surface does not provide residue-specific information, hence this panel is not very informative.

      Results text "T148 extends out from transmembrane helix 1 into a hydrophobic pocket between transmembrane helices 1, 2, and 3." Could this please be illustrated in one of the structural presentations?

      Results text, last paragraph, Figure 5C-D: interpretation of the experimental ESX-3 data based on ESX-5 models is problematic, without showing proof of conservation of relevant sequence/structural features. As mentioned above, I would encourage the authors to establish a hexameric ESX-3 model and interpret the data starting from there. Extrapolation of the interpretation of data to other ESX systems, including ESX-5, would expand the scope by generalization, which however would open another chapter. The ESX-5 structure does not explain e.g. why W227 when mutated is less sensitive to iron depletion as opposed to iron being present.

      Referee cross-commenting

      I especially second the comments of referee #1, major comments, point 3 (statistical significance of the data). Addressing this point is crucial for the paper. Referee #2, significance section "The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system." I was considering making the same point but did not at the end. Of course, ultimately, it would be great if all components of ESX-3 could be analyzed they way it was done for the EccD3 component. However, I am afraid such exercise could become quite open ended. Already by now, there is some compromise on the depth of mechanistic interpretation in light of a large data set generated.

      Significance

      Strength and Limitations: already assessed under "Evidence, reproducibility and clarity".

      There is scope for further interpretation using experimental structural and modeling data. There is also scope for applying complementary assays for selected mutants, most likely within a lower throughput format.

      Advance: The contribution demonstrates well the power of DMS for systematic screening, in the context of Type VII secretion. The main advance is in the raw data generated and deposited.

      Audience: microbiology with a specific interest in secretion, structural biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This work provides valuable insights into EccD3 function, a core component of the ESX-3 secretion system. The strength of this study lies in the development of a robust functional assay for the systematic mapping of functionally relevant amino acids in EccD3. The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system.

      1. The authors engineered an M. smegmatis knockout strain with deletions of fxbA and eccD3. Deletion of fxbA renders the exocholin iron uptake system non-functional, forcing the bacteria to rely on siderophore-mediated iron uptake under iron-limiting conditions. This process, in turn, depends on ESX-3 secretion activity, as PPE4, a known ESX-3 substrate, has been previously implicated in iron utilization in M. tuberculosis (Tufariello et al., 2016). This experimental setup links EccD3 function to a growth phenotype under iron-limiting conditions, as mutations impairing ESX-3 secretion disrupt iron utilization and mycobacterial growth.
      2. By complementing the knockout strain with a library of EccD3 mutant variants, the authors systematically identify residues essential for protein-protein interactions within the ESX-3 core complex. Structural analysis corroborates the functional relevance of these residues, specifically those mediating interactions between EccD3 and other ESX-3 components, or those disrupting the hydrophobic core of the EccD3 ubiquitin-like (Ubl) domain.
      3. Structural comparisons with the MycP5-bound ESX-5 complex allow the authors to predict residues within EccD3 that may interact with MycP3 during ESX-3 core complex assembly. Furthermore, comparisons with the ESX-5 hexamer suggest residues that may stabilize or drive oligomerization of the ESX-3 dimer into its putative hexameric state. These insights are significant and provide testable hypotheses for future studies.
      4. The methodology is limited to ESX-3. The authors exploit the essentiality of ESX-3 for siderophore-dependent growth under iron-limiting conditions. However, this functional readout cannot be directly transferred to other ESX systems (ESX-1, ESX-2, ESX-4, ESX-5), which have distinct substrates, biological roles, and regulatory mechanisms.

      Significance

      This work provides valuable insights into EccD3 function, a core component of the ESX-3 secretion system. The strength of this study lies in the development of a robust functional assay for the systematic mapping of functionally relevant amino acids in EccD3. The approach could potentially be expanded to analyze other ESX-3 components but remains limited to the ESX-3 secretion system.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a well-written manuscript that describes a thorough study of the functionality of individual residues of a central component of the ESX-3 type VII secretion system of Mycobacterium smegmatis, EccD3, in the essential role of this protein transport system in iron acquisition. Using the powerful and unbiased approach of deep mutational scanning (DMS), the authors assessed the impact of different mutations on a large number of residues of this component. This carefully executed research highlights the importance of hydrophobic residues at the center the ubiquitin-like domain, specific residues of the linker domain that connects this domain with the transmembrane domains and specific residues that connect EccD3 with the MycP3 component.

      Major comments

      Since the LOF effects in the iron-sufficient and iron-deficient condition differ less than expected, the differences of the DMS results between these two conditions should be better presented, explained and discussed:

      1. The authors discuss: "Of the 270 LOF mutations seen in the iron-deficient condition, 37 (13.7%) were tolerant in the iron sufficient condition, and 39 (14.44%) had strong LOF effects but weak LOF effects in the iron sufficient condition." Do the authors mean that 39 (14.44%) had strong LOF effects in the iron-deficient condition, but weak LOF effects in the iron-sufficient condition. In turn, does this mean that the remaining mutants (71.9%) had similar LOF effects in the two conditions?
      2. The diagonal shape of the scatter plot in Fig. 2C, which shows the correlation of the Enrich2 scores of all mutants in the two conditions, indicates that the growth of most mutants is affected similarly in these conditions, but in Fig. 2D lower graph, which shows only the Enrich2 scores of missense mutants, there are clear differences between the two conditions. How can this be explained?
      3. Regarding the authors' explanation for the observed LOF effects in the permissive condition, "This speaks to the sensitivity of next-generation sequencing compared to the strong differences observed between conditions in phenotypic growth curves." But this sensitivity does not explain the observed large LOF effects but no growth difference in the permissive condition, unless the analysis is less quantitative than expected? Could it be that there is local iron depletion in this mixed culture, causing selection pressure even in the iron-sufficient condition? Moreover, the severity of the growth defect at the time of sampling, i.e., after 24 hours of growth, is unclear. Indeed, the growth curve in Fig. 1 shows that the growth of the double mutant in iron-deficient conditions is significantly impaired at that timepoint. In the growth curve in Fig. 2B (and also slightly in Fig. 2F), however, the growth defect is less pronounced: the double mutant has a similar OD600 as the WT strain, although the error bar is larger. Is this variability between replicates also seen in the DMS analysis? In general, no statistics are shown for the DMS analysis and there is no information on the significance of the observed LOF effects. In addition, the legend should explain how many replicates the DMS data are based on.

      Minor comments

      1. Line and page numbering should be added to the manuscript to facilitate the reviewing process.
      2. "Knockout of the entire ESX-3 operon leads to inhibited M. smegmatis growth in a low-iron environment. When individual components of the ESX-3 system are deleted, growth is only available under impaired if the additional siderophore exochelin formyltransferase fxbA is also knocked out20." First, a reference should be added to the first sentence. Second, Siegrist et al. did not exactly show this. They showed that the fxbA/eccC3 double mutant grows slower that the fxbA single mutant. To my knowledge there is no publication showing that single esx-3 component mutants grow as WT in iron-deficient conditions. Do the authors have data demonstrating this? If true, it is surprising that mutating EccD3 has a milder phenotype compared the complete region deletion, as it is a crucial ESX-3 component.
      3. Reference to Table 1, should be a reference to Table S1.
      4. "Our heatmaps surprisingly reveal residues where substitutions are deleterious specifically in the iron-sufficient condition" Refer here to Fig. S2.
      5. "In the iron-deficient condition, 6/551 (1.08%) missense mutations have a weak LOF effect, and 0 have strong effects." More clearly explain this refers to the residues of the transmembrane region.
      6. "The MycP transmembrane helix has been hypothesized to be required for ESX complex specificity, targeting MycP to associate with the correct ESX homologue." I miss a reference here. And I thought that the transmembrane domain of MycP was required for complex stability not for specificity?
      7. "....role in ESX function relating to EccB3 and EccC3. In the transmembrane, ..... we" Insert "region" after "transmembrane"

      Significance

      The study provides insight into individual residues of a central component of the ESX-3 type VII secretion system for functionality, which is useful for those studying the functioning of mycobacterial type VII secretion systems. Moreover, because this system is essential for the growth of the important pathogen M. tuberculosis, this knowledge can be used to design new anti-tuberculosis compounds that block the ESX-3 system. Although the results mainly confirm previous observations (highlighting specific residues important for the stability of ubiquitin and residues of other parts of EccD important for protein-protein interactions within the ESX-3/ESX-5 membrane complex), to my knowledge this is the first time DMS has been applied to mycobacteria. This study is therefore of interest to mycobacteriologists.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02655

      Corresponding author(s): Thierry SOLDATI

      1. General Statements [optional]

      The emergence of powerful model organisms for infection studies accelerates discoveries in innate immunity and conserved cell-autonomous defence mechanisms. Using the genetically tractable Dictyostelium discoideum/Mycobacterium marinum infection platform, we explored the critical interplay between pathogen-induced membrane damage and host repair pathways.

      Recent findings highlight evolutionarily conserved membrane repair pathways as crucial for cellular integrity against both sterile and pathogenic insults. We previously demonstrated the involvement of ESCRT and autophagy machineries in repairing membrane damage and containing pathogenic mycobacteria within vacuoles. Crucially, we uncovered that TrafE, an evolutionarily conserved TRAF-like E3 ubiquitin ligase, coordinates these machineries to repair membrane damage, preventing cell death.

      Here, we reveal that pathogenic mycobacteria manipulate host membrane microdomain scaffolding proteins and sterols to enhance toxin activity and facilitate bacterial escape. Genetic knockout of these microdomain organizers and sterol depletion significantly reduce membrane damage and bacterial escape, effectively containing mycobacteria and increasing host resistance. The conserved roles of flotillin and sterols are confirmed in murine microglial cells, underscoring evolutionary conservation.

      These discoveries significantly advance understanding of intracellular host-pathogen interactions, offering broad implications for cellular microbiology and immunology and have already attracted wide interest at major international scientific meetings.

      Thanks to the constructive criticisms and suggestions of the referees, we were able to significantly enhance the manuscript by integrating novel experimental strategies and improving presentation and discussion of previous results that together further strengthen our evidence.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The proposed study aims to elucidate the role of membrane microdomains and associated proteins-Vacuolin A, B, and C-during the infection of Dictyostelium discoideum (Dd) amoebae by Mycobacterium marinum (Mm). The results demonstrate that Vacuolins are required for Mm virulence, and that the presence of membrane microdomains is essential for phagosome membrane damage and bacillary escape into the cytosol-key steps in establishing a successful infection and subsequent bacterial proliferation. The study is well-designed, employing methodologies with which the authors have demonstrated expertise. Overall, it is methodologically sound, and most conclusions are well-supported by the presented data. However, some points require clarification.

      We thank the referee for their positive evaluation of the scope and strengths of our manuscript. The constructive criticisms of the referees were important to guide our revisions. We are convinced that the new data now integrated further strengthen our evidence.

      Major Points:

      The study aims to link the function of Dd Vacuolins to their potnetial facilitating role in phagosome escape and overall infection by Mm. To phenocopy the effect of Vac-KO, the authors used MβCD. Strikingly, this compound had a more significant impact on phagosome escape compared to Vac-KO, which either did not affect or only mildly affected this process. This likely reflects a difference in the underlying mechanisms being studied. Vac-KO cells may lack well-organized membrane domains but could retain a similar overall membrane composition. In contrast, MβCD disrupts these domains by chelating cholesterol, thus altering both the membrane composition and the domains themselves. This may explain why EsxA partitioning is more affected by MβCD than by triple KO. Consequently, this suggests that cholesterol, rather than the mere presence of membrane domains, plays a critical role in EsxA partitioning and activity in the phagosome. And even if LLOMe activity was lower in Vac-KO cells, this might be explained by the compartment targeted, the lysosomes which membrane composition may differ from the MCV. These points should be further discussed in the discussion section.

      The referee is right on target, these are all excellent points, and we fully agree with the argumentation. If we compare EsxA to a cholesterol-dependent PFT such as SLO, the presence of sterol is an absolute requirement for pore formation, but the local concentration of sterols achieved via clustering and the organisation of lipids/sterols in microdomains "only" increases efficiency (see for example PMID: 39835825). Therefore, the respective impacts of vac-KO and CD treatment differ in "intensity", and are additive in most assays, but are not resulting from "different underlying mechanisms". The simplest and most plausible interpretation of the combined results is that EsxA requires a threshold of local concentration/clustering of sterols to act and vacuolins/flotillins is one of the means to achieve it. In other words, membrane composition inhomogeneities exist in physiological membranes, particularly sterol and sphingolipid clustering in rafts, and microdomain organisers probably regulate their size and dynamics. Without vacuolin/flotillin, these inhomogeneities persist. Only when sterol is depleted and/or redistributed, do they disappear. In brief, the local sterol concentration is the trigger for EsxA preferential partitioning and activity, and many factors besides microdomain organisers influence it.

      The second interesting point is that LLOMe is a lysosomotropic membrane damaging agent, whereas EsxA targets the MCV membrane. We have already documented that the MCV has some endo-lysosomal properties and potentially resembles most the "post-lysosomal" compartment, characterized by a mildly acidic pH (pH ~6), the presence of Rab7 and zinc, ammonium and cupper transporters, for example. Our experiments also show that LLOMe is active in the whole endo-lysosomal pathway, including these post-lysosomes (PMID: 30596802, PMID: 37070811). The exact lipid composition of the MCV and post-lysosomes is not known, but both accumulate sterols in a similar manner. Both compartments are also akin to multivesicular bodies. These data are no direct proof but are compatible with our conclusions that both LLOMe and EsxA require similar threshold of local sterol concentration and that vacuolins are a mean to achieve this.

      The presentation of these conclusions has been revised and enhanced in the discussion (for example lines 396-400 and 437-439).

      Despite these similarities between LLOMe and EsxA activities, note that the mature MCV can be distinguished from all other endo-lysosomal compartments by the use of a Flipper probe that is sensitive to lipid composition and packing (Fig. 7C, and see below). In addition, RNAseq analyses of the impact of vac-KO and sterol depletion on infected and non-infected cells also highlight the interdependence between sterol concentration and vacuolin expression (Fig. 3G, 4G and H, Fig. EV5 and 6, and see below).

      Based on this observation, in figure 2, does the D4H/filipin signal or association increase over time as the Vac signal "solidifies"? In Vac-KO cells, does the mScarlet-D4H signal change in intensity or pattern (building on fig. S1)? These insights could provide valuable information on cholesterol levels at the MCV in KO versus wild-type cells. If possible, the authors should quantify fluorescence or the frequency of signal association.

      Qualitatively, sterols, as visualised by filipin and D4H, are present at all stages of the endo-lysosomal pathway and of MCV biogenesis. Now, there are many technical difficulties linked to a quantitative assessment, and therefore, please, let me present the framework. First, despite their wide use, the exact mechanism of binding of both reporters and which pool of sterol they visualise is still a mystery. This is often expressed as "they detect the accessible pool" of sterol, whatever it is. In addition, filipin detects sterols in both leaflets (and in intra-lumenal vesicles and other lipidic structures), while D4H detects sterols only in the cytosolic leaflet, and it is not known whether both leaflets have the same concentration of sterols. It is also known that filipin signal is only indirectly proportional to the sterol quantity in a cell, as measured by other quantitative methods. One of the best examples comes from studying the cellular phenotype of Niemann-Pick Type C disease, because many publications report a strong increase of filliping staining, whereas lipidomic analyses show at best a two-fold increase in cholesterol in NPC deficient cells. Moreover, technically speaking, D4H is a live probe, and fixation leads to some loss of localisation, probably because sterols are not fixable. On the other hand, filipin is mainly used after chemical fixation, but again sterols are not fixable, and the signal is very likely restricted to the membrane of origin, but not necessarily to the microdomains.

      All this to admit that, despite numerous and rigorous tentatives, we have not been able to reliably obtain quantitative measurements of neither filipin nor D4H signals. Also, these features likely also explain why we were not able to document changes in "patterns" of signals during MCV maturation. We ask for the referee's indulgence about this. Vacuolins remain the best microdomain morphology reporters.

      We nevertheless present additional qualitative D4H and VacC colocalization images in Fig. EV1C.

      Additionally, since Vacuolins do not have a significant impact on phagosome damage or escape, the difference in intracellular growth may be indirect, as suggested in the team's previous study on Vacuolins (DOI: 10.1242/jcs.242974). The authors measured MCV pH in figure S6-could they repeat this experiment to test whether Vacuolins affect MCV maturation? This was investigated in a previous version of the pre-print (DOI: 10.1101/2021.11.16.468763), and if the results still hold, it would strengthen the hypothesis that Vacuolins promote escape by modulating membrane organization, rather than influencing phagosome maturation.

      First, we respectfully disagree that vacuolins have no impact on membrane damage, we explained above why this impact is limited, but nevertheless additive with sterol depletion in most assays, during infection and sterile damage.

      We thank the referee for their excellent knowledge of the literature. Indeed, we previously went to extreme experimental sophistication to interrogate the impact of vac-KO on endo-phagosomal maturation. We were able to demonstrate that the major impact is on the recycling of phagocytic receptors and therefore on the cytoskeleton- and motor-induced deformation of the membrane in a cup that is essential for efficient phagocytosis (but not macropinocytosis). We also demonstrated a minimal effect on maturation, on the kinetics of pH change and delivery/recycling of hydrolases, but these cell biological differences did not translate in an impact on bacteria killing and digestion. As mentioned above, the MCV shares characteristics with post-lysosomes but minimal alterations of endo-lysosomal maturation in vac-KO cells should not be responsible for the strong effect on Mm infection. In other words, we are convinced that these minimal (mainly loss-of-function) perturbations that do not impact killing of food bacteria do not lead to an increased phagosomal "ferocity" and restriction of tough mycobacteria.

      Consequently, we decided not to repeat experiments to measure the pH around wt Mm in vac-KO cells, as it is anyway only slightly and transiently acidified in wt host cells, and previous work did not reveal major differences in endolysosomal compartment pH control (PMID: 32482795). But we agree with the referee that some of the MCV maturation data presented in the previous bioRxiv version are interesting for specialists, despite the indications of extremely small alterations between wt and vac-KO host cells. These data document that in absence of vacuolins, MCV characteristics are slightly altered, but we found no indication that they are more bactericidal in vac-KO cells (Fig. EV8F-H).

      Finally, as a substantial part of this manuscript relies on microscopy and image analysis, the methods section should detail how these analyses were performed. Specifically, for figure 1f, it is unclear how the cells were segmented and fluorescence quantified-was total fluorescence per cell measured, or was an average value used? Figures 5c and 5h could be moved to the supplementary material, and the segmentation method should be explained in the methods section. Additionally, statistical analysis should be more clearly described, justifying the use of one-way or two-way ANOVA, and specifying the post-hoc tests used for group comparisons.

      We fully agree with the referee and have therefore improved the detailed description of image analyses. For example, details for cell segmentation in images originating from infection and LLOMe experiments are succinctly described in the Materials & Methods section (lines 585-588, 594-597, and 639-640), but we now also refer to a methods chapter in press that describe in detail the whole segmentation pipeline (Perret et al. 2025).

      Concerning specifically Fig. 1F, we distinguished infected or bystander cells by the presence of bacteria and quantitated the maximal fluorescence intensity for each cell. Then, we decided on an arbitrary threshold of intensity of 5,000, that corresponds to the maximal signal observed for cells in mock conditions. Then, we quantified the percentage of bystander and infected cells with a higher-than-threshold (>5,000) vacuolin signal intensity. This clarification is now added to the legend of Fig. 1F.

      The statistical analyses applied are described in more detail in each figure legend.

      Reviewer #1 (Significance (Required)):

      This study provides the first direct evidence of the importance of membrane composition and organization in the virulence of Mycobacterium marinum, particularly in facilitating phagosome damage and bacillary escape. Using the well-established model of Dictyostelium discoideum infected with M. marinum, which has frequently been predictive of Mycobacterium tuberculosis behavior within phagosomes, the authors contribute critical insights into the mechanisms of mycobacterial phagosome escape-a key step in cellular invasion and dissemination. These findings have the potential to inform strategies aimed at blocking this escape mechanism, which, as demonstrated in this study, could prevent intracellular bacterial growth.

      This work is significant for advancing our understanding of mycobacterial pathogenesis, particularly by linking membrane microdomain composition to bacterial virulence. It will be highly relevant to researchers studying mycobacteria, intracellular pathogens, and host-pathogen interactions. While the study's use of M. marinum provides valuable insights, a limitation is that these results may not fully translate to M. tuberculosis, and further testing with the latter pathogen will be essential.

      We sincerely thank the referee for their very strong appraisal of our contributions, past and present, much appreciated. We agree that the translation of our findings to Mtb and macrophages is not guaranteed ... but has turned to be surprisingly and satisfyingly consistent in the past. To our delight, a recent article in Nature Communications reports about "Paired analysis of host and pathogen genomes identifies determinants of human tuberculosis" and clearly identified flotillin-1 as a susceptibility factor for tuberculosis (PMID: 39613754). We have introduced a sentence in the discussion that reads "Importantly and consistently with our findings, recent work has revealed flotillins as a major determinant of the fate of Mtb infection in patients, because overexpression of flotillin-1, resulting from particular allele variants, is a host susceptibility factor for Mtb infection (PMID: 39613754)." (Lines 477-480)

      I am an expert in the infection of macrophages by Mycobacterium tuberculosis, the phagosome escape mechanism, and its associated effectors. I also have expertise in microscopy and image analysis. However, I do not specialize in Dictyostelium discoideum biology.


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      the authors of this manuscript reported that EsxA, a secreted virulent factor of Mtb or Mm, causes membrane lysis in sterol-rich micro domain. They used the Mm-infected amoeba as an infection model, and characterized the effects of microdomain in Mycobacterium-containing Vacuole (MCV) on EsxA-mediated membrane disruption. They found that disruption of the micro domain through knockout of vacuolins or sterol depletion diminished Mm-induced membrane damage and cytosolic escape. They also found that vacuolins and sterol are essential for EsxA inserting into the membranes in vitro, and flotillin knockdown and sterol depletion conferred the resistance of murine microglial cells to Mm infection. The experiments were well designed and controlled, and the data were convincing.

      We thank the referee for this snappy summary of our main findings and for the positive comment on study design.

      My major comment is that the authors need to justify the use of BV-2 cells that are murine microglial cells, instead of macrophage cell lines, which are more relevant to Mtb/Mm infection.

      We understand the referee's concerns about the host used for Mm infection. First, we would like to argue that it is very true that the detailed biological processes accompanying the infection by Mtb, Mm or in fact any other pathogen depend on the origin and status of the host cell. In the TB field, a plethora of host macrophages, from murine and human origins, primary or immortalised, alveolar or interstitial, M1 or M2 have been used through the decades. Beside a robust agreement on many processes (phagosome maturation arrest, MCV membrane damage, role of xenophagy etc...), some of the crucial outcomes, for example the susceptibility or resistance to Mtb infection and the type of host cell death, have been hotly debated and depend on the host phagocyte identity and status.

      Now, it is true that microglial cells have only rarely been used for Mtb (or Mm) research, but it does not mean that this is not relevant. First, we would like to remind the referee that TB is not only a pulmonary disease, and that among the most disastrous extra-pulmonary sites of infection is the brain, resulting in the devastating tuberculous meningitis. In fact, tuberculous meningitis is the most severe form of tuberculosis with a fatality rate of 20-50% in treated individuals (doi: https://doi.org/10.1101/2025.03.04.641394). A quick literature survey on the topic reveals over 9,000 publications, including very significant contributions, using both Mtb and Mm in animal and human models (PMID: 38745656, PMID: 38264653, PMID: 36862557, PMID: 32057291, PMID: 30645042, PMID: 29352446, PMID: 27935825, PMID: 26041993).

      We have introduced a brief mention of these arguments in the discussion (Lines 456-459).

      In addition, we have already shown that this BV-2 cell line is reliable, they are adherent, motile and constitutively phagocytic and thus do not need to be differentiated with mega-doses of PMA, or any other stimulus. They beautifully recapitulate our findings in the Dd-Mm model (PMID: 38270456, PMID: 25772333), including when used as a host phagocyte to validate anti-infective compounds that were primarily identified using the Dd-Mm platform (PMID: 29500372).

      We have introduced a brief mention of these arguments in the results section (Lines 329-334).

      We also introduced two novel experimental evidence to strengthen the link between the Dd and BV-2 model systems. First, we show using qRT-PCR that, like vacuolins, flotillin-1 is upregulated in BV-2 at 32hpi (Fig. EV9B). Excitingly, as mentioned as response to referee #1, a recent article in Nature Communications reports about "Paired analysis of host and pathogen genomes identifies determinants of human tuberculosis" and clearly identified flotillin-1 as a susceptibility factor for tuberculosis (PMID: 39613754). We have introduced a sentence in the discussion that reads "Importantly and consistently with our findings, recent work has revealed flotillins as a major determinant of the fate of Mtb infection in patients, because overexpression of flotillin-1, resulting from particular allele variants, is a host susceptibility factor for Mtb infection (PMID: 39613754)." (Lines 477-480)

      Second, we used for the first time the LysoFlipper probe to monitor MCV lipid composition and packing during infection (Fig. 7C). These results indicate that in BV-2 cells, as in Dd, the membrane characteristics of the MCV are profoundly different from the standard endo-lysosomal compartments.

      Reviewer #2 (Significance (Required)):

      It is well known that EsxA is membrane-lytic protein playing a role in Mtb/Mm-mediated phagosomal escape. There are other studies that have indicated lipid raft or micro domains in the membrane may play a role in EsxA-mediated membrane damage. This study further confirmed that the sterol-rich micro domain on the membrane has significant influence on the EsxA-mediated membrane disruption both in vitro and in vivo. While this finding is expected, but confirmation with solid experimental evidence is welcomed. This study also identified the genes or proteins required for micro domain organization, vacuolins and flotillin, which could be a target of host-directed therapy. Overall, this study is performed well and the results are convincing.

      We thank the referee for their expert views and comments on the function of EsxA and the lipidic environment in which it is supposed to act. We agree that EsxA has been the centre of attention for decades, but we respectfully disagree that its precise mode of action is known, neither in vitro nor in vivo. First, historically, it took the best of a decade for the field to accept that Mtb was not a strictly vacuolar pathogen. And even when the escape to the cytosol became a fact, the implication of EsxA remained extremely debated. For example, a "petition" was signed and published, arguing against its direct membrane damaging activity (PMID: 28119503). We agree that cumulated evidence now converges against a canonical "pore-forming" activity, but in favour of a "membrane-disrupting" activity. On the other hand, it is true that researchers have reached a form of consensus on the role of low pH to dissociate the EsxA-B dimer, and on the importance of the "physiological" composition of the acceptor membrane (PMID: 31430698, PMID: 35271388, PMID: 17557817). We are convinced that our evidence is not merely expected and confirmatory, but represents a novel, complete, solid, biochemical in vitro, molecular and genetics in vivo demonstration of the role of sterols clustering and microdomain organisers as susceptibility factors for Mm infection in evolutionary distant phagocytes.


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Bosmani, Perret et al examines the role of Dictyodistelium discoideum vacuolin proteins in the integrity of the Mycobacterium marinum vacuole membrane. The data demonstrates that loss of vacuolins, similar to sterol depletion, reduced vacuole membrane damage meaning less cytosolic escape of the pathogen and subsequently reduced bacterial replication. The authors demonstrate functional analogy in a mammalian model of infection - where flotillin plays a similar role to the vacuolins - and this is an important demonstration of the utility of the D. discoideum model. The data is well presented and clear.

      We thank the referee for this positive summary of our main findings and of the clarity of results, interpretations and working model.

      Major Comments:

      There is no evidence presented in the manuscript of "microdomains" - while I believe this is likely a true description of what is happening on the vacuole membrane there is no visualisation of this. Both the GFP-Vac vacuole staining and the filipin staining show complete coverage of the vacuole. Perhaps at the 1 hour time points this is more convincing but I think it is worth looking at more of these earlier time points and quantifying these "microdomains" - i.e. proportion of vacuole membrane that is positive for the Vacs. Is it possible to look at the GFP-Vac signal and filipin staining at the same time? And other vacuole markers too?

      We agree with the referee that microdomains are the central characters of our study. Now, we would like to argue with the referee that one has to distinguish between structural, morphological evidence for the existence of microdomains and the biochemical and genetic evidence of their functional implication.

      On the one hand, microdomains are in fact nanometer-scale and are thus under the resolution limit of most optical microscopies. We and others already documented that during phagosome maturation, vacuolin distribution is patchy, reflecting the clustering of nanometer-scale inhomogeneities, and that the coating becomes more continuous with progressing maturation. The transition we observed here for vacuolins, as microdomain organisers, from a patchy to continuous coating reflects indirectly their macroscopic coalescence. As discussed above in response to the first referee, visualisation of the underlying lipidic clusters and microdomains is for technical reasons almost undoable. One cannot fix sterols. As replied to the first referee, we have not been able to improve much on the spatial resolution of lipidic microdomains, and, despite numerous and rigorous tentatives, we have not been able to reliably obtain quantitative measurements of neither filipin nor D4H signals, nor to document changes in "patterns" of signals during MCV maturation. We nevertheless present additional qualitative D4H and VacC colocalization images Fig. EV1C.

      On the other hand, we respectfully disagree that our manuscript lacks in strong and direct evidence for the functionality of sterol-rich microdomains as susceptibility factors required for a full mycobacteria infection in evolutionary distant phagocytes.

      In addition to the evidence presented previously, we have now added a large set of RNAseq analyses of the impact of vac-KO and sterol depletion on infected and non-infected cells, which also highlight the interdependence between sterol concentration and vacuolin expression (Fig. 3G, 4G and H, Fig. EV5 and 6). Moreover, we have now used a Flipper probe sensitive to lipid composition and packing to distinguish the mature MCV from all other endo-lysosomal compartments in microglial cells (Fig. 7C). Altogether, the simplest and most plausible interpretation of our cumulated evidence is that sterol-rich microdomains are necessary for EsxA-mediated MCV damage and escape to the cytosol.

      I really like the data presented in Figure 1 that demonstrates the specific upregulation of Vacuolin C during M. marinum infection. This is an intriguing result that brings up a lot of new questions e.g. how is this regulated? In response to membrane damage? Sensed by what? Does this upregulation also hold true for flotillin in the mammalian model? (and more!) however none of these ideas are pursued in the manuscript and by the end I was wondering why this data was included in the manuscript because all of the phenotypic data uses either a VacBC or ABC mutant. The link between figure 1 and the rest of the manuscript would be aided by characterisation of a specific VacC mutant.

      We share the referee's fascination with these data showing that VacC is a specific reporter of virulent mycobacteria infection. First, VacC expression at the transcriptional level, but also at the protein accumulation level both point toward a correlation with an infection with damage-causing mycobacteria. Specifically, one can distinguish two stages, one transient upregulation of all three isoforms that becomes sustained only for VacC and only when wt Mm causes damage (as opposed to the DRD1 mutant or M. smegmatis). This is clearly presented in multiple places in the manuscript (for example lines 377-380).

      Now, how is MCV damage sensed is extremely interesting and is the focus of numerous past and on-going studies in our laboratory but is out of the scope of this article. Just to mention a few lines of research as food for thoughts, membrane damage (by EsxA and by LLOMe) triggers the recruitment of the E3 ubiquitin ligase TrafE (PMID: 37070811), and subsequently of the ESCRT and autophagy machineries (PMID: 37070811, PMID: 30596802). Upstream of TrafE, we know that decrease of membrane tension is one parameter, because transient hyperosmolar shock also recruits TrafE to endo-lysosomal compartments (PMID: 37070811). On-going experiments demonstrate that calcium leakage from endo-lysosomes and MCV is another major triggering factor.

      As mentioned above, and in more direct response to the referee's questioning, we have now included RNAseq experiments that unequivocally indicate the link between vac-KO and sterol depletion and the direct effect on reducing membrane damage, because the two conditions lead to a down-regulation of the damage-dependent transcriptomic signatures of the ESCRT and autophagy related genes (Fig. 4G-H and Fig. EV5). Moreover, it clearly establishes that sterol depletion, which decreases sterile and EsxA-mediated damage, decreases vacuolin expression in infected cells (Fig 3G). Finaly, qRT-PCR on infected BV-2 microglial cells indeed documents an up-regulation of flotillin-1, reminiscent of vacC regulation in Dd (Fig. EV9B).

      All in all, we would like to respectfully ask the editor and referee to acknowledge that the signalling pathway between damage sensing and the vacuolin responses will be the focus of future studies.

      We understand that investigating the phenotypic consequences of only a single vacC-KO might be interesting, but we would like to argue that it is superfluous. First, for intricate biological reasons, KO of single and combinations of vacuolin genes result in very qualitatively and quantitatively similar phenotypes associated to motility, phagocytosis, endosome maturation etc... (PMID: 32482795). The present study extends this remarkable phenomenon by interrogating multiple parameters, reporters and phenotypes linked to infection, some shown and some unpublished (for example Fig. EV3B and Fig. 4D-E).

      Are the MMVs positive for all three vacuolins? It would be great if you could quantify which are present together or whether there are more distinct populations that are positive for just one or all three for example.

      The referee points to an interesting mechanistic aspect. We have therefore directly assessed the colocalization of pairs of vacuolin isoforms (Fig. EV1B), which clearly indicate that every MCV is coated with two vacuolins, which therefore arithmetically implies that all three isoforms are present together and that there is no isoform-specific MCV (Fig 2B). This is potentially also corroborated by earlier studies that showed vacuolin hetero-oligomerization (PMID: 16750281), a characteristic shared by flotillins (PMID: 38985763).

      Minor Comments:

      Fig 1F - this graph is quite striking but I think the individual data points should be presented as it is unclear whether this intensity threshold is an arbitrary value or genuinely represents two different populations. Perhaps better represented as a scatter plot?

      We fuly agree with the referee and have accordingly replotted all the graphs where this improved the visualisation and contributed to the interpretation of the data. We did not change the representation in Fig. 7E and G, Fig. EV3C, because the error bar already represents the deviation of the Area Under the Curve (AUC) that was calculated for the average curves resulting from a biological triplicate of experiments.

      The bar graphs early in the manuscript should shoe the individual data points from replicates. While the presentation is clear and differences are striking I think this article explains why showing the replicate data is important: https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002128

      We fully agree with the referee and have accordingly replotted all the graphs where this improved the visualisation and contributed to the interpretation of the data.

      In Figure 2: F and G should include quantification, in G the arrow on the24 hpi filipin panel is not in the right location

      As mentioned in response to referee #1 and #2, qualitatively, sterols, as visualised by filipin and D4H, are present at all stages of the endo-lysosomal pathway and of MCV biogenesis. Now, there are many technical difficulties linked to a quantitative assessment, and therefore, please, let me present the framework. First, despite their wide use, the exact mechanism of binding of both reporters and which pool of sterol they visualise is still a mystery. This is often expressed as "they detect the accessible pool" of sterol, whatever it is. In addition, filipin detects sterols in both leaflets (and in intra-lumenal vesicles and other lipidic structures), while D4H detects sterols only in the cytosolic leaflet, and it is not known whether both leaflets have the same concentration of sterols. It is also known that filipin signal is only indirectly proportional to the sterol quantity in a cell, as measured by other quantitative methods. One of the best examples comes from studying the cellular phenotype of Niemann-Pick Type C disease, because many publications report a strong increase of filliping staining, whereas lipidomic analyses show at best a two-fold increase in cholesterol in NPC deficient cells. Moreover, technically speaking, D4H is a live probe, and fixation leads to some loss of localisation, probably because sterols are not fixable. On the other hand, filipin is mainly used after chemical fixation, but again sterols are not fixable, and the signal is very likely restricted to the membrane of origin, but not necessarily to the microdomains.

      We corrected the arrow localisation.

      Reviewer #3 (Significance (Required)):

      The key strength of this manuscript is the use of the Dictyostelium model to dissect host-pathogen interactions. This provides an interesting evolutionary lens to the research findings presented here and is further strengthened by the data demonstrating that these findings are relevant in a mammalian model as well. The weaknesses are articulated in my "major comments" section. The phenotypic data presented here is strong - it is clear that these vacuolin proteins are important for the intracellular success of M. marinum however the data demonstrating the mechanism for this is less clear.

      We thank the referee for this overall positive summary of our main findings and of the clarity of results, interpretations and working model. As detailed above, we respectfully disagree with the final conclusion and are pleased to note that the other two referees are more satisfied with the level of mechanistic evidence.

      I am an academic researcher who is interested in the molecular host-pathogen interactions mediated by intracellular microbial pathogens. Scientists in my research field will be a key audience for this research. Predominantly this is basic researchers but the interest will be broader than host-pathogen interactions as researchers in the membrane integrity and membrane dynamics field will be interested here.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Bosmani, Perret et al examines the role of Dictyodistelium discoideum vacuolin proteins in the integrity of the Mycobacterium marinum vacuole membrane. The data demonstrates that loss of vacuolins, similar to sterol depletion, reduced vacuole membrane damage meaning less cytosolic escape of the pathogen and subsequently reduced bacterial replication. The authors demonstrate functional analogy in a mammalian model of infection - where flotillin plays a similar role to the vacuolins - and this is an important demonstration of the utility of the D. discoideum model. The data is well presented and clear.

      Major Comments:

      1. There is no evidence presented in the manuscript of "microdomains" - while I believe this is likely a true description of what is happening on the vacuole membrane there is no visualisation of this. Both the GFP-Vac vacuole staining and the filipin staining show complete coverage of the vacuole. Perhaps at the 1 hour time points this is more convincing but I think it is worth looking at more of these earlier time points and quantifying these "microdomains" - i.e. proportion of vacuole membrane that is positive for the Vacs. Is it possible to look at the GFP-Vac signal and filipin staining at the same time? And other vacuole markers too?
      2. I really like the data presented in Figure 1 that demonstrates the specific upregulation of Vacuolin C during M. marinum infection. This is an intriguing result that brings up a lot of new questions e.g. how is this regulated? In response to membrane damage? Sensed by what? Does this upregulation also hold true for flotillin in the mammalian model? (and more!) however none of these ideas are pursued in the manuscript and by the end I was wondering why this data was included in the manuscript because all of the phenotypic data uses either a VacBC or ABC mutant. The link between figure 1 and the rest of the manuscript would be aided by characterisation of a specific VacC mutant.
      3. Are the MMVs positive for all three vacuolins? It would be great if you could quantify which are present together or whether there are more distinct populations that are positive for just one or all three for example.

      Minor Comments:

      1. Fig 1F - this graph is quite striking but I think the individual data points should be presented as it is unclear whether this intensity threshold is an arbitrary value or genuinely represents two different populations. Perhaps better represented as a scatter plot?
      2. The bar graphs early in the manuscript should shoe the individual data points from replicates. While the presentation is clear and differences are striking I think this article explains why showing the replicate data is important: https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.1002128
      3. In Figure 2: F and G should include quantification, in G the arrow on the24 hpi filipin panel is not in the right location

      Significance

      The key strength of this manuscript is the use of the Dictyostelium model to dissect host-pathogen interactions. This provides an interesting evolutionary lens to the research findings presented here and is further strengthened by the data demonstrating that these findings are relevant in a mammalian model as well. The weaknesses are articulated in my "major comments" section. The phenotypic data presented here is strong - it is clear that these vacuolin proteins are important for the intracellular success of M. marinum however the data demonstrating the mechanism for this is less clear.

      I am an academic researcher who is interested in the molecular host-pathogen interactions mediated by intracellular microbial pathogens. Scientists in my research field will be a key audience for this research. Predominantly this is basic researchers but the interest will be broader than host-pathogen interactions as researchers in the membrane integrity and membrane dynamics field will be interested here.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      the authors of this manuscript reported that EsxA, a secreted virulent factor of Mtb or Mm, causes membrane lysis in sterol-rich micro domain. They used the Mm-infected amoeba as an infection model, and characterized the effects of microdomain in Mycobacterium-containing Vacuole (MCV) on EsxA-mediated membrane disruption. They found that disruption of the micro domain through knockout of vacuolins or sterol depletion diminished Mm-induced membrane damage and cytosolic escape. They also found that vacuolins and sterol are essential for EsxA inserting into the membranes in vitro, and flotillin knockdown and sterol depletion conferred the resistance of murine microglial cells to Mm infection. The experiments were well designed and controlled, and the data were convincing.

      My major comment is that the authors need to justify the use of BV-2 cells that are murine microglial cells, instead of macrophage cell lines, which are more relevant to Mtb/Mm infection.

      Significance

      It is well known that EsxA is membrane-lytic protein playing a role in Mtb/Mm-mediated phagosomal escape. There are other studies that have indicated lipid raft or micro domains in the membrane may play a role in EsxA-mediated membrane damage. This study further confirmed that the sterol-rich micro domain on the membrane has significant influence on the EsxA-mediated membrane disruption both in vitro and in vivo. While this finding is expected, but confirmation with solid experimental evidence is welcomed. This study also identified the genes or proteins required for micro domain organization, vacuolins and flotillin, which could be a target of host-directed therapy. Overall, this study is performed well and the results are convincing.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The proposed study aims to elucidate the role of membrane microdomains and associated proteins-Vacuolin A, B, and C-during the infection of Dictyostelium discoideum (Dd) amoebae by Mycobacterium marinum (Mm). The results demonstrate that Vacuolins are required for Mm virulence, and that the presence of membrane microdomains is essential for phagosome membrane damage and bacillary escape into the cytosol-key steps in establishing a successful infection and subsequent bacterial proliferation. The study is well-designed, employing methodologies with which the authors have demonstrated expertise. Overall, it is methodologically sound, and most conclusions are well-supported by the presented data. However, some points require clarification. Major Points: The study aims to link the function of Dd Vacuolins to their potnetial facilitating role in phagosome escape and overall infection by Mm. To phenocopy the effect of Vac-KO, the authors used MβCD. Strikingly, this compound had a more significant impact on phagosome escape compared to Vac-KO, which either did not affect or only mildly affected this process. This likely reflects a difference in the underlying mechanisms being studied. Vac-KO cells may lack well-organized membrane domains but could retain a similar overall membrane composition. In contrast, MβCD disrupts these domains by chelating cholesterol, thus altering both the membrane composition and the domains themselves. This may explain why EsxA partitioning is more affected by MβCD than by triple KO. Consequently, this suggests that cholesterol, rather than the mere presence of membrane domains, plays a critical role in EsxA partitioning and activity in the phagosome. And even if LLOMe activity was lower in Vac-KO cells, this might be explained by the compartment targeted, the lysosomes which membrane composition may differ from the MCV. These points should be further discussed in the discussion section.

      Based on this observation, in figure 2, does the D4H/filipin signal or association increase over time as the Vac signal "solidifies"? In Vac-KO cells, does the mScarlet-D4H signal change in intensity or pattern (building on fig. S1)? These insights could provide valuable information on cholesterol levels at the MCV in KO versus wild-type cells. If possible, the authors should quantify fluorescence or the frequency of signal association. Additionally, since Vacuolins do not have a significant impact on phagosome damage or escape, the difference in intracellular growth may be indirect, as suggested in the team's previous study on Vacuolins (DOI: 10.1242/jcs.242974). The authors measured MCV pH in figure S6-could they repeat this experiment to test whether Vacuolins affect MCV maturation? This was investigated in a previous version of the pre-print (DOI: 10.1101/2021.11.16.468763), and if the results still hold, it would strengthen the hypothesis that Vacuolins promote escape by modulating membrane organization, rather than influencing phagosome maturation. Finally, as a substantial part of this manuscript relies on microscopy and image analysis, the methods section should detail how these analyses were performed. Specifically, for figure 1f, it is unclear how the cells were segmented and fluorescence quantified-was total fluorescence per cell measured, or was an average value used? Figures 5c and 5h could be moved to the supplementary material, and the segmentation method should be explained in the methods section. Additionally, statistical analysis should be more clearly described, justifying the use of one-way or two-way ANOVA, and specifying the post-hoc tests used for group comparisons.

      Significance

      This study provides the first direct evidence of the importance of membrane composition and organization in the virulence of Mycobacterium marinum, particularly in facilitating phagosome damage and bacillary escape. Using the well-established model of Dictyostelium discoideum infected with M. marinum, which has frequently been predictive of Mycobacterium tuberculosis behavior within phagosomes, the authors contribute critical insights into the mechanisms of mycobacterial phagosome escape-a key step in cellular invasion and dissemination. These findings have the potential to inform strategies aimed at blocking this escape mechanism, which, as demonstrated in this study, could prevent intracellular bacterial growth.

      This work is significant for advancing our understanding of mycobacterial pathogenesis, particularly by linking membrane microdomain composition to bacterial virulence. It will be highly relevant to researchers studying mycobacteria, intracellular pathogens, and host-pathogen interactions. While the study's use of M. marinum provides valuable insights, a limitation is that these results may not fully translate to M. tuberculosis, and further testing with the latter pathogen will be essential.

      I am an expert in the infection of macrophages by Mycobacterium tuberculosis, the phagosome escape mechanism, and its associated effectors. I also have expertise in microscopy and image analysis. However, I do not specialize in Dictyostelium discoideum biology.

  2. Mar 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility, and clarity)

      The manuscript by Song et al presents evidence to show that the predicted cysteine protease type 6 secretion system (T6SS) effector Cpe1 inhibits target cell growth by cleaving type II DNA Topoisomerases GyrB and ParE. The authors determined the structure of the protein complex formed by Cpe1 and its immunity protein Cpi1, which allowed them to reveal the mechanism of inhibition. Moreover, the authors identified type II DNA topoisomerases GyrB and ParE as the targets of Cpe1. Overall, the major conclusions were well supported by experimental data of high quality. The findings have expanded our appreciation of the mechanism utilized by T6SS effectors to inhibit target cell growth.

      We thank the reviewer for their positive remarks and valuable suggestions to improve this manuscript.


      Major comments

      To better establish that GyrB and ParE are the sole targets of Cpe1, the authors should express the GG mutant in target cells and determine whether these cells become resistant to Cpe1-mediated killing (inhibition). They can also determine whether co-expression of the cleavage resistant mutants suppresses the toxicity of Cpe1.

      We appreciate the reviewer’s suggestion to investigate additional substrates of Cpe1 beyond GyrB and ParE, which may not have been fully captured in our crosslinking-mass spectrometry experiments due to technical limitations or low protein abundance. To address this topic, we generated target cells heterologously expressing cleavage-resistant GyrB and ParE variants (GyrBΔG102 and ParEΔG98) that are not susceptible to Cpe1, as described in our original manuscript (Figures 3h, i). We performed both Cpe1 expression assay and competition assay to assess if expression of the cleavage-resistant variants suppresses Cpe1 toxicity (Author Response Figures 1a, b). However, we did not observe a substantial protective effect. While this outcome could suggest that GyrB and ParE are not the sole targets of Cpe1, alternative explanations are also plausible. In the Cpe1 expression assay, high levels of Cpe1 could still act on endogenous wild-type GyrB and ParE, and although we attempted to increase variant expression, precise quantification remains challenging. In the competition assay, highly active Cpe1 may have continued to target wild-type substrates throughout the experiment, potentially masking any protective effect. Additionally, reduced activity of the mutant proteins could contribute to the observed results. Finally, deletion of the global repressor H-NS in the Cpe1-producing E. coli strain may have induced other interbacterial competition mechanisms1, leading to growth inhibition independently of Cpe1. Addressing these questions comprehensively would require a more systematic investigation under a wider range of conditions. We consider this an important avenue for future studies.

      Results in Figure 7 clearly show that Cpi1 is capable of displacing ParE from Cpe1 due to higher affinity. Yet, the "competitive inhibition model" described in the last result section does not completely match what is really happening in Cpe1-mediated interbacterial competition. If Cpi1 is in the target cell, it would more likely engage the incoming Cpe1 before it can interact with ParE or GyrB, so competition does not occur in this scenario. Similarly, in the predatory cells expressing Cpe1 and Cpi1, these two proteins will form a stably protein complex, and no competition with the target will occur. The authors should reconsider their model.

      We thank the reviewer for their comments and appreciate the opportunity to clarify this point. First, we believe the reviewer is referring to Figure 5 rather than Figure 7. In our model, the primary role of immunity proteins in interbacterial competition is to neutralize cognate toxins and prevent self- or kin-intoxication. These immunity proteins exhibit high specificity and strong binding affinity toward their associated toxins, ensuring effective protection2. In predatory cells, immunity proteins are typically co-expressed with their corresponding toxins, likely enabling immediate suppression upon translation. During kin competition, immunity proteins can protect cells even after foreign toxins engage their substrates.

      Our results demonstrate that Cpi1 binds Cpe1 with higher affinity than its substrates and can displace them from pre-formed Cpe1-substrate complexes (Figures 5b-f). This aligns with the established function of immunity proteins in interbacterial competition and provides a mechanistic basis for how they confer protection, even when toxins have initially engaged their targets2. We acknowledge the reviewer’s point that in both scenarios—whether in the recipient cell or the toxin-producing cell—Cpe1 may first encounter Cpi1. However, our model underscores that Cpi1 not only binds at the substrate site but also exhibits superior affinity for Cpe1, ensuring robust protection against Cpe1-mediated toxicity.

      Minor comments

      "Intoxication" was used throughout the text numerous times to describe the activity of Cpe1. Looking in the Marriam-Webster dictionary, "Intoxication" means "a condition of being drunk". This word should be replaced with "toxicity" or some other terms in this line.

      We thank the reviewer for this comment. We acknowledge that the term "intoxication" is commonly associated with alcohol consumption, yet the Merriam-Webster dictionary also defines it as "an abnormal state that is essentially a poisoning" (https://www.merriam-webster.com/dictionary/intoxication). This definition aligns with its well-established usage in the field of interbacterial competition to describe the effects of interbacterial toxins during antagonism3-5, which we have adopted in our manuscript. However, we appreciate the reviewer’s concern and remain open to revising the terminology if deemed necessary for clarity.

      Lines 46-48, references on contact-dependent killings by these systems mentioned should cited. Ref. 9 cited does NOT cover the information at all.

      We thank the reviewer for this comment. We have revised the citation and now reference studies that specifically describe contact-dependent killing systems in the relevant sentences (Lines 45–____50)

      "characterizations" should be "characterization".

      We have now modified the sentence as requested (Line 69)

      Line 229 "Cpe1-Bpa monomers" should be " apo Cpe1-Bpa". The results cannot distinguish whether these bands are monomers or multimers.

      We appreciate the reviewer’s careful assessment of our manuscript. The results in Line 233 (Figure 3c) show the enrichment of His-tagged proteins, including crosslinked complexes and overproduced Cpe1-Bpa. Based on the molecular weight marker, the Cpe1-Bpa bands appear between 10–15 kDa, consistent with the molecular weight of Cpe1 monomers (Figure 3a). Therefore, we have labeled this band as “Cpe1-Bpa monomers” and maintained this terminology throughout the text. This designation aligns with previous studies utilizing site-specific crosslinking via Bpa incorporation6,7

      Line 283, was the mutation deletion? Substitution was used I think.

      We thank the reviewer for highlighting this point. The GyrB and ParE mutants used to confirm the cleavage sites were deletion mutants, with a single glycine removed from the predicted double-glycine motifs. We have now revised the text for clarity (Lines 285–290)

      Lines 439-444 the discussion should be extended to include other bacterial toxins that target type II DNA topoisomerases (e.g. PMID: 26299961 and PMID: 26814232).

      We appreciate the reviewer’s suggestion. The studies referenced (PMID: 26299961 and PMID: 26814232) describe FicT toxin with adenylyl transferase activity that target and post-translationally modify GyrB and ParE at their ATPase domains, highlighting a potential hotspot for topoisomerase inhibition. We have now incorporated an additional paragraph in the Discussion section to describe these findings (Lines 424–439).

      Reviewer #1 (Significance)

      The authors determined the structure of the protein complex formed by Cpe1 and its immunity protein Cpi1, which allowed them to reveal the mechanism of inhibition. Moreover, the authors identified type II DNA topoisomerases GyrB and ParE as the targets of Cpe1. Overall, the major conclusions were well supported by experimental data of high quality. The findings have expanded our appreciation of the mechanism utilized by T6SS effectors to inhibit target cell growth.

      We sincerely thank the reviewer for their positive comments and for the suggestions to improve our manuscript.

      Reviewer #2 (Evidence, reproducibility, and clarity)

      The manuscript, titled "An Interbacterial Cysteine Protease Toxin Inhibits Cell Growth by Targeting Type II DNA Topoisomerases GyrB and ParE", describes how an effector family was identified and characterized as a papain-like cysteine protease (PLCP) that negatively impacts bacterial growth in the absence of its co-encoded immunity protein. This thorough report includes (1) bioinformatic analysis of prevalence, finding this PLCP effector encoded in many gram-negative bacteria, (2) confirming conservation of catalytic active site via structural (crystallographic) analysis, as well as visualizing contacts with the immunity protein, (3) validation of results using growth studies combined with mutagenesis, (4) using a cell-based cross-linking method to pull out potential targets, which were subsequently identified via mass spectrometry, (5) validation of these results using in vitro protease assays with purified (potential) substrates, including verification of the motif recognized on the substrate(s), and cell-based phenotype analyses, and finally, (6) demonstrating competition between immunity protein and ParE substrate using an in vitro pull-down approach. Overall, this is a strong body of work with compelling conclusions that are well supported by multiple experimental approaches.

      We appreciate the reviewer for their positive comments regarding our original submission.

      Major comments

      The claims made based on the presented results are well supported, including that this PLCP effector toxin is widespread, is neutralized in a competitive mechanism by its immunity partner, and that it effectively cleaves both GyrB and ParE (subunits of bacterial type II topoisomerases) at a conserved motif, resulting in suppression of bacterial cell growth via mis-regulating chromosome segregation. No additional experiments are needed to further validate these results, and the authors are commended on the cell-based and in vitro studies to deduce very specific mechanisms and structural details.

      We appreciate the reviewer’s positive feedback.

      Minor comments

      While the writing and data presentation are extremely clear, in general I recommend the authors indicate the level(s) of replication for experiments. Figure legends generally note that mean values with standard deviations are shown, but I did not find where the number of replicates (and independent versus technical) were listed.

      We appreciate the reviewer’s suggestion. We have now revised the manuscript to specify the levels of replication (independent vs. technical) for each experiment in the figure legends, particularly in Figures 2 and 3.

      The figures are very clear, but in many instances the addition of PLCP toxin is indicated as "before" and "after"; while a modest change, I recommend altering this to some type of "-" and "+" type nomenclature rather than a time-based notation (especially as presumably both samples were treated identically, just with or without protease).

      We thank the reviewer for this helpful comment. In Figures 3 and Supplementary Figures 5, 9, we used "before" and "after" to indicate the time points for in vitro cleavage assays verifying Cpe1 cleavage. To minimize variations between reactions, the catalytic mutant Cpe1tox (Cpe1toxC362A) was used as a comparison rather than a reaction without Cpe1tox. In these assays, duplicate reaction mixtures were prepared: one was denatured immediately after preparation ("before" reaction) to serve as a baseline, while the other was incubated to allow enzymatic activity ("after" reaction). This labeling clarifies the comparison between initial and processed samples. We believe this approach clearly distinguishes the effects of Cpe1 activity and provides a reliable basis for assessing proteolysis in our assays.

      I also suggest quantifying the intensities of the gel images presented in Figure 5c, d (for example, Cpe1 intensity as a ratio to that of the ParE ATPase domain), to make the interpretation even more evident.

      We thank the reviewer for the valuable suggestion to quantify the signal intensities of the gel images presented in Figures 5c, d. We have now included the quantification results in Supplementary Figures 9e, f and have updated the respective text in the manuscript (Lines 826-828 and 1066-1087).

      Crystallographic structure: the PDB report notes some higher-than-expected RZR (RSRZ) scores; I interpret this to mean that there was strain around the catalytic site of one of the two toxins in the asymmetric unit, or that this copy was less well ordered. The RZR outliers likely arise from non-optimal weighting for geometric restraints. While no figures of electron density are presented, these modest outliers are not expected to alter the conclusions reached in the current work. One point of interest that is not addressed, however, is if any variance between the two complexes in the asymmetric unit are noted? A passage compares the current toxins to others in the larger subfamily and notes a rotation of a side chain is needed to superpose (Line 159). Can the authors please clarify around which bond this rotation is needed, and if both copies in the asymmetric unit are in the same orientation at this site?

      We appreciate the reviewer’s insightful comments.

      1. We have provided the electron density map for the RSR-Z outlier residues along with the model (Author response Figure 2a). These outlier residues are located at the loop regions of a molecule within the asymmetric unit in the crystal (Chain B). As a result, the electron density for their side chains appears to be noisier compared to residues in the well-folded regions, leading to higher RSR-Z scores. Notably, when we superimposed the models of two complexes within the asymmetric unit, the calculated RMSD value was 0.402 Å (Author response Figure 2b), indicating that the two models are structurally very similar and that these residues are properly assigned. Therefore, the RSR-Z outliers do not significantly impact the overall structure.
      2. Here, we provide a zoomed-in view of Figure 2d, highlighting the superimposed crystal structures of Cpe1 and the closely related PLCPs, ComA and LahT (Author response Figure 2c). As shown, the side chain of the catalytic cysteine residue in ComA adopts a different orientation, positioning it slightly farther from the homologous residues in Cpe1 and LahT. However, since the backbone and catalytic pockets remain structurally intact, we believe that this deviation arises due to results from crystal packing effects rather than an inherent functional distinction. We have now modified the main text (Lines 159-166) to clarify this and prevent any potential misinterpretation.

      Reviewer #2 (Significance)

      Bacteria encode numerous effectors to successfully compete in natural environments or to mediate virulence; these effectors are typically associated with type VI secretion system machinery or referred to as contact dependent inhibition systems. The current work has identified a sub-family of papain-like cysteine protease effectors that are unique by targeting type II topoisomerases. Among the actionable findings is the identification of both the specific site of interaction with the topo substrates, as well as the specific motif recognized for cleavage. This should enable the field to move forward probing for this activity with other toxins and substrates. The insights provided by the competitive neutralization mechanism also stand out as an important contribution that can be more broadly applied. Within the literature, few effector targets are identified, making the current study stand out as impactful by the well-executed experiments that directly support the conclusions.

      While the current study has strong elements of novelty and is complete, it also nicely sets up future studies for remaining open questions. For example, does the nucleotide-bound status of the ATPase domain, or other catalytic intermediate, impact the susceptibility of topoisomerases to cleavage? Is this identified motif found in other ATPase domains? Is the negative supercoiling activity unique to gyrase also impacted, or is the phenotypic mechanism of cell toxicity reliant only on chromosome segregation? What types of kinetic parameters do this class of toxins demonstrate, and does sequence variability alter this? These ideas are a testament to the intriguing study as presented, capturing the readers' curiosity for additional details that are clearly beyond the scope of the current work.

      I anticipate this work will be of interest to the broad field of microbiologists that study interbacterial communication as well as pathogenic mechanisms. While the research is largely fundamental in nature, it is wide in scope with applications to many gram-negative bacteria that inhabit a myriad of niches. The work will also be of interest to specialists in topoisomerases, as the list of toxins that target these essential enzymes is growing and the therapeutic utility of topoisomerase inhibition remains vital. My interest lies in the latter, in toxin-mediated inhibition of topoisomerase enzymes as a means to alter bacterial cell growth. While I have strong expertise in structural biology, I am lacking in expertise for mass spectrometry. I note this because this method was used for the identification of the target substrate.

      We appreciate the reviewer’s insightful discussion and interest in our study. We agree that further investigations are crucial to address the open questions posed, and we have initiated work on some of these avenues.

      For example, considering Cpe1's specificity for the ATPase domain of GyrB and ParE, we have begun examining whether Cpe1 targets other ATPase domains by searching for the consensus sequence or double glycine motifs in the sequences of ATPase domains beyond GyrB and ParE. Among the 42 E. coli ATPase domains identified by the PEC database8, we found several with double glycine residues. However, none contained the exact LHAGGKF consensus sequence identified in GyrB and ParE, which are targeted by Cpe1 (Author Response Figure 3). These findings suggest that Cpe1 is less likely to target other ATPase domains. Nonetheless, due to Cpe1’s potential tolerance of certain variations within the consensus sequence, we cannot draw a definitive conclusion without further investigation into the cleavage sites.

      Another critical open question is the impact of Cpe1-mediated cleavage on the function of GyrB and ParE. To address this topic, we have begun investigating if Cpe1 cleavage affects the ATPase activity of these proteins. As expected, our biochemical analysis has demonstrated a significant decrease in ATP hydrolysis in the presence of active Cpe1tox, but not in the presence of the catalytic mutant Cpe1toxC362A (Author response Figures 4a, b). These results confirm that the ATP-dependent activities of both GyrB and ParE are disrupted following Cpe1 cleavage9. Previous work on FicT toxin that inhibits GyrB and ParE ATPase activity through post-translational modification found that ATP-dependent activities such as DNA supercoiling, relaxation, and decatenation were inhibited10,11. Interestingly, GyrB’s relaxation of negative supercoiled DNA, which does not require ATP, was also affected to some extent. This outcome raises the question as to whether Cpe1-cleaved GyrB results in similar downstream defects. Investigating this possibility would provide valuable insights into Cpe1’s mode of action, although we feel doing so is beyond the scope of the current study. Consequently, we view this as an important area for future research.

      Finally, regarding the potential applications of Cpe1, we are interested in further investigating its enzymatic specificity and properties. In this study, we analyzed the binding kinetics between Cpe1 and its substrate (Figure 5f) and currently we are endeavoring to characterize the kinetics of Cpe1-mediated proteolysis. To better probe hydrolytic dynamics, we plan to utilize a substrate with a reporting group (such as a chromogenic or fluorogenic leaving group) to monitor cleavage over time. We could achieve this by designing a recombinant substrate based on our knowledge of Cpe1’s native substrates (GyrB and ParE) and the target sequence (“LHAGGKF”). Alternatively, a secondary reaction leading to colorimetric changes could be employed for detection. We consider this an exciting research direction and an important next step for this study.

      Overall, we are grateful for the reviewer’s recognition of the novelty and importance of our work in advancing the understanding of interbacterial toxins and their inhibitory effects on topoisomerases. We plan to further investigate the consequences of Cpe1 cleavage on GyrB and ParE and to explore Cpe1 kinetics and its mechanistic actions in more detail. This will not only deepen our understanding of bacterial toxin-mediated inhibition but may also provide critical insights into strategies for targeting type II DNA topoisomerases. The reviewer’s insightful feedback has proven invaluable in shaping our ongoing and future research directions.

      Reviewer #3 (Evidence, reproducibility, and clarity)

      Bacterial warfare in microbial communities has become illuminated by recent discoveries on molecular weapons that allow contact-dependent injection of bacterial toxins between competitors. Among the best characterized systems are the type VI secretion system (T6SS) or the contact-dependent inhibition (CDI) system (i.e. some of the T5SSs). These systems are delivering a plethora of toxins with various biochemical activities and a broad range of targets. In recent years many such toxins have been characterized and their relevance in pointing at appropriate drug targets is increasing.

      In this study the authors built on a previously published association of a family of proteins, papain-like cysteine proteases (PLCPs), with their delivery by T6SS or CDI into target bacterial cells. Whereas this observation is not particularly novel, the findings that this set of proteins, that the authors called now Cpe1, can specifically target bacterial proteins such as ParE and GyrB, so that it affects chromosome partitioning and cell division, is groundbreaking. The authors are clearly demonstrating that Cpe1 cleaves their target proteins at double glycine recognition site which is in line with previous characterization of such proteases when fused to a particular category of ABC transporters. Even more remarkably they can show using biochemical approaches that Cpi1 is a cognate immunity for CpeI, preventing its activity, not by interfering with the catalytic site, but instead with the substrate binding site. The mechanism of competitive inhibition between immunity and substrate is also substantiated by biochemical data.

      We sincerely appreciate the reviewer’s interest in and support of our study.

      Major comments

      • This is a very well conducted study which combines bacterial genetics and phenotypes with excellent biochemical evidence.

      We thank the reviewer for their positive comments.

      • There are 8 targets identified for Cpe1 and yet only two are cleaved by the enzyme. It is intriguing that FtsZ is one identified target by the pull down but not confirmed for cleavage. The authors rules this as false positive but the cell division defect associated with Cpe1 activity would be consistent here. Are there any double glycine in FtsZ that could be identified as cleavage site? Is it possible that slightly different incubation conditions may promote degradation of FtsZ?

      We appreciate the reviewer’s thoughtful comment regarding FtsZ as a potential substrate of Cpe1. This was indeed an intriguing possibility, especially given the cell division defects observed following Cpe1 intoxication. Early on in the project, we also identified FtsZ as a Cpe1 interactor in our proteomic crosslinking assays, which further fueled the hypothesis that FtsZ might be a target.

      To explore this possibility, first we examined the FtsZ protein sequence for potential Cpe1 cleavage sites and identified several double glycine motifs (Author response Figure 5a). However, none of these motifs matched the consensus sequence identified in GyrB and ParE, which is LHAGGKF, a sequence that we have shown to be critical for Cpe1 cleavage activity. In an effort to better understand if FtsZ could still be cleaved by Cpe1, we conducted additional cleavage assays under various conditions (Author response Figure 5b). We tested different incubation temperatures, including increasing the temperature to 37 °C, and extended the reaction time to overnight. However, we did not observe any cleavage of FtsZ under these conditions. Given that FtsZ undergoes significant conformational changes upon binding to GTP12, we also considered the possibility that the GTP-bound form of FtsZ might be cleaved by Cpe1. However, even under those conditions, no significant cleavage of FtsZ was detected (Author response Figure 5b). Based on these results, we do not have any evidence to support that FtsZ is a target of Cpe1. The observed cell division defects are more likely a secondary effect resulting from the cleavage of GyrB and ParE, direct targets of Cpe1 that are crucial for chromosome segregation.

      • Could it be structurally predicted whether the GG of ParE or GyrB is fitted into the catalytic site of Cpe1.

      We appreciate the reviewer’s insightful question regarding the structural prediction of the GG motif of ParE and GyrB fitting into the catalytic site of Cpe1. To address this possibility, we used Alphafold 3 to predict the interaction structure between Cpe1 and its substrates13. The resulting model of Cpe1 interacting with the ATPase domain of GyrB (GyrBATPase) is shown in Supplementary Figure 9c. As illustrated, the loop of the GyrB ATPase domain containing the consensus targeting sequence (“LHAGGKF”) fits into the catalytic site of Cpe1, with the GG motif positioned closest to the catalytic cysteine residue, which likely facilitates hydrolysis. We also attempted to model the interaction between Cpe1 and the ATPase domain of ParE. However, confidence for this model was lower (ipTM = 0.74, pTM = 0.71), possibly due to Alphafold’s preference for certain protein configurations. To gain a more accurate understanding of how Cpe1 binds and recognizes its substrates, we are currently working on co-crystallizing Cpe1tox with GyrB and ParE. This long-term project aims to provide precise structural insights into the Cpe1-substrate interaction and further elucidate the mechanism of cleavage.

      Minor comments

      • The authors described a family of proteases, PLPCs, and characterized one here called Cpe1. Not clear whether this is a generic name or one specific protein from one particular bacterial species. Indeed, it is unclear from which bacterial strain the Cpe1 protein studied here originates.

      We thank the reviewer for this comment and apologize for the lack of clarity. To provide better context, we have now revised the manuscript (Lines 136-137 and 141-145) to clearly state that the Cpe1 protein characterized in this study originates from E. coli strain ATCC 11775.

      • It may be worth to emphasize that the Cpe1 domain is found in all possible configurations as T6SS cargo and that is to be linked to VgrG, PAAR or Rhs.

      Thank you for this suggestion. We have revised the manuscript accordingly to emphasize this point (Lines 106-109).

      • Line 49 the authors could indicate that the Esx system is also known as type VII secretion system (T7SS).

      Thank you for this suggestion. We have revised the manuscript accordingly (Line 48-50).

      • Line 113 it may be better to use Proteobacteria instead of Pseudomonadota

      We have revised the manuscript (Lines 114-115) as suggested by the reviewer. It is important to note that following the recent decision by the International Committee on Systematics of Prokaryotes (ICSP) to amend the International Code of Nomenclature of Prokaryotes (ICNP) and formally recognize "phylum" under official nomenclature rules14,15, the taxonomy database used in our analysis has adopted the updated nomenclature. To ensure consistency, we followed this updated nomenclature throughout the original manuscript.

      Reviewer #3 (Significance)

      This is an excellent piece of work. The characterization of Cpe1 might look poorly novel at the start when compared to previous studies. Yet the findings go crescendo by characterizing original mechanisms of action of the cognate immunity, and by identifying the molecular target of Cpe1. This is providing real conceptual advance in the T6SS field and not just reporting yet another T6SS toxin.

      As a T6SS expert I genuinely feel that these findings are groundbreaking and could be targeted to broad audience since the possible implications of these observations for future antimicrobial drugs discovery or therapeutic approaches is highly relevant.

      We sincerely appreciate the reviewer’s positive remarks and support of our study.

      References

      1. Ishihama, A., and Shimada, T. (2021). Hierarchy of transcription factor network in Escherichia coli K-12: H-NS-mediated silencing and Anti-silencing by global regulators. FEMS Microbiol Rev 45. 10.1093/femsre/fuab032.
      2. Hersch, S.J., Manera, K., and Dong, T.G. (2020). Defending against the Type Six Secretion System: beyond Immunity Genes. Cell Rep 33, 108259. 10.1016/j.celrep.2020.108259.
      3. Russell, A.B., Singh, P., Brittnacher, M., Bui, N.K., Hood, R.D., Carl, M.A., Agnello, D.M., Schwarz, S., Goodlett, D.R., Vollmer, W., and Mougous, J.D. (2012). A widespread bacterial type VI secretion effector superfamily identified using a heuristic approach. Cell Host Microbe 11, 538-549. 10.1016/j.chom.2012.04.007.
      4. Jana, B., Fridman, C.M., Bosis, E., and Salomon, D. (2019). A modular effector with a DNase domain and a marker for T6SS substrates. Nat Commun 10, 3595. 10.1038/s41467-019-11546-6.
      5. Halvorsen, T.M., Schroeder, K.A., Jones, A.M., Hammarlof, D., Low, D.A., Koskiniemi, S., and Hayes, C.S. (2024). Contact-dependent growth inhibition (CDI) systems deploy a large family of polymorphic ionophoric toxins for inter-bacterial competition. PLoS Genet 20, e1011494. 10.1371/journal.pgen.1011494.
      6. Nguyen, T.T., Sabat, G., and Sussman, M.R. (2018). In vivo cross-linking supports a head-to-tail mechanism for regulation of the plant plasma membrane P-type H(+)-ATPase. J Biol Chem 293, 17095-17106. 10.1074/jbc.RA118.003528.
      7. Liu, Y., Yu, J., Wang, M., Zeng, Q., Fu, X., and Chang, Z. (2021). A high-throughput genetically directed protein crosslinking analysis reveals the physiological relevance of the ATP synthase 'inserted' state. FEBS J 288, 2989-3009. 10.1111/febs.15616.
      8. Yamazaki, Y., Niki, H., and Kato, J. (2008). Profiling of Escherichia coli Chromosome database. Methods Mol Biol 416, 385-389. 10.1007/978-1-59745-321-9_26.
      9. Reece, R.J., and Maxwell, A. (1991). DNA gyrase: structure and function. Crit Rev Biochem Mol Biol 26, 335-375. 10.3109/10409239109114072.
      10. Harms, A., Stanger, F.V., Scheu, P.D., de Jong, I.G., Goepfert, A., Glatter, T., Gerdes, K., Schirmer, T., and Dehio, C. (2015). Adenylylation of Gyrase and Topo IV by FicT Toxins Disrupts Bacterial DNA Topology. Cell Rep 12, 1497-1507. 10.1016/j.celrep.2015.07.056.
      11. Lu, C., Nakayasu, E.S., Zhang, L.Q., and Luo, Z.Q. (2016). Identification of Fic-1 as an enzyme that inhibits bacterial DNA replication by AMPylating GyrB, promoting filament formation. Sci Signal 9, ra11. 10.1126/scisignal.aad0446.
      12. Matsui, T., Han, X., Yu, J., Yao, M., and Tanaka, I. (2014). Structural change in FtsZ Induced by intermolecular interactions between bound GTP and the T7 loop. J Biol Chem 289, 3501-3509. 10.1074/jbc.M113.514901.
      13. Abramson, J., Adler, J., Dunger, J., Evans, R., Green, T., Pritzel, A., Ronneberger, O., Willmore, L., Ballard, A.J., Bambrick, J., et al. (2024). Accurate structure prediction of biomolecular interactions with AlphaFold 3. Nature 630, 493-500. 10.1038/s41586-024-07487-w.
      14. Oren, A., Arahal, D.R., Rossello-Mora, R., Sutcliffe, I.C., and Moore, E.R.B. (2021). Emendation of Rules 5b, 8, 15 and 22 of the International Code of Nomenclature of Prokaryotes to include the rank of phylum. Int J Syst Evol Microbiol 71. 10.1099/ijsem.0.004851.
      15. Oren, A., and Garrity, G.M. (2021). Valid publication of the names of forty-two phyla of prokaryotes. Int J Syst Evol Microbiol 71. 10.1099/ijsem.0.005056.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      Bacterial warfare in microbial communities has become illuminated by recent discoveries on molecular weapons that allow contact-dependent injection of bacterial toxins between competitors. Among the best characterized systems are the type VI secretion system (T6SS) or the contact-dependent inhibition (CDI) system (i.e. some of the T5SSs). These systems are delivering a plethora of toxins with various biochemical activities and a broad range of targets. In recent years many such toxins have been characterized and their relevance in pointing at appropriate drug targets is increasing. In this study the authors built on a previously published association of a family of proteins, papain-like cysteine proteases (PLCPs), with their delivery by T6SS or CDI into target bacterial cells. Whereas this observation is not particularly novel, the findings that this set of proteins, that the authors called now Cpe1, can specifically target bacterial proteins such as ParE and GyrB, so that it affects chromosome partitioning and cell division, is groundbreaking. The authors are clearly demonstrating that Cpe1 cleaves their target proteins at double glycine recognition site which is in line with previous characterization of such proteases when fused to a particular category of ABC transporters. Even more remarkably they can show using biochemical approaches that Cpi1 is a cognate immunity for CpeI, preventing its activity, not by interfering with the catalytic site, but instead with the substrate binding site. The mechanism of competitive inhibition between immunity and substrate is also substantiated by biochemical data.

      Major comments

      • This is a very well conducted study which combines bacterial genetics and phenotypes with excellent biochemical evidence.
      • There are 8 targets identified for Cpe1 and yet only two are cleaved by the enzyme. It is intriguing that FtsZ is one identified target by the pull down but not confirmed for cleavage. The authors rules this as false positive but the cell division defect associated with Cpe1 activity would be consistent here. Are there any double glycine in FtsZ that could be identified as cleavage site? Is it possible that slightly different incubation conditions may promote degradation of FtsZ?
      • Could it be structurally predicted whether the GG of ParE or GyrB is fitted into the catalytic site of Cpe1.

      Minor comments

      • The authors described a family of proteases, PLPCs, and characterized one here called Cpe1. Not clear whether this is a generic name or one specific protein from one particular bacterial species. Indeed, it is unclear from which bacterial strain the Cpe1 protein studied here originates.
      • It may be worth to emphasize that the Cpe1 domain is found in all possible configurations as T6SS cargo and that is to be linked to VgrG, PAAR or Rhs.
      • Line 49 the authors could indicate that the Esx system is also known as type VII secretion system (T7SS).
      • Line 113 it may be better to use Proteobacteria instead of Pseudomonadota

      Significance

      This is an excellent piece of work. The characterization of Cpe1 might look poorly novel at the start when compared to previous studies. Yet the findings go crescendo by characterizing original mechanisms of action of the cognate immunity, and by identifying the molecular target of Cpe1. This is providing real conceptual advance in the T6SS field and not just reporting yet another T6SS toxin. As a T6SS expert I genuinely feel that these findings are groundbreaking and could be targeted to broad audience since the possible implications of these observations for future antimicrobial drugs discovery or therapeutic approaches is highly relevant.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript, titled "An Interbacterial Cysteine Protease Toxin Inhibits Cell Growth by Targeting Type II DNA Topoisomerases GyrB and ParE", describes how an effector family was identified and characterized as a papain-like cysteine protease (PLCP) that negatively impacts bacterial growth in the absence of its co-encoded immunity protein. This thorough report includes (1) bioinformatic analysis of prevalence, finding this PLCP effector encoded in many gram-negative bacteria, (2) confirming conservation of catalytic active site via structural (crystallographic) analysis, as well as visualizing contacts with the immunity protein, (3) validation of results using growth studies combined with mutagenesis, (4) using a cell-based cross-linking method to pull out potential targets, which were subsequently identified via mass spectrometry, (5) validation of these results using in vitro protease assays with purified (potential) substrates, including verification of the motif recognized on the substrate(s), and cell-based phenotype analyses, and finally, (6) demonstrating competition between immunity protein and ParE substrate using an in vitro pull-down approach. Overall, this is a strong body of work with compelling conclusions that are well supported by multiple experimental approaches.

      Major comments:

      The claims made based on the presented results are well supported, including that this PLCP effector toxin is widespread, is neutralized in a competitive mechanism by its immunity partner, and that it effectively cleaves both GyrB and ParE (subunits of bacterial type II topoisomerases) at a conserved motif, resulting in suppression of bacterial cell growth via mis-regulating chromosome segregation. No additional experiments are needed to further validate these results, and the authors are commended on the cell-based and in vitro studies to deduce very specific mechanisms and structural details.

      Minor comments:

      While the writing and data presentation are extremely clear, in general I recommend the authors indicate the level(s) of replication for experiments. Figure legends generally note that mean values with standard deviations are shown, but I did not find where the number of replicates (and independent versus technical) were listed.

      The figures are very clear, but in many instances the addition of PLCP toxin is indicated as "before" and "after"; while a modest change, I recommend altering this to some type of "-" and "+" type nomenclature rather than a time-based notation (especially as presumably both samples were treated identically, just with or without protease). I also suggest quantifying the intensities of the gel images presented in Figure 5c, d (for example, Cpe1 intensity as a ratio to that of the ParE ATPase domain), to make the interpretation even more evident.

      Crystallographic structure: the PDB report notes some higher-than-expected RZR scores; I interpret this to mean that there was strain around the catalytic site of one of the two toxins in the asymmetric unit, or that this copy was less well ordered. The RZR outliers likely arise from non-optimal weighting for geometric restraints. While no figures of electron density are presented, these modest outliers are not expected to alter the conclusions reached in the current work. One point of interest that is not addressed, however, is if any variance between the two complexes in the asymmetric unit are noted? A passage compares the current toxins to others in the larger subfamily and notes a rotation of a side chain is needed to superpose (Line 159). Can the authors please clarify around which bond this rotation is needed, and if both copies in the asymmetric unit are in the same orientation at this site?

      Significance

      Bacteria encode numerous effectors to successfully compete in natural environments or to mediate virulence; these effectors are typically associated with type VI secretion system machinery or referred to as contact dependent inhibition systems. The current work has identified a sub-family of papain-like cysteine protease effectors that are unique by targeting type II topoisomerases. Among the actionable findings is the identification of both the specific site of interaction with the topo substrates, as well as the specific motif recognized for cleavage. This should enable the field to move forward probing for this activity with other toxins and substrates. The insights provided by the competitive neutralization mechanism also stand out as an important contribution that can be more broadly applied. Within the literature, few effector targets are identified, making the current study stand out as impactful by the well-executed experiments that directly support the conclusions.

      While the current study has strong elements of novelty and is complete, it also nicely sets up future studies for remaining open questions. For example, does the nucleotide-bound status of the ATPase domain, or other catalytic intermediate, impact the susceptibility of topoisomerases to cleavage? Is this identified motif found in other ATPase domains? Is the negative supercoiling activity unique to gyrase also impacted, or is the phenotypic mechanism of cell toxicity reliant only on chromosome segregation? What types of kinetic parameters do this class of toxins demonstrate, and does sequence variability alter this? These ideas are a testament to the intriguing study as presented, capturing the readers' curiosity for additional details that are clearly beyond the scope of the current work.

      I anticipate this work will be of interest to the broad field of microbiologists that study interbacterial communication as well as pathogenic mechanisms. While the research is largely fundamental in nature, it is wide in scope with applications to many gram-negative bacteria that inhabit a myriad of niches. The work will also be of interest to specialists in topoisomerases, as the list of toxins that target these essential enzymes is growing and the therapeutic utility of topoisomerase inhibition remains vital. My interest lies in the latter, in toxin-mediated inhibition of topoisomerase enzymes as a means to alter bacterial cell growth. While I have strong expertise in structural biology, I am lacking in expertise for mass spectrometry. I note this because this method was used for the identification of the target substrate.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Song et al presents evidence to show that the predicted cysteine protease type 6 secretion system (T6SS) effector Cpe1 inhibits target cell growth by cleaving type II DNA Topoisomerases GyrB and ParE. The authors determined the structure of the protein complex formed by Cpe1 and its immunity protein Cpi1, which allowed them to reveal the mechanism of inhibition. Moreover, the authors identified type II DNA topoisomerases GyrB and ParE as the targets of Cpe1. Overall, the major conclusions were well supported by experimental data of high quality. The findings have expanded our appreciation of the mechanism utilized by T6SS effectors to inhibit target cell growth.

      Specific comments:

      Main points:

      1. To better establish that GyrB and ParE are the sole targets of Cpe1, the authors should express the GG mutant in target cells and determine whether these cells become resistant to Cpe1-mediated killing (inhibition). They can also determine whether co-expression of the cleavage resistant mutants suppresses the toxicity of Cpe1.
      2. Results in Figure 7 clearly show that Cpi1 is capable of displacing ParE from Cpe1 due to higher affinity. Yet, the "competitive inhibition model" described in the last result section does not completely match what is really happening in Cpe1-mediated interbacterial competition. If Cpi1 is in the target cell, it would more likely engage the incoming Cpe1 before it can interact with ParE or GyrB, so competition does not occur in this scenario. Similarly, in the predatory cells expressing Cpe1 and Cpi1, these two proteins will form a stably protein complex, and no competition with the target will occur. The authors should reconsider their model.

      Minor points:

      1. "Intoxication" was used throughout the text numerous times to describe the activity of Cpe1. Looking in the Marriam-Webster dictionary, "Intoxication" means "a condition of being drunk". This word should be replaced with "toxicity" or some other terms in this line.
      2. Lines 46-48, references on contact-dependent killings by these systems mentioned should cited. Ref. 9 cited does NOT cover the informatin at all.
      3. "characterizations" should be "characterization".
      4. Line 229 "Cpe1-Bpa monomers" should be " apo Cpe1-Bpa". The results cannot distinguish whether these bands are monomers or multimers.
      5. Line 283, was the mutation deletion? Substitution was used I think.
      6. Lines 439-444 the discussion should be extended to include other bacterial toxins that target type II DNA topoisomerases (e.g. PMID: 26299961 and PMID: 26814232).

      Significance

      The authors determined the structure of the protein complex formed by Cpe1 and its immunity protein Cpi1, which allowed them to reveal the mechanism of inhibition. Moreover, the authors identified type II DNA topoisomerases GyrB and ParE as the targets of Cpe1. Overall, the major conclusions were well supported by experimental data of high quality. The findings have expanded our appreciation of the mechanism utilized by T6SS effectors to inhibit target cell growth.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We thank the reviewers for their evaluation of our previous submission and have responded to each point in detail below. Overall, we have revised the manuscript with the addition of several new data and corresponding figure panels that strengthen our previous conclusions and add new insights allowing us to extend the conclusions of the study. Important additions include new data showing the impact of loss of CLU on adapting to additional stressors during metabolic transitions that supports a mechanistic understanding of our omics results; by poly(dT) FISH we show that fly Clu granules indeed contain mRNAs; FRAP microscopy analysis supports that Clu1 granules have dynamic content similar to other LLPS membraneless organelles; and we have re-analysed our data to demonstrate more clearly the impact of Clu1 on translation efficiency and also the relative binding of mRNAs during translation. In addition, we provide some extra control analyses for completeness.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript the authors study the Clustered mitochondrial proteins Clu of Drosophila melanogaster and Clu1 of Saccharomyces cerevisiae, two homologues of the mammalian protein CLUH. They show in compelling microscopy analysis that both proteins form granules. This was the case for flies fed on yeast paste after starvation and in yeast in post-diauxic phase, in respiratory media or during mitochondrial stress. They show that these granules are found in proximity to mitochondria and that they behave like liquid-liquid-phase separated condensates. They show by co-staining for P-bodies and stress granules that Clu1-granules are distinct from these RNA granules. Furthermore, they found that the formation required active translation. In the second part, they show that Clu1 interacts with ribosomal and mitochondrial proteins by BioID. The deletion of Clu1 leads to slightly impaired growth on media containing Ethanol as a carbon source. They find that nascent polypeptides of some mitochondrial precursor proteins are decreased in the deletion of Clu1 and conclude that Clu1 regulates translation of these proteins. Using RNA immunoprecipitation of Clu1-GFP in presence of cycloheximid, EDTA and puromycin. The mRNAs of nuclear-encoded mitochondrial proteins found to be interacting with Clu1 were purified in conditions when the ribosomes are intact and the RNAs showed no interaction when ribosomes were disassembled. They show in sucrose gradients that Clu1 co-migrates with polysomes independent of its distribution state or carbon source. However, when cells are grown in conditions of granule formation, then polysomes and Clu1 run less deeply into the gradient. Form these data, the authors conclude that Clu/Clu1 regulates the translation of nuclear-encoded mitochondrial proteins.

      Major comments:

      -The authors state that Clu1 is regulating translation during metabolic shifts. However, it is not clear what the real impact on mitochondrial function is. They show that there is a minor growth defect on ethanol media when CLU1 is deleted. However, if Clu1 is necessary mainly for adaptation, the phenotype will be strongest observed in conditions where cells switch carbon sources. Growth curves would be suitable in which the lag-phase of yeast cells precultured either in glucose or glycerol switched to media of different carbon sources (glucose to glycerol or glycerol to glucose) are measured. One would expect that the deletion mutant shows a longer lag-phase compared to the wild type when shifted from glucose to glycerol media.

      We agree that this is an important question, and, duly, we previously attempted to address this exactly as the reviewer described. Surprisingly, we were not able to observe any substantial differences in the duration of the lag phase between the wild-type and CLU1 knockout strains under these conditions. However, we did note that CLU1 knockout cells consistently reached stationary phase with a lower optical density when switched to ethanol media, consistent with these cells having a different metabolic efficiency during growth on ethanol media.

      To further explore the role of Clu1, we noted that several of the Clu1 mRNA interactors were mitochondrial heat shock proteins (HSPs), which are crucial for mitochondrial protein folding and import during the transition from fermentation to respiration. Hence, we hypothesised that the absence of Clu1 might lead to increased sensitivity to heat shock during the metabolic shift.

      To test this, we subjected both wild-type and CLU1 knockout cells to heat shock under three different conditions: (1) during growth on glucose-containing media (fermentation), (2) after shifting cells to media containing ethanol during the lag phase, when cells are adapting to respiration, and (3) after cells had fully adapted to ethanol and resumed growth. Interestingly, CLU1 knockout cells were more sensitive to heat shock selectively during the adaptation to respiration, which involves the translation of an extensive number of mitochondrial proteins. We think that the small difference in translation of mitochondrial HSPs becomes evident only upon additional heat shock, likely due to a deficient mitochondrial protein folding and import. These findings support our hypothesis that Clu1 is essential for optimal mitochondrial function during metabolic shifts.

      These results have been added to the manuscript and shown in Fig. S6 and described on page 9.

      -In line with this, how different is the mitochondrial proteome of the WT and the mutant? Do hits of the BioID, RIP and Punch-P experiments change at steady state or during metabolic shifts? Either proteomics of isolated mitochondria or western blots of whole cells or isolated mitochondria of WT and the deletion mutant grown in conditions of Clu1-granule formation or no granules for the hits could answer this question.

      We also considered this question during the course of the work. However, in exploratory analyses we saw no obvious differences in overall mitochondrial proteomics at steady-state which is what prompted us to look at more subtle effects on translation. Considering this further, changes in steady-state levels can be complex to interpret as they represent the combined effects of protein production and degradation. Small changes arising from altered production could be masked by compensatory changes in turnover rate. In light of this, we believe that the translational regulation differences identified in our study remain central to understanding the role of Clu1, and any downstream proteomic changes would not alter our primary conclusions.

      -The authors analyze RNAs bound in polysomes to assess translation efficiency. Translation efficiency is usually calculated by the fraction of RNA bound by ribosomes to the total RNA amount of an RNA species. Thus, doing RT-qPCR from whole cells would be necessary to assess if the occupancy of ribosomes on the transcripts is due to changes in RNA abundance or other regulatory pathways and would help to further assess what causes the observed changes.

      Thanks for this recommendation. To address this and expand our analysis to other proteins differentially translated in clu1Δ cells, we measured the mRNA steady-state levels by performing RNAseq on WT and clu1Δ strains grown under the same conditions as used for Punch-P. We then calculated the translation efficiency by dividing the nascent protein levels (Punch-P) by steady-state mRNA levels (RNAseq), as previously described for Punch-P data (PMID: 26824027). The translation efficiency for the majority of proteins with reduced translation in the clu1Δ cells by Punch-P analysis was lower. Similarly, the majority of proteins with increased translation had higher translation efficiency.

      The mRNA quantification in polysomes we originally presented in the manuscript, further showed that the decrease in translation efficiency is not caused by a simple decrease of mRNA engaged in translation and that Clu1 is regulating protein translation at the ribosome level. In contrast, for higher translated proteins, we detected an increase in mRNAs engaged in polysomes, likely underlying the increased translation. These results further support our conclusions regarding the regulatory effects of Clu1 on translation.

      These results have been added to the manuscript and shown in Fig. 7E and described on page 9.

      OPTIONAL:

      -The authors show a co-localization of Clu/Clu1 with mitochondrial fission factors and conclude that the granules appear likely near fission sites. Indeed, CLUH has been implied in the past to play a role in mitochondrial fission (Yang, H., Sibilla, C., Liu, R. et al. Clueless/CLUH regulates mitochondrial fission by promoting recruitment of Drp1 to mitochondria. Nat Commun 13, 1582 (2022). https://doi.org/10.1038/s41467-022-29071-4). Thus, are fission sites required for Clu-granule localizations? What is the role of the mitochondrial network integrity for the granule distribution? Expressing Clu-GFP/Clu1-GFP in cells depleted for the fission factors would provide information on that.

      Thanks for this suggestion. We agree that it would be interesting to know whether Clu1 granules still appear when mitochondrial fission is blocked. We tried to address this question but encountered some technical limitations. First, overexpression of Clu1-GFP via a plasmid did not replicate the endogenous Clu1 behaviour, making it necessary to delete the fission factors in the Clu1-GFP background. While crossing the Clu1-GFP strain with already available knockout strains would be straightforward, we would need access to a tetrad dissecting microscope, which unfortunately was not available to us. We also attempted PCR-based gene deletion but the sequence homology between the GFP-tagging cassette and the deletion cassettes made this very challenging. Given these limitations, and as the lab's yeast expert had already left, we were not able to pursue this experiment further and have removed these observations from our manuscript. We hope that future studies will explore this question in more detail.

      -The author assess convincingly that Clu1 interacts with ribosomes and runs with polysomal fractions. However, how it actually regulates translation is not clear. To answer this question, selective ribosomal profiling would be necessary. The authors have established conditions which would be suitable for the experiment. They could use crosslinking and sucrose cushions to IP ribosomes with Clu1-GFP bound to be used for ribosomal profiling. However, this experiment is quite time-intensive (3-4 months) and expensive, thus, an optional suggestion.

      We thank the reviewer for this suggestion. We agree that ribosome profiling could provide novel insights into the function of Clu1/Clu. While we recognise the potential of this approach, as the reviewer points out, this experiment would indeed be time- and resource-intensive. Based on our initial tests, where we included cross-linked samples (UV and formaldehyde) we anticipate that it could even take longer than the estimated 3-4 months, as the IP using cross-linked lysates was not as successful as the IP using non-cross-linked samples: we were not able to immunoprepitate Clu1 so efficiently likely to the epitope being poorly exposed to the antibody. Although we have optimised working conditions for co-immunoprecipitating Clu1 with ribosomes, performing ribosome profiling using our setup within the timeframe and resources of this study is unfortunately not currently feasible.

      Minor comments:

      Fig1: B, C, please add scale bars into the zoom ins.

      These have been added.

      Fig 2 would profit from inlets of zoom ins to visualize the distribution better.

      These have been added.

      Fig.3: Panel C does not really add much information. I would rather remove it or put it into supplements and therefore show a zoom of Panel E with a line plot showing the rings. It is not clear from the represented images where the rings are formed.

      We think some confusion has arisen from the text description. It seems that the reviewer was under the impression that Fig. 3C and 3E were intended to be showing the Clu1 rings around the mitochondria, but this was shown only in Fig. S3A. We have re-written these sentences for better clarity. To be clear, Fig. 3C is a 3D rendering of the left-hand cell in 3B (3D is a line plot of part of the right-hand cell) and 3E is a different experiment showing the formation of Clu1 granules under a different respiratory stress (galactose plus CCCP). We have also added a line plot showing Clu1-GFP and mito-mCherry fluorescence intensity to highlight the Clu1 rings around the mitochondria in Fig. S3A.

      Fig.3 panel F: Max projections are not appropriate to show colocalization as they can lead to false-positive overlaps. Just remove the max projections.

      We tried a number of different approaches to improve this analysis but, ultimately, we were not able to generate sufficiently robust data to be convincing so we decided to remove this from the manuscript. The coincidence of Clu1 granules with mitochondrial fission factors was an adjunct observation and not a major part of the story and has been discussed by others relating to fly Clu (PMID: 35332133), so removal from the current manuscript does not impact the key conclusions of the study.

      References 21 and 22 are the same.

      Thanks. This has been fixed.

      Reviewer #1 (Significance (Required)):

      This manuscript shows in a convincing way that Clu and Clu1 form RNA granules and that Clu1 interacts with ribosomes. It is written in a clear way and the figures support the conclusions drawn in the text. The finding that Clu/Clu1 is important for metabolic adaptation has not been shown in fly or yeast to my knowledge. It is in line with findings for the mammalian homologue CLUH. Thus, the findings are supported by earlier work. This study is of value for a broader audience of the basic research field, especially of the mitochondrial and RNA granule field, as it supports the idea of post-transcriptional regulation of nuclear-encoded mitochondrial protein gene expression for dynamic adaptation of mitochondrial function. The conditions when Clu granules form is studied in detail, followed up by identification of target RNAs and interaction partners. Though the interaction of Clu1 with ribosomes is shown in a compelling way, a detailed mechanism of the function of Clu/Clu1 is missing and would require more experiments. Thus, even though a detailed mechanism is missing, the study does expand on our understanding of Clu/Clu1 in regulating mitochondrial biogenesis and is therefore of high interest of the mitochondrial field.

      Expertise: mitochondria, yeast, RNA granules, mitochondrial biogenesis, next-generation sequencing, fluorescence microscopy

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript the authors use D. melanogaster and S. cerevisiae to study the role of CLUH in the translation of nuclear-encoded mitochondrial proteins. During conditions requiring aerobic respiration, CLUH forms RNA-dependent granules that localise in the proximity to mitochondria. Furthermore, the authors demonstrate that CLUH interacts with translating ribosomes to facilitate the translation of specific target mRNAs. For this, the authors use a combination of GFP-tagged CLUH models. BioID, polysome translating proteomics, RNA-IP. The authors' main conclusions are that (i) CLUH forms dynamic, membrane-less, RNA-dependent granules under conditions that demand aerobic respiration, (ii) CLUH interacts with specific mRNAs encoding metabolic factors, and (iii) CLUH interacts with the translating ribosome. The manuscript is well written and the conclusions stand in proportion to the experimental output and the results. The main concern is with regards to lack of advancement in relationship to published data.

      We appreciate the reviewer's feedback and specific comments which we respond to individually below. However, we would like to first address the point regarding "lack of advancement" and the use of the "CLUH" terminology which the reviewer uses throughout their critique. We would like to reiterate, as the reviewer states, our work focussed exclusively on yeast Clu1 and Drosophila Clu. None of our data relates to mammalian CLUH. While these proteins share substantial sequence homology, it is imprudent and scientifically unsound to assume cross-species equivalence without directly testing. Indeed, one of the central aims of our study was to characterise the molecular function of yeast Clu1, which remains almost entirely unstudied.

      We acknowledge that some of the observations contained within our study have been described by others and we have appropriately noted and cited these in context. Nevertheless, (a) independent replication is always valuable but easily criticised as lacking novelty, and (b) the majority of the work was analysing the molecular dynamics and function of yeast Clu1 which is almost completely unstudied and may help provide hypotheses for others to test for conservation in mammalian CLUH. Hence, we consider that summarising the work as 'lacking advancement' is misplaced.

      Comments:

      To this reviewer it is not clear how CLUH can regulate the translation of specific mRNAs while being bound to ribosomes, regardless of being in a diffuse or granular state. The authors suggest that under metabolically active conditions, CLUH might aggregate translating ribosomes, forming the granular structures. How CLUH though can both be bound to translating ribosomes and recruit specific mRNAs at the same time is not explained.

      It was indeed surprising to us that the data indicate that Clu1 can bind both mRNAs and ribosomes to affect translation, and we share the reviewer's curiosity about the precise mechanism of how this occurs. While we have provided novel insights into this situation, dissecting the precise molecular mechanisms is beyond the scope of the current study.

      The authors might want to discuss how changes in metabolic demands signal the aggregation of CLUH, and how CLUH can recognise its target mRNAs.

      We appreciate the reviewer's point here but as this would be pure speculation we have made only brief comments on this at the end of the Discussion.

      What was the rationale to perform the RIP or the PUNCH-P experiments only under non-challenged conditions, but not under conditions demanding aerobic respiration?

      We appreciate the reviewer's question. In fact, the Punch-P analysis was carried out on cells that had been transferred to ethanol to induce respiration. This was stated in the Methods, but we appreciate that this may have been missed so we have now clarified this in the main text (p9).

      Regarding the RIP, our initial tests showed that mRNAs encoding proteins found to interact with Clu1 by BioID were interacting with Clu1 in both fermenting and respiring conditions. Due to this consistency, it did not seem necessary to perform the RIP experiments under both metabolic conditions, so we chose to conduct the experiment under the simpler growth condition.

      If CLUH is ubiquitously bound to ribosomes, has CLUH been seen in any structural representation of the cytosolic ribosome?

      This is a good question, and we wondered the same. To our knowledge, Clu1/Clu/CLUH has not been observed in any structural studies of the ribosome, and no formal structure of any Clu family proteins has been resolved.

      Nevertheless, we would like to clarify that we do not think, or suggest in the manuscript, that Clu/Clu1 is ubiquitously bound to ribosomes. First, current evidence supports that Clu/Clu1 only regulates a specific subset of mRNAs. Second, our work, particularly the sucrose gradient experiments, shows that Clu1 interacts transiently with ribosomes, as cross-linking was required to capture the full extent of this interaction. This transient and selective interaction of Clu/Clu1 with the ribosome, together with the fact that transient interactors are often lost during ribosome purification, makes Clu/Clu1 detection in structural studies unlikely. Due to the transient interaction and dynamic localisation of Clu/Clu1, capturing Clu/Clu1 in ribosomal structures will require significant work in the future.

      Reviewer #2 (Significance (Required)):

      CLUH has been studied in various publications, showing data very similar to that presented in this manuscirpt. However, the authors provide a comprehensive analysis on both yeast and fly CLUH. The strength of the manuscript is the combination of several elegant methods and genetically modified model systems in two species to elucidate the role of CLUH during the translation of specific mRNA. In my view through, the advancement of understanding the function of CLUH is limited.

      Although the authors work in yeast and DM, the results seem applicable to other species, including humans, and thus, the presented results will be of interest in a range of researchers working in the field of metabolic regulation and gene expression.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: This study from Miller-Fleming et al. employs yeast and Drosophila as model systems to explore the function of the RNA-binding protein Clu1, which is involved in mitochondrial biogenesis. The first part of the manuscript characterizes so called "Clu1 granules", and their dependance from metabolic transitions. In particular, using yeast, they find a relocalisation of Clu1 upon starvation and several mitochondrial stress conditions. These granules are not stress granules, and are dissolved by RNAse and puromycin treatment. The second part of the study aims to understand the molecular function of the protein and its link to translation. The results confirm an evolutionary conserved role of Clu1 in binding mRNAs for mitochondrial proteins and in interacting with mitochondrial proteins, ribosomal components and polysomes. In addition, the authors claim that binding of Clu1 to RNA is enhanced when mRNAs are trapped in polysomes by treatment with cycloheximide (CHX), leading to the proposal that Clu1 binds mRNAs during active translation.

      Major comments:

      -The claim of Clu1 granule localization next to mitochondria (Figure 3) would be more convincing if any of the experiment would be quantified. Especially in the case of panel 3G in Drosophila egg chambers where there are a lot of mitochondria, one wonders whether the closeness to mitochondria is just random. Furthermore, mdv1-signal does not look very convincing, being blurry and not dotty as expected. Thus, the conclusion that Clu1 granules partially colocalization with site of fission appears premature.

      The claim that Clu/Clu1 granules are often found in close proximity to mitochondria was inferred from observations from multiple analyses from yeast (we looked at hundreds of cells in several different conditions) and flies, where it had already been demonstrated (Cox and Spradling, 2009). We agree that observations of the fly egg chambers are challenging due to the very high density of mitochondria (and other cellular components - see the new analysis of poly(A) mRNAs) in these highly active cells. These considerations motivated us to take the CLEM approach (in addition to investigating the membraneless nature), to gain a much higher resolution view of the localisation of the granules. This analysis unequivocally showed that the Clu granules were exactly juxtaposed to several mitochondria. It is noteworthy that even in the TEM images shown, there is ample cytoplasm in which the Clu granule could be located if the association with mitochondria was coincidental and all granules had mitochondria in close proximity.

      Regarding the possible coincidence of Clu1 with mitochondrial fission factors, as mentioned above for Reviewer 1, we tried a number of different approaches to improve this analysis but, ultimately, we were not able to generate sufficiently robust data to be convincing so have decided to remove this from the manuscript. Since this was an adjunct observation and not a major part of the story and has been discussed by others relating to fly Clu (PMID: 35332133), removal from the current manuscript does not impact the key conclusions of the study.

      Based on the ability of 1,6-hexanediol to dissolve the granules (Figure 4), the authors conclude that: "Clu1 foci have membraneless nature". As they correctly state in the discussion, treatment with 1,6-hexanediol can have other effects. I suggest to be more cautious with the conclusions or add additional experiments. Are the granules dynamics if using FRAP? Do they fuse?

      The inference that the Clu1 granules are membraneless organelles was not solely based on the observation that they disassemble upon 1,6-hexanediol treatment but was made in conjunction with the CLEM analysis that showed unambiguously that Clu granules are not associated with any detectable membrane, which is strong evidence that these granules are membraneless in nature. Indeed, as the reviewer mentioned, we are cautious in concluding they have been formed by liquid-liquid phase separation (LLPS) and we do acknowledge that 1,6-hexanediol can have other effects in cells. Nevertheless, following the reviewer's suggestion we have analysed Clu1 granule dynamics using FRAP, even though we are aware that FRAP is also not a definitive proof that a structure is formed by LLPS. The FRAP analysis, shown in new Figure 4C, D, revealed approximately 50% recovery over 10 min imaging timeframe. As discussed on page 13, this indicates a dynamic nature of these granules, but this dynamism can vary widely between different types of granules and even different proteins within the same granule. Further work is warranted to fully investigate the dynamic nature of Clu/Clu1 granule components.

      The experiment in which the granules are dissolved by treatment with RNAse is very interesting. However, per se this does not directly demonstrate that the granules contain mRNA. To state this the author should perform FISH experiments for example using a probe to detect poly-A.

      We thank the reviewer for this suggestion. We have performed poly(dT) FISH in egg chambers. Initial analysis showed that the fluorescence was diffuse and widely distributed, as expected for these highly active cells, but with no specific accumulation in Clu granules. Interestingly, we observed that treatment with RNase A, which we initially used to demonstrate probe specificity, revealed an enrichment of poly(A) RNAs in Clu granules. So, while treating the live egg chambers with RNase revealed that granules depend on RNA for their stability, treating fixed egg chambers revealed more directly the presence of RNAs in granules.

      These results have been added to the manuscript and shown in Fig. 5 and described on page 7.

      The authors show that puromycin prevents the granule formation before insulin addition in the fly. Are these results (upon RNAse treatment and puromycin treatment) recapitulated in the yeast system? The authors conclude that Clu1 formation depends on mRNAs being engaged in translation, but never show that the granules are site of active translation. More experiments in this direction (for example using puro-PLA of specific mRNAs) are missing and would clearly improve the manuscript.

      Thanks for this very interesting consideration. We agree that we have not formally shown that the Clu1 granules are sites of active translation. A major limitation to addressing this is that puromycin is not able to penetrate the yeast cell wall, so cannot be used for analysis of intact cells as would be needed in this case. We agree that this would be a welcome addition but is beyond the scope of the current study.

      The interactome of Clu1-neighbouring proteins (Figure 6) is interesting and a valuable addition to data in other organisms. I am wondering why the authors have not used as a control a cytosolic BirA-GFP, which would have been the right control for this experiment, especially since GFP tends to form aggregates.

      We thank the reviewer for this comment. With hindsight, we agree that a cytosolic BirA-GFP would have been a better control. However, we are confident in our results for the following reasons:

      1. The levels of GFP obtained from Clu1-GFP expression are low, and under these conditions, we observed no evidence of GFP aggregation. Even in experiments where GFP is overexpressed from a high-copy 2µ plasmid under a strong promoter, we do not detect aggregation. Aggregation is not a concern in our experimental setup.
      2. Our conclusions are not solely based on the interactome analysis (BioID) but are supported by complementary findings. Specifically, several proteins identified in the proximity to Clu1 in the BioID analysis showed reduced translation in Clu1 knockout cells, and their corresponding mRNAs were found to interact with Clu1 during translation. These complementary results from independent techniques provide strong evidence for Clu1's role and validate the findings of the interactome analysis. Given this robust and complementary dataset, having BirA as a control strain was sufficient to validate our conclusions.

      Figure 7B: The log 2 FC for the changed proteins are in many cases small, implying that the difference in translation for these proteins is not so large. For this reason, it is relevant to know how was the statistical significance calculated for these MS measurements. In the supplementary Tables and in Fig 7B, a p value is indicated and it is not clear if this is a simple p value or an adjusted p value (FDR or q value). If not shown, I recommend showing the adjusted p value, so that one can have an idea of the solidity of the data and the claim. Again, this is an important piece of evidence, since the authors base on this experiment the conclusion that Clu1 controls translation of these mRNAs.

      Thanks for this comment. We have now included the q-value in the supplementary table.

      Minor comments:

      -Figure 1: The change in Clu1 localisation in post-diauxic phase or upon changing of the medium is evident from the images shown. However, it seems that the experiment has been performed only once (the same for Figure 2). Is this the case? An important information would be to show the expression levels of Clu1-GFP in the different conditions. Does recruitment of CLU1 to granules associate to increased expression levels?

      The experiments shown in figures 1 and 2 were performed independently at least three times, as stated in the figure legends. The numbers shown are indicative values from one of the replicate experiments. This has now been added to the figure legends.

      We agree that providing the information regarding the expression levels of Clu1-GFP is important to address whether the recruitment of Clu1 to granules is associated with changes in its abundance. To this end, we have performed an additional experiment to quantify Clu1-GFP levels under the conditions where Clu1 is diffuse (log growth phase in glucose-containing media) and when Clu1 is in granules (sodium azide treatment).

      These results have been added to the manuscript and shown in Fig. S2 and described on page 4.

      Figure 2 A-B. The authors claim that the only stressor capable of inducing Clu1 granules formation alone is inhibition of complex IV activity via sodium azide treatment. Other mitochondrial stresses like CCCP treatment or OA treatment are efficient only when combined to starvation. It should be mentioned that sodium azide treatment is not only capable of inhibiting complex IV but has also uncoupling function.

      Thanks for this comment. We have now mentioned this (p4).

      Figure 2 D-E: investigation of colocalization with Bre5 would help to understand how similar the yeast Clu1 granules are compared to the mammalian CLUH granules (Pla-Martin et al., 2020).

      This is an interesting suggestion and one that we also considered, but with limited time and resources we were not able to pursue this line of inquiry as well.

      Figure 8. This figure summarizes one of the most novel pieces of data about Clu1, the interaction with mRNAs via the ribosome. The way how panel A-C are represented is however a bit misleading. The Y axis in Figure B and C has the same amplitude as the one in A. Therefore, potential differences in Clu1-RNA pull-down in presence of EDTA or puromycin cannot be assessed. It is true that in presence of CHX there is much more pulled down RNA, but one cannot judge from these panels if there is any difference between Clu1 targets and controls also in the other conditions. The graphs should be modified and statistics added.

      We appreciate the reviewer's feedback regarding the presentation of the RIP-qPCR data in Fig. 8. Based on the comments, we have revised how the results are represented, improved the normalisation of the data and added statistical analysis.

      First, it is worth clarifying that the presentation of the original charts was done specifically to highlight the huge differences between RNA-pulldown in CHX versus disrupted ribosomes. It is also important to note that these RIP experiments were performed simultaneously under identical experimental conditions, so any differences lie in the treatments applied. To improve cross-comparison between treatments we have now incorporated an additional normalisation step. We normalised the enrichment levels of each mRNA tested against the non-specific binding observed with the negative control housekeeping genes (UBC6 and TAF10). This ensures that differences in bead loss or other technical variations are accounted for.

      We now show the comparison of the six positive hits and two negative controls normalised as described above, on the same scale (Fig. 8A). We now also present the relative effects of the three conditions (CHX, EDTA, and puromycin) within the same graph for each mRNA tested (Fig. 8B). This format enables direct comparison of Clu1 target mRNA enrichment and two negative controls across treatments, which is the relevant comparison for testing the hypothesis of ribosome-dependent interactions. We have adjusted the Y-axis scaling for each mRNA, as requested by the reviewer, and added statistical comparisons. For clarity, the data shown in Fig. 8A are also represented in the panels of Fig. 8B (CHX). We have amended the text appropriately and hope that these changes improve the comparisons between treatments and more readily demonstrate that Clu1 target enrichment is lost upon ribosome disassembly, either by EDTA or by puromycin.

      In addition, RNAse treatment in panel L does not seem to have really worked.

      These samples were cross-linked prior to treatment to preserve the transient interaction of Clu1 with the ribosome, hence, the normal dramatic effect of RNase to collapse the polysomes is much less pronounced. Nevertheless, the purpose of this experiment was to monitor whether Clu1 co-migrated with ribosomes, which it does.

      The authors should cite Vornlocher et al. (PMID: 10358023), who were the first to implicate Clu1 (Tif31) with translation.

      Thank you for this prompt. We have now added a comment on this in the Discussion (page 13).

      References 21 and 22 are the same.

      Thanks. This has been fixed.

      Reviewer #3 (Significance (Required)):

      The data reported in this manuscript are valuable, because they confirm an evolutionary conserved role of Clu1 in binding mRNAs for mitochondrial proteins and regulating their translation. It is also interesting that in yeast, similar to Drosophila and mammalian cells, Clu1 can form granular structures upon metabolic rewiring. A limitation of the study is that direct experiments to support the claim that Clu1 concentrates ribosomes engaged in translation are not provided. Furthermore, it is not clear what is the functional role of the Clu1 granules, since the proximity interactome and the binding of Clu1 to the polysomes is not affected by treatments that dissolve or stimulate granule formation.

      The study is of interest to a general cell biology audience.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study from Miller-Fleming et al. employs yeast and Drosophila as model systems to explore the function of the RNA-binding protein Clu1, which is involved in mitochondrial biogenesis. The first part of the manuscript characterizes so called "Clu1 granules", and their dependance from metabolic transitions. In particular, using yeast, they find a relocalisation of Clu1 upon starvation and several mitochondrial stress conditions. These granules are not stress granules, and are dissolved by RNAse and puromycin treatment. The second part of the study aims to understand the molecular function of the protein and its link to translation. The results confirm an evolutionary conserved role of Clu1 in binding mRNAs for mitochondrial proteins and in interacting with mitochondrial proteins, ribosomal components and polysomes. In addition, the authors claim that binding of Clu1 to RNA is enhanced when mRNAs are trapped in polysomes by treatment with cycloheximide (CHX), leading to the proposal that Clu1 binds mRNAs during active translation.

      Major comments:

      • The claim of Clu1 granule localization next to mitochondria (Figure 3) would be more convincing if any of the experiment would be quantified. Especially in the case of panel 3G in Drosophila egg chambers where there are a lot of mitochondria, one wonders whether the closeness to mitochondria is just random. Furthermore, mdv1-signal does not look very convincing, being blurry and not dotty as expected. Thus, the conclusion that Clu1 granules partially colocalization with site of fission appears premature.
      • Based on the ability of 1,6-hexanediol to dissolve the granules (Figure 4), the authors conclude that: "Clu1 foci have membraneless nature". As they correctly state in the discussion, treatment with 1,6-hexanediol can have other effects. I suggest to be more cautious with the conclusions or add additional experiments. Are the granules dynamics if using FRAP? Do they fuse?
      • The experiment in which the granules are dissolved by treatment with RNAse is very interesting. However, per se this does not directly demonstrate that the granules contain mRNA. To state this the author should perform FISH experiments for example using a probe to detect poly-A.
      • The authors show that puromycin prevents the granule formation before insulin addition in the fly. Are these results (upon RNAse treatment and puromycin treatment) recapitulated in the yeast system? The authors conclude that Clu1 formation depends on mRNAs being engaged in translation, but never show that the granules are site of active translation. More experiments in this direction (for example using puro-PLA of specific mRNAs) are missing and would clearly improve the manuscript.
      • The interactome of Clu1-neighbouring proteins (Figure 6) is interesting and a valuable addition to data in other organisms. I am wondering why the authors have not used as a control a cytosolic BirA-GFP, which would have been the right control for this experiment, especially since GFP tends to form aggregates.
      • Figure 7B: The log 2 FC for the changed proteins are in many cases small, implying that the difference in translation for these proteins is not so large. For this reason, it is relevant to know how was the statistical significance calculated for these MS measurements. In the supplementary Tables and in Fig 7B, a p value is indicated and it is not clear if this is a simple p value or an adjusted p value (FDR or q value). If not shown, I recommend showing the adjusted p value, so that one can have an idea of the solidity of the data and the claim. Again, this is an important piece of evidence, since the authors base on this experiment the conclusion that Clu1 controls translation of these mRNAs.

      Minor comments:

      • Figure 1: The change in Clu1 localisation in post-diauxic phase or upon changing of the medium is evident from the images shown. However, it seems that the experiment has been performed only once (the same for Figure 2). Is this the case? An important information would be to show the expression levels of Clu1-GFP in the different conditions. Does recruitment of CLU1 to granules associate to increased expression levels?
      • Figure 2 A-B. The authors claim that the only stressor capable of inducing Clu1 granules formation alone is inhibition of complex IV activity via sodium azide treatment. Other mitochondrial stresses like CCCP treatment or OA treatment are efficient only when combined to starvation. It should be mentioned that sodium azide treatment is not only capable of inhibiting complex IV but has also uncoupling function.
      • Figure 2 D-E: investigation of colocalization with Bre5 would help to understand how similar the yeast Clu1 granules are compared to the mammalian CLUH granules (Pla-Martin et al., 2020).
      • Figure 8. This figure summarizes one of the most novel pieces of data about Clu1, the interaction with mRNAs via the ribosome. The way how panel A-C are represented is however a bit misleading. The Y axis in Figure B and C has the same amplitude as the one in A. Therefore, potential differences in Clu1-RNA pull-down in presence of EDTA or puromycin cannot be assessed. It is true that in presence of CHX there is much more pulled down RNA, but one cannot judge from these panels if there is any difference between Clu1 targets and controls also in the other conditions. The graphs should be modified and statistics added. In addition, RNAse treatment in panel L does not seem to have really worked.
      • The authors should cite Vornlocher et al.. ( PMID: 10358023), who were the first to implicate Clu1 (Tif31) with translation.
      • References 21 and 22 are the same.

      Significance

      The data reported in this manuscript are valuable, because they confirm an evolutionary conserved role of Clu1 in binding mRNAs for mitochondrial proteins and regulating their translation. It is also interesting that in yeast, similar to Drosophila and mammalian cells, Clu1 can form granular structures upon metabolic rewiring. A limitation of the study is that direct experiments to support the claim that Clu1 concentrates ribosomes engaged in translation are not provided. Furthermore, it is not clear what is the functional role of the Clu1 granules, since the proximity interactome and the binding of Clu1 to the polysomes is not affected by treatments that dissolve or stimulate granule formation. The study is of interest to a general cell biology audience.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript the authors use D. melanogaster and S. cerevisiae to study the role of CLUH in the translation of nuclear-encoded mitochondrial proteins. During conditions requiring aerobic respiration, CLUH forms RNA-dependent granules that localise in the proximity to mitochondria. Furthermore, the authors demonstrate that CLUH interacts with translating ribosomes to facilitate the translation of specific target mRNAs. For this, the authors use a combination of GFP-tagged CLUH models. BioID, polysome translating proteomics, RNA-IP. The authors' main conclusions are that (i) CLUH forms dynamic, membrane-less, RNA-dependent granules under conditions that demand aerobic respiration, (ii) CLUH interacts with specific mRNAs encoding metabolic factors, and (iii) CLUH interacts with the translating ribosome. The manuscript is well written and the conclusions stand in proportion to the experimental output and the results. The main concern is with regards to lack of advancement in relationship to published data.

      Comments:

      • To this reviewer it is not clear how CLUH can regulate the translation of specific mRNAs while being bound to ribosomes, regardless of being in a diffuse or granular state. The authors suggest that under metabolically active conditions, CLUH might aggregate translating ribosomes, forming the granular structures. How CLUH though can both be bound to translating ribosomes and recruit specific mRNAs at the same time is not explained.
      • The authors might want to discuss how changes in metabolic demands signal the aggregation of CLUH, and how CLUH can recognise its target mRNAs.
      • What was the rationale to perform the RIP or the PUNCH-P experiments only under non-challenged conditions, but not under conditions demanding aerobic respiration?
      • If CLUH is ubiquitously bound to ribosomes, has CLUH been seen in any structural representation of the cytosolic ribosome?

      Significance

      CLUH has been studied in various publications, showing data very similar to that presented in this manuscirpt. However, the authors provide a comprehensive analysis on both yeast and fly CLUH. The strength of the manuscript is the combination of several elegant methods and genetically modified model systems in two species to elucidate the role of CLUH during the translation of specific mRNA. In my view through, the advancement of understanding the function of CLUH is limited.

      Although the authors work in yeast and DM, the results seem applicable to other species, including humans, and thus, the presented results will be of interest in a range of researchers working in the field of metabolic regulation and gene expression.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript the authors study the Clustered mitochondrial proteins Clu of Drosophila melanogaster and Clu1 of Saccharomyces cerevisiae, two homologues of the mammalian protein CLUH. They show in compelling microscopy analysis that both proteins form granules. This was the case for flies fed on yeast paste after starvation and in yeast in post-diauxic phase, in respiratory media or during mitochondrial stress. They show that these granules are found in proximity to mitochondria and that they behave like liquid-liquid-phase separated condensates. They show by co-staining for P-bodies and stress granules that Clu1-granules are distinct from these RNA granules. Furthermore, they found that the formation required active translation. In the second part, they show that Clu1 interacts with ribosomal and mitochondrial proteins by BioID. The deletion of Clu1 leads to slightly impaired growth on media containing Ethanol as a carbon source. They find that nascent polypeptides of some mitochondrial precursor proteins are decreased in the deletion of Clu1 and conclude that Clu1 regulates translation of these proteins. Using RNA immunoprecipitation of Clu1-GFP in presence of cycloheximid, EDTA and puromycin. The mRNAs of nuclear-encoded mitochondrial proteins found to be interacting with Clu1 were purified in conditions when the ribosomes are intact and the RNAs showed no interaction when ribosomes were disassembled. They show in sucrose gradients that Clu1 co-migrates with polysomes independent of its distribution state or carbon source. However, when cells are grown in conditions of granule formation, then polysomes and Clu1 run less deeply into the gradient. Form these data, the authors conclude that Clu/Clu1 regulates the translation of nuclear-encoded mitochondrial proteins.

      Major comments:

      • The authors state that Clu1 is regulating translation during metabolic shifts. However, it is not clear what the real impact on mitochondrial function is. They show that there is a minor growth defect on ethanol media when CLU1 is deleted. However, if Clu1 is necessary mainly for adaptation, the phenotype will be strongest observed in conditions where cells switch carbon sources. Growth curves would be suitable in which the lag-phase of yeast cells precultured either in glucose or glycerol switched to media of different carbon sources (glucose to glycerol or glycerol to glucose) are measured. One would expect that the deletion mutant shows a longer lag-phase compared to the wild type when shifted from glucose to glycerol media. In line with this, how different is the mitochondrial proteome of the WT and the mutant? Do hits of the BioID, RIP and Punch-P experiments change at steady state or during metabolic shifts? Either proteomics of isolated mitochondria or western blots of whole cells or isolated mitochondria of WT and the deletion mutant grown in conditions of Clu1-granule formation or no granules for the hits could answer this question.
      • The authors analyze RNAs bound in polysomes to assess translation efficiency. Translation efficiency is usually calculated by the fraction of RNA bound by ribosomes to the total RNA amount of an RNA species. Thus, doing RT-qPCR from whole cells would be necessary to assess if the occupancy of ribosomes on the transcripts is due to changes in RNA abundance or other regulatory pathways and would help to further assess what causes the observed changes.

      Optional:

      • The authors show a co-localization of Clu/Clu1 with mitochondrial fission factors and conclude that the granules appear likely near fission sites. Indeed, CLUH has been implied in the past to play a role in mitochondrial fission (Yang, H., Sibilla, C., Liu, R. et al. Clueless/CLUH regulates mitochondrial fission by promoting recruitment of Drp1 to mitochondria. Nat Commun 13, 1582 (2022). https://doi.org/10.1038/s41467-022-29071-4). Thus, are fission sites required for Clu-granule localizations? What is the role of the mitochondrial network integrity for the granule distribution? Expressing Clu-GFP/Clu1-GFP in cells depleted for the fission factors would provide information on that.
      • The author assess convincingly that Clu1 interacts with ribosomes and runs with polysomal fractions. However, how it actually regulates translation is not clear. To answer this question, selective ribosomal profiling would be necessary. The authors have established conditions which would be suitable for the experiment. They could use crosslinking and sucrose cushions to IP ribosomes with Clu1-GFP bound to be used for ribosomal profiling. However, this experiment is quite time-intensive (3-4 months) and expensive, thus, an optional suggestion.

      Minor comments:

      Fig1: B, C, please add scale bars into the zoom ins.

      Fig 2 would profit from inlets of zoom ins to visualize the distribution better.

      Fig.3: Panel C does not really add much information. I would rather remove it or put it into supplements and therefore show a zoom of Panel E with a line plot showing the rings. It is not clear from the represented images where the rings are formed.

      Fig.3 panel F: Max projections are not appropriate to show colocalization as they can lead to false-positive overlaps. Just remove the max projections.

      References 21 and 22 are the same.

      Significance

      This manuscript shows in a convincing way that Clu and Clu1 form RNA granules and that Clu1 interacts with ribosomes. It is written in a clear way and the figures support the conclusions drawn in the text. The finding that Clu/Clu1 is important for metabolic adaptation has not been shown in fly or yeast to my knowledge. It is in line with findings for the mammalian homologue CLUH. Thus, the findings are supported by earlier work. This study is of value for a broader audience of the basic research field, especially of the mitochondrial and RNA granule field, as it supports the idea of post-transcriptional regulation of nuclear-encoded mitochondrial protein gene expression for dynamic adaptation of mitochondrial function. The conditions when Clu granules form is studied in detail, followed up by identification of target RNAs and interaction partners. Though the interaction of Clu1 with ribosomes is shown in a compelling way, a detailed mechanism of the function of Clu/Clu1 is missing and would require more experiments. Thus, even though a detailed mechanism is missing, the study does expand on our understanding of Clu/Clu1 in regulating mitochondrial biogenesis and is therefore of high interest of the mitochondrial field.

      Expertise: mitochondria, yeast, RNA granules, mitochondrial biogenesis, next-generation sequencing, fluorescence microscopy

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      1. The structures of the lamina propria of murine colon mucosa are nicely described. However, in the introduction of the manuscript the structures of fibroblasts, myofibroblasts and ECM are not described. The structures of the lamina propria of murine colon mucosa should be well described in the induction and discussed in the discussion.

      We will revise the Introduction to include a more detailed description of fibroblasts, myofibroblasts, and the ECM within the lamina propria of the murine colon mucosa. We will also expand the Discussion section to address these structures in the context of our findings.

      2. The UMAP plot suggests potential heterogeneity within Cluster 1, raising questions about whether the chosen clustering resolution (e.g., parameter settings in Seurat's "FindClusters") optimally captures subpopulations.

      We appreciate this insightful observation. We agree that the UMAP plot suggests potential heterogeneity within Cluster 1 and that the current clustering resolution may not fully capture underlying subpopulations. We could revisit the clustering parameters and explore reclustering at a lower resolution. However, we note that lowering the resolution often increases the total number of clusters, which may introduce noise and complicate biological interpretation. To more precisely dissect the heterogeneity within Cluster 1 while minimizing artificial subdivisions, we propose to perform subclustering specifically within Cluster 1.

      3. Some subpopulations express marker genes characteristic of pericytes and smooth muscle cells (e.g., Desmin). How did the authors ensure proper discrimination between fibroblasts and these other cell types?

      We thank the reviewer for this important comment. We acknowledge the challenge in distinguishing between fibroblasts, pericytes, and smooth muscle cells (SMCs) based solely on single-cell RNA sequencing data, particularly given the overlapping expression of markers such as Desmin.

      Pericytes vs. Myofibroblast/SM-Pericyte-Like Fibroblasts: Due to the highly similar transcriptional profiles of pericytes and pericyte-like fibroblasts, scRNA-seq alone does not allow for unambiguous discrimination between these populations. However, we were able to distinguish them based on morphology and spatial localization observed in high-resolution imaging. Notably, we identified a population of large (50–150 µm), elongated myofibroblast/SM-pericyte-like fibroblasts that, unlike typical pericytes, are not positioned directly on blood vessels but are distributed around the crypts. Some of these cells also appear to contact both blood vessels and the muscle layer, raising the possibility that they represent a specialized pericyte-like population. While their precise function remains uncertain, we agree that further characterization is warranted. To address this, we propose additional staining for canonical pericyte markers to help clarify their identity and spatial relationship to the vasculature.

      Smooth Muscle Cells vs. Myofibroblast/SM-Pericyte-Like Fibroblasts: We are confident that the analyzed fibroblast populations do not include smooth muscle cells. The mucosa was carefully dissected and separated from the underlying smooth muscle layer prior to RNA sequencing, which was performed exclusively on the mucosal compartment. Therefore, contamination by SMCs is unlikely.

      4. The manuscript also did not show the distribution and structures of ECM. It is better to show the relationships of fibroblasts and myofibroblasts with in the lamina propria of murine colon mucosa.

      In the supplementary material we show distribution of main ECM proteins such as Laminin, Collagen I, Collagen IV, and Fibronectin1.

      5. The integration with previously published datasets lacks clear connection to the authors' own findings. A more detailed comparison and discussion of how these integrated analyses relate to the newly generated data would improve the manuscript's coherence.

      We thank the reviewer for this helpful comment. Our RNA-seq dataset shows strong consistency with previously published datasets, supporting the robustness of our fibroblast isolation and transcriptional profiling strategy. We agree that a more explicit integration and comparison will improve the manuscript. We have now revised the Discussion to better highlight the spatial localization and organization of the different fibroblast populations identified in our study, with an emphasis on the duality of their functions. In particular, we discuss how our findings extend existing datasets by providing spatial context and functional insights that were not previously resolved. These comparisons underscore the novelty and value of our integrated approach.

      6. While the authors focus on colonic mucosa, the integrated public datasets include data from both colon and small intestine. Were these distinct tissue sources accounted for in the analysis? Clarification on this point is necessary to ensure the validity of comparisons.

      We thank the reviewer for raising this important point. Among the integrated datasets, only one—McCarthy et al. (GEO GSE130681)—originates from the small intestine; all others, including our own, were derived from the colon. Specifically, we used the following datasets:

      • GEO GSE113043 (Degirmenci et al., PMID: 29875413) – Colon (1 sample)
      • GEO GSE114374 (Kinchen et al., PMID: 30270042) – Colon (3 samples)
      • GEO GSE130681 (McCarthy et al., PMID: 32884148) – Small intestine (2 samples)
      • GEO GSE142431 (Roulis et al., PMID: 32322056) – Colon (5 samples) We selected these datasets based on their relevance to fibroblast biology, particularly those that specifically focused on mural fibroblasts. The inclusion of the McCarthy dataset was guided by its high-quality profiling of fibroblast populations and its utility in expanding our comparative framework.

      Importantly, review by McCarthy et al. (https://doi.org/10.1038/s41556-020-0567-z) reported minimal differences in fibroblast clustering between the small intestine and colon. Our integrated analysis supports this conclusion: fibroblasts from both regions consistently co-cluster, indicating a high degree of transcriptional similarity. This suggests that inclusion of the small intestine dataset did not bias or compromise the integrity of our colon-focused findings.

      Nevertheless, our primary emphasis remains on the colon, particularly due to the relative scarcity of studies addressing fibroblast localization and morphology in this tissue compared to the small intestine. Additionally, at the time of analysis, the datasets we used represented the most comprehensive publicly available single-cell profiles of intestinal mural fibroblasts.

      7. Many aspects of the described fibroblast subpopulations, including their single-cell expression profiles and physiological functions, appear to have been previously reported. The authors should more explicitly highlight the novel contributions of their work to advance our understanding of intestinal fibroblast biology.

      We thank the reviewer for this important observation. While it is true that aspects of fibroblast heterogeneity have been previously reported, our study provides several novel contributions that advance the current understanding of intestinal fibroblast biology. We will revise the manuscript to more explicitly highlight the following key findings:

      1. Functional distinction between ECM production and contractility: Our integrative analysis reveals a clearer separation between fibroblast subpopulations based on their functional specializations—specifically, ECM production versus contractile properties. This distinction has not been well delineated in prior studies and is particularly relevant in the context of inflammatory bowel disease, where fibrosis remains a major complication. Our findings may help identify specific fibroblast subtypes that contribute to pathological remodeling.
      2. Detailed characterization of fibroblast localization and morphology: We provide new spatial insights by demonstrating the lack of overlap between GFP⁺ and CD34⁺ basket cell populations in vivo. Additionally, we highlight the presence of large, elongated myofibroblasts and pericyte/smooth muscle-like fibroblasts that span from the vasculature to the underlying muscle layer—morphologies and arrangements that have not been thoroughly described before. These observations offer a more refined anatomical and functional framework for understanding fibroblast roles within the colonic mucosa. We will revise both the Results and Discussion sections to more explicitly emphasize these novel contributions.

      Reviewer 2:

      Major points:

      1. The order of the present manuscript should be reconstructed. The main message is in the discussion part. It is worth bringing it to the front.

      We appreciate this thoughtful suggestion. We agree that the main message of the manuscript is currently more prominent in the Discussion section, and bringing it forward would improve the overall clarity and impact of the work. We will restructure the manuscript accordingly to ensure that the key findings and their significance are introduced earlier and more clearly communicated throughout the text.

      2. Figure 1A, the authors employed the "vimentin+" filter to distinguish between fibroblasts and other cell types in the single-cell RNA sequencing (scRNA-seq) data. However, they did not provide a rationale for this choice in the manuscript. It would be worthwhile to consider the incorporation of an "Epcam-" or "E-cadherin-" filter as well, given the potential impact on the subsequent analysis's significance. Notably, the original UMAP plot generated before the application of the "vimentin+, Krt8-" filter, is absent from both the main figures and the supplementary data. The availability of this data is crucial for the identification of specific fibroblast populations among the sorted cells.

      The rationale for using the “vimentin⁺” filter is based on its long-standing use as a canonical marker for fibroblasts and mesenchymal cells in both developmental and adult tissues, including the intestinal lamina propria. Vimentin has consistently been used to distinguish fibroblasts from epithelial and immune cell populations in scRNA-seq studies.

      Regarding the exclusion of epithelial cells, we chose to apply a “Krt8⁻” filter instead of “Epcam⁻” or “E-cadherin⁻”, as Krt8 is a highly specific marker for colonocytes in the intestinal epithelium. We found this to be a reliable criterion for excluding epithelial cells in our dataset. We will revise the Methods section to clearly explain this rationale and selection.

      Additionally, we agree that the original UMAP plot—prior to the application of the “vimentin⁺, Krt8⁻” filter—would provide valuable context. We will include this plot in the supplementary figures to allow better visualization of the initial clustering and to support the identification of fibroblast populations among the sorted cells.

      3. Page4 line 12, the authors claim that they did not find specific markers for the cluster 1, despite the fact that cluster 1 is distinctly separated from clusters 0, 5, 4 and 3 in figure 1B. Furthermore, the cells in the cluster 1 do not cluster together based on the resolution applied in the present manuscript. The authors claim that cells in cluster 1 are in a transition state, and therefore, they did not include them in the functional analysis. However, later they claim that the cluster 1 are multipotent progenitors, which is not clear.

      We appreciate the reviewer’s careful reading and valuable critique. We acknowledge the confusion regarding the identity and interpretation of Cluster 1 and would like to clarify our reasoning and planned revisions.

      When identifying marker genes using Seurat’s FindMarkers() or FindAllMarkers() functions, the output highlights genes that are significantly enriched in a given cluster relative to others—but these genes are not necessarily uniquely or exclusively expressed in that cluster. This is the case with Cluster 1: although it is spatially distinct in the UMAP (Figure 1B), many of the top-ranked marker genes are also expressed in other clusters, albeit at lower levels. As a result, defining Cluster 1 based solely on unique gene expression signatures is challenging, and we initially interpreted this cluster as a “transitional population” due to its ambiguous marker profile.

      However, we acknowledge the apparent inconsistency in referring to Cluster 1 as both "in transition" and "multipotent progenitors." We will clarify our interpretation and terminology in the revised manuscript. Specifically, we will refer to Cluster 1 as a __ transitory population__, and provide a more nuanced discussion of its potential roles.

      As mentioned in our response to Reviewer 1 (Comment 2), we will also perform reclustering within Cluster 1 to better explore its internal heterogeneity. Additionally, we will now include Cluster 1 in the functional enrichment analysis to further assess its biological relevance and contribution to fibroblast diversity.

      4. Figure 1E and F, authors only use gene ontology to define the functions of different clusters of fibroblasts which constrain the present manuscript at the hypothesis stage. To substantiate the claims, it is imperative to conduct more precise experiments. At the very least, co-staining with cluster marker genes and candidate genes identified in GO analysis is necessary. In the event that antibodies are not available, RNA scope can serve as a viable alternative. Further functional experiments will be required to prove their unique function. For instance, the identification of specific cell surface markers to isolate different clusters of fibroblasts for coculture with intestinal organoids in vitro can be facilitated by scRNA-seq data.

      We appreciate the reviewer’s insightful suggestions regarding the functional validation of GO-based predictions.

      While we recognize that RNAscope is a valuable alternative when antibodies are unavailable, its use requires much thinner tissue sections than those employed in our current imaging approach. Our analysis is based on thicker sections, which preserve the 3D architecture and spatial relationships of fibroblasts within the colonic mucosa—an essential aspect of our study. Transitioning to thinner sections would compromise our ability to visualize these cells in their full anatomical context.

      To suppor the GO analysis with experimental validation, we will include __co-staining for cluster __marker genes along with representative candidate genes____ identified through GO analysis to better substantiate the predicted functions of different fibroblast clusters.

      We acknowledge the importance of functional studies such as co-culture assays with intestinal organoids, and indeed, several such experiments have been reported by other groups. Additionally, isolating specific fibroblast populations via FACS sorting for in vitro studies presents practical challenges, including low cell survival rates, which limit the feasibility of downstream functional assays. Thus, we believe that these types of experiments are beyond the scope of the current manuscript. We hope that our integrative approach and spatial validation will serve as a valuable foundation for future functional investigations into fibroblast biology.

      5. DAPI staining is absent in the majority of the images, which complicates the task of distinguishing cells from different clusters. Multiplex staining is necessary to show all specific markers: EGFP, SMA, CD34, Desmin, Pdgfra, Pil6, and Clu, regarding six clusters in one section or image.

      We appreciate the reviewer’s comment and the emphasis on the importance of cellular context in multiplex imaging.

      We acknowledge that DAPI staining is absent in some of the presented images, which may limit nuclear visualization and make it more challenging to distinguish cell boundaries. However, to achieve high-content multiplexing, we employed protocols allowing up to 5–6 fluorophores per section, as previously demonstrated by Chikina et al. (Cell, 2020). Due to spectral limitations and the risk of fluorophore overlap and signal bleed-through, we occasionally excluded DAPI to allocate the 405 nm channel for markers of greater relevance to our study. In these cases, Tomato or EGFP signals served as effective surrogates for cellular localization, as they label cell membranes, providing sufficient morphological context.

      Regarding multiplex staining for Pi16 and Clu, we tested several commercially available antibodies, but unfortunately, none yielded specific or reproducible signals in our hands. As a result, we were unable to include these markers reliably in our multiplex panels.

      6. Figure 4, the authors utilize supervised methods to execute trajectory analysis, defining cluster 1 as the initial point based on its hybrid expression state of genes. This assertion, however, lacks sufficient substantiation, as cluster 1 could also function as a transition point, not necessarily an initial point.The data presented in the current manuscript is inadequate to support the conclusion of multipotency in cluster 1.To substantiate these claims, the authors should employ additional evidence, such as SENIC analysis, to demonstrate the expression of specific transcription factors for each lineage along the trajectory. In order to substantiate the assertion that cluster 1 is a multipotent progenitor capable of differentiating into other specific populations, such as fibroblasts, further functional experiments are required. These experiments could include isolating the population in question and conducting a differentiation test in vitro or tracking the population's response to wound healing.The absence of immunofluorescence images or gene signatures for this cluster in the study is a cause of confusion for the reader.

      We thank the reviewer for this thoughtful and constructive comment. We agree that Cluster 1 could plausibly represent either an initial or transitional state. In trajectory analysis, the starting point must be defined, and we selected Cluster 1 due to its hybrid gene expression profile—exhibiting low-level expression of marker genes associated with multiple other clusters—suggesting a less differentiated or “primed” state. However, we fully acknowledge that this assignment does not preclude its interpretation as a transitional population, and we will revise the manuscript text to reflect both possibilities more clearly and cautiously.

      We appreciate the suggestion to perform __SENIC __analysis (https://www.nature.com/articles/nmeth.4463). This algorithm aims to identify gene regulatory networks and their associated transcription factors for each cell cluster. While interpreting such analysis can be quite challenging, it could provide interesting insights and thus we propose to apply it.

      Regarding functional validation, we agree that experiments such as isolation and in vitro differentiation assays, or in vivo lineage tracing during injury models, would offer more definitive insights. However, as we noted, the lack of specific surface markers currently makes it challenging to isolate Cluster 1 by FACS for such assays.

      We also acknowledge the reviewer’s concern about the __lack of immunofluorescence images or distinct gene signatures__for Cluster 1. We will revise the text to clearly communicate that this limitation.

      7. Figure 5B, the data set of Kinchen et al is from human samples. Is it relevant and significant to merge murine data and the human data together?

      We appreciate the reviewer’s attention to detail. To clarify, the dataset from Kinchen et al__.__ used in Figure 5B refers exclusively to their murine samples, not the human data. Only murine datasets were included in our analysis to ensure consistency and biological relevance. Therefore, merging the Kinchen murine data with other murine datasets in Figure 5B is both appropriate and justified.

      We will revise the figure legend and Methods section to clearly state that only mouse data were used throughout the analysis.

      Minor points:

      8. The chapter entitled "Subepithelial Fibroblast Do Not Proliferate" is not necessary to be an independent chapter. It can be considered a fusion chapter, as it is combined with Chapter 2. Further experiments such as Brdu or Edu are needed to strengthen the current hypothesis.

      We agree with the reviewer that this section does not require a standalone chapter and would be better integrated into Chapter 2. We will revise the manuscript accordingly.

      In addition, to further support our observations regarding fibroblast proliferation, we will perform a 2-hour EdU pulse-chase experiment and include the results in the revised manuscript. We believe this will strengthen our conclusions and provide more direct evidence regarding the proliferative status of subepithelial fibroblasts.

      1. DAPI staining is absent in majority of the images.

      indeed this is a limitation of the unmixing technique we use.

      10. Put the number of each cluster next to the arrow in all the IF images.

      We will do this.

      11. Immunofluorescent staining of cluster markers identified in the previous studies should be included in the present study such as: CD81, FoxL1, Myh11, Pdgfrb and Gli1.

      We will include those markers.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The summitted article entitle "Intestinal fibroblast heterogeneity: unifying RNA-seq studies and introducing consensus-driven nomenclature" by Glisovic et al., identify six distinct populations of fibroblast with unique molecular signatures, spatial localization and specific function in mouse colon using scRNA-seq. Moreover, with different bioinformatic methods, they show the potential differentiation trajectories of fibroblast in mouse colon mucosa. Finally, they propose a standardized nomenclature for colonic fibroblast by integrating the data of this manuscript and the four published scRNA-seq data of mouse and human intestinal colonic fibroblast. Several similar studies cited by the authors in the present manuscript have been done and the different populations of colonic fibroblasts have been well characterized in these previous studies. Here the authors utilized another mouse model, the "aSMAcreERT2" to target the murine colonic fibroblast population which is novel compared to previous published data. Although the authors have provided multiple bioinformatic analyses and immunofluorescent staining of certain markers to support their conclusions, many points are overclaimed or not clear based on the data of the present manuscript, especially for the differentiation trajectories and unique function of different clusters of subepithelial colonic fibroblast. Functional experiment data are absent from the present manuscript.

      Major comments

      • The order of the present manuscript should be reconstructed. The main message is in the discussion part. It is worth bringing it to the front.
      • Figure 1A, the authors employed the "vimentin+" filter to distinguish between fibroblasts and other cell types in the single-cell RNA sequencing (scRNA-seq) data. However, they did not provide a rationale for this choice in the manuscript. It would be worthwhile to consider the incorporation of an "Epcam-" or "E-cadherin-" filter as well, given the potential impact on the subsequent analysis's significance. Notably, the original UMAP plot generated before the application of the "vimentin+, Krt8-" filter, is absent from both the main figures and the supplementary data. The availability of this data is crucial for the identification of specific fibroblast populations among the sorted cells.
      • Page4 line 12, the authors claim that they did not find specific markers for the cluster 1, despite the fact that cluster 1 is distinctly separated from clusters 0, 5, 4 and 3 in figure 1B. Furthermore, the cells in the cluster 1 do not cluster together based on the resolution applied in the present manuscript. The authors claim that cells in cluster 1 are in a transition state, and therefore, they did not include them in the functional analysis. However, later they claim that the cluster 1 are multipotent progenitors, which is not clear.
      • Figure 1E and F, authors only use gene ontology to define the functions of different clusters of fibroblasts which constrain the present manuscript at the hypothesis stage. To substantiate the claims, it is imperative to conduct more precise experiments. At the very least, co-staining with cluster marker genes and candidate genes identified in GO analysis is necessary. In the event that antibodies are not available, RNA scope can serve as a viable alternative. Further functional experiments will be required to prove their unique function. For instance, the identification of specific cell surface markers to isolate different clusters of fibroblasts for coculture with intestinal organoids in vitro can be facilitated by scRNA-seq data.
      • DAPI staining is absent in the majority of the images, which complicates the task of distinguishing cells from different clusters. Multiplex staining is necessary to show all specific markers: EGFP, SMA, CD34, Desmin, Pdgfra, Pil6, and Clu, regarding six clusters in one section or image.
      • Figure 4, the authors utilize supervised methods to execute trajectory analysis, defining cluster 1 as the initial point based on its hybrid expression state of genes. This assertion, however, lacks sufficient substantiation, as cluster 1 could also function as a transition point, not necessarily an initial point. The data presented in the current manuscript is inadequate to support the conclusion of multipotency in cluster 1.To substantiate these claims, the authors should employ additional evidence, such as SENIC analysis, to demonstrate the expression of specific transcription factors for each lineage along the trajectory. In order to substantiate the assertion that cluster 1 is a multipotent progenitor capable of differentiating into other specific populations, such as fibroblasts, further functional experiments are required. These experiments could include isolating the population in question and conducting a differentiation test in vitro or tracking the population's response to wound healing. The absence of immunofluorescence images or gene signatures for this cluster in the study is a cause of confusion for the reader.
      • Figure 5B, the data set of Kinchen et al is from human samples. Is it relevant and significant to merge murine data and the human data together?

      Minor comments

      • The chapter entitled "Subepithelial Fibroblast Do Not Proliferate" is not necessary to be an independent chapter. It can be considered a fusion chapter, as it is combined with Chapter 2. Further experiments such as Brdu or Edu are needed to strengthen the current hypothesis.
      • DAPI staining is absent in majority of the images.
      • Put the number of each cluster next to the arrow in all the IF images.
      • Immunofluorescent staining of cluster markers identified in the previous studies should be included in the present study such as: CD81, FoxL1, Myh11, Pdgfrb and Gli1.

      Significance

      In this study, the researchers employed an alternative mouse model, the "aSMAcreERT2," to target the murine colonic fibroblast population. This approach represents a novel contribution to the field, offering a fresh perspective on previous findings. While the authors have presented several bioinformatic analyses and immunofluorescent staining of specific markers to support their conclusions, certain aspects of their argument require further elaboration or clarification, particularly regarding the differentiation trajectories and unique functions of the various clusters of subepithelial colonic fibroblasts. The present manuscript is constrained at the descriptive level due to an absence of functional experiment data.

      Strengths: The authors utilize "aSMAcreETR2" as a research model to target murine colonic fibroblasts, a novel approach that complements previously published data. By comparing and combining four published single-cell RNA sequencing (scRNA-seq) of colonic fibroblasts, they proposed a novel classification with five distinct subpopulations: telocytes, trophocytes/extracellular matrix (ECM) fibroblast, fibroblast, myofibroblast, and smooth muscle/pericyte-like fibroblast.This new classification, together with their unique molecular signature, can be useful for people in the colon and intestine research field. However, the manuscript is not without its limitations. First, the novel classification and unique molecular signature are not substantiated by functional experimentation, which is essential for validating the fibroblast subcluster's functionality. Additionally, the characterization of cluster 1 is lacking, particularly concerning its ability to differentiate into the five distinct subcultures, which is crucial for confirming its status as a multipotent progenitor. Despite the proposal of a novel classification and detailed molecular signature of the colonic fibroblasts, no isolation strategy is proposed in the present manuscript to allow further characterization. If the authors can address these points, the manuscript can make a significant contribution to the field. This study might interest people who perform basic research in the intestine and colon.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study utilizes scRNA-seq to delineate six fibroblast subpopulations in mouse colonic mucosa, revealing their molecular heterogeneity, functional specialization, and spatial distribution. The high-quality confocal microscopy images effectively illustrate the spatial distribution of cells within the colon mucosa. However, several concerns should be addressed:

      1. The structures of the lamina propria of murine colon mucosa are nicely described. However, in the introduction of the manuscript the structures of fibroblasts, myofibroblasts and ECM are not described. The structures of the lamina propria of murine colon mucosa should be well described in the induction and discussed in the discussion.
      2. The UMAP plot suggests potential heterogeneity within Cluster 1, raising questions about whether the chosen clustering resolution (e.g., parameter settings in Seurat's "FindClusters") optimally captures subpopulations.
      3. Some subpopulations express marker genes characteristic of pericytes and smooth muscle cells (e.g., Desmin). How did the authors ensure proper discrimination between fibroblasts and these other cell types?
      4. The manuscript also did not show the distribution and structures of ECM. It is better to show the relationships of fibroblasts and myofibroblasts with in the lamina propria of murine colon mucosa.
      5. The integration with previously published datasets lacks clear connection to the authors' own findings. A more detailed comparison and discussion of how these integrated analyses relate to the newly generated data would improve the manuscript's coherence.
      6. While the authors focus on colonic mucosa, the integrated public datasets include data from both colon and small intestine. Were these distinct tissue sources accounted for in the analysis? Clarification on this point is necessary to ensure the validity of comparisons.
      7. Many aspects of the described fibroblast subpopulations, including their single-cell expression profiles and physiological functions, appear to have been previously reported. The authors should more explicitly highlight the novel contributions of their work to advance our understanding of intestinal fibroblast biology

      Significance

      The proposed standardized nomenclature for intestinal fibroblasts represents a valuable contribution toward unifying classification in the field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the editor for handling our manuscript and the reviewers for their constructive critiques. We are deeply convinced that the reviewers’ suggestions have substantially raised the quality and possible impact of our manuscript. We also like to thank the reviewers for their judgements that the subject of our manuscript is biologically and clinically significant and of high importance, and that our manuscript might help to increase focus and visibility for affected individuals.

      New text passages in the manuscript are colored in red. Below is a point-by-point response to the reviewers’ comments.

      2. Point-by-point description of the revisions

      Response to reviewer 1 comments

      Major comments

      Point 1-1

      The authors performed qRT-PCR validation for markers of differentiation and hypoxia, with a major absence of VEGF and HIF1a. The paper would be strengthened by mention of these factors, especially by qRT-PCR or Western blot.

      We thank the reviewer for the suggestion to include the bona fide hypoxia markers Vegfa and Hif1-alpha. We followed the suggestion and performed qRT-PCR on Vegfa transcripts at each tested condition (Figs. 1A,2A,3A,4A,5A,5D,5I,5N). As Hif1α is rather regulated on protein than on transcript level, we followed the advice to perform Western blots. We analyzed Hif1α protein levels on proliferating cells and quantified by normalization to actin (Figs. 1B,C and 5 B,C).

      Point 1-2

      Please provide justification of selection 0.5% as their hypoxic condition or perhaps repeat experiments in a less extreme environment to see if their conclusions still hold true.

      We admit that our approach to use 0.5% hypoxia was a drastic challenge for the cells. It should be noted, however, that physiologic oxygen levels during pregnancy at times drop to lower than 1% (Hansen et al, 2020; Ng et al, 2017). In the first place, we had used oxygen levels lower than this, because we had wanted to ensure that we can detect responses by bulk RNA-seq with a limited number of samples. As we had many conditions to compare, we did not want to use more than 3-4 samples per condition. The fact that the cells showed normal proliferation underscores the fact that 0.5% O2 per se was not so low that it would be overly stressful to the cells.

      Nevertheless, we are very grateful to the reviewer for the suggestion to include a milder hypoxic condition. We chose 2% O2, because this equals the physiological oxygen concentration shortly before the onset of cranial neural crest cell (CNCC) differentiation. We could recapitulate the phenomenon of impaired differentiation to chondrocytes, osteoblasts and smooth muscle cells at these mild hypoxic conditions, as shown by qRT-PCR and immunofluorescence of typical markers (Figs. 5D-R). Moreover, the differentiation-specific induction of the two central hypoxia-attenuated risk genes associated with orofacial clefts that we had identified by our bioinformatic analyses at 0.5% O2 (Boc and Cdo1), was still observable at 2% O2 (Figs. S6C,D). Interestingly, in some rare cases, the attenuation of induction was lost or not as drastic as in 0.5% O2.

      We are convinced that the experiments at 2% O2 strongly increased the relevance of our manuscript, because we thus detected that oxygen levels prevailing shortly before the onset of CNCC differentiation still can influence their differentiation. This leads to the conclusion that only slight decreases of intra-uterine oxygen levels indeed might interfere with correct differentiation of CNCC.

      Point 1-3

      Standard immunohistochemistry or histology of differentiated cells would strengthen the authors' claims of reduced differentiation under hypoxic conditions, e.g., Alcian blue, alk-phos or Alizarin red, and smooth muscle actin or other indicator.

      We are grateful to the reviewer for the suggestion to include stainings of cells, as these stainings visualized the drastic effects of hypoxia on the cells. We performed immunofluorescent stainings against at least one marker protein for each differentiation paradigm. At 0.5% O2, each protein signals were nearly completely absent and cell morphology was disrupted (Figs. 2E,F, 3E, 4E). At 2% O2, we detected some more protein deposition than at 0.5%. Importantly, cells had retained their normal shape at mild hypoxia (Figs. 5H,M,R, S5A).

      Point 1-4

      The authors identify a few genes that appear down-regulated in all three differentiation conditions. If it is within the scope of the study, it would strengthen the claim of these genes' function to show the effect of knock-down or knock-out for validation.

      We thank the reviewer for the suggestion of gene knock-down or knock-out in order to prove functional relevance of our findings. As this would have been too much effort and beyond the scope of our study, we rather followed the suggestion of reviewer 2 (cf. points 2-6, and 2-8) that headed to the same direction: we mined publicly available sequence data on orofacial development for gene expression or marks of active enhancers. We found robust expression of the two central hypoxia-attenuated OFC risk genes Boc and Cdo1 during human craniofacial development (Fig. 7A) and we identified enhancers that are active in embryonic craniofacial mouse tissue (Fig. 7B). Moreover, we detected expression of both genes during murine craniofacial development in undifferentiated mesenchymal cells, osteoblasts, chondrocytes and smooth muscle cells with the help of a single cell RNA-seq dataset (Figs. 7C-E, S6B).

      Thus, we found evidence for the in vivo relevance of Boc and Cdo1 and could rule out a possible important role of Actg2, the third gene we had identified. We therefore are grateful for the suggestion to circumvent gene knockouts by reviewer 2, as we think these data strongly emphasized the importance of our findings.

      Point 1-5

      Another major critique lies in the initial claim that proliferation of O9-1 cells is not significantly impacted by hypoxia. In figures 1E-H, photograms of the cells cultured 24 -72 hours and quantifications of live vs dead cells are shown as evidence for this argument. However, the increased density of cells in normoxic conditions may be a confounding variable in this assay. It would be interesting for the researchers to assess the percent of dead vs alive cells between normoxic and hypoxic conditions when the plates reach equivalent densities.

      We apologize for the use of image sections from photographs with different cell densities. Of course, as demonstrated by our quantification, cell densities between 0.5% and 21% O2 in total were equal (cf. Figs. 1D,E). We therefore replaced the formerly used sections with new image sections with equal cell numbers.

      We thank the reviewer for the suggestion to examine if cell numbers influence cell death rates. We followed this advice by several approaches: first, we seeded cells at different densities, incubated them for 72 h (the same time span where a minimal difference had been detected) and performed live/dead stainings (Fig. S1B). The seeding density did not affect percentages of dead cells and the values were in the same range as in our initial experiment (Fig. 1J). Moreover, we performed TUNEL stainings of apoptotic cells at different time points to have an additional readout of cell death (Figs. 1K,L). As expected, the percentages of TUNEL-positive cells were identical between hypoxic and normoxic cells at all analyzed time points.

      We therefore concluded that hypoxia does not influence the rate of cell death of proliferating CNCC and accordingly specified our wording in the results section.

      Point 1-6

      At end of Fig 1 section authors attempt to tie phenotypes observed in a cell line in vitro to the complex biological processes. They are not comparable and in vivo models would be better suited for these types of comparisons.

      We apologize for the overconfident wording in our manuscript. Of course, our in vitro experiments cannot fully simulate the complex developmental processes taking place in vivo. We therefore changed the text to a more careful formulation. Moreover, we kept the wording in the discussion section that we cannot exclude that in the in vivo situation proliferation of CNCC is also affected by low oxygen levels because nutrients might not be available in such excess as they are in cell culture.

      Point 1-7

      Fig 2: if qRT-PCR did not show statistically different results between experimental and control groups why move on to bulk RNA seq?

      We apologize that the sentence about statistical significance was misleading. What we wanted to express is that there was only a little difference (if any at all) between differentiated cells at 0.5% O2 and proliferating cells at 0.5% O2 or 21% O2. For the sake of clarity and readability, we deleted this misleading sentence.

      Point 1-8

      Fig 5: hypoxia this intense is going to affect broad range of biological processes and genes. Finding a few genes that are affected in extreme hypoxia that are also risk genes is highly unlikely. How can the authors be assured that these overlaps are actually significant and not just by chance?

      We thank the reviewer for the suggestion to test for statistical significance. We tested significance of the overlap of respective gene sets (nsOFC vs. hyp-a; OFC vs. hyp-a) by Fisher’s exact test. We included Venn diagrams depicting the overlap and present the exact p-values (Figs. S5C,D). In each case where overlap of genes occurred, p-values indicated significance.

      Point 1-9

      Would appreciate discussion on how examination of neural crest is relevant for OFC, as most animal models of OFC demonstrate the pathogenesis in embryonic epithelium or periderm, not in the neural crest. Defects in neural crest are associated with other congenital craniofacial anomalies such as craniosynostosis or complex (Tessier) clefts, not the typical orofacial cleft. Please revise rationale of study, interpretation of data and Discussion to specifically state how neural crest cells are involved in the pathogenesis of orofacial cleft.

      We apologize for not pointing out enough the role of epithelial cells in the emergence of orofacial clefts. We revised our introduction, results and discussion sections in this regard and emphasized the role of epithelial cells. Importantly, we addressed the possible influence of the results gained in CNCC on epithelial cells by analyzing scRNA-seq data with the algorithm CellChat, as suggested by reviewer 2 (cf. point 2-8). We detected several cell communication pathways from CNCC to epithelial cells which contain components that are misexpressed upon hypoxia in our dataset (Figs. 7F-I). Therefore, during hypoxia, these pathways might influence epithelial cells and therefore indirectly cause orofacial clefts. We outlined this possible interplay in the discussion and briefly mentioned it in the abstract.

      We have not discussed more strongly the role of CNCC in the emergence of OFC in the revised manuscript, because we did not want to put even more emphasis on this matter. Numerous studies have proven the contribution of cranial neural crest tissue to the emergence of orofacial clefts. This fact is also pointed out in several review articles about orofacial clefts. In most cases, this knowledge was achieved by mouse models, because tissue-specific conditional knockouts are feasible (in contrast to genetic studies on patients), usually via deletion with the Wnt1-Cre driver. Funato et al. give an excellent (but quite old) overview of mouse models in which the neural crest-specific knockout of a gene leads to emergence of OFC and lists 17 genes for which this is the case (Funato et al, 2015). Moreover, several recent studies also report on the emergence of orofacial clefts upon neural crest-specific deletion (Forman et al, 2024; Li et al, 2025). These include genes responsible for DNA methylation (Ulschmid et al, 2024), and a study on subunits of chromatin remodeling complexes that are necessary for correct transcription of their target genes, which was conducted by our group (Gehlen-Breitbach et al, 2023).

      Minor comments

      __Point 1-10 __

      The author should replace "Final proof" in the introduction with "further evidence supporting."

      We apologize for the incorrect wording. Of course, it is highly questionable if there is such a thing as final proof in life sciences. We re-phrased the text according to the reviewer’s suggestion.

      Point 1-11

      Authors are inconsistent when referring to Figures- sometimes they capitalize (i.e. 1J) and other times they leave lower case (i.e. 1i). Needs to be consistent throughout. Figures are not numbered.

      We apologize for the inconsistency. We corrected the references to figures. Moreover, we apologize for the missing figure numbers. We also corrected this and included figure numbers.

      Point 1-12

      In figures authors would sometimes list 21% O2 first then 0.5% O2 or vice versa. (i.e. Fig on page 21 panels I, J, K). Needs to be consistent.

      We again apologize for being inconsistent. We corrected the inconsistency in Fig. 1D. Now, 21% O2 is presented before/above 0.5% O2.

      Point 1-13

      Figures on pages 28, 29, 30 panel J and page 31 panel F: there is no legend on what the scale/measurement is for the difference in expression level other than it ranges from -1 to +3.

      We thank the reviewer for the hint. We are aware that from the heatmaps we used one cannot infer relative expression rates of different genes or similar. If we would have considered expression strength of single genes, many of the gene-specific differing expression rates under the different conditions would have been hard to detect, as presentation would have been dominated by the differences in expression rates between genes. We therefore plotted gene-wise scaled expression.

      We included an explanation of the procedure in the materials and methods section.

      Point 1-14

      Will the authors please comment on the one normoxic sample in Figure 1I that did not cluster with the others? Did this meet the standards to merit exclusion as an outlier?

      We regret that the default scale of our plot of the principal component analysis is a bit misleading. This is the case because x-axis accounts for 80.3% of variance and y-axis only accounts for 6.1%. Therefore, the sample that might seem as an outlier actually met our standards. Nevertheless, we decided to keep the default scaling as is, in order not to embellish the graph (Fig. 1M).

      Point 1-15

      The authors refer to DEG as deregulated genes; while not strictly incorrect, the more standard usage is "differentially expressed genes." Please address.

      We apologize for the incorrect explanation of the acronym. Of course, this was corrected in the revised manuscript.

      Significance

      This work on neural crest cells and hypoxia are biologically and clinically significant.

      We are deeply grateful to the reviewer for considering our manuscript significant for both biologists and clinicians. We are convinced that the additional data we gathered in the course of the revision has significantly increased the importance of our work. Therefore, we once again express our gratitude to the reviewer for the valuable suggestions.

      Response to reviewer 2 comments

      Major comments

      Point 2-1

      The conclusions drawn from the experimental data are carefully formulated for the most part. One of the main concerns is that the cells were subjected to extreme hypoxic conditions, while it may be more biologically relevant to include a condition representing more mild hypoxia (e.g. 10%).

      Please refer to the response to point 1-2.

      Point 2-2

      One of the opening claims regarding severe hypoxia only mildly affecting cell proliferation is not shown clearly, since no mitotic markers have been analyzed (i.e. KI67 or PCNA staining or a simple EdU incorporation assay). Thus, the claim that they assessed cell proliferation is not very convincing, even though cell death was analyzed.

      We appreciate the reviewer’s suggestion to include a more thorough analysis of proliferation rates. We followed the advice and performed immunofluorescent stainings against Ki67 (accounting for cells in proliferative state) and phospho-histone H3 (accounting for cells undergoing mitosis). We performed this assay at different time points of culture in order to address the question if cell density might influence proliferation rates (Figs. 1F-H). Neither for Ki67 nor for pHH3 a difference was detected between 21% and 0.5% O2.

      We are convinced that these analyses strengthened our initial findings and provide strong evidence that hypoxia does not influence proliferation rates of CNCC.

      Point 2-3

      Additionally, cellular morphology of the cells could be assessed (brightfield images), since previous studies observed that hypoxia can be an inducive factor in cranial neural crest and driving EMT (Scully et al. 2016; Barriga et al. 2013).

      We thank the reviewer’s hint and followed the advice. We analyzed cellular morphology by the parameters cell length, total number of pseudopodia, number of filopodia and number of lobopodia (Figs. S1C-F). As outlined in the results section, we did not detect a difference in these parameters between 21% and 0.5% O2.

      We included the second reference mentioned by the reviewer (Barriga et al, 2013) additionally to Scully et al. 2016 that had already been cited.

      Point 2-4

      Furthermore, in the RNA seq analysis of chondrogenic fate biased cells the authors draw a conclusion based on the proximity of the samples on the PCA plot, which is not very convincing. More careful analysis of the bulk RNA seq data sets they have generated for key marker genes will be more convincing (for example, a heatmap with selected genes would be a helpful representation).

      We apologize for the rash and inaccurate conclusion based on proximity on PCA plots. We are grateful to the reviewer for the suggestion to include heatmaps with selected marker genes. Following this advice, we generated heatmaps on our bulk RNA-seq data with the GO terms specific for each differentiation paradigm (Figs. S2F, S3F, S4F).

      We are convinced that these maps are perfect additions to the heatmaps of the 200 top differentially-expressed genes that already had been included in the manuscript (Figs. 2K, 3J, 4J) and helped to strengthen our findings. For chondrocytes and smooth muscle cells, the new, GO-specific heatmaps perfectly recapitulated the phenomenon of hypoxia-attenuated induction. Interestingly, for osteoblasts, about half of the induced genes were hypoxia-attenuated, while the other half was induced stronger than under normoxia. This pointed to gene-specific mechanisms of hypoxia-dependent attenuation of transcription. Moreover, it shed light on a hypoxia-evoked complete dysregulation of transcriptional induction in osteoblasts, as nearly none of the genes was induced similar to normoxia.

      __ __

      Point 2-5

      As mentioned above, a straight-forward and not time consuming experiment (given that it was assessed for a maximum of 72 hrs) would be to repeat the culture of NCCs and stain for mitotic markers, and quantify the number of positively stained cells over total cell numbers. Furthermore, it is not that demanding to add an experimental condition of less severe hypoxia in this assay.

      We thank the reviewer for the suggestion and followed the advice (cf. point 2-2). The conducted experiments straightened our results, because the initially detected slight tendency to lower cell numbers at 0.5% O2 could thus be falsified: We did not detect any difference for Ki67 and pHH3 between 0.5% and 21% O2 at any analyzed time point (Figs. 1F-H). Moreover, percentages of dead or apoptotic cells at 0.5% O2 did not vary from 21% (Figs. 1I-L, S1B). As we could not detect any difference in proliferation between 21% and 0.5% O2, we skipped the analysis of proliferating cells at 2% O2.

      Point 2-6

      Without underestimating how time consuming this would be, a major lack of experimental validation of the key genes they identify as important across all conditions may be the limitation of the study (this would be the difference between correlation and a probable underlying mechanism). This can be circumvented by more extensive reference to in situ data sets from mouse or existing data sets of single cell and spatial transcriptomics. A suggested targeted knock-down (for example with siRNA, shRNA or CRISPR) to validate a few of the key genes revealed as important could take a few months, with an estimated cost up to 5,000 euros per targeted gene and replicate.

      We thank the reviewer for the notion that targeted knockdowns are beyond the scope of our manuscript. We are deeply grateful for the reviewer’s constructive criticism and for the suggestion to analyze publicly available data sets in order to gather data depicting in vivo relevance of our identified central hypoxia-attenuated OFC risk genes Boc, Cdo1 and Actg2 (cf. point 1-4). We detected robust expression of Boc and Cdo1 during human craniofacial development (Fig. 7A) and we identified enhancers that are active in embryonic craniofacial mouse tissue (Fig. 7B). Moreover, we detected expression of both genes during murine craniofacial development in undifferentiated mesenchymal cells, osteoblasts, chondrocytes and smooth muscle cells by reanalysis of a scRNA-seq dataset (Figs. 7C-E, S6B). This data comprised scRNA-seq of mouse embryonic maxillary prominence at stages E11.5 and E14.5 (Sun et al, 2023).

      Thus, we found evidence for the in vivo relevance of Boc and Cdo1 and could rule out a possible important role of Actg2, the third gene we had identified. We therefore are deeply grateful for the suggestion, as we think these data strongly emphasize the importance of our findings.

      Point 2-7

      On methods, replicates and statistics: The experimental methods and approach are described efficiently and seem reproducible. All biological and technical replicates are of a minimum of N=3 from independent experiments and statistical tests have been run in all cases.

      We thank the reviewer for the appreciation of our methodology, descriptions and statistical analyses.

      Minor points

      Point 2-8

      One of the key implications of NCCs in palate formation is interaction with orofacial epithelial cells, which the authors also mention. It may be interesting to check if any signaling pathways involved in this crosstalk are affected under hypoxic conditions in their existing data sets of bulk RNA SEQ. This can be done by using available algorithms such as CellChat (Jin et al. 2021; Jin, Plikus, and Nie 2023), which has been reported to work also in bulk RNA seq data analysis (according to GitHub). The authors could mine the literature for existing RNA sequencing data that include osteoblasts, chondrocytes and epithelial cells (Ozekin, O'Rourke, and Bates 2023; Piña et al. 2023).

      We are very grateful to the reviewer for this suggestion. Moreover, we like to thank the reviewer for mentioning exemplary references. We followed the advice by the methodology lined out in results and materials and methods sections: we applied the CellChat algorithm on a scRNA-seq dataset (Pina et al, 2023; Sun et al., 2023) to identify pathways containing components that are hypoxia-attenuated (and associated with a risk for OFC) in our bulk RNA-seq dataset (Figs. 7F-I). We did not use the datasets the reviewer had suggested, because the data were not available for us or the file format was not well-suited for the analysis with CellChat. Importantly, the dataset from Sun et al. has the following advantages over the suggested references: the complete maxillary prominence was used (instead of palatal shelves only), and different time points were included. Thus, we were able to follow the expression of genes of interest at different developmental stages before the onset of differentiation and after (Figs. 7C-E and S6B). By our approach, we identified several OFC-related pathways that contain hypoxia-attenuated components such as BMP and FGF signaling and deposition of collagen and fibronectin (Figs. 7F-I). Importantly, the named pathways (and others) send outgoing communication patterns to epithelial cells. Therefore, hypoxia-attenuated gene induction in CNCC could influence epithelial cells via these pathways.

      We believe that the use of the CellChat algorithm has brought a deeper understanding of how hypoxia can have indirect consequences on the important topic of epithelial cells and thus could also evoke OFC. We therefore once again like to express our gratitude to the reviewer.

      Point 2-9

      Additionally, another process that may be affected is EMT (epithelial-to-mesenchymal-transition) and is possible to assess by re-analysis of bulk RNA-seq data while focusing on key genes implicated in this process (i.e. E-cadherin, vimentin, EpCAM, Snail, Twist, PRRX1).

      We thank the reviewer for the advice. We followed the advice and analyzed cellular morphology by the parameters cell length, total number of pseudopodia, number of filopodia and number of lobopodia (Figs. S1C-F) (cf. point 2-3). As we did not detect any differences between 21% and 0.5% O2, and because the cells we used for our analyses represent mesenchymal cells, i.e. cells that had already undergone EMT, we did not re-analyze our dataset with the focus on EMT.

      Point 2-10

      Lastly, when the authors report on the significantly up- or down-regulated genes, it may be interesting to categorize them by ligands, receptors, intracellular molecules and transcription factors (and use separate plots to visualize them). While a big focus of the manuscript are down-regulated genes, less emphasis was given in upregulated genes (other than the response to hypoxia gene module).

      We thank the reviewer for the advice. Following this advice, we categorized genes according to Panther protein classes "intercellular signal molecule" (PC00207), "transmembrane signal receptor" (PC00197) and "gene-specific transcriptional regulator" (PC00264) and depicted the results with violin plots (Fig. S5B). We could not analyze intracellular molecules, because this protein class does not exist in the Panther database. We had not focused on the genes with stronger induction in hypoxic condition, because the number of genes was low in each differentiation paradigm (7 in chondrocytes, less than 30 in osteoblasts, none in smooth muscle cells) and the transcriptional changes were mostly not as drastic as for the attenuated genes. In order to achieve a broader overview of deregulated processes, we now included GO term analyses of genes downregulated during the differentiation regimes both at 21% and 0.5% O2 (Figs. S2D,E, S3D,E, S4D,E).

      Point 2-11

      The authors are referencing extensively and accurately existing studies in the field and the manuscript is exceptionally well-written, with only a few points of limited clarity or increased complexity. Such an example is when the authors refer to OFC risk genes, because it is not clearly stated how the referenced studies reached their conclusions (for example, are they mouse studies, do they involve mutants, are any of these studies based on GWAS on human cohorts). This matter would significantly improve the flow of the text and highlight the importance of the study and their findings.

      We would like to thank the reviewer very much for the appreciation of our scientific writing. We apologize for not explaining exactly how our OFC risk gene lists had been curated. We included this information for both non-syndromic and other OFC risk genes at the respective sites in the results section. Moreover, we included the Human Phenotype Ontology terms that had been used in the search in the materials and methods section.

      We thank the reviewer for this suggestion, as we agree that this information significantly highlights the importance of our findings.

      Point 2-12

      The figures could be redesigned to be more intuitive to interpret. For example, using violin plots and heatmaps, as discussed, and including references or re-analysis/re-use of existing spatial transcriptomics and in situs for marker genes.

      In all cases where there is a comparison of gene expression levels, violin plots would be a better representation of up- and down-regulated genes (i.e. selected genes from Fig1K, comparison of gene expression between normoxic and hypoxic NCCs, Fig 2G when analyzing chondrogenesis and the respective analysis for osteoblasts and smooth muscle cells, as well as when comparing the three fate-biasing conditions to identify common genes that are misregulated).

      We thank the reviewer for the advice and for the appreciation of the usage of heatmaps (Figs. 2K, 3J, 4J, 6F). Unfortunately, as the number of biological replicates is only three to four, the visualization of gene expression data from our bulk RNA-seq data with violin plots was not intuitive. We therefore retained the heatmaps rather than choosing bar graphs, because they are much clearer when presenting expression data of several to many genes. We included violin plots whenever possible due to high numbers of data points (Figs. S1C, S1D, S1E, S1F, S5B). Moreover, we added additional heatmaps to depict transcriptional changes of genes associated with GO terms with the various differentiation regimes (Figs. S2F, S3F, S4F). Unfortunately, we did not detect the three central hypoxia-attenuated genes in spatial transcriptomics data on craniofacial development. But we used scRNA-seq data of different stages of orofacial mouse tissue where we could identify expression of Boc and Cdo1 (cf. points 1-4 and 2-6). These data helped, together with other in vivo data to gain evidence for the in vivo function of Boc and Cdo1 during CNCC differentiation and helped to dismiss Actg2 as another central player.

      Significance

      Several pieces of evidence have pointed to hypoxia as an environmental factor contributing to congenital orofacial clefts, ranging from studies in mouse to observations in human. The authors are doing an excellent job in putting this information together and the question they are trying to answer is of high importance, given the prevalence of such congenital syndromes.

      We are deeply grateful to the reviewer for the appreciation of our work and for classifying our research topic as highly important.

      In terms of the methods and model employed, there are some limitations, related to the choice of a mouse cell line over one from human, the severe hypoxia induced (over a more mild), and the conditions of directed differentiation not allowing for simultaneous examination of more complex lineage transitions. The methods as a whole are not that up-to-date, given the single cell and multiplexed transcriptomic advances the last couple of decades, advanced bioinformatics that could be used in combination with in vitro lineage tracing methods.

      We thank the reviewer for the honest evaluation of our methods, especially for the constructive suggestions that were given to address our hypotheses with more up-to-date methods and at milder hypoxic conditions. As outlined above, we followed the advice and re-analyzed existing scRNA-seq datasets (cf. points 2-6 and 2-8) and checked our central hypotheses at milder hypoxic conditions (cf. response to point 1-3).

      We are deeply convinced that both significantly increased the biological relevance of our results, because we thus (1) gathered evidence for the in vivo function of Boc and Cdo1 and (2) were able to show that the phenomenon of hypoxia-attenuated gene induction still holds true at biologically relevant hypoxic conditions.

      The audience this work will reach are neural crest experts, developmental biologists, and potentially clinical doctors. The general public outreach of such a paper is also diverse, as more focus and visibility is required for the individuals affected by those syndromes and their families.

      We thank the reviewer for the judgement that our manuscript will not only reach neural crest experts, but also developmental biologists in general and potentially also clinicians. We are very much pleased that the reviewer shares our opinion that affected individuals should be more in the focus of public attention. We like to express our gratitude for the judgement that our manuscript might help to increase focus and visibility for them.

      References

      Barriga EH, Maxwell PH, Reyes AE, Mayor R (2013) The hypoxia factor Hif-1α controls neural crest chemotaxis and epithelial to mesenchymal transition. The Journal of cell biology 201: 759-776, 10.1083/jcb.201212100.

      Forman TE, Sajek MP, Larson ED, Mukherjee N, Fantauzzo KA (2024) PDGFRα signaling regulates Srsf3 transcript binding to affect PI3K signaling and endosomal trafficking. Elife 13, 10.7554/eLife.98531.

      Funato N, Nakamura M, Yanagisawa H (2015) Molecular basis of cleft palates in mice. World journal of biological chemistry 6: 121-138, 10.4331/wjbc.v6.i3.121.

      Gehlen-Breitbach S, Schmid T, Fröb F, Rodrian G, Weider M, Wegner M, Gölz L (2023) The Tip60/Ep400 chromatin remodeling complex impacts basic cellular functions in cranial neural crest-derived tissue during early orofacial development. International Journal of Oral Science 15: 16, 10.1038/s41368-023-00222-7.

      Hansen JM, Jones DP, Harris C (2020) The Redox Theory of Development. Antioxid Redox Signal 32: 715-740, 10.1089/ars.2019.7976.

      Li D, Tian Y, Vona B, Yu X, Lin J, Ma L, Lou S, Li X, Zhu G, Wang Y et al (2025) A TAF11 variant contributes to non-syndromic cleft lip only through modulating neural crest cell migration. Hum Mol Genet 34: 392-401, 10.1093/hmg/ddae188.

      Ng KYB, Mingels R, Morgan H, Macklon N, Cheong Y (2017) In vivo oxygen, temperature and pH dynamics in the female reproductive tract and their importance in human conception: a systematic review. Human Reproduction Update 24: 15-34, 10.1093/humupd/dmx028.

      Pina JO, Raju R, Roth DM, Winchester EW, Chattaraj P, Kidwai F, Faucz FR, Iben J, Mitra A, Campbell K et al (2023) Multimodal spatiotemporal transcriptomic resolution of embryonic palate osteogenesis. Nature communications 14: 5687, 10.1038/s41467-023-41349-9.

      Sun J, Lin Y, Ha N, Zhang J, Wang W, Wang X, Bian Q (2023) Single-cell RNA-Seq reveals transcriptional regulatory networks directing the development of mouse maxillary prominence. J Genet Genomics 50: 676-687, 10.1016/j.jgg.2023.02.008.

      Ulschmid CM, Sun MR, Jabbarpour CR, Steward AC, Rivera-González KS, Cao J, Martin AA, Barnes M, Wicklund L, Madrid A et al (2024) Disruption of DNA methylation-mediated cranial neural crest proliferation and differentiation causes orofacial clefts in mice. Proc Natl Acad Sci U S A 121: e2317668121, 10.1073/pnas.2317668121.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Schmidt and colleagues are addressing the effects of severe hypoxia on proliferation and differentiation potential of (mouse) cranial neural crest, using a neural crest cell line subjected to hypoxic conditions, assessed by transcriptomics analysis (quantitative reverse transcription PCR, bulk RNA sequencing and bioinformatics). They are reporting a mild effect of cell proliferation and an extensive inhibition of differentiation towards osteoblasts, chondrocytes and smooth muscle cells. They reveal affected biological processes shared between the three fate biasing conditions related to cytoskeleton organization and amino acid metabolism. Lastly, among affected genes upon hypoxic conditions in vitro, they authors identified risk genes linked to non-syndromic (non-genetic) orofacial clefts exclusively downregulated in osteoblasts and smooth muscle cells, namely Fgfr2, Gstt1 and Tbxa2. Similarly, hypoxia-driven downregulation of genes implicated in syndromic orofacial clefts was observed in all three chondrocyte, osteoblast and smooth muscle differentiation scenarios. Lastly, STRING analysis of downregulated genes cross-validated their findings related to affected differentiation.

      Major comments:

      The conclusions drawn from the experimental data are carefully formulated for the most part. One of the main concerns is that the cells were subjected to extreme hypoxic conditions, while it may be more biologically relevant to include a condition representing more mild hypoxia (e.g. 10%). One of the opening claims regarding severe hypoxia only mildly affecting cell proliferation is not shown clearly, since no mitotic markers have been analyzed (i.e. KI67 or PCNA staining or a simple EdU incorporation assay). Thus, the claim that they assessed cell proliferation is not very convincing, even though cell death was analyzed. Additionally, cellular morphology of the cells could be assessed (brightfield images), since previous studies observed that hypoxia can be an inducive factor in cranial neural crest and driving EMT (Scully et al. 2016; Barriga et al. 2013).

      Furthermore, in the RNA seq analysis of chondrogenic fate biased cells the authors draw a conclusion based on the proximity of the samples on the PCA plot, which is not very convincing. More careful analysis of the bulk RNA seq data sets they have generated for key marker genes will be more convincing (for example, a heatmap with selected genes would be a helpful representation). As mentioned above, a straight-forward and not time consuming experiment (given that it was assessed for a maximum of 72 hrs) would be to repeat the culture of NCCs and stain for mitotic markers, and quantify the number of positively stained cells over total cell numbers. Furthermore, it is not that demanding to add an experimental condition of less severe hypoxia in this assay. Without underestimating how time consuming this would be, a major lack of experimental validation of the key genes they identify as important across all conditions may be the limitation of the study (this would be the difference between correlation and a probable underlying mechanism). This can be circumvented by more extensive reference to in situ data sets from mouse or existing data sets of single cell and spatial transcriptomicsA suggested targeted knock-down (for example with siRNA, shRNA or CRISPR) to validate a few of the key genes revealed as important could take a few months, with an estimated cost up to 5,000 euros per targeted gene and replicate. On methods, replicates and statistics: The experimental methods and approach are described efficiently and seem reproducible.All biological and technical replicates are of a minimum of N=3 from independent experiments and statistical tests have been run in all cases.

      Minor comments:

      One of the key implications of NCCs in palate formation is interaction with orofacial epithelial cells, which the authors also mention. It may be interesting to check if any signaling pathways involved in this crosstalk are affected under hypoxic conditions in their existing data sets of bulk RNA SEQ. This can be done by using available algorithms such as CellChat (Jin et al. 2021; Jin, Plikus, and Nie 2023), which has been reported to work also in bulk RNA seq data analysis (according to GitHub). The authors could mine the literature for existing RNA sequencing data that include osteoblasts, chondrocytes and epithelial cells (Ozekin, O'Rourke, and Bates 2023; Piña et al. 2023).

      Additionally, another process that may be affected is EMT (epithelial-to-mesenchymal-transition) and is possible to assess by re-analysis of bulk RNA-seq data while focusing on key genes implicated in this process (i.e. E-cadherin, vimentin, EpCAM, Snail, Twist, PRRX1). Lastly, when the authors report on the significantly up- or down-regulated genes, it may be interesting to categorize them by ligands, receptors, intracellular molecules and transcription factors (and use separate plots to visualize them). While a big focus of the manuscript are down-regulated genes, less emphasis was given in upregulated genes (other than the response to hypoxia gene module).

      The authors are referencing extensively and accurately existing studies in the field and the manuscript is exceptionally well-written, with only a few points of limited clarity or increased complexity. Such an example is when the authors refer to OFC risk genes, because it is not clearly stated how the referenced studies reached their conclusions (for example, are they mouse studies, do they involve mutants, are any of these studies based on GWAS on human cohorts). This matter would significantly improve the flow of the text and highlight the importance of the study and their findings. The figures could be redesigned to be more intuitive to interpret. For example, using violin plots and heatmaps, as discussed, and including references or re-analysis/re-use of existing spatial transcriptomics and in situs for marker genes.

      In all cases where there is a comparison of gene expression levels, violin plots would be a better representation of up- and down-regulated genes (i.e. selected genes from Fig1K, comparison of gene expression between normoxic and hypoxic NCCs, Fig 2G when analyzing chondrogenesis and the respective analysis for osteoblasts and smooth muscle cells, as well as when comparing the three fate-biasing conditions to identify common genes that are misregulated).

      References:

      Barriga, Elias H., Patrick H. Maxwell, Ariel E. Reyes, and Roberto Mayor. 2013. "The Hypoxia Factor Hif-1α Controls Neural Crest Chemotaxis and Epithelial to Mesenchymal Transition." The Journal of Cell Biology 201 (5): 759-76. https://doi.org/10.1083/jcb.201212100.

      Jin, Suoqin, Christian F. Guerrero-Juarez, Lihua Zhang, Ivan Chang, Raul Ramos, Chen-Hsiang Kuan, Peggy Myung, Maksim V. Plikus, and Qing Nie. 2021. "Inference and Analysis of Cell-Cell Communication Using CellChat." Nature Communications 12 (1). https://doi.org/10.1038/s41467-021-21246-9.

      Jin, Suoqin, Maksim V. Plikus, and Qing Nie. 2023. "CellChat for Systematic Analysis of Cell-Cell Communication from Single-Cell and Spatially Resolved Transcriptomics." bioRxiv. https://doi.org/10.1101/2023.11.05.565674.

      Ozekin, Yunus H., Rebecca O'Rourke, and Emily Anne Bates. 2023. "Single Cell Sequencing of the Mouse Anterior Palate Reveals Mesenchymal Heterogeneity." Developmental Dynamics : An Official Publication of the American Association of Anatomists 252 (6): 713-27. https://doi.org/10.1002/dvdy.573.

      Piña, Jeremie Oliver, Resmi Raju, Daniela M. Roth, Emma Wentworth Winchester, Parna Chattaraj, Fahad Kidwai, Fabio R. Faucz, et al. 2023. "Multimodal Spatiotemporal Transcriptomic Resolution of Embryonic Palate Osteogenesis." Nature Communications 14 (September):5687. https://doi.org/10.1038/s41467-023-41349-9.

      Scully, Deirdre, Eleanor Keane, Emily Batt, Priyadarssini Karunakaran, Debra F. Higgins, and Nobue Itasaki. 2016. "Hypoxia Promotes Production of Neural Crest Cells in the Embryonic Head." Development 143 (10): 1742-52. https://doi.org/10.1242/dev.131912.

      Significance

      Several pieces of evidence have pointed to hypoxia as an environmental factor contributing to congenital orofacial clefts, ranging from studies in mouse to observations in human. The authors are doing an excellent job in putting this information together and the question they are trying to answer is of high importance, given the prevalence of such congenital syndromes. In terms of the methods and model employed, there are some limitations, related to the choice of a mouse cell line over one from human, the severe hypoxia induced (over a more mild), and the conditions of directed differentiation not allowing for simultaneous examination of more complex lineage transitions. The methods as a whole are not that up-to-date, given the single cell and multiplexed transcriptomic advances the last couple of decades, advanced bioinformatics that could be used in combination with in vitro lineage tracing methods.

      The audience this work will reach are neural crest experts, developmental biologists, and potentially clinical doctors. The general public outreach of such a paper is also diverse, as more focus and visibility is required for the individuals affected by those syndromes and their families.

      Reviewer's expertise: mouse neural crest lineage and multipotency, lineage tracing, single cell transcriptomics, NGS, immunofluorescence, molecular methods (RNA, DNA based). Limited expertise with in vitro studies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Title: Hypoxia impedes differentiation of cranial neural crest cells into derivatives relevant for craniofacial development

      Synopsis: Cleft lip w/ or w/o cleft palate is the second-most common birth defect worldwide. Defects are often traceable to cranial neural crest cells through genetics or environmental factors. Schmid and coauthors focus on the environmental factor of hypoxia and investigate the effects of hypoxic conditions on the ability of CNCCs to differentiate and migrate. They performed RNA-seq analysis with qRT-PCR validation for specific markers and show that hypoxia appears to repress differentiation without markedly affecting proliferation. Hypoxic conditions did not demonstrated significant perturbations in cell proliferation; however, chondrocyte, osteoblast, and smooth muscle differentiation was significantly reduced for cell lines cultured under hypoxia. Bulk RNA-seq and PCA revealed dysregulation of genes implicated in cytoskeletal integrity (such as actin γ-2), neural crest cell migration (hedgehog co-receptor brother of CDO) and amino acid metabolism (cysteine dioxygenase), which Schmid and colleagues termed OFC risk genes.

      Major comments

      • The authors performed qRT-PCR validation for markers of differentiation and hypoxia, with a major absence of VEGF and HIF1a. The paper would be strengthened by mention of these factors, especially by qRT-PCR or Western blot.
      • Please provide justification of selection 0.5% as their hypoxic condition or perhaps repeat experiments in a less extreme environment to see if their conclusions still hold true.
      • standard immunohistochemistry or histology of differentiated cells would strengthen the authors' claims of reduced differentiation under hypoxic conditions, e.g., Alcian blue, alk-phos or Alizarin red, and smooth muscle actin or other indicator.
      • The authors identify a few genes that appear down-regulated in all three differentiation conditions. If it is within the scope of the study, it would strengthen the claim of these genes' function to show the effect of knock-down or knock-out for validation.
      • Another major critique lies in the initial claim that proliferation of O9-1 cells is not significantly impacted by hypoxia. In figures 1E-H, photograms of the cells cultured 24 -72 hours and quantifications of live vs dead cells are shown as evidence for this argument. However, the increased density of cells in normoxic conditions may be a confounding variable in this assay. It would be interesting for the researchers to assess the percent of dead vs alive cells between normoxic and hypoxic conditions when the plates reach equivalent densities.
      • At end of Fig 1 section authors attempt to tie phenotypes observed in a cell line in vitro to the complex biological processes. They are not comparable and in vivo models would be better suited for these types of comparisons.
      • Fig 2: if qRT-PCR did not show statistically different results between experimental and control groups why move on to bulk RNA seq?
      • Fig 5: hypoxia this intense is going to affect broad range of biological processes and genes. Finding a few genes that are affected in extreme hypoxia that are also risk genes is highly unlikely. How can the authors be assured that these overlaps are actually significant and not just by chance?
      • Would appreciate discussion on how examination of neural crest is relevant for OFC, as most animal models of OFC demonstrate the pathogenesis in embryonic epithelium or periderm, not in the neural crest. Defects in neural crest are associated with other congenital craniofacial anomalies such as craniosynostosis or complex (Tessier) clefts, not the typical orofacial cleft. Please revise rationale of study, interpretation of data and Discussion to specifically state how neural crest cells are involved in the pathogenesis of orofacial cleft.

      Minor comments

      • The author should replace "Final proof" in the introduction with "further evidence supporting."
      • Authors are inconsistent when referring to Figures- sometimes they capitalize (i.e. 1J) and other times they leave lower case (i.e. 1i). Needs to be consistent throughout. Figures are not numbered
      • In figures authors would sometimes list 21% O2 first then 0.5% O2 or vice versa. (i.e. Fig on page 21 panels I, J, K). Needs to be consistent.
      • Figures on pages 28, 29, 30 panel J and page 31 panel F: there is no legend on what the scale/measurement is for the difference in expression level other than it ranges from -1 to +3.
      • Will the authors please comment on the one normoxic sample in Figure 1I that did not cluster with the others? Did this meet the standards to merit exclusion as an outlier?
      • The authors refer to DEG as deregulated genes; while not strictly incorrect, the more standard usage is "differentially expressed genes." Please address.

      Significance

      This work on neural crest cells and hypoxia are biologically and clinically significant.

    4. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the editor for handling our manuscript and the reviewers for their constructive critiques. We are deeply convinced that the reviewers’ suggestions have substantially raised the quality and possible impact of our manuscript. We also like to thank the reviewers for their judgements that the subject of our manuscript is biologically and clinically significant and of high importance, and that our manuscript might help to increase focus and visibility for affected individuals.

      New text passages in the manuscript are colored in red. Below is a point-by-point response to the reviewers’ comments.

      2. Point-by-point description of the revisions

      Response to reviewer 1 comments

      Major comments


      Point 1-1

      The authors performed qRT-PCR validation for markers of differentiation and hypoxia, with a major absence of VEGF and HIF1a. The paper would be strengthened by mention of these factors, especially by qRT-PCR or Western blot.

      We thank the reviewer for the suggestion to include the bona fide hypoxia markers Vegfa and Hif1-alpha. We followed the suggestion and performed qRT-PCR on Vegfa transcripts at each tested condition (Figs. 1A,2A,3A,4A,5A,5D,5I,5N). As Hif1α is rather regulated on protein than on transcript level, we followed the advice to perform Western blots. We analyzed Hif1α protein levels on proliferating cells and quantified by normalization to actin (Figs. 1B,C and 5 B,C).

      Point 1-2

      Please provide justification of selection 0.5% as their hypoxic condition or perhaps repeat experiments in a less extreme environment to see if their conclusions still hold true.

      We admit that our approach to use 0.5% hypoxia was a drastic challenge for the cells. It should be noted, however, that physiologic oxygen levels during pregnancy at times drop to lower than 1% (Hansen et al, 2020; Ng et al, 2017). In the first place, we had used oxygen levels lower than this, because we had wanted to ensure that we can detect responses by bulk RNA-seq with a limited number of samples. As we had many conditions to compare, we did not want to use more than 3-4 samples per condition. The fact that the cells showed normal proliferation underscores the fact that 0.5% O2 per se was not so low that it would be overly stressful to the cells.

      Nevertheless, we are very grateful to the reviewer for the suggestion to include a milder hypoxic condition. We chose 2% O2, because this equals the physiological oxygen concentration shortly before the onset of cranial neural crest cell (CNCC) differentiation. We could recapitulate the phenomenon of impaired differentiation to chondrocytes, osteoblasts and smooth muscle cells at these mild hypoxic conditions, as shown by qRT-PCR and immunofluorescence of typical markers (Figs. 5D-R). Moreover, the differentiation-specific induction of the two central hypoxia-attenuated risk genes associated with orofacial clefts that we had identified by our bioinformatic analyses at 0.5% O2 (Boc and Cdo1), was still observable at 2% O2 (Figs. EV6C,D). Interestingly, in some rare cases, the attenuation of induction was lost or not as drastic as in 0.5% O2.

      We are convinced that the experiments at 2% O2 strongly increased the relevance of our manuscript, because we thus detected that oxygen levels prevailing shortly before the onset of CNCC differentiation still can influence their differentiation. This leads to the conclusion that only slight decreases of intra-uterine oxygen levels indeed might interfere with correct differentiation of CNCC.

      Point 1-3

      Standard immunohistochemistry or histology of differentiated cells would strengthen the authors' claims of reduced differentiation under hypoxic conditions, e.g., Alcian blue, alk-phos or Alizarin red, and smooth muscle actin or other indicator.

      We are grateful to the reviewer for the suggestion to include stainings of cells, as these stainings visualized the drastic effects of hypoxia on the cells. We performed immunofluorescent stainings against at least one marker protein for each differentiation paradigm. At 0.5% O2, each protein signals were nearly completely absent and cell morphology was disrupted (Figs. 2E,F, 3E, 4E). At 2% O2, we detected some more protein deposition than at 0.5%. Importantly, cells had retained their normal shape at mild hypoxia (Figs. 5H,M,R, EV5A).

      Point 1-4

      The authors identify a few genes that appear down-regulated in all three differentiation conditions. If it is within the scope of the study, it would strengthen the claim of these genes' function to show the effect of knock-down or knock-out for validation.

      We thank the reviewer for the suggestion of gene knock-down or knock-out in order to prove functional relevance of our findings. As this would have been too much effort and beyond the scope of our study, we rather followed the suggestion of reviewer 2 (cf. points 2-6, and 2-8) that headed to the same direction: we mined publicly available sequence data on orofacial development for gene expression or marks of active enhancers. We found robust expression of the two central hypoxia-attenuated OFC risk genes Boc and Cdo1 during human craniofacial development (Fig. 7A) and we identified enhancers that are active in embryonic craniofacial mouse tissue (Fig. 7B). Moreover, we detected expression of both genes during murine craniofacial development in undifferentiated mesenchymal cells, osteoblasts, chondrocytes and smooth muscle cells with the help of a single cell RNA-seq dataset (Figs. 7C-E, EV6B).

      Thus, we found evidence for the in vivo relevance of Boc and Cdo1 and could rule out a possible important role of Actg2, the third gene we had identified. We therefore are grateful for the suggestion to circumvent gene knockouts by reviewer 2, as we think these data strongly emphasized the importance of our findings.

      Point 1-5

      Another major critique lies in the initial claim that proliferation of O9-1 cells is not significantly impacted by hypoxia. In figures 1E-H, photograms of the cells cultured 24 -72 hours and quantifications of live vs dead cells are shown as evidence for this argument. However, the increased density of cells in normoxic conditions may be a confounding variable in this assay. It would be interesting for the researchers to assess the percent of dead vs alive cells between normoxic and hypoxic conditions when the plates reach equivalent densities.

      We apologize for the use of image sections from photographs with different cell densities. Of course, as demonstrated by our quantification, cell densities between 0.5% and 21% O2 in total were equal (cf. Figs. 1D,E). We therefore replaced the formerly used sections with new image sections with equal cell numbers.

      We thank the reviewer for the suggestion to examine if cell numbers influence cell death rates. We followed this advice by several approaches: first, we seeded cells at different densities, incubated them for 72 h (the same time span where a minimal difference had been detected) and performed live/dead stainings (Fig. EV1B). The seeding density did not affect percentages of dead cells and the values were in the same range as in our initial experiment (Fig. 1J). Moreover, we performed TUNEL stainings of apoptotic cells at different time points to have an additional readout of cell death (Figs. 1K,L). As expected, the percentages of TUNEL-positive cells were identical between hypoxic and normoxic cells at all analyzed time points.

      We therefore concluded that hypoxia does not influence the rate of cell death of proliferating CNCC and accordingly specified our wording in the results section.

      Point 1-6

      At end of Fig 1 section authors attempt to tie phenotypes observed in a cell line in vitro to the complex biological processes. They are not comparable and in vivo models would be better suited for these types of comparisons.

      We apologize for the overconfident wording in our manuscript. Of course, our in vitro experiments cannot fully simulate the complex developmental processes taking place in vivo. We therefore changed the text to a more careful formulation. Moreover, we kept the wording in the discussion section that we cannot exclude that in the in vivo situation proliferation of CNCC is also affected by low oxygen levels because nutrients might not be available in such excess as they are in cell culture.


      Point 1-7

      Fig 2: if qRT-PCR did not show statistically different results between experimental and control groups why move on to bulk RNA seq?

      We apologize that the sentence about statistical significance was misleading. What we wanted to express is that there was only a little difference (if any at all) between differentiated cells at 0.5% O2 and proliferating cells at 0.5% O2 or 21% O2. For the sake of clarity and readability, we deleted this misleading sentence.

      Point 1-8

      Fig 5: hypoxia this intense is going to affect broad range of biological processes and genes. Finding a few genes that are affected in extreme hypoxia that are also risk genes is highly unlikely. How can the authors be assured that these overlaps are actually significant and not just by chance?

      We thank the reviewer for the suggestion to test for statistical significance. We tested significance of the overlap of respective gene sets (nsOFC vs. hyp-a; OFC vs. hyp-a) by Fisher’s exact test. We included Venn diagrams depicting the overlap and present the exact p-values (Figs. EV5C,D). In each case where overlap of genes occurred, p-values indicated significance.

      Point 1-9

      Would appreciate discussion on how examination of neural crest is relevant for OFC, as most animal models of OFC demonstrate the pathogenesis in embryonic epithelium or periderm, not in the neural crest. Defects in neural crest are associated with other congenital craniofacial anomalies such as craniosynostosis or complex (Tessier) clefts, not the typical orofacial cleft. Please revise rationale of study, interpretation of data and Discussion to specifically state how neural crest cells are involved in the pathogenesis of orofacial cleft.

      We apologize for not pointing out enough the role of epithelial cells in the emergence of orofacial clefts. We revised our introduction, results and discussion sections in this regard and emphasized the role of epithelial cells. Importantly, we addressed the possible influence of the results gained in CNCC on epithelial cells by analyzing scRNA-seq data with the algorithm CellChat, as suggested by reviewer 2 (cf. point 2-8). We detected several cell communication pathways from CNCC to epithelial cells which contain components that are misexpressed upon hypoxia in our dataset (Figs. 7F-I). Therefore, during hypoxia, these pathways might influence epithelial cells and therefore indirectly cause orofacial clefts. We outlined this possible interplay in the discussion and briefly mentioned it in the abstract.

      We have not discussed more strongly the role of CNCC in the emergence of OFC in the revised manuscript, because we did not want to put even more emphasis on this matter. Numerous studies have proven the contribution of cranial neural crest tissue to the emergence of orofacial clefts. This fact is also pointed out in several review articles about orofacial clefts. In most cases, this knowledge was achieved by mouse models, because tissue-specific conditional knockouts are feasible (in contrast to genetic studies on patients), usually via deletion with the Wnt1-Cre driver. Funato et al. give an excellent (but quite old) overview of mouse models in which the neural crest-specific knockout of a gene leads to emergence of OFC and lists 17 genes for which this is the case (Funato et al, 2015). Moreover, several recent studies also report on the emergence of orofacial clefts upon neural crest-specific deletion (Forman et al, 2024; Li et al, 2025). These include genes responsible for DNA methylation (Ulschmid et al, 2024), and a study on subunits of chromatin remodeling complexes that are necessary for correct transcription of their target genes, which was conducted by our group (Gehlen-Breitbach et al, 2023).

      Minor comments

      __Point 1-10 __

      The author should replace "Final proof" in the introduction with "further evidence supporting."

      We apologize for the incorrect wording. Of course, it is highly questionable if there is such a thing as final proof in life sciences. We re-phrased the text according to the reviewer’s suggestion.

      Point 1-11

      Authors are inconsistent when referring to Figures- sometimes they capitalize (i.e. 1J) and other times they leave lower case (i.e. 1i). Needs to be consistent throughout. Figures are not numbered.

      We apologize for the inconsistency. We corrected the references to figures. Moreover, we apologize for the missing figure numbers. We also corrected this and included figure numbers.

      Point 1-12

      In figures authors would sometimes list 21% O2 first then 0.5% O2 or vice versa. (i.e. Fig on page 21 panels I, J, K). Needs to be consistent.

      We again apologize for being inconsistent. We corrected the inconsistency in Fig. 1D. Now, 21% O2 is presented before/above 0.5% O2.

      Point 1-13

      Figures on pages 28, 29, 30 panel J and page 31 panel F: there is no legend on what the scale/measurement is for the difference in expression level other than it ranges from -1 to +3.

      We thank the reviewer for the hint. We are aware that from the heatmaps we used one cannot infer relative expression rates of different genes or similar. If we would have considered expression strength of single genes, many of the gene-specific differing expression rates under the different conditions would have been hard to detect, as presentation would have been dominated by the differences in expression rates between genes. We therefore plotted gene-wise scaled expression.

      We included an explanation of the procedure in the materials and methods section.

      Point 1-14

      Will the authors please comment on the one normoxic sample in Figure 1I that did not cluster with the others? Did this meet the standards to merit exclusion as an outlier?

      We regret that the default scale of our plot of the principal component analysis is a bit misleading. This is the case because x-axis accounts for 80.3% of variance and y-axis only accounts for 6.1%. Therefore, the sample that might seem as an outlier actually met our standards. Nevertheless, we decided to keep the default scaling as is, in order not to embellish the graph (Fig. 1M).

      Point 1-15

      The authors refer to DEG as deregulated genes; while not strictly incorrect, the more standard usage is "differentially expressed genes." Please address.

      We apologize for the incorrect explanation of the acronym. Of course, this was corrected in the revised manuscript.

      Significance

      This work on neural crest cells and hypoxia are biologically and clinically significant.

      We are deeply grateful to the reviewer for considering our manuscript significant for both biologists and clinicians. We are convinced that the additional data we gathered in the course of the revision has significantly increased the importance of our work. Therefore, we once again express our gratitude to the reviewer for the valuable suggestions.

      Response to reviewer 2 comments

      Major comments


      Point 2-1

      The conclusions drawn from the experimental data are carefully formulated for the most part. One of the main concerns is that the cells were subjected to extreme hypoxic conditions, while it may be more biologically relevant to include a condition representing more mild hypoxia (e.g. 10%).

      Please refer to the response to point 1-2.

      Point 2-2

      One of the opening claims regarding severe hypoxia only mildly affecting cell proliferation is not shown clearly, since no mitotic markers have been analyzed (i.e. KI67 or PCNA staining or a simple EdU incorporation assay). Thus, the claim that they assessed cell proliferation is not very convincing, even though cell death was analyzed.

      We appreciate the reviewer’s suggestion to include a more thorough analysis of proliferation rates. We followed the advice and performed immunofluorescent stainings against Ki67 (accounting for cells in proliferative state) and phospho-histone H3 (accounting for cells undergoing mitosis). We performed this assay at different time points of culture in order to address the question if cell density might influence proliferation rates (Figs. 1F-H). Neither for Ki67 nor for pHH3 a difference was detected between 21% and 0.5% O2.

      We are convinced that these analyses strengthened our initial findings and provide strong evidence that hypoxia does not influence proliferation rates of CNCC.

      Point 2-3

      Additionally, cellular morphology of the cells could be assessed (brightfield images), since previous studies observed that hypoxia can be an inducive factor in cranial neural crest and driving EMT (Scully et al. 2016; Barriga et al. 2013).


      We thank the reviewer’s hint and followed the advice. We analyzed cellular morphology by the parameters cell length, total number of pseudopodia, number of filopodia and number of lobopodia (Figs. EV1C-F). As outlined in the results section, we did not detect a difference in these parameters between 21% and 0.5% O2.

      We included the second reference mentioned by the reviewer (Barriga et al, 2013) additionally to Scully et al. 2016 that had already been cited.

      Point 2-4

      Furthermore, in the RNA seq analysis of chondrogenic fate biased cells the authors draw a conclusion based on the proximity of the samples on the PCA plot, which is not very convincing. More careful analysis of the bulk RNA seq data sets they have generated for key marker genes will be more convincing (for example, a heatmap with selected genes would be a helpful representation).

      We apologize for the rash and inaccurate conclusion based on proximity on PCA plots. We are grateful to the reviewer for the suggestion to include heatmaps with selected marker genes. Following this advice, we generated heatmaps on our bulk RNA-seq data with the GO terms specific for each differentiation paradigm (Figs. EV2F, EV3F, EV4F).

      We are convinced that these maps are perfect additions to the heatmaps of the 200 top differentially-expressed genes that already had been included in the manuscript (Figs. 2K, 3J, 4J) and helped to strengthen our findings. For chondrocytes and smooth muscle cells, the new, GO-specific heatmaps perfectly recapitulated the phenomenon of hypoxia-attenuated induction. Interestingly, for osteoblasts, about half of the induced genes were hypoxia-attenuated, while the other half was induced stronger than under normoxia. This pointed to gene-specific mechanisms of hypoxia-dependent attenuation of transcription. Moreover, it shed light on a hypoxia-evoked complete dysregulation of transcriptional induction in osteoblasts, as nearly none of the genes was induced similar to normoxia.

      __ __


      Point 2-5

      As mentioned above, a straight-forward and not time consuming experiment (given that it was assessed for a maximum of 72 hrs) would be to repeat the culture of NCCs and stain for mitotic markers, and quantify the number of positively stained cells over total cell numbers. Furthermore, it is not that demanding to add an experimental condition of less severe hypoxia in this assay.

      We thank the reviewer for the suggestion and followed the advice (cf. point 2-2). The conducted experiments straightened our results, because the initially detected slight tendency to lower cell numbers at 0.5% O2 could thus be falsified: We did not detect any difference for Ki67 and pHH3 between 0.5% and 21% O2 at any analyzed time point (Figs. 1F-H). Moreover, percentages of dead or apoptotic cells at 0.5% O2 did not vary from 21% (Figs. 1I-L, EV1B). As we could not detect any difference in proliferation between 21% and 0.5% O2, we skipped the analysis of proliferating cells at 2% O2.

      Point 2-6

      Without underestimating how time consuming this would be, a major lack of experimental validation of the key genes they identify as important across all conditions may be the limitation of the study (this would be the difference between correlation and a probable underlying mechanism). This can be circumvented by more extensive reference to in situ data sets from mouse or existing data sets of single cell and spatial transcriptomics. A suggested targeted knock-down (for example with siRNA, shRNA or CRISPR) to validate a few of the key genes revealed as important could take a few months, with an estimated cost up to 5,000 euros per targeted gene and replicate.

      We thank the reviewer for the notion that targeted knockdowns are beyond the scope of our manuscript. We are deeply grateful for the reviewer’s constructive criticism and for the suggestion to analyze publicly available data sets in order to gather data depicting in vivo relevance of our identified central hypoxia-attenuated OFC risk genes Boc, Cdo1 and Actg2 (cf. point 1-4). We detected robust expression of Boc and Cdo1 during human craniofacial development (Fig. 7A) and we identified enhancers that are active in embryonic craniofacial mouse tissue (Fig. 7B). Moreover, we detected expression of both genes during murine craniofacial development in undifferentiated mesenchymal cells, osteoblasts, chondrocytes and smooth muscle cells by reanalysis of a scRNA-seq dataset (Figs. 7C-E, EV6B). This data comprised scRNA-seq of mouse embryonic maxillary prominence at stages E11.5 and E14.5 (Sun et al, 2023).

      Thus, we found evidence for the in vivo relevance of Boc and Cdo1 and could rule out a possible important role of Actg2, the third gene we had identified. We therefore are deeply grateful for the suggestion, as we think these data strongly emphasize the importance of our findings.

      Point 2-7

      On methods, replicates and statistics: The experimental methods and approach are described efficiently and seem reproducible. All biological and technical replicates are of a minimum of N=3 from independent experiments and statistical tests have been run in all cases.


      We thank the reviewer for the appreciation of our methodology, descriptions and statistical analyses.

      Minor points

      Point 2-8

      One of the key implications of NCCs in palate formation is interaction with orofacial epithelial cells, which the authors also mention. It may be interesting to check if any signaling pathways involved in this crosstalk are affected under hypoxic conditions in their existing data sets of bulk RNA SEQ. This can be done by using available algorithms such as CellChat (Jin et al. 2021; Jin, Plikus, and Nie 2023), which has been reported to work also in bulk RNA seq data analysis (according to GitHub). The authors could mine the literature for existing RNA sequencing data that include osteoblasts, chondrocytes and epithelial cells (Ozekin, O'Rourke, and Bates 2023; Piña et al. 2023).

      We are very grateful to the reviewer for this suggestion. Moreover, we like to thank the reviewer for mentioning exemplary references. We followed the advice by the methodology lined out in results and materials and methods sections: we applied the CellChat algorithm on a scRNA-seq dataset (Pina et al, 2023; Sun et al., 2023) to identify pathways containing components that are hypoxia-attenuated (and associated with a risk for OFC) in our bulk RNA-seq dataset (Figs. 7F-I). We did not use the datasets the reviewer had suggested, because the data were not available for us or the file format was not well-suited for the analysis with CellChat. Importantly, the dataset from Sun et al. has the following advantages over the suggested references: the complete maxillary prominence was used (instead of palatal shelves only), and different time points were included. Thus, we were able to follow the expression of genes of interest at different developmental stages before the onset of differentiation and after (Figs. 7C-E and EV6B). By our approach, we identified several OFC-related pathways that contain hypoxia-attenuated components such as BMP and FGF signaling and deposition of collagen and fibronectin (Figs. 7F-I). Importantly, the named pathways (and others) send outgoing communication patterns to epithelial cells. Therefore, hypoxia-attenuated gene induction in CNCC could influence epithelial cells via these pathways.

      We believe that the use of the CellChat algorithm has brought a deeper understanding of how hypoxia can have indirect consequences on the important topic of epithelial cells and thus could also evoke OFC. We therefore once again like to express our gratitude to the reviewer.

      Point 2-9

      Additionally, another process that may be affected is EMT (epithelial-to-mesenchymal-transition) and is possible to assess by re-analysis of bulk RNA-seq data while focusing on key genes implicated in this process (i.e. E-cadherin, vimentin, EpCAM, Snail, Twist, PRRX1).

      We thank the reviewer for the advice. We followed the advice and analyzed cellular morphology by the parameters cell length, total number of pseudopodia, number of filopodia and number of lobopodia (Figs. EV1C-F) (cf. point 2-3). As we did not detect any differences between 21% and 0.5% O2, and because the cells we used for our analyses represent mesenchymal cells, i.e. cells that had already undergone EMT, we did not re-analyze our dataset with the focus on EMT.

      Point 2-10

      Lastly, when the authors report on the significantly up- or down-regulated genes, it may be interesting to categorize them by ligands, receptors, intracellular molecules and transcription factors (and use separate plots to visualize them). While a big focus of the manuscript are down-regulated genes, less emphasis was given in upregulated genes (other than the response to hypoxia gene module).

      We thank the reviewer for the advice. Following this advice, we categorized genes according to Panther protein classes "intercellular signal molecule" (PC00207), "transmembrane signal receptor" (PC00197) and "gene-specific transcriptional regulator" (PC00264) and depicted the results with violin plots (Fig. EV5B). We could not analyze intracellular molecules, because this protein class does not exist in the Panther database. We had not focused on the genes with stronger induction in hypoxic condition, because the number of genes was low in each differentiation paradigm (7 in chondrocytes, less than 30 in osteoblasts, none in smooth muscle cells) and the transcriptional changes were mostly not as drastic as for the attenuated genes. In order to achieve a broader overview of deregulated processes, we now included GO term analyses of genes downregulated during the differentiation regimes both at 21% and 0.5% O2 (Figs. EV2D,E, EV3D,E, EV4D,E).

      Point 2-11

      The authors are referencing extensively and accurately existing studies in the field and the manuscript is exceptionally well-written, with only a few points of limited clarity or increased complexity. Such an example is when the authors refer to OFC risk genes, because it is not clearly stated how the referenced studies reached their conclusions (for example, are they mouse studies, do they involve mutants, are any of these studies based on GWAS on human cohorts). This matter would significantly improve the flow of the text and highlight the importance of the study and their findings.

      We would like to thank the reviewer very much for the appreciation of our scientific writing. We apologize for not explaining exactly how our OFC risk gene lists had been curated. We included this information for both non-syndromic and other OFC risk genes at the respective sites in the results section. Moreover, we included the Human Phenotype Ontology terms that had been used in the search in the materials and methods section.

      We thank the reviewer for this suggestion, as we agree that this information significantly highlights the importance of our findings.

      Point 2-12

      The figures could be redesigned to be more intuitive to interpret. For example, using violin plots and heatmaps, as discussed, and including references or re-analysis/re-use of existing spatial transcriptomics and in situs for marker genes.

      In all cases where there is a comparison of gene expression levels, violin plots would be a better representation of up- and down-regulated genes (i.e. selected genes from Fig1K, comparison of gene expression between normoxic and hypoxic NCCs, Fig 2G when analyzing chondrogenesis and the respective analysis for osteoblasts and smooth muscle cells, as well as when comparing the three fate-biasing conditions to identify common genes that are misregulated).

      We thank the reviewer for the advice and for the appreciation of the usage of heatmaps (Figs. 2K, 3J, 4J, 6F). Unfortunately, as the number of biological replicates is only three to four, the visualization of gene expression data from our bulk RNA-seq data with violin plots was not intuitive. We therefore retained the heatmaps rather than choosing bar graphs, because they are much clearer when presenting expression data of several to many genes. We included violin plots whenever possible due to high numbers of data points (Figs. EV1C, EV1D, EV1E, EV1F, EV5B). Moreover, we added additional heatmaps to depict transcriptional changes of genes associated with GO terms with the various differentiation regimes (Figs. EV2F, EV3F, EV4F). Unfortunately, we did not detect the three central hypoxia-attenuated genes in spatial transcriptomics data on craniofacial development. But we used scRNA-seq data of different stages of orofacial mouse tissue where we could identify expression of Boc and Cdo1 (cf. points 1-4 and 2-6). These data helped, together with other in vivo data to gain evidence for the in vivo function of Boc and Cdo1 during CNCC differentiation and helped to dismiss Actg2 as another central player.

      Significance

      Several pieces of evidence have pointed to hypoxia as an environmental factor contributing to congenital orofacial clefts, ranging from studies in mouse to observations in human. The authors are doing an excellent job in putting this information together and the question they are trying to answer is of high importance, given the prevalence of such congenital syndromes.

      We are deeply grateful to the reviewer for the appreciation of our work and for classifying our research topic as highly important.

      In terms of the methods and model employed, there are some limitations, related to the choice of a mouse cell line over one from human, the severe hypoxia induced (over a more mild), and the conditions of directed differentiation not allowing for simultaneous examination of more complex lineage transitions. The methods as a whole are not that up-to-date, given the single cell and multiplexed transcriptomic advances the last couple of decades, advanced bioinformatics that could be used in combination with in vitro lineage tracing methods.

      We thank the reviewer for the honest evaluation of our methods, especially for the constructive suggestions that were given to address our hypotheses with more up-to-date methods and at milder hypoxic conditions. As outlined above, we followed the advice and re-analyzed existing scRNA-seq datasets (cf. points 2-6 and 2-8) and checked our central hypotheses at milder hypoxic conditions (cf. response to point 1-3).

      We are deeply convinced that both significantly increased the biological relevance of our results, because we thus (1) gathered evidence for the in vivo function of Boc and Cdo1 and (2) were able to show that the phenomenon of hypoxia-attenuated gene induction still holds true at biologically relevant hypoxic conditions.

      The audience this work will reach are neural crest experts, developmental biologists, and potentially clinical doctors. The general public outreach of such a paper is also diverse, as more focus and visibility is required for the individuals affected by those syndromes and their families.

      We thank the reviewer for the judgement that our manuscript will not only reach neural crest experts, but also developmental biologists in general and potentially also clinicians. We are very much pleased that the reviewer shares our opinion that affected individuals should be more in the focus of public attention. We like to express our gratitude for the judgement that our manuscript might help to increase focus and visibility for them.

      References


      Barriga EH, Maxwell PH, Reyes AE, Mayor R (2013) The hypoxia factor Hif-1α controls neural crest chemotaxis and epithelial to mesenchymal transition. The Journal of cell biology 201: 759-776, 10.1083/jcb.201212100.

      Forman TE, Sajek MP, Larson ED, Mukherjee N, Fantauzzo KA (2024) PDGFRα signaling regulates Srsf3 transcript binding to affect PI3K signaling and endosomal trafficking. Elife 13, 10.7554/eLife.98531.

      Funato N, Nakamura M, Yanagisawa H (2015) Molecular basis of cleft palates in mice. World journal of biological chemistry 6: 121-138, 10.4331/wjbc.v6.i3.121.

      Gehlen-Breitbach S, Schmid T, Fröb F, Rodrian G, Weider M, Wegner M, Gölz L (2023) The Tip60/Ep400 chromatin remodeling complex impacts basic cellular functions in cranial neural crest-derived tissue during early orofacial development. International Journal of Oral Science 15: 16, 10.1038/s41368-023-00222-7.

      Hansen JM, Jones DP, Harris C (2020) The Redox Theory of Development. Antioxid Redox Signal 32: 715-740, 10.1089/ars.2019.7976.

      Li D, Tian Y, Vona B, Yu X, Lin J, Ma L, Lou S, Li X, Zhu G, Wang Y et al (2025) A TAF11 variant contributes to non-syndromic cleft lip only through modulating neural crest cell migration. Hum Mol Genet 34: 392-401, 10.1093/hmg/ddae188.

      Ng KYB, Mingels R, Morgan H, Macklon N, Cheong Y (2017) In vivo oxygen, temperature and pH dynamics in the female reproductive tract and their importance in human conception: a systematic review. Human Reproduction Update 24: 15-34, 10.1093/humupd/dmx028.

      Pina JO, Raju R, Roth DM, Winchester EW, Chattaraj P, Kidwai F, Faucz FR, Iben J, Mitra A, Campbell K et al (2023) Multimodal spatiotemporal transcriptomic resolution of embryonic palate osteogenesis. Nature communications 14: 5687, 10.1038/s41467-023-41349-9.

      Sun J, Lin Y, Ha N, Zhang J, Wang W, Wang X, Bian Q (2023) Single-cell RNA-Seq reveals transcriptional regulatory networks directing the development of mouse maxillary prominence. J Genet Genomics 50: 676-687, 10.1016/j.jgg.2023.02.008.

      Ulschmid CM, Sun MR, Jabbarpour CR, Steward AC, Rivera-González KS, Cao J, Martin AA, Barnes M, Wicklund L, Madrid A et al (2024) Disruption of DNA methylation-mediated cranial neural crest proliferation and differentiation causes orofacial clefts in mice. Proc Natl Acad Sci U S A 121: e2317668121, 10.1073/pnas.2317668121.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Schmidt and colleagues are addressing the effects of severe hypoxia on proliferation and differentiation potential of (mouse) cranial neural crest, using a neural crest cell line subjected to hypoxic conditions, assessed by transcriptomics analysis (quantitative reverse transcription PCR, bulk RNA sequencing and bioinformatics). They are reporting a mild effect of cell proliferation and an extensive inhibition of differentiation towards osteoblasts, chondrocytes and smooth muscle cells. They reveal affected biological processes shared between the three fate biasing conditions related to cytoskeleton organization and amino acid metabolism. Lastly, among affected genes upon hypoxic conditions in vitro, they authors identified risk genes linked to non-syndromic (non-genetic) orofacial clefts exclusively downregulated in osteoblasts and smooth muscle cells, namely Fgfr2, Gstt1 and Tbxa2. Similarly, hypoxia-driven downregulation of genes implicated in syndromic orofacial clefts was observed in all three chondrocyte, osteoblast and smooth muscle differentiation scenarios. Lastly, STRING analysis of downregulated genes cross-validated their findings related to affected differentiation.

      Major comments:

      The conclusions drawn from the experimental data are carefully formulated for the most part. One of the main concerns is that the cells were subjected to extreme hypoxic conditions, while it may be more biologically relevant to include a condition representing more mild hypoxia (e.g. 10%). One of the opening claims regarding severe hypoxia only mildly affecting cell proliferation is not shown clearly, since no mitotic markers have been analyzed (i.e. KI67 or PCNA staining or a simple EdU incorporation assay). Thus, the claim that they assessed cell proliferation is not very convincing, even though cell death was analyzed. Additionally, cellular morphology of the cells could be assessed (brightfield images), since previous studies observed that hypoxia can be an inducive factor in cranial neural crest and driving EMT (Scully et al. 2016; Barriga et al. 2013).

      Furthermore, in the RNA seq analysis of chondrogenic fate biased cells the authors draw a conclusion based on the proximity of the samples on the PCA plot, which is not very convincing. More careful analysis of the bulk RNA seq data sets they have generated for key marker genes will be more convincing (for example, a heatmap with selected genes would be a helpful representation). As mentioned above, a straight-forward and not time consuming experiment (given that it was assessed for a maximum of 72 hrs) would be to repeat the culture of NCCs and stain for mitotic markers, and quantify the number of positively stained cells over total cell numbers. Furthermore, it is not that demanding to add an experimental condition of less severe hypoxia in this assay. Without underestimating how time consuming this would be, a major lack of experimental validation of the key genes they identify as important across all conditions may be the limitation of the study (this would be the difference between correlation and a probable underlying mechanism). This can be circumvented by more extensive reference to in situ data sets from mouse or existing data sets of single cell and spatial transcriptomicsA suggested targeted knock-down (for example with siRNA, shRNA or CRISPR) to validate a few of the key genes revealed as important could take a few months, with an estimated cost up to 5,000 euros per targeted gene and replicate. On methods, replicates and statistics: The experimental methods and approach are described efficiently and seem reproducible.All biological and technical replicates are of a minimum of N=3 from independent experiments and statistical tests have been run in all cases.

      Minor comments:

      One of the key implications of NCCs in palate formation is interaction with orofacial epithelial cells, which the authors also mention. It may be interesting to check if any signaling pathways involved in this crosstalk are affected under hypoxic conditions in their existing data sets of bulk RNA SEQ. This can be done by using available algorithms such as CellChat (Jin et al. 2021; Jin, Plikus, and Nie 2023), which has been reported to work also in bulk RNA seq data analysis (according to GitHub). The authors could mine the literature for existing RNA sequencing data that include osteoblasts, chondrocytes and epithelial cells (Ozekin, O'Rourke, and Bates 2023; Piña et al. 2023).

      Additionally, another process that may be affected is EMT (epithelial-to-mesenchymal-transition) and is possible to assess by re-analysis of bulk RNA-seq data while focusing on key genes implicated in this process (i.e. E-cadherin, vimentin, EpCAM, Snail, Twist, PRRX1). Lastly, when the authors report on the significantly up- or down-regulated genes, it may be interesting to categorize them by ligands, receptors, intracellular molecules and transcription factors (and use separate plots to visualize them). While a big focus of the manuscript are down-regulated genes, less emphasis was given in upregulated genes (other than the response to hypoxia gene module).

      The authors are referencing extensively and accurately existing studies in the field and the manuscript is exceptionally well-written, with only a few points of limited clarity or increased complexity. Such an example is when the authors refer to OFC risk genes, because it is not clearly stated how the referenced studies reached their conclusions (for example, are they mouse studies, do they involve mutants, are any of these studies based on GWAS on human cohorts). This matter would significantly improve the flow of the text and highlight the importance of the study and their findings. The figures could be redesigned to be more intuitive to interpret. For example, using violin plots and heatmaps, as discussed, and including references or re-analysis/re-use of existing spatial transcriptomics and in situs for marker genes.

      In all cases where there is a comparison of gene expression levels, violin plots would be a better representation of up- and down-regulated genes (i.e. selected genes from Fig1K, comparison of gene expression between normoxic and hypoxic NCCs, Fig 2G when analyzing chondrogenesis and the respective analysis for osteoblasts and smooth muscle cells, as well as when comparing the three fate-biasing conditions to identify common genes that are misregulated).

      References:

      Barriga, Elias H., Patrick H. Maxwell, Ariel E. Reyes, and Roberto Mayor. 2013. "The Hypoxia Factor Hif-1α Controls Neural Crest Chemotaxis and Epithelial to Mesenchymal Transition." The Journal of Cell Biology 201 (5): 759-76. https://doi.org/10.1083/jcb.201212100.

      Jin, Suoqin, Christian F. Guerrero-Juarez, Lihua Zhang, Ivan Chang, Raul Ramos, Chen-Hsiang Kuan, Peggy Myung, Maksim V. Plikus, and Qing Nie. 2021. "Inference and Analysis of Cell-Cell Communication Using CellChat." Nature Communications 12 (1). https://doi.org/10.1038/s41467-021-21246-9.

      Jin, Suoqin, Maksim V. Plikus, and Qing Nie. 2023. "CellChat for Systematic Analysis of Cell-Cell Communication from Single-Cell and Spatially Resolved Transcriptomics." bioRxiv. https://doi.org/10.1101/2023.11.05.565674.

      Ozekin, Yunus H., Rebecca O'Rourke, and Emily Anne Bates. 2023. "Single Cell Sequencing of the Mouse Anterior Palate Reveals Mesenchymal Heterogeneity." Developmental Dynamics : An Official Publication of the American Association of Anatomists 252 (6): 713-27. https://doi.org/10.1002/dvdy.573.

      Piña, Jeremie Oliver, Resmi Raju, Daniela M. Roth, Emma Wentworth Winchester, Parna Chattaraj, Fahad Kidwai, Fabio R. Faucz, et al. 2023. "Multimodal Spatiotemporal Transcriptomic Resolution of Embryonic Palate Osteogenesis." Nature Communications 14 (September):5687. https://doi.org/10.1038/s41467-023-41349-9.

      Scully, Deirdre, Eleanor Keane, Emily Batt, Priyadarssini Karunakaran, Debra F. Higgins, and Nobue Itasaki. 2016. "Hypoxia Promotes Production of Neural Crest Cells in the Embryonic Head." Development 143 (10): 1742-52. https://doi.org/10.1242/dev.131912.

      Significance

      Several pieces of evidence have pointed to hypoxia as an environmental factor contributing to congenital orofacial clefts, ranging from studies in mouse to observations in human. The authors are doing an excellent job in putting this information together and the question they are trying to answer is of high importance, given the prevalence of such congenital syndromes. In terms of the methods and model employed, there are some limitations, related to the choice of a mouse cell line over one from human, the severe hypoxia induced (over a more mild), and the conditions of directed differentiation not allowing for simultaneous examination of more complex lineage transitions. The methods as a whole are not that up-to-date, given the single cell and multiplexed transcriptomic advances the last couple of decades, advanced bioinformatics that could be used in combination with in vitro lineage tracing methods.

      The audience this work will reach are neural crest experts, developmental biologists, and potentially clinical doctors. The general public outreach of such a paper is also diverse, as more focus and visibility is required for the individuals affected by those syndromes and their families.

      Reviewer's expertise: mouse neural crest lineage and multipotency, lineage tracing, single cell transcriptomics, NGS, immunofluorescence, molecular methods (RNA, DNA based). Limited expertise with in vitro studies.

    6. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Title: Hypoxia impedes differentiation of cranial neural crest cells into derivatives relevant for craniofacial development

      Synopsis: Cleft lip w/ or w/o cleft palate is the second-most common birth defect worldwide. Defects are often traceable to cranial neural crest cells through genetics or environmental factors. Schmid and coauthors focus on the environmental factor of hypoxia and investigate the effects of hypoxic conditions on the ability of CNCCs to differentiate and migrate. They performed RNA-seq analysis with qRT-PCR validation for specific markers and show that hypoxia appears to repress differentiation without markedly affecting proliferation. Hypoxic conditions did not demonstrated significant perturbations in cell proliferation; however, chondrocyte, osteoblast, and smooth muscle differentiation was significantly reduced for cell lines cultured under hypoxia. Bulk RNA-seq and PCA revealed dysregulation of genes implicated in cytoskeletal integrity (such as actin γ-2), neural crest cell migration (hedgehog co-receptor brother of CDO) and amino acid metabolism (cysteine dioxygenase), which Schmid and colleagues termed OFC risk genes.

      Major comments

      • The authors performed qRT-PCR validation for markers of differentiation and hypoxia, with a major absence of VEGF and HIF1a. The paper would be strengthened by mention of these factors, especially by qRT-PCR or Western blot.
      • Please provide justification of selection 0.5% as their hypoxic condition or perhaps repeat experiments in a less extreme environment to see if their conclusions still hold true.
      • standard immunohistochemistry or histology of differentiated cells would strengthen the authors' claims of reduced differentiation under hypoxic conditions, e.g., Alcian blue, alk-phos or Alizarin red, and smooth muscle actin or other indicator.
      • The authors identify a few genes that appear down-regulated in all three differentiation conditions. If it is within the scope of the study, it would strengthen the claim of these genes' function to show the effect of knock-down or knock-out for validation.
      • Another major critique lies in the initial claim that proliferation of O9-1 cells is not significantly impacted by hypoxia. In figures 1E-H, photograms of the cells cultured 24 -72 hours and quantifications of live vs dead cells are shown as evidence for this argument. However, the increased density of cells in normoxic conditions may be a confounding variable in this assay. It would be interesting for the researchers to assess the percent of dead vs alive cells between normoxic and hypoxic conditions when the plates reach equivalent densities.
      • At end of Fig 1 section authors attempt to tie phenotypes observed in a cell line in vitro to the complex biological processes. They are not comparable and in vivo models would be better suited for these types of comparisons.
      • Fig 2: if qRT-PCR did not show statistically different results between experimental and control groups why move on to bulk RNA seq?
      • Fig 5: hypoxia this intense is going to affect broad range of biological processes and genes. Finding a few genes that are affected in extreme hypoxia that are also risk genes is highly unlikely. How can the authors be assured that these overlaps are actually significant and not just by chance?
      • Would appreciate discussion on how examination of neural crest is relevant for OFC, as most animal models of OFC demonstrate the pathogenesis in embryonic epithelium or periderm, not in the neural crest. Defects in neural crest are associated with other congenital craniofacial anomalies such as craniosynostosis or complex (Tessier) clefts, not the typical orofacial cleft. Please revise rationale of study, interpretation of data and Discussion to specifically state how neural crest cells are involved in the pathogenesis of orofacial cleft.

      Minor comments

      • The author should replace "Final proof" in the introduction with "further evidence supporting."
      • Authors are inconsistent when referring to Figures- sometimes they capitalize (i.e. 1J) and other times they leave lower case (i.e. 1i). Needs to be consistent throughout. Figures are not numbered
      • In figures authors would sometimes list 21% O2 first then 0.5% O2 or vice versa. (i.e. Fig on page 21 panels I, J, K). Needs to be consistent.
      • Figures on pages 28, 29, 30 panel J and page 31 panel F: there is no legend on what the scale/measurement is for the difference in expression level other than it ranges from -1 to +3.
      • Will the authors please comment on the one normoxic sample in Figure 1I that did not cluster with the others? Did this meet the standards to merit exclusion as an outlier?
      • The authors refer to DEG as deregulated genes; while not strictly incorrect, the more standard usage is "differentially expressed genes." Please address.

      Significance

      This work on neural crest cells and hypoxia are biologically and clinically significant.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Major comments

      Unfortunately the major conclusions of the article are not well supported by the provided data. Including:

      1. That interhemispheric remodelling occurs in non-mammalian amniotes. It would not surprise me that this may be the case, however, the major evidence for this is a series of horizontal insets that do not evidence this point well. There are broad morphological changes during development that can change the proportions and regionalisation of tissue, and therefore the IHF becoming apparently smaller as development progresses (qualitatively, in single sectioning planes, and without clear n numbers) may easily be explained by sutble differences in sectioning planes, or, for example, more caudal territories of the brain expanding at faster rates than the rostral territories. Quantification of the ratio between the IHF and total midline length across ages and between species may go some way to helping to clarify the degree of potential midline remodelling. Very high quality live imaging of the process would be the definitive way to evidence the claim, although I appreciate this is highly technically difficult and may not be possible. A key opportunity seems to be missed in the Satb2 knockout geckoes, where midline remodelling is purported to not occur. This is shown only qualitatively in a single plane of sectioning and again is not convincing. If the IHF length in these animals was quantified to be longer than wildtype at a comparable age, this would help to evidence the claim that remodelling occurs in these species.

      Our responses

      We take seriously the critique that the series of horizontal section images in the figures do not sufficiently substantiate our claim that interhemispheric remodeling occurs in non-mammalian amniotes. To address this, we plan to create a simplified atlas composed of adjacent sections of various wild-type amniotes as well as Satb2-knockout geckos.

      Additionally, in response to the suggestion that the IHF (interhemispheric fissure) should be quantified relative to the total midline length across developmental stages and species, we note that Figure 1 already presents such an analysis. Specifically, we have quantified changes in the midline collagen content using Principal Component Analysis (PCA) in Satb2 Crispants in geckos(FigureS4). However, if necessary, we also plan to perform a similar analysis on wild-type soft-shelled turtles at developmental stages before and after interhemispheric remodeling.

      That similar cell types contribute to remodelling in non-mammalian amniotes as mice/eutherian mammals. The microphotographs presented are not of very high quality, and it is often difficult to be convinced that the data is showing the strong claims made in the paper. For instance the "MZG-like cells" may in fact be astrocytes or another cell type as it is hard to visualise morphology, and the "intercalation of GFP-positive radial glial fibres" is very unclear from the photos. The colocalization of MMPsense with laminin positive cells is very hard to appreciate from the figure, and again not quantified. Similarly, there is a claim that there was degeneration of laminin-positive leptomeninges during astroglial intercalation, which is an active process that is difficult to infer from a single microphotograph. From the data, I can appreciate that some of the similar broad categories of cell types that exist at the mouse midline (glia, radial glia) are also present in non-mammalian amniote midlines, but it is difficult to be convinced of much more than this from the data presented.

      Our responses

      We take seriously the critique that the degeneration of Laminin-positive leptomeninges close to astroglial components is not accepted and that the evidence for glial fiber intercalation is insufficient.

      Verifying the degeneration of Laminin-positive leptomeninges is highly challenging. However, we have recently developed a method to visualize collagen in the pia mater using μCT and a CHP probe (3Helix Inc.). Preliminary experiments have already revealed pan-collagen deposition in the midline of the telencephalon (with lower amounts in the fusion region) and degeneration of the collagen composing the pia mater. We plan to incorporate these findings into the revised manuscript.


      That the gecko RPC and CPC connect distinct parts of the brain (rostral and caudal). These tracer injections lacked visualisation of the deposition site to confirm specificity, as well as appropriate quantification. Importantly, the absence of axons in the CPC following the rostral dye deposition (and vice versa) was not shown, which is essential to make the claim that these commissures carry axons from specific parts of the brain. The alternative hypothesis is that all axons are intermixed and traverse both commissures, independent of brain area of origin, which is not at all tested or disproved by the data presented.

      Our responses

      Thank you for the valuable critique suggestion. To support our claim that the pallial commissure in geckos consists of axons derived from specific brain regions, we should carefully eliminate the possibility that all axons are intermixed and cross both RPC and CPC regardless of brain region.

      To address this, we are planning additional experiments and will include a schematic diagram clearly indicating the labeling sites.


      Overall, the major conclusions of the study are not well supported by the data. A major effort to quantify phenomena and/or dramatically soften conclusions would be needed in order to make the conclusions well supported.

      Our responses

      We will thoroughly reconsider our conclusions and make significant efforts to revise the manuscript.

      Minor comments

      1. The n numbers are not always clearly reported

      Our responses

      We plan to address the clarification of quantitative data and the exact number of replicates.

      At times important points reference reviews or articles that do not support the statements as well as the most important primary articles might.

      Our responses

      We plan to carefully review the manuscript and, in addition to citing the most important primary papers, revise any descriptions that are not sufficiently supported by the cited reviews or articles, as per the suggestions.

      Figures showing the entire section that insets were taken from would help to convince that sectioning planes were equivalent, and also show the deposition site of neurovue experiments.

      Our response

      We will add a schematic showing the locations labeled in NeuroVue and additional experiments as a similar point made in Major comment 3.

      The fibre direction of GFAP+ fibres in figure 6 is confusing - It seems from the labelling on the figures as if red is used for the WT condition in mouse, but for the Satb2del condition in Gecko? If this is the case, then it would appear that the fibres are more specifically oriented in the del condition in mice, but in the WT condition of geckoes? There are several instances of this where clearer description and labelling would help the reader to interpret the results.

      Our response

      We plan to add clarification and indication of the direction of GFAP+ fibers in Figure 6 to make it easier to understand.

      Reviewer #1 (Significance (Required)):

      This study attempts to address a highly significant, novel and important question, that, if well achieved, would be publishable at a high degree of interest and impact to the basic research fields of brain development and evolution. Unfortunately the major conclusions made by the study are stronger than the data provided is able to evidence, and I remain unconvinced by many of them.

      Our responses

      We take seriously the suggestion that the major claims made by this study are excessive and so strong that they cannot be proven with the data provided. We will revise the manuscript as necessary.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      The authors provide a comparative analysis of interhemispheric (IHF) remodeling and its potential role in the generation of commissural axons. Based on histological material from mice, chickens, turtles, and geckos, the IHF remodeling of the midline is divided in two events: caudal and rostral. It is suggested that the rostral event is a preliminary step to the crossing of commissural axons, as it is characteristic of eutherian mammals with a corpus callosum (CC). However, the authors describe similar histologic features in other amniotes during development, particularly reptiles. This is in contrast with the case of the chick, which does not show signs of IHF remodeling nor a rostral pallial commissure. Additionally, deficient transgenic mice and geckos illustrate a potential role of Satb2 in rostral IHF remodeling and subsequent commissural formation. Whereas the topic and the conclusions of the analysis are interesting and provide new knowledge to the evo-devo field, several issues should be addressed prior to publication, such as data precision and presentation to support the main statements in the manuscript.

      Major comments:

      ____-A central point of this article is the splitting of the IHF into rostral and caudal events. The authors suggest that each one can be regulated differentially, and they attribute the rostral remodeling as a step prior to corpus callosum (CC) formation, in contrast to the caudal remodeling. In my opinion, these two events are not sufficiently characterized either in the figures or the manuscript. It is necessary to better describe these two processes that the authors mention. For instance, the authors could add or re-organize information in Figures 1-3 to include wide-field images showing the whole septum from rostral to caudal, and representative dorsoventral sections at important stages (with insets pointing at specific features). Otherwise, a table summarizing the rostral and caudal events would also be helpful to the reader.

      Our responses

      We take the suggestion seriously that the distinction between rostral and caudal remodeling may not be clear, especially regarding rostral remodeling, which is prior to the stage of corpus callosum (CC) formation, in contrast to caudal remodeling. Specifically, we plan to add or restructure the information from Figures 1 to 3 by including wide-field images that show the entire septum from rostral to caudal, as well as representative sagittal sections along the dorsal-ventral axis at key stages, with insets highlighting specific features. These will be added to the Supplementary data. Additionally, a table summarizing the events in both the rostral and caudal regions will also be created and included in the revised manuscript.

      When the authors refer to the reptilian rostral pallial commissure (RPC) and caudal pallial commissure (CPC), are these the same structures as the pallial commissure and anterior commissure described by Lanuza and Halpern (1997), Butler and Hodos (2005) and Puelles et al. (2019)? It is necessary to clarify the nomenclature, given that they are providing data from several species. Also, structures with the same names among species may not be truly homologous. A simple atlas with some horizontal and transverse planes highlighting anatomical landmarks and important structures (commissural tracts in this case) of the non-mammalian species would be extremely useful for the reader.

      Our responses

      As suggested by the reviewer, we are considering to provide a more detailed definition of the nomenclature of the pallial commissure in the revised manuscript, specifically in the introduction. Additionally, as mentioned earlier, we plan to create a simplified atlas with several horizontal and transverse sections, emphasizing anatomical landmarks and important structures (in this case, the commissural pathways) in species other than mammals.

      ____I wonder if the authors tested Fgf8 as marker on any of their sauropsidian tissue samples, as this gene has a known role in murine MZG development, which is required for IHF remodeling (Gobius et al. 2016, already cited in the manuscript). It would be beneficial to test this marker for the study, and if positive, it would open the possibility of designing loss-of-function experiments in avian or reptilian development models to identify mechanisms common to eutherians and support the statements of this work.

      Our responses

      We plan to verify the gene expression necessary for mouse MZG development and IHF remodeling, including Fgf8, DCC, and MMP2, through immunohistochemical staining as suggested.

      It would be really interesting to provide a more elaborate discussion on whether authors consider the sauropsidian IHF as a homologous process to eutherian IHF, and the reptilian RPC as an homologous of the CC.

      Our responses

      Since 3 out of the 4 reviewers consider IHF remodeling in sauropods to be homologous to that in placental mammals, we plan to further emphasize this claim in the revised manuscript. Additionally, we will expand on the discussion regarding whether the process of RPC formation in reptiles is considered homologous to that of the corpus callosum, and I will approach this from the context of character identity mechanisms claimed by Dr. Günter Wagner.

      Data and methods are presented in such a way that, in principle, they could be reproduced. Authors should indicate the number of animals/replicates of each species used in each experiment.


      Our responses

      As suggested, we plan to provide more detailed descriptions of the methods to ensure reproducibility. This will include adding the number of samples and trial repetitions for each animal species used in the experiments, including those for the additional experiments, in the revised manuscript.


      Minor comments:

      In the results section, paragraph 2, line 3: "We detected the accumulation of GFAP-positive cells and phosphorylated vimentin (Ser55) -positive mitotic radial glia in the IHF and telencephalic hinge in developing turtles, geckoes and chicks (Figure 2A)". Figure 2A shows sections from the four analyzed species labeled with radial glia markers at the end of the IHF remodeling. It would be beneficial to have analogous sections at several time points (perhaps before or after the process) to compare and show more clearly the accumulation of glial cells at that location.

      Our responses

      We have prepared serial sections before and after the developmental stages when interhemispheric remodeling occurs, in order to compare and more clearly show the accumulation of glial cells at their respective locations in mice, geckos, and soft-shelled turtles. I plan to add these results to Figure 2A in the revised manuscript.

      The article will improve its quality by adding more comparative information in the introduction about the analyzed sauropsidian structures (rostral pallial commissure and caudal pallial commissure), their relations with the pallial and anterior commissures, the structures/cells connected by them, and homologies previously proposed.

      Our responses

      We will add comparative information regarding the brain structures in sauropod, including the rostral and caudal pallial commissures and their relationship to the pallial commissure and anterior commissure, and the structures they connect, such as pyramidal cells, along with previously proposed homologies. This information will be included in the introduction and summarized in a table.


      In Figure 1 panels A-D, there is a lot of disparity in brain sizes and scales both between sections of the same species and between species. Placing the insets next to their source images is very necessary for clarity.


      Our responses

      As mentioned earlier, I will create a simplified atlas using adjacent sections and continuous μCT tomography images. Additionally, I will adjust the placement of the inset images in the revised manuscript to more visually accessible positions, improving their visibility.

      In the results section, paragraph 2, line 11: "In addition, it was suggested that astroglial intercalation occurs in conjunction with the aforementioned regression of the IHF from st.21 to st.26 in the developing turtle (Figure 2C)." In Figure 2C, all images are at different scales,

      which makes it very hard to properly compare between stages.

      Our responses

      By creating inset images based on the low-magnification images in the upper panel, we will enhance the visibility of GFAP intercalation. Additionally, we will improve the visibility in the revised manuscript by adding scale bars, referencing the simplified atlas in the figure legends, and standardizing the tissue specimen scale. we also plan to correct any typographical errors in the figures.

      In Figure 2D, the authors show the presence of MMP around the leptomeninges, suggesting MMP-mediated degradation. In the images, MMP labeling is revealed in dark blue, which is largely invisible against the black background. Colors should be used properly to allow visualization of this MMP labeling.

      Our responses

      In Figure 2D, we will reconsider the selection of pseudo-colors and use cyan to represent MMPsense.

      In Figure 4, it would really help if the authors provided wide-field images and DAPI counterstaining of the anterograde and retrograde tracings, to provide anatomical landmarks that help readers to identify the midline and understand the orientation of images.


      Our responses

      In addition to the previously mentioned schematic diagram of the gecko's pallial commissure and the additional experiments, we plan to include wide-field images along with forward and retrograde tracing using Hoechst counterstaining.

      In Figure 5B, I understand that the images in the red and blue squares correspond to brain areas in the squares in A. However, some confusion remains, especially with the image in B, which does not seem to be at the same angle as in the diagram representation. This makes it difficult to understand the results.

      Our responses

      According to the comment, we will revise the design of the Figure 5B to be more easily understand, and modify the scheme to match the angle of sections with actual figures.

      In Figure 6D, to better visualize defects in the RPC formation, the asterisk in the middle of the deficient structure needs to be replaced with a more lateral arrow pointing to the malformation.


      Our responses

      To better visualize the absence of RPC formation in Figure 6D, we will replace the asterisk in the center of the missing structure with a horizontal arrow indicating the malformation.

      In Figure S5, violin plots in panel C do not correspond with data in A and B. This needs correction or clarification.

      Our responses

      In Figure S5, the inconsistency between the violin plot in panel C and the data in panels A and B is a clear error, and we will correct this in the revised manuscript.

      In the article, a section appears solely to explain spatial transcriptomics results in a chick coronal section. The conclusion of this experiment is that three markers associated with midline remodeling are present in chick, suggesting that interhemispheric remodeling is conserved between mouse and chick. As these are complementary results and are not deeply analyzed in this manuscript, I think it would be better to summarize these findings in a dedicated paragraph and transfer some of the key images from Figure S2 to one of the main figures. Other problems with Figure S2: color contrast between clusters in the tSNE projection in B is very poor, should be enhanced; color intensity in FeaturePlots of panels D-F is too weak, and it seems that there is not really much expression at all in any cluster for any of these genes.

      Our responses

      In the revised manuscript, we will move some of the key images from Figure S2 to Main Figure 3 to demonstrate that the three markers related to midline remodeling are also present in chickens, showing that interhemispheric remodeling is conserved between mice and chickens. Additionally, we will enhance the contrast between clusters in the tSNE projection of the FeaturePlots in S2B and D-F by increasing the pseudo-color intensity or adjusting the intensity levels to emphasize the color contrast, and incorporate this updated figure into the revised manuscript.

      Reviewer #2 (Significance (Required)):

      The authors identify in the developing brain of sauropsids an event similar to IHF remodeling in eutherians, and suggest a causal relation between the rostral IHF remodeling and the formation of the pallial commissure in reptilian brains. This implies a potential homology between the pallial commissure and the corpus callosum of placental mammals. If this is the intention of the authors, this conclusion should be addressed explicitly and at length in the Discussion section. Whereas the results and conclusions described in the manuscript will be valuable in the field, the data presented in the manuscript needs quite some improvement, particularly for some of the images in the previously mentioned figures. Otherwise, the original data cannot be properly judged and may set reasonable doubt to readers.

      Advance: The findings described in this report are new to my knowledge. The description of the IHF remodeling event prior to corpus callosum development in mice has been published (Gobius et al. 2016, Cell Reports), but not in other mammalian branches or non-mammalian vertebrates. For this reason, the data in this report should be very convincing and better presented.

      Audience: This research will be interesting for a specialized and basic research audience, particularly for researchers in the evo-devo fields.

      My expertise: neuroanatomy, development, evolution, brain, cerebral cortex

      Our responses

      Thank you for your positive feedback on the novelty and high evaluation of identifying phenomena in reptilian development that resemble interhemispheric fissure (IHF) remodeling in placental mammals and demonstrating a causal relationship between rostral IHF remodeling and the formation of the reptilian pallial commissure. we will incorporate the concept of the potential homology between the corpus callosum in placental mammals and the brain commissures in reptiles into the revised manuscript, reflecting this in the context of character Identity mechanisms claimed by Dr. Günter Wagner. This will be clearly and thoroughly discussed in the discussion section. Additionally, we sincerely appreciate the constructive comment about the room for significant improvement, particularly in some of the figures, and we will address these points in the revised manuscript.


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Conserved interhemispheric morphogenesis in amniotes preceded the evolution

      of the corpus callosum. Noji Kaneko et al., 2025

      The CC is formed exclusively in placental mammals. In other amniotes species, the communication of the two hemispheres is mediated by other structures such as the anterior commissure or the hippocampal commissure. The authors perform anatomical comparisons between species to conclude that interhemispheric fissure remodeling, a prior developmental step for CC formation, is highly conserved in non-mammalian amniotes, such as reptiles and birds. They suggest that might have contributed to the evolution of eutherian-specific CC formation. In an attempt to test their hypothesis, the authors investigate the role of Satb2 in interhemispheric fissure remodeling. They show IH fissure defects in both mice and geckoes. This is a nice manuscript that bridges a gap in the current understanding of CC formation. The study is mostly anatomical and directed at a specialized community.

      Our response

      We appreciate for positive comments on the manuscript.

      I suggest some changes that might contribute to improving the manuscript.

      Main

      1. Much of the most important conclusions are extracted from the anatomical observation of the dynamics of IHF closure and the emergence of the Hinge. It is very clear that the researchers are specialists in the field but for a broader audience, the images they provide are not always easy to interpret. It takes a lot of effort to visualize the anatomical data they use for their conclusions. As an example, perhaps the authors can find ways to explain how to identify the hinge specifically. It is very clear what the hinge is in the schemes (drawings)but forms one picture to the other at different developmental stages neither in the same animal species nor from different species. In Figure 1, it is difficult to see how the hinge in the mouse is similar (i.e. the same structure) to the hinge in the Gecko and chick. Moreover, in panels C , chick brain sections are shown at much greater magnification than the gecko, and thus is very difficult. In addition, in the manuscript text, the authors refer to sequential sectioning, but only one image for each stage is shown. They can show more images in supplementary Figures, otr they can just explain that they show the relevant images of the sectioning. As another example, in Fig2A, in the text, the authors explain that they detect the same specific glial components, but the images show very different co-localizations and distributions. In Figures 1 and 3, there are lines indicating Dorsal to ventral. This refers to the sectioning but in reality, what the sections are illustrating is the anterior-to-posterior differences in the IHF. maybe they can clarify it, because at quick sight it can be confusing.

      Our responses

      We sincerely appreciate the feedback regarding the interpretation of images that show the dynamics of interhemispheric remodeling and the emergence of the hinge, which is central to the most important conclusions of this study, as it may not always be easy to interpret. In the revised manuscript, we plan to address this by making the following revisions. For example, to clarify how the hinge corresponds across different species, we will create a simplified atlas to explain that the sections from the main figure are at the same level within the continuous slices.

      The authors have to revise the manuscript text to be more precise. For example, In the result section quote "To address whether the interhemispheric remodeling in non-mammalian amniotes is dependent on midline glial activities, we next examined the expression of several glial markers in the reptilian and avian midline regions". the anatomical comparison does not address the role of glial.

      Our responses

      Thank you for your feedback. I will correct the expression "midline glial activities" to "midline glial components" and incorporate this more accurate terminology into the revised manuscript.


      As an option to increase the relevance of their work, the authors might want to consider to describe in more detail and moving the results of the RNAseq and the analysis of the Stab2 mutants to the main figures.


      Our responses

      Thank you for your feedback. we will move the RNAseq results and the analysis of Satb2 mutants to the main figures and will describe them in more detail to enhance the relevance of the study. Specifically, we plan to separate Figure 6A-C as independent figures and add Supplementary Figure 5, corresponding to mice and geckos, to the main figures in the revised manuscript.


      Minor:

      Please indicate the length of the scale bars in the figure legends, and not only in the figure panels Fig5. Indicate the animal model in the panel Perhaps they can draw a model of the different mechanisms of caudal and anterior remodeling.


      Our responses

      Thank you for your feedback. I plan to revise the figure legend for Figure 5 by clearly indicating the scale bar length and increasing the font size, as well as including the information in each panel. Additionally, I will add a graphical abstract that illustrates the different mechanisms of caudal and rostral remodeling to enhance visual comprehension.


      Reviewer #3 (Significance (Required)):

      The study addresses a gap in knowledge from an evolutionary perspective. It provides novel hypotheses and an innovative framework for the understanding of cortical development and evolution. however, most of the conclusions are inferred from anatomical observations and the experimental testing of the hypothesis (Mutants and RNAseq analysis) are minor part of the study that could be further developed. The study is interesting for investigators with expertise in brain development and evolution but requires familiarity with comparative anatomy and even then it is difficult to go through the work.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Overall, this is a well-written manuscript focusing on the evolution of mid-line interhemispheric fusion related to corpus callosum development and evolution from amniotes to eutherian species. The authors also demonstrated that Satb2 plays a critical role in interhemispheric remodeling, which is essential for corpus callosum development. This is a nicely organized and interesting study and the data are compelling. The following are suggestions for improvement, mostly for clarity:

      Minor comments:

      1. Figure 1A: While the E14 and E17 horizontal sections are informative, the addition of the E12 horizontal section does not provide further information. It would be better to place the inset and the whole image side by side, rather than having them far apart across the whole figure. For Figures 1C-D, is it possible to include horizontal sections for chick at

      E14 and Gecko at 45 dpo, as shown in the subsequent images?

      Our responses

      In Figure 1A, we will replace the current figure with a new one that visually enhances the comparison by placing the inset and the full image side by side. we will also add new horizontal sections of the whole image for chicken E14 and gecko 45 dpo, obtained from μCT tomography images and HE staining, to improve visibility between the images.


      When comparing across species it is sometimes helpful to use a standard staging system so that different developmentally staged tissue can be compared. A timeline of how embryonic day or dpo equates to stage might be helpful.

      Our response

      To clarify the developmental stages, I plan to incorporate a time scale into the graphical abstract in the revised manuscript.


      Figure 2B: It is difficult to discern the perspective without a full, lower power section of Gecko at 45 dpo. Adding a full image with an inset would be helpful. In Figure 1C, it would be helpful to define the magnified area by placing a box on the low magnification image.

      Our responses

      We plan to add a low-magnification μCT tomography image or HE-stained whole image of the gecko at 45 dpo in the revised manuscript. As for Figure 1C, it has already been included in the preprint.


      Figures 3B-E: Include the staining methods used for these sections.

      Our response

      We plan to add a note specifying that the image is stained with HE.


      Figure 4B: Add a low magnification image with an inset. The current image is a bit confusing as it is unclear what is being shown.

      Our responses

      We plan to add a low-magnification image showing the entire section and use an inset to indicate the positional relationship of the section's plane in a schematic diagram.

      Figures 6A-E: It would be helpful to denote the genotype as Satb2+/- or heterozygous, rather than Satb2 WT/del, which can be confusing. Ensure consistency in genotyping notation throughout all figures. It is noted that some are CRISPR knockdown and could be denoted as such.

      Our responses

      For CRISPR knockdown, I will adopt the term "CRISPANT" in the revised manuscript. This terminology will be used consistently throughout all figures to avoid confusion in genotype notation.


      Reviewer #4 (Significance (Required)):

      The corpus callosum evolved only in eutherian mammals and its development relies critically on an earlier developmental process known as interhemispheric remodeling. Nomura and colleagues investigate the evolution of these processes and identify that interhemispheric remodeling occurs in reptiles and birds and was therefore already present in the common ancestor of amniotes. This highly conserved developmental process likley evolved early and provided a substrates for major commissures to form throughout evolution.

      3.____Description of the revisions that have already been incorporated in the transferred manuscript.

      Currently we do not incorporate the revision in the transferred manuscript.


      __ Description of analyses that authors prefer not to carry out__

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Major

      That similar cell types contribute to remodelling in non-mammalian amniotes as mice/eutherian mammals. The microphotographs presented are not of very high quality, and it is often difficult to be convinced that the data is showing the strong claims made in the paper. For instance the "MZG-like cells" may in fact be astrocytes or another cell type as it is hard to visualise morphology, and the "intercalation of GFP-positive radial glial fibres" is very unclear from the photos. The colocalization of MMPsense with laminin positive cells is very hard to appreciate from the figure, and again not quantified. Similarly, there is a claim that there was degeneration of laminin-positive leptomeninges during astroglial intercalation, which is an active process that is difficult to infer from a single microphotograph. From the data, I can appreciate that some of the similar broad categories of cell types that exist at the mouse midline____ ____(glia, radial glia) are also present in non-mammalian amniote midlines, but it is difficult to be convinced of much more than this from the data presented.

      Our responses

      We are confident that this paper provides sufficient evidence that cell types similar to those in non-mammalian amniotes, mice, and placental mammals contribute to interhemispheric remodeling and that glial fiber intercalation occurs. This point is also supported by other reviewers.

      In the present study, we have not conducted the MMPsense experiments with the aim of showing the co-localization of MMPsense and laminin-positive cells or pia mater. Contrary to the reviewer's claim, it is important that the non-continuous regions of MMPsense and laminin-positive areas (pia mater), which are extracellular components, are adjacent to each other. Unfortunately, establishing a quantification system using MMPsense is practically impossible.

      Major

      The implication that Satb2 expression at the midline is necessary for appropriate interhemispheric remodeling. Alternative hypotheses for an inappropriately remodeled midline upon whole-brain Satb2 knockout is that it is not dependent on expression at the midline region. Rather, it could be that, for example, the appropriately timed interaction between ingrowing callosal axons and the midline territory is needed for the timely differentiation and/or behavior of midline cells. Other alternatives include that the lack of axonal midline crossing changes the morphology of the midline territory, including potentially "unfusing" the midline. Given the high prevalence of midline remodelling defects concomitant with callosal agenesis referred to be the authors in the literature, it seems like these alternatives would be worth considering. Indeed, the only article the authors reference in their statement that "several studies implicated that agenesis of CC in Satb2-deficient mice is also associated with defects in midline fusion" is an article where Satb2 was knocked out specifically in the cortex and hippocampus. This result is difficult to interpret, as some Emx1 promotors do label some of the midline territory, however the point stands that it is difficult to interpret solely that Satb2 action at the midline is responsible for the effects. I understand that this is a hard question to investigate, so I would suggest allusion to the alternative hypotheses/interpretations as the main priority when interpreting the data.

      Our responses

      This study does not aim to demonstrate the detailed molecular function of Satb2 in the developmental processes of the corpus callosum or pallial commissure. We plan to clearly state this point in the revised manuscript and focus on the findings obtained as a result. Based on the histological relationships, we will classify interhemispheric remodeling and consider adding a section in the Discussion to identify the common character identity mechanisms underlying the development of the pallial commissure and corpus callosum. This addition will help provide a more detailed understanding of the remodeling mechanisms. As is well known, discussions of homology are complex, and we understand that providing concrete evidence is even more challenging. When discussing homology, we will emphasize that it must be handled cautiously, and that discussions on molecular features and homology will rely heavily on future research. As an alternative, we plan to position the results of Satb2 Crispants in mice and geckos as evidence of the disruption of character identity mechanisms. By incorporating this perspective into the revised manuscript, we believe it will deepen our understanding of the role of Satb2 and its molecular mechanisms.

      Reviewer4

      Minor comment 7. There is very valuable data in the supplementary figures. As suggestion is to incorporate Supp. figures S1, S2 and S5 in the main figures.

      Our responses

      Due to space constraints, we plan to move only Supplementary Figure S5 to the supplementary section, and Figures S1 and S2 will not be included in the main figures of the revised manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Overall, this is a well-written manuscript focusing on the evolution of mid-line interhemispheric fusion related to corpus callosum development and evolution from amniotes to eutherian species. The authors also demonstrated that Satb2 plays a critical role in interhemispheric remodeling, which is essential for corpus callosum development. This is a nicely organized and interesting study and the data are compelling. The following are suggestions for improvement, mostly for clarity:

      Minor comments:

      1. Figure 1A: While the E14 and E17 horizontal sections are informative, the addition of the E12 horizontal section does not provide further information. It would be better to place the inset and the whole image side by side, rather than having them far apart across the whole figure. For Figures 1C-D, is it possible to include horizontal sections for chick at E14 and Gecko at 45 dpo, as shown in the subsequent images?
      2. When comparing across species it is sometimes helpful to use a standard staging system so that different developmentally staged tissue can be compared. A timeline of how embryonic day or dpo equates to stage might be helpful.
      3. Figure 2B: It is difficult to discern the perspective without a full, lower power section of Gecko at 45 dpo. Adding a full image with an inset would be helpful. In Figure 1C, it would be helpful to define the magnified area by placing a box on the low magnification image.
      4. Figures 3B-E: Include the staining methods used for these sections.
      5. Figure 4B: Add a low magnification image with an inset. The current image is a bit confusing as it is unclear what is being shown.
      6. Figures 6A-E: It would be helpful to denote the genotype as Satb2 +/- or heterozygous, rather than Satb2 WT/del, which can be confusing. Ensure consistency in genotyping notation throughout all figures. It is noted that some are CRISPR knockdown and could be denoted as such.
      7. There is very valuable data in the supplementary figures. As suggestion is to incorporate Supp. figures S1, S2 and S5 in the main figures.

      Significance

      The corpus callosum evolved only in eutherian mammals and its development relies critically on an earlier developmental process known as interhemispheric remodeling. Nomura and colleagues investigate the evolution of these processes and identify that interhemispheric remodeling occurs in reptiles and birds and was therefore already present in the common ancestor of amniotes. This highly conserved developmental process likley evolved early and provided a substrates for major commissures to form throughout evolution.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Conserved interhemispheric morphogenesis in amniotes preceded the evolution of the corpus callosum. Kaneko et al., 2025

      The CC is formed exclusively in placental mammals. In other amniotes species, the communication of the two hemispheres is mediated by other structures such as the anterior commissure or the hippocampal commissure. The authors perform anatomical comparisons between species to conclude that interhemispheric fissure remodeling, a prior developmental step for CC formation, is highly conserved in non-mammalian amniotes, such as reptiles and birds. They suggest that might have contributed to the evolution of eutherian-specific CC formation. In an attempt to test their hypothesis, the authors investigate the role of Satb2 in interhemispheric fissure remodeling. They show IH fissure defects in both mice and geckoes. This is a nice manuscript that bridges a gap in the current understanding of CC formation. The study is mostly anatomical and directed at a specialized community.

      I suggest some changes that might contribute to improving the manuscript.

      Main

      1. Much of the most important conclusions are extracted from the anatomical observation of the dynamics of IHF closure and the emergence of the Hinge. It is very clear that the researchers are specialists in the field but for a broader audience, the images they provide are not always easy to interpret. It takes a lot of effort to visualize the anatomical data they use for their conclusions. As an example, perhaps the authors can find ways to explain how to identify the hinge specifically. It is very clear what the hinge is in the schemes (drawings)but forms one picture to the other at different developmental stages neither in the same animal species nor from different species. In Figure 1, it is difficult to see how the hinge in the mouse is similar (i.e. the same structure) to the hinge in the Gecko and chick. Moreover, in panels C , chick brain sections are shown at much greater magnification than the gecko, and thus is very difficult In addition, in the manuscript text, the authors refer to sequential sectioning, but only one image for each stage is shown. They can show more images in supplementary Figures, otr they can just explain that they show the relevant images of the sectioning. As another example, in Fig2A, in the text, the authors explain that they detect the same specific glial components, but the images show very different co-localizations and distributions. In Figures 1 and 3, there are lines indicating Dorsal to ventral. This refers to the sectioning but in reality, what the sections are illustrating is the anterior-to-posterior differences in the IHF. maybe they can clarify it, because at quick sight it can be confusing.
      2. The authors have to revise the manuscript text to be more precise. For example, In the result section quote "To address whether the interhemispheric remodeling in non-mammalian amniotes is dependent on midline glial activities, we next examined the expression of several glial markers in the reptilian and avian midline regions". the anatomical comparison does not address the role of glial.
      3. As an option to increase the relevance of their work, the authors might want to consider to describe in more detail and moving the results of the RNAseq and the analysis of the Stab2 mutants to the main figures.

      Minor:

      Please indicate the length of the scale bars in the figure legends, and not only in the figure panels

      Fig5 .Indicate the animal model in the panel

      Perhaps they can draw a model of the different mechanisms of caudal and anterior remodeling.

      Significance

      The study addresses a gap in knowledge from an evolutionary perspective. It provides novel hypotheses and an innovative framework for the understanding of cortical development and evolution. however, most of the conclusions are inferred from anatomical observations and the experimental testing of the hypothesis (Mutants and RNAseq analysis) are minor part of the study that could be further developed. The study is interesting for investigators with expertise in brain development and evolution but requires familiarity with comparative anatomy and even then it is difficult to go through the work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The authors provide a comparative analysis of interhemispheric (IHF) remodeling and its potential role in the generation of commissural axons. Based on histological material from mice, chickens, turtles, and geckos, the IHF remodeling of the midline is divided in two events: caudal and rostral. It is suggested that the rostral event is a preliminary step to the crossing of commissural axons, as it is characteristic of eutherian mammals with a corpus callosum (CC). However, the authors describe similar histologic features in other amniotes during development, particularly reptiles. This is in contrast with the case of the chick, which does not show signs of IHF remodeling nor a rostral pallial commissure. Additionally, deficient transgenic mice and geckos illustrate a potential role of Satb2 in rostral IHF remodeling and subsequent commissural formation. Whereas the topic and the conclusions of the analysis are interesting and provide new knowledge to the evo-devo field, several issues should be addressed prior to publication, such as data precision and presentation to support the main statements in the manuscript.

      Major comments:

      • A central point of this article is the splitting of the IHF into rostral and caudal events. The authors suggest that each one can be regulated differentially, and they attribute the rostral remodeling as a step prior to corpus callosum (CC) formation, in contrast to the caudal remodeling. In my opinion, these two events are not sufficiently characterized either in the figures or the manuscript. It is necessary to better describe these two processes that the authors mention. For instance, the authors could add or re-organize information in Figures 1-3 to include wide-field images showing the whole septum from rostral to caudal, and representative dorsoventral sections at important stages (with insets pointing at specific features). Otherwise, a table summarizing the rostral and caudal events would also be helpful to the reader.
      • When the authors refer to the reptilian rostral pallial commissure (RPC) and caudal pallial commissure (CPC), are these the same structures as the pallial commissure and anterior commissure described by Lanuza and Halpern (1997), Butler and Hodos (2005) and Puelles et al. (2019)? It is necessary to clarify the nomenclature, given that they are providing data from several species. Also, structures with the same names among species may not be truly homologous. A simple atlas with some horizontal and transverse planes highlighting anatomical landmarks and important structures (commissural tracts in this case) of the non-mammalian species would be extremely useful for the reader.
      • I wonder if the authors tested Fgf8 as marker on any of their sauropsidian tissue samples, as this gene has a known role in murine MZG development, which is required for IHF remodeling (Gobius et al. 2016, already cited in the manuscript). It would be beneficial to test this marker for the study, and if positive, it would open the possibility of designing loss-of-function experiments in avian or reptilian development models to identify mechanisms common to eutherians and support the statements of this work
      • It would be really interesting to provide a more elaborate discussion on whether authors consider the sauropsidian IHF as a homologous process to eutherian IHF, and the reptilian RPC as an homologous of the CC.
      • Data and methods are presented in such a way that, in principle, they could be reproduced. Authors should indicate the number of animals/replicates of each species used in each experiment.

      Minor comments:

      • In the results section, paragraph 2, line 3: "We detected the accumulation of GFAP-positive cells and phosphorylated vimentin (Ser55) -positive mitotic radial glia in the IHF and telencephalic hinge in developing turtles, geckoes and chicks (Figure 2A)". Figure 2A shows sections from the four analyzed species labeled with radial glia markers at the end of the IHF remodeling. It would be beneficial to have analogous sections at several time points (perhaps before or after the process) to compare and show more clearly the accumulation of glial cells at that location.
      • The article will improve its quality by adding more comparative information in the introduction about the analyzed sauropsidian structures (rostral pallial commissure and caudal pallial commissure), their relations with the pallial and anterior commissures, the structures/cells connected by them, and homologies previously proposed.
      • In Figure 1 panels A-D, there is a lot of disparity in brain sizes and scales both between sections of the same species and between species. Placing the insets next to their source images is very necessary for clarity.
      • In the results section, paragraph 2, line 11: "In addition, it was suggested that astroglial intercalation occurs in conjunction with the aforementioned regression of the IHF from st.21 to st.26 in the developing turtle (Figure 2C)." In Figure 2C, all images are at different scales, which makes it very hard to properly compare between stages.
      • In Figure 2D, the authors show the presence of MMP around the leptomeninges, suggesting MMP-mediated degradation. In the images, MMP labeling is revealed in dark blue, which is largely invisible against the black background. Colors should be used properly to allow visualization of this MMP labeling.
      • In Figure 4, it would really help if the authors provided wide-field images and DAPI counterstaining of the anterograde and retrograde tracings, to provide anatomical landmarks that help readers to identify the midline and understand the orientation of images.
      • In Figure 5B, I understand that the images in the red and blue squares correspond to brain areas in the squares in A. However, some confusion remains, especially with the image in B, which does not seem to be at the same angle as in the diagram representation. This makes it difficult to understand the results.
      • In Figure 6D, to better visualize defects in the RPC formation, the asterisk in the middle of the deficient structure needs to be replaced with a more lateral arrow pointing to the malformation.
      • In Figure S5, violin plots in panel C do not correspond with data in A and B. This needs correction or clarification.
      • In the article, a section appears solely to explain spatial transcriptomics results in a chick coronal section. The conclusion of this experiment is that three markers associated with midline remodeling are present in chick, suggesting that interhemispheric remodeling is conserved between mouse and chick. As these are complementary results and are not deeply analyzed in this manuscript, I think it would be better to summarize these findings in a dedicated paragraph and transfer some of the key images from Figure S2 to one of the main figures. Other problems with Figure S2: color contrast between clusters in the tSNE projection in B is very poor, should be enhanced; color intensity in FeaturePlots of panels D-F is too weak, and it seems that there is not really much expression at all in any cluster for any of these genes.

      Significance

      The authors identify in the developing brain of sauropsids an event similar to IHF remodeling in eutherians, and suggest a causal relation between the rostral IHF remodeling and the formation of the pallial commissure in reptilian brains. This implies a potential homology between the pallial commissure and the corpus callosum of placental mammals. If this is the intention of the authors, this conclusion should be addressed explicitly and at length in the Discussion section. Whereas the results and conclusions described in the manuscript will be valuable in the field, the data presented in the manuscript needs quite some improvement, particularly for some of the images in the previously mentioned figures. Otherwise, the original data cannot be properly judged and may set reasonable doubt to readers.

      Advance: The findings described in this report are new to my knowledge. The description of the IHF remodeling event prior to corpus callosum development in mice has been published (Gobius et al. 2016, Cell Reports), but not in other mammalian branches or non-mammalian vertebrates. For this reason, the data in this report should be very convincing and better presented.

      Audience: This research will be interesting for a specialized and basic research audience, particularly for researchers in the evo-devo fields.

      My expertise: neuroanatomy, development, evolution, brain, cerebral cortex

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors study several valuable developmental series of non-mammalian amniotes, reaching the conclusion that interhemispheric remodeling occurs in these species and that it is dependent on transcription factor Satb2

      Major comments

      Unfortunately the major conclusions of the article are not well supported by the provided data. Including:

      1. That interhemispheric remodelling occurs in non-mammalian amniotes. It would not surprise me that this may be the case, however, the major evidence for this is a series of horizontal insets that do not evidence this point well. There are broad morphological changes during development that can change the proportions and regionalisation of tissue, and therefore the IHF becoming apparently smaller as development progresses (qualitatively, in single sectioning planes, and without clear n numbers) may easily be explained by sutble differences in sectioning planes, or, for example, more caudal territories of the brain expanding at faster rates than the rostral territories. Quantification of the ratio between the IHF and total midline length across ages and between species may go some way to helping to clarify the degree of potential midline remodelling. Very high quality live imaging of the process would be the definitive way to evidence the claim, although I appreciate this is highly technically difficult and may not be possible. A key opportunity seems to be missed in the Satb2 knockout geckoes, where midline remodelling is purported to not occur. This is shown only qualitatively in a single plane of sectioning and again is not convincing. If the IHF length in these animals was quantified to be longer than wildtype at a comparable age, this would help to evidence the claim that remodelling occurs in these species.
      2. That similar cell types contribute to remodelling in non-mammalian amniotes as mice/eutherian mammals. The microphotographs presented are not of very high quality, and it is often difficult to be convinced that the data is showing the strong claims made in the paper. For instance the "MZG-like cells" may in fact be astrocytes or another cell type as it is hard to visualise morphology, and the "intercalation of GFP-positive radial glial fibres" is very unclear from the photos. The colocalization of MMPsense with laminin positive cells is very hard to appreciate from the figure, and again not quantified. Similarly, there is a claim that there was degeneration of laminin-positive leptomeninges during astroglial intercalation, which is an active process that is difficult to infer from a single microphotograph. From the data, I can appreciate that some of the similar broad categories of cell types that exist at the mouse midline (glia, radial glia) are also present in non-mammalian amniote midlines, but it is difficult to be convinced of much more than this from the data presented.
      3. That the gecko RPC and CPC connect distinct parts of the brain (rostral and caudal). These tracer injections lacked visualisation of the deposition site to confirm specificity, as well as appropriate quantification. Importantly, the absence of axons in the CPC following the rostral dye deposition (and vice versa) was not shown, which is essential to make the claim that these commissures carry axons from specific parts of the brain. The alternative hypothesis is that all axons are intermixed and traverse both commissures, independent of brain area of origin, which is not at all tested or disproved by the data presented.
      4. The implication that Satb2 expression at the midline is necessary for appropriate interhemispheric remodeling. Alternative hypotheses for an inappropriately remodeled midline upon whole-brain Satb2 knockout is that it is not dependent on expression at the midline region. Rather, it could be that, for example, the appropriately timed interaction between ingrowing callosal axons and the midline territory is needed for the timely differentiation and/or behavior of midline cells. Other alternatives include that the lack of axonal midline crossing changes the morphology of the midline territory, including potentially "unfusing" the midline. Given the high prevalence of midline remodelling defects concomitant with callosal agenesis referred to be the authors in the literature, it seems like these alternatives would be worth considering. Indeed, the only article the authors reference in their statement that "several studies implicated that agenesis of CC in Satb2-deficient mice is also associated with defects in midline fusion" is an article where Satb2 was knocked out specifically in the cortex and hippocampus. This result is difficult to interpret, as some Emx1 promotors do label some of the midline territory, however the point stands that it is difficult to interpret solely that Satb2 action at the midline is responsible for the effects. I understand that this is a hard question to investigate, so I would suggest allusion to the alternative hypotheses/interpretations as the main priority when interpreting the data.

      Overall, the major conclusions of the study are not well supported by the data. A major effort to quantify phenomena and/or dramatically soften conclusions would be needed in order to make the conclusions well supported.

      Minor comments

      1. The n numbers are not always clearly reported
      2. At times important points reference reviews or articles that do not support the statements as well as the most important primary articles might.
      3. Figures showing the entire section that insets were taken from would help to convince that sectioning planes were equivalent, and also show the deposition site of neurovue experiments.
      4. The fibre direction of GFAP+ fibres in figure 6 is confusing - It seems from the labelling on the figures as if red is used for the WT condition in mouse, but for the Satb2del condition in Gecko? If this is the case, then it would appear that the fibres are more specifically oriented in the del condition in mice, but in the WT condition of geckoes? There are several instances of this where clearer description and labelling would help the reader to interpret the results.

      Significance

      This study attempts to address a highly significant, novel and important question, that, if well achieved, would be publishable at a high degree of interest and impact to the basic research fields of brain development and evolution. Unfortunately the major conclusions made by the study are stronger than the data provided is able to evidence, and I remain unconvinced by many of them.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This study by Zimyanin et al. examines the role of the C. elegans chromokinesin KLP-19 in the formation and architecture of the anaphase central spindle in C. elegans zygotes. Through a combination of electron and light microscopy, along with RNAi-mediated perturbations, the authors propose that KLP-19 influences central spindle stiffness by regulating microtubule dynamics.

      In Figure 5, the effect of KLP-19 depletion on central spindle microtubules appears unconvincing. The FRAP results show no significant difference with or without KLP-19, and overall microtubule density does not consistently respond to its depletion. Additionally, the double klp-19; gpr-1/2 (RNAi) condition does not exhibit a strong increase in microtubule density, though a statistical test is missing for this condition. Furthermore, the spd-1; gpr-1/2 double depletion produces a similar increase in microtubule density to most klp-19 depletion conditions, suggesting that the effect cannot be solely attributed to the absence of KLP-19.

      Figure 5A shows that depletion of KLP-19 leads to an increase in tubulin signal in the spindle midzone. The reviewer is correct, that there are differences in the microtubule density between KLP-19 depletion alone and KLP-19 + GPR-1/2 depletion. While depletion of KLP-19 alone leads to a significant increase, co-depletion of GPR-1/2 and KLP-19 leads to a slight, but not significant increase. Along this line, we have added Supplementary Table 1 that contains all p-Values for the different conditions for Figure 5A. However, depletion of GPR-1/2 alone does not affect the microtubule density in the midzone, arguing that changes in pulling forces do not affect the microtubule density in the midzone. It is possible, that the double RNAi leads to a decrease in efficiency and thus a reduced effect on microtubule intensity. We will demonstrate the RNAi efficiency by western blot. Another possibility is that there are some feedback mechanisms that responds to presence/ absence of pulling forces and some of our data (not from this manuscript) hints in this direction, but we have not yet worked out the details of this. We are planning to publish this in a follow up publication.

      • *

      In response to the spd-1 + gpr-1/2 (RNAi), the reviewer is correct, that the microtubule density in the midzone is not significantly different from klp-19 (RNAi) conditions and we think it is interesting to note that spd-1 + gpr-1/2 (RNAi) leads to an increased microtubule density in the midzone. This could be, as above mentioned caused by some feedback mechanisms that responds to pulling forces, or also due to some functions of SPD-1 that affects microtubules in the midzone. Interestingly, our data also shows that metaphase spindles are significantly shorter in the absence of SPD-1 in comparison to spindles in control embryos, suggesting that SPD-1 plays a role in regulating microtubules or force transmission. We are currently working on understanding SPD-1's role in this process.

      • *

      We also agree that there is no significant effect on the microtubule turn-over as shown in Figure 5B and we have stated this in the text. Our data does show a trend to a decreased turn-over, but the difference is not significant. This could be due to the low sample number.

      • *

      Overall, we think our data, the light microscopy and even more so the EM data does show a clear effect on midzone microtubules.

      • *

      The use of hcp-6 depletion to argue that KLP-19 depletion affects central spindle elongation independently of stretched chromatin is problematic. hcp-6 encodes a component of the Condensin II complex in C. elegans, and its depletion leads to chromatin decompaction rather than the stretched, dense chromatin observed in the midzone during anaphase in klp-19 (RNAi) embryos. These conditions are not equivalent and do not effectively rule out the possibility that chromatin stretching contributes to the observed phenotype.

      We agree with the reviewer that the HCP-6 experiments do not entirely rule out effects from lagging chromosomes. Proving that the reduced spindle and chromosome separation is not due to lagging chromosomes is challenging. Most of the depletions that lead to lagging chromosomes are based on defective kinetochore microtubule connections, such as depletion of KNL-1, NDC-80 or CLS-2 (CLASP). In C. elegans, this leads to the mass of Chromosomes staying behind in anaphase and increased spindle pole separation, which is not comparable to KLP-19 depletion. Perturbations that do not affect kinetochore microtubules but still lead to lagging chromosomes are often targeting cohesin or condensin. Ultimately none of these conditions are directly comparable.

      A probably better way to test this would be to deplete KLP-19 only after metaphase to prevent its effect on chromosome alignment. However, this is currently not possible as the time window is about 1 minute or less. We currently do not have the tools to conduct this type of experiment. As other reviewers also criticized this experiment and its significance for the paper, we have removed this entirely and have added the following part to the discussion about the potential effect of lagging chromosomes:

      " *We can not unambiguously rule out that failure to properly align chromosomes and the resulting lagging chromosomal material could also lead to some of the observed effects on spindle dynamics, such as slow chromosome segregation and pole separation rates as well as preventing spindle rupture in absence of SPD-1. However, several observations argue in favor of KLP-19 actively changing the midzone cytoskeleton network and thus affecting spindle dynamics. *

      Most of the protein depletions in C. elegans that lead to lagging chromosomes are based on defective kinetochore microtubule connections, such as depletion of CeCENP-A, CeCENP-C, KNL-1 or NDC-80 (70-72). This mostly leads to the Chromosome mass staying behind in anaphase and increased spindle pole separation (70-72), which is not comparable to KLP-19 depletion. Perturbations that do not affect kinetochore microtubules but still lead to lagging chromosomes are often targeting cohesin or condensin, which depletion leads to chromatin decompaction (73-74) rather than the stretched, dense chromatin as observed in the midzone during anaphase in klp-19 (RNAi) embryos. Ultimately none of these conditions are directly comparable, making it difficult to completely rule out an effect of lagging chromosomes. A better way to test this would be to deplete KLP-19 only after metaphase to prevent its effect on chromosome alignment. However, this is currently not possible as the time window is about 1 minute or less and we do not have the tools to conduct this type of experiment.

      *Based on our results we hypothesize that the observed spindle dynamics in absence of KLP-19 are not only caused by lagging chromosomes. Instead, KLP-19 RNAi results in a global rearrangement of the spindle and leads to a significant reduction of the spindle size, microtubule overlap, growth rate, and stability. Furthermore, the increase of microtubule interactions after klp-19 (RNAi) could also contribute to lagging of chromosomes and exacerbation of fragmented extrachromosomal material." *

      Additionally, the authors report that KLP-19 influences astral microtubule dynamics (Figure 5E), yet in Figure 3E, they show that KLP-19 localizes exclusively to kinetochores and spindle microtubules, excluding astral microtubules and spindle poles. How do they reconcile this apparent contradiction?

      We think that KLP-19 localizes also to astral Microtubules. Our KLP-19 GFP CRISPR line is very dim and this makes it hard to see. We are proposing to use a TIRF approach to image KLP-19 GFP on the C. elegans cortex, which we will include in the revised version. In addition, in support of our hypothesis of KLP-19 binding to astral Microtubules as well we would like to note that there is a PhD thesis available from Jack Martin in Josana Rodriguez Sanchez's Lab in Newcastle (LINK, will lead to a download of the thesis! ) that has reported KLP-19s localization to cortical Microtubules in C. elegans. In this thesis the author also reports an effect on astral microtubule growth.

      Figure legends lack consistency and do not adhere to standard C. elegans nomenclature conventions (e.g., protein names should not be capitalized, and genetic perturbations should be italicized). Standardizing these elements would improve clarity and readability.

      We have checked our figure legend and to our best knowledge the legends adhere to the C. elegans nomenclature. All RNAi conditions are lower case italicized and Protein names are capitalized as it is standard in other C. elegans publications. We have however noticed some variation in our Figures, i.e. EB-2 instead of EBP-2 and have corrected this in all figures.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Zimyanin et al, Chromokinesin Klp-19 regulates microtubule overlap and dynamics during anaphase in C. elegans.

      The authors used a myriad of techniques, including confocal live-cell imaging, 2-photon microscopy, second harmonic generation imaging, FRAP, microfluidic-coupled TIRF, EM-tomography, to study spindle midzone assembly dynamics in C. elegans one-cell stage embryos. In particular, they illuminated the role of kinesin-4 KLP-19 in maintaining proper midzone length and organization. Inhibition of KLP-19 results in longer more stable midzones, implying KLP-19 functions in depolymerizing microtubules.

      Indeed, much of the results in the current study are consistent with previously published results elsewhere. Nevertheless, the current work represents a tour-de-force showcase of diverse and state-of-the-art technology application to address spindle assembly dynamics. How KLP-19 functions to define microtubule length at the midzone is still not known. But the current work, with diverse and solid data, serves to highlight where future work should focus.

      Minor comments:

      Fig 3E / There is an unusual diagonal line bisecting the embryo. Visually this does not affect viewing of the His::GFP and KLP-19::GFP signals. However, when these signals are quantified and normalized (as in Fig 3F), the diagonal bisect displaying different background signal may impact the measurements.

      We are very sorry about this line in the images. The line is due to a defect in the camera chip of the spinning disc. We will acquire new images for this Figure using our new spinning disc microscope.

      Fig 4B / While the kymographs clearly show KLP-19::GFP motility on microtubules, they also show that the majority of KLP(-::GFP do not move. Perhaps some quantification and discussion of this result is appropriate?

      The reviewer is correct that only a small fraction small fraction of molecules, maybe ~10%, moves. We will add this quantification to the paper and discussion. This could be due to several reasons: Many of the non-moving particles are not visibly colocalized with microtubules, which could mean they are sticking non-specifically to the surface (or sticking to small tubulin aggregates that aren't long enough to support movement). In addition, as this experiment is done in a lysate it is hard to interpret if the immobile KLP-19 is not moving because other proteins are bound along the microtubule blocking its way or if the KLP-19 requires some activation (i.e. phosphorylations) to become mobiles. We think this could be very interesting and will follow up on this in the future.

      • *

      Reviewer #2 (Significance (Required)):

      Indeed, much of the results in the current study are consistent with previously published results elsewhere. Nevertheless, the current work represents a tour-de-force showcase of diverse and state-of-the-art technology application to address spindle assembly dynamics. How KLP-19 functions to define microtubule length at the midzone is still not known. But the current work, with diverse and solid data, serves to highlight where future work should focus.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      The anaphase spindle midzone is an essential structure for cell division. It consists of antiparallel overlapping microtubules organized by the antiparallel microtubule bundler PRC1, molecular motors and other regulatory proteins. This manuscript investigates the role of KLP-19 (C. elegans ortholog of human kinesin-4 KIF4A) and SPD-1 (C. elegans ortholog of PRC1) for spindle midzone organization in the C. elegans embryo and its relevance for proper spindle function. Advanced fluorescence microscopy, 3D electron tomography, and a fluorescence microscopy-based single molecule assay in embryo lysate are used in a unique combination. The authors confirm several aspects of PRC1 and KIF4A function in anaphase, as reported in previous work, mostly in human cells and Drosophila embryos and also in C. elegans embryos. Measurements are mostly very quantitative and to a high quality standard. The main difference to previous conclusions is that here, the authors propose that KLP-19 does not interact with SPD-1, in contrast to what has been established for other animal kinesin-4s and PRC1, and instead localizes to the spindle midzone independently of PRC1 by a mechanism that remains unknown. The authors provide evidence that KLP-19 nevertheless controls microtubule overlap length as in other species and that it produces outward forces sliding midzone microtubules apart a movement that SPD-1 counteracts (presumably by friction). The manuscript presents a rich resource of carefully measured quantitative structural and dynamic C. elegans anaphase spindle data.

      Major comments:

      Key conclusions convincing?

      (1) The key conclusions that the length of the central anaphase spindle microtubule overlap remains constant as the C.elegans spindle elongates (Fig. 1), that PRC1 indeed localizes quite precisely to these overlaps as previously assumed based on its in vitro (purified protein) behavior (Fig. 3B) and that the kinesin-4 KLP-19 controls overlap length as in other species (Fig. 3B) are all convincingly shown. What's missing are quantitative KLP-19 together with microtubule polarity profiles in the presence and absence of SPD-1, leaving it unclear to which extent this kinesin localizes to microtubule overlaps in the two situations. Such data seem crucial, given the authors' claim that KLP-19 localizes to the midzone and that this localization of KLP-19 is mostly unaffected by the absence of SPD-1.

      If we understand this correctly the reviewer is asking for second harmonic imaging (SHG) together with imaging of KLP-19 GFP. This is currently not possible due to the way this imaging must be done (2-photon of GFP-Tubulin followed by the SHG). The only thing we can do is provide KLP-19 GFP profiles for control and SPD-1 depleted embryos. We can also use the line co-expressing SPD-1 Halo-tag and KLP-19 GFP to plot their respective localizations in control conditions. We are happy to provide such plots. Generally, we see KLP-19 going to the midzone in absence of SPD-1 and the SHG data does show that the overlap is increased. If KLP-19 specifically localizes to microtubule overlap (rather to i.e. microtubule ends) can currently not be distinguished in the spindle midzone. In vitro data from other labs and our in vitro assay suggests that KLP-19 does not specifically bind to antiparallel overlaps but rather microtubules in general.

      (2) 'Normalized KLP-19 intensities' are used to demonstrate that the total amount of this kinesin localizing to the spindle midzone does not depend on the presence of SPD-1 (Fig. 3F). Given that this claim represents a major novelty of the study, the efficiency of the SPD-1 knock-down should be documented, ideally by western blot and fluorescence microscopy.

      We agree with the reviewer and will provide western blots.

      (3) The authors show convincingly that the kinesin KLP-19 contributes to outward microtubule sliding (and can contribute to spindle rupture in the absence of SPD-1) (Fig. 2), which is interesting and in line with the author's main claim.

      (4) The interaction between KIF4a and PRC1 is well established in other species and has been clearly demonstrated both in cells and in vitro (with purified proteins). The authors claim that this concept does not apply to the C. elegans orthologs. To show 'in vitro' (outside of the spindle) that the C. elegans homologs KLP-19 and SPD-1 do not interact, the authors use a novel microfluidic fluorescence-based single-molecule assay in lysate (Fig. 4). Although very original, these experiments do not reach the biochemical standard of previous experiments with purified proteins without appropriate controls. Given that the lysate setup is fairly novel, it's advisable to present at least one positive control demonstrating that interactions between soluble proteins can indeed be detected using this assay. It would also be useful to show the absence of interaction between KLP-19 and SPD-1 by a more conventional method like co-IP, again with a positive control, to support the authors' claim. Eventually, experiments with purified proteins will have to unequivocally demonstrate whether KLP-19 and SPD-1 indeed do not interact - something which appears, however, to be beyond the scope of this study. Without additional experimental proof, the authors may want to indicate that these results are of more preliminary nature.

      *We agree with the reviewer, and we will conduct co-IPs of SPD-1 and KLP-19. We will also add CYK-4 as a positive control as previous publications have shown the interaction of CYK-4 with SPD-1. We are now generating lines co-expressing CYK-4 GFP and SPD-1 Halo-tag for the co-IP experiments. *

      (5) Unfortunately, the authors do not propose an alternative mechanism for KLP-19 localization to the midzone in SPD-1 depleted embryos, limiting the conceptual advance. Does KLP-19 bind directly to antiparallel microtubules, for example in the assay presented in Fig. 4 (where signs of microtubule crosslinking are shown for SPD-1)? If not, how would it accumulate in the midzone (if it does) in the C. elegans embryo anaphase spindle? The authors do also not propose a mechanism explaining why central antiparallel microtubule overlap length does not change as the spindle elongates in anaphase. Moreover, there is no discussion regarding the potential mechanism leading to KLP-19 controlling microtubule dynamics globally instead of locally where the motor accumulates, indicating limitations in mechanistic insight.

      *We agree with the reviewer and will add these points to the discussion. *

      *To address some of the points: *

      *How does KLP-19 end up in the midzone? : Our data shows that localization of KLP-19 does depend on AIR-2 and BUB-1 as previously reported. However, those proteins primarily affect the formation of the midzone. The in vitro assay does not suggest that KLP-19 has a preference for overlaps, unlike SPD-1, but rather binds microtubules in general. One possible mechanism of midzone localization could be microtubule end-tagging, as has been suggested for PRC1 (SPD-1 homolog). This could lead to an accumulation of KLP-19 in the midzone. *

      Why does the central overlap stay constant? : This can be explained by constant microtubule growth at the plus-ends why maintaining the overlap length. Alternatively, this could be explained by some (sophisticated) rearrangements of microtubules that ensure the overlap length stays the same. Generally, this is a very interesting question, as each of this scenario still requires that the overlap length is tightly regulated. Our data suggests that this is correlated with microtubule length in the midzone, as KLP-19 depletion leads to longer microtubules and overlap. This suggests that the regulation of microtubule dynamics might be an important factor in this process. We will add this to the discussion.

      • *

      Potential mechanism leading to KLP-19 controlling microtubule dynamics globally: We think that KLP-19 localizes to spindle and astral microtubules and regulates the dynamics on all of those, leading to a global regulation. By increasing it's concentration locally, microtubule dynamics can be regulated in the midzone. We will add data showing the localization of KLP-19 to astral microtubules.

      Claims justified/preliminary and clearly presented?

      The observation that the spindle length remains constant throughout anaphase in C. elegans is based on elegant, but unconventional fluorescence microscopy data (Fig. 1A & B). It would be helpful to add images of SHG and two-photon microscopy to help the reader understand the graphs. Measurements are presented based on distances between the poles. It is unclear why the distances between 15-20 µm were chosen and how they translate to anaphase progression. Can measurements be carried out across the entire duration of cell division to demonstrate that the overlap's 'constant length' property is unique to anaphase? (This could demonstrate already in Fig. 1 that the method in principle is capable of measuring different overlap lengths.)

      We agree with the reviewer and have moved the SHG images from supplementary Fig. 6 to the main Figure 1A for better visibility. In addition, we have added a plot as an inset in (now) Figure 1B and C explanation of how the used spindle pole distances related to the progression through anaphase. Unfortunately, we can only acquire a single timepoint and not a live movie during the SHG.

      Even though the manuscript contains an impressive amount of data, it appears to be lengthy, the motivation for several experiments is not clearly described, and the order of data presentation can probably be improved. For example, it is unclear why SPD-1 profiles are presented late and why KLP-19 profiles are missing - one would expect to see them early on as an essential characterization of the system under study. The motivation of the paragraph investigating the relation of KLP-19 and SPD-1 to HCP-6 is especially unclear (more than 1 page of text describing supplementary material).

      We will go through our text again and will revise the order of presented experiments. As stated above, we have removed the HCP-6 data.

      The absence of interaction between KLP-19 and SPD-1 is not demonstrated to the same quality standard as the presence of interaction between the orthologs in the literature, which should at least be mentioned.

      Additional experiments essential to support the claims of the paper?

      KLP-19 profiles in the presence and absence of SPD-1 seem to be essential.

      We agree with the reviewer and will add this.

      A co-IP of KLP-19 and SPD-1 (including positive control) to prove that the proteins are not interacting would help to support the claim.

      We agree with the reviewer and will add this

      Data and methods presented so that they can be reproduced? Yes.

      Experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments:

      Generating cellular electron tomography data is very laborious. It is a pity that no raw data is provided; for example, a slice of a reconstructed tomogram or a video of whole volumes without segmentation would be an informative addition and allow assessment of the data quality.

      We agree with the reviewer and will add movies of the raw electron microscopy data.

      The clear evidence for direct interaction between PRC1 and kinesin-4 in other species should be correctly acknowledged throughout the text.

      We agree with the reviewer and have corrected this

      The average (mean or median?) values and STDs reported in the text do not appear to match those in Fig. 1D.

      *We thank the reviewer for pointing this out and have corrected the figure. The violin lot showed the mean and percentiles, we have now changed the plot to show mean and STD. *

      • *

      The kymograph of spd-1 RNAi in Fig. 2A seems stretched, and the size based on the scale bar does not fit the values stated in the text.

      We thank the reviewer for pointing this out and have corrected the figure.

      The figure numbering, as stated in the text, does not seem to agree with those in Supplementary Figure 8.

      *We thank the reviewer for pointing this out and have corrected the text. *

      Page numbers and/or line numbers and figure numbers on the figures would help the reader to navigate the manuscript more easily.

      We agree with the reviewer and have added this.

      Reviewer #3 (Significance (Required)):

      The manuscript is a rich resource of quantitative measurements of C.elegans' structural and dynamic spindle properties, using advanced light microscopy and 3D electron microscopy imaging. In large parts, the work confirms previous conclusions of the function of PRC1 and kinesin-4 in the anaphase spindle, but also reports some interesting differences, namely that the C.elegans proteins differ from their orthologs in that they do not interact with each other, raising the question of how the kinesin-4 KLP-19 localizes to the central spindle in this organism. This work is of interest for researchers studying cell division, and specifically spindle architecture, dynamics, and function.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The anaphase spindle midzone is an essential structure for cell division. It consists of antiparallel overlapping microtubules organized by the antiparallel microtubule bundler PRC1, molecular motors and other regulatory proteins. This manuscript investigates the role of KLP-19 (C. elegans ortholog of human kinesin-4 KIF4A) and SPD-1 (C. elegans ortholog of PRC1) for spindle midzone organization in the C. elegans embryo and its relevance for proper spindle function. Advanced fluorescence microscopy, 3D electron tomography, and a fluorescence microscopy-based single molecule assay in embryo lysate are used in a unique combination. The authors confirm several aspects of PRC1 and KIF4A function in anaphase, as reported in previous work, mostly in human cells and Drosophila embryos and also in C. elegans embryos. Measurements are mostly very quantitative and to a high quality standard. The main difference to previous conclusions is that here, the authors propose that KLP-19 does not interact with SPD-1, in contrast to what has been established for other animal kinesin-4s and PRC1, and instead localizes to the spindle midzone independently of PRC1 by a mechanism that remains unknown. The authors provide evidence that KLP-19 nevertheless controls microtubule overlap length as in other species and that it produces outward forces sliding midzone microtubules apart a movement that SPD-1 counteracts (presumably by friction). The manuscript presents a rich resource of carefully measured quantitative structural and dynamic C. elegans anaphase spindle data.

      Major comments:

      Key conclusions convincing?

      1. The key conclusions that the length of the central anaphase spindle microtubule overlap remains constant as the C.elegans spindle elongates (Fig. 1), that PRC1 indeed localizes quite precisely to these overlaps as previously assumed based on its in vitro (purified protein) behavior (Fig. 3B) and that the kinesin-4 KLP-19 controls overlap length as in other species (Fig. 3B) are all convincingly shown. What's missing are quantitative KLP-19 together with microtubule polarity profiles in the presence and absence of SPD-1, leaving it unclear to which extent this kinesin localizes to microtubule overlaps in the two situations. Such data seem crucial, given the authors' claim that KLP-19 localizes to the midzone and that this localization of KLP-19 is mostly unaffected by the absence of SPD-1.
      2. 'Normalized KLP-19 intensities' are used to demonstrate that the total amount of this kinesin localizing to the spindle midzone does not depend on the presence of SPD-1 (Fig. 3F). Given that this claim represents a major novelty of the study, the efficiency of the SPD-1 knock-down should be documented, ideally by western blot and fluorescence microscopy.
      3. The authors show convincingly that the kinesin KLP-19 contributes to outward microtubule sliding (and can contribute to spindle rupture in the absence of SPD-1) (Fig. 2), which is interesting and in line with the author's main claim.
      4. The interaction between KIF4a and PRC1 is well established in other species and has been clearly demonstrated both in cells and in vitro (with purified proteins). The authors claim that this concept does not apply to the C. elegans orthologs. To show 'in vitro' (outside of the spindle) that the C. elegans homologs KLP-19 and SPD-1 do not interact, the authors use a novel microfluidic fluorescence-based single-molecule assay in lysate (Fig. 4). Although very original, these experiments do not reach the biochemical standard of previous experiments with purified proteins without appropriate controls. Given that the lysate setup is fairly novel, it's advisable to present at least one positive control demonstrating that interactions between soluble proteins can indeed be detected using this assay. It would also be useful to show the absence of interaction between KLP-19 and SPD-1 by a more conventional method like co-IP, again with a positive control, to support the authors' claim. Eventually, experiments with purified proteins will have to unequivocally demonstrate whether KLP-19 and SPD-1 indeed do not interact - something which appears, however, to be beyond the scope of this study. Without additional experimental proof, the authors may want to indicate that these results are of more preliminary nature.
      5. Unfortunately, the authors do not propose an alternative mechanism for KLP-19 localization to the midzone in SPD-1 depleted embryos, limiting the conceptual advance. Does KLP-19 bind directly to antiparallel microtubules, for example in the assay presented in Fig. 4 (where signs of microtubule crosslinking are shown for SPD-1)? If not, how would it accumulate in the midzone (if it does) in the C. elegans embryo anaphase spindle? The authors do also not propose a mechanism explaining why central antiparallel microtubule overlap length does not change as the spindle elongates in anaphase. Moreover, there is no discussion regarding the potential mechanism leading to KLP-19 controlling microtubule dynamics globally instead of locally where the motor accumulates, indicating limitations in mechanistic insight.

      Claims justified/preliminary and clearly presented?

      The observation that the spindle length remains constant throughout anaphase in C. elegans is based on elegant, but unconventional fluorescence microscopy data (Fig. 1A & B). It would be helpful to add images of SHG and two-photon microscopy to help the reader understand the graphs. Measurements are presented based on distances between the poles. It is unclear why the distances between 15-20 µm were chosen and how they translate to anaphase progression. Can measurements be carried out across the entire duration of cell division to demonstrate that the overlap's 'constant length' property is unique to anaphase? (This could demonstrate already in Fig. 1 that the method in principle is capable of measuring different overlap lengths.)

      Even though the manuscript contains an impressive amount of data, it appears to be lengthy, the motivation for several experiments is not clearly described, and the order of data presentation can probably be improved. For example, it is unclear why SPD-1 profiles are presented late and why KLP-19 profiles are missing - one would expect to see them early on as an essential characterization of the system under study. The motivation of the paragraph investigating the relation of KLP-19 and SPD-1 to HCP-6 is especially unclear (more than 1 page of text describing supplementary material).

      The absence of interaction between KLP-19 and SPD-1 is not demonstrated to the same quality standard as the presence of interaction between the orthologs in the literature, which should at least be mentioned.

      Additional experiments essential to support the claims of the paper?

      KLP-19 profiles in the presence and absence of SPD-1 seem to be essential.

      A co-IP of KLP-19 and SPD-1 (including positive control) to prove that the proteins are not interacting would help to support the claim.

      Data and methods presented so that they can be reproduced? Yes.

      Experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments:

      Generating cellular electron tomography data is very laborious. It is a pity that no raw data is provided; for example, a slice of a reconstructed tomogram or a video of whole volumes without segmentation would be an informative addition and allow assessment of the data quality.

      The clear evidence for direct interaction between PRC1 and kinesin-4 in other species should be correctly acknowledged throughout the text.

      The average (mean or median?) values and STDs reported in the text do not appear to match those in Fig. 1D.

      The kymograph of spd-1 RNAi in Fig. 2A seems stretched, and the size based on the scale bar does not fit the values stated in the text.

      The figure numbering, as stated in the text, does not seem to agree with those in Supplementary Figure 8.

      Page numbers and/or line numbers and figure numbers on the figures would help the reader to navigate the manuscript more easily.

      Significance

      The manuscript is a rich resource of quantitative measurements of C.elegans' structural and dynamic spindle properties, using advanced light microscopy and 3D electron microscopy imaging. In large parts, the work confirms previous conclusions of the function of PRC1 and kinesin-4 in the anaphase spindle, but also reports some interesting differences, namely that the C.elegans proteins differ from their orthologs in that they do not interact with each other, raising the question of how the kinesin-4 KLP-19 localizes to the central spindle in this organism. This work is of interest for researchers studying cell division, and specifically spindle architecture, dynamics, and function.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Zimyanin et al, Chromokinesin Klp-19 regulates microtubule overlap and dynamics during anaphase in C. elegans.

      The authors used a myriad of techniques, including confocal live-cell imaging, 2-photon microscopy, second harmonic generation imaging, FRAP, microfluidic-coupled TIRF, EM-tomography, to study spindle midzone assembly dynamics in C. elegans one-cell stage embryos. In particular, they illuminated the role of kinesin-4 KLP-19 in maintaining proper midzone length and organization. Inhibition of KLP-19 results in longer more stable midzones, implying KLP-19 functions in depolymerizing microtubules.

      Indeed, much of the results in the current study are consistent with previously published results elsewhere. Nevertheless, the current work represents a tour-de-force showcase of diverse and state-of-the-art technology application to address spindle assembly dynamics. How KLP-19 functions to define microtubule length at the midzone is still not known. But the current work, with diverse and solid data, serves to highlight where future work should focus.

      Minor comments:

      Fig 3E / There is an unusual diagonal line bisecting the embryo. Visually this does not affect viewing of the His::GFP and KLP-19::GFP signals. However, when these signals are quantified and normalized (as in Fig 3F), the diagonal bisect displaying different background signal may impact the measurements.

      Fig 4B / While the kymographs clearly show KLP-19::GFP motility on microtubules, they also show that the majority of KLP(-::GFP do not move. Perhaps some quantification and discussion of this result is appropriate?

      Significance

      Indeed, much of the results in the current study are consistent with previously published results elsewhere. Nevertheless, the current work represents a tour-de-force showcase of diverse and state-of-the-art technology application to address spindle assembly dynamics. How KLP-19 functions to define microtubule length at the midzone is still not known. But the current work, with diverse and solid data, serves to highlight where future work should focus.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study by Zimyanin et al. examines the role of the C. elegans chromokinesin KLP-19 in the formation and architecture of the anaphase central spindle in C. elegans zygotes. Through a combination of electron and light microscopy, along with RNAi-mediated perturbations, the authors propose that KLP-19 influences central spindle stiffness by regulating microtubule dynamics.

      In Figure 5, the effect of KLP-19 depletion on central spindle microtubules appears unconvincing. The FRAP results show no significant difference with or without KLP-19, and overall microtubule density does not consistently respond to its depletion. Additionally, the double klp-19; gpr-1/2 (RNAi) condition does not exhibit a strong increase in microtubule density, though a statistical test is missing for this condition. Furthermore, the spd-1; gpr-1/2 double depletion produces a similar increase in microtubule density to most klp-19 depletion conditions, suggesting that the effect cannot be solely attributed to the absence of KLP-19.

      The use of hcp-6 depletion to argue that KLP-19 depletion affects central spindle elongation independently of stretched chromatin is problematic. hcp-6 encodes a component of the Condensin II complex in C. elegans, and its depletion leads to chromatin decompaction rather than the stretched, dense chromatin observed in the midzone during anaphase in klp-19 (RNAi) embryos. These conditions are not equivalent and do not effectively rule out the possibility that chromatin stretching contributes to the observed phenotype.

      Additionally, the authors report that KLP-19 influences astral microtubule dynamics (Figure 5E), yet in Figure 3E, they show that KLP-19 localizes exclusively to kinetochores and spindle microtubules, excluding astral microtubules and spindle poles. How do they reconcile this apparent contradiction?

      Edit: In the sentence: "Similar, 60s after anaphase onset, spindles of klp-19 (RNAi) (19.2 μm {plus minus} 0.5 μm) and klp-19/spd-1 (RNAi) treated spindles (16.2 μm {plus minus} 0.6 μm) were significantly shorter in comparison to control (20.6 μm {plus minus} 0.2 μm),".

      Figure legends lack consistency and do not adhere to standard C. elegans nomenclature conventions (e.g., protein names should not be capitalized, and genetic perturbations should be italicized). Standardizing these elements would improve clarity and readability.

      Significance

      The experiments are generally well executed and provide convincing data. However, a key concern is that the role of chromokinesins-particularly Kif4, the vertebrate homolog of KLP-19-in central spindle assembly and microtubule regulation has already been demonstrated (Hu et al., CB 2011). Additionally, the function of KLP-12, a C. elegans paralog of KLP-19, in inhibiting microtubule dynamics was more recently reported and the structural details of this inhibition have been dissected (Taguchi et al., eLife 2022, this prior work should be cited and discussed). Given these considerations, and despite the extensive array of approaches used in this paper, the novelty of the current study appears rather limited and may be of interest for C. elegans researchers mainly.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Revision Plan (Response to Reviewers)

      1. General Statements [optional]

      Response: We are pleased the reviewers appreciate the power of this novel proteomics methodology that allowed us to uncover new depths on the complexity of the ribosome ubiquitination code in response to stress. We also appreciate that the reviewers think that this is a “very timely” study and “interesting to a broad audience” that can change the models of translation control currently adopted in the field. Characterizing complex cellular processes is critical to advance scientific knowledge and our work is the first of its kind using targeted proteomics methods to unveil the integrated complexity of ribosome ubiquitin signals in eukaryotic systems. We also appreciate the fairness of the comments received and below we offer a comprehensive revision plan substantially addressing the main points raised by the reviewers. According to the reviewers’ suggestions, we will also expand our studies to two additional E3 ligases (Mag2 and Not4) known to ubiquitinate ribosomes, which will create an even more complete perspective of ubiquitin roles in translation regulation.

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors present a potentially powerful proteomics platform using parallel reaction monitoring (PRM) to quantitatively profile ribosomal protein (RP) ubiquitylation, with a focus on yeast under hydrogen peroxide (H₂O₂) stress. This approach robustly identifies both known and novel RP modifications, including basal ubiquitylation events previously undetected, and identifies Hel2-dependent mechanisms. The data support the conclusion that RPs are regulated by a multifaceted ubiquitin code, establishing a good foundation for the study.

      However, the study's focus shifts in a manner that introduces several limitations. Following the rigorous PRM-based analyses, the reliance on Western blotting without replication or quantification (e.g., single-experiment data in Figs. 3-5) significantly weakens the evidence. Experimental design becomes inconsistent, with variable combinations of stressors (H₂O₂, MMS, 4-NQO) and genetic backgrounds (WT, hel2Δ, rad6Δ) that preclude systematic comparisons. For instance, Fig. 3C/E and Fig. 4 omit critical controls (e.g., MMS in Fig. 4, rad6Δ in Fig. 3E), while Fig. 5 conflates distinct variables by comparing H₂O₂-treated rad6Δ with MMS-treated hel2Δ-a design that obscures causal relationships. Furthermore, Fig. 3F highlights that 4-NQO and MMS elicit divergent responses in hel2Δ, undermining the rationale for using these stressors interchangeably. These inconsistencies culminate in a fragmented narrative; attempts to link ISR activation or ribosome stalling to RP ubiquitylation become impossible, leaving the primary takeaway as "stress responses are complex" rather than advancing mechanistic insight.

              __Response: __We appreciate the evaluation of our work and that the power of our proteomics method established a good foundation for the study. We also understand the reviewer’s concerns and we will detail below a plan to enhance quantification and increase systematic comparisons. The experiments presented here were conducted with biological replicates, but in several instances, we focused on presence and absence of bands, or their pattern (mono vs poly-ub) because of the semi-quantitative nature of immunoblots. We will revise the figures and present their quantification and statistical analyses. In additional, we did not intend to use these stressors interchangeably, but instead, to use select conditions to highlight the complexity the stress response. In particular, we followed up with H2O2 *versus* 4-NQO because both chemicals are considered sources of oxidative stress. Even though it is unfeasible to compare every single stress condition in every strain background, in the revised version, we will include additional controls to increase the cohesion of the narrative, and expand the comparison between MMS, H2O2, and 4-NQO, as suggested. Details below.
      

      To strengthen the work, the following revisions are essential:

      R1.1. Repeat and quantify immunoblots: All Western blotting data require biological replicates and statistical analysis to support claims.

              __Response: __As requested, we will display quantification and statistical analysis of the suggested and new immunoblots that will be conducted during the revision period.
      

      R1.3. Remove non-parallel comparisons: The mRNA expression analysis in Fig. 5, which compares dissimilar conditions (e.g., rad6Δ + H₂O₂ vs. hel2Δ + MMS), should be omitted or redesigned to enable direct, strain- and stressor-matched contrasts.

              __Response: __We will follow the reviewers’ suggestion and redesign the analysis to increase consistency and prioritize data under identical conditions. To increase confidence in the mRNA data analysis, we intend to perform follow up experiments and analyze protein abundance of *ARG proteins* and *CTT1 *under different conditions. The remaining data using non-parallel comparisons will be moved to supplemental material and de-emphasized in the final version of the manuscript.
      

      R1.4. Standardize experimental variables: Restructure the study to maintain identical genetic backgrounds and stressors across all figures, enabling systematic interrogation of enzyme- or stress-specific effects on the ubiquitin code.

              __Response: __To ensure a better comparison across strains and conditions, we will re-run several experiments and focus on our main stress conditions. Specifically:
      
      • 3D: We plan to re-run this experiment and include MMS

      • 3E: We plan to perform the same panel of experiments in rad6D ,and display WT data as main figure.

      • 4A-B: We plan to perform translation output (HPG incorporation) experiments with MMS as suggested

      • 4C: We plan to re-run blots for p-eIF2a under MMS for improved comparison.

      Reviewer #1 (Significance (Required)):

      The authors present a potentially powerful proteomics platform using parallel reaction monitoring (PRM) to quantitatively profile ribosomal protein (RP) ubiquitylation, with a focus on yeast under hydrogen peroxide (H₂O₂) stress. This approach robustly identifies both known and novel RP modifications, including basal ubiquitylation events previously undetected, and identifies Hel2-dependent mechanisms. The data support the conclusion that RPs are regulated by a multifaceted ubiquitin code, establishing a good foundation for the study.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this manuscript the authors use a new target proteomics approach to quantify site-specific ubiquitin modification across the ribosome before and after oxidative stress. Then they validate their findings following in particular ubiquitination of Rps20 and Rps3 and extend their analysis to different forms of oxidative stress. Finally they question the relevance of two known actors of ribosome ubiquitination, Hel2 and Rad6. It is not easy to summarize the observations because in fact the major finding is that the patterns of ribosome ubiquitination occur in a stresser and enyzme specific manner (even when considering only oxidative stress). However, the complexity revealed by this study is very relevant for the field, because it underlies that the ubiquitination code of ribosomes is not easy to interpret with regard to translation dynamics and responses to stress or players involved. It suggests that some of the models that have generally been adopted probably need to be amended or completed. I am not a proteomics expert, so I cannot comment on the validity of the new proteomics approach, of whether the methods are appropriately described to reproduce the experiments. However, for the follow up experiments, the results following Rps20 and Rps3 ubiquitination are well performed, nicely controlled and are appropriately interpreted.

      Maybe what one can regret is that the authors have limited their analysis to the study of Hel2 and Rad6, and not included other enyzmes that have already been associated with regulation of ribosome ubiquitination, to get a more complete picture. It may not take that much time to test more mutants, but of course there is the risk that rather than enable to make a working model it might make things even more complex.

              __Response: __We value the positive evaluation of our work. We also appreciate the notion that it meaningfully expands the knowledge on the complexity of the ribosome ubiquitination code, challenges the current models of translation control, and conducted well-performed, and nicely controlled experiments. To address the main concern of the reviewer, we will expand our work by studying two additional enzymes involved in ribosome ubiquitination (Mag2 and Not4) and provide a more comprehensive picture of this integrated system. Specifically, we will generate yeast strains deleted for *MAG2* and *NOT4*, and evaluate their impact in ribosome ubiquitination under our main conditions of stress. We will investigate the role of these additional E3s in translation output (HPG incorporation), and in inducing the integrated stress response via phosphorylated eIF2α and Gcn4 expression. Additional follow up experiments will be performed according to our initial results.
      

      Reviewer #2 (Significance (Required)):

      In recent years, regulation of translation elongation dynamics has emerged as a much more relevant site of control of gene expression that previously envisonned. The ribosome has emerged as a hub for control of stress responses. Therefore this study is certainly very timely and interesting for a broad audience. However, it does fall short of giving any simple picture, and maybe the only point one can question is whether it is interesting to publish a manuscript that concludes that regulation is complicated, without really being able to provide any kind of suggestive model.

      My feeling is nevertheless that it will impact how scientists in the field design their experiments and what they will conclude. It will certainly also drive new experiments and approaches, and lead to investigations on how all the different players in regulation of ribosome modification talk to each other and signal to signaling pathways.

              __Response: __We appreciate the comments and the balanced view that studies like ours will still be impactful and contribute to a number of fields in multiple and meaningful ways. With the new experiments proposed here, and used of additional mutants and strains, we intend to propose and provide a more unified model that explain this complex and dynamic relationship.
      

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Recent studies have shown that the ubiquitination of uS3 (Rps3) is crucial for the quality control of nonfunctional rRNA, specifically in the process known as 18S noncoding RNA degradation (NRD). Additionally, the ubiquitination of uS10 (Rps20) plays a significant role in ribosome-associated quality control (RQC). However, the dynamics of ribosome ubiquitination in response to oxidative stress are not yet fully understood.

      In this study, the authors developed a targeted proteomics method to quantify the dynamics of ribosome ubiquitination in response to oxidative stress, both relatively and stoichiometrically. They identified 11 ribosomal sites that exhibited increased ubiquitin modification after exposure to hydrogen peroxide (H2O2). This included two known targets: uS10 and uS3 (of Hel2), which recognize collided ribosomes and initiate the processes of 18S NRD and translation quality control (RQC). Using isotope-labeled peptides, the researchers demonstrated that these modifications are non-stoichiometric and display significant variability among different peptides.

      Furthermore, the authors explored how specific enzymes in the ubiquitin system affect these modifications and their impact on global translation regulation. They found that uS3 (Rps3) and uS10 (Rps20) were modified differently by various stressors, which in turn influenced the Integrated Stress Response (ISR). The authors suggest that different types of stressors alter the pattern of ubiquitinated ribosomes, with Rad6 and Hel2 potentially competing for specific subpopulations of ribosomes.

      Overall, this study emphasizes the complexity of the ubiquitin ribosomal code. However, further experiments are necessary to validate these findings before publication.

      Major Comments:

      I consider the additional experiments essential to support the claims of the paper.

      R3.1. To understand the roles of ribosome ubiquitination at the specific sites, the authors must perform stressor-specific suppression of global translation, as demonstrated in Figures 4 and 5. This should include the uS10-K6R/K8R and uS3-K212R mutants.

              __Response: __We understand the importance of the suggested experiment. We have already requested and kindly received strains expressing these mutations, which will reduce the time required to successfully address this point. We will perform our translation and ISR assays such as the one referred by the reviewer in Figs. 4A-C and 5E, and results will determine the role of individual ribosome ubiquitination sites in translation control.
      

      R3.2. It is crucial to ensure that experiments are adequately replicated and that statistical analysis is thorough, with precise quantification. For a more accurate comparison between wild-type (WT) and Hel2 deletion mutants regarding ribosome ubiquitination, the authors should quantify the ubiquitinated ribosomes in both WT and Hel2 mutants under stress conditions. This quantification should be conducted on the same blot, using diluted control samples. Similarly, in Figures 3F and 4C, for an accurate comparison between WT and Hel2 or Rad6 deletion mutants, the authors should quantify the ubiquitinated ribosomes across these conditions. Again, this quantification should be performed on the same blot with the dilution of control samples.

              __Response: __As was also requested by reviewer 1 and discussed above (point R1.1), we will conduct quantification and display statistical analyses for our immunoblots. In addition, we will re-run the aforementioned experiments to improve quantification following the reviewers’ request (same gel & diluted control samples).
      

      Reviewer #3 (Significance (Required)):

      • General assessment:

      Recent studies reveal that the ubiquitination of uS3 (Rps3) is essential for the quality control of nonfunctional rRNA (18S NRD), while the ubiquitination of uS10 (Rps20) plays a crucial role in ribosome-associated quality control (RQC). However, the dynamics of ribosome ubiquitination in response to oxidative stress remain unclear.

      • Advance:

      In this study, the authors developed a targeted proteomics method to quantify ribosome ubiquitination dynamics in response to oxidative stress, both relatively and stoichiometrically. By utilizing isotope-labeled peptides, they demonstrated that these modifications are non-stoichiometric and exhibit significant variability across different peptides. They identified 11 ribosomal sites that showed increased ubiquitin modification following H2O2 exposure, including two known targets of Hel2, which recognize collided ribosomes and induce translation quality control (RQC).

      • Audience: This information will be of interest to a specialized audience in the fields of translation, ribosome function, quality control, ubiquitination, and proteostasis.

      • The field: Translation, ribosome function, quality control, ubiquitination, and proteostasis.

      __ Response:__ We appreciate that our work will be valuable to a number of fields in protein dynamics and that our method advances the field by measuring ribosome ubiquitination relatively and stoichiometrically in response to stress.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Response: All requested changes require experiments and data analyses, and a complete revision plan is delineated above in section #2.

      • *

      4. Description of analyses that authors prefer not to carry out

      • *

      R1.2. Leverage the PRM platform: Apply the established quantitative proteomics approach to validate or extend findings in Fig. 3 (e.g., RAD6-dependent ubiquitylation), ensuring methodological consistency.

              __Response: __Although we understand the interest on the proposed result for consistency, this is the only requested experiment that we do not intend to conduct. Because of the lack of overall ubiquitination of ribosomal proteins in *rad6**D* in response to H2O2 (e.g., Silva et al., 2015, Simoes et al., 2022), we believe that this PRM experiment in unlikely to produce meaningful insight on the ubiquitination code. In this context, we expected that sites regulated by Hel2 will be the ones largely modified in rad6*D *and we followed up on them via immunoblot. Moreover, this experiment would not be time or cost-effective, and resources and efforts could be used to strengthen other important areas of the manuscript, such as including the E3’s Mag2 and Not4 into our work.
      
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Recent studies have shown that the ubiquitination of uS3 (Rps3) is crucial for the quality control of nonfunctional rRNA, specifically in the process known as 18S noncoding RNA degradation (NRD). Additionally, the ubiquitination of uS10 (Rps20) plays a significant role in ribosome-associated quality control (RQC). However, the dynamics of ribosome ubiquitination in response to oxidative stress are not yet fully understood.

      In this study, the authors developed a targeted proteomics method to quantify the dynamics of ribosome ubiquitination in response to oxidative stress, both relatively and stoichiometrically. They identified 11 ribosomal sites that exhibited increased ubiquitin modification after exposure to hydrogen peroxide (H2O2). This included two known targets: uS10 and uS3 (of Hel2), which recognize collided ribosomes and initiate the processes of 18S NRD and translation quality control (RQC). Using isotope-labeled peptides, the researchers demonstrated that these modifications are non-stoichiometric and display significant variability among different peptides.

      Furthermore, the authors explored how specific enzymes in the ubiquitin system affect these modifications and their impact on global translation regulation. They found that uS3 (Rps3) and uS10 (Rps20) were modified differently by various stressors, which in turn influenced the Integrated Stress Response (ISR). The authors suggest that different types of stressors alter the pattern of ubiquitinated ribosomes, with Rad6 and Hel2 potentially competing for specific subpopulations of ribosomes.

      Overall, this study emphasizes the complexity of the ubiquitin ribosomal code. However, further experiments are necessary to validate these findings before publication.

      Major Comments:

      I consider the additional experiments essential to support the claims of the paper.

      1. To understand the roles of ribosome ubiquitination at the specific sites, the authors must perform stressor-specific suppression of global translation, as demonstrated in Figures 4 and 5. This should include the uS10-K6R/K8R and uS3-K212R mutants.
      2. It is crucial to ensure that experiments are adequately replicated and that statistical analysis is thorough, with precise quantification. For a more accurate comparison between wild-type (WT) and Hel2 deletion mutants regarding ribosome ubiquitination, the authors should quantify the ubiquitinated ribosomes in both WT and Hel2 mutants under stress conditions. This quantification should be conducted on the same blot, using diluted control samples. Similarly, in Figures 3F and 4C, for an accurate comparison between WT and Hel2 or Rad6 deletion mutants, the authors should quantify the ubiquitinated ribosomes across these conditions. Again, this quantification should be performed on the same blot with the dilution of control samples.

      Significance

      General assessment:

      Recent studies reveal that the ubiquitination of uS3 (Rps3) is essential for the quality control of nonfunctional rRNA (18S NRD), while the ubiquitination of uS10 (Rps20) plays a crucial role in ribosome-associated quality control (RQC). However, the dynamics of ribosome ubiquitination in response to oxidative stress remain unclear.

      Advance:

      In this study, the authors developed a targeted proteomics method to quantify ribosome ubiquitination dynamics in response to oxidative stress, both relatively and stoichiometrically. By utilizing isotope-labeled peptides, they demonstrated that these modifications are non-stoichiometric and exhibit significant variability across different peptides. They identified 11 ribosomal sites that showed increased ubiquitin modification following H2O2 exposure, including two known targets of Hel2, which recognize collided ribosomes and induce translation quality control (RQC).

      Audience: This information will be of interest to a specialized audience in the fields of translation, ribosome function, quality control, ubiquitination, and proteostasis.

      The field: Translation, ribosome function, quality control, ubiquitination, and proteostasis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript the authors use a new target proteomics approach to quantify site-specific ubiquitin modification across the ribosome before and after oxidative stress. Then they validate their findings following in particular ubiquitination of Rps20 and Rps3 and extend their analysis to different forms of oxidative stress. Finally they question the relevance of two known actors of ribosome ubiquitination, Hel2 and Rad6.

      It is not easy to summarize the observations because in fact the major finding is that the patterns of ribosome ubiquitination occur in a stresser and enyzme specific manner (even when considering only oxidative stress). However, the complexity revealed by this study is very relevant for the field, because it underlies that the ubiquitination code of ribosomes is not easy to interpret with regard to translation dynamics and responses to stress or players involved. It suggests that some of the models that have generally been adopted probably need to be amended or completed. I am not a proteomics expert, so I cannot comment on the validity of the new proteomics approach, of whether the methods are appropriately described to reproduce the experiments. However, for the follow up experiments, the results following Rps20 and Rps3 ubiquitination are well performed, nicely controlled and are appropriately interpreted. Maybe what one can regret is that the authors have limited their analysis to the study of Hel2 and Rad6, and not included other enyzmes that have already been associated with regulation of ribosome ubiquitination, to get a more complete picture. It may not take that much time to test more mutants, but of course there is the risk that rather than enable to make a working model it might make things even more complex.

      Significance

      In recent years, regulation of translation elongation dynamics has emerged as a much more relevant site of control of gene expression that previously envisonned. The ribosome has emerged as a hub for control of stress responses. Therefore this study is certainly very timely and interesting for a broad audience.

      However, it does fall short of giving any simple picture, and maybe the only point one can question is whether it is interesting to publish a manuscript that concludes that regulation is complicated, without really being able to provide any kind of suggestive model.

      My feeling is nevertheless that it will impact how scientists in the field design their experiments and what they will conclude. It will certainly also drive new experiments and approaches, and lead to investigations on how all the different players in regulation of ribosome modification talk to each other and signal to signaling pathways.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors present a potentially powerful proteomics platform using parallel reaction monitoring (PRM) to quantitatively profile ribosomal protein (RP) ubiquitylation, with a focus on yeast under hydrogen peroxide (H₂O₂) stress. This approach robustly identifies both known and novel RP modifications, including basal ubiquitylation events previously undetected, and identifies Hel2-dependent mechanisms. The data support the conclusion that RPs are regulated by a multifaceted ubiquitin code, establishing a good foundation for the study.

      However, the study's focus shifts in a manner that introduces several limitations. Following the rigorous PRM-based analyses, the reliance on Western blotting without replication or quantification (e.g., single-experiment data in Figs. 3-5) significantly weakens the evidence. Experimental design becomes inconsistent, with variable combinations of stressors (H₂O₂, MMS, 4-NQO) and genetic backgrounds (WT, hel2Δ, rad6Δ) that preclude systematic comparisons. For instance, Fig. 3C/E and Fig. 4 omit critical controls (e.g., MMS in Fig. 4, rad6Δ in Fig. 3E), while Fig. 5 conflates distinct variables by comparing H₂O₂-treated rad6Δ with MMS-treated hel2Δ-a design that obscures causal relationships. Furthermore, Fig. 3F highlights that 4-NQO and MMS elicit divergent responses in hel2Δ, undermining the rationale for using these stressors interchangeably. These inconsistencies culminate in a fragmented narrative; attempts to link ISR activation or ribosome stalling to RP ubiquitylation become impossible, leaving the primary takeaway as "stress responses are complex" rather than advancing mechanistic insight.

      To strengthen the work, the following revisions are essential:

      1. Repeat and quantify immunoblots: All Western blotting data require biological replicates and statistical analysis to support claims.
      2. Leverage the PRM platform: Apply the established quantitative proteomics approach to validate or extend findings in Fig. 3 (e.g., RAD6-dependent ubiquitylation), ensuring methodological consistency.
      3. Remove non-parallel comparisons: The mRNA expression analysis in Fig. 5, which compares dissimilar conditions (e.g., rad6Δ + H₂O₂ vs. hel2Δ + MMS), should be omitted or redesigned to enable direct, strain- and stressor-matched contrasts.
      4. Standardize experimental variables: Restructure the study to maintain identical genetic backgrounds and stressors across all figures, enabling systematic interrogation of enzyme- or stress-specific effects on the ubiquitin code.

      Significance

      The authors present a potentially powerful proteomics platform using parallel reaction monitoring (PRM) to quantitatively profile ribosomal protein (RP) ubiquitylation, with a focus on yeast under hydrogen peroxide (H₂O₂) stress. This approach robustly identifies both known and novel RP modifications, including basal ubiquitylation events previously undetected, and identifies Hel2-dependent mechanisms. The data support the conclusion that RPs are regulated by a multifaceted ubiquitin code, establishing a good foundation for the study.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Using human pluripotent stem cell-based Gel-3D model of amniogenesis this study investigates the transcriptional dynamics of amnion differentiation at single cell level. Seven cell clusters are identified that emerge over four days of differentiation, including progressive amniotic fates precursors, among them CLDN10 progenitor located on the boundary of amnion and epiblast, and primordial germ cell-like fate. Mutational studies support the role of CLDN10 in promoting amniotic but limiting primordial germ cell-like.

      Major Comments:

      This generally clearly presented study significantly advances our understanding of human/NHP amniogenesis and should be of broad interest with relevance to human reproduction. However, the there are several questions about how experiments were performed, analyzed, presented and interpreted that need to be answered.

      1. The presented antibody stainings while beautiful are presented without sufficient quantification. A single representative (?) cyst is shown. Please provide information about how many cysts on average have been analyzed in how many experiments, expression levels of should be quantified to support conclusions such as: Line 116-118 "a subset of cells within the cysts displays reduced expression of NANOG, while TFAP2A expression becomes weakly activated"; or Line 129 "the transition from pluripotent to amnion cell types occurs progressively over the cyst, starting from focal initiation sites".
      2. As the the scRNA-seq experiment is one of the main advances of this study and it explores the temporal dynamics and transitional cell populations during amniogenesis this experiment should be performed with two independent biological replicates to investigate the variability of the amniogenesis in this model in terms of the proportion of the 7 distinct cell populations the authors identified in this analysis.
      3. Another interesting parallel between the amnion model and the CS7 human gastrula is most Tyser "Epiblast" cells are seen in the "pluripotency-exiting" population of the amnion model. However, pluripotency exit is a hallmark of epiblast as it initiates gastrulation and primitive streak formation/mesendoderm differentiation. This should be analyzed and discussed further, especially that the authors see in the amnion model some cells expressing TBXT at low level.
      4. How do the authors explain/interpret the difference in CLDN10 expression at RNA and protein level?
      5. Two hESC CLDN10 mutant lines are presented in Figure S4, which are transheterozygous for framesfhit mutations. However, it is not clear how (guideRNAs), in which position of the gene these mutations were generated and what is predicted mutant protein product of each allele. Please provide, gene structure, gRNA position and predicted protein product cartoons. As we do not know the antigen recognized by CLDN10 antibody, these are critical considerations.
      6. What are the consequences of these mutations on CLDN10 transcript? qPCR and also scRNA-seq data the authors have.
      7. Please indicate in the experiments using CLDN10 mutant lines, which KO line has been used for specific experiment and whether same/different results have been obtained with the two lines.
      8. The excess of PGCL cells in CLDN10 KO Gel-3D amnion model is an important observation, but not fully supported by the data. We are presented with single images of mutant cysts at different stages of amniogenesis. Additional data and the number of SOX17+ cells in WT and mutant cysts at should be provided.
      9. The authors propose an interesting concept of CLDN10 at the boundary between the amnion and the epiblast promoting amniogenesis and limiting hHPGLC formation. They speculate about the role of tight junction in this process in agreement with increased hHPGLC formation upon ZO1 reduction in another hPSC model. However, surprisingly little discussion is provided about signaling implications of the reported amniogenic transcriptional cascade, and signals emanating from the different amnion progression cell types. Given the important role of BMP in the formation of amnion and hPGCs, notable is increasing expression of BAMBI in progenitor cell types and high expression in specified and maturing clusters. The expression of signaling pathway components should be analyzed and discussed in more depth.

      Additional comments:

      1. It is not easy to discern the numbers of the seven populations that are detected at D1-D4 from Figure 1C. A panel in Figure 1 illustrating this would be informative.
      2. The similarity of the "Ectoderm" cluster from the CS7 human gastrula Tyser et al., 2021 to extraembryonic cell type with amnion/trophectoderm characteristics in hESC 2D-gastruloid model has been reported by Minn et al., Stem Cell Reports, 2021 and this should be acknowledged.

      Referees cross-commenting

      There is consensus among the reviewers that this is a novel and important work, but additional experiments and their rigorous quantification is needed. Attending to the reviewers comments will significantly elevate this exciting work.

      Significance

      Occurring upon implantation of human blastocyst, amniogenesis, or formation of the amniotic sac from the pluripotent epiblast, is still poorly understood but essential process of human embryogenesis. The key morphogenetic aspects of amniogenesis, i.e. epithelial polarization of epiblast into a cyst and subsequently differentiation of the portion of the cyst abutting the trophectoderm proximal to the uterus into squamous epithelium is in part modeled by the hESC-based amnion models in which BMP stimulation plays a crucial role. In the Gel-3D amnion model model deployed here, no exogenous BMP is added, however, BMP signaling is activated in the cells by a mechanosensitive cue provided by the soft substrate; hESCs initially form a cyst of epithelial cells expressing pluripotent markers that initiate transcriptional cascade and within 4 days of culture differentiate into a cyst of squamous-amnion-like epithelium.

      This work expands on the previous studies by investigating the transcriptional dynamics of amnion differentiation at single cell level combined with additional antibody stainings and compare their findings to distinct cell types in a Carnegie stage 7 human embryo (Tyser et al., 2021) and relevant non-human primate datasets. Based on the resulting data the authors posit contiguous amniogenic cell states: pluripotency-exiting, early progenitor, late progenitor, specified and maturing. Moreover, they also uncover that this model of amniogenesis also produces primordial germ cell-like (hPGC-L) and mesoderm-like cells. A notable finding is that high levels of CLDN10 mark a later transient progenitor state, but CLDN10 expression is downregulated more differentiated cells. Moroever, the authors posit that CLDN10 is a marker of the progenitor population, expression of which is restricted to the boundary between the amnion and the epiblast of the cynomolgus macaque peri-gastrula. Functional interrogation of CLDN10 using hESC mutant lines in the Gel-3D amnion model shows reduced amniogenesis and excess of hPGC-L cells. The authors propose that the CLDN10 the amnion-epiblast boundary is a site of active amniogenesis but limits hPGC-L. This work advances our understanding of amniogenesis, strengthens the concept that amnion and PGC progressing cells initially share acommon intermediate lineage, provides a valuable transcriptomic dataset and should be of broad interest with relevance to human development and reproduction.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Sekulovski et al characterize the transcriptome of human pluripotent stem cells differentiating to an amnion fate in 3D, using single-cell RNA sequencing. This leads to the identification of CLDN10+ cells as amnion progenitors. When CLDN10 is eliminated, amniogenesis is compromised. Moreover, analysis of CLDN10 localization in cynomolgus macaque embryos reveals that this progenitor population is located at the boundary between the epiblast and the amnion.

      Major comments:

      The key results are convincing and supported by clear experiments. However, additional controls, quantifications, and clarifications are needed as follows:

      • The authors identify five amnion-progressing states in vitro and mention that each of these states also shows transcriptional similarities to cell types in a CS7 embryo (Tyser et al, 2021). How do the authors interpret this result? Would this mean that there are amnion cells at all different maturation stages present at a specific time point in development? Given that the available in vivo reference is derived from a single human embryo, it is more likely that the true in vivo counterpart of these states is not captured in the embryo data.
      • The authors stain the 3D amnion model at different stages and conclude that "amniogenesis initiates focally and spreads laterally". This cannot be concluded from the data provided. The images in Figure 1 simply show heterogeneity in the levels of TFAP2A. To support their claims, the authors would need to perform time-lapse experiments using a TFAP2A reporter line.
      • The authors conclude that CLDN10+ cells give rise to amnion during gastrulation of cynomolgus macaque embryos. The data provided does not prove that CLDN10+ cells are the amnion progenitors in vivo.
      • CLDN10 KO cells form amnion cysts like control cells by day 3. However, by day 4 the cysts lose expression of the amnion marker ISL1 and become disorganized. To characterize the epithelial (or lack of) phenotype, the authors should include membrane/polarity/adhesion immunostainings. Is the disorganization observed at day 4 associated with the progressive changes in cell identity, or is it a time-dependent phenotype? The authors should include human PSC cysts as a control. This would allow them to determine whether the role of CLDN10 is specific to amnion cells.
      • Figure 2: is there a correlation between the levels of CLDN10 and TFAP2A based on the scRNAseq data and the immunofluorescence stainings? The IF data would benefit from quantifications.
      • Figure 4: the experiment has not been quantified. What is the % of PGCLCs in WT and KO cells? What are the levels of ISL1 in WT and KO cells? What is the localization of epithelial determinants in WT and KO cells? Is there an anti-correlation between CLDN10 and ISL1?

      Referees cross-commenting

      I think there is a general consensus that additional quantifications and careful analyses are needed before this paper is accepted for publication. I agree with the comments raised by the other reviewers.

      Significance

      This manuscript is a follow-up work of Sekulovski et al, 2024. In this recent manuscript, the authors already provided a temporally resolved transcriptomic characterization of in vitro amniogenesis. The key difference between the two articles is that while Sekulovski et al, 2024 performed a bulk RNAseq experiment, in the current manuscript a single-cell RNAseq experiment has been done. It is fundamental to clearly define what new findings have been obtained thanks to the single-cell experiment, which could not have been obtained using the bulk transcriptomics data. This is a particularly important point given the robustness and synchrony of the model. For example, the authors had already identified five amnion states in vitro in their previous publication. Is CLDN10 differentially expressed in the progenitor population based on the bulk RNAseq data? Are the same dynamics of expression recapitulated? The title of the manuscript does not mention CLDN10 but rather focuses on transcriptional profiling at the single-cell level. In my opinion, the key novelty of this manuscript is the identification of CLDN10 and the role it plays during amniogenesis. Focusing the manuscript on the dynamic transcriptional profile diminishes the novelty, as this had already been done by the authors at the bulk level. Globally, this manuscript provides additional information of the poorly understood process of amniogenesis that will be interesting for those working on early human embryogenesis.

      My area of expertise is early mammalian embryo development and stem cells. I do not have the computational background to evaluate the bioinformatic analyses of the manuscript in-depth.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a novel and important and interesting study that uses one of the best amniogeneisis form PSCs int he field. The authors do scRNA-seq during 4 day course to understand different populations emerging during amniogeneisis, and they identify CLDN10 as a marker for newly emerging new amion cells, and then use their model and monkey real embryos to prove the CLDN10+ population at the amnion-epiblast border. In the final part, the authors knockout CLDN10 and claim it compromises amniogenesis and favours formation.

      Significance

      This is a well conducted study, and conclusions are novel and super exciting and IMPORTANT!!!. I have one-2 major comments to strengthen conclusions in the last part, and will help make this excellent study become superb and a landmark study.

      1. it is not really clear what is the phenotype of CLDN10 KO cells. is amniogenesis totally inhibited? can the authors do scRNA-seq on the KO cells and compare them to WT cells? There is no quantitation to amnion or PGC formation efficiency ? how many structures where analyzed?
      2. in continuation with the above The claim that PGC formation is enhanced in KO is not strong. PGCs should be stained for NANOS3 and blimp1 specific marker and not only SOX17 which can also be a Pre marker. Then quantification should be properly done.
    1. Note: This response was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Dear Review Commons editorial team,

      Thank you for coordinating the thorough and careful review of our manuscript. We are especially grateful to the four anonymous reviewers for recognizing the value of our work and for their constructive suggestions on how to improve it.

      We are encouraged by the positive reception of our main conclusions on the robustness of adaptation to DNA replication stress and its relevance to multiple fields. All reviewers provided insightful comments, with reviewers #2 and #4 emphasizing that further experimental validation of the hypothesized role of reduced dNTPs in alleviating fitness during constitutive DNA replication stress would strengthen the paper. While the precise molecular mechanisms underlying this suppression are not the primary focus of this manuscript, we are eager to perform additional experiments based on the reviewers’ suggestions.

      Below, we present a detailed revision plan in the form of a point-by-point response to their comments.

      Reviewer #1 (Evidence, reproducibility and clarity):

      This study investigates the compensatory evolutionary response of Saccharomyces cerevisiae to DNA replication stress, focusing on the influence of genotype-environment interactions (GXE). The authors used a range of experimental conditions with varying nutrient levels to assess evolutionary outcomes under replication stress. Their genomic analysis reveals that while glucose levels affect initial adaptation rates, the genetics of adaptation remain robust across all nutritional environments. The research offers new insights into the adaptability of S. cerevisiae, emphasizing the role of the nutritional environment in evolutionary processes related to DNA replication stress. It identifies recurrent advantageous mutations under different macronutrient availabilities and uncovers a novel role for the RNA polymerase II mediator complex in adaptation to replication stress. Overall, this well-designed study adds to the growing recognition of the complexity and robustness of evolutionary responses to environmental stressors. It provides strong evidence that compensatory evolution to replication stress is robust across varying nutritional conditions. It both challenges and reinforces previous findings regarding the resilience of the yeast genetic interaction network to environmental perturbations. The detailed analysis of specific compensatory mutations and their fitness impacts across different conditions offers valuable insights into adaptive dynamics over 1000 generations, contributing a clear empirical framework for understanding how replication-associated stress shapes evolutionary outcomes in diverse environments.

      Based on the analysis:

      1) The conclusions are generally well-supported by the presented data. The evolution experiments and genomic analyses are robust and provide convincing evidence for the study's main claims. The authors took steps to eliminate bias, such as maintaining an adequate Ne, which, if not done, could have compromised their conclusions by affecting genetic drift and limiting the population's access to beneficial mutations.

      2) The figures are well-designed and easy to understand.

      3) The methodology is well-described and appears reproducible. The authors provide sufficient details on experimental procedures. Experimental replication is adequate, with multiple evolutionary lines.

      4) They also made efforts to validate their observations, such as the validation of mutations, the prediction of interactions in the Med14 structure, and its potential implication in gene regulation, as well as the analysis of the cumulative fitness benefit and the reconstruction of the quadruple mutant.

      There are, however, a few results that would benefit from further clarification:

      1) The experimental design is strong, offering a diverse range of conditions. However, the high glucose condition (8%) stands out as significantly different from the neutral 2% condition, both in range and margin, compared to the low glucose conditions (0.25-0.5%). While this mainly affects growth profiles and evolvability in the early generations, a brief explanation in the discussion would strengthen the conclusions. Specifically, addressing:

      1. a) The rationale behind selecting these particular glucose concentrations.

      2. b) How other glucose concentrations might influence the outcomes. Providing this additional context would enhance the reader's understanding of the experimental setup and its potential implications, while also offering insights into the broader applicability of the findings and possible directions for future research.

      We thank the reviewer for pointing out the need to clarify the rationale behind the glucose concentrations used in our study, an aspect we agree should have been better explained. In response, we have added the following text detailing the chosen conditions and their established effects on cellular metabolism.

      Line 67: “Glucose is the most abundant monosaccharide in nature, and represents the preferred source of energy for most cells.”

      Line 110: “...we grew WT and ctf4Δ cells in varying glucose concentrations to induce distinct physiological states. Low glucose levels (0.25% and 0.5%) induce caloric restriction and ultimately glucose starvation (Lin et al 2000, Smith et al. 2009). These conditions elicit increased respiration (Lin et al., 2002), sirtuins expression (Guarente, 2013), autophagy (Bagherniya et al. 2018), DNA repair (Heydari et al., 2007), and reduced recombination at the ribosomal DNA locus (Riesen and Morgan, 2009) ultimately extending lifespan in several organisms (Kapahi et al., 2016). In contrast, standard laboratory conditions typically use 2% glucose, promoting a rapid proliferation environment to which strains have been adapted since laboratory domestication (Lindergren, 1949). Finally, elevated glucose concentrations (such as 8%) result in higher ethanol production (Lin et al., 2012) and reactive oxygen species (ROS) levels (Maslanka et al., 2017).

      2) In the discussion section, a more explicit comparison with similar studies in other model organisms would help contextualize the findings within the broader field of evolutionary biology. While the results appear robust, it would be beneficial to explore how they align with or contrast to previous studies on DNA damage, particularly in bacteria or highly complex eukaryotes.

      We appreciate this suggestion to better contextualize our findings within the broader literature, as it provides an opportunity to highlight the unique aspects of our work. While many studies have explored how environmental factors shape fitness landscapes and influence evolutionary strategies, to our knowledge, only a few have addressed this in the context of compensatory evolution, where cells must recover fitness lost due to intracellular perturbations. To address this point, we have added a discussion of additional examples involving other model organisms, highlighting their difference with the question asked in this work.

      Line 34: “Genotype-by-environment (GxE) interactions are well-documented. For example, several studies on E. coli have demonstrated how different environments influence fitness and epistatic interactions among adaptive mutations in the Lenski Long-Term Evolution Experiment (Ostrowski et al., 2005, 2008; Flynn et al., 2012; Hall et al., 2019). Adaptive mutations in viral genomes similarly exhibit variable fitness effects across different hosts (Lalic and Elena, 2012; Cervera, 2016). Furthermore, interactions between mutations in the Plasmodium falciparum dihydrofolate reductase gene have been shown to predict distinct patterns of resistance to antimalarial drugs (Ogbunugafor et al., 2016). However, the role of environmental factors in shaping evolution within the context of compensatory adaptation, when fitness defects primarily arise from intracellular perturbations, remains much less explored.”

      However, if the reviewer have particular additional studies in mind, we welcome further suggestions to include in the final manuscript.

      Minor comments:

      1) The presentation of data in the figures is clear and informative. However, some figure legends could benefit from more detailed explanations. For example, although the statistical tests used are mentioned in the methods section, it would be helpful to also include them in the figure legends, such as in legend 1acde, as well as in all other figures.

      We are now reporting the statistical test used for each comparison also in figure legends.

      2) In terms of broader conclusions, here are a few suggestions, though they are, of course, optional:

      a) The study could benefit from exploring the potential trade-offs of adaptive mutations in the hypothetical return to environments without replication stress, at least theoretically. This would provide a more comprehensive understanding of the evolutionary constraints.

      We thank the reviewer for the suggestion, we had performed the measurements but did not comment on them explicitly. We are now commenting on them as follows:

      Line 310: “In the WT background, all mutations were nearly neutral, with only minimal deleterious or advantageous effects on fitness depending on glucose concentrations (Fig S4A).”

      Line 468: “The nearly neutral effects on fitness of the core adaptive mutations in WT suggest that they are likely to persist even after the initial replication stress is resolved.”

      b) A brief discussion of the potential limitations of using lab strains versus wild isolates of S. cerevisiae would offer valuable context for the generalizability of the findings.

      This is an excellent point. While addressing it fully would warrant a separate manuscript, we provide our comments here, along with similar observations raised by this and other reviewers, as follows:

      Line 450: “How generalizable are our conclusions about the reproducibility of evolutionary repair to DNA replication stress across other organisms, species, or replication challenges? While dedicated future studies are needed to fully address these important questions, several lines of evidence are encouraging. A recent report demonstrated that the identity of suppressor mutations of lethal alleles was conserved when introduced into highly divergent wild yeast isolates (Paltenghi and van Leeuwen, 2024). Similarly, earlier work showed that even ploidy, which significantly alters the target size for loss- and gain-of-function mutations, affected only the identity of the genes targeted by selection, while the broader cellular modules involved remained consistent (Fumasoni and Murray, 2021). Moreover, divergent organisms experiencing different types of DNA replication stress exhibit some of the adaptive responses described here. For example, the yeast genus Hanseniaspora, which lacks the Pol32 subunit of the replisome, has also been reported to have lost the DNA damage checkpoint (Steenwyk et al., 2019). Human Ewing sarcoma cells carrying the fusion oncogene EWS-FLI1 frequently exhibit adaptive amplification of the cohesin subunit RAD21 (Su et al., 2021). Together, these findings suggest that while the specific details of DNA replication perturbations and the genomic features of organisms may shape the precise targets of compensatory evolution, the overarching principles and cellular modules affected are broadly conserved.”

      Furthermore, we plan to search a recently published database of variants found in natural isolates of S. cerevisiae to assess whether similar evolutionary processes to those described in this study may have occurred in wild strains.

      c) It would be valuable to present the differences in ploidy in the context of other studies, such as the nutrient-limitation hypothesis (e.g., 'The Evolutionary Advantage of Haploid Versus Diploid Microbes in Nutrient-Poor Environments' by Bessho, 2015), since, as previously demonstrated by the authors of this article that is being reviewed, ploidy may influence the evolutionary trajectories of DNA repair.

      d) Interrelating these three terms: nutrient-limitation, ploidy, and DNA repair could be an interesting avenue to explore in the discussion.

      In response to comments c and d, we have now commented on the intersection between ploidy and other types of DNA perturbation in the paragraph starting in line 491 (see response above)

      3) Specific details:

      a) Line 116: To improve clarity, it would be beneficial to refer to the figure right after the statement: 'However, their relative fitness improved compared to the WT reference as the initial glucose levels (Figure X).'

      b) Line 404: The statement about antibiotics and cancer progression is somewhat brief here; it might be helpful to provide more context on why this mechanism influences these processes (here or before).

      c) Line 418: "were re-suspended in water containing zymolyase (Zymo Research, Irvine, CA, US, 0.025 μ/μL), incubated at". Something is missing in the units.

      d) Line 459: "and G2 phases for each genotype was estimated by deriving the the relative cell distribution". The article "the" is repeated.

      e) 1a: The x-axis ticks appear misaligned, which makes it difficult to interpret the boxplots. For example, at 0.25, the tick is closer to the orange boxplot than to the black one. In contrast, at 2%, the tick seems well-centered."

      f) Figure 3 could benefit from a general legend at the top regarding the colors, as finding it in 2c was not intuitively easy.

      The typos and suggestions raised in points 3a-f have now been corrected in the manuscript.

      g) I didn't review the code on GitHub.

      Reviewer #1 (Significance):

      The main strength of the study is that it shows robustness of compensatory evolution across varying nutrient conditions. The study adds to the growing body of literature on DNA replication stress and evolutionary adaptation by showing that compensatory evolution can occur regardless of nutrient availability. This fundamental finding challenges prior assumptions that nutrient conditions significantly alter evolutionary outcomes, contributing to a more nuanced understanding of how cells respond to stress. Furthermore, the discovery of the RNA polymerase II mediator complex's role in this process is particularly novel and opens new lines of investigation.

      Advance in the field: The results advance our understanding of evolutionary biology, particularly in the context of DNA replication stress and compensatory evolution. The study demonstrates that evolutionary repair mechanisms are predictable, even under variable environmental conditions, which has key implications for evolutionary biology and therapeutic applications.

      Audience:

      This paper will be of interest to a specialized audience in evolutionary biology, genomics, and cell biology, particularly those interested in DNA replication stress and adaptive evolution. Researchers studying stress responses in model organisms, such as S. cerevisiae, will find the findings valuable, as will those working in applied fields where stress adaptation is a critical factor (e.g., industrial yeast fermentation, drug development, disease resistance, cancer research, or aging studies).

      Expertise:

      Evolutionary biology, genomic analysis, and cellular stress responses, with a particular focus on experimental evolution under DNA damage stress in Saccharomyces cerevisiae. Recently graduated and beginner reviewer.

      Reviewer #2 (Evidence, reproducibility and clarity):

      The paper addresses the effect of sugar availability in shaping compensatory evolution. The first observation of the paper is that cell physiology changes by modulating glucose availability also in strains that come with defective DNA replication (ctf4-null previously studied by the authors). An intriguing result is that ctf4-null grows comparatively better in low concentrations of glucose. This is hypothesized to be a consequence of both the decrease in dNTPs in low glucose, which causes slow down of fork progression, and/or reduced fork collapse at rDNA locus. Hence, wild types and ctf4-null show an opposite trend: in the mutant, the lowest concentration of glucose is the least affected by the mutation; in wild type, the highest concentration is the least affected. Adaptation rate is inversely related with the initial fitness. The effect on physiology and adaptation rate is a starting point for asking the key question: are evolutionary trajectories influnced by the growth conditions? The answer is negative: evolution experiments show the very same core of genetic changes at all sugar concentrations. The result is apparently at odds with previous publications, and the authors conclude that, in this particular setting, availability of carbon sources plays a minor role compared to impaired DNA replication. The different rates of adaptation in WT and mutant is rather explained by the initial fitness at the different glucose concentrations, which, as mentioned, is opposite in WT and ctf4-null mutants. The paper also reports a new mutation in MED14, component of the transcription mediator complex, which rescues the lack of Ctf4 activity. The study is interesting and asks a relevant question. The experiments are well executed and convincing, but the paper can be strengthened by testing some of the hypotheses which are put forward.

      Main points

      1- The raw data for evolutionary dynamics (Figure S2C) are fitted with the power law suggested by Wiser and Lenski, and return different values of the parameter 'b'. The authors say that the result depends greatly on the initial conditions ("due to the varying initial fitness of ctf4Δ cells across different glucose environments, they display an opposite trend to WT"). Around the initial values, however, the curves are non-monotonic, especially for low glucose availability. Both for WT and ctf4-null there is an initial drop in fitness, after which fitness increases. If one would neglect this initial dynamics, the value of the parameter 'b' would likely be different.

      The non-monotonic trend in fitness highlighted by the reviewer is likely due to technical factors: Fitness at Generation 0 was measured with high precision in a low-throughput manner early in the project. In contrast, fitness from Generation 100 to 1000 was measured later in the study in a high-throughput fashion, necessitated by the large number of competitions conducted (96 wells × 4 time points × 6 replicates = 2304 assays). This difference in methodologies may have introduced a slight offset when the datasets were combined at Generation 100. Following the reviewer’s suggestion, we have excluded the data point at Generation 100 responsible for this non-monotonic behavior and re-fitted the curves. While this adjustment has caused minor changes in the parameter ‘b’, the qualitative trends, particularly the opposing trends between WT and ctf4Δ as glucose increases, remain consistent (Figure_rev_only 1). To ensure transparency, we have retained all recorded fitness values in the original figure for reference.

      In general, one can question whether curves with this shape are best fitted by the power law proposed by Wiser and Lenski. For example, for the WT 0.25% glucose the linear fit gives a better R2 (why do the authors show the linear fit anyway?). This impression is further reinforced by the observation that Wiser and Lenski fit dynamics that last 50.000 generation, here the curves last 1/50th of it. In conclusion, I would question whether the parameter 'b' is a solid measurement of 'rate of adaptation'. Also, normalizations makes it difficult to appreciate the result shown in Figure 2B. I think the authors should look for a different way to show the different trend in adaptation dynamics for different glucose concentrations between wild types and mutants. For example, they could move Figure S2C in the main text to stress the result shown in Figure 2C, which already shows the difference between WT and mutant. This is especially true if what Figure 2C shows is (evo-anc)/evo. This is not fully clear to me: in the legend it refers to the delta, in the label of the y-axis I read that this is a percentage.

      We thank the reviewer for prompting us to clarify our methods for reporting fitness changes over time. The fitness values are reported, throughout the paper, as a percentage change relative to the reference WT strain. The gain in fitness during evolution (reported as Δ) represents the difference between the evolved strain (evo%) and the ancestral strain (anc%), calculated as Δ = evo% - anc%. This represents the absolute gain, rather than the relative gain. This value is still reported as a percentage as it’s the same scale and unit as the two values being subtracted. We have included additional details to clarify this aspect in the figure legend.

      “(C) Absolute fitness gains (Δ) at generation 1000 for evolved WT (upper panel, black) and ctf4Δ (lower panel, orange) populations. Box plots show median, IQR, and whiskers extending to 1.5×IQR, with individual data points beyond whiskers considered outliers. Absolute fitness gains were calculated by subtracting the ancestral relative fitness from the relative fitness of the evolved (Δ = evo% - anc%), both calculated as percentages relative to the same reference strain in the same glucose concentration.”

      To conclude: the data show a different trend between wild types and mutants, which is interesting. Fitting it with the power law seems to be neither required nor appropriate. I suggest the authors to show the WT vs mutant pattern differently.

      We followed the reviewer’s suggestion and moved Figure S2C, which depicts the detailed fitness trajectories over time, into the main manuscript as Figure 2D. We agree that presenting these trajectories alongside the absolute fitness gains (now in Figure S2C) provides a more intuitive and effective depiction of the evolutionary dynamics of WT and ctf4Δ strains without relying solely on the power-law fit. Additionally, we quantified the mean adaptation rate, calculated as the absolute fitness gain (Δ) divided by the total number of generations (now Figure 2B). While no individual method definitively captures the adaptation rates across the experiment, these complementary analyses consistently highlight the same trends noted by the reviewer. We have re-written the main text as follows:

      Line 171: “By generation 1000, both WT and ctf4Δ evolved lines achieved, on average, slightly higher fitness in low glucose compared to high glucose conditions (Fig S2B). However, due to the varying initial fitness of ctf4Δ cells across different glucose environments, they recovered the same extent of the original defect (Fig S2C). ctf4Δ lines displayed an opposite trend to WT, with increasing absolute fitness throughout the experiment as glucose concentration rose (Fig S2B vs S2D). The differint absolute fitness gains over the same number of generations highlight distinct mean adaptation rates (Fig 2B). These differences are evident when examining the evolutionary dynamics of the evolved lines over time (Fig 2C). Additionally, we approximated the fitness trajectories using the power law function (Fig 2C, dashed purple lines), previously proposed to describe long-term evolutionary dynamics in constant environments (Wiser et al., 2013). The parameter b in this formula determines the curve's steepness, and can be used to quantify the global adaptation rate over generations (Fig S2E). Collectively, these analyses demonstrate that, unlike WT cells, ctf4Δ lines adapt faster in the presence of high glucose. This evidence aligns with the declining adaptability observed in other studies (Moore et al., 2000; Kryazhimskiy et al., 2014; Couce & Tenaillon, 2015), where low-fitness strains consistently adapt faster than their more fit counterparts (Fig S2F).”

      Overall, these results demonstrate that cells can recover from fitness defects caused by constitutive DNA replication stress regardless of the glucose environment. However, adaptation rates under DNA replication stress exhibit opposing trends compared to WT cells, with faster adaptation yielding greater fitness gains in higher glucose conditions.”

      2- In Figure S2C, the individual trajectories for WT at 2% glucose are strangely variable. In this case, plotting the average does not make too much sense. This result is strange, since this is the default condition, where cells are grown without any change of sugar concentration. Can the authors give any rationale? Are there other available results to replace those published in Figure S2C?

      We agree with the reviewer that the individual trajectories for WT at 2% glucose are intriguing. However, we do not find these results necessarily “strange” as they could be explained by the following rationale: WT cells have been cultivated in 2% glucose since the 1950s, likely fixing most beneficial mutations for this condition. When many isogenic strains are evolved in parallel, (a) some lines show no improvement due to the scarcity of available beneficial mutations, (b) others exhibit slight decreases in fitness due to genetic drift fixing deleterious mutations, and (c) a few lines discover rare beneficial mutations, leading to fitness increases. In contrast, other conditions represent “newer” environments with larger mutational target sizes, resulting in more consistent outcomes.

      Prompted by the reviewer’s comment, we look for other studies reporting detailed fitness measurements of evolved WT strains in standard laboratory media. We downloaded and plotted the fitness data from Johnson et al. 2021, where authors studied the evolution of WT strains over 10.000 generations. Interestingly, we see that in the early phase of the evolution (generations 500-1400) evolved lines show similar levels of variability in fitness as the one reported in our study (Figure_rev_only 2). Of note is that in Johnson et al. 2021 most of the adaptive mutations alleviate the toxicity of the ade2-1 allele. In our WT strain the gene was preemptively restored, furter reducing the target size for adaptation in YPD.

      We believe it is important to report these measurements and decided to leave the original data, with the appropriate quantifications of variability, in Figure 2.

      3- The molecular explanation given for the rescue of ctf4-null proposes a very relevant role for dNTPs downregulation. Particularly, both for Irx1 and med14-H919P, the authors propose that this happens via Rnr1 downregulation. At this stage, this is only a hypothesis. The molecular verification of the central role of Rnr1 downregulation would make the conclusion much stronger. For example, a preliminary test would imply that duplicating RNR1 in ctf4-null irx1-null and/or ctf4-null med14-H919P would revert the rescue. Any other experiment addressing this point would be useful to improve the paper.

      We agree that the experiment suggested by the reviewer, or similar tests, would substantiate our hypotheses and strengthen the paper. Specifically, we plan to perturb dNTP production in both ctf4Δ ixr1Δ and ctf4Δ med14-H919P mutants through genetic manipulation of known factors involved in dNTP synthesis. We will then compare the resulting fitness to the expectations based on our hypotheses: reduced fitness benefits of the double mutants upon increasing dNTP levels and/or increased fitness in ctf4Δ mutants by decreasing dNTP levels through alternative mechanisms.

      4- The authors propose from Figure S4B that the rescue of ixr1-null is less evident at low sugar concentration since both conditions trigger a reduction of dNTPs. I think this is interesting, since it would provide a link between glucose concentration and evolutionary trajectories to adaptation, which is what the authors wanted to study. In particular, one would predict that 0.25% glucose would see less ixr1-null than the other glucose conditions. I could not (was not able to) confute this hypothesis from the data shown in the paper. Likewise, for med14-H919P. If the authors have not tested it, it would be worth trying.

      We had reported the appearance and frequency of all ‘core adaptive mutations’ (Figure S6C) but did not explicitly test the likelihood of their appearance under different glucose conditions. Following the reviewer’s suggestion, we have now performed χ2 tests (on the presence or absence of mutations) and ANOVA tests (on their mean frequency) to determine whether any mutation is particularly enriched or depleted in a given glucose environment. At first glance, the results do not support the hypothesis proposed by the reviewer. However, we note that although ixr1 mutants are less beneficial in low glucose than in high glucose, they still confer an 8% fitness advantage, which is likely sufficient to drive clones to fixation. We believe the reviewer’s reasoning is correct but is potentially masked by the still elevated fitness advantage of ixr1 in low glucose.

      To better convey the results of this analysis, we have included a visual representation of the presence and frequency of the mutations in Figure 6A, and the results of the χ2 and ANOVA tests in Supplementary File 5. We also comment on the analysis as follows:

      Line 314: “Similarly, we did not detect differences in the frequency of occurrence (χ2 tests) or average fractions (ANOVA test) achieved by the mutations in the populations evolved under different glucose environments (Fig 6A, Fig S4C and Supplementary File 5. The presence of all mutations in the final evolved lines correlated with their fitness benefits, suggesting how their selection in all glucose conditions was mostly dictated by their relative fitness benefits, rather than the environment (Fig 6A).”

      5- The combination of the four genetic adaptation (Fig 6B) would benefit from an experimental verification to show that the different solutions are not mutually exclusive. This is not obvious: if more than one solution acts by reducing dNTPs, maybe their combined effect is less strong than what measured theoretically. The authors could derive some clones at the end of the experiment and Sanger sequencing some of the four genes, to confirm the co-presence of some of them in the same cell.

      The co-occurrence of nearly every combination of the four core adaptive mutations we identified can be inferred from their relative frequencies, as revealed by deep whole-genome sequencing of the evolved populations (Fig. S4C). In these data, we observe populations carrying each pairwise combination of mutations at frequencies exceeding 50%, implying their coexistence. Moreover, many combinations of mutations approach or reach fixation. A particularly striking example is ctf4Δ Population 11, evolved in 8% glucose, where all core adaptive mutations are present at 100% frequency. These findings provide robust evidence that the different adaptive solutions are not mutually exclusive and can coexist within the same genetic background.

      Nevertheless, we agree that experimentally verifying the compatibility and fitness of the four genetic adaptations described in Figure 6B (now Fig 6C) would further strengthen our conclusions. To this end, we plan to reconstruct all combinations of mutations observed at high frequency in the final evolved populations. We will then measure their fitness and compare it to that of the evolved populations, as well as to the theoretical expectations based on additivity currently presented in Figure 6C.

      Minor points

      Figures

      • S4B: in the legend it should be explained that it is compared to ctf4D

      We now report how the values were obtained in the figure legend:

      (D = |anc%|-|reconstraucted%|)

      -2A: the color code is not fully clear to me: what does green and blue indicate? higher and lower than 2%?

      We apogise for not having included an explicit description of the color code in Figure 2A. Throughout the paper blue refers to glucose starvation (light blue for 0,25%, dark blue for 0,5%), while green refers to glucose abundance (light blue for 2%, dark blue for 8%). We now include a detailed description of the color code when it first appears (Fig 1B) and make sure is properly reported in all figure legends.

      • S3A: the authors should show the statistical difference between WT and ctf4-null, which is mentioned as non-existent in p.6

      The p value is now represented in Fig S3A

      Text

      • RNR1 is not really the gene with the highest score in Figure 5D, not even close: can you give a rationale for pin-pointing it (see also main point 3)?

      The reviewer is correct. Perturbations of the mediator complex, which regulate the expression of most of RNA PolII transcripts, is expected to result in changes in the expression of a large set of genes. However, our focus on dNTPs and RNR1 is based on the following rationale:

      1. Gene Ontology Enrichment Analysis: The downregulated genes in our dataset are enriched for the 'nucleotide metabolism' term, which includes pathways critical for dNTP production and directly linked to DNA replication and repair.

      2. Role of RNR1: Among the downregulated genes, RNR1 stands out as it encodes the major subunit of ribonucleotide reductase, the rate-limiting enzyme in dNTP synthesis. This enzyme is essential for DNA replication, and cells experiencing constitutive DNA replication stress, as in our system, are particularly sensitive to changes in dNTP levels.

      To make this rationale more explicit to the reader, we are adding the following sentence in the discussion:

      Line 404: “Nucleotide metabolism, particularly ribonucleotide reductase, is essential for dNTP production. Given the role of dNTPs in regulating DNA replication and repair, the advantage of med14-H919P mutants in the ctf4Δ background may stem from reduced dNTP levels caused by the perturbed TID domain."

      In addition, following the reviewers’ suggestions, we are conducting additional experiments to investigate the role of med14-H919P mutants in enhancing fitness under conditions of constitutive DNA replication stress (See response to reviewer #4). We anticipate that the final revised manuscript will offer further insights into the role of dNTPs or present alternative explanations for the observed phenomena.

      • The med14-H919P mutation is observed in 22/48 wells. I guess the authors checked already: are some of these wells close to each other in the plate?

      Correct. We took significant precautions in our experimental design to prevent cross-contamination, as outlined in the Materials and Methods section. Specifically, rows of ctf4Δ samples were alternated with rows of WT samples. Daily dilutions were then performed row by row using a 12 channels pipette. This approach ensured that any potential carry-over of cells would result in them being placed in wells containing a different genotype, where they would be eliminated by the consistent use of genotype-specific drugs.

      As a result of these measures, we do not observe any distinct pattern of core genetic adaptation corresponding to the plate layout (Figure_rev_only 3). The only exception are mutations in IXR1, which appear in all ctf4Δ strains (albeit with different alleles, see supplementary File 3). Moreover, we reasoned that if a highly fit strain had invaded other wells, all the pre-existing mutations from its lineage would have been detected in those wells. However, apart from the recurrent ixr1 and rad9 mutations, which are also strongly adaptive, we find no evidence of shared mutations in wells carrying the med14-H919P allele (Figure_rev_only 4).

      • Compensatory evolution of ctf4-null in 2% glucose is the experiment published by Fumasoni and Murray in eLife. In that paper, there is no trace of mutations in MED14. I think the authors should comment on this (different method for detecting putative compensatory mutations?).

      We also noticed the absence of MED14 mutations in the eLife study by Fumasoni and Murray and find this discrepancy intriguing. One possible explanation lies in methodological differences. Our current study employed an improved version of the mutational analysis pipeline. However, we have not yet reanalyzed the original data from the previous study to determine whether MED14 mutations were present but undetected.

      Interestingly, in the current study, we observed that in 2% glucose, MED14 mutations arose in only 3 out of 12 populations, a frequency lower than in other glucose conditions (Figure S6C). Assuming a similar frequency occurred in the 8 populations evolved in 2% glucose by Fumasoni and Murray (2020), one would expect only 2 populations to carry the mutation. This number falls below the threshold required for our algorithm to detect statistically significant parallelism.

      Additionally, two significant experimental differences may also contribute to the observed discrepancy. First, the culture volumes and vessels differed: 10 mL cultures in tubes were used previously, whereas 1.5 mL cultures in 96-well plates were used in the current study.

      • I may be mistaken, but Szamecz et al do not actually investigate whether different conditions result in different evolutionary trajectories (i.e., different genetics), and so their results may not be at odds with those presented here.

      The reviewer is correct that Szamecz et al. do not explicitly test whether different conditions result in different evolutionary trajectories. However, in the section titled “Compensatory Evolution Generates Diverse Growth Phenotypes across Environments,” they examine how lines evolved in 2% YPD perform across various environments. They report how in roughly 50% of the cases tested, evolved lines showed either no improvement or even some lower fitness than the ancestor (Figure 5A).

      While this could be explained by the accumulation of detrimental non-adaptive mutations in specific contexts, it likely implies that the adaptive strategies compensating for the original mutation in one environment do not confer similar benefits in other environments. This observation contrasts with our findings in Figure 6D, where we demonstrate that the main adaptive strategies provide a consistent benefit across diverse environments, including those with glucose, nitrogen, or phosphate abundance or starvation.

      We have now modified the introduction, results and discussion to avoid misleading interpretations:

      Line 42: “Szamecz and colleagues examined the evolutionary trajectories of 180 haploid yeast gene deletions over 400 generations (Szamecz et al., 2014). They found that, while fitness recovery occurred in the environment where evolution took place, the evolved lines often showed no improvement over their ancestors in other environments. This suggests that compensatory mutations beneficial in one environment often fail to restore fitness in others.”

      Line 327: “A previous study in yeast showed how evolved lines which compensate for detrimental defects of gene deletions in standard laboratory conditions often failed to show fitness benefits compared to their ancestor when tested in other environments (Szamecz et al., 2014). We thus investigated the extent to which the core genetic adaptation to DNA replication stress was beneficial under alternative nutrient conditions.”

      Line 422: “What could explain the discrepancies between our results, and previous studies on evolutionary repair highlighting the role of the environment in shaping evolutionary trajectories (Filteau et al., 2015), and the heterogeneous behavior of evolved lines in various environments (Szamecz et al., 2014)?”

      typos

      p.18, line 564 preformed -> performed

      1. 6 line 189 with a strongly skew -> with a strong skew ?

      Typos are now corrected in the main text

      Reviewer #2 (Significance):

      This is a well-done paper that could be of interest for the community of evolutionary biologists, scientists working on metabolism and cell division. It addresses an interesting problem, how metabolism affects compensatory evolution. Among the strengths: experiments are well done, the results are novel, the cross-talk between metabolism and evolutionary repair is intriguing. Among the weaknesses, the fact that the molecular explanations for the observations are only hypothesized and not tested experimentally. This is where the authors could improve the manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity):

      This paper combines phenotypic and genomic data from an experimental evolution study in yeast to assess how repeatable evolution is in response to DNA replication stress. Importantly, the authors ask whether genotype by environment interactions influence repeatability of their evolved lines. To this end, the authors have constructed an elegant highly-replicated experiment in which two yeast genotypes (WT and CTF4 KO) were evolved under a variety of glucose levels for 1,000 generations. Recurrent mutations are found across many replicates, suggesting that repeatability is robust to GxE interactions. Of course, the authors correctly identify that these results are dependent on many particulars, as is always the case in biology, but provide a comprehensive discussion to accompany their results. I do not have any major comments to give, but simply some suggestions and points of clarification.

      Major comments: N/A

      Minor comments:

      L19: I found the definition for compensatory evolution/mutations to be somewhat vague in the introduction (and subsequently throughout the text). It's clear that this was written for a more medical/physiological audience, but without a more explicit explanation of compensatory evolution/mutations, it became difficult to properly weigh some claims/discussions made by the authors later on. Do you define compensatory mutations as those which completely recover WT function/fitness, or are simply of opposite effect to the altered genotype? Others define "compensatory evolution" as simply any epistastically interacting amino acid substitutions (Ivankov et al, 2014). It would be nice to see more explicitly defined.

      We thank the reviewer for highlighting the need for a precise definition of compensatory evolution and compensatory mutations. We recognize that the literature encompasses multiple definitions, including the one cited by the reviewer, which emphasizes compensatory mutations within the context of structural biology. This particular definition, prevalent in molecular evolution, was introduced by Kimura (Kimura, 1985) and is frequently used to explain the co-occurrence of amino acid mutations within a protein. These mutations offset each other’s defects, restoring or maintaining protein function. Here, however, we are using an older and broader definition of compensatory mutation, first introduced by Wright (Wright, 1964, 1977, 1982) and frequently used in evolutionary genomics (e.g., Moore et al., 2000; Szamecz et al., 2014; Rajon and Mazel, 2013; Eckartt et al., 2024). This definition includes any mutation in the rest of the genome that compensates (fully or partially) for another mutation's detrimental effects on fitness.

      We have now included this definition in the introduction:

      Line 19: “Compensatory evolution is a process by which cells mitigate the negative fitness effects of persistent perturbations in cellular processes across generations. This adaptation occurs through spontaneously arising compensatory mutations anywhere in the genome (Wright, 1964, 1977, 1982) that partially or fully alleviate the negative fitness effects of perturbations (Moore et al., 2000). The successive accumulation of compensatory mutations over evolutionary timescales progressively repair the cellular defects, ultimately restoring fitness.”

      Line 361: “Our findings demonstrate that while glucose availability significantly affects the physiology and adaptation speed of cells under replication stress, it does not alter the fundamental genome-wide compensatory mutations that drive fitness recovery and evolutionary repair.”

      Along these lines, I would have liked to see a more direct comparison/discussion of the degree to which deletion lines recovered. I can see from Fig 2E and Fig S2B that fitness increased quite a bit; would it not be possible to include a figure on the degree of compensation (basically relative fitness of evolved deletion lines - relative fitness of ancestral deletion lines)?

      If the reviewer is suggesting calculating the difference between the evolved and ancestor fitness, the data is already in Figure S2B and S2D, defined as ‘Absolute fitness gains Δ’ and calculated as Δ = evo% - anc%.

      If instead is suggesting to plot the fitness of evolved deletion lines (Y axis) against the relative fitness of ancestral deletion lines (X axis), we have now produced the plot is Figure S2F.

      To better understand the extent of the fitness recovery in Ctf4 strains, we have also calculated and plotted the ‘relative fitness gain’ calculated as |evo%| / |anc%| *100 (Figure S2C)

      We are now commenting on these comparisons in the following paragraph:

      Line 171: “By generation 1000, both WT and ctf4Δ evolved lines achieved, on average, slightly higher fitness in low glucose compared to high glucose conditions (Fig S2B). However, due to the varying initial fitness of ctf4Δ cells across different glucose environments, they recovered the same extenct of the original defect (Fig S2C), displaying an opposite trend to WT, with increasing absolute fitness throughout the experiment as glucose concentration rose (Fig S2B vs S2D). The differint absolute fitness gains over the same number of generations highlight distinct mean adaptation rates (Fig 2B). These differences are evident when examining the evolutionary dynamics of the evolved lines over time (Fig 2C). Additionally, we approximated the fitness trajectories using the power law function (Fig 2C, dashed purple lines), previously proposed to describe long-term evolutionary dynamics in constant environments (Wiser et al., 2013). The parameter b in this formula determines the curve's steepness, and can be used to quantify the global fitness change over generations (Fig S2E). Collectively, these analyses demonstrate that, unlike WT cells, ctf4Δ lines adapt faster in the presence of high glucose. This evidence aligns with the declining adaptability observed in other studies (Moore et al., 2000; Kryazhimskiy et al., 2014; Couce & Tenaillon, 2015), where low-fitness strains consistently adapt faster than their more fit counterparts (Fig S2F).”

      L57: Another minor nitpick that just comes down to semantics. When discussing "96 parallel populations", it invokes a higher sense of replication than is actually present in the study. I would rephrase this to something along the lines of "12 replicate populations across 8 treatments under conditions of [...]".

      We changed the sentence as follows:

      Line 66: “We evolved 96 parallel populations of budding yeast, organized into 12 replicate lines, across four conditions of glucose availability (from starvation to abundance) with or without replication stress.”

      L185-187: The wording here needs to be clarified. Be explicit in that are examine the ratio (or count) of synonymous to non-synonymous mutations here, otherwise the interpretations appears to be direct contradiction to the (as written) results. Only after viewing the supplemental figure was I able to figure out what exactly was meant here.

      We changed the sentence as follows:

      Line 212: “We found no significant differences in the numbers of synonymous mutations detected in evolved populations in WT and ctf4∆ populations (Fig. S3A). These results support the hypothesis that replication stress in ctf4∆ lines favors the retention of beneficial mutations, rather than simply increasing the overall mutation rate.”

      L349-350: The authors observe higher rates of adaptation in deletion lines than WT lines, and discuss this in adequate detail. Although not explicitly mentioned, this is consistent with a diminishing returns epistasis model (that could be beneficial to discuss, but is not necessary), which has been implicated in modulating the degree of repeatability observed along evolutionary trajectories (Wünsche et al. 2017). Although definitely not required for this already very nice manuscript, I think it would be very rewarding if the authors were to eventually analyze fine-scale dynamics of phenotypic and genomic adaptation to mine for these putative interactions and their influence on repeatability.

      We agree with the reviewer on how our results align with a model of diminishing returns epistasis. This pattern is apparent not only between ctf4Δ and WT lines but also among ctf4Δ lines evolved in different glucose conditions. This phenomenon likely arises from the interaction of various adaptive mutations, which we aim to explore further in a dedicated manuscript. However, until we do so, we prefer to refer generally to a pattern of declining adaptability. To explicit this trend we have now included Fig S2F and commented on it in the manuscript:

      Line 181: “This evidence aligns with the declining adaptability observed in other studies (Moore et al., 2000; Kryazhimskiy et al., 2014; Couce & Tenaillon, 2015), where low-fitness strains consistently adapt faster than their more fit counterparts (Fig S2F).”

      Line 388: "Our results are consistent with declining adaptability, as evidenced by the reduced rates of adaptation observed both between ctf4Δ and WT lines and among ctf4Δ lines evolved in different glucose conditions (Fig S2F)"

      Reviewer #3 (Significance):

      It is clear to me that a great deal of time and care has been put into this study and the preparation of this manuscript. The science and analyses are appropriate to answer the questions at hand, and it bodes well that whenever I had a question pop up while reading, they were typically answered immediately after. I think that this manuscript will be broadly relevant to both biologists both evolutionary and clinical, and was written in a way to be accessible to both.

      As someone with an expertise in repeatable evolution, I felt most excited by the observation of so many parallel substitutions at a single amino acid across deletion lines. As the authors rightfully point out in the results and discussion, it's likely that this degree of robustness is highly dependent on the particular mechanism of disruption that cells experience. The authors then go above and beyond to functionally validate the putative molecular mechanisms of (repeatable) adaptation in this system. While it may not always be possible to accomplish in non-model organisms, such multi-modal approaches will be crucial to advance the field of repeatable evolution.

      Reviewer #4 (Evidence, reproducibility and clarity):

      The authors investigated the effects of DNA replication stress on adaptation in different nutrient availabilities by passaging wild-type and ctf4Δ Saccharomyces cerevisiae in media with varying levels of glucose over ~1000 generations. The ctf4Δ strain experiences increased DNA replication stress due to the deletion of a non-essential replication fork protein. The authors found differences in evolution between wild-type and ctf4Δ yeast, which held across different growth media. This study identified a compensatory single amino acid variant in Med14, a protein in the mediator complex of RNA polymerase II, that was specifically selected in ctf4Δ strains. The authors conclude that while environmental nutrient availability has implications for cell fitness and physiology, adaptation is largely independent and instead dependent on genetic background. The data provide excellent support for the key aspects of the models, although some details are (to me) overstated.

      Major comments:

      • A ctf4Δ mutant strain was used to investigate the effects of replication stress. Why was this mutant chosen instead of other deletions that cause different types of replication stress?

      We appreciate the opportunity to clarify our rationale for choosing the ctf4Δ mutant. The following are the main reasons why we believe ctf4Δ strains represent an ideal tool to study a global perturbation of the DNA replication program over evolutionary timescales:

      1. General replication stress: The absence of Ctf4 perturbs replication fork progression, leading to a spectrum of replication stress-related phenotypes, including DNA damage sensitivity, single-stranded DNA gaps, reversed forks (Abe et al., 2018; Fumasoni et al., 2015), checkpoint activation (Poli et al., 2012), cell cycle delays (Miles and Formosa, 1992), increased recombination (Alvaro et al., 2007), and chromosome instability (Kouprina et al., 1992). This broad disruption makes it an excellent model for observing global perturbations in replication processes. In contrast, other mutants typically affect specific enzymatic (e.g., POL32 and RRM3) or signaling (e.g., MRC1) functions, making them better suited to address specific questions.
      2. Constitutive stress: Unlike drug-induced stress (e.g., Hydroxyurea; Krakoff et al., 1968) or conditional depletion systems (e.g., GAL1-POLε; Zhang et al., 2022), which cells can easily circumvent through single mutations, ctf4Δ enforces persistent replication stress. Its deletion cannot be complemented by a single mutation, ensuring a robust and consistent stress environment for evolutionary studies.

      We have now modified the main text to convey these advantages in a concise form:

      Line 91: “In the absence of Ctf4, cells exhibit multiple defects commonly associated with DNA replication stress, such as single-stranded DNA gaps and altered replication forks (Fumasoni et al., 2015), leading to basal cell cycle checkpoint activation (Poli et al., 2012). These defects result in severe and persistent growth impairments, cell cycle delays, elevated nucleotides pools and chromosome instability (Miles and Formosa, 1992; Kouprina et al., 1992; Poli at al., 2012), making ctf4Δ mutants an ideal model for studying the cellular consequences of general and constitutive replication stress over evolutionary time.”

      It's not clear from the study that the effects are generalizable to other forms of replication stress.

      As with any method to induce DNA replication stress (including commonly used drugs like HU) each approach inevitably affects replication in a specific manner. Testing the broader applicability of our conclusions would require evolving additional strains with different replisome perturbations. For instance, mutations in ELG1 and CTF18 (affecting the alternative Replication Factor C), POL30 (affecting the sliding clamp PCNA), POL32 (affecting Polε), RRM3 (protective helicase) and (MRC1 (coordinating leading strand activities and signalling to the checkpoint) would have to be taken into account. Furthermore, specific mutant alleles of Ctf4 that disrupt interactions with particular binding partners (Such as ctf4–4E and ctf4–3E, perturbing the interaction with the CMG helicase and accessory factors respectively) will be highly informative on which specific aspects of the replication stress generated by the lack of Ctf4 each adaptive mutation alleviate.

      However, accommodating such extensive variability would inflate the sample size to an extent that will become unfeasible within the experimental design focused on capturing parallel evolution over a nutrient gradient (the primary focus of this study). We agree that this is an important question and intend to address it comprehensively in a dedicated future study.

      • The authors could be clearer that a (the?) cause of the ctf4∆ fitness defect is spurious upregulation of RNR1. I don't think it is mentioned until the Discussion, but it is highly relevant to Fig 4, and to the adaptations one would expect from ctf4∆.

      We thank the reviewer for the opportunity to clarify this aspect. We do not think that the fitness defects of ctf4∆ cells stem solely from the spurious upregulation of RNR1. However, we believe that a major aspect of the evolutionary adaptation is aimed at decreasing dNTP levels, potentially through different mechanisms. We are now mentionig increased dNTPs as major phenotype of ctf4∆ and commenting on the hypothesis more clearly in the discussion.

      Line 93: “These defects result in severe and persistent growth impairments, cell cycle delays, elevated nucleotides pools and chromosome instability (Miles and Formosa, 1992; Kouprina et al., 1992; Poli at al., 2012)”

      Line 409: “This condition will, in turn, be detrimental when proliferation rates are high (as in WT in high glucose) but beneficial under constitutive DNA replication stress (ctf4Δ), where cells experience spurious upregulation of dNTP production (Poli et al., 2012; Davidson et al., 2012).

      • In Figure 1E, there is a very large spread in the relative fitness at 2% and 8% glucose, but this was not commented on. Is this heteroscedasticity expected?

      The observed heteroscedasticity is expected. Our competition assays tend to exhibit increased variability when a strain approaches very low fitness levels. Specifically, as one strain nears extinction by the third day of competition, its abundance is estimated based on a much smaller number of events in the flow cytometer. Furthermore, we noticed a small number of reference cells carrying pACT1-yCerulean not showing strong fluorescence in 8% glucose. The nature of this effect is uncertain, and possibly linked to metabolism-linked changes in the cytoplasm. The combination of these two phenomena amplifies the impact of noise inherent to the methodology, leading to increased variability across replicates.

      Nontheless, the overall decreasing fitness trend across glucose conditions, combined with the statistical significance observed between high and low glucose levels, collectively convey a roboust phenotype

      • The med14-H919P mutant was highly selected in ctf4Δ strains, independent of glucose availability. Is this variant found in any natural yeast strains (i.e., are there environments that select for this variant)? Also, if this variant is found in natural strains, does it co-occur with other mutations that could affect DNA replication?

      We agree that this is an intriguing question. To address it, we plan to explore existing databases of variants identified in S. cerevisiae natural isolates. Specifically, we will investigate whether the med14-H919P mutation is present in these strains, identify any potential environmental factors that may select for it, and assess whether it co-occurs with other mutations that could influence DNA replication processes.

      • The statement on lines 271-273 is not particularly well-supported. The analysis of the Warfield data suggest that reduced expression of RNR1 could be causal, but the data don't go as far as showing how the med14 mutation is advantageous in ctf4∆. Further experimentation would be necessary to support the possibilities that the authors discuss.

      The sentence the reviewer refers to is: “Overall, these results show how an amino acid substitution in the Med14 subunit of the mediator complex, putatively affecting transcription, is strongly selected, and advantageous, in the presence of constitutive DNA replication stress.” We are unsure which aspect of the statement is seen as unsupported. The mutation's strong selection in ctf4∆ is demonstrated in Figures 5A, 6A, and S4C, while its advantageous nature is supported by Figures 5B and S4B. Regarding the mechanism, we have been cautious with our phrasing, describing its effect on transcription as "putative" (Line 272) and suggesting that our observations “are compatible with” reduced dNTP availability in med14-H919P cells due to RNR1 downregulation (Line 361).

      The main focus of this study is to explore how nutrient availability influences evolutionary dynamics and compensatory adaptation in cells lacking Ctf4. We believe the identification of a novel selected allele (Fig. 5A) and confirmation of its benefit across glucose conditions (Fig. 5B) serves as an excellent complement to the primary conclusions (present in the title). We invite the reviewer to consider that the molecular basis of such a phenotype is not mentioned in our abstract, as we believe that its precise characterization would require a dedicated study on Med14.

      Nonetheless, we are encouraged by the reviewer’s interest in this newly identified compensatory mutant (also noted by Reviewer #2), and we are eager to perform further experiments to better understand the biological processes affected by this mutation. We plan to extend our work as follows:

      Based on known phenotypes associated with perturbations of Med14, we propose the following novel hypotheses regarding the mechanism by which med14-H919P alleviates ctf4Δ defects:

      1. Decreased replication-transcription conflicts: Conflicts between the transcription machinery and replication forks are known to cause fragile sites, leading to increased chromosome breaks and genomic instability (Garcia-Muse and Aguilera, 2016). A general reduction in PolII transcription during replication, resulting from perturbations of the mediator complex, could reduce these conflicts and mitigate the fitness defects observed in ctf4Δ cells.
      2. Increased cohesin loading: We have demonstrated that amplification of the cohesin loader SCC2 is beneficial in the absence of Ctf4. Recent findings (Mattingly et al., 2022) indicate that the mediator complex recruits SCC2 to PolII-transcribed genes. The med14-H919P mutation may enhance the fitness of ctf4Δ cells by facilitating cohesin loading during DNA replication.
      3. Decreased dNTP levels: As discussed in the manuscript, perturbations of Med14 subunits in the mediator complex reduce the expression of genes, including those associated with nucleotide metabolism. Notably, these include RNR1, the major subunit of ribonucleotide reductase. The med14-H919P mutation could benefit the ctf4Δ background by counteracting the reported spurious increase in dNTPs, which affects replication fork speed (Poli et al., 2012).

      We plan to distinguish between these hypotheses using the following approaches. First, the proposed mechanisms underlying Hypotheses 1 and 3 suggest that med14-H919P is a loss-of-function mutation, while Hypothesis 2 implies a gain-of-function effect. Testing the impact of a heterozygous med14-H919P allele in a homozygous ctf4Δ strain will allow us to differentiate between these two categories of mechanisms. Additionally, we aim to investigate the molecular process affected by the med14-H919P allele by analyzing its genetic interactions with genes involved in replication-transcription conflicts, cohesin loading, and dNTP production (See also response to reviewer #2).

      We believe that the results of these experiments will provide further insights on the mechanism of suppression exerted by med14-H919P in the presence of constitutive DNA replication stress, without diverting the reader from the main message of the paper.

      • The authors comment that the med14-H919P mutant could have implications for the stability of Med14, based on computational modelling. Verifying the stability of the med14-H919P in vivo would strengthen this discussion.

      We believe that in vivo and in vitro structural studies investigating the effect of this mutation on the stability and function of the Mediator complex are beyond the scope of this manuscript. These investigations would be more appropriately addressed in future, dedicated studies focused on these specific aspects.

      • In the discussion, the authors propose that the context of the perturbation may influence the robustness of adaptation. A more detailed explanation of this point (including a discussion of the findings of other similar studies investigating different conditions) would be helpful to further bolster this section.

      We are now supporting this concept more explicitly by commenting on other studies as follows:

      Line 429: “Third, the environment’s influence on compensatory evolution may depend on the specific cellular module perturbed and its genetic interactions with other modules that are significantly influenced by environmental conditions. For example, the actin cytoskeleton, which must rapidly respond to extracellular stimuli, is likely to be more directly influenced by environmental factors (Filateau et al., 2015) compared to the DNA replication machinery, which operates within the nucleus and is relatively insulated from such changes. Supporting this idea, a study examining mutants’ fitness across diverse environments found that conditions such as different carbon sources or TOR inhibition, similar to those used in this study, primarily affected genes involved in vesicle trafficking, transcription, protein metabolism, and cell polarity. In contrast, genes associated with genome maintenance, as well as their epistatic interactions, were largely unaffected (Costanzo et al., 2021)”.

      In addition, to further substantiate this hypothesis, we plan to re-analyze published datasets on fitness and epistatic interactions among genes in various environments, testing whether specific cellular modules are more prone to changes following shifts in nutrient conditions.

      Minor comments: - Competitions were performed between ctf4Δ strains and a constructed strain with yCerulean integrated at ACT1. Is the fitness of the fluorescent strain comparable to the ancestral wild-type strain (i.e., in a competition between the ancestral WT and the fluorescent strain, does either have an advantage)?

      We noticed a slight disadvantage of the reference strain compare to WT, likely due to the costs of the extra fluorescence reporter. However, the disadvantage is minimal, ranging from -0.5 to -2.5 depending on the glucose environment (raw measurments are reported supplementary file 1, sheet 5). To take this into account, all fitness reported in figures are normalized for the WT value measured in the same environment line 613: “Relative fitness of the ancestral WT strain was used to normalize fitness across conditions.​​”

      • In Figure 3, the legends for panels B and C appear to be swapped. Discussion of Figure 3 on pages 6 and 7 appear to reference the wrong panels.

      We are unsure about this typo. Main text and figure legend seem to refer to the appropriate panels, 3B for mutation fractions and 3C for mutation counts. Perhaps the organization of the panels with B being under A instead of on its right confounds the reader?

      • In Figure 4A and B, having the same colour scale between both heatmaps is misleading, as the scales are different. Consider having the same scale across both heatmaps so that enrichments are visually comparable.

      Following the reviewer’s suggestion we have have chosen a uniform heatmap to visually represent GO terms enrichment in WT and ctf4∆ genetic backgrounds.

      • In Figure 4C, having a legend in the figure for node size would be helpful to understand the actual number of populations with mutations in each gene.

      A legend for node size has now being added next to Figure 4C.

      Reviewer #4 (Significance):

      In this study, a high-throughput evolution experiment uncovered the effects of genetic background on the development of adaptive mutations. The authors were able to identify a single amino acid variant of Med14 (med14-H919P) that was positively selected in ctf4Δ. Furthermore, they demonstrated the causality of med14-H919P in conferring a fitness advantage in ctf4Δ. The novelty of this mechanistic finding opens future avenues of investigation regarding the interaction network of the mediator complex in conditions of DNA replication stress. A limitation of the study is that only one mechanism of replication stress was assessed (ctf4Δ). Other gene mutations that cause replication stress would be interesting to assess and would provide a more thorough investigation of the effects of DNA replication factors on evolvability. This work will be of interest to researchers in the population genetics and genotype-by-environment fields, as it suggests the robustness of evolvability to environmental factors in the specific condition of DNA replication stress. As discussed by the authors, this finding differs from other works that have linked environmental conditions to adaptive evolution to different conditions, and is concordant with work that indicates the robustness of genetic interactions to environmental stresses. Furthermore, the identification of the highly-selected med14-H919P variant will be of interest to the DNA replication field. There is the potential for future work investigating the role of Med14 in mediating the response to DNA replication stress in both yeast and mammalian cell contexts, since the authors note that there are links between altered mediator complex regulation and cancers. Although I suspect that the very different regulation of RNR in mammalian cells makes it unlikely that the kind of upregulation of dNTP pools seen in ctf4∆ would be induced by replication stress in mammalian cells.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The authors investigated the effects of DNA replication stress on adaptation in different nutrient availabilities by passaging wild-type and ctf4Δ Saccharomyces cerevisiae in media with varying levels of glucose over ~1000 generations. The ctf4Δ strain experiences increased DNA replication stress due to the deletion of a non-essential replication fork protein. The authors found differences in evolution between wild-type and ctf4Δ yeast, which held across different growth media. This study identified a compensatory single amino acid variant in Med14, a protein in the mediator complex of RNA polymerase II, that was specifically selected in ctf4Δ strains. The authors conclude that while environmental nutrient availability has implications for cell fitness and physiology, adaptation is largely independent and instead dependent on genetic background. The data provide excellent support for the key aspects of the models, although some details are (to me) overstated.

      Major comments:

      • A ctf4Δ mutant strain was used to investigate the effects of replication stress. Why was this mutant chosen instead of other deletions that cause different types of replication stress? It's not clear from the study that the effects are generalizable to other forms of replication stress.
      • The authors could be clearer that a (the?) cause of the ctf4∆ fitness defect is spurious upregulation of RNR1. I don't think it is mentioned until the Discussion, but it is highly relevant to Fig 4, and to the adaptations one would expect from ctf4∆.
      • In Figure 1E, there is a very large spread in the relative fitness at 2% and 8% glucose, but this was not commented on. Is this heteroscedasticity expected?
      • The med14-H919P mutant was highly selected in ctf4Δ strains, independent of glucose availability. Is this variant found in any natural yeast strains (i.e., are there environments that select for this variant)? Also, if this variant is found in natural strains, does it co-occur with other mutations that could affect DNA replication?
      • The statement on lines 271-273 is not particularly well-supported. The analysis of the Warfield data suggest that reduced expression of RNR1 could be causal, but the data don't go as far as showing how the med14 mutation is advantageous in ctf4∆. Further experimentation would be necessary to support the possibilities that the authors discuss.
      • The authors comment that the med14-H919P mutant could have implications for the stability of Med14, based on computational modelling. Verifying the stability of the med14-H919P in vivo would strengthen this discussion.
      • In the discussion, the authors propose that the context of the perturbation may influence the robustness of adaptation. A more detailed explanation of this point (including a discussion of the findings of other similar studies investigating different conditions) would be helpful to further bolster this section.

      Minor comments:

      • Competitions were performed between ctf4Δ strains and a constructed strain with yCerulean integrated at ACT1. Is the fitness of the fluorescent strain comparable to the ancestral wild-type strain (i.e., in a competition between the ancestral WT and the fluorescent strain, does either have an advantage)?
      • In Figure 3, the legends for panels B and C appear to be swapped. Discussion of Figure 3 on pages 6 and 7 appear to reference the wrong panels.
      • In Figure 4A and B, having the same colour scale between both heatmaps is misleading, as the scales are different. Consider having the same scale across both heatmaps so that enrichments are visually comparable.
      • In Figure 4C, having a legend in the figure for node size would be helpful to understand the actual number of populations with mutations in each gene.

      Significance

      In this study, a high-throughput evolution experiment uncovered the effects of genetic background on the development of adaptive mutations. The authors were able to identify a single amino acid variant of Med14 (med14-H919P) that was positively selected in ctf4Δ. Furthermore, they demonstrated the causality of med14-H919P in conferring a fitness advantage in ctf4Δ. The novelty of this mechanistic finding opens future avenues of investigation regarding the interaction network of the mediator complex in conditions of DNA replication stress. A limitation of the study is that only one mechanism of replication stress was assessed (ctf4Δ). Other gene mutations that cause replication stress would be interesting to assess and would provide a more thorough investigation of the effects of DNA replication factors on evolvability.<br /> This work will be of interest to researchers in the population genetics and genotype-by-environment fields, as it suggests the robustness of evolvability to environmental factors in the specific condition of DNA replication stress. As discussed by the authors, this finding differs from other works that have linked environmental conditions to adaptive evolution to different conditions, and is concordant with work that indicates the robustness of genetic interactions to environmental stresses. Furthermore, the identification of the highly-selected med14-H919P variant will be of interest to the DNA replication field. There is the potential for future work investigating the role of Med14 in mediating the response to DNA replication stress in both yeast and mammalian cell contexts, since the authors note that there are links between altered mediator complex regulation and cancers. Although I suspect that the very different regulation of RNR in mammalian cells makes it unlikely that the kind of upregulation of dNTP pools seen in ctf4∆ would be induced by replication stress in mammalian cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper combines phenotypic and genomic data from an experimental evolution study in yeast to assess how repeatable evolution is in response to DNA replication stress. Importantly, the authors ask whether genotype by environment interactions influence repeatability of their evolved lines. To this end, the authors have constructed an elegant highly-replicated experiment in which two yeast genotypes (WT and CTF4 KO) were evolved under a variety of glucose levels for 1,000 generations. Recurrent mutations are found across many replicates, suggesting that repeatability is robust to GxE interactions. Of course, the authors correctly identify that these results are dependent on many particulars, as is always the case in biology, but provide a comprehensive discussion to accompany their results. I do not have any major comments to give, but simply some suggestions and points of clarification.

      Major comments: N/A

      Minor comments:

      L19: I found the definition for compensatory evolution/mutations to be somewhat vague in the introduction (and subsequently throughout the text). It's clear that this was written for a more medical/physiological audience, but without a more explicit explanation of compensatory evolution/mutations, it became difficult to properly weigh some claims/discussions made by the authors later on. Do you define compensatory mutations as those which completely recover WT function/fitness, or are simply of opposite effect to the altered genotype? Others define "compensatory evolution" as simply any epistastically interacting amino acid substitutions (Ivankov et al, 2014). It would be nice to see more explicitly defined.

      Along these lines, I would have liked to see a more direct comparison/discussion of the degree to which deletion lines recovered. I can see from Fig 2E and Fig S2B that fitness increased quite a bit; would it not be possible to include a figure on the degree of compensation (basically relative fitness of evolved deletion lines - relative fitness of ancestral deletion lines)?

      L57: Another minor nitpick that just comes down to semantics. When discussing "96 parallel populations", it invokes a higher sense of replication than is actually present in the study. I would rephrase this to something along the lines of "12 replicate populations across 8 treatments under conditions of [...]".

      L185-187: The wording here needs to be clarified. Be explicit in that are examine the ratio (or count) of synonymous to non-synonymous mutations here, otherwise the interpretations appears to be direct contradiction to the (as written) results. Only after viewing the supplemental figure was I able to figure out what exactly was meant here.

      L349-350: The authors observe higher rates of adaptation in deletion lines than WT lines, and discuss this in adequate detail. Although not explicitly mentioned, this is consistent with a diminishing returns epistasis model (that could be beneficial to discuss, but is not necessary), which has been implicated in modulating the degree of repeatability observed along evolutionary trajectories (Wünsche et al. 2017). Although definitely not required for this already very nice manuscript, I think it would be very rewarding if the authors were to eventually analyze fine-scale dynamics of phenotypic and genomic adaptation to mine for these putative interactions and their influence on repeatability.

      Significance

      It is clear to me that a great deal of time and care has been put into this study and the preparation of this manuscript. The science and analyses are appropriate to answer the questions at hand, and it bodes well that whenever I had a question pop up while reading, they were typically answered immediately after. I think that this manuscript will be broadly relevant to both biologists both evolutionary and clinical, and was written in a way to be accessible to both.

      As someone with an expertise in repeatable evolution, I felt most excited by the observation of so many parallel substitutions at a single amino acid across deletion lines. As the authors rightfully point out in the results and discussion, it's likely that this degree of robustness is highly dependent on the particular mechanism of disruption that cells experience. The authors then go above and beyond to functionally validate the putative molecular mechanisms of (repeatable) adaptation in this system. While it may not always be possible to accomplish in non-model organisms, such multi-modal approaches will be crucial to advance the field of repeatable evolution.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of "Compensatory evolution to DNA replication stress is robust to nutrient availability" from Natalino and Fumasoni.

      The paper addresses the effect of sugar availability in shaping compensatory evolution. The first observation of the paper is that cell physiology changes by modulating glucose availability also in strains that come with defective DNA replication (ctf4-null previously studied by the authors). An intriguing result is that ctf4-null grows comparatively better in low concentrations of glucose. This is hypothesized to be a consequence of both the decrease in dNTPs in low glucose, which causes slow down of fork progression, and/or reduced fork collapse at rDNA locus. Hence, wild types and ctf4-null show an opposite trend: in the mutant, the lowest concentration of glucose is the least affected by the mutation; in wild type, the highest concentration is the least affected. Adaptation rate is inversely related with the initial fitness.

      The effect on physiology and adaptation rate is a starting point for asking the key question: are evolutionary trajectories influnced by the growth conditions? The answer is negative: evolution experiments show the very same core of genetic changes at all sugar concentrations. The result is apparently at odds with previous publications, and the authors conclude that, in this particular setting, availability of carbon sources plays a minor role compared to impaired DNA replication. The different rates of adaptation in WT and mutant is rather explained by the initial fitness at the different glucose concentrations, which, as mentioned, is opposite in WT and ctf4-null mutants.

      The paper also reports a new mutation in MED14, component of the transcription mediator complex, which rescues the lack of Ctf4 activity. The study is interesting and asks a relevant question. The experiments are well executed and convincing, but the paper can be strengthened by testing some of the hypotheses which are put forward.

      Main points

      1. The raw data for evolutionary dynamics (Figure S2C) are fitted with the power law suggested by Wiser and Lenski, and return different values of the parameter 'b'. The authors say that the result depends greatly on the initial conditions ("due to the varying initial fitness of ctf4Δ cells across different glucose environments, they display an opposite trend to WT"). Around the initial values, however, the curves are non-monotonic, especially for low glucose availability. Both for WT and ctf4-null there is an initial drop in fitness, after which fitness increases. If one would neglect this initial dynamics, the value of the parameter 'b' would likely be different. In general, one can question whether curves with this shape are best fitted by<br /> the power law proposed by Wiser and Lenski. For example, for the WT 0.25% glucose the linear fit gives a better R2 (why do theauthors show the linear fit anyway?). This impression is further reinforced by the observation that Wiser and Lenski fit dynamics that last 50.000 generation, here the curves last 1/50th of it. In conclusion, I would question whether the parameter 'b' is a solidmeasurement of 'rate of adaptation'. Also, normalizations makes it difficult to appreciate the result shown in Figure 2B.

      I think the authors should look for a different way to show the different trend in adaptation dynamics for different glucose concentrations between wild types and mutants. For example, they could move Figure S2C in the main text to stress the result shown in Figure 2C, which already shows the difference between WT and mutant. This is especially true if what Figure 2C shows is (evo-anc)/evo. This is not fully clear to me: in the legend it refers to the delta, in the label of the y-axis I read that this is a percentage.

      To conclude: the data show a different trend between wild types and mutants, which is interesting. Fitting it with the power law seems to be neither required nor appropriate. I suggest the authors to show the WT vs mutant pattern differently.<br /> 2. In Figure S2C, the individual trajectories for WT at 2% glucose are strangely variable. In this case, plotting the average does not make too much sense. This result is strange, since this is the default condition, where cells are grown without any change of sugar concentration. Can the authors give any rationale? Are there other available results to replace those published in Figure S2C?<br /> 3. The molecular explanation given for the rescue of ctf4-null proposes a very relevant role for dNTPs downregulation. Particularly, both for Irx1 and med14-H919P, the authors propose that this happens via Rnr1 downregulation.

      At this stage, this is only a hypothesis. The molecular verification of the central role of Rnr1 downregulation would make the conclusion much stronger. For example, a preliminary test would imply that duplicating RNR1 in ctf4-null irx1-null and/or ctf4-null med14-H919P would revert the rescue. Any other experiment addressing this point would be useful to improve the paper.<br /> 4. The authors propose from Figure S4B that the rescue of ixr1-null is less evident at low sugar concentration since both conditions trigger a reduction of dNTPs. I think this is interesting, since it would provide a link between glucose concentration and evolutionary trajectories to adaptation, which is what the authors wanted to study.

      In particular, one would predict that 0.25% glucose would see less ixr1-null than the other glucose conditions. I could not (was not able to) confute this hypothesis from the data shown in the paper. Likewise, for med14-H919P. If the authors have not tested it, it would be worth trying.<br /> 5. The combination of the four genetic adaptation (Fig 6B) would benefit from an experimental verification to show that the different solutions are not mutually exclusive. This is not obvious: if more than one solution acts by reducing dNTPs, maybe their combined effect is less strong than what measured theoretically. The authors could derive some clones at the end of the experiment and Sanger sequencing some of the four genes, to confirm the co-presence of some of them in the same cell.

      Minor points

      Figures

      • S4B: in the legend it should be explained that it is compared to ctf4D .
      • 2A: the color code is not fully clear to me: what does green and blue indicate? higher and lower than 2%?
      • S3A: the authors should show the statistical difference between WT and ctf4-null, which is mentioned as non-existent in p.6

      Text

      • RNR1 is not really the gene with the highest score in Figure 5D, not even close: can you give a rationale for pin-pointing it (see also main point 3)?
      • The med14-H919P mutation is observed in 22/48 wells. I guess the authors checked already: are somee of these wells<br /> close to each other in the plate?
      • Compensatory evolution of ctf4-null in 2% glucose is the experiment published by Fumasoni and Murray in eLife. In that paper,<br /> there is no trace of mutations in MED14. I think the authors should comment on this (different method for detecting<br /> putative compensatory mutations?).
      • I may be mistaken, but Szamecz et al do not actually investigate whether different conditions result in different<br /> evolutionary trajectories (i.e., different genetics), and so their results may not be at odds with those presented here.

      typos

      p.18, line 564 preformed -> performed

      p. 6 line 189 with a strongly skew -> with a strong skew ?

      Significance

      This is a well-done paper that could be of interest for the community of evolutionary biologists, scientists working on metabolism and cell division. It addresses an interesting problem, how metabolism affects compensatory evolution. Among the strengths: experiments are well done, the results are novel, the cross-talk between metabolism and evolutionary repair is intriguing. Among the weaknesses, the fact that the molecular explanations for the observations are only hypothesized and not tested experimentally. This is where the authors could improve the manuscript.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study investigates the compensatory evolutionary response of Saccharomyces cerevisiae to DNA replication stress, focusing on the influence of genotype-environment interactions (GXE). The authors used a range of experimental conditions with varying nutrient levels to assess evolutionary outcomes under replication stress. Their genomic analysis reveals that while glucose levels affect initial adaptation rates, the genetics of adaptation remain robust across all nutritional environments.

      The research offers new insights into the adaptability of S. cerevisiae, emphasizing the role of the nutritional environment in evolutionary processes related to DNA replication stress. It identifies recurrent advantageous mutations under different macronutrient availabilities and uncovers a novel role for the RNA polymerase II mediator complex in adaptation to replication stress.

      Overall, this well-designed study adds to the growing recognition of the complexity and robustness of evolutionary responses to environmental stressors. It provides strong evidence that compensatory evolution to replication stress is robust across varying nutritional conditions. It both challenges and reinforces previous findings regarding the resilience of the yeast genetic interaction network to environmental perturbations. The detailed analysis of specific compensatory mutations and their fitness impacts across different conditions offers valuable insights into adaptive dynamics over 1000 generations, contributing a clear empirical framework for understanding how replication-associated stress shapes evolutionary outcomes in diverse environments. Based on the analysis:

      1. The conclusions are generally well-supported by the presented data. The evolution experiments and genomic analyses are robust and provide convincing evidence for the study's main claims. The authors took steps to eliminate bias, such as maintaining an adequate Ne, which, if not done, could have compromised their conclusions by affecting genetic drift and limiting the population's access to beneficial mutations.
      2. The figures are well-designed and easy to understand.
      3. The methodology is well-described and appears reproducible. The authors provide sufficient details on experimental procedures. Experimental replication is adequate, with multiple evolutionary lines.
      4. They also made efforts to validate their observations, such as the validation of mutations, the prediction of interactions in the Med14 structure, and its potential implication in gene regulation, as well as the analysis of the cumulative fitness benefit and the reconstruction of the quadruple mutant.

      There are, however, a few results that would benefit from further clarification:

      1. The experimental design is strong, offering a diverse range of conditions. However, the high glucose condition (8%) stands out as significantly different from the neutral 2% condition, both in range and margin, compared to the low glucose conditions (0.25-0.5%). While this mainly affects growth profiles and evolvability in the early generations, a brief explanation in the discussion would strengthen the conclusions. Specifically, addressing:

      a) The rationale behind selecting these particular glucose concentrations.

      b) How other glucose concentrations might influence the outcomes.<br /> Providing this additional context would enhance the reader's understanding of the experimental setup and its potential implications, while also offering insights into the broader applicability of the findings and possible directions for future research.<br /> 2. In the discussion section, a more explicit comparison with similar studies in other model organisms would help contextualize the findings within the broader field of evolutionary biology. While the results appear robust, it would be beneficial to explore how they align with or contrast to previous studies on DNA damage, particularly in bacteria or highly complex eukaryotes.

      Minor comments:

      1. The presentation of data in the figures is clear and informative. However, some figure legends could benefit from more detailed explanations. For example, although the statistical tests used are mentioned in the methods section, it would be helpful to also include them in the figure legends, such as in legend 1acde, as well as in all other figures.
      2. In terms of broader conclusions, here are a few suggestions, though they are, of course, optional:

      a) The study could benefit from exploring the potential trade-offs of adaptive mutations in the hypothetical return to environments without replication stress, at least theoretically. This would provide a more comprehensive understanding of the evolutionary constraints.

      b) A brief discussion of the potential limitations of using lab strains versus wild isolates of S. cerevisiae would offer valuable context for the generalizability of the findings.

      c) It would be valuable to present the differences in ploidy in the context of other studies, such as the nutrient-limitation hypothesis (e.g., 'The Evolutionary Advantage of Haploid Versus Diploid Microbes in Nutrient-Poor Environments' by Bessho, 2015), since, as previously demonstrated by the authors of this article that is being reviewed, ploidy may influence the evolutionary trajectories of DNA repair. Interrelating these three terms: nutrient-limitation, ploidy, and DNA repair could be an interesting avenue to explore in the discussion.<br /> 3. Specific details:

      a) Line 116: To improve clarity, it would be beneficial to refer to the figure right after the statement: 'However, their relative fitness improved compared to the WT reference as the initial glucose levels (Figure X).'

      b) Line 404: The statement about antibiotics and cancer progression is somewhat brief here; it might be helpful to provide more context on why this mechanism influences these processes (here or before).

      c) Line 418: "were re-suspended in water containing zymolyase (Zymo Research, Irvine, CA, US, 0.025 μ/μL), incubated at". Something is missing in the units.

      d) Line 459: "and G2 phases for each genotype was estimated by deriving the the relative cell distribution". The article "the" is repeated.

      e) Fig. 1a: The x-axis ticks appear misaligned, which makes it difficult to interpret the boxplots. For example, at 0.25, the tick is closer to the orange boxplot than to the black one. In contrast, at 2%, the tick seems well-centered."

      f) Figure 3 could benefit from a general legend at the top regarding the colors, as finding it in 2c was not intuitively easy.

      g) I didn't review the code on GitHub.

      Significance

      The main strength of the study is that it shows robustness of compensatory evolution across varying nutrient conditions. The study adds to the growing body of literature on DNA replication stress and evolutionary adaptation by showing that compensatory evolution can occur regardless of nutrient availability. This fundamental finding challenges prior assumptions that nutrient conditions significantly alter evolutionary outcomes, contributing to a more nuanced understanding of how cells respond to stress. Furthermore, the discovery of the RNA polymerase II mediator complex's role in this process is particularly novel and opens new lines of investigation.

      Advance in the field: The results advance our understanding of evolutionary biology, particularly in the context of DNA replication stress and compensatory evolution. The study demonstrates that evolutionary repair mechanisms are predictable, even under variable environmental conditions, which has key implications for evolutionary biology and therapeutic applications.

      Audience:

      This paper will be of interest to a specialized audience in evolutionary biology, genomics, and cell biology, particularly those interested in DNA replication stress and adaptive evolution. Researchers studying stress responses in model organisms, such as S. cerevisiae, will find the findings valuable, as will those working in applied fields where stress adaptation is a critical factor (e.g., industrial yeast fermentation, drug development, disease resistance, cancer research, or aging studies).

      Expertise:

      Evolutionary biology, genomic analysis, and cellular stress responses, with a particular focus on experimental evolution under DNA damage stress in Saccharomyces cerevisiae. Recently graduated and beginner reviewer.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Wasilewska and colleagues generated tmbim5-/- zebrafish line and demonstrated that tmbim5 loss of function leads to decrease in zebrafish size and induces muscle atrophy. Authors used immunohistochemistry to suggest that tmbim5-/- zebrafish shows reduced glycogen levels in muscle and liver. However, most of the immunohistochemistry is not quantitated and only qualitative differences are shown. Next, the authors measured mitochondrial calcium levels in the brain of tmbim5-/- zebrafish but there was no behavioral phenotype in the fish. It would have be better to measure mitochondrial calcium levels in the muscles of tmbim5-/- zebrafish as phenotype is muscle atrophy. Further, it is reported that the mitochondrial membrane potential and glycogen levels were perturbed in tmbim5-/- zebrafish.

      Next, the authors generated a scl8b1-/- (a probable NCLX ortholog in zebrafish) zebrafish, which did not show any drastic phenotype. However, neither slc8b1 function nor the phenotype of scl8b1-/- zebrafish was well characterized. Further, authors created two double knockout zebrafish lines i.e. tmbim5-/-/mcu-/- and tmbim5-/-/slc8b1-/-. Interestingly, both these lines were viable and do not show any drastic phenotypes. The authors concluded that in these transgenic fishes compensatory and/or alternative mitochondrial Ca2+ mobilization pathways counterbalance the effects of silencing of these proteins.

      Although it is an interesting study, the conclusions are not well supported with the data. At several places only qualitative images are shown and quantitative data is missing. Similarly, Ca2+ imaging in muscles of tmbim5-/- zebrafish is not performed. Finally, no molecular mechanism or molecular details are provided. Though Tmbim5's potential role in EMRE degradation is discussed, it is not experimentally investigated. The quality of the manuscript would significantly enhance if authors perform the suggested experiments.

      Major Comments:

      1. As a potential mechanism, Tmbim5's potential role in EMRE degradation is discussed but it is not experimentally investigated. It is very easy to test this hypothesis. If this is the case, it would be a very good contribution to the field.
      2. On Page 16, authors state that slc8b1 does not constitutes the major mitochondrial Ca2+ efflux transport system. Authors should do calcium imaging experiments just like they did with tmbim5 and mcu double knockouts (data presented in Figure 4C) to make any comments on functioning of slc8b1 in mitochondrial Ca2+ transport. This is important because slc8b1 is only a predictive ortholog of human NCLX and it is not experimentally examined yet.
      3. The data presented in Fig. 4C is very important but it is not fully explained and discussed in the results. Please discuss all the data sets presented in Fig4C in detail. As such, it is very difficult to follow and interpret the data.
      4. In tmbim5-/- zebrafish, what happens to mitochondrial Ca2+ signaling in muscle as phenotype is muscle atrophy only?
      5. Please validate the observation of decreased glycogen levels in tmbim5-/- fish by one more way. Only immunohistochemistry that too without quantitation is not convincing (Fig. 2E-H).

      Minor Comments:

      1. Authors state that tmbim5 loss of function leads to metabolic changes but the only data provided is decrease in glycogen levels. It would be helpful for the authors to focus comments specifically on the data presented in the manuscript to avoid potential over-interpretation.
      2. While discussing Fig4., authors mention that Tmbim5 may act as a MCU independent Ca2+ uptake mechanism and therefore they crossed tmbim5 mutants with mcu KO fish. But from the data presented in Fig.3 and as concluded by the authors themselves tmbim5 mutants do not show changes in the mitochondrial Ca2+ levels. Authors may clarify this point.
      3. Does tmbim5 contributes to mitochondrial Ca2+ uptake in presence or along with MCU. Further analysis of Fig4C may shed some light on this. Authors should test significance between tmbim5-/- and WT as well as between tmbim5-/- and tmbim5+/+ in mcu-/- background.
      4. Please check the labeling on traces in Fig3D.
      5. Please include quantitation of data presented in EV2E-F.
      6. Please include quantitation of immunohistochemistry data presented in 2E-H.

      Referee cross-commenting

      Several comments are common between the reviewers highlighting that those experiments are critical. Secondly, I agree with the concerns raised by other two reviewers.

      Significance

      In this study, authors report couple of new transgenic zebrafish lines. However, further characterization of slc8b1-/- is required. This study reinforces the existing idea that there are very robust compensatory mechanisms that maintain mitochondrial Ca2+ homeostasis. While the work provides useful insights, it could benefit from a broader scope to provide substantial advancement to existing knowledge.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The work of Wasilewska et al. focusses on the MCU independent basal Ca2+ uptake mechanisms and the effects of MCU, NCLX, and TMBIM5 KO on Zebrafish Ca2+ homeostasis, mortality, anatomy and metabolism. The authors found evidence that tmbim5 potentially has a bidirectional mode of operation and is able to extrude Ca2+ from the matrix as well as transfer Ca2+ into mitochondria. Further, a reduced membrane potential in tmbim5-/- fish and altered metabolism was found. While the conclusion drawn are well argumented, a few points have to be addressed.

      Major Points:

      1. While all mitochondrial genes seem collectively reduced compared to control, it would be interesting to assess the mitochondrial mass and/or mitochondrial turnover rate in regard to e.g. mitophagy. The reduced membrane potential could lead to PINK1 accumulation on the outer mitochondrial membrane to mediate mitophagy leading overall to reduced mitochondrial count and mass.
      2. The characterization of slc8b1-KO fish needs some improvement to facilitate a better understanding of the molecular interactions of slc8b1 and tmbim5. This would also greatly improve the understanding of the phenotypical characterization and behavioral response to CGP.
      3. Functional Ca2+ measurements of the activity of slc8b1 gene product have to be done to ensure a KO phenotype. Especially in light of the surprising results presented in Figure 6A showing an effect of CGP on slc8b1-KO fish but not on tmbim5-KO fish I advise mitochondrial isolation to conduct mitochondrial basal and extrusion Ca2+experiments of slc8b1-KO fish, tmbim5-KO fish, and double KO-fish.

      Minor Points:

      The authors claim that mRNA levels of mitochondrial proteins involved in Ca2+ transport in tmbim5-/- are unaffected (Figure EV3). While the T-tests show no significant alteration, what happens if a 2-way ANOVA shows a more general effect revealed between WT and TMBIM5-/-?

      Significance

      This is a well-designed and carefully executed piece of work. The experimental design is thoughtfully elaborated, and the topic is worthy of investigation. The strengths of this study lie in translating our knowledge of TMBIN5 from single cells to organism and organ function. Moreover, the work provides important new information that will help the scientific community working on mitochondrial regulation AND muscle diseases to understand how ions coordinately regulate mitochondrial function.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Although the experimental approach is promising (see below), the results do not significantly expand our current understanding. This is partly due to the challenges of interpreting negative results, which are nonetheless worth reporting. Some of the conclusions and interpretations of the results could benefit from further clarification and contextualization to enhance their impact:

      • Figure 1D: The distribution of fiber size in wt vs. Tmbim5-ko fish shows a notable difference limited to one size range. Can the authors clarify this observation? Could this indicate a switch in fiber type? Is there a correlation between this finding and the differential PAS staining?
      • Figure 3: one of the advantages of the zebrafish model is its transparency, allowing for fluorescence imaging. Unfortunately, this proves to be impossible in the case of cepia2mt. The data provided by the authors show that the fluorescence of this probe does not vary following physiological stimuli. The only change is that induced by CCCP (Fig 3C-D), which according to the authors causes a discharge of mitochondrial calcium. However, the use of CCCP with GFP-based probes should be avoided, as the acidification caused by CCCP treatment leads to quenching of the fluorophore, resulting in a fluorescence decrease which is independent of Ca2+ levels. Although the experimental approach aims to detect dynamic changes in mitochondrial Ca2+ levels, the presented results in Figure 3 do not provide conclusive evidence to support this capability. While significant experimental effort is evident, these findings may require further validation or additional data to strengthen their impact. Alternatively, the authors could remove this Figure 3 and relevant text from the manuscript.
      • Figure 6A: In my opinion, this dataset is impossible to understand. To my knowledge, the precise molecular target of CGP-37157 remains elusive. While CGP is often considered an NCLX inhibitor, this classification lacks definitive experimental support. Although CGP is known to inhibit mitochondrial Na+-dependent Ca2+ extrusion, direct binding of CGP to NCLX has yet to be conclusively demonstrated. With this in mind, the authors show that pharmacological intervention with CGP elicits a distinct phenotype in the fish model. While this effect appears to persist in SLC8B1-KO fish, it is absent in Tmbim5-KO fish, suggesting Tmbim5 as a potential molecular target for CGP. However, this interpretation is inconsistent with the following observations: i) CGP remains effective in Tmbim5/Slc8b1 double-KO fish and ii) Tmbim5-KO fish exhibit no discernible phenotype. A comprehensive explanation that reconciles these findings is sought.
      • Figure 6B: according to the authors, the phenotype induced by CGP treatment is specific because a different substance with a completely different effect, CCCP, causes the same phenotype in both wt and Tmbim5-KO fish. Also in this case, the rationale and reasoning behind this experiment in not very evident. As I see it, CCCP blocks zebrafish motility because it is a metabolic poison, and its effect does not depend on any transporter.

      Significance

      The manuscript submitted by Wasilewska et al investigates the functional relationship between different mitochondrial calcium transporters using zebrafish as a model. The topic is of great interest. In the last 15 years, many mitochondrial calcium transporters have been identified. In some cases, their mechanism is not fully understood, such as in the case of TMBIM5, recently described by some as an H/Ca exchanger, or as a Ca channel by others. Furthermore, the functional relationship between different transporters has so far been studied in a partial and superficial way. I believe that this work is therefore of great interest because it aims to contribute to a fundamental problem that is still poorly studied. The idea of using zebrafish is interesting, as it is an organism that is easy to manipulate and phenotype, and because it is transparent, making it possible to use specific biosensors to characterize mitochondrial calcium dynamics, at least in principle. The paper therefore deserves attention.

    5. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    6. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Wasilewska and colleagues generated tmbim5-/- zebrafish line and demonstrated that tmbim5 loss of function leads to decrease in zebrafish size and induces muscle atrophy. Authors used immunohistochemistry to suggest that tmbim5-/- zebrafish shows reduced glycogen levels in muscle and liver. However, most of the immunohistochemistry is not quantitated and only qualitative differences are shown. Next, the authors measured mitochondrial calcium levels in the brain of tmbim5-/- zebrafish but there was no behavioral phenotype in the fish. It would have be better to measure mitochondrial calcium levels in the muscles of tmbim5-/- zebrafish as phenotype is muscle atrophy. Further, it is reported that the mitochondrial membrane potential and glycogen levels were perturbed in tmbim5-/- zebrafish.

      Next, the authors generated a scl8b1-/- (a probable NCLX ortholog in zebrafish) zebrafish, which did not show any drastic phenotype. However, neither slc8b1 function nor the phenotype of scl8b1-/- zebrafish was well characterized. Further, authors created two double knockout zebrafish lines i.e. tmbim5-/-/mcu-/- and tmbim5-/-/slc8b1-/-. Interestingly, both these lines were viable and do not show any drastic phenotypes. The authors concluded that in these transgenic fishes compensatory and/or alternative mitochondrial Ca2+ mobilization pathways counterbalance the effects of silencing of these proteins.

      Although it is an interesting study, the conclusions are not well supported with the data. At several places only qualitative images are shown and quantitative data is missing. Similarly, Ca2+ imaging in muscles of tmbim5-/- zebrafish is not performed. Finally, no molecular mechanism or molecular details are provided. Though Tmbim5's potential role in EMRE degradation is discussed, it is not experimentally investigated. The quality of the manuscript would significantly enhance if authors perform the suggested experiments.

      Major Comments:

      1. As a potential mechanism, Tmbim5's potential role in EMRE degradation is discussed but it is not experimentally investigated. It is very easy to test this hypothesis. If this is the case, it would be a very good contribution to the field.
      2. On Page 16, authors state that slc8b1 does not constitutes the major mitochondrial Ca2+ efflux transport system. Authors should do calcium imaging experiments just like they did with tmbim5 and mcu double knockouts (data presented in Figure 4C) to make any comments on functioning of slc8b1 in mitochondrial Ca2+ transport. This is important because slc8b1 is only a predictive ortholog of human NCLX and it is not experimentally examined yet.
      3. The data presented in Fig. 4C is very important but it is not fully explained and discussed in the results. Please discuss all the data sets presented in Fig4C in detail. As such, it is very difficult to follow and interpret the data.
      4. In tmbim5-/- zebrafish, what happens to mitochondrial Ca2+ signaling in muscle as phenotype is muscle atrophy only?
      5. Please validate the observation of decreased glycogen levels in tmbim5-/- fish by one more way. Only immunohistochemistry that too without quantitation is not convincing (Fig. 2E-H).

      Minor Comments:

      1. Authors state that tmbim5 loss of function leads to metabolic changes but the only data provided is decrease in glycogen levels. It would be helpful for the authors to focus comments specifically on the data presented in the manuscript to avoid potential over-interpretation.
      2. While discussing Fig4., authors mention that Tmbim5 may act as a MCU independent Ca2+ uptake mechanism and therefore they crossed tmbim5 mutants with mcu KO fish. But from the data presented in Fig.3 and as concluded by the authors themselves tmbim5 mutants do not show changes in the mitochondrial Ca2+ levels. Authors may clarify this point.
      3. Does tmbim5 contributes to mitochondrial Ca2+ uptake in presence or along with MCU. Further analysis of Fig4C may shed some light on this. Authors should test significance between tmbim5-/- and WT as well as between tmbim5-/- and tmbim5+/+ in mcu-/- background.
      4. Please check the labeling on traces in Fig3D.
      5. Please include quantitation of data presented in EV2E-F.
      6. Please include quantitation of immunohistochemistry data presented in 2E-H.

      Referee cross-commenting

      Several comments are common between the reviewers highlighting that those experiments are critical. Secondly, I agree with the concerns raised by other two reviewers.

      Significance

      In this study, authors report couple of new transgenic zebrafish lines. However, further characterization of slc8b1-/- is required. This study reinforces the existing idea that there are very robust compensatory mechanisms that maintain mitochondrial Ca2+ homeostasis. While the work provides useful insights, it could benefit from a broader scope to provide substantial advancement to existing knowledge.

    7. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The work of Wasilewska et al. focusses on the MCU independent basal Ca2+ uptake mechanisms and the effects of MCU, NCLX, and TMBIM5 KO on Zebrafish Ca2+ homeostasis, mortality, anatomy and metabolism. The authors found evidence that tmbim5 potentially has a bidirectional mode of operation and is able to extrude Ca2+ from the matrix as well as transfer Ca2+ into mitochondria. Further, a reduced membrane potential in tmbim5-/- fish and altered metabolism was found. While the conclusion drawn are well argumented, a few points have to be addressed.

      Major Points:

      1. While all mitochondrial genes seem collectively reduced compared to control, it would be interesting to assess the mitochondrial mass and/or mitochondrial turnover rate in regard to e.g. mitophagy. The reduced membrane potential could lead to PINK1 accumulation on the outer mitochondrial membrane to mediate mitophagy leading overall to reduced mitochondrial count and mass.
      2. The characterization of slc8b1-KO fish needs some improvement to facilitate a better understanding of the molecular interactions of slc8b1 and tmbim5. This would also greatly improve the understanding of the phenotypical characterization and behavioral response to CGP.
      3. Functional Ca2+ measurements of the activity of slc8b1 gene product have to be done to ensure a KO phenotype. Especially in light of the surprising results presented in Figure 6A showing an effect of CGP on slc8b1-KO fish but not on tmbim5-KO fish I advise mitochondrial isolation to conduct mitochondrial basal and extrusion Ca2+experiments of slc8b1-KO fish, tmbim5-KO fish, and double KO-fish.

      Minor Points:

      The authors claim that mRNA levels of mitochondrial proteins involved in Ca2+ transport in tmbim5-/- are unaffected (Figure EV3). While the T-tests show no significant alteration, what happens if a 2-way ANOVA shows a more general effect revealed between WT and TMBIM5-/-?

      Significance

      This is a well-designed and carefully executed piece of work. The experimental design is thoughtfully elaborated, and the topic is worthy of investigation. The strengths of this study lie in translating our knowledge of TMBIN5 from single cells to organism and organ function. Moreover, the work provides important new information that will help the scientific community working on mitochondrial regulation AND muscle diseases to understand how ions coordinately regulate mitochondrial function.

    8. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Although the experimental approach is promising (see below), the results do not significantly expand our current understanding. This is partly due to the challenges of interpreting negative results, which are nonetheless worth reporting. Some of the conclusions and interpretations of the results could benefit from further clarification and contextualization to enhance their impact:

      • Figure 1D: The distribution of fiber size in wt vs. Tmbim5-ko fish shows a notable difference limited to one size range. Can the authors clarify this observation? Could this indicate a switch in fiber type? Is there a correlation between this finding and the differential PAS staining?
      • Figure 3: one of the advantages of the zebrafish model is its transparency, allowing for fluorescence imaging. Unfortunately, this proves to be impossible in the case of cepia2mt. The data provided by the authors show that the fluorescence of this probe does not vary following physiological stimuli. The only change is that induced by CCCP (Fig 3C-D), which according to the authors causes a discharge of mitochondrial calcium. However, the use of CCCP with GFP-based probes should be avoided, as the acidification caused by CCCP treatment leads to quenching of the fluorophore, resulting in a fluorescence decrease which is independent of Ca2+ levels. Although the experimental approach aims to detect dynamic changes in mitochondrial Ca2+ levels, the presented results in Figure 3 do not provide conclusive evidence to support this capability. While significant experimental effort is evident, these findings may require further validation or additional data to strengthen their impact. Alternatively, the authors could remove this Figure 3 and relevant text from the manuscript.
      • Figure 6A: In my opinion, this dataset is impossible to understand. To my knowledge, the precise molecular target of CGP-37157 remains elusive. While CGP is often considered an NCLX inhibitor, this classification lacks definitive experimental support. Although CGP is known to inhibit mitochondrial Na+-dependent Ca2+ extrusion, direct binding of CGP to NCLX has yet to be conclusively demonstrated. With this in mind, the authors show that pharmacological intervention with CGP elicits a distinct phenotype in the fish model. While this effect appears to persist in SLC8B1-KO fish, it is absent in Tmbim5-KO fish, suggesting Tmbim5 as a potential molecular target for CGP. However, this interpretation is inconsistent with the following observations: i) CGP remains effective in Tmbim5/Slc8b1 double-KO fish and ii) Tmbim5-KO fish exhibit no discernible phenotype. A comprehensive explanation that reconciles these findings is sought.
      • Figure 6B: according to the authors, the phenotype induced by CGP treatment is specific because a different substance with a completely different effect, CCCP, causes the same phenotype in both wt and Tmbim5-KO fish. Also in this case, the rationale and reasoning behind this experiment in not very evident. As I see it, CCCP blocks zebrafish motility because it is a metabolic poison, and its effect does not depend on any transporter.

      Significance

      The manuscript submitted by Wasilewska et al investigates the functional relationship between different mitochondrial calcium transporters using zebrafish as a model. The topic is of great interest. In the last 15 years, many mitochondrial calcium transporters have been identified. In some cases, their mechanism is not fully understood, such as in the case of TMBIM5, recently described by some as an H/Ca exchanger, or as a Ca channel by others. Furthermore, the functional relationship between different transporters has so far been studied in a partial and superficial way. I believe that this work is therefore of great interest because it aims to contribute to a fundamental problem that is still poorly studied. The idea of using zebrafish is interesting, as it is an organism that is easy to manipulate and phenotype, and because it is transparent, making it possible to use specific biosensors to characterize mitochondrial calcium dynamics, at least in principle. The paper therefore deserves attention.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The paper nicely shows that PP2A antagonizes Crb-dependent and Crb-independent phosphorylation and degradation of Expanded (Ex), in cell culture and in wing discs. The authors focus on the Mts catalytic subunit of PP2A, but also demonstrate the involvement of the Wrd and Tws B regulatory subunits. They also show via use of transcriptional reporters that PP2A directly affects Hpo signaling in vivo. Finally, they show a potential role for Merlin and Kibra in regulating Ex levels, and that Kib binds to Mts and Wrd. The experiments are on the whole well executed and quantified.

      Major comments:- (1) I am not convinced that the authors can entirely rule out a role for the STRIPAK complex. Mutation of MtsR268A reduces binding of Wrd by 60% and abrogates the effect of Mts on Ex. However mutation of MtsL186A reduces binding of Cka by less than 50% and doesn't disrupt Mts regulation of Ex. Perhaps Cka is more abundant than Wrd, and 50% of Mts/Cka complex is more than sufficient for it to carry out its enzymatic function.

      To further investigate whether PP2A can indeed stabilise Ex independently of the STRIPAK complex we will conduct the following experiments in response to the comments from Reviewers 1 and 3:

      • Test whether knocking down other components of the STRIPAK complex such as FGOP2 and Mob4 affects the ability of Mts to stabilise Ex degradation in the presence or absence of Crbintra in vitro using S2 cells. If we do observe any effect, we will also test whether knocking these components in the posterior compartment of the wing disc also has an effect on the Ex stability reporter levels.
      • The reviewers raised the point that the MtsL186A mutant results in 50% reduction in binding with Cka and that a 50% reduction in the Mts/Cka complex may still be sufficient to stabilise Ex levels. To address this, we will knock down either Wrd or Cka and test whether this affects the ability of MtsL186A to stabilise Ex both in the presence/absence of Crbintra. This will test whether the stabilisation of Ex by MtsL186A can be attributed to the function of the MtsL186A::Cka holoenzyme or the MtsL186A::Wrd holoenzyme. We will test this both in vitro and in vivo.

        I also note that in Fig 1H, Ex levels in Crb/Mts+Cka RNAi appear to be intermediate between those in Crb and Crb/Mts. Ideally this would be quantified. Similarly in 4J, mtsL186A (while not significant) appears intermediate between mtsH118N and mts-WT. What is the actual P value for the comparison to Mts-WT? In any case I would suggest the authors tone down these conclusions.

      We have now provided quantification for the blot in Fig. 1H (now Fig. 1I) in Fig. 1J. We will tone down our conclusions regarding the role of STRIPAK based on our results from the experiments detailed above.

      (2) I also found it rather confusing that the authors discuss the Cka B subunit in the context of the STRIPAK complex in Figure 1, then don't look at the other B subunits until Figures 3/4. In my opinion, it would be easier to follow the flow of the manuscript if the authors discussed Crb-dependent and independent regulation of Ex, then the roles of Gish/CKI, then the role of the B subunits including Cka. In this context, it would also be interesting to see if there was any redundancy between Cka and Wrd - have the authors tried any double knockdown experiments (with appropriate controls for RNAi dosage)?

      We thank the reviewer for their suggestion to potentially alter the order by which some of the results of the paper are presented. At the moment, we believe the current description of the results fits well with the observations and their significance, but we will assess this after the revisions are completed and, if required, we will change the order of the results to improve the clarity of the manuscript. To test for any redundancy between Cka and Wrd, we will undertake knock down both Cka and Wrd using S2 cells.

      (3) The authors examine Crb-independent Ex regulation in the wing disc, which appears to be wing discs that do not overexpress Crb. I would expect that wing discs do express Crb - or is this not the case? Please clarify whether this is in the absence of Crb, or the absence of overexpressed Crb.

      This is now clarified in the text Line 358.

      (4) I was confused by the section 'CKIs and Slmb regulate Ex proteostasis via the 452-457 Slmb consensus sequence'. The authors conclude that 'these results show that the machinery that facilitates Crb-mediated Ex phosphorylation and degradation is also partly involved in the Crb-independent regulation of Ex protein stability.' However, I had concluded the opposite, as it appeared that Slimb and gish RNAi only affected Ex1-468, and similarly Slmb only affected Ex1-468, but not Ex1-450 (which in the previous section was shown to be regulated by Mts independent of Crb). Please could the authors explain/clarify this.

      We have previously shown that, in the presence of Crbintra, Gish/Ck1α/Slmb act on Ex via the Ex452-457 aa sequence, which corresponds to a b-TrCP/Slmb consensus sequence (Fulford et al., 2019). In the absence of Crbintra, we observed that Gish/Ck1α/Slmb require the 452-457 site to be present to be able to phosphorylate and degrade Ex (i.e. the Ex1-450 truncation that lacks this site is refractory to the regulation by Gish/Ck1α/Slmb). This suggests that Gish/Ck1α/Slmb regulate Ex via the 452-457 site, both in absence and presence of Crbintra. We have now clarified this in the text: Lines 387-388 and Lines 405-406.

      (5) The regulation of Ex by Merlin and Kibra is potentially interesting, but a bit preliminary. This part of the manuscript could be strengthened by showing for example if Mts or Wrd knockdown affects the stabilization of Ex by Kib.

      As suggested by the reviewer we will further characterise the interaction between Kib and Mts in stabilising Ex. We will test whether Kib can stabilise Ex when either mts or wrd is knocked down. We will also test whether Kib can stabilise Ex in the absence of ectopic Crb expression in vivo and whether this is indeed dependent on the Wrd subunit.

      Minor comments: (1) The Introduction gives a quite comprehensive review of known interactions between STRIPAK, Expanded and Hippo pathway components. However, it is hard to keep track of all the components and interactions if you are not deeply into the field. To improve accessibility, I would suggest a summary diagram of the key interactions (currently the manuscript has no introductory figures at all!) and if possible the authors might consider whether there are details they could leave out or which could just be mentioned as necessary in the results sections.

      We have now added an introductory figure, Fig.1A, detailing the key elements of Hpo regulation that is pertinent for this study.

      (2) Could the authors show a shorter exposure of the Ex blot in Figure 1A, in order to better visualize the loss of band shift?

      A shorter exposure of the Ex blot has now been added to the Fig. 1B (previously Fig. 1A).

      (3) Line 307 '(Fig. 1B,D,G,I)' the call-out to Fig.1I appears to be in strike-through font, presumably because 1I shouldn't be cited here? It also looks like Fig.1I is wrongly cited on line 342 as that sentence only describes action of L168A in wing discs. I think a sentence describing the experiment in Fig.1I is missing?

      The Figures have now been cited appropriately. Fig. 1J (previously Fig. 1I) is now referred to in Line 336.

      (4) Line 355 ambiguous, should this read low expression of Crb in S2 cells?

      This has now been changed from extremely low expression to low expression.

      (5) Line 369 reads 'PP2A was able to stabilize full-length Ex', Mts-WT would be more precise.

      This has now been changed to MtsWT was able to stabilise full-length Ex.

      (6) The blot in panel 2O is mislabeled Ex1-468, I think this should be Ex1-450.

      The blot in panel 2O is now correctly labelled as Ex1-450.* *

      (7) The nomenclature of 'Mts-WT' for their own transgene and 'Mts-BL' for the Bloomington transgene. is confusing, as both are, I believe, wild type. Maybe leave this detail for the M&M, at least if the authors believe there is no difference in behavior.

      We are happy to change this if required.

      (8) Figure S6 appears to be missing from the uploaded version.

      We thank the reviewer for noticing this. Fig. S6 is now included in the supplementary figure file.

      (9) Lines 480-481: 'Using co-IP analyses, we observed that Mts interacts with Ex, both in the presence and absence of Crbintra.' No figure call-out is given for this statement, and I can't see the data anywhere, but from the figure legends it seems to be in the missing Fig.S6? And everything that follows in this paragraph should have call-outs for Fig.4K?

      Fig. S6 has now been appended and the call-outs to Fig. 4K have been added to in the paragraph Line 475-490.

      (10) Lines 503-504: 'we found that Kib associated with Mts (Fig. 5C)' - Fig.5B?

      This has now been changed.

      (11) Lines 504-505: 'no interaction was observed between Mts and Mer (Fig.5B)' - Fig.5C?

      This has now been changed.

      (12) In Figure 6G, authors note that 'the mean diap1GFP4.3 levels of MtsWT+Crb-Intra were lower than those of Crb-Intra, this difference was not statistically significant when all genotypes were included in the comparisons, but only when the Control, crbintra and mtsWT+crbintra conditions were considered.' It might be useful to have a table showing the actual P values of all the comparisons (or maybe better still just put actual P values on the graphs?). Sometimes an arbitrary cut-off of 0.05 for significant can be misleading.

      We have now added the actual p-values for those >0.05 to the graph.

      Reviewer #1 (Significance (Required)):

      The Hippo signaling pathway is a conserved regulator of tissue growth, and understanding how this pathway is activated and modulated is of great importance. Levels of the upstream activator Expanded are known to be regulated by phosphorylation/degradation, but whether dephosphorylation of Ex is important for growth control has not been widely investigated. This paper utilizes cell culture and the fruit fly model organism to provide clear evidence for a role for PP2A in regulation of Ex levels, independent of its known role in regulating phosphorylation of Hpo. It will therefore be of interest to biologists working in the fields of growth control and tissue homeostasis.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors show that the protein phosphatase PP2A antagonizes Crb-mediated phosphorylation and subsequent degradation of Expanded in vivo. Using Drosophila imaginal wing discs and the GAL4-UAS system, the authors provide evidence that the PP2A holoenzyme dephosphorylates Ex, stabilizing its protein levels, in a manner independent of the STRIPAK complex and identifies Wrd as a key regulatory subunit of PP2A in this process. Importantly, the study also shows that PP2A stabilizes Ex protein levels independent of Crb-driven phosphorylation and that, via this stabilization, PP2A activates Hpo pathway signaling to repress transcriptional targets of Yki.

      Major comments: Overall, the study is strong, and the conclusions are supported by the data. The data does largely lean on overexpression models in the wing disc and it would strengthen the biological relevance to include genomic alleles (i.e., do Ex-GFP levels go down in PP2A/mts mutant clones?). Materials and methods are thoroughly presented, and statistical analyses are adequate. OPTIONAL: While not necessarily required for publication, note that full in vivo confirmation would require altering the PP2A target sites in Ex by generating phospho-deficient and phospho-mimetic versions and seeing if they match the model. This would push the conclusions to the highest degree of confidence and rigor.

      We agree with the reviewer and indeed have tried to undertake MARCM experiments with mts null mutant clones. However, since mts is an essential gene, even when MtsWT was expressed in the presence of mts mutant, we were only able to obtain few single cell clones, which was difficult to analyse. Hence, clonal analysis using mts mutant clones will not be feasible in this case. (see also revision plan for figure illustrating the data referred to here).

      Minor comments: Text and figures are clear and accurate. It may be helpful to include a modified version of the Mts mutants table in SF1 in a main figure for easier reference but is not necessary.

      If required, we can move the table to one of the main figures based on whether additional data will be presented in the revised manuscript.

      Reviewer #2 (Significance (Required)):

      The studies strengths include biochemical and in vivo validation of the effect of PP2A and its various regulatory subunits on Ex phosphorylation and stabilization. The study very methodically parses out the context in which PP2A is stabilizing Ex (i.e., both in the context of Crb stimuli and independently, and it does so independently of the STRIPAK complex). As noted previously, recapitulating the major results in clones using genomic alleles would strengthen the biological relevance. The study advances our understanding of mechanisms tightly controlling downstream transcriptional outputs of the Hpo pathway via regulating Ex protein stability/turnover. Though the primary audience may be those well-versed in the Hpo field and Drosophila genetics, the implications for the research are broad.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The authors hypothesized that Crb mediated Ex phosphorylation and degradation, that they previously established, should be countered and set on to identify the phosphatase involved. Surprisingly, they find that Mts, the catalytic subunit of PP2A, counters the effect of ectopically expressed intracellular domain of Crb on Ex stability. This was surprising because PP2A and the STRIPAK complex was shown to counter Hippo activity previously, suggesting that PP2A would inject both positive and negative inputs into Hpo activity. The title reflects this finding.

      Overall, the experiments are well controlled and are of high quality. I especially appreciate the effort to show results of parallel experiments both in S2 cells and in vivo in wing discs.

      The manuscript convincingly demonstrates that Mts expression stabilizes Ex1-468::GFP in the presence or absence of ectopic Crb-intra. This effect is mainly mediated by the Wrd adaptor subunit, and requires the catalytic activity of Mts. However, results shown in Fig4K highlights the Tws adaptor as the main one that binds to and stabilizes Ex in S2 cells, in the presence or absence of Crb-intra expression. This is slightly at odds with Wrd-RNAi experiments nicely reversing the effects of Crb-intra expression.

      We would like to highlight that results shown in Fig. 4K were obtained upon the transfection of HA-tagged Wrd/Tws and, hence, they are not necessarily indicative of the levels of binding between the endogenous Ex and the regulatory subunits. Additionally, we would argue that the Ex:Tws interaction is merely indicative of the steady state regulation of Ex, which occurs both in the presence and absence of Crbintra, thereby explaining why we can detect the interaction in both settings. As for Wrd, given that we have shown that it is involved in the regulation of Ex only in the presence of Crbintra and antagonises its effect on Ex protein stability, it is only interacting with Ex in conditions where Crbintra is affecting Ex protein levels.

      The manuscript is not easy to read given the vast amount of data using many different constructs, but there is little the authors can do about it as the story is complex and layered.

      The argument that the effects of Mts are independent of the STRIPAK complex is less convincing. This conclusion is based on Mts-L186A mutant which should not bind Cka which is the PP2A adaptor subunit found in the STRIPAK complex. Fig S3F and G show that Cka binding to Mts is reduced by half when Mts-L186A mutant is expressed in lieu wt Mts. Consistent with this in Fig1F rescue of Ex degradation by Mts-L186A is half as effective as the rescue seen in 1F by the wt Mts.

      We will conduct the experiments mentioned in the reply to Major comments 1 of Reviewer 1 to address this.

      Towards the same argument, data shown on S3A-D is deemed inconclusive based on quantification in S3E which does not reflect the clear reduction in Ex that is seen in S3B. Hence FigS3 is in favour of Cka4 being involved in the rescue effect.

      In Fig. S3 we show that expression of either Crbintra or MtsWT+Crbintra does not cause any changes in the levels of the Ex reporter when the crosses were raised at 18°C. Hence, we believe that in this setting, we are unable to fully study PP2A-mediated stabilisation of Ex in the presence of Crbintra. Cka RNAi causes dramatic effects on tissue growth at 18°C (where Crbintra cannot modulate Ex protein levels), and lethality prior to the late L3 stage (where Crbintra modulates Ex protein levels), and this precludes us from testing the role of Cka. However, the results shown with the Mts mutant that has reduced binding to the STRIPAK complex strongly suggest that Cka is not essential for the role of PP2A in regulating Ex protein levels.

      In Figures 5A and 3A, Crb-intra expression does not destabilize Ex1-468::GFP, why is that?

      This is due to the expression levels of Crbintra in this particular biological repeat of the experiment. We will repeat this experiment to obtain a more representative image of the effect of Crbintra.

      The authors connect effects on Ex stability to the influence on Hippo pathway activity in Fig 6, which is a very nice touch.

      Finally, I wonder whether the dual effect of PP2A on Hippo activity (inhibiting Hippo and stabilizing Ex) could be a single effect. I am guessing the Ex1-468::GFP construct, having its own regulatory elements, would act independently of the transcriptional activity of Hippo. However, I was not able to find this demonstrated in the literature. Can the authors show that? For example, make hpo or wts mutant clones in the presence of the Ex1-468::GFP construct. Otherwise, an alternative explanation could be that PP2A, with its various adaptor subunits, counters Hippo activity which translates into higher levels of expanded transcription and Ex protein production.

      Since the reporter is under the control of the ubiquitin 63E promoter as opposed to the endogenous promoter, we do not envisage that its transcription is regulated by Yki. Indeed, a similar method of decoupling potential transcriptional and post-translational effects of Hpo signalling has been successfully used in studies that have focused on other Hpo pathway components, such as Kibra (Tokamov et al., 2021) and Salvador (Aerne et al., 2015). The reviewer suggests that we should assess the effect of hpo or wts mutant clones and determine of these affect the levels of the ubi-Ex1-468::GFP reporter. However, we believe this may lead to results that will be difficult to interpret. Although hpo or wts clones are expected to result in higher Yki activity, they will also remove Hpo or Wts function, and these proteins may be involved in the molecular mechanisms that regulate Ex protein stability. Therefore, as an alternative approach to assess the impact of Hpo signalling on the Ex reporter, we will perform RT-PCR experiments to monitor the transcriptional regulation of the transgenic reporter in the presence or absence of Yki overexpression.

      It was also demonstrated that there are higher levels of Crb in hippo mutants likely due to the expansion of the apical domain. This would be consistent with the stabilized Crb-intra seen in Figures 1A&3A upon Mts expression. Stabilization of Crb upon Mts expression (not commented on in the manuscript) is very interesting as extra Crb should further push the balance towards Ex degradation but Mts seems to be able to reverse the effect. I agree that this alternative explanation may be far-fetched, yet it is also easily tested, and would greatly simplify the model put forward.

      The reviewer suggests that Mts may potentially be involved in regulating Crbintra levels. To test this, we will test whether overexpression of various doses of either MtsWT or MtsH118N affects the stability of Crbintra using S2 cells.

      Finally, if indeed various PP2A complexes, depending on the adaptor subunits they contain, have a range of effects on Ex stability and Hippo pathway activity, this brings in the question of what regulates the availability of various adaptor subunits and the PP2A complexes they form? The question is outside the scope of the manuscript but it is worth discussing.

      We agree with the reviewer that this is a crucial question. However, tackling this experimentally would be challenging at this stage and we believe this is beyond the scope of the current manuscript. However, we will address this point in the discussion of the revised manuscript.

      Reviewer #3 (Significance (Required)):

      A vast amount of data is presented in both in vivo and in vitro settings. The study uses biochemical and genetic approaches and combines them aptly.

      I think the findings showing multiple and various effects on PP2A on the same pathway would be of higher interest to the PP2A enthusiasts than the Hippo researchers.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors hypothesized that Crb mediated Ex phosphorylation and degradation, that they previously established, should be countered and set on to identify the phosphatase involved. Surprisingly, they find that Mts, the catalytic subunit of PP2A, counters the effect of ectopically expressed intracellular domain of Crb on Ex stability. This was surprising because PP2A and the STRIPAK complex was shown to counter Hippo activity previously, suggesting that PP2A would inject both positive and negative inputs into Hpo activity. The title reflects this finding.

      Overall, the experiments are well controlled and are of high quality. I especially appreciate the effort to show results of parallel experiments both in S2 cells and in vivo in wing discs.

      The manuscript convincingly demonstrates that Mts expression stabilizes Ex1-468::GFP in the presence or absence of ectopic Crb-intra. This effect is mainly mediated by the Wrd adaptor subunit, and requires the catalytic activity of Mts. However, results shown in Fig4K highlights the Tws adaptor as the main one that binds to and stabilizes Ex in S2 cells, in the presence or absence of Crb-intra expression. This is slightly at odds with Wrd-RNAi experiments nicely reversing the effects of Crb-intra expression.

      The manuscript is not easy to read given the vast amount of data using many different constructs, but there is little the authors can do about it as the story is complex and layered.

      The argument that the effects of Mts are independent of the STRIPAK complex is less convincing. This conclusion is based on Mts-L186A mutant which should not bind Cka which is the PP2A adaptor subunit found in the STRIPAK complex. Fig S3F and G show that Cka binding to Mts is reduced by half when Mts-L186A mutant is expressed in lieu wt Mts. Consistent with this in Fig1F rescue of Ex degradation by Mts-L186A is half as effective as the rescue seen in 1F by the wt Mts. Towards the same argument, data shown on S3A-D is deemed inconclusive based on quantification in S3E which does not reflect the clear reduction in Ex that is seen in S3B. Hence FigS3 is in favour of Cka4 being involved in the rescue effect.

      In Figures 5A and 3A, Crb-intra expression does not destabilize Ex1-468::GFP, why is that?

      The authors connect effects on Ex stability to the influence on Hippo pathway activity in Fig 6, which is a very nice touch.

      Finally, I wonder whether the dual effect of PP2A on Hippo activity (inhibiting Hippo and stabilizing Ex) could be a single effect. I am guessing the Ex1-468::GFP construct, having its own regulatory elements, would act independently of the transcriptional activity of Hippo. However, I was not able to find this demonstrated in the literature. Can the authors show that? For example, make hpo or wts mutant clones in the presence of the Ex1-468::GFP construct. Otherwise, an alternative explanation could be that PP2A, with its various adaptor subunits, counters Hippo activity which translates into higher levels of expanded transcription and Ex protein production. It was also demonstrated that there are higher levels of Crb in hippo mutants likely due to the expansion of the apical domain. This would be consistent with the stabilized Crb-intra seen in Figures 1A&3A upon Mts expression. Stabilization of Crb upon Mts expression (not commented on in the manuscript) is very interesting as extra Crb should further push the balance towards Ex degradation but Mts seems to be able to reverse the effect. I agree that this alternative explanation may be far-fetched, yet it is also easily tested, and would greatly simplify the model put forward.

      Finally, if indeed various PP2A complexes, depending on the adaptor subunits they contain, have a range of effects on Ex stability and Hippo pathway activity, this brings in the question of what regulates the availability of various adaptor subunits and the PP2A complexes they form? The question is outside the scope of the manuscript but it is worth discussing.

      Significance

      A vast amount of data is presented in both in vivo and in vitro settings. The study uses biochemical and genetic approaches and combines them aptly.

      I think the findings showing multiple and various effects on PP2A on the same pathway would be of higher interest to the PP2A enthusiasts than the Hippo researchers.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The authors show that the protein phosphatase PP2A antagonizes Crb-mediated phosphorylation and subsequent degradation of Expanded in vivo. Using Drosophila imaginal wing discs and the GAL4-UAS system, the authors provide evidence that the PP2A holoenzyme dephosphorylates Ex, stabilizing its protein levels, in a manner independent of the STRIPAK complex and identifies Wrd as a key regulatory subunit of PP2A in this process. Importantly, the study also shows that PP2A stabilizes Ex protein levels independent of Crb-driven phosphorylation and that, via this stabilization, PP2A activates Hpo pathway signaling to repress transcriptional targets of Yki.

      Major comments: Overall, the study is strong, and the conclusions are supported by the data. The data does largely lean on overexpression models in the wing disc and it would strengthen the biological relevance to include genomic alleles (i.e., do Ex-GFP levels go down in PP2A/mts mutant clones?). Materials and methods are thoroughly presented, and statistical analyses are adequate. OPTIONAL: While not necessarily required for publication, note that full in vivo confirmation would require altering the PP2A target sites in Ex by generating phospho-deficient and phospho-mimetic versions and seeing if they match the model. This would push the conclusions to the highest degree of confidence and rigor.

      Minor comments: Text and figures are clear and accurate. It may be helpful to include a modified version of the Mts mutants table in SF1 in a main figure for easier reference but is not necessary.

      Significance

      The studies strengths include biochemical and in vivo validation of the effect of PP2A and its various regulatory subunits on Ex phosphorylation and stabilization. The study very methodically parses out the context in which PP2A is stabilizing Ex (i.e., both in the context of Crb stimuli and independently, and it does so independently of the STRIPAK complex). As noted previously, recapitulating the major results in clones using genomic alleles would strengthen the biological relevance. The study advances our understanding of mechanisms tightly controlling downstream transcriptional outputs of the Hpo pathway via regulating Ex protein stability/turnover. Though the primary audience may be those well-versed in the Hpo field and Drosophila genetics, the implications for the research are broad.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The paper nicely shows that PP2A antagonizes Crb-dependent and Crb-independent phosphorylation and degradation of Expanded (Ex), in cell culture and in wing discs. The authors focus on the Mts catalytic subunit of PP2A, but also demonstrate the involvement of the Wrd and Tws B regulatory subunits. They also show via use of transcriptional reporters that PP2A directly affects Hpo signaling in vivo. Finally, they show a potential role for Merlin and Kibra in regulating Ex levels, and that Kib binds to Mts and Wrd. The experiments are on the whole well executed and quantified.

      Major comments:

      1. I am not convinced that the authors can entirely rule out a role for the STRIPAK complex. Mutation of MtsR268A reduces binding of Wrd by 60% and abrogates the effect of Mts on Ex. However mutation of MtsL186A reduces binding of Cka by less than 50% and doesn't disrupt Mts regulation of Ex. Perhaps Cka is more abundant than Wrd, and 50% of Mts/Cka complex is more than sufficient for it to carry out its enzymatic function. I also note that in Fig 1H, Ex levels in Crb/Mts+Cka RNAi appear to be intermediate between those in Crb and Crb/Mts. Ideally this would be quantified. Similarly in 4J, mtsL186A (while not significant) appears intermediate between mtsH118N and mts-WT. What is the actual P value for the comparison to Mts-WT? In any case I would suggest the authors tone down these conclusions.
      2. I also found it rather confusing that the authors discuss the Cka B subunit in the context of the STRIPAK complex in Figure 1, then don't look at the other B subunits until Figures 3/4. In my opinion, it would be easier to follow the flow of the manuscript if the authors discussed Crb-dependent and independent regulation of Ex, then the roles of Gish/CKI, then the role of the B subunits including Cka. In this context, it would also be interesting to see if there was any redundancy between Cka and Wrd - have the authors tried any double knockdown experiments (with appropriate controls for RNAi dosage)?
      3. The authors examine Crb-independent Ex regulation in the wing disc, which appears to be wing discs that do not overexpress Crb. I would expect that wing discs do express Crb - or is this not the case? Please clarify whether this is in the absence of Crb, or the absence of overexpressed Crb.
      4. I was confused by the section 'CKIs and Slmb regulate Ex proteostasis via the 452-457 Slmb consensus sequence'. The authors conclude that 'these results show that the machinery that facilitates Crb-mediated Ex phosphorylation and degradation is also partly involved in the Crb-independent regulation of Ex protein stability.' However, I had concluded the opposite, as it appeared that Slimb and gish RNAi only affected Ex1-468, and similarly Slmb only affected Ex1-468, but not Ex1-450 (which in the previous section was shown to be regulated by Mts independent of Crb). Please could the authors explain/clarify this.
      5. The regulation of Ex by Merlin and Kibra is potentially interesting, but a bit preliminary. This part of the manuscript could be strengthened by showing for example if Mts or Wrd knockdown affects the stabilization of Ex by Kib.

      Minor comments:

      1. The Introduction gives a quite comprehensive review of known interactions between STRIPAK, Expanded and Hippo pathway components. However, it is hard to keep track of all the components and interactions if you are not deeply into the field. To improve accessibility, I would suggest a summary diagram of the key interactions (currently the manuscript has no introductory figures at all!) and if possible the authors might consider whether there are details they could leave out or which could just be mentioned as necessary in the results sections.
      2. Could the authors show a shorter exposure of the Ex blot in Figure 1A, in order to better visualize the loss of band shift?
      3. Line 307 '(Fig. 1B,D,G,I)' the call-out to Fig.1I appears to be in strike-through font, presumably because 1I shouldn't be cited here? It also looks like Fig.1I is wrongly cited on line 342 as that sentence only describes action of L168A in wing discs. I think a sentence describing the experiment in Fig.1I is missing?
      4. Line 355 ambiguous, should this read low expression of Crb in S2 cells?
      5. Line 369 reads 'PP2A was able to stabilize full-length Ex', Mts-WT would be more precise.
      6. The blot in panel 2O is mislabeled Ex1-468, I think this should be Ex1-450.
      7. The nomenclature of 'Mts-WT' for their own transgene and 'Mts-BL' for the Bloomington transgene. is confusing, as both are, I believe, wild type. Maybe leave this detail for the M&M, at least if the authors believe there is no difference in behavior.
      8. Figure S6 appears to be missing from the uploaded version.
      9. Lines 480-481: 'Using co-IP analyses, we observed that Mts interacts with Ex, both in the presence and absence of Crbintra.' No figure call-out is given for this statement, and I can't see the data anywhere, but from the figure legends it seems to be in the missing Fig.S6? And everything that follows in this paragraph should have call-outs for Fig.4K?
      10. Lines 503-504: 'we found that Kib associated with Mts (Fig. 5C)' - Fig.5B?
      11. Lines 504-505: 'no interaction was observed between Mts and Mer (Fig.5B)' - Fig.5C?
      12. In Figure 6G, authors note that 'the mean diap1GFP4.3 levels of MtsWT+Crb-Intra were lower than those of Crb-Intra, this difference was not statistically significant when all genotypes were included in the comparisons, but only when the Control, crbintra and mtsWT+crbintra conditions were considered.' It might be useful to have a table showing the actual P values of all the comparisons (or maybe better still just put actual P values on the graphs?). Sometimes an arbitrary cut-off of 0.05 for significant can be misleading.

      Referees cross-commenting

      *this session contains comments from ALL the reviewers" Rev1

      All comments look very fair and we seem to have similar views, so nothing further to add on our part. Rev 2

      Agreed. We think the reviews provide a consistent guide for revisions/additions that would enhance impact of the studies and rigor of the conclusions. Rev 3

      I also find the other reviewers' comments to be fair. Major issues that stick out are: 1. is the effect really independent of STRIPAK? 2. do the effects seen on ectopic Ex1-468 apply to endogenous Ex?

      A relatively simple experiment could possibly address both issues. If the model is correct and PP2A can target both Hippo and Ex using different adaptor proteins, then we would expect modulating the levels of Tws and Wrd adaptors to influence Ex stability, but not Hpo phosphorylation. Could the authors test this hypothesis in vivo, looking at the endogenous proteins?

      Do the other reviewers think that this would be a fair experiment to ask for? Rev 1 With regard to points of rev 3, I think it's perfectly fair to ask for more data to support the conclusions, and specifically what they suggest regarding separating effects on Hippo and Ex is obviously helpful. The broader question (which I'm unsure how to address in the context of Review Commons) is 'what is necessary for publication' as that depends on where the authors aspire to publish. I would be fine with the authors softening their conclusions and adding caveats instead of adding more data. However, it is also true that adding more data would increase the certainty of their conclusions and lead to a more valuable publication. This is a question for the editor of the journal that they finally submit to, but I'm not sure as reviewers how we lay out these options. Do we add an extra review comment saying either (i) soften conclusions for less valuable paper, (ii) add more data for more valuabe paper, and then leave the authors to argue the point with an editor. In particular the STRIPAK dependence was raised in 2 reviews, so an editor would probably pick up on this. Rev 2 In past reviews for Review Commons, we've distinguished between three levels of review requests: (1) what is minimally necessary to publish (ie egregious gaps); (2) what would enhance confidence in the conclusions, and finally (3) what, if anything, would turn it into a high impact/visibility paper.

      I think most of our suggestions for additional expts fall into category #2 as "either tone down the language or add expt X". Rev 1 That sounds reasonable.

      Significance

      The Hippo signaling pathway is a conserved regulator of tissue growth, and understanding how this pathway is activated and modulated is of great importance. Levels of the upstream activator Expanded are known to be regulated by phosphorylation/degradation, but whether dephosphorylation of Ex is important for growth control has not been widely investigated. This paper utilizes cell culture and the fruit fly model organism to provide clear evidence for a role for PP2A in regulation of Ex levels, independent of its known role in regulating phosphorylation of Hpo. It will therefore be of interest to biologists working in the fields of growth control and tissue homeostasis.

      Expertise: developmental biology, Drosophila research, cell biology

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-0284z

      Corresponding author(s): Bérénice, Benayoun A

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This paper by McGill and colleagues explores sex differences in murine macrophages from different niches. They use a combination of publicly available, and newly developed datasets, and combine these using meta-analysis approaches. They explore DEGs between sexes - both common across niches, and specific to certain niches - and use enrichment analyses to identify pathways linked to these genes. Their overall conclusions are that gene expression changes in females are more consistent across niches, than for males, and are enriched in extracellular matrix-related genes. The paper is easy to follow and very well written.

      Major Comments:

      1. I would suggest Figure 1 be moved to a supplemental figure. We agree that the Xist and Ddx3y is QC and can be removed. However, we believe that the separation of macrophage transcriptomes based on sex in the Multidimensional Scaling plot is an important result. Thus, we have revised Figure 1 to only include the MDS plots and have moved the Xist/Ddx3y plots to the supplement (new Supplemental Figure S1) in line with the reviewer’s suggestion.

      Line 106 - It should be clarified why 50 DEGs was selected as the cut off for exclusion.

      We apologize that our cut off criteria was not explained clearly enough. Because these are publicly available datasets, every lab used different numbers of biological replicates, methods, and sequencing depths, impacting the power of the assay to detect differences in gene expression robustly. Since we were interested in functions that were sex-dimorphic, and that requires running functional enrichment analysis, we needed to have a minimum gene set size to be able to run these analyses, which, in the field, is usually accepted to be 50 genes for robustness. Thus, we used 50 DEGs and have updated the methods to explain our reasoning: “Applying a cutoff for the number of differentially expressed genes (DEGs) helps ensure data consistency and comparability across datasets with varying methodologies and sequencing depths. This prevents datasets with excessively low DEG counts from disproportionately influencing downstream analyses. A cutoff also reduces noise from spurious findings, prioritizing datasets with robust transcriptional changes that are more likely to be biologically meaningful. The excluded microglia dataset contained only 11 DEGs (whereas all other microglia datasets had hundreds of DEGs), the pleural macrophage dataset had 37 (whereas all other lung-related macrophage datasets had above 50), and the spleen macrophage dataset had only 30.” (page 12, lines 381-388).

      Optional - would suggest sex chromosome-linked genes are excluded and the analysis redone to see if there are other autosomal genes that are statistically shadowed by the X and Y linked genes.

      We thank the reviewer for this great suggestion, and we now added this point to the discussion (page 9, lines 260-268). However, we think that genes on the X and Y chromosomes will impact overall function of the macrophages and that they are necessary to understand how macrophages from males and females may support differences in immune function throughout life. We now add this in the discussion as a potential future direction: “We find that a majority of genes similarly differential across sexes among the macrophage niches are sex chromosome linked. X-linked genes like Tlr7, Cxcr3, and Kdm6a enhance immune responses in female macrophages, potentially increasing inflammation with age (Feng et al., 2024). Meanwhile, Y-linked genes such as Uty and Sry influence transcriptional regulation and inflammatory signaling in male macrophages, which may contribute to chronic low-grade inflammation (Lusis, 2019). These genetic differences affect macrophage activity, tissue-specific immune responses, and susceptibility to age-related diseases, highlighting the importance of sex-specific factors in immune research. Future research should also explore how non-sex chromosome-linked genes interact with these sex-specific mechanisms to further shape macrophage and immune function.” (page 9, lines 260-268).

      More metadata about the included studies should be included eg mouse ages, strains, experimental manipulations etc. I can't seem to access all of the Supplemental tables so this may already be included in Table S1.

      We agree that this information is important to take into consideration and have now included this information in Supplemental Table S1A, along with the accession numbers to each dataset. All mice were aged between 2 to 24 weeks and all on variations of the C57BL/6 background.

      How relevant the findings in mice are for humans should be explained further in the discussion.

      We agree that our discussion needs to better explain broader implications. Our findings are relevant for human health because macrophages play key roles in immunity, inflammation, and tissue homeostasis, and their functions are known to differ between sexes. Understanding these sex-specific transcriptional differences in mice can provide insights into how male and female immune systems respond differently to infections, autoimmune diseases, and aging in humans. Since macrophage phenotypes are influenced by both systemic factors (e.g., hormones) and tissue-specific environments, studying multiple macrophage subtypes from different organs helps identify conserved and context-dependent sex differences. Indeed, our findings suggest the ECM may be a potential mechanism underlying sex-biased diseases, such as higher autoimmune prevalence in females or increased susceptibility to certain infections in males. We have added this detail to the discussion (page 10, lines 269-275).

      Minor Comments:

      1. Lines 63-66 - need references here. This mirrors Reviewer 2’s major point #2. We agree with the reviewer that references are needed and now cite PMID: 31541153, PMID: 29533975, PMID: 37863894, PMID: 33415105, and PMID: 37491279 (page 4 line 68-69).

      Line 61 and 69 - repeated.

      We thank the reviewer for catching this oversight and have deleted the first instance of the sentence.

      Reviewer #1 (Significance (Required)):

      Although this study is primarily descriptive, it adds to the current knowledge about sex differences in macrophages, an important and relatively understudied area. Those interested in sex differences and in the innate immune system generally, plus those who study macrophages in any context, should be interested in this work.

      We thank the reviewer for their interest in our work and their helpful suggestions.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The study investigates sex-specific differences in macrophage gene expression across various tissue niches by analyzing both newly generated and publicly available datasets of varying quality. The key finding is the identification of three consistently differentially expressed genes (DEGs) across all macrophage niches: the Y-chromosome-encoded genes Ddx3y and Eif2s3y, and the X-chromosome-specific gene Xist. However, the number of sex-dimorphic DEGs varied significantly between macrophage niches, with female-biased genes showing more consistency across datasets. To further explore these sex-specific differences, the authors performed an overrepresentation analysis of the DEGs across datasets. They found enriched gene sets associated with specific biological terms in female-biased macrophages from peritoneal macrophages, bone marrow-derived macrophages (BMDMs), and osteoclast progenitors (OCPs), while male-biased enrichment was observed in microglia, exudate macrophages, OCPs, and BMDMs. Notably, extracellular matrix (ECM)-related genes were specifically enriched in female peritoneal macrophages and OCPs, whereas the term "nucleic acid binding" was more prominent in male samples from microglia, BMDMs, and OCPs, driven by the Y-chromosome genes Uty and Kdm5d. A gene set enrichment analysis (GSEA) using Gene Ontology (GO) and Reactome databases further confirmed the enrichment of sex-biased pathways. Based on these findings, the authors conclude that three sex chromosome-associated genes are consistently differentially expressed across all datasets and that female-associated gene expression appears to be more stable, particularly in relation to ECM-associated processes.

      Major Comments:

      Are the key conclusions convincing?

      1. The study provides valuable insights into sex-dimorphic gene expression in macrophages across different niches. However, some conclusions appear overinterpreted due to the limited number of differentially expressed genes (DEGs) driving specific terms in the overrepresentation analysis. The reliance on only a few recurring genes (e.g., Kdm5d, Eif2s3y, Uty, and Ddx3y) raises concerns about the biological significance of some enriched terms. A clearer discussion on the limitations of such findings is necessary. We apologize for the confusion. Although the Venn Diagram may give the impression that our comparisons are limited to those few genes, we only highlight them with bold text because they are a good quality control mechanism for our analyses.

      Importantly, methods like gene set enrichment analysis [GSEA] use whole-transcriptome ranking, which means the results we obtain are driven by the entire transcriptome and not just a few genes (GSEA results are reported in Figure 5). We agree that further explanation of these methodologies would improve interpretation of our findings for readers unfamiliar with these analytical techniques. To address this, we have now added the following to the methods: “GSEA relies on whole-transcriptome ranking, ensuring that the results reflect global transcriptomic patterns rather than being influenced by only a few genes.” (page 13, lines 415-417).

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?Some claims, particularly those regarding the role of macrophages in diseases such as AD, histiocytosis, and osteoporosis, lack relevant references.

      This mirrors minor point #1 from Reviewer #1. We apologize for not originally including references for this statement and have now updated the introduction and discussion with appropriate references: “Excessive macrophage activation is associated with numerous conditions, including neurodegeneration, atherosclerosis, osteoporosis, and cancer, many of which exhibit sex-biased tendencies (Chen et al., 2020; Hou et al., 2023; Li et al., 2023; Mammana et al., 2018)” (page 4, lines 67-69) and “Thus, investigating female and male-biased processes in macrophages, including the contribution of the ECM, will be an important step in developing treatments for diseases including, but not limited to, AD, histiocytosis, and osteoporosis(Chen et al., 2020; Cox et al., 2021; Hou et al., 2023; Li et al., 2023; Mammana et al., 2018)” (page 10, lines 285-288).

      Would additional experiments be essential to support the claims of the paper? While additional wet-lab experiments are not strictly necessary, a deconvolution analysis of the datasets could be highly beneficial. This would allow the identification of enriched macrophage subtypes and help assess whether differences between datasets are driven by specific macrophage populations rather than global sex differences. Since peritoneal macrophage origin is influenced by age and inflammation status, deconvolution could also clarify dataset comparability.

      The reviewer makes an interesting point. We apologize for the confusion regarding the purity and origin of these datasets. All the datasets we curated from public repositories for our analysis are from purified populations of macrophages. To clarify this, we now include a column with the purification method used for each of the datasets based on the original manuscript in revised Supplemental Table S1A.

      Since all the used datasets were derived from pure macrophage populations, deconvolution (which is used to identify cellular proportions in heterogeneous contexts) would not accomplish much, predicting that all the cells in the data are macrophages. While some people have argued that deconvolution may be used to identify different cell states, this is very controversial, especially since the “pure” reference and the heterogeneous query are subject to batch effects (i.e. either from differences in bench processing, sex of provenance for target/query datasets, transcriptional impact of sorting methods, differences in transcriptomic quantification methods, etc.) which overshadow most differences beyond cell types. Thus, due to the known batch sensitivity of deconvolution methods and the fact that we only selected pure macrophage transcriptomic profiling datasets, using deconvolution to identify macrophage subtypes would not be informative/feasible. Importantly, we focused our analyses on datasets derived only from young, healthy, naïve animals (2 to 24 weeks), without any interference from age-related inflammation.

      To make this caveat clearer, we have added sentences to the results section indicating the age range of the animals (page 6, lines 100-101), as well as in the discussion to discuss how inflammation states and age may change some of our findings (page 10, lines 295-299).

      Are the suggested experiments realistic in terms of time and resources? Performing cell-type deconvolution using established computational tools (e.g., CIBERSORT, BisqueRNA, or single-cell deconvolution methods) would be a realistic approach within a few weeks and would significantly strengthen the study. This analysis would not require additional experimental work but could refine the interpretation of the dataset. Additionally, a PCA of all datasets could help identify potential similarities among macrophages from different niches and between sexes.

      As explained in our response to point #4, the use of only datasets from purified macrophages from young animals (before any influence of age or disease) makes deconvolution analysis meaningless, especially due to batching concerns. Specifically, it would require us to generate paired single-cell and bulk datasets on all macrophage subtypes in house to remove batch-inducing experimental biases, which we believe is outside of the scope of this small bioinformatics study.

      To the second point, doing a PCA of all the datasets together would not provide much new information beyond cell type of origin due to batching concerns that could not be corrected, which are a known problem in transcriptomics analyses (PMID:20838408, PMID:28351613). Since datasets come from different labs, using different isolation methods, RNA capture choices, library construction kits and sequencing platforms, the main separating effects overall will be batch/dataset, not biology (PMID:20838408, PMID:28351613). Indeed, this is what we observe (Reviewer Figure 1), with broad separation of datasets by tissue of origin, then dataset of origin. Additionally, the top 10 loadings for PC1 and PC2 are primarily associated to autosomal genes (i.e. not on the sex chromosomes; Reviewer Table 1).

      Reviewer Figure 1. (A) PCA of all samples across datasets. Read counts were processed together through R package sva v.3.46.0 for surrogate variable estimation, and surrogate variables were removed using the removeBatchEffect function from ‘limma’ v.3.54.2. DESeq2 normalized counts were used to make the PCA. (B) Zoomed in PCA excluding three outlier sample to enable easier visual discrimination of samples.

      Principal Component – Gene

      Loading

      Chromosome

      PC1- Srcin1

      0.013601

      11

      PC1- Cacna1c

      0.013593

      6

      PC1- Pclo

      0.01357

      5

      PC1- Tro

      0.013547

      X

      PC1- Ppp4r4

      0.013541

      12

      PC1- Ppp1r1a

      0.01354

      15

      PC1- Homer2

      0.013538

      7

      PC1- Caskin1

      0.013535

      17

      PC1- Arhgef9

      0.013527

      X

      PC1- Slc4a3

      0.013499

      1

      PC2- Gm15446

      0.017978

      5

      PC2- 1810034E14Rik

      0.017897

      13

      PC2- Gm19557

      0.017871

      19

      PC2- Pkd1l2

      0.017792

      8

      PC2- H60b

      0.017274

      10

      PC2- Appbp2os

      0.01723

      11

      PC2- Mir7050

      0.017221

      7

      PC2- Nkapl

      0.017166

      13

      PC2- Tmem51os1

      0.017083

      4

      PC2- Dpep3

      0.016962

      8

      Reviewer Table 1. Top 10 loadings for principal component 1 and principal component 2 with their respective chromosomal location.

      Thus, since batch effects can only be accounted for rigorously when they are not confounded by biology (and in our case since each dataset only looks at one type of macrophage), this cannot be corrected in a rigorous manner to yield the desired results.

      We have added a sentence to the discussion to highlight how future work where macrophages from diverse niches would be profiled in parallel may give greater insights into niche-specific sex-dimorphic effects (page 10, line 295-296).

      Are the data and the methods presented in such a way that they can be reproduced? Some methodological details are missing, particularly regarding:

      The isolation of mouse peritoneal macrophages (details on injection and harvesting procedure needed). Quality control of isolated macrophages (How were contaminating cells excluded? Was additional validation performed beyond using the kit?)

      The age of mice used for bone marrow-derived macrophages (BMDMs) is not provided, which is important given that immune responses can be age-dependent.

      We appreciate the reviewer’s request for additional methodological details. We apologize for not being clear with our details and have updated the methods to be clearer (page 11, lines 320-346), as well as added this information in revised Supplemental Table S1A (e.g. age of animals and purification method as described in the original papers). For all our in house datasets, mice were 4-months old, and the text is now updated to reflect this: “Long bones (tibia and femur) of young (4-months-old) from both sexes were collected and bone marrow was flushed into 1.5mL Eppendorf tubes via centrifugation (30 seconds, 10,000g) (Amend et al., 2016)” (page 11, lines 334-336).

      While we couldn’t check the purity post hoc for published datasets we identified for meta-analysis, we performed a purity check on our isolated peritoneal macrophages using Cd11b-F4/80 staining by flow cytometry and have now included this data (including gating strategy) in Supplemental Figure S4. For BMDMs, no purity check was performed, as there is extensive literature on the efficiency of this differentiation protocol which consistently yields > 90% of macrophages. This has been added to the methods: “We used a protocol that is expected to yield ~90% Cd11b+ F4/80+ cells (Mendoza et al., 2022; Toda et al., 2021)” (page 11, lines 336-337).

      Are the experiments adequately replicated and statistical analysis adequate? The statistical analysis appears generally appropriate, but there are concerns about dataset inconsistencies that should be addressed. Some datasets were not used across all analyses, which is not clearly indicated in figures or text. This should be explicitly mentioned to avoid misleading interpretations.

      We appreciate the reviewer’s careful evaluation of our statistical analysis and the concern regarding dataset inconsistencies.

      We believe that the reviewer is referring to the omission of the exudate dataset from the Venn Diagram analysis (Figure 2C), as this is the only time that we did not report the results from all datasets. We originally chose not to include the exudate dataset in the shared differentially expressed gene (DEG) analysis, because it contained over 1,300 DEGs, whereas all other datasets had between 4–30 DEGs, resulting in an unreadable figure.

      However, we agree that it is important to include for the readers, and while we have decided to still exclude the exudate dataset from Figure 1C for readability purposes, we now include the overlap analyses for all datasets in Supplemental Figure S2 using an upset plot (an alternative visualization method) showing all 6 niches, as well as a table panel that lists the shared genes across niches “Three genes were found to be differentially expressed across all six niches: Xist, Ddx3y, and Eif2s3y (Figure 2C, Supplemental Figure 2A,B)” (page 6, lines 124-126). We thank the reviewer for drawing our attention to this and making our analysis clearer for future readers.

      Minor Comments

      1. Figures are included twice in the manuscript. We apologize for this, and figures are now only included once.

      The use of stereotypic colors in figures (e.g., blue for male, pink for female) could be reconsidered for better readability and to avoid reinforcing gender stereotypes.

      While we understand that this color choice might feel gender normative, we respectfully disagree with the reviewer, as we believe that for the expediency of scientific communication it is important to choose a color palette that is easily understandable without confusion without even needing to consult a legend.

      Importantly, we have been using the same color palette in all publications from the lab on sex-differences for consistency (Lu et al, Nat aging 2021 PMID: 34514433; McGill et al, PLoS ONE, 2023 PMID: 38032907; Kang et al, J Neuroinflammation, 2024 PMID: 38840206; McGill et al, STAR Protocols, 2021 PMID: 34820637), which is crucial for scientific rigor and communication consistency.

      Results - Section 1

      Line 92: The word 'identified' may not be the most appropriate choice here, as it implies discovery rather than selection. Consider rephrasing to 'compiled' or 'gathered' to more accurately reflect the process of assembling the datasets. Additionally, the sentence structure could be refined for clarity, such as specifying that the datasets include both newly generated and publicly available data.

      We have changed two instances of using the word identified to “collected” and “gathered” (page 4, line 83 and page 6, line 98). We also adjusted the sentence to say, “Although we initially collected 21 datasets, both newly generated and publicly available, for our study, only 18 datasets were retained after various quality filtering steps for downstream analysis” (page 4, lines 83-85).

      Line 95: Specify the source of exudate-derived macrophage data.

      We have updated Supplemental Table S1A to make sure it was comprehensively describing the datasets we used in our analysis and double checked that it was complete (including for the exudate data). We have updated the text to reflect this: “All accession numbers and corresponding manuscripts are found in Supplemental Table S1A” (page 6, lines 103-104).

      Figure 1/2A: The scheme overview lacks clarity-its purpose is unclear. The two identical boxes are redundant and do not provide additional insight. Consider illustrating the origins of different macrophage subtypes instead. The cutoff of >50 DEGs should be included in the schematic to improve clarity. Overrepresentation and GSEA analysis should not be illustrated multiple times across different figures-it is redundant.

      In Figure 1A, we included the identical boxes to indicate that no datasets were excluded for incorrect labeling of males/females. However, we agree that this is unnecessary and have removed the second box as suggested.

      In Figure 2A, we agree the identical boxes are unneeded as the Xist/Ddx3y quality control step was listed in Figure 1A, and we have modified the figure accordingly.

      We also agree that including the DEG cutoff and removing the GSEA mention will streamline the figures and have updated them accordingly as well.

      Line 100: The mention of R software should be moved to the Methods section instead of appearing in the Results section.

      We have now updated the text to say, “Expression levels of male-specific Ddx3y and female-specific Xist genes across all samples were examined to ensure proper sex labeling of samples (Supplemental Figure 1A-U)” (page 6, lines 111-112).

      Figure 1B-V: The current figure layout is visually cluttered. Consider plotting male and female datasets together in a single graph with different point shapes instead of separate panels for each specific niche.

      This seems to echo the above request for a global PCA in Reviewer 2’s Major Point #4, which unfortunately cannot be included due to the disproportionate impact of batch effects that has been well documented in the literature (Reviewer Figure 1; PMID:20838408, PMID:28351613). However, to make the figure clearer and less cluttered, and to address related Reviewer 1’s Major Point #1, we have moved the Xist/Ddx3y plots to Supplemental Figure S1 and only include the Multidimensional Scaling plots in Figure 1 to showcase the sex separation in each dataset.

      Text-Figure alignment: The text describes male/female-specific gene expression levels first, while the figure starts with MDS analysis. The order should be consistent.

      We agree and have adjusted the text accordingly (lines 109-112).

      Figure 2C: Exudate data is missing-explain why.

      This point echoes major point #6. As explained above, we have clarified this and included new data panels for clarity (New Supplemental Figure S2).

      Results - Section 2

      Line 151: Use consistent terminology-either "DEGs" or "DE genes", not both.

      We replaced all instances of “DE genes” with DEGs (lines 132, 137, 141, 147, 149, 163, and 397).

      Figure 3A: The text suggests not all datasets were included in this analysis-this should be explicitly indicated in the figure.

      We apologize for the confusion. All datasets were included in this analysis; however, some niches did not have any GO terms passing the FDR

      Show the number of DEGs used for analysis.

      We apologize for the confusion. For the ORA analyses (Figures 3 and 4), we indicate the number of DEGs used for analysis in the panel header. For the GSEA analysis (Figure 5, Supplemental Figure S3), all expressed genes are ranked based on effect size without any prior filter (see response to major point #1), so DEGs are irrelevant for these analyses.

      Figure 3B: Smaller pale dots in the bubble plot are difficult to distinguish-consider using a darker outline.

      We have now added outlines to all the bubbles in the plots to help improve visibility.

      Line 158: The term "phagocytosis" appears inconsistent with the figure, where it is labeled "phagocytosis, recognition".

      We have updated the text accordingly (page 7, line 170).

      Figure 4B, D, E: The overrepresentation analysis is based on very few genes (often only 1-2 genes per term), which may lead to overinterpretation.

      We apologize for the lack of clarity of our previous manuscript. The number of genes used for DEG analysis is in the panel titles of Figure 3 and 4. While the overlap is small, this is unlikely to be spurious since all of the pathways we discuss show significant enrichment with FDR

      Consider explicitly naming these genes and discussing their biological role instead of assigning terms based on minimal evidence.

      We now discuss these genes in the results: “Male-biased GO terms for microglia, OCPs, and BMDMs derived from four genes: Kdm5d, Uty, Ddx3y, and Eif2s3y. All of these are Y-linked genes and play crucial roles in regulating innate and adaptive immune responses (Meester et al., 2020). Kdm5d and Uty influence adaptive immunity through chromatin remodeling and histone modification, while Ddx3y and Eif2s3y shape innate immune responses by modulating macrophage activation and cytokine production via translation initiation and RNA processing (Bloomer et al., 2013; Hamlin et al., 2024; Meester et al., 2020) “(page 8, lines 195-200).

      Figures S3G and S3H seem to be switched.

      We are puzzled by this comment, as our original manuscript did not include a Supplemental Figure S3. Out of an abundance of caution, however, we checked that Supplemental Table S3G and H were correctly labelled, and independently confirmed that they are not switched.

      Results - Section 3

      Figure 5A does not add significant new insights. Consider refining its content to highlight key findings more effectively.

      We respectfully disagree and believe that schematic overviews help readers understand what is accomplished in any specific figure and have thus decided to keep it.

      Number of genes included in the analysis is not provided-this is important to assess significance and should be stated in methods and figure legends.

      We apologize for the lack of clarity. As explained above, GSEA uses all the genes in rank order (PMID: 16199517), we now explain GSEA more explicitly in the text “GSEA relies on whole-transcriptome ranking, ensuring that the results reflect global transcriptomic patterns rather than being influenced by only a few genes” (page 13, lines 415-417).

      Discussion 20. Line 201-203: Missing reference.

      We have now updated the text with the proper reference: “Tissue-resident macrophages are crucial to proper immune system function (Guilliams et al., 2020). While all macrophages share the responsibility of clearing cellular debris and foreign bodies, tissue-resident macrophages also have unique responsibilities that facilitate homeostasis throughout the body (Guilliams et al., 2020; Varol et al., 2015)” (page 9, lines 227-230).

      Reference 23 (1999) is outdated. Newer literature should be cited to reflect modern insights into sex differences in macrophages.

      We have now updated the text with an updated reference for two outdated references: (i) “Sex differences have previously been reported in macrophages, with female macrophages having higher phagocytic activity than males (Scotland et al., 2011)” (page 9, lines 232-233) and (ii) “Dysfunctional OCPs are associated with development of osteoporosis, a disease that is four times more prevalent in women (Alswat, 2017)” (page 10, lines 284-285).

      Peritoneal macrophages and OCPs originate from monocytes. Would deconvolution help identify enriched subtypes and assess dataset comparability?

      As noted in Reviewer 2’s Major Points #3 and #4, deconvolution analysis is not meaningful for subtype analysis without paired isolated/bulk datasets, which are outside of the scope of this study to generate.

      The 'more consistent' pathways found for female datasets are not discussed.

      We now discuss pathways found among the female datasets: “In addition, GSEA analysis of REACTOME gene sets showed male-biased expression for cell cycle related pathways (average set size 499), and female-biased expression for G protein-coupled receptor (GPCR) signaling (average set size 122) and extracellular matrix organization (average set size 127) (Figure 5C, Supplemental Table S4S-AJ; consistent with our ECM observation, Supplemental Figure S3A). Macrophages express a wide variety of GPCRs that allow them to respond to different stimuli. The expression of specific GPCRs influences macrophage polarization toward either a pro-inflammatory or anti-inflammatory state (Wang et al., 2019). A manual review of the genes contributing to this GPCR enrichment reveals the presence of several chemokine-related genes (such as Ccl4, Ccr4, Cxcl1, and others) (Supplemental Table S4). This suggests that females may have an increased abundance of chemokine GPCRs, potentially contributing to heightened autoimmune activity, among other factors.” (page 8, lines 212-222).

      Methods - Peritoneal macrophage isolation:

      Details on injection and harvesting are missing.

      We apologize for not being clear with our details and have modified the methods to be clearer (page 11, lines 320-331).

      How was contamination from other cell types assessed? F4/80 selection may not be fully macrophage-specific, and contamination could occur due to insufficient washing or the presence of non-macrophage F4/80+ cells.

      For the peritoneal macrophage datasets we generated, the macrophages were checked for purity through flow cytometry using Cd11b and F4/80 antibodies. We considered double positive Cd11b+ F4/80+ cells to be macrophages, which represents >95% of cells using our methodology (Supplemental Figure S4), without a difference between sexes.

      For the BMDMs, we utilize a protocol that is expected to yield ~90% Cd11b+ F4/80+ cells (PMID: 35212988 and PMID: 33458708).

      Finally, we now include the purification method for all publicly available datasets according to their original manuscript in Supplemental Table S1A and explicitly discuss the information for our in-house datasets in the methods (page 11, lines 321-346).

      • Bone marrow macrophages:

      Mouse age is not provided in the results part.

      We now provide this information in the methods (page 11, line 334). All ages for all datasets are now included in Supplemental Table S1A.

      Figure Legends

      Figure 2: Peritoneal macrophages are abbreviated as PeriMac-consider using this abbreviation consistently in the text.

      We respectfully disagree with the reviewer and choose to keep Peritoneal Macrophages spelled out in the text for clarity. We use the shorthand “PeriMac” in Figure 2 and Figure 5 solely for spacing purposes, but these are explained in the figure legend.

      Reviewer #2 (Significance (Required)):

      The study's strengths include the integration of multiple datasets, the use of both overrepresentation and GSEA, and the exploration of tissue-specific macrophage niches. These findings have relevance for diverse communities, including immunologists, sex-difference researchers, and those studying macrophage-driven diseases such as osteoporosis, neurodegeneration, and chronic inflammation. The work provides a foundation for further studies on sex-specific macrophage biology and may have implications for sex-specific therapeutic strategies. However, the study has limitations. The conclusions regarding enriched pathways rely heavily on a small number of DEGs, raising concerns about overinterpretation. Additionally, dataset variability and missing data for some analyses (e.g., exudate macrophages) could affect the robustness of the results.

      Despite these limitations, the study makes a meaningful but incremental advance by highlighting stable sex-dimorphic patterns in macrophage biology. It provides insights for both fundamental and translational research, particularly for audiences focused on immune regulation, sex-specific gene expression, and tissue-specific macrophage function.

      We thank the reviewer for understanding the importance of our work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: McGill et al. explore sex-based differences in macrophage gene expression across various tissues. Using a meta-analysis of publicly available and newly generated datasets, they identify conserved and divergent sex-dimorphic genes and pathways between tissues. Overall, the report is easy to follow and guides the reader through the analysis. The authors highlight the relevance of the report by noting sex differences in immune responses to infection, autoimmunity, and chronic diseases. The inclusion of 17 independent transcriptomic datasets provides a robust and extensive analysis of sex-based transcriptional differences. The authors explore potential biological implications of sex-based transcriptional differences using pathway analysis. Despite the overall strengths, there are some points for which further clarification and analysis would improve the manuscript. Detailed comments are listed below.

      Major comments:

      1. A comparison of the overall transcriptomic profiles of macrophages regardless of sex would be additive. Knowing the degree of similarities and differences among macrophages from different niches would help the reader determine what genetic programs vary by compartment. If macrophages are very different by niche, it is not surprising that they share few sex-dimorphic patterns. This mirrors Reviewer 2’s Major Point #4. While this approach may seem valuable, it would only be feasible if all datasets were generated simultaneously by the same lab using identical sequencing and library preparation protocols to avoid batch effects. In this case, biology and batch effects are confounded, making any global analysis misleading. Although the reviewer may find the limited overlap unsurprising, given that macrophages are generally considered to be the same cell type, our goal was to explore the extent of shared versus distinct features across datasets, which we believe to be an invaluable question for the field.

      Although it would not be possible to do this rigorously with the data we curated, the question of niche specific gene regulation of macrophages has been studied, showing extensive niche-specific regulation: “While the question of niche-specific gene regulation has been studied, showing extensive niche-specific regulation (Gosselin et al., 2014; Lavin et al., 2014), a comprehensive and systematic study of sex-differences across macrophage subtypes has not yet been performed” (page 4, lines 78-81).

      It is unclear what age and strain the mice were and the number of samples that were included (n) for each dataset. This information should be included in S1A. If different ages or strains were used, how might this impact findings?

      This mirrors Reviewer 1’s Major Point #4. We agree that this information is important to take into consideration and have now included this information in Supplemental Table 1A, along with the accession numbers to each dataset. Because there is no aging effect (all mice are aged between 2 to 24 weeks) and all mice are on a variation of the C57BL/6 background, we don’t expect this to be a major problem impacting our findings.

      The authors used a Jaccard index to examine similarities in sex-based differences across tissue compartments. They claim that there are more similarities in females. However, the male are female graphs (Fig. 1E,D) do not look that different. Is there a better way to display this?

      We apologize for the lack of clarity. We clustered the Jaccard matrices using hierarchical clustering to determine patterns of sharing. Thus, in these figures, the samples cluster based on the degree of similarity in sex-biased genes. In the females, there is clear separation by macrophage origin (yolk sac or circulating monocytes); whereas males have some separation but also have some mixing (e.g. Peritoneal Macrophage 2 clustering with the yolk-sac derived macrophage datasets). Additionally, four microglia datasets are together in the females with only one separate, whereas in the males they are split into three. We included colored bars by the dataset names to help highlight clear separation by niche of origin.

      We have added this detail to the text to better explain the similarities: “Our results indicate that female-biased genes were more consistent among the cell types compared to male-biased genes (Figures 2D,E). In females, there is clear separation by macrophage origin (yolk sac or circulating monocytes), with all the peritoneal macrophages clustering together, followed by bone-related macrophages, then microglia and lung macrophages. In the males, the five microglia datasets are split into three groups, and Peritoneal Macrophage 2 clusters with the yolk-sac derived macrophage datasets” (page 7, lines 155-160).

      In the Gene Ontology analysis, it is unclear what type of GO pathways were included (biological process, cellular component, molecular function). Also, some of the GO analyses were done with very few genes (as little as 4).

      This echoes Reviewer #2’s Major Comment #1. For the Overrepresentation analysis (ORA) using Gene Ontology, we use the “ALL” option to include biological process, cellular component, and molecular function terms. We used ORA to look at shared DEGs across datasets of the same niche which is why some have very low input. For this reason, we also performed Gene Set Enrichment Analysis that uses all genes, not just those differentially expressed at FDR 5%, to examine gene changes at a broader level. In the methods we have added this information: “The differentially expressed genes shared within each niche were divided into up and down-regulated based on the sign of the DEseq2 log2 fold change. These gene lists were used as the shared genes and all expressed genes across datasets in that specific niche were used as the universe for the clusterProfiler function ‘enrichGO’, using the “ALL” option to include biological process, cellular component, and molecular function terms” (page 13, lines 405-410) and “GSEA relies on whole-transcriptome ranking, ensuring that the results reflect global transcriptomic patterns rather than being influenced by only a few genes.” (page 13, lines 415-417)”.

      Is it possible to combine datasets by tissue to remove potential batch effects before downstream analyses? At the very least, PCA on combined data may help determine if some biological (e.g., age, strain) or technical (batch) differences are contributing to identifying few common sex differences.

      This mirrors Reviewer #2’s Major Point #4. Unfortunately, since every dataset only examined a single niche, biology and batches are confounded, and thus performing a PCA on all datasets together will be driven by technical rather than biological drivers. Batch effects are a well-documented issue in genomics (PMID:20838408, PMID:28351613) Indeed, this is largely observed when we attempt this analysis, with datasets clustering by batch (Reviewer Figure 1). Due to the issue of uncorrectable batch effects, we do not believe this analysis meets the rigor required to be included in the revised manuscript and have chosen to not include it.

      Validation of key results would further strengthen the manuscript.

      We agree that future validation is important but is beyond the scope of this purely bioinformatic analysis. We have included text in the revision to highlight the importance of future validation studies: “Thus, investigating female- and male-biased processes in macrophages, including the contribution of the ECM, will be an important step in developing treatments for diseases including, but not limited to, AD, histiocytosis, and osteoporosis, and future research will be essential to validate these findings and further refine therapeutic strategies (Chen et al., 2020; Cox et al., 2021; Hou et al., 2023; Li et al., 2023; Mammana et al., 2018)” (page 10, lines 285-289).

      Further contextualization of key results would enhance the discussion. For example, ECM-related differences in female macrophages could have broader roles in wound healing, fibrosis, and migration.

      We agree with the reviewers and have added this detail to the discussion: “ECM components are emerging as key regulators of innate immune responses (García-García & Martin, 2019). Macrophages contribute to ECM remodeling by producing and degrading collagens (Sutherland et al., 2023), and ECM-related differences in female macrophages may impact wound healing, fibrosis, and migration. In lung and kidney tissues, macrophages recruit and activate fibroblasts, influencing fibrosis through direct interactions and ECM-degrading enzymes (Nikolic-Paterson et al., 2014). The balance between ECM deposition and degradation is crucial for tissue homeostasis, as excessive fibrosis leads to pathology (Nikolic-Paterson et al., 2014; Ran et al., 2025). Mechanical properties of the ECM, such as stiffness and collagen crosslinking, enhance macrophage adhesion, migration, and inflammatory activation (Hsieh et al., 2019). These ECM cues direct macrophage behavior during injury response, influencing their ability to reach inflammation sites and promote repair. Thus, female-biased expression of ECM-related genes may contribute to phenotypes such as enhanced wound healing or even fibrosis(Balakrishnan et al., 2021; Harness-Brumley et al., 2014; Rønø et al., 2013) “ (page 9, lines 248-259).

      Minor comments:

      1. Line 51: In the introduction, the authors state that macrophages produce chemokines. There are other signaling molecules produced by macrophages (e.g., cytokines) that also contribute to immune responses. We apologize for this and have updated the text to say: “Macrophages are a key component of the mammalian immune system and are responsible for producing a diverse array of signaling molecules including (but not limited to) cytokines, chemokines, and interferons that activate the rest of the immune system to combat infection (Shapouri-Moghaddam et al., 2018)” (page 4, lines 49-52).

      Line 53: The authors state that after birth the primary source of new macrophages come from differentiation of monocytes. However, some tissue resident macrophages are self-renewing.

      We apologize for this oversight and have adjusted the text to say: “After birth, the primary source of new macrophages comes from the differentiation of monocytes, which can be recruited to tissues throughout life. However, some tissue resident macrophages can self-renew, including those from the pleural and peritoneal cavities (Röszer, 2018)” (page 4, lines 53-56).

      Line 123: "spermatogenial" should be "spermatogonial"

      We have updated the text accordingly (page 6, line 130).

      Reviewer #3 (Significance (Required)):

      Significance: • General assessment: The study provides a novel and comprehensive analysis of sex-dimorphic gene expression in macrophages, with key findings that emphasize the importance of ECM remodeling in female macrophages. The strengths include the broad dataset inclusion, rigorous quality control, and methodological rigor. However, consideration of potential confounding variables (e.g., age, strain) should be included and validation of key results would strengthen the manuscript. • Advance: This study advances knowledge by analyzing sex differences across multiple macrophage niches rather than focusing on a single tissue type. It extends findings from previous immune studies. • Audience: This report would be of interest to immunologists and researchers studying sex differences. Expertise: Immunology, sex differences in disease, macrophage biology, transcriptomics, and inflammation research.

      We thank the reviewer for their positive comments on the impact of our work and for their useful feedback.

      __ __


      References

      Alswat, K. A. (2017). Gender Disparities in Osteoporosis. J Clin Med Res, 9(5), 382-387. https://doi.org/10.14740/jocmr2970w

      Amend, S. R., Valkenburg, K. C., & Pienta, K. J. (2016). Murine Hind Limb Long Bone Dissection and Bone Marrow Isolation. J Vis Exp(110). https://doi.org/10.3791/53936

      Balakrishnan, M., Patel, P., Dunn-Valadez, S., Dao, C., Khan, V., Ali, H., El-Serag, L., Hernaez, R., Sisson, A., Thrift, A. P., Liu, Y., El-Serag, H. B., & Kanwal, F. (2021). Women Have a Lower Risk of Nonalcoholic Fatty Liver Disease but a Higher Risk of Progression vs Men: A Systematic Review and Meta-analysis. Clin Gastroenterol Hepatol, 19(1), 61-71.e15. https://doi.org/10.1016/j.cgh.2020.04.067

      Bloomer, L. D., Nelson, C. P., Eales, J., Denniff, M., Christofidou, P., Debiec, R., Moore, J., Zukowska-Szczechowska, E., Goodall, A. H., Thompson, J., Samani, N. J., Charchar, F. J., & Tomaszewski, M. (2013). Male-specific region of the Y chromosome and cardiovascular risk: phylogenetic analysis and gene expression studies. Arterioscler Thromb Vasc Biol, 33(7), 1722-1727. https://doi.org/10.1161/atvbaha.113.301608

      Chen, K., Jiao, Y., Liu, L., Huang, M., He, C., He, W., Hou, J., Yang, M., Luo, X., & Li, C. (2020). Communications Between Bone Marrow Macrophages and Bone Cells in Bone Remodeling. Front Cell Dev Biol, 8, 598263. https://doi.org/10.3389/fcell.2020.598263

      Cox, N., Pokrovskii, M., Vicario, R., & Geissmann, F. (2021). Origins, Biology, and Diseases of Tissue Macrophages. Annu Rev Immunol, 39, 313-344. https://doi.org/10.1146/annurev-immunol-093019-111748

      Gosselin, D., Link, V. M., Romanoski, C. E., Fonseca, G. J., Eichenfield, D. Z., Spann, N. J., Stender, J. D., Chun, H. B., Garner, H., Geissmann, F., & Glass, C. K. (2014). Environment drives selection and function of enhancers controlling tissue-specific macrophage identities. Cell, 159(6), 1327-1340. https://doi.org/10.1016/j.cell.2014.11.023

      Hamlin, R. E., Pienkos, S. M., Chan, L., Stabile, M. A., Pinedo, K., Rao, M., Grant, P., Bonilla, H., Holubar, M., Singh, U., Jacobson, K. B., Jagannathan, P., Maldonado, Y., Holmes, S. P., Subramanian, A., & Blish, C. A. (2024). Sex differences and immune correlates of Long Covid development, symptom persistence, and resolution. Sci Transl Med, 16(773), eadr1032. https://doi.org/10.1126/scitranslmed.adr1032

      Harness-Brumley, C. L., Elliott, A. C., Rosenbluth, D. B., Raghavan, D., & Jain, R. (2014). Gender differences in outcomes of patients with cystic fibrosis. J Womens Health (Larchmt), 23(12), 1012-1020. https://doi.org/10.1089/jwh.2014.4985

      Hou, P., Fang, J., Liu, Z., Shi, Y., Agostini, M., Bernassola, F., Bove, P., Candi, E., Rovella, V., Sica, G., Sun, Q., Wang, Y., Scimeca, M., Federici, M., Mauriello, A., & Melino, G. (2023). Macrophage polarization and metabolism in atherosclerosis. Cell Death Dis, 14(10), 691. https://doi.org/10.1038/s41419-023-06206-z

      Lavin, Y., Winter, D., Blecher-Gonen, R., David, E., Keren-Shaul, H., Merad, M., Jung, S., & Amit, I. (2014). Tissue-resident macrophage enhancer landscapes are shaped by the local microenvironment. Cell, 159(6), 1312-1326. https://doi.org/10.1016/j.cell.2014.11.018

      Li, M., Yang, Y., Xiong, L., Jiang, P., Wang, J., & Li, C. (2023). Metabolism, metabolites, and macrophages in cancer. J Hematol Oncol, 16(1), 80. https://doi.org/10.1186/s13045-023-01478-6

      Mammana, S., Fagone, P., Cavalli, E., Basile, M. S., Petralia, M. C., Nicoletti, F., Bramanti, P., & Mazzon, E. (2018). The Role of Macrophages in Neuroinflammatory and Neurodegenerative Pathways of Alzheimer's Disease, Amyotrophic Lateral Sclerosis, and Multiple Sclerosis: Pathogenetic Cellular Effectors and Potential Therapeutic Targets. Int J Mol Sci, 19(3). https://doi.org/10.3390/ijms19030831

      Meester, I., Manilla-Muñoz, E., León-Cachón, R. B. R., Paniagua-Frausto, G. A., Carrión-Alvarez, D., Ruiz-Rodríguez, C. O., Rodríguez-Rangel, X., & García-Martínez, J. M. (2020). SeXY chromosomes and the immune system: reflections after a comparative study. Biol Sex Differ, 11(1), 3. https://doi.org/10.1186/s13293-019-0278-y

      Rønø, B., Engelholm, L. H., Lund, L. R., & Hald, A. (2013). Gender affects skin wound healing in plasminogen deficient mice. PLoS One, 8(3), e59942. https://doi.org/10.1371/journal.pone.0059942

      Röszer, T. (2018). Understanding the Biology of Self-Renewing Macrophages. Cells, 7(8). https://doi.org/10.3390/cells7080103

      Scotland, R. S., Stables, M. J., Madalli, S., Watson, P., & Gilroy, D. W. (2011). Sex differences in resident immune cell phenotype underlie more efficient acute inflammatory responses in female mice. Blood, 118(22), 5918-5927. https://doi.org/10.1182/blood-2011-03-340281

      Shapouri-Moghaddam, A., Mohammadian, S., Vazini, H., Taghadosi, M., Esmaeili, S. A., Mardani, F., Seifi, B., Mohammadi, A., Afshari, J. T., & Sahebkar, A. (2018). Macrophage plasticity, polarization, and function in health and disease. J Cell Physiol, 233(9), 6425-6440. https://doi.org/10.1002/jcp.26429

      Wang, X., Iyer, A., Lyons, A. B., Körner, H., & Wei, W. (2019). Emerging Roles for G-protein Coupled Receptors in Development and Activation of Macrophages. Front Immunol, 10, 2031. https://doi.org/10.3389/fimmu.2019.02031

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      McGill et al. explore sex-based differences in macrophage gene expression across various tissues. Using a meta-analysis of publicly available and newly generated datasets, they identify conserved and divergent sex-dimorphic genes and pathways between tissues. Overall, the report is easy to follow and guides the reader through the analysis. The authors highlight the relevance of the report by noting sex differences in immune responses to infection, autoimmunity, and chronic diseases. The inclusion of 17 independent transcriptomic datasets provides a robust and extensive analysis of sex-based transcriptional differences. The authors explore potential biological implications of sex-based transcriptional differences using pathway analysis. Despite the overall strengths, there are some points for which further clarification and analysis would improve the manuscript. Detailed comments are listed below.

      Major comments:

      • A comparison of the overall transcriptomic profiles of macrophages regardless of sex would be additive. Knowing the degree of similarities and differences among macrophages from different niches would help the reader determine what genetic programs vary by compartment. If macrophages are very different by niche, it is not surprising that they share few sex-dimorphic patterns.
      • It is unclear what age and strain the mice were and the number of samples that were included (n) for each dataset. This information should be included in S1A. If different ages or strains were used, how might this impact findings?
      • The authors used a Jaccard index to examine similarities in sex-based differences across tissue compartments. They claim that there are more similarities in females. However, the male are female graphs (Fig. 1E,D) do not look that different. Is there a better way to display this?
      • In the Gene Ontology analysis, it is unclear what type of GO pathways were included (biological process, cellular component, molecular function). Also, some of the GO analyses were done with very few genes (as little as 4).
      • Is it possible to combine datasets by tissue to remove potential batch effects before downstream analyses? At the very least, PCA on combined data may help determine if some biological (e.g., age, strain) or technical (batch) differences are contributing to identifying few common sex differences.
      • Validation of key results would further strengthen the manuscript.
      • Further contextualization of key results would enhance the discussion. For example, ECM-related differences in female macrophages could have broader roles in wound healing, fibrosis, and migration.

      Minor comments:

      • Line 51: In the introduction, the authors state that macrophages produce chemokines. There are other signaling molecules produced by macrophages (e.g., cytokines) that also contribute to immune responses.
      • Line 53: The authors state that after birth the primary source of new macrophages come from differentiation of monocytes. However, some tissue resident macrophages are self-renewing.
      • Line 23: "spermatogenial" should be "spermatogonial"

      Significance

      Significance:

      • General assessment: The study provides a novel and comprehensive analysis of sex-dimorphic gene expression in macrophages, with key findings that emphasize the importance of ECM remodeling in female macrophages. The strengths include the broad dataset inclusion, rigorous quality control, and methodological rigor. However, consideration of potential confounding variables (e.g., age, strain) should be included and validation of key results would strengthen the manuscript.
      • Advance: This study advances knowledge by analyzing sex differences across multiple macrophage niches rather than focusing on a single tissue type. It extends findings from previous immune studies.
      • Audience: This report would be of interest to immunologists and researchers studying sex differences.

      Expertise: Immunology, sex differences in disease, macrophage biology, transcriptomics, and inflammation research.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The study investigates sex-specific differences in macrophage gene expression across various tissue niches by analyzing both newly generated and publicly available datasets of varying quality. The key finding is the identification of three consistently differentially expressed genes (DEGs) across all macrophage niches: the Y-chromosome-encoded genes Ddx3y and Eif2s3y, and the X-chromosome-specific gene Xist. However, the number of sex-dimorphic DEGs varied significantly between macrophage niches, with female-biased genes showing more consistency across datasets. To further explore these sex-specific differences, the authors performed an overrepresentation analysis of the DEGs across datasets. They found enriched gene sets associated with specific biological terms in female-biased macrophages from peritoneal macrophages, bone marrow-derived macrophages (BMDMs), and osteoclast progenitors (OCPs), while male-biased enrichment was observed in microglia, exudate macrophages, OCPs, and BMDMs. Notably, extracellular matrix (ECM)-related genes were specifically enriched in female peritoneal macrophages and OCPs, whereas the term "nucleic acid binding" was more prominent in male samples from microglia, BMDMs, and OCPs, driven by the Y-chromosome genes Uty and Kdm5d. A gene set enrichment analysis (GSEA) using Gene Ontology (GO) and Reactome databases further confirmed the enrichment of sex-biased pathways. Based on these findings, the authors conclude that three sex chromosome-associated genes are consistently differentially expressed across all datasets and that female-associated gene expression appears to be more stable, particularly in relation to ECM-associated processes.

      Major Comments:

      Are the key conclusions convincing?

      The study provides valuable insights into sex-dimorphic gene expression in macrophages across different niches. However, some conclusions appear overinterpreted due to the limited number of differentially expressed genes (DEGs) driving specific terms in the overrepresentation analysis. The reliance on only a few recurring genes (e.g., Kdm5d, Eif2s3y, Uty, and Ddx3y) raises concerns about the biological significance of some enriched terms. A clearer discussion on the limitations of such findings is necessary.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? Some claims, particularly those regarding the role of macrophages in diseases such as AD, histiocytosis, and osteoporosis, lack relevant references.

      Would additional experiments be essential to support the claims of the paper?

      While additional wet-lab experiments are not strictly necessary, a deconvolution analysis of the datasets could be highly beneficial. This would allow the identification of enriched macrophage subtypes and help assess whether differences between datasets are driven by specific macrophage populations rather than global sex differences. Since peritoneal macrophage origin is influenced by age and inflammation status, deconvolution could also clarify dataset comparability.

      Are the suggested experiments realistic in terms of time and resources?

      Performing cell-type deconvolution using established computational tools (e.g., CIBERSORT, BisqueRNA, or single-cell deconvolution methods) would be a realistic approach within a few weeks and would significantly strengthen the study. This analysis would not require additional experimental work but could refine the interpretation of the dataset. Additionally, a PCA of all datasets could help identify potential similarities among macrophages from different niches and between sexes.

      Are the data and the methods presented in such a way that they can be reproduced?

      Some methodological details are missing, particularly regarding: The isolation of mouse peritoneal macrophages (details on injection and harvesting procedure needed). Quality control of isolated macrophages (How were contaminating cells excluded? Was additional validation performed beyond using the kit?) The age of mice used for bone marrow-derived macrophages (BMDMs) is not provided, which is important given that immune responses can be age-dependent.

      Are the experiments adequately replicated and statistical analysis adequate?

      The statistical analysis appears generally appropriate, but there are concerns about dataset inconsistencies that should be addressed. Some datasets were not used across all analyses, which is not clearly indicated in figures or text. This should be explicitly mentioned to avoid misleading interpretations.

      Minor Comments

      Figures are included twice in the manuscript. The use of stereotypic colors in figures (e.g., blue for male, pink for female) could be reconsidered for better readability and to avoid reinforcing gender stereotypes.

      Results - Section 1

      • Line 92: The word 'identified' may not be the most appropriate choice here, as it implies discovery rather than selection. Consider rephrasing to 'compiled' or 'gathered' to more accurately reflect the process of assembling the datasets. Additionally, the sentence structure could be refined for clarity, such as specifying that the datasets include both newly generated and publicly available data.
      • Line 95: Specify the source of exudate-derived macrophage data.
      • Figure 1/2A: The scheme overview lacks clarity-its purpose is unclear. The two identical boxes are redundant and do not provide additional insight. Consider illustrating the origins of different macrophage subtypes instead. The cutoff of >50 DEGs should be included in the schematic to improve clarity. Overrepresentation and GSEA analysis should not be illustrated multiple times across different figures-it is redundant.
      • Line 100: The mention of R software should be moved to the Methods section instead of appearing in the Results section.
      • Figure 1B-V: The current figure layout is visually cluttered. Consider plotting male and female datasets together in a single graph with different point shapes instead of separate panels for each specific niche.
      • Text-Figure alignment: The text describes male/female-specific gene expression levels first, while the figure starts with MDS analysis. The order should be consistent.
      • Figure 2C: Exudate data is missing-explain why.

      Results - Section 2

      • Line 151: Use consistent terminology-either "DEGs" or "DE genes", not both.
      • Figure 3A: The text suggests not all datasets were included in this analysis-this should be explicitly indicated in the figure.
        • Show the number of DEGs used for analysis.
      • Figure 3B: Smaller pale dots in the bubble plot are difficult to distinguish-consider using a darker outline.
      • Line 158: The term "phagocytosis" appears inconsistent with the figure, where it is labeled "phagocytosis, recognition".
      • Figure 4B, D, E: The overrepresentation analysis is based on very few genes (often only 1-2 genes per term), which may lead to overinterpretation.
      • Consider explicitly naming these genes and discussing their biological role instead of assigning terms based on minimal evidence.
      • Figures S3G and S3H seem to be switched.

      Results - Section 3

      • Figure 5A does not add significant new insights. Consider refining its content to highlight key findings more effectively.
        • Number of genes included in the analysis is not provided-this is important to assess significance and should be stated in methods and figure legends.

      Discussion

      • Line 201-203: Missing reference.
      • Reference 23 (1999) is outdated. Newer literature should be cited to reflect modern insights into sex differences in macrophages.
      • Peritoneal macrophages and OCPs originate from monocytes. Would deconvolution help identify enriched subtypes and assess dataset comparability?
      • The 'more consistent' pathways found for female datasets are not discussed.

      Methods

      • Peritoneal macrophage isolation:
        • Details on injection and harvesting are missing.
        • How was contamination from other cell types assessed? F4/80 selection may not be fully macrophage-specific, and contamination could occur due to insufficient washing or the presence of non-macrophage F4/80+ cells.
      • Bone marrow macrophages:
        • Mouse age is not provided in the results part.

      Figure Legends

      • Figure 2: Peritoneal macrophages are abbreviated as PeriMac-consider using this abbreviation consistently in the text.

      Significance

      The study's strengths include the integration of multiple datasets, the use of both overrepresentation and GSEA, and the exploration of tissue-specific macrophage niches. These findings have relevance for diverse communities, including immunologists, sex-difference researchers, and those studying macrophage-driven diseases such as osteoporosis, neurodegeneration, and chronic inflammation. The work provides a foundation for further studies on sex-specific macrophage biology and may have implications for sex-specific therapeutic strategies.

      However, the study has limitations. The conclusions regarding enriched pathways rely heavily on a small number of DEGs, raising concerns about overinterpretation. Additionally, dataset variability and missing data for some analyses (e.g., exudate macrophages) could affect the robustness of the results.

      Despite these limitations, the study makes a meaningful but incremental advance by highlighting stable sex-dimorphic patterns in macrophage biology. It provides insights for both fundamental and translational research, particularly for audiences focused on immune regulation, sex-specific gene expression, and tissue-specific macrophage function.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper by McGill and colleagues explores sex differences in murine macrophages from different niches. They use a combination of publicly available, and newly developed datasets, and combine these using meta-analysis approaches. They explore DEGs between sexes - both common across niches, and specific to certain niches - and use enrichment analyses to identify pathways linked to these genes. Their overall conclusions are that gene expression changes in females are more consistent across niches, than for males, and are enriched in extracellular matrix-related genes. The paper is easy to follow and very well written.

      Major Comments:

      1. I would suggest Figure 1 be moved to a supplemental figure.
      2. Line 106 - It should be clarified why 50 DEGs was selected as the cut off for exclusion.
      3. Optional - would suggest sex chromosome-linked genes are excluded and the analysis redone to see if there are other autosomal genes that are statistically shadowed by the X and Y linked genes.
      4. More metadata about the included studies should be included eg mouse ages, strains, experimental manipulations etc. I can't seem to access all of the supplementary tables so this may already be included in Table S1.
      5. How relevant the findings in mice are for humans should be explained further in the discussion.

      Minor Comments:

      1. Lines 63-66 - need references here.
      2. Line 61 and 69 - repeated.

      Significance

      Although this study is primarily descriptive, it adds to the current knowledge about sex differences in macrophages, an important and relatively understudied area. Those interested in sex differences and in the innate immune system generally, plus those who study macrophages in any context, should be interested in this work.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      'The authors do not wish to provide a response at this time.'

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript from Craig et al., (2023) leverages a previously reported atoh1a reporter to drive expression of lifeact-egfp in Merkel cells (MC) to assess MC morphology during both scale development and regeneration, in the optically tractable zebrafish. Using a combination of live-imaging approaches and genetic perturbations, the authors show that MCs arise from a more immature population of dendritic Merkel cells (dMC) and that dMCs themselves derive from basal keratinocytes. The authors show that following injury, dMCs are the major cell type to infiltrate the regenerating scale region, with MCs becoming the predominant cell type at later stages of regeneration (presumably as the dMCs mature). The authors present evidence suggesting that dMCs are molecularly similar to both keratinocytes and MCs and argue that dMCs may represent an intermediate cell type. Data in the manuscript suggests MC and dMC protrusions are differently polarized, and that MC and dMC dynamics are also different. The authors provide direct evidence that dMCs mature into MCs morphologically and suggest that the reverse may also occur. Finally, the authors show that MC microvilli morphology is impaired in eda-/- mutants, suggesting a role for eda in the normal morphology of MCs, more specifically in the trunk.

      Major comments:

      1. The discovery and characterization of dMCs in this study relies entirely on their labeling by an atoh1a-lifeact transgenic reporter. Given the striking similarity of dMCs to melanocytes, it is important to confirm the atoh1a reporter labels dMCs and MCs specifically, and not melanocytes. For example, it would be useful to see confirmation of cell type by double labelling of dMCs, e.g. with atoh1a:lifeact-egfp together with an antibody for atoh1a or preferably, another MC/dMC marker. dMCs look morphologically similar to melanocytes, which also display many of the behaviors noted in this manuscript. According to RNA-seq data (see https://hair-gel.net/), atoh1 is expressed in melanocytes in embryonic mouse skin and hair follicle stem cell precursors in post-natal skin. We recommend that the authors mine a similar dataset for zebrafish to ascertain whether atho1a is also expressed in pigment cells (e.g. https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?&acc=GSE190115). We would also recommend that the authors run a stain for a melanocyte marker such as Mitf/Tyr/Dct to show this is not expressed in dMCs.
      2. A major conclusion of the paper is that dMCs display molecular properties that overlap with both MCs and basal keratinocytes based on expression of three markers. I feel this conclusion is a little strong given the evidence presented; global transcriptomic analysis of these cells (RNA-seq) would better define where along a differentiation trajectory dMCs lie.
      3. More data regarding the function of the dMC intermediate cell type would greatly strengthen the significance of the study. The characterization of dMCs forms the core of the report, yet little is shown/discussed regarding the function of this cell population. For example, why is this intermediary even required? Presumably this is to facilitate the migration of MCs from the basal layer into the upper strata and their dispersion upon arrival. In this case, one could argue that the morphology of the dMC is directly related to its migratory function, as the authors suggest dMCs arise from basal keratinocytes, then migrate upwards towards the more superficial strata, where mature MCs are located. However, very little evidence in support of this upward migration is presented - most of the migratory data are related to lateral movement. Experiments to alter the migratory properties of dMCs, for example using inhibitors of Arp2/3, would address whether migration is the key function of dMCs. Finally, there is insufficient evidence to suggest contact-inhibition is occurring, and in the cell division movie 5, it doesn't appear to happen (or the movie isn't long enough to show it). More examples are required or this observation should be reworded accordingly.
      4. Eda is shown to be important for MC morphology, especially in MCs located in the trunk. More discussion of how eda may function would be helpful to the reader. For example, in what cells are Eda and Edar expressed? Do the authors think Edar signaling is cell autonomous within the MCs? Or does the loss of Eda indirectly affect MC morphology as a result of impaired scale formation? Additionally, the authors state that corneal MCs in both WT and eda-/- have similar microvilli morphologies. The figure, however, shows that corneal MCs from these genotypes do look different, with eda-/- corneal MCs having a more evenly distributed microvilli than the polarized microvilli of their WT counterparts. The metric '% of MCs with microvilli' does not capture this aspect of their morphology.
      5. In several places, the number of biological replicates is unclear. A major concern is that data presented as 'number of cells' may only have been collated from n=1 animal. The authors should specify the number of both biological and technical replicates per experiment and consider displaying the data in superplots. Where stats are undertaken, particularly on percentages, it should be made clear whether the stats test was perfomed on raw numbers or the % (particularly true for Chi square). Examples of this issue can be found in figures 3C-H, 4F-H, 5B-C and supplemental.

      Minor comments:

      • Line 124. Why did the authors choose developmental stages 11mm and 28mm for the quantification? The images in Figure 1 show 8, 10 and 12mm but not 11mm.
      • Line 126. It is unclear what the difference is between circularity and roundness.
      • Line 645 and Fig 1I. 'Cells manually classified as MC or dMC'. Please provide further clarification on this categorization (e.g. number of protrusions/roundness value etc.)
      • Line 141 and Fig 1O. The authors comment on the mosaic nature of DsRed expression, but it seems particularly sparse in the image. Similarly, there are numerous GFP+ cells that do not express DsRed, and the ones that do are found at a distance from the DsRed+ basal keratinocytes. Further explanation is required here. For example, if MCs ultimately arise from dMCs, why are so few of them labelled? It would be useful to know the % of cre-recombination that is actually occurring (i.e. how efficient the cre driver is in keratinocytes by DsRed+/total number) to put these data in context.
      • Line 170 and 179. The authors do not comment on the possibility of de/trans-differentiation of mature MCs as an explanation of how dMCs and 'new' MCs arise on regenerating scales.
      • Line 176. Can the authors comment on how quickly the nls-Eos protein turns over? This is pertinent given it is possible that by 7 dpp all the red nls-Eos could potentially have been replaced by green nls-Eos in an 'existing' atoh1a+ cell.
      • Figure 2M-P. Both channels (green and magenta) should be shown here. Cells will express both and it is unclear from the image panel what this looks like.
      • Line 186, 200 and 206. 'regenerating dMCs' this is confusing. Perhaps reword to 'dMCs associated with regenerating scales'.
      • Line 186. Why did the authors focus on 5dpp, particularly given that at 3 dpp the proportion of dMCs:MCs is more evenly spread?
      • Figure 3A-B. An additional panel with DAPI is needed here to enable Tp63 negative nuclei to be visualized. Also, what is the cell in the top right of 3B? It has a red nucleus but is not marked by an asterisk.
      • Figure 3D-E. This data panel also needs to show a dMC that is negative for SV2.
      • Figure 4D-E and line 235. It is intuitive that dMCs will not have basal facing processes if they are already in the basal layer of keratinocytes - there simply isn't the physical space (unless they penetrate the scales/basement membrane which presumably they don't). Also, the authors need to comment on, and quantify dMC protrusions in relation to the directionality of dMC migration in the main text. This is referred to in line 762 as part of the figure legend (Fig 5) and Movie 3 legend (line 809), but this is not quantified anywhere.
      • Line 258. How do these unipolar protrusions correlate with directionality?
      • Line 287 and Figure 5G. There is insufficient evidence to conclude that MCs can revert back to dMCs, particularly given that MCs are considered post-mitotic. N=2 (cells/fish?) is not sufficient without further evidence, and the MC depicted in Figure 5G doesn't resemble a bona fide MC at the start of imaging. Suggest removing this conclusion and data or increasing n and providing further evidence.
      • Line 394. 'These protrusions extended from lateral-facing membranes and interdigitated between basal and suprabasal keratinocytes'. Did the authors specifically show this? It is not clear from the data.
      • Line 430. The reference to Merkel Cell carcinoma needs more commentary with regards to the relevance of the authors' findings.
      • Line 491. Denoise.ai was used on images as stated. Can the authors confirm that any image quantification was done on raw images prior to using the Denoise.ai function?
      • Line 528. Include details of the tp63 antibody here.

      Significance

      Overall, the data are novel and of interest to researchers in several fields, including development, skin biology and MC carcinoma. This work provides an important step forward in our understanding of how basal keratinocytes give rise to MCs in zebrafish - via a dMC intermediary cell type. The imaging presented therein is of a high quality, and the movies are beautiful; capturing the cellular behaviors very clearly. This paper does not however, comment on the molecular mechanisms regulating this transition, nor on the cellular mechanisms resulting in the altered morphology and migration of dMCs and maturation into MCs. Inclusion of data as described above in the major comments section would increase the significance and impact of this work. Notwithstanding, the observations made in this work describe, for the first time to my knowledge, a morphologically distinct cell type in zebrafish (dMCs) similar to that having been described in other vertebrates and provide the ground work for future investigation.

      Reviewer expertise: skin biology, live-imaging, zebrafish, mouse, developmental biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This work by Craig et al., defines intermediate steps in Merkel cell (MC) differentiation during development and regeneration in the zebrafish model system. Using live imaging, the authors describe a number of previously unappreciated steps that lead to the MCs differentiation from basal keratinocytes through a dendritic MC (dMC) intermediate. Live imaging of MSs' microvilli as well as dMSc show a previously unrecognized dynamics of dMSs, including the presence of long actin-based protrusions and their dynamics. The authors also carefully analyzed dMCs migration, dynamics of dMC-dMC contacts and their division. Moreover, lineage tracing identified basal keratinocytes as dMC precursors, showing that basal keratinocytes give rise to this intermediate cell population. Their marker expression analysis provides further evidence that dMCs indeed represent a transitional state between basal keratinocytes and MCs. They also look at the MCs renewal during skin regeneration and show that MCs in regenerated epidermis form predominantly de novo. Although the Eda requirement for MCs differentiation is not novel, they show that microvilli are absent in mutant cells. This adds some mechanistic insight into the MC protrusion formation. I found the study rigorous, well-controlled and their conclusions supported by the presented data. It clearly adds to our basic understanding of this important cell type. I only have a few general and minor comments.

      Major comments:

      One burning question is what controls the transition of dMCs into MCs? An obvious candidate is innervation. If the authors can demonstrate that, it would certainly take their work to another level.

      What happens to the MC regeneration in eda mutants? Is it already known? If not, it would help to address its role in the MC differentiation process.

      In their discussion they talk about directionality of MCs' protrusions in other species. Can they resolve MCs in 3D to address special orientation of their protrusions in zebrafish?

      Minor comments:

      The authors should comment on the eda expression; is it present in dMSs and MCs?

      The difference between corneal and trunk dMCs and MCs in eda mutants is striking. The authors should comment on this in their discussion. Can they speculate on the basis of these differences?

      Referees cross-commenting

      Reviewer 3 made an important point about atoh1a expression and the reporter line. I agree that the authors should confirm their atoh1a reporter indeed marks dMCs and MCs.

      Significance

      The strength of this work is the ability to follow MCs' differentiation in a live animal over time. One of its limitations is that the work is mostly descriptive. The main advance is showing that dMCs are the MCs intermediate population derived from basal keratinocytes. The study will be of an interest to sensory neuroscientists as well as those studying various aspects of skin development and regeneration.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors use confocal imaging techniques to morphologically characterize Merkel cells during their maturation process in the zebrafish skin. Using an F-actin reporter, they identify two morphologically distinct populations of atoh1a+ cells: 1) Mature Merkel cells (MCs), which had previously been described in zebrafish, and 2) a transient population sharing morphological characteristics with so called dendritic Merkel Cells (dMCs), that were described in mice and humans but not previously identified in zebrafish. It was unknown whether dMCs represent a developmentally immature MC state or a functionally distinct subpopulation of neuroendocrine cells. The authors go on to show that dMCs represent the primary atoh1a+ cell type during skin regeneration and share features of both basal keratinocytes and Merkel cells, leading them to speculate that they could be MC precursors. Confocal time lapse imaging further showed that MCs and dMCs differ in the polarity of their protrusions. In some of the lapses, dMC can be seen maturing into MCs, providing evidence that they could be precursor cells. MC to dMC reversion events are also observed, albeit less often. Finally, the authors show that loss Ectodysplasin A (Eda) signaling disrupts MC microvilli formation, identifying this pathway as a potential regulator of MC morphology.

      Major comments:

      • The authors conclude that dMCs represent an intermediate state in the MC maturation program. This is based on the observation that the percentage of dMCs decreases over time and the fact that they share characteristics of both keratinocytes and MCs. In addition, dMCs are observed to mature into MCs in time lapses. However, these findings do not completely rule out the possibility that dMCs represent a transient, functionally distinct population of MCs. The authors should discuss this possibility. Additionally, some clarifications on the data could help strengthen their conclusion:
        • Figure 1 I-K: The interpretation of the simultaneous increase of dMCs and MCs is not clear. Shouldn't the percent of dMCs be highest at 8-9mm and then go down, when MCs first start to appear?
        • Fig. 2K: These results could also mean that dMCs numbers stay the same and only MCs increase in number. Does not imply lineage as stated in line 182 where the authors say that dMCs are a transient population. Please also report the total number of dMCs.
        • Figure 5 F and G: In these time lapses, "a small subset of dMCs (n>10)" is observed to adopt MC morphology. Does this mean 10 cells, and if so, out of how many? The authors should clarify how many time lapses were taken, and quantify the percentage of dMCs undergoing this process. The same goes for the reciprocal process, MC to dMC conversion, which happens only "in rare instances (n=2)".
      • Use photoconversion of single cells to establish lineage relationship. The 2 time lapses shown are not statistically significant and the identity of MCs in these movies is solely based on morphology.
      • In the last part of the paper, the authors show that trunk dMCs and MCs adopt abnormal morphologies in the absence of Eda signaling. However, this phenotype is not seen in the corneal epidermis, which is not squamated. Since Eda mutants do not develop scales, could the altered morphology in the trunk be due to the absence of scales? If possible, the authors should inhibit Eda signaling after the formation of scales or tone down their conclusions.
      • Line 264: The authors write: 'Consistent with this notion, dMC-dMC or dMC-MC contacts resulted in lateral dMC movement away from the contact (Movie 4). Together these observations suggest that MCs are immotile, epithelial-like cells, whereas dMCs are motile, mesenchymal-like cells that undergo contact inhibition upon encountering another atoh1a+ cell'. The lateral movement of dMCs after contacting MCs needs to be quantified before it can be interpreted as contact inhibition.

      Minor comments:

      • 'Defects in the morphogenesis of actin-based protrusions are linked to a variety of diseases, including colorectal cancer and deafness'. Please provide refs.
      • Line 145: this experiment does not show motility. Just that basal keratinocytes give rise to them.
      • Line 165. Cells increase by 14dpp and do not seem to plateau at 7dpp. Please discuss.
      • Line 190. Does Figure 3A not show basal keratinocytes? Only Figure 3B is cited.
      • Figure 3: Within individual cells, is there a negative correlation between SV2 staining and tp63 staining in dMCs? Or between sphericity and tp63 staining?
      • If dMCs are immature, are they already innervated by somatosensory axons?
      • Line 284: Indeed, during our live-imaging of juvenile and regenerating adult skin, we observed a small subset of dMCs (n>10) withdraw their long protrusions, round up their cell body, and rapidly extend microvilli reminiscent of the mature "mace-like" MC morphology (Figure 5F; Movies 6,7). I do not think movie 7 shows that. If it does, please indicate which of the cells shows this behavior.

      Optional:

      Published scRNASeq of the zebrafish skin exists and I am wondering if the authors could have searched for dMC and MC genes in these data which then could be used to generate lineage tracing tools or perform a pseudotime analysis that could indicate lineage relationships.

      Significance

      The aim of the study was to test if motile, dividing dMCs are precursors of immotile, post-mitotic MCs or a functionally distinct subpopulation of neuroendocrine cells. The manuscript is largely descriptive, well written and the findings are supported by beautiful imaging. The authors performed a series of experiments that strongly support the interpretation that dMCs are immature MCs. The findings will be of interest to developmental and stem cell biologists who study cell specification and differentiation. The most direct evidence that dMCs and MCs share a lineage relationship are the observations of a few dMCs that acquire the morphology of MCs in time lapse analyses. The other results support this interpretation but are correlative and do not exclude the possibility that dMCs are a functionally distinct cell type. To substantiate their interpretation the authors could take advantage of their photoconvertible line and photoconvert individual dMCs to determine if they differentiate into MCs.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statement:

      We appreciate the reviewers for acknowledging the impact of our work to the field of neurodegeneration and motor neuron diseases as well as for the understanding of the biology and function of VAPB itself; “the idea of assaying the function of ALS-causing VAPB mutants in iPSC-derived neurons is great and would be a great asset to the field” (Reviewer 1) “The new iPSC-derived system to study VAPB mutations in human motor neurons is significant and has led the authors to discover new functions for VAPB (i.e., mitochondria-ER contacts).” (Reviewer 2). The main concern raised by both reviewers is that the doxycycline inducible VAPB iPSC lines may not fully recapitulate the physiological environment found in patients, as patients are heterozygous for the VAPB P56S mutation, and our lines had VAPB under the control of an exogenous doxycycline inducible promoter. While we maintain that the doxycycline inducible lines do provide their own substantial benefits to the interrogation of the ALS pathogenesis, namely the opportunity to identify mutant VAPB interactors compared to wild type VAPB interactors through proteomics, as well as to identify pathogenesis associated to mutant VAPB without the confounding effects of wild type VAPB, we do agree with both reviewers that the inclusion of heterozygous patient iPSC lines would increase the significance of our study. Thus, in this revised manuscript we have included iPSC patient lines harboring the VAPB P56S mutation which we reprogrammed in our lab and to uphold the highest standards in the stem cell field we also performed CRISPR mediated genomic editing to generate the isogenic corrected pair. In our assessment of the ALS patient iPSC-derived motor neurons, we have already observed the same mitochondria and translation dysfunction previously described in our work with the doxycycline-inducible VAPB P56S mutant iPSC lines. Most importantly, these phenotypes were also similarly rescued by the integrated stress response inhibitor (ISRIB). Collectively, these findings suggest that the proposed mechanism initially identified in doxycycline-inducible iPSC-derived motor neurons is preserved in ALS patient iPSC-derived motor neurons.

      Reviewer #1 Major Point 1. The method of knocking out and selecting an inducible line in problematic. VAPB is an essential gene-patients with P56S are always heterozygotes, since nonfunctional VAPB is embryonic lethal. Selecting a knockout cell line is already choosing a parent that is very far from physiological, and the reexpression of P56S VAPB as the sole form also is not a good a model for understanding the contributions of P56S to disease. This approach is unusual, as it seems to overlook the advantages of working with iPSCs and patient-derived neurons. Unfortunately, the value of this amazing and rare system is diminished by the design of the selection method.

      *Reviewer #2 Major Point 1. Why did the authors decide to make VAPB knockouts and then introduce the WT or P56S VAPB constructs on a lentivirus instead of generating the point mutations (or correcting them) directly in the endogenous locus? Data in Extended Fig. 1c and Extended Fig. 2a indicate significant differences in either the kinetics of WT vs. P56S VAPB expression (1c) or levels (2a). It seems important to be able to compare comparable levels of WT and mutant proteins, especially for the interpretation of the subsequent IP-MS experiments to identify PTP151. The authors may wish to consider generating (or obtaining) isogenic lines harboring the mutations at the endogenous locus so that equal levels of expression of WT and mutant VAPB can be assessed. *

      Carried Out Revisions

      The development of the inducible system for VAPB was specifically designed to enable a systematic investigation of the effects of mutant VAPB (VAPB P56S) on cellular homeostasis while minimizing confounding influences from the wild-type (WT) protein. Additionally, this system allowed us to assess VAPB P56S binding partners and compare them to those of VAPB WT, which would not have been feasible in the context of heterozygous ALS8 patient cells.

      In response to Reviewer 2’s concern regarding differences in VAPB WT and VAPB P56S expression levels, we utilized ALS8 patient cells and familial controls to calibrate the doxycycline dose response. This approach allowed us to precisely adjust VAPB protein levels in the inducible system to match those observed in ALS8 patient and familial control iPSCs. As a result, the inducible VAPB P56S iPSCs recapitulate the VAPB expression levels found in ALS8 patient iPSCs, whereas the inducible VAPB WT iPSCs mimic the levels present in familial control iPSCs. Furthermore, the differential expression of VAPB between ALS8 patient and control cells is well documented in the literature (Mitne-Neto, et al., 2011)

      Nonetheless, we acknowledge the significance of studying ALS patient-derived iPSCs. To address this, we obtained fibroblasts from an ALS8 patient carrying the heterozygous VAPB P56S mutation, originating from a genetic background distinct from the cells used in our inducible system. These fibroblasts were reprogrammed into iPSCs in our laboratory, followed by CRISPR/Cas9-mediated genome editing to generate isogenic corrected iPSCs as controls.

      The resulting iPSC isogenic pair was differentiated into motor neurons following the protocol described in our manuscript. Notably, ALS8 patient iPSC-derived motor neurons exhibited reduced mRNA translation, as assessed by the SUnSET assay (Fig. 6A), along with a decrease in mitochondrial membrane potential, as determined using the JC-1 assay (Fig. 6B). These findings confirm that the hypotranslation and mitochondrial dysfunction initially identified in VAPB P56S doxycycline-inducible iPSC-derived motor neurons were successfully recapitulated in ALS8 patient iPSC-derived motor neurons. Furthermore, ISRIB treatment effectively rescued these phenotypic defects.

      Overall, these results demonstrate that the molecular and cellular abnormalities identified in the original inducible system can be reliably reproduced in an ALS patient-derived model with a different genetic background, thereby reinforcing the significance and broader applicability of our findings.

      Currently, we are investigating the electrophysiological properties of ALS8 patient iPSC-derived motor neurons compared to the isogenic control using the multi-electrode array (MEA) system. If a reduction in electrophysiological activity is observed, consistent with our initial findings in doxycycline-inducible VAPB P56S iPSC-derived motor neurons, we plan to treat the heterozygous patient-derived cultures with ISRIB on day 45 of differentiation. This will allow us to determine whether neuronal firing deficits in the heterozygous patient-derived motor neurons can be rescued.

      All other concerns have been addressed in this revision.

      Citation:

      1. Mitne-Neto M, Machado-Costa M, Marchetto MC, Bengtson MH, Joazeiro CA, Tsuda H, Bellen HJ, Silva HC, Oliveira AS, Lazar M et al (2011) Downregulation of VAPB expression in motor neurons derived from induced pluripotent stem cells of ALS8 patients. Hum Mol Genet 20: 3642-3652 Reviewer #1 Major Point 2. The interactome analysis is not controlled properly to interpret. It is not the total amount of VAPB that needs to be used as the normalization control, since it is already known 90+% of the P56S VAPB is in cytoplasmic aggregates. The authors need to normalize to the amount of VAPB that made it to the contact sites-a much smaller amount in the cells expressing the diseased form. For example, the fact that the authors can still pull down detectable PTPIP51 in Fig. 2e actually argues for the opposite conclusion than what the authors have stated-if the authors have actually expressed just P56S in a true knock out condition, this means that P56S CAN still bind to PTPIP51 (and possibly even better than WT, as several previous papers have suggested-since there is ~100-fold less available for binding). Without an appropriate normalization, the authors cannot make any conclusion about how to interpret the results of this part of the paper.

      Carried Out Revisions

      We sincerely thank Reviewer 1 for highlighting this critical point. Previous studies have demonstrated that the VAPB P56S mutation increases its binding affinity for PTPIP51; however, it has been proposed that the overall reduction in VAPB levels in cells harboring the P56S mutation leads to a decrease in ER-mitochondrial contacts despite the enhanced affinity (De Vos et al., 2012).

      To address this, we have repeated the co-immunoprecipitation experiment and normalized the data to VAPB levels. Consistent with Reviewer 1’s hypothesis, when normalized to soluble VAPB, we observe an increased affinity of VAPB P56S for PTPIP51. However, the total amount of PTPIP51 co-immunoprecipitated with VAPB remains significantly lower in the mutant compared to WT, likely due to the overall reduced levels of soluble VAPB P56S. This finding aligns with both Reviewer 1’s comment and the previous observations reported by De Vos et al. (2012).

      Figure 2E has been updated to reflect the normalized co-immunoprecipitation data.

      Citation:

      1. De Vos, K. J. et al. VAPB interacts with the mitochondrial protein PTPIP51 to regulate calcium homeostasis. Hum Mol Genet 21, 1299-1311, doi:10.1093/hmg/ddr559 (2012). *Reviewer #1 Major Point 3. The electron microscopy data is not interpretable in this form. The authors have provided no data at all on how analysis was performed, how contact sites were defined, how samples were collected and ensured to be representative, blinding that was performed, how sources of bias were accounted for, etc. It is clear even from what little is shown that the authors are not focused on what matters to address their own questions. For example, apart from the P56S Day 35 example, none of the "contact sites" selected for the figure are even possible to be mediated by VAPB, since the distance between the ER and the mitochondria is too far for the maximum tethering distance of VAPB-PTPIP51. Since the authors have neglected to include scale bars in their zooms, the reader cannot be sure of the distance, but it is clearly in excess of 50 nm since there are obviously visible ribosomes between the two organelles. Additionally, the authors provide no information on what "% mitochondria in contact with ER" means (By organelle? By unit surface area? Is the data grouped by cell or all comes from a single cell? How do you account for contact sites vs. proximity by crowding? Etc.). *

      2. *

      Carried Out Revisions

      We thank Reviewer 1 for their insightful comments on the analysis of the electron microscopy (EM) data and recognize the need for greater clarity in describing our quantification approach. To address this, we have revised the Electron Microscopy section of the Methods to explicitly detail our methodology for quantifying ER-mitochondria-associated membranes (ER-MAMs), as follows:

      "A series of images at various magnifications were provided, and data were collected from unique images at the highest magnification for each condition: D35 WT (13 unique images), D35 P56S (21 unique images), D60 WT (13 unique images), and D60 P56S (18 unique images). All images for a given condition originated from a single well of a 12 mm Snapwell™ Insert with 0.4 µm Pore Polyester Membranes (Corning). No information on cell grouping or sampling strategy was supplied with the images; therefore, we treated the dataset as a random sampling of the culture. Images were blinded, and quantification was performed using FIJI. Mitochondria were identified based on the presence of cristae and a double membrane. The mitochondrial perimeter was traced using the free-draw tool, and the length of ER membranes within 50 nm of this perimeter was quantified. The final measurement represents the percentage of each mitochondrion’s perimeter in contact with the ER, aggregating all visually distinct ER-MAMs, as continuity beyond the imaging plane cannot be determined (Cosson et al., 2012; Csordás et al., 2010; Stoica et al., 2014). Each data point on the graph corresponds to a single mitochondrion, with data collected from multiple cells across the unique images provided by the Core, originating from a single biological replicate."

      Regarding the quantification of ER-MAM distances, VAPB has not been definitively localized exclusively to either the rough or smooth ER. To ensure comprehensive analysis, we quantified ER-MAMs without restricting our assessment to a specific ER subdomain. We adopted a 50 nm threshold as the maximum distance for defining ER-MAMs, a well-established criterion that Reviewer 1 also referenced.

      Furthermore, we disagree with Reviewer 1’s assertion that the presence of ribosomes should justify extending the ER-MAM threshold beyond 50 nm. Ribosomes in human cells have a well-documented diameter of 20–30 nm (Anger et al., 2013), which does not support the claim that an observed ribosome within the contact site necessitates a redefinition of the ER-MAM boundary.

      We stand by our methodological approach and have updated the manuscript to ensure precision and clarity in our EM data analysis.

      Citations:

      1. Cosson, P., Marchetti, A., Ravazzola, M. & Orci, L. Mitofusin-2 independent juxtaposition of endoplasmic reticulum and mitochondria: an ultrastructural study. PLoS One 7, e46293 (2012).
      2. Csordás, G. et al. Imaging interorganelle contacts and local calcium dynamics at the ER-mitochondrial interface. Mol Cell 39, 121-132 (2010).
      3. Stoica, R. et al. ER–mitochondria associations are regulated by the VAPB–PTPIP51 interaction and are disrupted by ALS/FTD-associated TDP-43. Nat Commun 5, 3996 (2014).
      4. Anger AM, Armache JP, Berninghausen O, Habeck M, Subklewe M, Wilson DN, Beckmann R. Structures of the human and Drosophila 80S ribosome. Nature. 2013 May 2;497(7447):80-5. doi: 10.1038/nature12104. PMID: 23636399. We would like to thank the Editor of Review Commons for clarifying Reviewer #1’s Major Point 4 and will be responding to the Editor’s interpretations as detailed in the Editorial Note.

      Reviewer #1 Major Point 4. The strange pooling of data without explanation, unusual sample sizes, and lack of clarity about statistical testing, false hypothesis testing, and really any clear rigor in statistics of any kind make it impossible for a reader to have any confidence in the results presented here. The fact that every experiment in the paper has just enough n to trigger statistical significance as determined by the authors raises some concerns, suggesting potential biases. The reliability of these conclusions is questionable, especially if the authors were blinded to the identity of their own samples. This is particularly relevant for the EM data, where the determination of contact sites appears to have been made subjectively.

      Reviewer #1: "The strange pooling of data without explanation"

      • *

      - When looking into the figures and their captions in more detail, we could also not understand the nature of the replicates and how the data was aggregated or “pooled”. In Figure 1, the stated number of replicates is "N=8 separate wells”. It is unclear whether these are 8 wells from a single dissociation/replating procedure (the procedure is described in Materials & Methods as follows: "Motor neurons were dissociated on day 25 of differentiation and re-plated onto 48-well MEA plate") or whether the eight are sampled across multiple plates across cultures obtained from independent dissociations procedures.

      • We apologize for the lack of clarity and specificity. We have updated the Multi-Electrode Array Recordings portion of the Methods Section with the following: “iPSC-derived MNs from a single well of a 6-well plate thawed as day 15 MNP were dissociated and plated across 8 wells of the MEA plate. Each point on the graph is an average of the weighted mean firing rate of those 8 wells, normalized for cell count across genotypes, obtained after all firings were recorded by dissociating 2 wells per line, counting and averaging the cell numbers, and then normalizing all firings by the ratio of cell number between WT and P56S. Wells with no firing detected were excluded from quantification.”

      - In Figure 3, the number of replicates is "N=13-21 images”. Here, it is unclear whether these images come from the same or independent samples, how many quantifications were performed per image, and how many images per sample were used.

      • We have updated the Electron Microscopy Methods Section with the following: “We were provided with a series of images and magnifications and were able to gather data from unique images at the highest magnification level for each of the following categories: D35 WT: 13 unique images, D35 P56S: 21 unique images, D60 WT 13 unique images, D60 P56S: 18 unique images. All images for a given line come from a single well of a 12 mm Snapwell™ Insert with 0.4 µm Pore Polyester Membranes (Corning). No indication of cell grouping or sampling techniques was provided with the images, therefore the images were quantified as a random sampling of the culture. *Images were then blinded, and FIJI was used to quantify.” *

      We are happy to make all images publicly available.

      *- We also note that replicates are not mentioned in the proteomics analysis. *

      • We apologize for missing this and thank the editor for mentioning it. The Proteomics portion of the methods section has been updated with the following: “The identification of VAPB binding partners via mass spectrometry was performed with one biological sample, while the validation of VAPB-PTPIP51 binding via co-immunoprecipitation and Western Blot was performed with three separate biological replicates.”

      Reviewer #1: “unusual sample sizes”:

      • *

      - The wording is indeed not very explicit, but we believe it is reasonable to assume that this point refers to "N=13-21 images” and that it is not clear how the data were pooled. The reviewer makes the related point: "Is the data grouped by cell or all comes from a single cell?", which provides further context to this point.

      • We thank the editor for this clarification, our response to Reviewer #1 Major Point 3 details the updates to Electron Microscopy section of the Methods and covers this. All images were provided to us by the Case Western Reserve University Electron Microscopy Core based on the number of quality images their team were able to obtain from our samples. Reviewer #1: “lack of clarity about statistical testing”:

      • *

      - We agree that without a clear description of the nature of the replicates, the statistical analysis is unclear.

      • We hope with the updated clarity on the description of the nature of the replicates as detailed above, the nature of the statistical analysis is clearer. In addition, we have added a Statistical Analysis subsection in the Methods Section. Reviewer #1: "The reliability of these conclusions is questionable, especially if the authors were blinded to the identity of their own samples.”:

      • *

      - This is a typo; the word “not” is missing. It should read: "if the authors were NOT blinded to the identity…” and refers to concerns raised by the reviewers about evaluating the EM images.

      • We apologize for this omission, each unique image was blinded after we received them from the core, and then quantification was performed on the blinded images. The Electron Microscopy portion of the methods section has been updated to include: “We were provided with a series of images and magnifications and were able to gather data from unique images at the highest magnification level for each of the following categories: D35 WT: 13 unique images, D35 P56S: 21 unique images, D60 WT 13 unique images, D60 P56S: 18 unique images. All images for a given line come from a single well of a 12 mm Snapwell™ Insert with 0.4 µm Pore Polyester Membranes (Corning). No indication of cell grouping or sampling techniques was provided with the images, therefore the images were quantified as a random sampling of the culture. Images were then blinded, and FIJI was used to quantify.”

      Reviewer #1: “The figures suggest a lack of appropriate blinding, with cherry-picking evident even in the ‘representative’ images'”

      • *

      - We agree the wording is somewhat problematic. However, we also feel that there is a discrepancy between the differences highlighted between the EM images shown in Fig 3A and a rather modest change of the median by only a few percent, as shown in the respective violin plots. We agree with the reviewer that the images of Fig 3A might, therefore, not be “representative” of the quantified changes.

      • We appreciate the editor's clarification and have selected images that more accurately represent the subtle changes in ER-MAMs observed in our quantification. These images have been included in Figure EV6 and referenced accordingly in the manuscript to ensure a balanced depiction of our findings. Additionally, we are prepared to make all images publicly available. We would like to again thank the editor for their clarification on Reviewer #1’s Major Point 4 as well as their agreement on the inappropriate nature of some of Reviewer #1’s comments.

      *Reviewer#1 Minor points: 1. It is not accurate to describe Day 60 neurons as "aged" in the context of P56S-induced disease or imply they are a model for human aging. I could be mistaking, as I am not an iPSC expert, but I believe the field uses these terms in the context of iPSC-derived neurons to mean something more akin to "mature". The authors try to invoke this to argue for the relevance of their results to patient disease, unless the authors know this is somehow actually representative of neurons from older patients, I think this is misleading. *

      Carried Out Revisions

      We apologize for any confusion. Our use of the term "aged" was intended solely as a relative descriptor, indicating that day 60 motor neurons had been maintained in culture for a longer duration than day 35 motor neurons. It was not meant to suggest that these neurons represent a specific age or disease state, but rather that they had been cultured for an extended period.

      Furthermore, we use the term "mature" specifically in the context of motor neuron differentiation to indicate the expression of motor neuron-specific markers, which occurs by day 25 of differentiation. To avoid ambiguity, we have revised the manuscript to use the term "culture time" instead, ensuring clarity in our terminology.

      *Reviewer #1 Minor Point 2. The JC-1 experiment is not being appropriately controlled. These results are predicted by increased cell or mitochondrial death even if the membrane potentials are identical. The authors need to control for apoptotic signaling if they want to make this conclusion. There is an accepted standard in the mitochondrial field for assaying mitochondrial membrane potential (generally using TMRE or TMRM, but JC-1 can be used with proper controls), but it requires lots of careful controls not performed here (normalization to oligomycin- and FCCP-treated cells as a bare minimum. *

      Carried Out Revisions

      We would like to thank Reviewer 1 for this comment. We apologize for the omission, and we did treat the cells with CCCP provided in the JC-1 kit as a positive control. The JC-1 subsection of the methods has been updated to reflect this with the following: “A separate aliquot of cell suspension was also incubated with 1 uL of the supplied 50mM CCCP for 15 min prior to JC-1 dye addition, to act as a positive control and ensure the JC-1 dye was correctly detecting low MMP populations.”

      • The flow cytometry experiments are problematic in general since the authors state that part of their incentive for studying mitochondria in this model is due to effects at synapses, and the sample preparation for the cytometer involved dissociating the cells (i.e.-removing all of the processes where synapses mostly reside). *

      Carried Out Revisions

      We thank Reviewer #1 for this comment. Our citation of the study by Gómez-Suaga et al. (2019) was not intended to suggest that our investigation focuses exclusively on mitochondria at synapses but rather to provide context on the current understanding of the field. To clarify this point, we have revised the manuscript to include the following statement: "It has also been shown that this interaction can occur at synapses, and disruptions to it may impact synaptic activity (Gómez-Suaga et al., 2019)."

      Citation:

      Gómez-Suaga, P. et al. The VAPB-PTPIP51 endoplasmic reticulum-mitochondria tethering proteins are present in neuronal synapses and regulate synaptic activity. Acta Neuropathologica Communications 7, 35, doi:10.1186/s40478-019-0688-4 (2019).

      • The normalization for VAPB in the inducible lines is unclear-how is normalization performed simultaneously to two genes at once? The authors do not provide enough information for us to understand what they have actually done, and I wonder if the data presented in the supplement on this is actually sufficiently different from random noise to be interpretable, since no statistics of any kind are given.*

      In response, we have added a qPCR section to the Methods, detailing our experimental approach as follows:

      "Quantitative PCR: RNA was extracted using TRIzol Reagent (Thermo Fisher), and the procedure was performed according to their provided protocol. cDNA was generated using SuperScript™ IV VILO™ Master Mix (Thermo Fisher), following the manufacturer’s instructions. qPCR was conducted using PowerTrack™ SYBR Green Master Mix for qPCR (Thermo Fisher), following the provided protocol, on a BioRad CFX96 thermocycler. Samples were run in triplicate. Quantification was performed using CFX Maestro software (BioRad). VAPB expression was normalized to Neomycin and RPL3 using the software, and the resultant expression values were graphed along with the provided SEM, per standards in the field (Livak & Schmittgen, 2001; Wong & Medrano, 2005)."

      Additionally, we have modified the graph to more clearly illustrate the comparison between VAPB WT and P56S, emphasizing that there is no significant difference in mRNA expression.

      Citations

      1. Wong, M. L. & Medrano, J. F. Real-time PCR for mRNA quantitation. Biotechniques 39, 75-85 (2005).
      2. Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) Method. Methods 25, 402-408 (2001).

      3. I don't think the tunicamycin experiments make sense in this context. The authors start with premise that I do not understand: "if the decrease in MERC was underlying the decrease in MMP seen later in differentiation, inducing cell stress early in differentiation could mimic the decreased MMP." Most cell stress pathways enhance ER-mito contact, not decrease it, so I am not sure why they expected this to work this way. They then continue: "We selected tunicamycin, an ER stressor, as VAPB is an ER protein, and if the decreased MMP could be caused, at least partially, by loss of MERCs, ER stress would likely exacerbate it." I don't understand this either- Tunicamycin is not a general ER-stressing agent-it is a specific inhibitor of some N-linked glycosylation-maturation pathways in the ER lumen, which causes ER stress by dysregulation of misfolded protein pathways. Since VAPB has no luminal domains to speak of, is not known to interact with the protein folding and maturation machinery at all, and Tunicamycin has no obvious connection I'm aware of to MERCs, I am not able to follow the authors' intentions or conclusions here. I suspect this needs a major rewrite to explain what the goals were and how the authors controlled for their findings. *

      Carried Out Revisions

      We thank Reviewer 1 for this insightful comment. To provide greater clarity on this point, we have revised the manuscript to include the following statement:

      "MAMs are known to be a hot spot for the transfer of stress signals from the ER to mitochondria (van Vliet & Agostinis, 2018). Consequently, to test whether we could induce mitochondrial dysfunction by exposing iPSC-derived motor neurons to stressors, we selected tunicamycin (TM), an ER stressor, as VAPB is an ER protein, and if the decreased MMP could be caused, at least partially, by loss of ER-MAM, ER stress would likely exacerbate it."

      This revision aims to more clearly articulate the rationale behind our approach and the selection of tunicamycin as an ER stressor.

      Citations

      1. van Vliet AR, Agostinis P (2018) Mitochondria-Associated Membranes and ER Stress. Curr Top Microbiol Immunol 414: 73-102 Minor Adjustments Not in Response to Reviewer Comments

      Several minor adjustments have been made in response to internal reviews and feedback, independent of any specific Reviewer comment. The only modification affecting the presented data resulted from a comment noting a minor discrepancy in the gating of green-fluorescing cells between VAPB WT and VAPB P56S on Day 30 (Figure 3C). To ensure consistency, the gating was redrawn and applied uniformly to both plots, leading to a slight change in values. However, the overall difference remains non-significant, and our interpretation of the data remains unchanged. Additionally, to facilitate visual comparison, the Y-axes of the quantification graphs in Figures 3C and 3D have been standardized, though the data in Figure 3D itself was not modified—only the Y-axis scaling was adjusted.

      Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      We have responded to both of Reviewer #2’s Major Points 2 and 3 together, as the answer applies to both questions and the points raised in each idea.

      • *

      *Reviewer #2 Major Point 2. The authors highlight PTP151 binding to VAPB as a way to promote mitochondria ER contacts (MERC). They provide evidence that this association is diminished by the P56S VAPB mutation. This raises an important question. How does PTPIP51 binding connect with other phenotypes, such as the neuronal firing and ER stress sensitivity? Can the authors consider experiments to test this directly? For example, is there a way to drive PTP151 : VAPB interactions even in the face of mutant VAPB and see if this rescues the MERC defects and other phenotypes? *

      Reviewer #2 Major Point 3. The authors propose that the detachment of the mitochondria from the ER most likely be the cause for why their mutant motor neurons are more sensitive to ER stressors. Along the lines of the above, is there a way to test this hypothesis directly? Can they use other means to promote ER mitochondria association even in the face of VAPB mutation and test if this rescues phenotypes?

      Analyses We Prefer Not or Are Unable to Carry Out

      We thank Reviewer 2 for these insightful suggestions and fully agree that enhancing PTPIP51:VAPB interactions in the presence of mutant VAPB would be an effective approach to directly demonstrate that the loss of this interaction is the causative event underlying the observed phenotypes or to drive increased ER-mitochondria attachment.

      However, we have not identified a method to achieve this without introducing substantial alterations to the model system, which would likely compromise the interpretability of the results. The most promising approach we considered was the use of rapamycin-inducible linkers, as described by Csordás et al. (2010), which facilitate ER-mitochondria tethering upon rapamycin addition. Unfortunately, rapamycin directly affects translational regulation via mTOR (mammalian target of rapamycin) and given that translation dysregulation is a key phenotype in our study, its addition could influence multiple pathways, making it difficult to attribute any observed effects specifically to the intended manipulation.

      If the reviewers or editors have suggestions for alternative approaches, we would greatly appreciate their input. However, based on the current state of the field, we do not believe there is a method to selectively drive ER-mitochondria attachment or specifically enhance VAPB-PTPIP51 interactions without introducing confounding factors that would obscure whether the resulting effects are due to VAPB P56S pathophysiology or the intervention itself.

      Citation:

      1. Csordás G, Várnai P, Golenár T, Roy S, Purkins G, Schneider TG, Balla T, Hajnóczky G. Imaging interorganelle contacts and local calcium dynamics at the ER-mitochondrial interface. Mol Cell. 2010 Jul 9;39(1):121-32. doi: 10.1016/j.molcel.2010.06.029. PMID: 20603080; PMCID: PMC3178184.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Mutations in the VAPB gene are a cause of amyotrophic lateral sclerosis (ALS), a human motor neuron disease. To define the mechanisms by which mutations in VAPB cause motor neuron degeneration, the authors establish a new human iPSC-derived motor neuron model. They start by using CRISPR to knockout the VAPB gene and then introduce a lentivirus encoding a doxycycline-inducible construct to express WT or mutant VAPB. They then phenotypically characterize these WT and mutant motor neurons including using multi-electrode array (MEA), which revealed neuronal firing deficits in mutant motor neurons. They performed protein interaction studies WT vs mutant VAPB motor neuron and identified decreased binding to PTPIP51 in the mutant VAPB motor neurons.

      Phenotypically, the authors report that the VAPB mutant motor neurons exhibit decreased mitochondria / ER contacts (MERC) in mutant motor neurons compared to WT as well as decreased mitochondrial membrane potential. They report that these mitochondrial defects lead to heightened sensitivity to ER stress and activation of the integrated stress response, which could be rescued by treatment with ISRIB. Importantly, the neuronal firing defects are also rescued by ISRIB, providing compelling evidence that these defects are tied to activation of ER stress. Overall, this paper presents novel functional analyses of an important ALS gene, VAPB in disease-relevant cell types (human motor neurons). I have the following comments and suggestions for the authors to consider.

      1. Why did the authors decide to make VAPB knockouts and then introduce the WT or P56S VAPB constructs on a lentivirus instead of generating the point mutations (or correcting them) directly in the endogenous locus? Data in Extended Fig. 1c and Extended Fig. 2a indicate significant differences in either the kinetics of WT vs. P56S VAPB expression (1c) or levels (2a). It seems important to be able to compare comparable levels of WT and mutant proteins, especially for the interpretation of the subsequent IP-MS experiments to identify PTP151. The authors may wish to consider generating (or obtaining) isogenic lines harboring the mutations at the endogenous locus so that equal levels of expression of WT and mutant VAPB can be assessed.

      2. The authors highlight PTP151 binding to VAPB as a way to promote mitochondria ER contacts (MERC). They provide evidence that this association is diminished by the P56S VAPB mutation. This raises an important question. How does PTPIP51 binding connect with other phenotypes, such as the neuronal firing and ER stress sensitivity? Can the authors consider experiments to test this directly? For example, is there a way to drive PTP151 : VAPB interactions even in the face of mutant VAPB and see if this rescues the MERC defects and other phenotypes?

      3. The authors propose that the detachment of the mitochondria from the ER most likely be the cause for why their mutant motor neurons are more sensitive to ER stressors. Along the lines of the above, is there a way to test this hypothesis directly? Can they use other means to promote ER mitochondria association even in the face of VAPB mutation and test if this rescues phenotypes?

      Referee Cross-commenting

      There seems to be concurrence between Reviewer 1 and 2 about the interest in the VAPB gene but that the specific approaches and analyses methods used to study mutations in this gene (knockout and then over expression of WT and mutant version) are not a faithful representation of the in vivo situation (heterozygous mutations) and both provide suggestions for improvement of the study design.

      Editorial Note

      This Editorial Note by the Review Commons editorial team was communicated to the author in response to their request for clarification and contextualization of the referee report of reviewer #1.

      Since reviewer #1 did not clarify what was requested by the editorial office, we included the present Editorial Note in the review process after re-analyzing the manuscript in detail again and the referee report of reviewer #1.

      We agree with the authors that the wording used by reviewer #1 is problematic. However, we also see that the substance of the points raised by this reviewer is relevant and affects the study's conclusions. Below, we have included our comments on the individual points and quotes highlighted in your letter.

      Reviewer #1: "The strange pooling of data without explanation"

      • When looking into the figures and their captions in more detail, we could also not understand the nature of the replicates and how the data was aggregated or "pooled". In Figure 1, the stated number of replicates is "N=8 separate wells". It is unclear whether these are 8 wells from a single dissociation/replating procedure (the procedure is described in Materials & Methods as follows: "Motor neurons were dissociated on day 25 of differentiation and re-plated onto 48-well MEA plate") or whether the eight are sampled across multiple plates across cultures obtained from independent dissociations procedures.

      • In Figure 3, the number of replicates is "N=13-21 images". Here, it is unclear whether these images come from the same or independent samples, how many quantifications were performed per image, and how many images per sample were used.

      • We also note that replicates are not mentioned in the proteomics analysis.

      Reviewer #1: "unusual sample sizes":

      • The wording is indeed not very explicit, but we believe it is reasonable to assume that this point refers to "N=13-21 images" and that it is not clear how the data were pooled. The reviewer makes the related point: "Is the data grouped by cell or all comes from a single cell?", which provides further context to this point.

      "lack of clarity about statistical testing":

      • We agree that without a clear description of the nature of the replicates, the statistical analysis is unclear.

      "false hypothesis testing":

      • We agree with the authors that the reviewer is unclear.

      "The fact that every experiment in the paper has just enough n to trigger statistical significance as determined by the authors raises some concerns, suggesting potential biases."

      • We agree that this is an inappropriate statement in absence of evidence or detailed argumentation; we very much regret not having caught this statement up front.

      "The reliability of these conclusions is questionable, especially if the authors were blinded to the identity of their own samples.":

      • This is a typo; the word "not" is missing. It should read: "if the authors were NOT blinded to the identity..." and refers to concerns raised by the reviewers about evaluating the EM images.

      "The figures suggest a lack of appropriate blinding, with cherry-picking evident even in the 'representative' images'"

      • We agree the wording is somewhat problematic. However, we also feel that there is a discrepancy between the differences highlighted between the EM images shown in Fig 3A and a rather modest change of the median by only a few percent, as shown in the respective violin plots. We agree with the reviewer that the images of Fig 3A might, therefore, not be "representative" of the quantified changes.

      We agree that there are statements in this review that are written in a style and tone that is not appropriate. We greatly apologize for this and, we should have caught these issues beforehand.

      At the same time, this reviewer raises significant issues about the study. In this case, we cannot eliminate the entire review since the points raised are relevant to the conclusiveness of the study.

      Significance

      The new iPSC-derived system to study VAPB mutations in human motor neurons is significant and has led the authors to discover new functions for VAPB (i.e., mitochondria-ER contacts). The significance and impact of the study, in my opinion, would be increased if the authors considered using motor neuron lines expressing comparable levels of WT and mutant VAPB, preferably from the endogenous location under physiological conditions. Their discovery of a role of defective mitochondria-ER contact as making VAPB mutant motor neurons more sensitive to ER stress would be bolstered by experiments to directly test this hypothesis by rescuing the contact defects.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Landry et al. present characterization of iPSC-derived neurons that inducibly express either WT VAPB or P56S VAPB in the context of a VAPB knockout. They do this by first generating a novel iPSC line with a frameshift knockout in a VAPB, and then selecting lentiviral-transduced clones that express either WT or P56S VAPB from an inducible promoter. The resulting lines are then differentiated using conventional protocols, VAPB expression is induced, and the cells are subjected to a battery of cell biological tests to examine mitochondrial function.

      Major Points:

      1. The method of knocking out and selecting an inducible line in problematic. VAPB is an essential gene-patients with P56S are always heterozygotes, since nonfunctional VAPB is embryonic lethal. Selecting a knockout cell line is already choosing a parent that is very far from physiological, and the reexpression of P56S VAPB as the sole form also is not a good a model for understanding the contributions of P56S to disease. This approach is unusual, as it seems to overlook the advantages of working with iPSCs and patient-derived neurons. Unfortunately, the value of this amazing and rare system is diminished by the design of the selection method.
      2. The interactome analysis is not controlled properly to interpret. It is not the total amount of VAPB that needs to be used as the normalization control, since it is already known 90+% of the P56S VAPB is in cytoplasmic aggregates. The authors need to normalize to the amount of VAPB that made it to the contact sites-a much smaller amount in the cells expressing the diseased form. For example, the fact that the authors can still pull down detectable PTPIP51 in Fig. 2e actually argues for the opposite conclusion than what the authors have stated-if the authors have actually expressed just P56S in a true knock out condition, this means that P56S CAN still bind to PTPIP51 (and possibly even better than WT, as several previous papers have suggested-since there is ~100-fold less available for binding). Without an appropriate normalization, the authors cannot make any conclusion about how to interpret the results of this part of the paper.
      3. The electron microscopy data is not interpretable in this form. The authors have provided no data at all on how analysis was performed, how contact sites were defined, how samples were collected and ensured to be representative, blinding that was performed, how sources of bias were accounted for, etc. It is clear even from what little is shown that the authors are not focused on what matters to address their own questions. For example, apart from the P56S Day 35 example, none of the "contact sites" selected for the figure are even possible to be mediated by VAPB, since the distance between the ER and the mitochondria is too far for the maximum tethering distance of VAPB-PTPIP51. Since the authors have neglected to include scale bars in their zooms, the reader cannot be sure of the distance, but it is clearly in excess of 50 nm since there are obviously visible ribosomes between the two organelles. Additionally, the authors provide no information on what "% mitochondria in contact with ER" means (By organelle? By unit surface area? Is the data grouped by cell or all comes from a single cell? How do you account for contact sites vs. proximity by crowding? Etc.).
      4. The strange pooling of data without explanation, unusual sample sizes, and lack of clarity about statistical testing, false hypothesis testing, and really any clear rigor in statistics of any kind make it impossible for a reader to have any confidence in the results presented here. The fact that every experiment in the paper has just enough n to trigger statistical significance as determined by the authors raises some concerns, suggesting potential biases. The reliability of these conclusions is questionable, especially if the authors were blinded to the identity of their own samples. This is particularly relevant for the EM data, where the determination of contact sites appears to have been made subjectively.

      Minor points:

      1. It is not accurate to describe Day 60 neurons as "aged" in the context of P56S-induced disease or imply they are a model for human aging. I could be mistaking, as I am not an iPSC expert, but I believe the field uses these terms in the context of iPSC-derived neurons to mean something more akin to "mature". The authors try to invoke this to argue for the relevance of their results to patient disease, unless the authors know this is somehow actually representative of neurons from older patients, I think this is misleading.
      2. The JC-1 experiment is not being appropriately controlled. These results are predicted by increased cell or mitochondrial death even if the membrane potentials are identical. The authors need to control for apoptotic signaling if they want to make this conclusion. There is an accepted standard in the mitochondrial field for assaying mitochondrial membrane potential (generally using TMRE or TMRM, but JC-1 can be used with proper controls), but it requires lots of careful controls not performed here (normalization to oligomycin- and FCCP-treated cells as a bare minimum.
      3. The flow cytometry experiments are problematic in general since the authors state that part of their incentive for studying mitochondria in this model is due to effects at synapses, and the sample preparation for the cytometer involved dissociating the cells (i.e.-removing all of the processes where synapses mostly reside).
      4. The normalization for VAPB in the inducible lines is unclear-how is normalization performed simultaneously to two genes at once? The authors do not provide enough information for us to understand what they have actually done, and I wonder if the data presented in the supplement on this is actually sufficiently different from random noise to be interpretable, since no statistics of any kind are given.
      5. I don't think the tunicamycin experiments make sense in this context. The authors start with premise that I do not understand: "if the decrease in MERC was underlying the decrease in MMP seen later in differentiation, inducing cell stress early in differentiation could mimic the decreased MMP." Most cell stress pathways enhance ER-mito contact, not decrease it, so I am not sure why they expected this to work this way. They then continue: "We selected tunicamycin, an ER stressor, as VAPB is an ER protein, and if the decreased MMP could be caused, at least partially, by loss of MERCs, ER stress would likely exacerbate it." I don't understand this either- Tunicamycin is not a general ER-stressing agent-it is a specific inhibitor of some N-linked glycosylation-maturation pathways in the ER lumen, which causes ER stress by dysregulation of misfolded protein pathways. Since VAPB has no luminal domains to speak of, is not known to interact with the protein folding and maturation machinery at all, and Tunicamycin has no obvious connection I'm aware of to MERCs, I am not able to follow the authors' intentions or conclusions here. I suspect this needs a major rewrite to explain what the goals were and how the authors controlled for their findings.

      Significance

      While the idea of assaying the function of ALS-causing VAPB mutants in iPSC-derived neurons is great and would be a great asset to the field, the execution here raises significant concerns. It is difficult to draw clear conclusions from the presented data. Necessary controls are either incorrectly applied or missing, the methods section lacks crucial details for reproducibility, and the figures suggest a lack of appropriate blinding, with cherry-picking evident even in the "representative" images. There are also major issues with the entire premise of how the lines were generated, since VAPB knockout cells are highly aberrant lines, the authors have likely selected for all sorts of mitochondrial pathways that would not be operating in an actual patient neuron.

      Claims about mitochondrial dysfunction could potentially mislead the field, as such conclusions do not seem to be supported by the actual data. To be suitable for publication, the study needs substantial revisions, including proper controls, blinding, and detailed methodological information for reproducibility. I understand the challenges and costs associated with using iPSC-derived neurons, but focusing on a few well-controlled experiments would be far more beneficial than presenting numerous, less interpretable findings.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02713

      Corresponding author(s): Igor, Kramnik

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      Dear Editors,

      We are grateful for constructive reviewers’ comments and criticisms and have thoroughly addressed all major and minor comments in the revised manuscript.

      Summary of new data.

      We have performed the following additional experiments to support our concept:

      1. The kinetcs of ROS production in B6 and B6.Sst1S macrophages after TNF stimulation (Fig. ____3I and J, Suppl. Fig. 3G)____;
      2. __ Time course of stress kinase activation (_Fig.3K)_ that clearly demonstrated the persistent stress kinase (phospho-ASK1 and phospho-cJUN) activation exclusively in. the B6.Sst1S macrophages;__
      3. New Fig.4 C – E panels include comparisons of the B6 and B6.Sst1S macrophage responses to TNF and effects of IFNAR1 blockade in both backgrounds.
      4. We performed new experiments demonstrating that the synthesis of lipid peroxidation products (LPO) occurs in TNF-stimulated macrophages earlier than the IFNβ super-induction (__Suppl.Fig.____4A and B). __
      5. We demonstrated that the IFNAR1 blockade 12, 24 and 32 h after TNF stimulation still reduced the accumulation of LPO product (4-HNE) in TNF-stimulated B6.Sst1S BMDMs (Suppl.Fig.4 E – G).
      6. We added comparison of cMyc expression between the wild type B6 and B6.Sst1S BMDMs during TNF stimulation for 6 – 24 h (Fig.__5I–J). __
      7. New data comparing 4-HNE levels in Mtb-infected B6 wild type and B6.Sst1S macrophages and quantification of replicating Mtb was added (Fig.____6B, Suppl.Fig.7C and D).
      8. In vivo data described in Fig.7 was thoroughly revised and new data was included. We demonstrated increased 4-HNE loads in multibacillary lesions (Fig.7A, Suppl. Fig.9A) and the 4-HNE accumulation in CD11b+ myeloid cells (Fig.7B __and __Suppl.Fig.9B). We demonstrated that the Ifnb – expressing cells are activated iNOS+ macrophages (Fig.7D and Suppl.Fig.13A). Using new fluorescent multiplex IHC, we have shown that stress markers phopho-cJun and Chac1 in TB lesions are expressed by Ifnb- and iNOS-expressing macrophages (Fig.7E and Suppl.Fig.13D – F).
      9. We performed additional experiment to demonstrate that naïve (non-BCG vaccinated) lymphocytes did not improve Mtb control by Mtb-infected macrophages in agreement with previously published data (Suppl.Fig.7H). Summary of updates

      Following reviewers requests we updated figures to include isotype control antibodies, effects of inhibitors on non-stimulated cells, positive and negative controls for labile iron pool, additional images of 4-HNE and live/dead cell staining.

      Isotype control for IFNAR1 blockade were included in Fig.3M, Fig.4C -E, Fig.6L-M

      Suppl.Fig.4F -G, 7I.

      Positive and negative controls for labile iron pool measurements were added to Fig.3E, Fig.5D, Suppl.Fig.3B

      Cell death staining images were added Suppl.Fig.3H

      Co-staining of 4-HNE with tubulin was added to Suppl.Fig.3A.

      High magnification images for Figure 7 __were added in __Suppl.Fig.8 to demonstrate paucibacillary and multibacillary image classification.

      Single-channel color images for individual markers were provided in Fig.____7E and Suppl.Fig.13B–F.

      Inhibitor effects on non-stimulated cells were included in Fig.____5 D – H, Suppl.Fig.6A and B.

      Titration of CSF1R inhibitors for non-toxic concentration determination are included in Suppl.Fig.6D.

      In addition, we updated the figure legends in the revised manuscript to include more details about the experiments. We also clarified our conclusions in the Discussion.

      Responses to every major and minor comment of the reviewers are provided below.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary

      The study by Yabaji et al. examines macrophage phenotypes B6.Sst1S mice, a mouse strain with increased susceptibility to M. tuberculosis infection that develops necrotic lung lesions. Extending previous work, the authors specifically focus on delineating the molecular mechanisms driving aberrant oxidative stress in TNF-activated B6.Sst1S macrophages that has been associated with impaired control of M. tuberculosis. The authors use scRNAseq of bone marrow-derived macrophages to further characterize distinctions between B6.Sst1S and control macrophages and ascribe distinct trajectories upon TNF stimulation. Combined with results using inhibitory antibodies and small molecule inhibitors in in vitro experimentation, the authors propose that TNF-induced protracted c-Myc expression in B6.Sst1S macrophages disables the cellular defense against oxidative stress, which promotes intracellular accumulation of lipid peroxidation products, fueled at least in part by overexpression of type I IFNs by these cells. Using lung tissue sections from M. tuberculosis-infected B6.Sst1S mice, the authors suggest that the presence of a greater number of cells with lipid peroxidation products in lung lesions with high counts of stained M. tuberculosis are indicative of progressive loss of host control due to the TNF-induced dysregulation of macrophage responses to oxidative stress. In patients with active tuberculosis disease, the authors suggest that peripheral blood gene expression indicative of increased Myc activity was associated with treatment failure.

      __Major comments __ The authors describe differences in protein expression, phosphorylation or binding when referring to Fig 2A-C, 2G, 3D, 5B, 5C. However, such differences are not easily apparent or very subtle and, in some cases, confounded by differences in resting cells (e.g. pASK1 Fig 3L; c-Myc Fig 5B) as well as analyses across separate gels/blots (e.g. Fig 3K, Fig 5B). Quantitative analyses across different independent experiments with adequate statistical analyses are required to strengthen the associated conclusions.

      Author: We updated our Western blots as follows: 1. Densitometery of normalized bands is included above each lane (Fig.2A – C; Fig.3C – D and 3K; Fig.4A – B; Fig.5B,C,I,J). New data in Fig.3K is added to highlight differences between B6 and B6.Sst1S at individual timepoints after TNF stimulation. In Fig.5I we added new data comparing Myc levels in B6 and B6.Sst1S with and without JNK inhibitor and updated the results accordingly. New Fig.3K clearly demonstrates the persistent activation of p-cJun and p-Ask1 at 24 and 36h of TNF stimulation. In Fig.5B we clearly demonstrate that Myc levels were higher in B6.Sst1S after 12 h of TNF stimulation. At 6h, however, the basal differences in Myc levels are consistently higher in B6.Sst1S and the induction by TNF is 1.6-fold similar in both backgrounds. We noted this in the text.

      A representative experiment is shown in individual panels and the corresponding figure legend contains information on number of biological repeats. Each Western blot was repeated 2 – 4 times.

      The representative images of fluorescence microscopy in Fig 3H, 4H, 5H, S3C, S3I, S5A, S6A seem to suggest that under some conditions the fluorescence signal is located just around the nucleus rather than absent or diminished from the cytoplasm. It is unclear whether this reflects selective translocation of targets across the cell, morphological changes of macrophages in culture in response to the various treatments, or variations in focal point at which images were acquired. Control images (e.g. cellular actin, DIC) should be included for clarification. If cell morphology changes depending on treatments, how was this accounted for in the quantitative analyses? In addition, negative controls validating specificity of fluorescence signals would be warranted.

      Author: Our conclusion of higher LPO production is based on several parameters: 4-HNE staining, measurements of MDA in cell lysates and oxidized lipids using BODIPY C11. Taken together they demonstrate significant and reproducible increase in LPO accumulation in TNF-stimulated B6.Sst1S macrophages. This excludes imaging artefact related to unequal 4-HNE distribution noted by the reviewer. In fact, we also noted that the 4-HNE was spread within cell body of B6.Sst1S macrophages and confirmed it using co-staining with tubulin, as suggested by the reviewer (new Suppl.Fig.3A). Since low molecular weight LPO products, such as MDA and 4-HNE, traverse cell membranes, it is unlikely that they will be strictly localized to a specific membrane bound compartment. However, we agree that at lower concentrations, there might be some restricted localization, explaining a visible perinuclear ring of 4-HNE staining in B6 macrophages. This phenomenon may be explained just by thicker cytoplasm surrounding nucleus in activated macrophages spread on adherent plastic surface or by proximity to specific organelles involved in generation or clearance of LPO products and definitively warrants further investigation.

      We also included images of non-stimulated cells in Fig.3H, Suppl.Fig.3A and 3E. We used multiple fields for imaging and quantified fluorescence signals (Suppl. Fig.3D and 3F, Suppl.Fig.4G, Suppl.Fig.6A and B).

      We used negative controls without primary antibodies for the initial staining optimization, but did not include it in every experiment.

      To interpret the evaluation on the hierarchy of molecular mechanisms in B6.Sst1S macrophages, comparative analyses with B6 control cells should be included (e.g. Fig 4C-I, Fig 5, Fig 6B, E-M, S6C, S6E-F). This will provide weight to the conclusions that the dysregulated processes are specifically associated with the susceptibility of B6.Sst1S macrophages.

      Author: Understanding the sst1-mediated effects on macrophage activation is the focus of our previously published studies Bhattacharya et al., JCI, 2021) and this manuscript. The data comparing B6 and B6.Sst1S macrophage are presented in Fig.1, Fig.2, Fig.3, Fig.4, Fig.5A – C, I and J, Fig.6A – C, 6J and corresponding supplemental figures 1, 2, 3, 4A and B, Suppl.Fig.5, Suppl.Fig.6C, Suppl.Fig.7A-D,7F.

      Once we identified the aberrantly activated pathways in the B6.Sst1S, we used specific inhibitors to correct the aberrant response in B6.Sst1S.

      All experiments using inhibitory antibodies require comparison to the effect of a matched isotype control in the same experiment (e.g. Fig 3J, 4F, G, I; 6L, 6M, S3G, S6F).

      Author: Isotype control for IFNAR1 blockade were included in Fig.3M, Fig.4C -E, Fig.6L-M

      Suppl.Fig.4F -G, 7I.

      Experiments using inhibitors require inclusion of an inhibitor-only control to assess inhibitor effects on unstimulated cells (e.g. Fig 4I, 5D-I)

      Author: Inhibitor effects on non-stimulated cells were included in Fig.5 D – H, Suppl.Fig.6A and B.

      Fig 3K and Fig 5J appear to contain the same images for p-c-Jun and b-tubulin blots.

      Author: Fig.3K and 5J partially overlapped but had different focus – 3K has been updated to reflect the time course of stress kinase activation. Fig.5J is updated (currently Fig.5I and J) to display B6 and B6.Sst1S macrophage data including cMyc and p-cJun levels.

      Data of TNF-treated cells in Fig 3I appear to be replotted in Fig 3J.

      Author: Currently these data is presented in Fig.3L and 3M and has been updated to include comparison of B6 and B6.Sst1S cells (Fig.3L) and effects of inhibitors in Fig.3M.

      Rev.1: It is stated that lungs from 2 mice with paucibacillary and 2 mice with multi-bacillary lesions were analyses. There is contradicting information on whether these tissues were collected at the same time post infection (week 14?) or whether the pauci-bacillary lesions were in lungs collected at earlier time points post infection (see Fig S8A). If the former, how do the authors conclude that multi-bacillary lesions are a progression from paucibacillary lesions and indicative of loss of M. tuberculosis control, especially if only one lesion type is observed in an individual host? If the latter, comparison between lesions will likely be dominated by temporal differences in the immune response to infection. In either case, it is relevant to consider density, location, and cellular composition of lesions (see also comments on GeoMx spatial profiling). Is the macrophage number/density per tissue area comparable between pauci-bacillary and multi-bacillary lesions?

      Author: We did not collect lungs at the same time point. As described in greater detail in our preprints (Yabaji et al., https://doi.org/10.1101/2025.02.28.640830 and https://doi.org/10.1101/2023.10.17.562695) pulmonary TB lesions in our model of slow TB progression are heterogeneous between the animals at the same timepoint, as observed in human TB patients and other chronic TB animal models. Therefore, we perform analyses of individual TB lesions that are classified by a certified veterinary pathologist in a blinded manner based on their morphology (H&E) and acid fast staining of the bacteria, as depicted in Suppl.Fig.8. Currently it is impossible to monitor progression of individual lesions in mice. However, in mice TB is progressive disease and no healing and recovery from the disease have been observed in our studies or reported in literature. Therefore, we assumed that paucibacillary lesions preceded the multibacillary ones, and not vice versa, thus reflecting the disease progression. In our opinion, this conclusion most likely reflects the natural course of the disease. However, we edited the text : instead of disease progression we refer to paucibacillary and multibacillary lesions.

      Rev1: Does 4HNE staining align with macrophages and if so, is it elevated compared to control mice and driven by TNF in the susceptible vs more resistant mice?

      Author: We performed additional staining and analyses to demonstrate the 4-HNE accumulation in CD11b+ myeloid cells of macrophage morphology. Non-necrotic lesions contain negligible proportion of neutrophils (Fig.7B, Suppl.Fig.9B). B6 mice do not develop advanced multibacillary TB lesions containing 4-HNE+ cells. Also, 4-HNE staining was localized to TB lesions and was not found in uninvolved lung areas of the infected mice, as shown in Suppl.Fig.9A (left panel).

      It is well established that TNF plays a central role in the formation and maintenance of TB granulomas in humans and in all animal models. Therefore, TNF neutralization would lead to rapid TB progression, rapid Mtb growth and lesions destruction in both B6 and B6.Sst1S genetic backgrounds.

      Pathway analysis of spatial transcriptomic data (Suppl.Fig.11) identified TNF signaling via NF-kB among dominant pathways upregulated in multibacillary lesions, suggesting that the 4-HNE accumulation paralleled increased TNF signaling. In addition, in vivo other cytokines, including IFN-I, could activate macrophages and stimulate production of reactive oxygen and nitrogen species and lead to the accumulation of LPO products as shown in this manuscript.

      Rev.1: It would be relevant to state how many independent lesions per host were sampled in both the multiplex IHC as well as the GeoMx data. Can the authors show the selected regions of interest in the tissue overview and in the analyses to appreciate within-host and across-host heterogeneity of lesions. The nature of the spatial transcriptomics platform used is such that the data are derived from tissue areas that contain more than just Iba1+ macrophages. At later stages of infection, the cellular composition of such macrophage-rich areas will be different when compared to lesions earlier in the infection process. Hence, gene expression profiles and differences between tissue regions cannot be attributed to macrophages in this tissue region but are more likely a reflection of a mix of cellular composition and per-cell gene expression.

      Author: We used Iba1 staining to identify macrophages in TB lesions and programmed GeoMx instrument to collect spatial transcriptomics probes from Iba1+ cells within ROIs. Also, we selected regions of interest (ROI) avoiding necrotic areas (depicted in Suppl.Fig.10). We agree that Iba1+ macrophage population is heterogenous – some Iba1+ cells are activated iNOS+ macrophages, other are iNOS-negative (Fig.7C and D, and Suppl.Fig.13A). Multibacillary lesions contain larger areas occupied by activated (iNOS+) macrophages (Fig.7D, Suppl.Fig.13B and 13F). Although the GeoMx spatial transcriptomic platform does not provide single cell resolution, it allowed us to compare populations of Iba1+ cells in paucibacillary and multibacillary TB lesions and to identify a shift in their overall activation pattern.

      It is stated that loss of control of M. tuberculosis in multibacillary lesions was associated with "downregulation of IFNg-inducible genes". If the authors base this on the tissue expression of individual genes, this requires further investigation to support such conclusion (also see comment on GeoMx above). Furthermore, how might this conclusion be compatible with significantly elevated iNOS+ cells (Fig 7D) in multibacillary lesions?

      Author: We demonstrated that Ciita gene expression is specifically induced by IFN-gamma and is suppressed by IFN-I (Fig.6M). The expression of Ciita in paucibacillary lesions suggest the presence of the IFN-gamma activated cells and its disappearance in the multibacillary lesion is consistent with massive activation of IFN-I pathway (Fig.7C).

      Rev1. It is appreciated that the human blood signature analyses contain Myc-signatures but the association with treatment failure is not very strong based on the data in Fig 13B and C (Suppl.Fig.15B and C now). The authors indicate that they have no information on disease severity, but it should perhaps not be assumed that treatment failure is indicative of poor host control of the infection. Perhaps independent analyses in separate cohort/data set can add strength and provide -additional insights (e.g. PMID: 35841871; PMID: 32451443, PMID: 17205474, PMID: 22872737). In addition, the human data analyses could be strengthened by extension to additional signatures such as IFN, TNF, oxidative stress. Details of the human study design are not very clear and are lacking patient demographics, site of disease, time of blood collection relative to treatment onset, approving ethics committees.

      Author: X axis of Suppl.Fig.15A represent pre-defined molecular signature gene sets (MSigDB) in Gene Set Enrichment Analysis (GSEA) database (https://www.gsea-msigdb.org/gsea/msigdb). On Y axis is area under curve (AUC) score for each gene set. The Myc upregulated gene set myc_up was identified among top gene sets associated with treatment failure using unbiased ssGSEA algorithm. The upregulation of Myc pathway in the blood transcriptome associated with TB treatment failure most likely reflects greater proportion of immature cells in peripheral blood, possibly due to increased myelopoiesis.

      Pathway analysis of the differentially expressed genes revealed that treatment failures were associated with the following pathways relevant to this study: NF-kB Signaling, Flt3 Signaling in Hematopoietic Progenitor Cells (indicative of common myeloid progenitor cell proliferation), SAPK/JNK Signaling and Senescence (indicative of oxidative stress). The upregulation of these pathways in human patients with poor TB treatment outcomes correlates with our findings in TB susceptible mice. The detailed analysis of differentially regulated pathways in human TB patients is beyond the scope of this study and is presented in another manuscript entitled “ Tuberculosis risk signatures and differential gene expression predict individuals who fail treatment” by Arthur VanValkenburg et al., submitted for publication.

      Blood collection for PBMC gene expression profiling of TB patients was prior to TB treatment or within a first week of treatment commencement. Boxplot of bootstrapped ssGSEA enrichment AUC scores from several oncogene signatures ranked from lowest to highest AUC score, with myc_up and myc_dn genes highlighted in red.

      We agree with the reviewer that not every gene in the myc_up gene set correlates with the treatment outcome. But the association of the gene set is statistically significant, as presented in Suppl.Fig.15B – C.

      We updated the details of the study, including study sites and the ethics committee approval statement and references describing these cohorts. __ Other comments__

      It is excellent that the authors provide individual data points. Choosing a colour other than black would increase clarity when black bars are used.

      Author: We followed this useful suggestion and selected consistent color codes for B6 and B6.Sst1S groups to enhance clarity throughout the revised manuscript.

      Error bars are inconsistently depicted as either bi-directional or just unidirectional.

      Author: We used bi-directional error bars in the revised manuscript.

      Fig 1E, G, H- please include a scale to clarify what the heat map is representing.

      Author: We have included the expression key in Fig.1E,G and H and Suppl.Fig.1C and D in the revised version.

      Fig 2K, Fig S10A gene information cannot be deciphered.

      Author: We increased the font in previous Fig.2K and moved to supplement to keep larger fonts (current Suppl.Fig.2G).

      Fig S4A,B please add error bars.

      Author: These data are presented as Suppl.Fig.5 in the revised version. We performed one experiment to test the hypothesis. Because the data indicated no clear increase in transposon small RNAs in the sst1S macrophages, we did not pursue this hypothesis further, and therefore, the error bars were not included. However, we decided to include these negative data because it rejects a very attractive and plausible hypothesis.

      Please use gene names as per convention (e.g. Ifnb1) to distinguish gene expression from protein expression in figures and text.

      Author: We addressed the comment in the revised manuscript.

      Fig S8B. Contrary to the description of results, there seems to be minimal overlap between the signal for YFP and the Ifnb1 probe. Is the Ifnb1 reporter mouse a legacy reporter? If so, it is worth stating this and including such considerations in the data interpretation.

      Author: The YFP reporter expresses YFP protein under the control of the Ifnb1 promoter. The YFP protein accumulates within the cells and while Ifnb protein is rapidly secreted and does not accumulate in the producing cells in appreciable amounts. So YFP is not a lineage tracing reporter, but its accumulation marks the Ifnb1 promoter activity in cells, although the YFP protein half-life is longer than that of the Ifnb1 mRNA that is rapidly degraded (Witt et al., BioRxiv, 2024; doi:10.1101/2024.08.28.61018). Therefore, there is no precise spatiotemporal coincidence of these readouts.

      Please clarify what is meant by "normal interstitium" ? If the tissue is from uninfected mice, please state clearly.

      Author: In this context we refer to the uninvolved lung areas of the infected lungs. In every sample we compare uninvolved lung areas and TB lesions of the same animal. Also, we performed staining of lung of non-infected mice as additional controls.

      Rev1: If macrophage cultures underwent media changes every 48h, how was loss of liberated Mtb taken into account especially if differences in cell density/survival were noted? The assessment of M. tuberculosis load by qPCR is not well described. In particular, the method of normalization applied within the experiments (not within the qPCR) here remains unclear, even with reference to the authors' prior publication.

      Author: Our lab has many years of experience working with macrophage monolayers infected with virulent Mtb and uses optimized protocols to avoid cell losses and related artifacts. Recently we published a detailed protocol for this methodology in STAR Protocols (Yabaji et al., 2022; PMID 35310069). In brief, it includes preparation of single cell suspensions of Mtb by filtration to remove clumps, use of low multiplicity of infection, preparation of healthy confluent monolayers and use of nutrient rich culture medium and medium change every 2 days. We also rigorously control for cell loss using whole well imaging and quantification of cell numbers and live/dead staining.

      Please add citation for the limma package.

      Author: The references has been added (Ritchie et al, NAR 2015; PMID 25605792).

      The description of methodology relating to the "oncogene signatures" is unclear.

      Author: This signature was described in Bild etal, Nature, 2006 and McQuerry JA, et al, 2019 “Pathway activity profiling of growth factor receptor network and stemness pathways differentiates metaplastic breast cancer histological subtypes”. BMC Cancer 19: 881 and is cited in Methods section Oncogene signatures

      Please clearly state time points post infection for mouse analyses.

      Author: We collected lung samples from Mtb infected mice 12 – 20 weeks post infection. The lesions were heterogeneous and were individually classified using criteria described above.

      Reference is made to "a list of genes unique to type I [interferon] genes [....]" (p29). Can the authors indicate the source of the information used for compiling this list?

      Author: The lists were compiled from Reactome, EMBL's European Bioinformatics Institute and GSEA databases. The links for all datasets are provided in Suppl.Table 8 “Expression of IFN pathway genes in Iba1+ cells from pauci- and multi-bacillary lesions of Mtb infected B6.Sst1S mouse lungs” in the “Pool IFN I & II gene sets” worksheet.

      The discussion at present is very long, contains repetition of results and meanders on occasion.

      Author: Thank you for this suggestion, We critically revised the text for brevity and clarity.

      Reviewer #1 (Significance (Required)):

      Strengths and limitations

      Strengths: multi-pronged analysis approaches for delineating molecular mechanisms of macrophage responses that might underpin susceptibility to M. tuberculosis infection; integration of mouse tissues and human blood samples

      Weaknesses: not all conclusions supported by data presented; some concerns related to experimental design and controls; links between findings in human cohort and the mechanistic insights gained in mouse macrophage model uncertain

      Author: The revised manuscript addresses every major and minor comment of the reviewers, including isotype controls and naïve T cells, to provide additional support for our conclusions. Our study revealed causal links between Myc hyperactivity with the deficiency of anti-oxidant defense and type I interferon pathway hyperactivity. We have shown that Myc hyperactivity in TNF-stimulated macrophages compromises antioxidant defense leading to autocatalytic lipid peroxidation and interferon-beta superinduction that in turn amplifies lipid peroxidation, thus, forming a vicious cycle of destructive chronic inflammation. This mechanism offers a plausible mechanistic explanation of for the association of Myc hyperactivity with poorer treatment outcomes in TB patients and provide a novel target for host-directed TB therapy.

      Advance

      The study has the potential to advance molecular understanding of the TNF-driven state of oxidative stress previously observed in B6.Sst1S macrophages and possible implications for host control of M. tuberculosis in vivo.

      Audience

      Experts seeking understanding of host factors mediating M. tuberculosis control, or failure thereof, with appreciation for the utility of the featured mouse model in assessing TB diseases progression and severe manifestation. Interest is likely extended to audience more broadly interested in TNF-driven macrophage (dys)function in infectious, inflammatory, and autoimmune pathologies.

      Reviewer expertise

      In preparing this review, I am drawing on my expertise in assessing macrophage responses and host defense mechanisms in bacterial infections (incl. virulent M. tuberculosis) through in vitro and in vivo studies. This includes but is not limited to macrophage infection and stimulation assays, microscopy, intra-macrophage replication of M. tuberculosis, analyses of lung tissues using multi-plex IHC and spatial transcriptomics (e.g. GeoMx). I am familiar with the interpretation of RNAseq analyses in human and mouse cells/tissues, but can provide only limited assessment of appropriateness of algorithms and analysis frameworks.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Yabaji et al. investigated the effects of BMDMs stimulated with TNF from both WT and B6.Sst1S mice, which have previously been identified to contain the sst1 locus conferring susceptibility to Mycobacterium tuberculosis. They identified that B6.Sst1S macrophages show a superinduction of IFNß, which might be caused by increased c-Myc expression, expanding on the mechanistic insights made by the same group (Bhattacharya et al. 2021). Furthermore, prolonged TNF stimulation led to oxidative stress, which WT BMDMs could compensate for by the activation of the antioxidant defense via NRF2. On the other hand, B6.Sst1S BMDMs lack the expression of SP110 and SP140, co-activators of NRF2, and were therefore subjected to maintained oxidative stress. Yabaji et al. could link those findings to in vivo studies by correlating the presence of stressed and aberrantly activated macrophages within granulomas to the failure of Mtb control, as well as the progression towards necrosis. As the knowledge regarding Mtb progression and necrosis of granulomas is not yet well understood, findings that might help provide novel therapy options for TB are crucial. Overall, the manuscript has interesting findings with regard to macrophage responses in Mycobacteria tuberculosis infection.

      However, in its current form there are several shortcomings, both with respect to the precision of the experiments and conclusions drawn. In particular a) important controls are often missing, e.g. T-cells form non-immune mice in Fig. 6J, in F, effectivity of BCG in B6 mice in 6N; b) single experiments are shown throughout the manuscript, in particular western blots and histology without proper quantification and statistics, this is absolutely not acceptable; c) very few repetitions are shown in in vitro experiments, where there is no evidence for limitation in resources (usually not more than 3), it is not clear what "independent experiment means" - i.e. the robustness of the findings is questionable; d) data are often normalized multiple times, e.g. in the case of qPCR, and the methods of normalization are not clear (what house-keeping gene exactly?);

      Moreover, experiments regarding IFN I signaling (e.g. short term TNF treatment of BMDMs to analyze LPO, making sure that the reporter mouse for IFNß works in vivo) and c-Myc (e.g. the increase after M-CSF addition might impact on other analysis as well and the experiments should be adjusted to control for this effect; MYC expression in the human samples) should be carefully repeated and evaluated to draw correct conclusions.

      In addition, we would like to strongly encourage the authors to more precisely outline the experimental set-ups and figure legends, so that the reader can easily understand and follow them. In other words: The legends are - in part very - incomplete. In addition, the authors should be mindful of gene names vs. protein names and italicize where appropriate.

      Author: We appreciate a very thorough evaluation of our manuscript by this reviewer. Their insightful comments helped us improve the manuscript. As outlined below in point-by-point responses 1) we added important controls including isotype control antibodies in IFNAR blocking experiments and non-vaccinated T cells in T cell – macrophage interactions experiments; updated figure legends to indicate number of repeated experiment where a representative experiment is shown, numbers of mouse lungs and individual lesions, methods of data normalization, where it was missing. We also explained our in vitro experimental design and how we analyzed and excluded effects of media change and fresh CSF1 addition, by using a rest period before TNF stimulation and Mtb infection. The data shown in Suppl. Fig. 6C (previously Suppl. Fig. 5B) demonstrate that Myc levels induced by CSF1 return to the basal level at 12 h after media change. Our detailed in vitro protocol that contains these details has been published (Yabaji et al., STAR Protocols, 2022). We added new data demonstrating the ROS and LPO production at 6h of TNF stimulation, while the Ifnb1 mRNA super-induction occurred at 16 – 18 h, and edited the text to highlight these dynamics. The upregulation of Myc pathway in human samples does not necessarily mean the upregulation of Myc itself, it could be due to the dysregulation of downstream pathways. The upregulation of Myc pathway in the blood transcriptome associated with TB treatment failure most likely reflects greater proportion of immature cells in peripheral blood, possibly due to increased myelopoiesis. The detailed analysis of this cell populations in human patients is suggested by our findings but it is beyond the scope of this study.

      The reviewer’s comments also suggested that a summary of our findings was necessary. The main focus of our study was to untangle connections between oxidative stress and Ifnb1 superinduction. It revealed that Myc hyperactivity caused partial deficiency of anti-oxidant defense leading to type I interferon pathway hyperactivity that in turn amplifies lipid peroxidation, thus establishing a vicious cycle driving inflammatory tissue damage.

      Our laboratory worked on mechanisms of TB granuloma necrosis over more than two decades using genetic, molecular and immunological analyses in vitro and in vivo. It provided mechanistic basis for independent studies in other laboratories using our mouse model and further expanding our findings, thus supporting the reproducibility and robustness of our results and our lab’s expertise.

      Specific comments to the experiments and data:

      • Fig. 1E: Evaluation of differences in up- and downregulation between B6 and B6.Sst1S cells should highlight where these cells are within the heatmap, as it is only labelled with the clusters, or it should be depicted differently (in particular for cluster 1 and 2). Furthermore, a more simple labelling of the pathways would increase the readability of the data.

      Author: For our scRNAseq data presentation, we used formats accepted by computational community. To clarify Fig.1E, we added labels above B6 and B6.Sst1S-specific clusters.

      • Fig. 2D, E: The staining legend is missing. For the quantification it is not clear what % total means. Is this based on the intensity or area? What do the dots represent in the bar chart? Is one data point pooled from several pictures? If not, the experiments need to be repeated, as three pictures might not be representative for evaluation.

      • Fig. 2E: Statistics comparing B6/ B6,SsT1S with TNF (different) is required: Absence of induction is not a proof for a difference!

      Author: We included staining with NRF2-specific antibodies and performed area quantification per field using ImageJ to calculate the NRF2 total signal intensity per field. Each dot in the graph represents the average intensity of 3 fields in a representative experiment. The experiment was repeated 3 times. We included pairwise comparison of TNF-stimulated B6 and B6.Sst1S macrophages and updated the figure legend.

      • Fig. 3E: Positive and negative control need to be depicted in the figure (see legend).

      Author: We have added the positive and negative controls for the determination of labile iron pool to the data in Fig. 3E and related Suppl. Fig. 3B and to Fig. 5D that also demonstrates labile iron determination.

      • Fig. 3I: A quantification by flow cytometry or total cell counts are important, as 6% cell death in cell culture is a very modest observation. Otherwise, confocal images of the quantification would be a good addition to judge the specificity of the viability staining.

      Author: To validate the specificity of the viability staining method, we have provided fluorescent images as Suppl.Fig.3H. The main point of this experiment was to demonstrate a modest, but reproducible, increase in cell death in the sst1-mutant macrophages that suggested an IFN-dependent oxidative damage. In our study, we did not focus on mechanisms of cell death, but on a state of chronic oxidative stress in the sst1 mutant live cells during TNF stimulation.

      • Fig. 3I, J: What does one dot represent?

      Author: We performed this assay in 96 well format and each dot represent the % cell death in an individual well.

      • Fig. 3K,L: For the B6 BMDMs it seems that p-cJun is highly increased at 12h in (L), while it is not in (K). On the other hand, for the B6.Sst1S BMDMs it peaks at 24h in (K), while in (L) it seems to at 12h. According to the data in (L) it seems that p-cJun is rather earlier and stronger activated in B6 BMDMs and has a weakened but prolonged activation in the B6.Sst1S BMDMs, which would not fit with your statement in the text that B6.Sst1S BMDMs show an upregulation. !These experiments need repetitions and quantification and statistiscs!

      Fig. 3L: ASK1 seems to be higher at 12h for the B6 BMDMs and similar for both lines at 24h, which is not fitting to the statement in the text. ("Also, the ASK1 - JNK - cJun stress kinase axis was upregulated in B6.Sst1S macrophages, as compared to B6, after 12 - 36 h of TNF stimulation")

      Author: These experiments were repeated, and new data were added to highlight differences in ASK1 and c-Jun phosphorylation between B6 and B6.Sst1S at individual timepoints after TNF stimulation (presented in new Fig.3K). It demonstrated that after TNF stimulation the activation of stress kinases ASK1 and c-Jun initially increased in both genetic backgrounds. However, their upregulation was maintained exclusively in the sst1-susceptible macrophages from 24 to 36 h of TNF stimulation, while in the resistant macrophages their upregulation was transient. Thus, during prolonged TNF stimulation, B6.Sst1S macrophages experience stress that cannot be resolved, as evidenced by this kinetic analysis. The quantification of the band intensity was added to Western blot images above individual lanes.

      Reviewer 2 pointed to missing isotype control antibodies in Fig.3 and Fig.4:

      • Figure 3J: the isotype control for the IFNAR antibody is missing

      • Figure 4E: It seems the isotype control itself has already an effect in the reduction of IFNb.

      • Fig. 4H: It seems that the Isotype control antibody had an effect to increase 4-HNE (compared to TNF stimulated only).

      Author: We always include isotype control antibodies in our experiments because antibodies are known to modulate macrophage activation via binding to Fc receptor. To address the reviewer’s comments, we updated all panels that present the effects of IFNAR1 blockade with isotype-matched non-specific control antibodies in the revised manuscript. Specifically, we included isotype control in Fig. 3M (previously Fig.3J), Fig.4I, Suppl.4E – G, Fig.6L-M), Suppl.Fig.7I (previously Suppl.Fig.6F).

      • Fig.4A - C: "IFNAR1 blockade, however, did not increase either the NRF2 and FTL protein levels, or the Fth, Ftl and Gpx1 mRNA levels above those treated with isotype control antibodies"

      Maybe not above the isotype but it is higher than the TNF alone stimulation at least for NRF2 at 8h and for Ftl at both time points. Why does the isotype already cause stimulation/induction of the cells? !These experiments need repetitions and quantification and statistics!

      Author: To determine specific effects of IFNAR blockade we compared effects of non-specific isotype control and IFNAR1-specific antibodies. In our experiments, the isotype control antibody modestly increased of Nrf2 and Ftl protein levels and the Fth and Ftl mRNA levels, but their effects were similar to the effect of IFNAR-specific antibody. The non-IFN -specific effects of antibodies, although are of potential biological significance, are modest in our model and their analysis is beyond the scope of this study.

      • Fig.4H Was the AB added also at 12h post stimulation? Figure legend should be adjusted.

      Author: The IFNAR1 blocking antibodies and isotype control antibodies were added at 2 h after TNF stimulation in Fig.4H and 4I, as described in the corresponding figure legend. The data demonstrating effects of IFNAR blockade after 12, 24,and 33h of TNF stimulation are presented in Suppl.Fig.4 E - G.

      • Figure 4I: How was the data measured here, i.e. what is depicted? The isotype control is missing. It seems a two-way ANOVA was used, yet it is stated differently. The figure legend should be revised, as Dunnett's multiple comparison would only check for significances compared to the control.

      Author: The microscopy images and bar graphs were updated to include isotype control and presented in Suppl. Fig.4E - G of the revised version. We also revised the statistical analysis to include correction for multiple comparisons.

      Figure 4C and subsequent: How exactly was the experiment done (house-keeping gene)?

      Author: We included the details in the figure legends of revised version. We quantified the gene expression by DDCt method using b-actin (for Fig. 4C-E) and 18S (For Fig. 4F and G) as internal controls.

      • Figure 4D,E: Information on cells used is missing. Why the change in stimulation time? Did it not work after 12h? Then the experiments in A-C should be repeated for 16h.

      Author: The updated Fig. 4D and E present comparison of B6 and B6.Sst1S BMDMs clearly demonstrating significant difference between these macrophages in Ifnb1 mRNA expression 16 h after TNF stimulation, in agreement with our previous publication(Bhattacharya, et al., 2021). There we studied the time course of responses of B6 and B6.Sst1S macrophages to TNF at 2h intervals and demonstrated the divergence between their activation trajectories starting at 12 h of TNF stimulation Therefore, to reveal the underlying mechanisms we focus our analyses on this critical timepoint, i.e. as close to the divergence as possible. However, the difference between the strains in Ifnb1 mRNA expression achieved significance only by 16h of TNF stimulation. That is why we have used this timepoint for the Ifnb1 and Rsad2 analyses. It clearly shows that the superinduction was not driven by the positive feedback via IFNAR, as has been shown by the Ivashkiv lab for B6 wild type macrophages previously PMID 21220349.

      • Figure 4E: It would be helpful to see if these transcripts are actually translated into protein levels, e.g. perform an ELISA. Authors state that IFNAR blockages does not alter the expression but you statistic says otherwise.

      -The data for Ifnb expression (or better protein level) should be provided for B6 BMDMs as well.

      Author: We have previously reported the differences in Ifnb protein secretion (He et al., Plos Pathogens, 2013 and Bhattacharya et al., JCI 2021). We use mRNA quantification by qRT-PCR as a more sensitive and direct measurement of the sst1-mediated phenotype. The revised Fig.4D and E include responses of B6 in addition to the B6.Sst1S to demonstrate that the IFNAR blockade does not reduce the Ifnb1 mRNA levels in TNF-stimulated B6.Sst1S mutant to the B6 wild type levels. A slight reduction can be explained by a known positive feedback loop in the IFN-I pathway (see above). In this experiment we emphasized that the effect of the sst1 locus is substantially greater, as compared to the effect of the IFNAR blockade (Fig.4D), and updated the text accordingly.

      • Fig. 4F: To what does the fold induction refer to? If it is again to unstimulated cells, then why is the induction now so much higher than in (E) where it was only 50x (now to 100x).

      • Figure 4G: Again to what is the fold induction referring to? It seems your Fer-1 treatment only contains 2 data points. This needs to be fixed.

      Author: Yes, the fold induction was calculated by normalizing mRNA levels to untreated control incubated for the same time. Regarding the variation in Ifnb1 mRNA levels - a two-fold variation is not unusual in these experiments that may result in the Ifnb1 mRNA superinduction ranging from 50 -200-fold at this timepoint (16h). The graph in Fig.4G was modified to make all datapoints more visible.

      • "These data suggest that type I IFN signaling does not initiate LPO in our model but maintains and amplifies it during prolonged TNF stimulation that, eventually, may lead to cell death". Data for a short term TNF stimulation are not shown, however, so it might impact also on the initiation of LPO.

      • The overall conclusion drawn from Fig. 3 and 4 is not really clear with regard that IFN does not initiate LPO. Where is that shown? Data on earlier stimulation time points should be added to make this clear.

      Author: We demonstrated ROS production (new Suppl.Fig.3G) and the rate of LPO biosynthesis (new Suppl.Fig.4E-F) at 6 h post TNF stimulation, while the Ifnb1 superinduction occurs between 12-18 h post TNF stimulation. This temporal separation supports our conclusion that IFN-β superinduction does not initiate LPO. We clarified it in the text:

      “Thus, Ifnb1 super-induction and IFN-I pathway hyperactivity in B6.Sst1S macrophages follow the initial LPO production, and maintain and amplify it during prolonged TNF stimulation”. (Previously: These data suggest that type I IFN signaling does not initiate LPO in our model). We also edited the conclusion in this section to explain the hierarchy of the sst1-regulated AOD and IFN-I pathways better:

      “Taken together, the above experiments allowed us to reject the hypothesis that IFN-I hyperactivity caused the sst1-dependent AOD dysregulation. In contrast, they established that the hyperactivity of the IFN-I pathway in TNF-stimulated B6.Sst1S macrophages was itself driven by the initial dysregulation of AOD and iron-mediated lipid peroxidation. During prolonged TNF stimulation, however, the IFN-I pathway was upregulated, possibly via ROS/LPO-dependent JNK activation, and acted as a potent amplifier of lipid peroxidation”.

      We believe that these additional data and explanation strengthen our conclusions drawn from Figures 3 and 4.

      • "A select set of mouse LTR-containing endogenous retroviruses (ERV's) (Jayewickreme et al, 2021), and non-retroviral LINE L1 elements were expressed at a basal level before and after TNF stimulation, but their levels in the B6.Sst1S BMDMs were similar to or lower than those seen in B6". This sentence should be revised as the differences between B6 and B6.Sst1S BMDMs seem small and are not there after 48h anymore. Are these mild changes really caused by the mutation or could they result from different housing conditions and/or slowly diverging genetically lines. How many mice were used for the analysis? Is there already heterogeneity between mice from the same line?

      Author: We agree with the reviewer that the data presented in Suppl.Fig.4 (Suppl.Fig.5 in the revised version) indicated no increase in single- and double-stranded transposon RNAs in the B6.Sst1S macrophages. The purpose of these experiment was to test the hypothesis that increased transposon expression might be responsible for triggering the superinduction of type I interferon response in TNF-stimulated B6.Sst1S macrophages. In collaboration with a transposon expert Dr. Nelson Lau (co-author of this manuscript) we demonstrated that transposon expression was not increased above the B6 level and, thus, rejected this attractive hypothesis. We explained the purpose of this experiment in the text and adequately described our findings as “the levels in the B6.Sst1S BMDMs were similar to or lower than those seen in B6”…and concluded that ” the above analyses allowed us to exclude the overexpression of persistent viral or transposon RNAs as a primary mechanism of the IFN-I pathway hyperactivity” in the sst1-mutant macrophages.

      • Fig. 5A: Indeed, it even seems that Myc is upregulated for the mutant BMDMs. Yet, there are only 2 data points for B6 12h. !These experiments need repetitions and quantification and statistics!

      Author: We observed these differences in c-Myc mRNA levels by independent methods: RNAseq and qRT-PCR. The qRT-PCR experiments were repeated 3 times. A representative experiment in Fig.5A shows 3 data points for each condition. We reformatted the panel to make all data points clearly visible.

      • Fig. 5B: Why would the protein level decrease in the controls over 6h of additional cultivation? Is this caused by fresh M-CSF? In this case maybe cells should be left to settle for one day before stimulating them to properly compare c-Myc induction. Comment on two c-Myc bands is needed. At 12h only the upper one seems increased for TNF stimulated mutant BMDMs compared to B6 BMDMs.

      Author: We agree with the reviewer’s point that cells need to be rested after media change that contains fresh CSF-1. Indeed, in Suppl.Fig.6C, we show that after media change containing 10% L929 supernatant (a source of CSF1) there is an increase in c-Myc protein levels that takes approximately 12 hours to return to baseline.

      Our protocol includes resting period of 18 – 24 h after medium change before TNF stimulation. We updated Methods to highlight this detail. Thus, the increase in c-Myc levels we observe at 12 h of TNF stimulation (Fig.5B) is induced by TNF, not the addition of growth factors, as further discussed in the text.

      The two c-Myc bands observed in Fig.5B,I and J, are similar to patterns reported in previous studies that used the same commercial antibodies (PMIDs: 24395249, 24137534, 25351955). Whether they correspond to different c-Myc isoforms or post-translational modifications is unknown.

      • Fig. 5A,B: It seems that not all the RNA is translated into protein, as c-Myc at 12h in the mutant BMDMs seems to be lower than at 6h, while the gene expression implicates it vice versa.

      Author: In addition to Fig.5B, the time course of Myc protein expression up to 24 h is presented in new panels Fig. 5I-5J. It demonstrates the gradual decrease of Myc protein levels. The observed dissociation between the mRNA and protein levels in the sst1-mutant BMDMs at 12 and 24 h is most likely due to translation inhibition as a result of the development of the integrated stress response, ISR (as shown in our previous publication by Bhattacharya et al., JCI, 2021). Translation of Myc is known to be particularly sensitive to the ISR (PMID18551192, PMID25079319, PMID28490664). Perhaps, the IFN-driven ISR may serve as a backup mechanism for Myc downregulation. We are planning to investigate these regulatory mechanisms in greater detail in the future.

      • Fig. 5J: Indeed, the inhibitor seems to cause the downregulation of the proteins. Explanation?

      Author: This experiment was repeated twice and the average normalized densitometry values are presented in the updated Fig.5J. The main question addressed in this experiment was whether hyperactivity of JNK in TNF-stimulated sst1 mutant macrophages contributed to Myc upregulation, as had been previously shown in cancer. Comparing effects of JNK inhibition on phospho-cJun and c-Myc protein levels in TNF stimulated B6.Sst1S macrophages (updated Fig.5J), we rejected the hypotghesis that JNK activity might have a major role in c-Myc upregulation in sst1 mutant macrophages.

      • "TNF stimulation tended to reduce the LPO accumulation in the B6 macrophages and to increase it in the B6.Sst1S ones" However, this is not apparent in Sup. Fig. 6B. Here it seems that there might be a significant increase.

      Author: Suppl.Fig.6B (currently Suppl.Fig.7B) shows the 4-HNE accumulation at day 3 post infection. The data obtained after 5 days of Mtb infection are shown in Fig.6A. We clarified this in the text: “By day 5 post infection, TNF stimulation induced significant LPO accumulation only in the B6.Sst1S macrophages (Fig.6A)”.

      • Fig. 6B: Mtb and 4-HNE should be shown in two different channels in order to really assign each staining correctly.

      What time point is this? Are the mycobacteria cleared at MOI1, since it looks that there are fewer than that? How does this look like for the B6 BMDMs? Are there even less mycobacteria?

      Author: We included B6 infection data to the updated Fig.6B and added Suppl.Fig.7C and 7D that address this reviewer’s comment. The data represent day 5 of Mtb infection as indicated in the updated Fig.6B and Suppl.Fig.7C and 7D legends. New Suppl.Fig.7D shows quantification of replicating Mtb using Mtb replication reporter stain expressing single strand DNA binding protein GFP fusion, as described in Methods. We observed fewer Mtb and a lower percentage of replicating Mtb in B6 macrophages, but we did not observe a complete Mtb elimination in either background.

      We used red fluorescence for both Mtb::mCherry and 4-HNE staining to clearly visualize the SSB-GFP puncta in replicating Mtb DNA. In the revised manuscript, we have included the relevant channels in Suppl. Fig.7C and D to demonstrate clearly distinct patterns of Mtb::mCherry and 4-HNE signals. We did not aim to quantify the 4-HNE signal intensity in this experiment. For the 4-HNE quantification we use Mtb that expressed no reporter proteins (Fig.6A-B and Suppl.Fig.7A-B).

      • Fig 6E: In the context of survival a viability staining needs to be included, as well as the data from day 0. Then it needs to be analyzed whether cell numbers remain the same from D0 or if there is a change.

      Author: We updated Fig.6 legend to indicate that the cell number percentages were calculated based on the number of cells at Day 0 (immediately after Mtb infection). We routinely use fixable cell death staining to enumerate cell death to exclude artifacts due to cell loss. Brief protocol containing this information is included in Methods section. The detailed protocol including normalization using BCG spike has been published – Yabaji et al, STAR Protocols, 2022. Here we did not present dead cell percentage as it remained low and we did not observe damage to macrophage monolayers. The fold change of Mtb was calculated after normalization using Mtb load at Day 0 after infection and washes.

      "The 3D imaging demonstrated that YFP-positive cells were restricted to the lesions, but did not strictly co-localize with intracellular Mtb, i.e. the Ifnb promoter activity was triggered by inflammatory stimuli, but not by the direct recognition of intracellular bacteria. We validated the IFNb reporter findings using in situ hybridization with the Ifnb probe, as well as anti-GFP antibody staining (Suppl.Fig.8B - E)." The colocalization is not present within the tissue sections. It seems that the reporter line does not show the same staining pattern in vivo as the IFNß probe or the anti GFP antibody staining. The reporter line has to be tested for the specificity of the staining. Furthermore, to state that it was restricted to the lesions, an uninvolved tissue area needs to be depicted.

      Author: The Ifnb secreting cells are notoriously difficult to detect in vivo using direct staining of the protein. Therefore, lineage tracing of reporter expression are used as surrogates. The Ifnb reporter used in our study has been developed by the Locksley laboratory (Scheu et al., PNAS, 2008, PMID: 19088190) and has been validated in many independent studies. The reporter mice express the YFP protein under the control of the Ifnb1 promoter. The YFP protein accumulates within the cells, while Ifnb protein is rapidly secreted and does not accumulate in the producing cells in appreciable amounts. Also, the kinetics of YFP protein degradation is much slower as compared to the endogenous Ifnb1 mRNA that was detected using in situ hybridization. Thus, there is no precise spatiotemporal coincidence of these readouts in Ifnb expressing cells in vivo. However, this methodology more closely reflect the Ifnb expressing cells in vivo, as compared to a Cre-lox mediated lineage tracing approach. In the revised manuscript we demonstrate that both YFP and mRNA signals partially overlap (Suppl.Fig.12B). In Suppl.Fig.12B. we also included a new panel showing no YFP expression in the uninvolved area of the reporter mice infected with Mtb. The YFP expression by activated macrophages is demonstrated by co-staining with Iba1- and iNOS-specific antibodies (new Fig.7D and Suppl.Fig.13A). Our specificity control also included TB lesions in mice that do not carry the YFP reporter and did not express the YFP signal, as reported elsewhere (Yabaji et al., BioRxiv, https://doi.org/10.1101/2023.10.17.562695).

      • Are paucibacillary and multibacillary lesions different within the same animal or does one animal have one lesion phenotype? If that is the case, what is causing the differences between mice? Bacterial counts for the mice are required.

      Author: The heterogeneity of pulmonary TB lesions has been widely acknowledged in clinic and highlighted in recent experimental studies. In our model of chronic pulmonary TB (described in detail in Yabaji et al., https://doi.org/10.1101/2025.02.28.640830 and https://doi.org/10.1101/2023.10.17.562695) the development of pulmonary TB lesions is not synchronized, i.e. the lesions are heterogeneous between the animals and within individual animals at the same timepoint. Therefore, we performed a lesion stratification where individual lesions were classified by a certified veterinary pathologist in a blinded manner based on their morphology (H&E) and acid fast staining of the bacteria, as depicted in Suppl.Fig.8.

      • "Among the IFN-inducible genes upregulated in paucibacillary lesions were Ifi44l, a recently described negative regulator of IFN-I that enhances control of Mtb in human macrophages (DeDiego et al, 2019; Jiang et al, 2021) and Ciita, a regulator of MHC class II inducible by IFNy, but not IFN-I (Suppl.Table 8 and Suppl.Fig.10 D-E)." Why is Sup. Fig. 10 D, E referred to? The figure legend is also not clear, e.g. what means "upregulated in a subset of IFN-inducible genes"? Input for the hallmarks needs to be defined.

      Author: These data is now presented in Suppl.Fig.11 and following the reviewer’s comment, we moved reference to panels 11D – E up to previous paragraph in the main text, where it naturally belongs . We also edited the figure legend to refer to the list of IFN-inducible genes compiled from the literature that is discussed in the text. We appreciate the reviewer’s suggestion that helped us improve the text clarity. The inputs for the Hallmark pathway analysis are presented in Suppl.Tables 7 and 8, as described in the text.

      • Fig. 7C: Single channel pictures are required as it is hard to see the differences in staining with so many markers. Why is there no iNOS expression in the bottom row? What does the rectangle indicate on the bottom right? As black is chosen for DAPI, it is not visible at all. In case the signal is needed a visible a color should be chosen.

      Author: We thoroughly revised this figure to address the reviewer’s concern about the lack of clarity. We provide individual channels for each marker in Fig.7D – E and Suppl.Fig.13F. We have to use DAPI in these presentation in gray scale to better visualize other markers.

      • "In the advanced lesions these markers were primarily expressed by activated macrophages (Iba1+) expressing iNOS and/or Ifny (YFP+)(Fig.7D)" Iba1 is needed in the quantification. Based on the images, iNOS seems to be highly produced in Iba1 negative cells. Which cells do produce it then? Flow cytometry data for this quantification are required. This would allow you to specifically check which cells express the markers and allow for a more precise analysis of double positive cells.

      Author: Currently these data demonstrating the co-localization of stress markers phospho-c-Jun and Chac1 with YFP are presented in Fig.7E (images) and Suppl.Fig.13D (quantification). The co-localization of stress markers phospho-cJun and Chac1 with iNOS is presented in Suppl.Fig.13F (images) and Suppl.Fig.13E (quantification). We agree that some iNOS+ cells are Iba1-negative (Fig.7D). We manually quantified percentages of Iba1+iNOS+ double positive cells and demonstrated that they represent the majority of the iNOS+ population(Suppl.Fig.13A). Regarding the required FACS analysis, we focus on spatial approaches because of the heterogeneity of the lesions that would be lost if lungs are dissociated for FACS. We are working on spatial transcriptomics at a single cell resolution that preserves spatial organization of TB lesions to address the reviewer’s comment and will present our results in the future.

      • Results part 6: In general, can you please state for each experiment at what time point mice were analyzed? You should include an additional macrophage staining (e.g. MerTK, F4/80), as alveolar macrophages are not staining well for Iba1 and you might therefore miss them in your IF microscopy. It would be very nice if you could perform flow cytometry to really check on the macrophages during infection and distinguish subsets (e.g. alveolar macrophages, interstitial macrophages, monocytes).

      Author: We have included the details of time post infection in figure legends for Fig.7, Suppl.Figures 8, 9, 12B, 13, 14A of the revised manuscript. We have performed staining with CD11b, CD206 and CD163 to differentiate the recruited and lung resident macrophages and determined that in chronic pulmonary TB lesions in our model the vast majority of macrophages are recruited CD11b+, but not resident (CD206+ and CD163+) macrophages. These data is presented in another manuscript (Yabaji et al., BioRxiv https://doi.org/10.1101/2023.10.17.562695).

      • Spatial sequencing: The manuscript would highly profit from more data on that. It would be very interesting to check for the DEGs and show differential spatial distribution. Expression of marker genes should be inferred to further define macrophage subsets (e.g. alveolar macrophages, interstitial macrophages, recruited macrophages) and see if these subsets behave differently within the same lesion but also between the lesions. Additional bioinformatic approaches might allow you to investigate cell-cell interactions. There is a lot of potential with such a dataset, especially from TB lesions, that would elevate your findings and prove interesting to the TB field.

      • "Thus, progression from the Mtb-controlling paucibacillary to non-controlling multibacillary TB lesions in the lungs of TB susceptible mice was mechanistically linked with a pathological state of macrophage activation characterized by escalating stress (as evidenced by the upregulation phospho-cJUN, PKR and Chac1), the upregulation of IFNβ and the IFN-I pathway hyperactivity, with a concurrent reduction of IFNγ responses." To really show the upregulation within macrophages and their activation, a more detailed IF microscopy with the inclusion of additional macrophage markers needs to be provided. Flow cytometry would enable analysis for the differences between alveolar and interstitial macrophages, as well as for monocytes. As however, it seems that the majority of iNOS, as well as the stress associated markers are not produced by Iba1+ cells. Analyzing granulocytes and T lymphocytes should be considered.

      Author: We appreciate the reviewer’s suggestion. Indeed, our model provides an excellent opportunity to investigate macrophage heterogeneity and cell interactions within chronic TB lesions. We are working on spatial transcriptomics at a single cell resolution that would address the reviewer’s comment and will present our results in the future.

      In agreement with classical literature the overwhelming majority of myeloid cells in chronic pulmonary TB lesions is represented by macrophages. Neutrophils are detected at the necrotic stage, but our study is focused on pre-necrotic stages to reveal the earlier mechanisms pre-disposing to the necrotization. We never observed neutrophils or T cells expressing iNOS in our studies.

      • It's mentioned in the method section that controls in the IF staining were only fixed for 10min, while the infected cells were fixed for 30min. Consistency is important as the PFA fixation might impact on the fluorescence signal. Therefore, controls should be repeated with the same fixation time.

      Author: We have carefully considered the impact of fixation time on fluorescence and have separately analyzed the non-infected and infected samples to address this concern.

      For the non-infected samples, we examined the effect of TNF in both B6 and B6.Sst1S backgrounds, ensuring that a consistent fixation protocol (10 min) was applied across all experiments without Mtb infection.

      For the Mtb infection experiments, we employed an optimized fixation protocol (30 min) to ensure that Mtb was killed before handling the plates, which is critical for preserving the integrity of the samples. In this context, we compared B6 and B6.Sst1S samples to evaluate the effects of fixation and Mtb infection on lipid peroxidation (LPO) induction.

      We believe this approach balances the need for experimental consistency with the specific requirements for handling infected cells, and we have revised the manuscript to reflect this clarification.

      • Reactive oxygen species levels should be determined in B6 and B6.Sst1S BMDMs (stimulated and unstimulated), as they are very important for oxidative stress.

      Author: We have conducted experiments to measure ROS production in both B6 and B6.Sst1S BMDMs and demonstrated higher levels of ROS in the susceptible BMDMs after prolonged TNF stimulation (new Fig.3I – J and Suppl. Fig. 3G). Additionally, we have previously published a comparison of ROS production between B6 and B6.Sst1S by FACS (PMID: 33301427), which also supports the findings presented here.

      • Sup. Fig 2C: The inclusion of an unstimulated control would be advisable in order to evaluate if there are already difference in the beginning.

      Author: We have included the untreated control to the Suppl. Fig. 2C (currently Suppl. Fig. 2D) in the revised manuscript.

      • Sup. Fig. 3F: Why is the fold change now lower than in Fig. 4D (fold change of around 28 compared to 120 in 4D)?

      Author: The data in Fig.4D (Fig.4E in the revised manuscript) and Suppl.Fig.3F (currently Suppl.Fig.4C) represent separate experiments and this variation between experiments is commonly observed in qRT-PCR that is affected by slight variations in the expression in unsimulated controls used for the normalization and the kinetics of the response. This 2-4 fold difference between same treatments in separate experiments, as compared to 30 – 100 fold and higher induction by TNF does not affect the data interpretation.

      • Sup. Fig. 5C, D: The data seems very interesting as you even observe an increase in gene expression. Data for the B6 mice should be evaluated for increase to a similar level as the TNF treated mutants. Data on the viability of the cells are necessary, as they no longer receive M-CSF and might be dying at this point already.

      Author: To ensure that the observed effects were not confounded by cytotoxicity, we determined non-toxic concentrations of the CSF1R inhibitors during 48h of incubation and used them in our experiments that lasted for 24h. To address this valid comment, we have included cell viability data in the revised manuscript to confirm that the treatments did not result in cell death (Suppl. Fig. 6D). This experiment rejected our hypothesis that CSF1 driven Myc expression could be involved in the Ifnb superinduction. Other effects of CSF1R inhibitors on type I IFN pathway are intriguing but are beyond the scope of this study.

      • Sup. Fig 12: the phospho-c-Jun picture for (P) is not the same as in the merged one with Iba1. Double positive cells are mentioned to be analyzed, but from the staining it appears that P-c-Jun is expressed by other cells. You do not indicate how many replicates were counted and if the P and M lesions were evaluated within the same animal. What does the error bar indicate? It seems unlikely from the plots that the double positive cells are significant. Please provide the p values and statistical analysis.

      Author: We thank the reviewer for bringing this inadvertent field replacement in the single phospho-cJun channel to our attention. However, the quantification of Iba1+phospho-cJun+ double positive cells in Suppl.Fig.12 and our conclusions were not affected. In the revised manuscript, images and quantification of phospho-cJun and Iba1 co-expression are shown in new Suppl.Fig.13B and C, respectively. We have also updated the figure legends to denote the number of lesions analyzed and statistical tests. Specifically, lesions from 6–8 mice per group (paucibacillary and multibacillary) were evaluated. Each dot in panels Suppl.Fig.13 represent individual lesions.

      • Sup. Fig. 13D (suppl.Fig.15D now): What about the expression of MYC itself? Other parts of the signaling pathway should be analyzed(e.g. IFNb, JNK)?

      Author: The difference in MYC mRNA expression tended to be higher in TB patients with poor outcomes, but it was not statistically significant after correction for multiple testing. The upregulation of Myc pathway in the blood transcriptome associated with TB treatment failure most likely reflects greater proportion of immature cells in peripheral blood, possibly due to increased myelopoiesis. Pathway analysis of the differentially expressed genes revealed that treatment failures were associated with the following pathways relevant to this study: NF-kB Signaling, Flt3 Signaling in Hematopoietic Progenitor Cells (indicative of common myeloid progenitor cell proliferation), SAPK/JNK Signaling and Senescence (possibly indicative of oxidative stress). The upregulation of these pathways in human patients with poor TB treatment outcomes correlates with our findings in TB susceptible mice.

      • In the mfIHC you he usage of anti-mouse antibodies is mentioned. Pictures of sections incubated with the secondary antibody alone are required to exclude the possibility that the staining is not specific. Especially, as this data is essential to the manuscript and mouse-anti-mouse antibodies are notorious for background noise.

      Author: We are well aware of the technical difficulties associated with using mouse on mouse staining. In those cases, we use rabbit anti-mouse isotype specific antibodies specifically developed to avoid non-specific background (Abcam cat#ab133469). Each antibody panel for fluorescent multiplexed IHC is carefully optimized prior to studies. We did not use any primary mouse antibodies in the final version of the manuscript and, hence, removed this mention from the Methods.

      • In order to tie the story together, it would be interesting to treat infected mice with an INFAR antibody, as well as perform this experiment with a Myc antibody. According to your data, you might expect the survival of the mice to be increased or bacterial loads to be affected.

      Author: In collaboration with the Vance laboratory, we tested effects of type I IFN pathway inhibition in B6.Sst1S mice on TB susceptibility: either type I receptor knockout or blocking antibodies increased their resistance to virulent Mtb (published in Ji et al., 2019; PMID 31611644). Unfortunately, blocking Myc using neutralizing antibodies in vivo is not currently achievable. Specifically blocking Myc using small molecule inhibitors in vivo is notoriously difficult, as recognized in oncology literature. We consider using small molecule inhibitors of either Myc translation or specific pathways downstream of Myc in the future.

      • It is surprising that you not even once cite or mention your previous study on bioRxiv considering the similarity of the results and topic (https://doi.org/10.1101/2020.12.14.422743). Is not even your Figure 1I and Figure 2 J, K the same as in that study depicted in Figure 4?

      Author: The reviewer refers to the first version of this manuscript uploaded to BioRxiv, but it has never been published. We continued this work and greatly expanded our original observations, as presented in the current manuscript. Therefore, we do not consider the previous version as an independent manuscript and, therefore, do not cite it.

      • Please revise spelling of the manuscript and pay attention to write gene names in italics

      Author: Thank you, we corrected the gene and protein names according to current nomenclature.

      Minor points: - Fig. 1: Please provide some DEGs that explain why you used this resolution for the clustering of the scRNAseq data and that these clusters are truly distinct from each other.

      Author: Differential gene expression in clusters is presented in Suppl.Fig.1C (interferon response) and Suppl.Fig.1D (stress markers and interferon response previously established in our studies).

      • Fig. 1F: What do the two lines represent (magenta, green)?

      Author: The lines indicate pseudotime trajectories of B6 (magenta) and B6.Sst1S (green) BMDMs.

      • Fig. 1F, G: Why was cluster 6 excluded?

      Author: This cluster was not different between B6 and B6.Sst1S, so it was not useful for drawing the strain-specific trajectories.

      • Fig. 1E, G, H: The intensity scales are missing. They are vital to understand the data.

      Author: We have included the scale in revised manuscript (Fig.1E,G,H and Suppl.Fig.1C-D).

      • Fig. 2G-I: please revise order, as you first refer to Fig. 2H and I

      Author: We revised the panels’ order accordingly

      • Fig. 5: You say the data represents three samples but at least in D and E you have more. Please revise. Why do you only include at (G) the inhibitor only control?

      Author: We added the inhibitor only controls to Fig. 5D - H. We also indicated the number of replicates in the updated Fig.5 legend.

      • Figure 7A, Sup. Fig. 8: Are these maximum intensity projection? Or is one z-level from the 3D stack depicted?

      Author: The Fig. 7A shows 3D images with all the stacks combined.

      • Fig. 7B: What do the white boxes indicate?

      Author: We have removed this panel in the revised version and replaced it with better images.

      • Sup. Fig. 1A: The legend for the staining is missing

      Author: The Suppl. Fig.1A shows the relative proportions of either naïve (R and S) or TNF-stimulated (RT and ST) B6 or B6.Sst1S macrophages within individual single cell clusters depicted in Fig.1B. The color code is shown next to the graph on the right.

      • Sup. Fig. 1B: The feature plots are not clear: The legend for the expression levels is missing. What does the heading means?

      Author: We updated the headings, as in Fig.1C. The dots represent individual cells expressing Sp110 mRNA (upper panels) and Sp140 mRNA (lower panels).

      • Sup. Fig. 3C: The scale bar is barely visible.

      Author: We resized the scale bar to make it visible and presented in Suppl. Fig.3E (previously Suppl. Fig.3C).

      • Sup. Fig. 3D: There is not figure legend or the legend to C-E is wrong.

      • Sup. Fig. 3F, G: You do not state to what the data is relative to.

      Author: We identified an error in the Suppl.Fig.3 legend referring to specific panels. The Suppl.Fig.3 legend has been updated accordingly. New panels were added and Suppl.Fig.3-G panels are now Suppl.Fig.4C-D.

      • Sup. Fig. 3H: It seems you used a two-way ANOVA, yet state it differently. Please revise the figure legend, as Dunnett's multiple comparison would only check for significances compared to the control.

      Author: Following the reviewer’s comment, we repeated statistical analysis to include correction for multiple comparisons and revised the figure and legend accordingly.

      • Sup. Fig. 4A, B: It is not clear what the lines depict as the legend is not explained. Names that are not required should be changed to make it clear what is depicted (e.g. "TE@" what does this refer to?)

      Author: This previous Sup. Fig 4 is now Sup. Fig. 5. The “TE@” is a leftover label from the bioinformatics pipeline, referring to “Transposable Element”. We apologize for this confusion and have removed these extraneous labels. We have also added transposon names of the LTR (MMLV30 and RTLV4) and L1Md to Suppl.Fig.5A and 5B legend, respectively.

      • Sup. 4B: What does the y-scale on the right refer to?

      Author: We apologize for the missing label for the y-scale on the right which represents the mRNA expression level for the SetDB1 gene, which has a much lower steady state level than the LINE L1Md, so we plotted two Y-scales to allow both the gene and transposon to be visualized on this graph.

      • Sup. 4C: Interpretation of the data is highly hindered by the fact that the scales differ between the B6 and B6.Sst1. The scales are barely visible.

      Author: We apologize for the missing labels for the y-scales of these coverage plots, which were originally meant to just show a qualitative picture of the small RNA sequencing that was already quantitated by the total amounts in Sup. 4B. We have added thee auto-scaled Y-scales to Sup. 4C and improved the presentation of this figure.

      • Sup. Fig. 5A, B: Is the legend correct? Did you add the antibody for 2 days or is the quantification from day 3?

      Author: We recognize that the reviewer refers to Suppl.Fig.6A-B (Suppl.Fig.7A-B in the revised manuscript). We did not add antibodies to live cells. The figure legend describes staining with 4-HNE-specific antibodies 3 days post Mtb infection.

      • Sup. Fig. 8A: Are the "early" and "intermediate" lesions from the same time points? What are the definitions for these stages?

      Author: We discussed our lesion classification according to histopathology and bacterial loads above. Of note, in the revised manuscript we simplified our classification to denote paucibacillary and multibacillary lesions only. We agree with reviewers that designation lesions as early, intermediate and advanced lesions were based on our assumptions regarding the time course of their progression from low to high bacterial loads.

      • Sup. Fig. 8E: You should state that the bottom picture is an enlargement of an area in the top one. Scale bars are missing.

      Author: We replaced this panel with clearer images in Suppl.Fig.12B.

      • Sup. Fig. 11A: The IF staining is only visible for Iba and iNOS. Please provide single channels in order to make the other staining visible.

      Author: Suppl.Fig.11A (now Suppl.Fig.13B) shows the low-magnification images of TB lesions. In the Fig. 7 and Suppl. Fig. 13F of the revised manuscript we provided images for individual markers.

      • Sup. Fig. 13A (Suppl.Fig.15A now): Your axis label is not clear. What do the numbers behind the genes indicate? Why did you choose oncogene signatures and not inflammatory markers to check for a correlation with disease outcome?

      Author: X axis of Suppl.Fig.15A represent pre-defined molecular signature gene sets MSigDB) in Gene Set Enrichment Analysis (GSEA) database (https://www.gsea-msigdb.org/gsea/msigdb). On Y axis is area under curve (AUC) score for each gene set.

      • Sup. 13D(Suppl.Fig.15D now):: Maybe you could reorder the patients, so that the impression is clearer, as right now only the top genes seem to show a diverging gene signature, while the rest gives the impression of an equal distribution.

      Author: The Myc upregulated gene set myc_up was identified among top gene sets associated with treatment failure using unbiased ssGSEA algorithm. We agree with the reviewer that not every gene in the myc_up gene set correlates with the treatment outcome. But the association of the gene set is statistically significant, as presented in Suppl.Fig.15B – C.

      • The scale bars for many microscopy pictures are missing.

      Author: We have included clearly visible scale bars to all the microscopy images in the revised version.

      • The black bar plots should be changed (e.g. in color), since the single data points cannot be seen otherwise.
      • It would be advisable that a consistent color scheme would be used throughout the manuscript to make it easier to identify similar conditions, as otherwise many different colours are not required and lead right now rather to confusion (e.g. sometimes a black bar refers to BMDMs with and sometimes without TNF stimulation, or B6 BMDMs). Furthermore, plot sizes and fonts should be consistent within the manuscript (including the supplemental data)

      Author: We followed this useful suggestion and selected consistent color codes for B6 and B6.Sst1S groups to enhance clarity throughout the revised manuscript.

      Within the methods section: - At which concentration did you use the IFNAR antibody and the isotype?

      Author: We updated method section by including respective concentrations in the revised manuscript.

      • Were mice maintained under SPF conditions? At what age where they used?

      Author: Yes, the mice are specific pathogen free. We used 10 - 14 week old mice for Mtb infection.

      • The BMDM cultivation is not clear. According to your cited paper you use LCCM but can you provide how much M-CSF it contains? How do you make sure that amounts are the same between experiments and do not vary? You do not mention how you actually obtain this conditioned medium. Is there the possibility of contamination or transferred fibroblasts that would impact on the data analysis? Is LCCM also added during stimulation and inhibitor treatment?

      Author: We obtain LCCM by collecting the supernatant from L929 cell line that form confluent monolayer according to well-established protocols for LCCM collection. The supernatants are filtered through 0.22 micron filters to exclude contamination with L929 cells and bacteria. The medium is prepared in 500 ml batches that are sufficient for multiples experiments. Each batch of L929-conditioned medium is tested for biological activity using serial dilutions.

      • How was the BCG infection performed? How much bacteria did you use? Which BCG strain was used?

      Author: We infected mice with M. bovis BCG Pasteur subcutaneously in the hock using 106 CFU per mouse.

      • At what density did you seed the BMDMs for stimulation and inhibitor experiments?

      Author: In 96 well plates, we seed 12,000 cells per well and allow the cells to grow for 4 days to reach confluency (approximately 50,000 cells per well). For a 6-well plate, we seed 2.5 × 10^5 cells per well and culture them for 4 days to reach confluency. For a 24-well plate, we seed 50,000 cells per well and keep the cells in media for 4 days before starting any treatments. This ensures that the cells are in a proliferative or near-confluent state before beginning the stimulation or inhibitor treatments. Our detailed protocol is published in STAR Protocols (Yabaji et al., 2022; PMID 35310069).

      • What machine did you use to perform the bulk RNA sequencing? How many replicates did you include for the sequencing?

      Author: For bulk sequencing we used 3 RNA samples for each condition. The samples were sequenced at Boston University Microarray & Sequencing Resource service using Illumina NextSeq™ 2000 instrument.

      • How many replicates were used for the scRNA sequencing? Why is your threshold for the exclusion of mitochondrial DNA so high? A typical threshold of less than 5% has been reported to work well with mouse tissue.

      Author: We used one sample per condition. For the mitochondrial cutoff, we usually base it off of the total distribution. There is no "universal" threshold that can be applied to all datasets. Thresholds must be determined empirically.

      • You do not mention how many PCAs were considered for the scRNA sequencing analysis.

      Author: We considered 50 PCAs, this information was added to Methods

      • You should name all the package versions you used for the scRNA sequencing (e.g. for the slingshot, VAM package)

      Author: The following package versions were used: Seurat v4.0.4, VAM v1.0.0, Slingshot v2.3.0, SingleCellTK v2.4.1, Celda v1.10.0, we added this information to Methods.

      • You mention two batches for the human samples. Can you specify what the two batches are?

      Author: Human blood samples were collected at five sites, as described in the updated Methods section and two RNAseq batches were processed separately that required batch correction.

      • At which temperature was the IF staining performed?

      Author: We performed the IF at 4oC. We included the details in revised version.

      Reviewer #2 (Significance (Required)):

      Overall, the manuscript has interesting findings with regard to macrophage responses in Mycobacteria tuberculosis infection. However, in its current form there are several shortcomings, both with respect to the precision of the experiments and conclusions drawn.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary The authors use a mouse model designed to be more susceptible to M.tb (addition of sst1 locus) which has granulomatous lesions more similar to human granulomas, making this mouse highly relevant for M.tb pathogenesis studies. Using WT B6 macrophages or sst1B6 macrophages, the authors seek to understand the how the sst1 locus affects macrophage response to prolonged TNFa exposure, which can occur during a pro-inflammatory response in the lungs. Using single cell RNA-seq, revealed clusters of mutant macrophages with upregulated genes associated with oxidative stress responses and IFN-I signaling pathways when treated with TNF compared to WT macs. The authors go on to show that mutant macrophages have decreased NRF2, decreased antioxidant defense genes and less Sp110 and Sp140. Mutant macrophages are also more susceptible to lipid peroxidation and iron-mediated oxidative stress. The IFN-I pathway hyperactivity is caused by the dysregulation of iron storage and antioxidant defense. These mutant macrophages are more susceptible to M.tb infection, showing they are less able to control bacterial growth even in the presence of T cells from BCG vaccinated mice. The transcription factor Myc is more highly expressed in mutant macs during TNF treatment and inhibition Myc led to better control of M.tb growth. Myc is also more abundant in PBMCs from M.tb infected humans with poor outcomes, suggesting that Myc should be further investigated as a target for host-directed therapies for tuberculosis.

      Major Comments Isotypes for IF imaging and confocal IF imaging are not listed, or not performed. It is a concern that the microscopy images throughout the manuscript do not have isotype controls for the primary antibodies.

      Fig 4 (and later) the anti-IFNAR Ab is used along with the Isotype antibody, Fig 4I does not show the isotype. Use of the isotype antibody is also missing in later figures as well as Fig 3J. Why was this left off as the proper control for the Ab?

      Author: We addressed the comment in revised manuscript as described above in summary and responses to reviewers 1 and 2. Isotype controls for IFNAR1 blockade were included in Fig.3M (previously 3J), Fig. 4I, Suppl.Fig.4G (previously Fig.4I), and updated Fig.4C -E, Fig.6L-M, Suppl.Fig.4F -G, 7I.

      Conclusions drawn by the authors from some of the WB data are worded strongly, yet by eye the blots don't look as dramatically different as suggested. It would be very helpful to quantify the density of bands when making conclusions. (for example, Fig 4A).

      Author: We added the densitometry of Western blot values after normalization above each lane in Fig.2A – C, Fig.3C – D and 3K; Fig.4A – B, Fig5B,C,I,J.

      Fig 5A is not described clearly. If the gene expression is normalized to untreated B6 macs, then the level of untreated B6 macs should be 1. In the graph the blue bars are slightly below 1, which would not suggest that levels "initially increased and subsequently downregulated" as stated in the text. It seems like the text describes the protein expression but not the RNA expression. Please check this section and more clearly describe the results.

      Author: We appreciate the reviewer’s comment and modified the text to specify the mRNA and protein expression data, as follows:

      “We observed that Myc was regulated in an sst1-dependent manner: in TNF-stimulated B6 wild type BMDMs, c-Myc mRNA was downregulated, while in the susceptible macrophages c-Myc mRNA was upregulated (Fig.5A). The c-Myc protein levels were also higher in the B6.Sst1S cells in unstimulated BMDMs and 6 – 12 h of TNF stimulation (Fig.5B)”.

      Also, why look at RNA through 24h but protein only through 12h? If c-myc transcripts continue to increase through 24h, it would be interesting to see if protein levels also increase at this later time point.

      Author: The time-course of Myc expression up to 24 h is presented in new panels Fig. 5I-5J

      It demonstrates the decrease of Myc protein levels at 24 h. In the wild type B6 BMDMs the levels of Myc protein significantly decreased in parallel with the mRNA suppression presented in Fig.5A. In contrast , we observed the dissociation of the mRNA and protein levels in the sst1-mutant BMDMs at 12 and 24 h, most likely, because the mutant macrophages develop integrated stress response (as shown in our previous publication by Bhattacharya et al., JCI, 2021) that is known to inhibit Myc mRNA translation.

      Fig 5J the bands look smaller after D-JNK1 treatment at 6 and 12h though in the text is says no change. Quantifying the bands here would be helpful to see if there really is no difference.

      Author: This experiment was repeated twice, and the average normalized densitometry values are presented in the updated Fig.5J. The main question addressed in this experiment was whether the hyperactivity of JNK in TNF-stimulated sst1 mutant macrophages contributed to Myc upregulation, as was previously shown in cancer. Comparing effects of JNK inhibition on phospho-cJun and c-Myc protein levels in TNF stimulated B6.Sst1S macrophages (updated Fig.5J), we concluded that JNK did not have a major role in c-Myc upregulation in this context.

      Section 4, third paragraph, the conclusion that JNK activation in mutant macs drives pathways downstream of Myc are not supported here. Are there data or other literature from the lab that supports this claim?

      Author: This statement was based on evidence from available literature where JNK was shown to activate oncogens, including Myc. In addition, inhibition of Myc in our model upregulated ferritin (Fig.Fig.5C), reduced the labile iron pool, prevented the LPO accumulation (Fig.5D - G) and inhibited stress markers (Fig.5H). However, we do not have direct experimental evidence in our model that Myc inhibition reduces ASK1 and JNK activities. Hence, we removed this statement from the text and plan to investigate this in the future.

      Fig 6N Please provide further rationale for the BCG in vivo experiment. It is unclear what the hypothesis was for this experiment.

      Author: In the current version BCG vaccination data is presented in Suppl.Fig.14B. We demonstrate that stressed BMDMs do not respond to activation by BCG-specific T cells (Fig.6J) and their unresponsiveness is mediated by type I interferon (Fig.6L and 6M). The observed accumulation of the stressed macrophages in pulmonary TB lesions of the sst1-susceptible mice (Fig.7E, Suppl.Fig.13 and 14A) and the upregulation of type I interferon pathway (Fig.1E,1G, 7C), Suppl.Fig.1C and 11) suggested that the effect of further boosting T lymphocytes using BCG in Mtb-infected mice will be neutralized due to the macrophage unresponsiveness. This experiment provides a novel insight explaining why BCG vaccine may not be efficient against pulmonary TB in susceptible hosts.

      The in vitro work is all concerning treatment with TNFa and how this exposure modifies the responses in B6 vs sst1B6 macrophages; however, this is not explored in the in vivo studies. Are there differences in TNFa levels in the pauci- vs multi-bacillary lesions that lead to (or correlate with) the accumulation of peroxidation products in the intralesional macrophages. How to the experiments with TNFa in vitro relate back to how the macrophages are responding in vivo during infection?

      Author: Our investigation of mechanisms of necrosis of TB granulomas stems from and supported by in vivo studies as summarized below.

      This work started with the characterization necrotic TB granulomas in C3HeB/FeJ mice in vivo followed by a classical forward genetic analysis of susceptibility to virulent Mtb in vivo.

      That led to the discovery of the sst1 locus and demonstration that it plays a dominant role in the formation of necrotic TB granulomas in mouse lungs in vivo. Using genetic and immunological approaches we demonstrated that the sst1 susceptibility allele controls macrophage function in vivo (Yan, et al., J.Immunol. 2007) and an aberrant macrophage activation by TNF and increased production of Ifn-b in vitro (He et al. Plos Pathogens, 2013). In collaboration with the Vance lab we demonstrated that the type I IFN receptor inactivation reduced the susceptibility to intracellular bacteria of the sst1-susceptible mice in vivo (Ji et al., Nature Microbiology, 2019). Next, we demonstrated that the Ifnb1 mRNA superinduction results from combined effects of TNF and JNK leading to integrated stress response in vitro (Bhattacharya, JCI, 2021). Thus, our previous work started with extensive characterization of the in vivo phenotype that led to the identification of the underlying macrophage deficiency that allowed for the detailed characterization of the macrophage phenotype in vitro presented in this manuscript. In a separate study, the Sher lab confirmed our conclusions and their in vivo relevance using Bach1 knockout in the sst1-susceptible B6.Sst1S background, where boosting antioxidant defense by Bach1 inactivation resulted in decreased type I interferon pathway activity and reduced granuloma necrosis. We have chosen TNF stimulation for our in vitro studies because this cytokine is most relevant for the formation and maintenance of the integrity of TB granulomas in vivo as shown in mice, non-human primates and humans. Here we demonstrate that although TNF is necessary for host resistance to virulent Mtb, its activity is insufficient for full protection of the susceptible hosts, because of altered macrophages responsiveness to TNF. Thus, our exploration of the necrosis of TB granulomas encompass both in vitro and extensive in vivo studies.

      Minor comments Introduction, while well written, is longer than necessary. Consider shortening this section. Throughout figures, many graphs show a fold induction/accumulation/etc, but it is rarely specified what the internal control is for each graph. This needs to be added. Paragraph one, authors use the phrase "the entire IFN pathway was dramatically upregulated..." seems to be an exaggeration. How do you know the "entire" IFN pathway was upregulated in a dramatic fashion?

      Author: 1) We shortened the introduction and discussion; 2) verified that figure legends internal controls that were used to calculate fold induction; 3) removed the word “entire” to avoid overinterpretation.

      Figures 1E, G and H and supp fig 1C, the heat maps are missing an expression key Section 2 second paragraph refers to figs 2D, E as cytoplasmic in the text, but figure legend and y-axis of 2E show total protein.

      Author: The expression keys were added to Fig.1E,G,H, Fig.7C, Suppl.Fig.1C and 1D and Suppl.Fig.11A of the revised manuscript.

      Section 3 end of paragraph 1 refers to Fig 3h. Does this also refer to Supp Fig 3E?

      Author: Yes, Fig.3H shows microscopy of 4-HNE and Suppl.Fig.3H shows quantification of the image analysis. In the revised manuscript these data are presented in Fig.3H and Suppl.Fig.3F. The text was modified to reflect this change.

      Supplemental Fig 3 legend for C-E seems to incorrectly also reference F and G.

      Author: We corrected this error in the figure legend. New panels were added to Suppl.Fig.3 and previous Suppl.Fig.3F and G were moved to Suppl.Fig.4 panels C and D of the revise version.

      Fig 3K, the p-cJun was inhibited with the JNK inhibitor, however it’s unclear why this was done or the conclusion drawn from this experiment. Use of the JNK inhibitor is not discussed in the text.

      Author: The JNK inhibitor was used to confirm that c-Jun phosphorylation in our studies is mediated by JNK and to compare effects of JNK inhibition on phospho-cJun and Myc expression. This experiment demonstrated that the JNK inhibitor effectively inhibited c-Jun phosphorylation but not Myc upregulation, as shown in Fig.5I-J of the revised manuscript.

      Fig 4 I and Supp Fig 3 H seem to have been swapped? The graph in Fig 4I matches the images in Supp Fig 3I. Please check.

      Author: We reorganized the panels to provide microscopy images and corresponding quantification together in the revised the panels Fig. 4H and Fig. 4I, as well as in Suppl. Fig. 4F and Suppl. Fig. 4G.

      Fig 6, it is unclear what % cell number means. Also for bacterial growth, the data are fold change compared to what internal control?

      Author: We updated Fig.6 legend to indicate that the cell number percentages were calculated based on the number of cells at Day 0 (immediately after Mtb infection). We routinely use fixable cell death staining to enumerate cell death. Brief protocol containing this information is included in Methods section. The detailed protocol including normalization using BCG spike has been published – Yabaji et al, STAR Protocols, 2022. Here we did not present dead cell percentage as it remained low and we did not observe damage to macrophage monolayers. This allows us to exclude artifacts due to cell loss. The fold change of Mtb was calculated after normalization using Mtb load at Day 0 after infection and washes.

      Fig 7B needs an expression key

      Author: The expression keys was added to Fig.7C (previously Fig. 7B).

      Supp Fig 7 and Supp Fig 8A, what do the arrows indicate?

      Author: In Suppl.Fig.8 (previously Suppl.Fig.7) the arrows indicate acid fast bacilli (Mtb).

      In figures Fig.7A and Suppl.Fig.9A arrows indicate Mtb expressing fluorescent reporter mCherry. Corresponding figure legends were updated in the revised version.

      Supp Fig 9A, two ROI appear to be outlined in white, not just 1 as the legend says Methods:

      Author: we updated the figure legend.

      Certain items are listed in the Reagents section that are not used in the manuscript, such as necrostatin-1 or Z-VAD-FMK. Please carefully check the methods to ensure extra items or missing items does not occur.

      Author: These experiments were performed, but not included in the final manuscript. Hence, we removed the “necrostatin-1 or Z-VAD-FMK” from the reagents section in methods of revised version.

      Western blot, method of visualizing/imaging bands is not provided, method of quantifying density is not provided, though this was done for fig 5C and should be performed for the other WBs.

      Author: We used GE ImageQuant LAS4000 Multi-Mode Imager to acquire the Western blot images and the densitometric analyses were performed by area quantification using ImageJ. We included this information in the method section. We added the densitometry of Western blot values after normalization above each lane in Fig.2A – C, Fig.3C – D and 3K; Fig.4A – B, Fig5B,C,I,J.

      Reviewer #3 (Significance (Required)):

      The work of Yabaji et al is of high significance to the field of macrophage biology and M.tb pathogenesis in macrophages. This work builds from previously published work (Bhattacharya 2021) in which the authors first identified the aberrant response induced by TNF in sst1 mutant macrophages. Better understanding how macrophages with the sst1 locus respond not only to bacterial infection but stimulation with relevant ligands such as TNF will aid the field in identifying biomarkers for TB, biomarkers that can suggest a poor outcome vs. "cure" in response to antibiotic treatment or design of host-directed therapies. This work will be of interest to those who study macrophage biology and who study M.tb pathogenesis and tuberculosis in particular. This study expands the knowledge already gained on the sst1 locus to further determine how early macrophage responses are shaped that can ultimately determine disease progression. Strengths of the study include the methodologies, employing both bulk and single cell-RNA seq to answer specific questions. Data are analyze using automated methods (such as HALO) to eliminated bias. The experiments are well planned and designed to determine the mechanisms behind the increased iron-related oxidative stress found in the mutant macrophages following TNF treatment. Also, in vivo studies were performed to validate some of the in vitro work. Examining pauci-bacillary lesions vs multi-bacillary lesions and spatial transcriptomics is a significant strength of this work. The inclusion of human data is another strength of the study, showing increased Myc in humans with poor response to antibiotics for TB. Limitations include the fact that the work is all done with BMDMs. Use of alveolar macrophages from the mice would be a more relevant cell type for M.tb studies. AMs are less inflammatory, therefore treatment with TNF of AMs could result in different results compared to BMDMs. Reviewer's field of expertise: macrophage activation, M.tb pathogenesis in human and mouse models, cell signaling Limitations: not qualified to evaluate single cell or bulk RNA-seq technical analysis/methodology or spatial transcriptomics analysis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The authors use a mouse model designed to be more susceptible to M.tb (addition of sst1 locus) which has granulomatous lesions more similar to human granulomas, making this mouse highly relevant for M.tb pathogenesis studies. Using WT B6 macrophages or sst1B6 macrophages, the authors seek to understand the how the sst1 locus affects macrophage response to prolonged TNFa exposure, which can occur during a pro-inflammatory response in the lungs. Using single cell RNA-seq, revealed clusters of mutant macrophages with upregulated genes associated with oxidative stress responses and IFN-I signaling pathways when treated with TNF compared to WT macs. The authors go on to show that mutant macrophages have decreased NRF2, decreased antioxidant defense genes and less Sp110 and Sp140. Mutant macrophages are also more susceptible to lipid peroxidation and iron-mediated oxidative stress. The IFN-I pathway hyperactivity is caused by the dysregulation of iron storage and antioxidant defense. These mutant macrophages are more susceptible to M.tb infection, showing they are less able to control bacterial growth even in the presence of T cells from BCG vaccinated mice. The transcription factor Myc is more highly expressed in mutant macs during TNF treatment and inhibition Myc led to better control of M.tb growth. Myc is also more abundant in PBMCs from M.tb infected humans with poor outcomes, suggesting that Myc should be further investigated as a target for host-directed therapies for tuberculosis.

      Major Comments

      Isotypes for IF imaging and confocal IF imaging are not listed, or not performed. It is a concern that the microscopy images throughout the manuscript do not have isotype controls for the primary antibodies. Fig 4 (and later) the anti-IFNAR Ab is used along with the Isotype antibody, Fig 4I does not show the isotype. Use of the isotype antibody is also missing in later figures as well as Fig 3J. Why was this left off as the proper control for the Ab? Conclusions drawn by the authors from some of the WB data are worded strongly, yet by eye the blots don't look as dramatically different as suggested. It would be very helpful to quantify the density of bands when making conclusions. (for example, Fig 4A) Fig 5A is not described clearly. If the gene expression is normalized to untreated B6 macs, then the level of untreated B6 macs should be 1. In the graph the blue bars are slightly below 1, which would not suggest that levels "initially increased and subsequently downregulated" as stated in the text. It seems like the text describes the protein expression but not the RNA expression. Please check this section and more clearly describe the results. Also, why look at RNA through 24h but protein only through 12h? If c-myc transcripts continue to increase through 24h, it would be interesting to see if protein levels also increase at this later time point. Fig 5J the bands look smaller after D-JNK1 treatment at 6 and 12h though in the text is says no change. Quantifying the bands here would be helpful to see if there really is no difference. Section 4, third paragraph, the conclusion that JNK activation in mutant macs drives pathways downstream of Myc are not supported here. Are there data or other literature from the lab that supports this claim? Fig 6N Please provide further rationale for the BCG in vivo experiment. It is unclear what the hypothesis was for this experiment. The in vitro work is all concerning treatment with TNFa and how this exposure modifies the responses in B6 vs sst1B6 macrophages; however, this is not explored in the in vivo studies. Are there differences in TNFa levels in the pauci- vs multi-bacillary lesions that lead to (or correlate with) the accumulation of peroxidation products in the intralesional macrophages. How to the experiments with TNFa in vitro relate back to how the macrophages are responding in vivo during infection?

      Minor comments

      Introduction, while well written, is longer than necessary. Consider shortening this section. Throughout figures, many graphs show a fold induction/accumulation/etc, but it is rarely specified what the internal control is for each graph. This needs to be added. Paragraph one, authors use the phrase "the entire IFN pathway was dramatically upregulated..." seems to be an exaggeration. How do you know the "entire" IFN pathway was upregulated in a dramatic fashion? Figures 1E, G and H and supp fig 1C, the heat maps are missing an expression key Section 2 second paragraph refers to figs 2D, E as cytoplasmic in the text, but figure legend and y-axis of 2E show total protein. Section 3 end of paragraph 1 refers to Fig 3h. Does this also refer to Supp Fig 3E? Supplemental Fig 3 legend for C-E seems to incorrectly also reference F and G. Fig 3K, the p-cJun was inhibited with the JNK inhibitor, however its unclear why this was done or the conclusion drawn from this experiment. Use of the JNK inhibitor is not discussed in the text. Fig 4 I and Supp Fig 3 H seem to have been swapped? The graph in Fig 4I matches the images in Supp Fig 3I. Please check.<br /> Fig 6, it is unclear what % cell number means. Also for bacterial growth, the data are fold change compared to what internal control? Fig 7B needs an expression key Supp Fig 7 and Supp Fig 8A, what do the arrows indicate? Supp Fig 9A, two ROI appear to be outlined in white, not just 1 as the legend says Methods: Certain items are listed in the Reagents section that are not used in the manuscript, such as necrostatin-1 or Z-VAD-FMK. Please carefully check the methods to ensure extra items or missing items does not occur. Western blot, method of visualizing/imaging bands is not provided, method of quantifying density is not provided, though this was done for fig 5C and should be performed for the other WBs.

      Significance

      The work of Yabaji et al is of high significance to the field of macrophage biology and M.tb pathogenesis in macrophages. This work builds from previously published work (Bhattacharya 2021) in which the authors first identified the aberrant response induced by TNF in sst1 mutant macrophages. Better understanding how macrophages with the sst1 locus respond not only to bacterial infection but stimulation with relevant ligands such as TNF will aid the field in identifying biomarkers for TB, biomarkers that can suggest a poor outcome vs. "cure" in response to antibiotic treatment or design of host-directed therapies. This work will be of interest to those who study macrophage biology and who study M.tb pathogenesis and tuberculosis in particular. This study expands the knowledge already gained on the sst1 locus to further determine how early macrophage responses are shaped that can ultimately determine disease progression. Strengths of the study include the methodologies, employing both bulk and single cell-RNA seq to answer specific questions. Data are analyze using automated methods (such as HALO) to eliminated bias. The experiments are well planned and designed to determine the mechanisms behind the increased iron-related oxidative stress found in the mutant macrophages following TNF treatment. Also, in vivo studies were performed to validate some of the in vitro work. Examining pauci-bacillary lesions vs multi-bacillary lesions and spatial transcriptomics is a significant strength of this work. The inclusion of human data is another strength of the study, showing increased Myc in humans with poor response to antibiotics for TB. Limitations include the fact that the work is all done with BMDMs. Use of alveolar macrophages from the mice would be a more relevant cell type for M.tb studies. AMs are less inflammatory, therefore treatment with TNF of AMs could result in different results compared to BMDMs.

      Reviewer's field of expertise: macrophage activation, M.tb pathogenesis in human and mouse models, cell signaling

      Limitations: not qualified to evaluate single cell or bulk RNA-seq technical analysis/methodology or spatial transcriptomics analysis

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Yabaji et al. investigated the effects of BMDMs stimulated with TNF from both WT and B6.Sst1S mice, which have previously been identified to contain the sst1 locus conferring susceptibility to Mycobacterium tuberculosis. They identified that B6.Sst1S macrophages show a superinduction of IFNß, which might be caused by increased c-Myc expression, expanding on the mechanistic insights made by the same group (Bhattacharya et al. 2021). Furthermore, prolonged TNF stimulation led to oxidative stress, which WT BMDMs could compensate for by the activation of the antioxidant defense via NRF2. On the other hand, B6.Sst1S BMDMs lack the expression of SP110 and SP140, co-activators of NRF2, and were therefore subjected to maintained oxidative stress. Yabaji et al. could link those findings to in vivo studies by correlating the presence of stressed and aberrantly activated macrophages within granulomas to the failure of Mtb control, as well as the progression towards necrosis. As the knowledge regarding Mtb progression and necrosis of granulomas is not yet well understood, findings that might help provide novel therapy options for TB are crucial.

      Overall, the manuscript has interesting findings with regard to macrophage responses in Mycobacteria tuberculosis infection.

      However, in its current form there are several shortcomings, both with respect to the precision of the experiments and conclusions drawn.

      In particular a) important controls are often missing, e.g. T-cells form non-immune mice in Fig. 6J, in F, effectivity of BCG in B6 mice in 6N; b) single experiments are shown throughout the manuscript, in particular western blots and histology without proper quantification and statistics, this is absolutely not acceptable; c) very few repetitions are shown in in vitro experiments, where there is no evidence for limitation in resources (usually not more than 3), it is not clear what "independent experiment means" - i.e. the robustness of the findings is questionable; d) data are often normalized multiple times, e.g. in the case of qPCR, and the methods of normalization are not clear (what house-keeping gene exactly?);

      Moreover, experiments regarding IFN I signaling (e.g. short term TNF treatment of BMDMs to analyze LPO, making sure that the reporter mouse for IFNß works in vivo) and c-Myc (e.g. the increase after M-CSF addition might impact on other analysis as well and the experiments should be adjusted to control for this effect; MYC expression in the human samples) should be carefully repeated and evaluated to draw correct conclusions.

      In addition, we would like to strongly encourage the authors to more precisely outline the experimental set-ups and figure legends, so that the reader can easily understand and follow them. In other words: The legends are - in part very - incomplete. In addition, the authors should be mindful of gene names vs. protein names and italicize where appropriate.

      Finally, it is necessary that the connection to several overlapping preprints by the same author group is outlined, e.g. to https://www.biorxiv.org/content/10.1101/2020.12.14.422743v1.full.

      part very - incomplete. In addition, the authors should be mindful of gene names vs. protein names and italicize where appropriate.

      Finally, it is necessary that the connection to several overlapping preprints by the same author group is outlined, e.g. to https://www.biorxiv.org/content/10.1101/2020.12.14.422743v1.full.

      Specific comments to the experiments and data:

      • Fig. 1E: Evaluation of differences in up- and downregulation between B6 and B6.Sst1S cells should highlight where these cells are within the heatmap, as it is only labelled with the clusters, or it should be depicted differently (in particular for cluster 1 and 2). Furthermore, a more simple labelling of the pathways would increase the readability of the data
      • Fig. 2D, E: The staining legend is missing. For the quantification it is not clear what % total means. Is this based on the intensity or area? What do the dots represent in the bar chart? Is one data point pooled from several pictures? If not, the experiments need to be repeated, as three pictures might not be representative for evaluation.
      • Fig. 2E: Statistics comparing B6/ B6,SsT1S with TNF (different) is required: Absence of induction is not a proof for a difference!
      • Fig. 3E: Positive and negative control need to be depicted in the figure (see legend).
      • Fig. 3I: A quantification by flow cytometry or total cell counts are important, as 6% cell death in cell culture is a very modest observation. Otherwise, confocal images of the quantification would be a good addition to judge the specificity of the viability staining.
      • Fig. 3I, J: What does one dot represent?
      • Fig. 3K,L: For the B6 BMDMs it seems that p-cJun is highly increased at 12h in (L), while it is not in (K). On the other hand, for the B6.Sst1S BMDMs it peaks at 24h in (K), while in (L) it seems to at 12h. According to the data in (L) it seems that p-cJun is rather earlier and stronger activated in B6 BMDMs and has a weakened but prolonged activation in the B6.Sst1S BMDMs, which would not fit with your statement in the text that B6.Sst1S BMDMs show an upregulation. !These experiments need repetitions and quantification and statistiscs!
      • Figure 3J: the isotype control for the IFNAR antibody is missing
      • Fig. 3L: ASK1 seems to be higher at 12h for the B6 BMDMs and similar for both lines at 24h, which is not fitting to the statement in the text. ("Also, the ASK1 - JNK - cJun stress kinase axis was upregulated in B6.Sst1S macrophages, as compared to B6, after 12 - 36 h of TNF stimulation")
      • Fig.4A - C: "IFNAR1 blockade, however, did not increase either the NRF2 and FTL protein levels, or the Fth, Ftl and Gpx1 mRNA levels above those treated with isotype control antibodies" Maybe not above the isotype but it is higher than the TNF alone stimulation at least for NRF2 at 8h and for Ftl at both time points. Why does the isotype already cause stimulation/induction of the cells? !These experiments need repetitions and quantification and statistics!
      • Figure 4C and subsequent: How exactly was the experiment done (house-keeping gene)?
      • Figure 4D,E: Information on cells used is missing. Why the change in stimulation time? Did it not work after 12h? Then the experiments in A-C should be repeated for 16h.
      • Figure 4E: It seems the isotype control itself has already an effect in the reduction of IFNb.
      • Figure 4E: It would be helpful to see if these transcripts are actually translated into protein levels, e.g. perform an ELISA. Authors state that IFNAR blockages does not alter the expression but you statistic says otherwise.
      • Fig. 4F: To what does the fold induction refer to? If it is again to unstimulated cells, then why is the induction now so much higher than in (E) where it was only 50x (now to 100x).
      • Figure 4G: Again to what is the fold induction referring to? It seems your Fer-1 treatment only contains 2 data points. This needs to be fixed.
      • Fig. 4H: It seems that the Isotype control antibody had an effect to increase 4-HNE (compared to TNF stimulated only). Was the AB added also at 12h post stimulation? Figure legend should be adjusted.
      • Figure 4I: How was the data measured here, i.e. what is depicted? The isotype control is missing. It seems a two-way ANOVA was used, yet it is stated differently. The figure legend should be revised, as Dunnett's multiple comparison would only check for significances compared to the control.
      • "These data suggest that type I IFN signaling does not initiate LPO in our model but maintains and amplifies it during prolonged TNF stimulation that, eventually, may lead to cell death". Data for a short term TNF stimulation are not shown, however, so it might impact also on the initiation of LPO.
      • The data for Ifnb expression (or better protein level) should be provided for B6 BMDMs as well.
      • "A select set of mouse LTR-containing endogenous retroviruses (ERV's) (Jayewickreme et al, 2021), and non-retroviral LINE L1 elements were expressed at a basal level before and after TNF stimulation, but their levels in the B6.Sst1S BMDMs were similar to or lower than those seen in B6". This sentence should be revised as the differences between B6 and B6.Sst1S BMDMs seem small and are not there after 48h anymore. Are these mild changes really caused by the mutation or could they result from different housing conditions and/or slowly diverging genetically lines. How many mice were used for the analysis? Is there already heterogeneity between mice from the same line?
      • The overall conclusion drawn from Fig. 3 and 4 is not really clear with regard that IFN does not initiate LPO. Where is that shown? Data on earlier stimulation time points should be added to make this clear.
      • Fig. 5A: Indeed, it even seems that Myc is upregulated for the mutant BMDMs. Yet, there are only 2 data points for B6 12h. !These experiments need repetitions and quantification and statistics!
      • Fig. 5B: Why would the protein level decrease in the controls over 6h of additional cultivation? Is this caused by fresh M-CSF? In this case maybe cells should be left to settle for one day before stimulating them to properly compare c-Myc induction. Comment on two c-Myc bands is needed. At 12h only the upper one seems increased for TNF stimulated mutant BMDMs compared to B6 BMDMs
      • Fig. 5A,B: It seems that not all the RNA is translated into protein, as c-Myc at 12h in the mutant BMDMs seems to be lower than at 6h, while the gene expression implicates it vice versa.
      • Fig. 5J: Indeed the inhibitor seems to cause the downregulation of the proteins. Explanation?
      • "TNF stimulation tended to reduce the LPO accumulation in the B6 macrophages and to increase it in the B6.Sst1S ones" However, this is not apparent in Sup. Fig. 6B. Here it seems that there might be a significant increase.
      • Fig. 6B: Mtb and 4-HNE should be shown in two different channels in order to really assign each staining correctly. What time point is this? Are the mycobacteria cleared at MOI1, since it looks that there are fewer than that? How does this look like for the B6 BMDMs? Are there even less mycobacteria?
      • Fig 6E: In the context of survival a viability staining needs to be included, as well as the data from day 0. Then it needs to be analyzed whether cell numbers remain the same from D0 or if there is a change.
      • "The 3D imaging demonstrated that YFP-positive cells were restricted to the lesions, but did not strictly co-localize with intracellular Mtb, i.e. the Ifnb promoter activity was triggered by inflammatory stimuli, but not by the direct recognition of intracellular bacteria. We validated the IFNb reporter findings using in situ hybridization with the Ifnb probe, as well as anti-GFP antibody staining (Suppl.Fig.8B - E)." The colocalization is not present within the tissue sections. It seems that the reporter line does not show the same staining pattern in vivo as the IFNß probe or the anti GFP antibody staining. The reporter line has to be tested for the specificity of the staining. Furthermore, to state that it was restricted to the lesions, an uninvolved tissue area needs to be depicted.
      • Are paucibacillary and multibacillary lesions different within the same animal or does one animal have one lesion phenotype? If that is the case, what is causing the differences between mice? Bacterial counts for the mice are required.
      • "Among the IFN-inducible genes upregulated in paucibacillary lesions were Ifi44l, a recently described negative regulator of IFN-I that enhances control of Mtb in human macrophages (DeDiego et al, 2019; Jiang et al, 2021) and Ciita, a regulator of MHC class II inducible by IFNy, but not IFN-I (Suppl.Table 8 and Suppl.Fig.10 D-E)." Why is Sup. Fig. 10 D, E referred to? The figure legend is also not clear, e.g. what means "upregulated in a subset of IFN-inducible genes"? Input for the hallmarks needs to be defined.
      • Fig. 7C: Single channel pictures are required as it is hard to see the differences in staining with so many markers. Why is there no iNOS expression in the bottom row? What does the rectangle indicate on the bottom right? As black is chosen for DAPI, it is not visible at all. In case the signal is needed a visible a color should be chosen.
      • "In the advanced lesions these markers were primarily expressed by activated macrophages (Iba1+) expressing iNOS and/or Ifny (YFP+)(Fig.7D)" Iba1 is needed in the quantification. Based on the images, iNOS seems to be highly produced in Iba1 negative cells. Which cells do produce it then? Flow cytometry data for this quantification are required This would allow you to specifically check which cells express the markers and allow for a more precise analysis of double positive cells.
      • Results part 6: In general, can you please state for each experiment at what time point mice were analyzed? You should include an additional macrophage staining (e.g. MerTK, F4/80), as alveolar macrophages are not staining well for Iba1 and you might therefore miss them in your IF microscopy. It would be very nice if you could perform flow cytometry to really check on the macrophages during infection and distinguish subsets (e.g. alveolar macrophages, interstitial macrophages, monocytes)
      • Spatial sequencing: The manuscript would highly profit from more data on that. It would be very interesting to check for the DEGs and show differential spatial distribution. Expression of marker genes should be inferred to further define macrophage subsets (e.g. alveolar macrophages, interstitial macrophages, recruited macrophages) and see if these subsets behave differently within the same lesion but also between the lesions. Additional bioinformatic approaches might allow you to investigate cell-cell interactions. There is a lot of potential with such a dataset, especially from TB lesions, that would elevate your findings and prove interesting to the TB field.
      • "Thus, progression from the Mtb-controlling paucibacillary to non-controlling multibacillary TB lesions in the lungs of TB susceptible mice was mechanistically linked with a pathological state of macrophage activation characterized by escalating stress (as evidenced by the upregulation phospho-cJUN, PKR and Chac1), the upregulation of IFNβ and the IFN-I pathway hyperactivity, with a concurrent reduction of IFNγ responses." To really show the upregulation within macrophages and their activation, a more detailed IF microscopy with the inclusion of additional macrophage markers needs to be provided. Flow cytometry would enable analysis for the differences between alveolar and interstitial macrophages, as well as for monocytes. As however, it seems that the majority of iNOS, as well as the stress associated markers are not produced by Iba1+ cells. Analyzing granulocytes and T lymphocytes should be considered.
      • It's mentioned in the method section that controls in the IF staining were only fixed for 10min, while the infected cells were fixed for 30min. Consistency is important as the PFA fixation might impact on the fluorescence signal. Therefore, controls should be repeated with the same fixation time.
      • Reactive oxygen species levels should be determined in B6 and B6.Sst1S BMDMs (stimulated and unstimulated), as they are very important for oxidative stress.
      • Sup. Fig 2C: The inclusion of an unstimulated control would be advisable in order to evaluate if there are already difference in the beginning.
      • Sup. Fig. 3F: Why is the fold change now lower than in Fig. 4D (fold change of around 28 compared to 120 in 4D)?
      • Sup. Fig. 5C, D: The data seems very interesting as you even observe an increase in gene expression. Data for the B6 mice should be evaluated for increase to a similar level as the TNF treated mutants. Data on the viability of the cells are necessary, as they no longer receive M-CSF and might be dying at this point already.
      • Sup. Fig 12: the P-c-Jun picture for (P) is not the same as in the merged one with Iba1. Double positive cells are mentioned to be analyzed, but from the staining it appears that P-c-Jun is expressed by other cells. You do not indicate how many replicates were counted and if the P and M lesions were evaluated within the same animal. What does the error bar indicate? It seems unlikely from the plots that the double positive cells are significant. Please provide the p values and statistical analysis.
      • Sup. Fig. 13D: What about the expression of MYC itself? Other parts of the signaling pathway should be analyzed(e.g. IFNb, JNK)?
      • In the mfIHC you he usage of anti-mouse antibodies is mentioned. Pictures of sections incubated with the secondary antibody alone are required to exclude the possibility that the staining is not specific. Especially, as this data is essential to the manuscript and mouse-anti-mouse antibodies are notorious for background noise.
      • In order to tie the story together, it would be interesting to treat infected mice with an INFAR antibody, as well as perform this experiment with a Myc antibody. According to your data, you might expect the survival of the mice to be increased or bacterial loads to be affected.
      • It is surprising that you not even once cite or mention your previous study on bioRxiv considering the similarity of the results and topic (https://doi.org/10.1101/2020.12.14.422743). Is not even your Figure 1I and Figure 2 J, K the same as in that study depicted in Figure 4?
      • Please revise spelling of the manuscript and pay attention to write gene names in italics

      Minor points:

      • Fig. 1: Please provide some DEGs that explain why you used this resolution for the clustering of the scRNAseq data and that these clusters are truly distinct from each other.
      • Fig. 1F: What do the two lines represent (magenta, green)?
      • Fig. 1F, G: Why was cluster 6 excluded?
      • Fig. 1E, G, H: The intensity scales are missing. They are vital to understand the data.
      • Fig. 2G-I: please revise order, as you first refer to Fig. 2H and I
      • Fig. 5: You say the data represents three samples but at least in D and E you have more. Please revise. Why do you only include at (G) the inhibitor only control?
      • Figure 7A, Sup. Fig. 8: Are these maximum intensity projection? Or is one z-level from the 3D stack depicted?
      • Fig. 7B: What do the white boxes indicate?
      • Sup. Fig. 1A: The legend for the staining is missing
      • Sup. Fig. 1B: The feature plots are not clear: The legend for the expression levels is missing. What does the heading means?
      • Sup. Fig. 3C: The scale bar is barely visible.
      • Sup. Fig. 3D: There is not figure legend or the legend to C-E is wrong.
      • Sup. Fig. 3F, G: You do not state to what the data is relative to.
      • Sup. Fig. 3H: It seems you used a two-way ANOVA, yet state it differently. Please revise the figure legend, as Dunnett's multiple comparison would only check for significances compared to the control.
      • Sup. Fig. 4A, B: It is not clear what the lines depict as the legend is not explained. Names that are not required should be changed to make it clear what is depicted (e.g. "TE@" what does this refer to?)
      • Sup. 4B: What does the y-scale on the right refer to?
      • Sup. 4C: Interpretation of the data is highly hindered by the fact that the scales differ between the B6 and B6.Sst1. The scales are barely visible.
      • Sup. Fig. 5A, B: Is the legend correct? Did you add the antibody for 2 days or is the quantification from day 3?
      • Sup. Fig. 8A: Are the "early" and "intermediate" lesions from the same time points? What are the definitions for these stages?
      • Sup. Fig. 8E: You should state that the bottom picture is an enlargement of an area in the top one. Scale bars are missing.
      • Sup. Fig. 11A: The IF staining is only visible for Iba and iNOS. Please provide single channels in order to make the other staining visible.
      • Sup. Fig. 13A: Your axis label is not clear. What do the numbers behind the genes indicate? Why did you chose oncogene signatures and not inflammatory markers to check for a correlation with disease outcome?
      • Sup. 13D: Maybe you could reorder the patients, so that the impression is clearer, as right now only the top genes seem to show a diverging gene signature, while the rest gives the impression of an equal distribution.

      • The scale bars for many microscopy pictures are missing.

      • The black bar plots should be changed (e.g. in color), since the single data points cannot be seen otherwise.
      • It would be advisable that a consistent color scheme would be used throughout the manuscript to make it easier to identify similar conditions, as otherwise many different colours are not required and lead right now rather to confusion (e.g. sometimes a black bar refers to BMDMs with and sometimes without TNF stimulation, or B6 BMDMs). Furthermore, plot sizes and fonts should be consistent within the manuscript (including the supplemental data)

      Within the methods section:

      • At which concentration did you use the IFNAR antibody and the isotype?
      • Were mice maintained under SPF conditions? At what age where they used?
      • The BMDM cultivation is not clear. According to your cited paper you use LCCM but can you provide how much M-CSF it contains? How do you make sure that amounts are the same between experiments and do not vary? You do not mention how you actually obtain this conditioned medium. Is there the possibility of contamination or transferred fibroblasts that would impact on the data analysis? Is LCCM also added during stimulation and inhibitor treatment?
      • How was the BCG infection performed? How much bacteria did you use? Which BCG strain was used?
      • At what density did you seed the BMDMs for stimulation and inhibitor experiments?
      • What machine did you use to perform the bulk RNA sequencing? How many replicates did you include for the sequencing?
      • How many replicates were used for the scRNA sequencing? Why is your threshold for the exclusion of mitochondrial DNA so high? A typical threshold of less than 5% has been reported to work well with mouse tissue.
      • You do not mention how many PCAs were considered for the scRNA sequencing analysis.
      • You should name all the package versions you used for the scRNA sequencing (e.g. for the slingshot, VAM package)
      • You mention two batches for the human samples. Can you specify what the two batches are?
      • At which temperature was the IF staining performed?

      Significance

      Overall, the manuscript has interesting findings with regard to macrophage responses in Mycobacteria tuberculosis infection.

      However, in its current form there are several shortcomings, both with respect to the precision of the experiments and conclusions drawn.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The study by Yabaji et al. examines macrophage phenotypes B6.Sst1S mice, a mouse strain with increased susceptibility to M. tuberculosis infection that develops necrotic lung lesions. Extending previous work, the authors specifically focus on delineating the molecular mechanisms driving aberrant oxidative stress in TNF-activated B6.Sst1S macrophages that has been associated with impaired control of M. tuberculosis. The authors use scRNAseq of bone marrow-derived macrophages to further characterize distinctions between B6.Sst1S and control macrophages and ascribe distinct trajectories upon TNF stimulation. Combined with results using inhibitory antibodies and small molecule inhibitors in in vitro experimentation, the authors propose that TNF-induced protracted c-Myc expression in B6.Sst1S macrophages disables the cellular defense against oxidative stress, which promotes intracellular accumulation of lipid peroxidation products, fueled at least in part by overexpression of type I IFNs by these cells. Using lung tissue sections from M. tuberculosis-infected B6.Sst1S mice, the authors suggest that the presence of a greater number of cells with lipid peroxidation products in lung lesions with high counts of stained M. tuberculosis are indicative of progressive loss of host control due to the TNF-induced dysregulation of macrophage responses to oxidative stress. In patients with active tuberculosis disease, the authors suggest that peripheral blood gene expression indicative of increased Myc activity was associated with treatment failure.

      Major comments

      The authors describe differences in protein expression, phosphorylation or binding when referring to Fig 2A-C, 2G, 3D, 5B, 5C. However, such differences are not easily apparent or very subtle and, in some cases, confounded by differences in resting cells (e.g. pASK1 Fig 3L; c-Myc Fig 5B) as well as analyses across separate gels/blots (e.g. Fig 3K, Fig 5B). Quantitative analyses across different independent experiments with adequate statistical analyses are required to strengthen the associated conclusions.

      The representative images of fluorescence microscopy in Fig 3H, 4H, 5H, S3C, S3I, S5A, S6A seem to suggest that under some conditions the fluorescence signal is located just around the nucleus rather than absent or diminished from the cytoplasm. It is unclear whether this reflects selective translocation of targets across the cell, morphological changes of macrophages in culture in response to the various treatments, or variations in focal point at which images were acquired. Control images (e.g. cellular actin, DIC) should be included for clarification. If cell morphology changes depending on treatments, how was this accounted for in the quantitative analyses? In addition, negative controls validating specificity of fluorescence signals would be warranted.

      To interpret the evaluation on the hierarchy of molecular mechanisms in B6.Sst1S macrophages, comparative analyses with B6 control cells should be included (e.g. Fig 4C-I, Fig 5, Fig 6B, E-M, S6C, S6E-F). This will provide weight to the conclusions that the dysregulated processes are specifically associated with the susceptibility of B6.Sst1S macrophages.

      All experiments using inhibitory antibodies require comparison to the effect of a matched isotype control in the same experiment (e.g. Fig 3J, 4F, G, I; 6L, 6M, S3G, S6F).

      Experiments using inhibitors require inclusion of an inhibitor-only control to assess inhibitor effects on unstimulated cells (e.g. Fig 4I, 5D-I)

      Fig 3K and Fig 5J appear to contain the same images for p-c-Jun and b-tubulin blots.

      Data of TNF-treated cells in Fig 3I appear to be replotted in Fig 3J.

      It is stated that lungs from 2 mice with paucibacillary and 2 mice with multi-bacillary lesions were analyses. There is contradicting information on whether these tissues were collected at the same time post infection (week 14?) or whether the pauci-bacillary lesions were in lungs collected at earlier time points post infection (see Fig S8A). If the former, how do the authors conclude that multi-bacillary lesions are a progression from paucibacillary lesions and indicative of loss of M. tuberculosis control, especially if only one lesion type is observed in an individual host? If the latter, comparison between lesions will likely be dominated by temporal differences in the immune response to infection.<br /> In either case, it is relevant to consider density, location, and cellular composition of lesions (see also comments on GeoMx spatial profiling). Is the macrophage number/density per tissue area comparable between pauci-bacillary and multi-bacillary lesions? Does 4HNE staining align with macrophages and if so, is it elevated compared to control mice and driven by TNF in the susceptible vs more resistant mice?

      It would be relevant to state how many independent lesions per host were sampled in both the multiplex IHC as well as the GeoMx data. Can the authors show the selected regions of interest in the tissue overview and in the analyses to appreciate within-host and across-host heterogeneity of lesions. The nature of the spatial transcriptomics platform used is such that the data are derived from tissue areas that contain more than just Iba1+ macrophages. At later stages of infection, the cellular composition of such macrophage-rich areas will be different when compared to lesions earlier in the infection process. Hence, gene expression profiles and differences between tissue regions cannot be attributed to macrophages in this tissue region but are more likely a reflection of a mix of cellular composition and per-cell gene expression.

      It is stated that loss of control of M. tuberculosis in multibacillary lesions was associated with "downregulation of IFNg-inducible genes". If the authors base this on the tissue expression of individual genes, this requires further investigation to support such conclusion (also see comment on GeoMx above). Furthermore, how might this conclusion be compatible with significantly elevated iNOS+ cells (Fig 7D) in multibacillary lesions?

      It is appreciated that the human blood signature analyses contain Myc-signatures but the association with treatment failure is not very strong based on the data in Fig 13B and C. The authors indicate that they have no information on disease severity, but it should perhaps not be assumed that treatment failure is indicative of poor host control of the infection. Perhaps independent analyses in separate cohort/data set can add strength and provide additional insights (e.g. PMID: 35841871; PMID: 32451443, PMID: 17205474, PMID: 22872737).

      In addition, the human data analyses could be strengthened by extension to additional signatures such as IFN, TNF, oxidative stress. Details of the human study design are not very clear and are lacking patient demographics, site of disease, time of blood collection relative to treatment onset, approving ethics committees.

      Other comments

      It is excellent that the authors provide individual data points. Choosing a colour other than black would increase clarity when black bars are used.

      Error bars are inconsistently depicted as either bi-directional or just unidirectional.

      Fig 1E, G, H- please include a scale to clarify what the heat map is representing.

      Fig 2K, Fig S10A gene information cannot be deciphered.

      Fig S4A,B please add error bars.

      Fig S4C labelling of the graphs is too small to appreciate and the axes between WT and mutant seem to vary.

      Please use gene names as per convention (e.g. Ifnb1) to distinguish gene expression from protein expression in figures and text.

      Fig S8B. Contrary to the description of results, there seems to be minimal overlap between the signal for YFP and the Ifnb1 probe.

      Please clarify what is meant by "normal interstitium" ? If the tissue is from uninfected mice, please state clearly.

      Is the Ifnb1 reporter mouse a legacy reporter? If so, it is worth stating this and including such considerations in the data interpretation.

      If macrophage cultures underwent media changes every 48h, how was loss of liberated Mtb taken into account especially if differences in cell density/survival were noted?

      The assessment of M. tuberculosis load by qPCR is not well described. In particular, the method of normalization applied within the experiments (not within the qPCR) here remains unclear, even with reference to the authors' prior publication.

      Please add citation for the limma package.

      The description of methodology relating to the "oncogene signatures" is unclear.

      Please clearly state time points post infection for mouse analyses.

      Reference is made to "a list of genes unique to type I [interferon] genes [....]" (p29). Can the authors indicate the source of the information used for compiling this list?

      The discussion at present is very long, contains repetition of results and meanders on occasion.

      Significance

      Strengths and limitations

      Strengths: multi-pronged analysis approaches for delineating molecular mechanisms of macrophage responses that might underpin susceptibility to M. tuberculosis infection; integration of mouse tissues and human blood samples

      Weaknesses: not all conclusions supported by data presented; some concerns related to experimental design and controls; links between findings in human cohort and the mechanistic insights gained in mouse macrophage model uncertain

      Advance

      The study has the potential to advance molecular understanding of the TNF-driven state of oxidative stress previously observed in B6.Sst1S macrophages and possible implications for host control of M. tuberculosis in vivo.

      Audience

      Experts seeking understanding of host factors mediating M. tuberculosis control, or failure thereof, with appreciation for the utility of the featured mouse model in assessing TB diseases progression and severe manifestation. Interest is likely extended to audience more broadly interested in TNF-driven macrophage (dys)function in infectious, inflammatory, and autoimmune pathologies.

      Reviewer expertise

      In preparing this review, I am drawing on my expertise in assessing macrophage responses and host defense mechanisms in bacterial infections (incl. virulent M. tuberculosis) through in vitro and in vivo studies. This includes but is not limited to macrophage infection and stimulation assays, microscopy, intra-macrophage replication of M. tuberculosis, analyses of lung tissues using multi-plex IHC and spatial transcriptomics (e.g. GeoMx). I am familiar with the interpretation of RNAseq analyses in human and mouse cells/tissues, but can provide only limited assessment of appropriateness of algorithms and analysis frameworks.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      Major Comments:

      1. The data in the paper strongly suggests that the new copper shuttles are selective for copper and have faster binding kinetics (Fig 1) than the previous one. However, the data regarding the copper shuttling from the copper(Aβ) peptides is not very convincing. It appears to be due to the Cu effect alone (Fig.3), as the reduction in viability with Cu(II)+ AscH- is almost the same as the Cu(II)(Aβ)+AscH-. To convincingly show that the peptide shuttle can strip copper from (Aβ) peptides, the authors need to show that the copper is bound to the (Aβ) peptide before it is used in the experiment. Rightfully so, the effect of the toxicity of Cu(II)+ AscH- is similar to that of Cu(II)(Aβ16)+AscH-. This is due to the fact that Aβ16 is not toxic to the cells, so therefore there is no compounded effect of Cu and Aβ16 as seen for Cu(II)(Aβ40). As for the toxicity of Cu(II)+ AscH-, it is be similar to Cu(II)(Aβ)+AscH- because Cu(II) will be bound to a weaker ligand in the medium and such loosely bound Cu is also able to produce ROS with AscH- with similar rates as Cu-Ab.

      Data from our lab and others have shown that in HEPES solution at pH 7.4, Aβ forms a complex with Cu. The present work is also in line with Cu-binding to Ab, as in Figure 1C (GSH), the rate of Cu withdrawal by the shuttle can only be explained by Cu bound to Ab, as Cu in the buffer binds to the shuttle much faster. Also, the AscH- consumption rate measured in Fig S5D-E are congruent of Cu bound to Ab, unbound Cu has a much faster rate of AscH- consumption (Santoro et al. 2018, doi.org/10.1039/C8CC06040A).

      The concentrations of Aβ and Cu used in our experimental condition were determined with a UV-Vis spectrophotometer.

      Minor comments:

      1. The paper does not cite Figure 1A and some supplementary figures, especially Supp. Fig. 1-2. All the figures and supplementary figures should be cited. This has been rectified for all the concerned figures.

      The data presentation in Figures 3B and S8 is confusing."-" signs indicate no addition or the blank box means no addition. Also, the AKH-αR5W4 has no "-" sign in the first bar. For clarity, please indicate the -, +, or no sign means in the figure legends. Also, what does "Batch A" refer to in Figure 3B?

      The figures have been modified as suggested by the reviewer.

      Page 7, correct (Error! Referencesource not found.Figure 1C).

      This has been rectified.

      The Giantin staining in Figure 2B is making it hard to visualize ATP7A trafficking. If the Giantin image overlay is removed, it may be easier to see the movement of ATP7A from the perinuclear region to the vesicles.

      The images have been modified to better appreciate the ATP7A change in distribution upon the increase in intracellular Cu level. We have reduced the number of conditions for which images are provided and provided individual staining for clarity. Zoomed images are also provided. The remainder of the conditions are in Figure S7B

      In the introduction, the authors mention, "These molecules have, however, a major pitfall as is seen for Elesclemol, a candidate for Menkes disease treatments 32. The authors cite reference " Tsvetkov, P. et al. Copper induces cell death by targeting lipoylated TCA cycle proteins." The paper showing elesclomol as a candidate for Menkes disease treatments is Guthrie L et al., Elesclomol alleviates Menkes pathology and mortality by escorting Cu to cuproenzymes in mice. Science. 2020.

      We thank the reviewer for pointing this out, which was apparently not clearly explained. Our intention here was to show that a major pitfall of shuttles like Elesclomol, as seen in the study by Tsvetkov, P. et al. Science (2022), is cuprotoxicity. The sentence has been clarified and the work of Guthrie L et al is cited for Elesclomol as a candidate for Menkes disease.

      Reviewer #2 :

      Major issues:

      1. This reviewer is not convinced that the authors' experimental system is well suited for studies of glia activation and protective effects. With the exception of a couple of panels it is very hard to see differences. The authors should significantly improve the quality of images in Figure 5 to make this set of data convincing. We thank the reviewer for his/her detailed evaluation and for bringing to light the quality of the image in Figure 5. We have therefore improved the quality of the images by improving the signal to noise ratio to better show the differences between conditions.

      Similarly, the quality of giantin staining is low and needs to be improved and more experimental details are needed (see details below).

      As stated in our answer to reviewer 1, the images have been modified to better appreciate ATP7A redistribution upon increase of intracellular Cu levels. We have reduced the number of conditions for which images are provided and provided individual staining for clarity. Zoomed images are also provided. The remainder of the conditions are in Figure S7B.

      Given that shuttles are found within vesicles, the authors should discuss the mechanism through which Cu is released into the cytosol to trigger ATP7B trafficking.

      The mechanism of Cu escape from endosomes remains poorly understood. However, supported by our recent observations that Cu quickly (within 10 min) dissociates from the Cu-shuttle AKH-αR5W4NBD in endosomes (Okafor et al., 2024, /doi.org/10.3389/fmolb.2024.1355963), we discuss the potential involvement CTR1/2 and DMT1 (page 16).

      There are numerous small writing issues that make paper difficult to read. The authors are encouraged to carefully edit their manuscript.

      We thank the reviewer for pointing this out and several errors have been corrected whereas various sentences have been clarified.

      Minor issues

      * „A solution of monomerized Aβ complex in 10% DMEM (diluted with DMEM salt solution) was prepared in microcentrifuge tubes" - here and further the description of media composition is confusing What is the rest 90%?

      This has been rectified. The composition of the salt solution that makes up the 90% has been provided (page 4).

      * „Afterwards, AscH- was added to the tubes and vortexed, the mixture was then added to PC12 cells" - concentration of ascorbate is mentioned only once (later in the figure legend) where it can be barely found, also without explaining the choice of concentration. Additionally, ascorbate's product code is not listed. Please, correct.

      These points have been rectified.

      * Description of the cell (PC12 line) handling conditions is absent (growth medium, passage number used etc) and should be included.

      This information is now provided.

      * ATP7A delocalization assay. Details for the secondary antibodies are absent (full name (e.g. AlexaFluor 488), manufacturer, code) and should be added.

      Missing information has been added.

      * page 6: „Next, we investigated the capacity of the shuttles to withdraw Cu(II) from cell culture media, DMEM 10% and DMEM/F12 1:1 (D/F)." Here and further explanation is needed why the mixture of DMEM/F12 is needed (F12 is also not listed in the materials list).

      DMEM/F12 is a media that is commercially available used for some cell types, and it has been added to the materials list (page 4).

      * Page 7. Legend to the figure 1B: „Conditions: Cu(II)=AKH-αR5W4NBD=DapHH-αR5W4NBD=HDapH-αR5W4NBD= 5 μM, DMEM 10%, D/F 100%, 25{degree sign}C, n=3." - „DMEM/F12" ratio equals to „100%" is confusing, please clarify

      This has been clarified.

      * Page 8-9. Legend to the Figure 2A. „Similar observations were obtained with 5 different cell cultures." Same remark goes to the legend to supplementary figure 7 ("Similar observations were obtained with at least 3 different cell cultures"). Do the authors mean independent experiments or different cell lines? Please clarify. If different cell lines, consider including these data into the supplement.

      Indeed we meant independent experimentations. This has been clarified.

      * Page 8-9, figure 2B. Giantin is a cis-golgi marker, which should localize perinuclearly. In the cells shown the signal is diffuse and appears non-specific. Please improve the quality.

      We have reduced the number of conditions for which images are provides and are providing individual staining for clarity. Zoomed images are also provided allowing visualization of the typical cis-Golgi distribution of Giantin.

      * Page 8-9, figure 2B. ATP7A is shown in green. The authors did not specify the secondary antibody has been used for it. If the secondary antibody used for labeling of ATP7A has green fluorescence then how does one distinguish between the transporter signal and signal of the green fluorescent shuttle? Please provide more details.

      We thank the reviewer for pointing this point as we missed to mention this technical issue in the original manuscript. The Cu-shuttles labeled with NBD indeed emit in the green signal, but they are not fixable under our conditions and are washed out during ICC procedure. Accordingly, they do generate any background signal and do not interfere with the ICC as shown by the controls and test conditions (Figure S7B and Figure 2B). This is now mentioned (page 11).

      * Page 9 and Figure 2B. Why did authors use Cu(II)EDTA for the experiment? What was the concentration? Please, add this information as well as Cu(II)GTSM treatment conditions to the experiment description in materials and methods.

      EDTA is a strong chelator of Cu(II), however due to its negative charge it cannot penetrate the plasma membrane thus importing Cu. It is therefore used as a negative control, to eliminate the speculation of Cu non-specifically crossing the plasma membrane or through a channel.

      * Figure 2 and supplementary figure 7. It would be beneficial to have higher magnification images. Please, add them, if possible.

      These higher magnification images have been provided.

      * Page 11. „In conclusion, the novel Cu(II)-selective peptide shuttles .... capable of instantly preventing ... toxicity on PC12 cells, whereas ... instantly rescue Cu(II)Aβ1-42 toxicity". Authors should be more careful with terminology. According to the materials and methods, the survival assay was carried out after 24h of cells' treatment with the reagents. Effect visible after 24h and „instant rescue" is not the same, Please clarify or modify the wording

      In principle, the peptides cannot reverse the production of ROS, however they prevent ROS production. Therefore, for the peptides to have an effect, they have to instantly halt ROS production. This is justified by the novel shuttles being more effective than AKH-αR5W4NBD in preventing toxicity, given we modified just the Cu binding sequence. We have however restricted the use of the term instantly to ROS production.

      * Page 13, figure 5, panels C and D. In both quantitations Cu(II) was used as one of the control conditions. Why in panel D the percentage of activated microglial cells (second graphs from right) is several fold higher (appr. 150% vs >500%)?

      This variability was observed throughout our set of experiments and could be linked to the quality of the hippocampal slices used. Slight variations in the age of the animals or in the traces of metals in the mediums are likely explanations. However, the different groups that are compared represent experiments performed simultaneously.

      * Supplementary Figure S3B. The lowest solid line does not correspond to any color in the legend (please, check and correct). However, by the method of exclusion, one may conclude that it refers to Cu(II)+HDapH-shuttle. What could be a potential explanation for stronger quenching of this shuttle by binding Cu(II) directly from the spiked media comparing to when it is pre-complexed with copper (also supported by the panel D)?

      The stronger quenching of this shuttle by binding Cu(II) directly from the spiked media comparing to when it is pre-complexed with copper is not significant.

      * In discussion the authors mention that the designed shuttles are prone to degradation in 48 hours. In the viability assays, they treat cells for 24 hours, in the fluorescent and confocal microscopy experiments for one hour or less. What is the lifetime of these shuttle peptides in the cells?

      The lifetime of the shuttle peptide in the cells is currently unknown. However, after 24h incubation of PC12 cells with the AKH-αR5W4NBD, DapHH-αR5W4NBD and HDapH-αR5W4NBD, the Cu shuttles lose their punctate distribution and appear diffuse inside the cells. We have recently shown that AKH-αR5W4NBD cycles through different endosomal compartments and eventually reaches the lysosomes where it could be degraded (Okafor et al., 2024, /doi.org/10.3389/fmolb.2024.1355963). Therefore, the diffuse distribution of the fluorescence signal could suggest degradation of the Cu-shuttles.

      * From the microscopy observations, the mechanism of entry of apo-shuttles (with no Cu(II) in the complex) and in complex with Cu(II) looks quite different. Namely, in figure S7 the fluorescent signal is very strong in the plasma membrane with significantly less vesicular pattern when compared to figure 2A. It is especially apparent for DapHH shuttle at 15 minutes of incubation. Can authors hypothesize/discuss the reason for these differences?

      The difference of the shuttle’s signal in the presence or absence of Cu binding, is due to fluorescence quenching by Cu bound and was at the heart of the design of these shuttles. Hence a strong signal at the plasma membrane is seen in the absence of Cu as these CPP-based shuttles interact strongly with the plasma membrane. However in presence of Cu, they become less visible due to quenching by Cu. Interestingly however, is that when Cu dissociates from the shuttle inside the cells (likely in acid endosomes), this quenching is suppressed and the fluorescence reappears. This is now better explained (page 10).

      * Please, show the figures in the supplementary file in the same order as you refer to them.

      This has been rectified.

      * Introduction. Description of the shuttle peptides: „(3) a cell penetrating peptide (CPP), αR5W4, with sequence RRWWRRRWWR, for cell entry35" - one R is the middle is extra.

      This has been rectified.

      *Kd units are missing (pages 2, 3 and 15) and should be added.

      This has been added.

      * Figure 1A is either not referred at all or mislabeled.

      * Page 7, Figure 1B: x axis on the second panel (+Mn+) misses a label.

      * Page 8. „Upon addition of DapHH-αR5W4NBD or HDapH-αR5W4NBD, an immediate slow-down in ROS production was observed (Figure 1D and S1E), ..." - mislabeled supplementary figure, please, correct.

      * Page 11. „...but not in the presence of AKH-αR5W4NBD which required pre-incubation to prevent toxicity (Figure 3AFigure)." Please, correct the reference to the figure.

      * Page 11. „This is in line with the faster retrieval ... previously demonstrated in vitro (Figure 1)" - please, specify the panel.

      * Supplementary materials and methods, subsection „Retrieval of Cu by peptide shuttles from Aβ", page 2: „The same was done for 10 μM Cu(II)...to give the estimated 100% saturated emission level." - check the spelling of the shuttle species.

      * Supplementary Figure S4. By the behavior of AKH-shuttle in the presence of copper and other metals, it looks that panels are shuffled, i.e. panel C looks corresponding to the panel B with DMEM/F12 conditions, whish is also supported by the values in the Table S1. Please, check and correct, if needed.

      * Supplementary figure S9, panel A. Apparently, mislabeled images with Abeta1-42 and Cu(II)Abeta1-42. Please, correct.

      We apologize for the different issues in referencing figures. This has been rectified.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Minor Concerns

      I think that authors can add some concepts of general interest on AD, as follows

      evidence showed that AD top-line disease-modifying drugs employing monoclonal antibodies (donanemab, lecanemab, and aducanumab) that tag Aβ, based on the 'Amyloid cascade hypothesis', are able to rid the brain of Aβ plaques, but the drug benefits consist in a reduction of 35% of cognitive decline. The remaining disease burden (more than 65%) has no disease-modifying therapeutic options, at the moment. Furthermore, monoclonal antibodies against Aβ have strong side- events (ARIA). On this basis, it could be suggested that removing Aβ plaque might not be sufficient to slow the 100% percentage of clinical decline in AD. This is why the Cu(II) shuttle invention presented by the candidate may represent a valid and concrete means to fight AD, since also meta-analyses demonstrate that Cu and more specifically non-Cp Cu is increased in AD (PMID: 34219710). The authors can add some of these clinical considerations in the Discussion.

      There is only a very brief description of the scenario of evidence of the involvement of copper in Alzheimer's, especially from a clinical point of view, I mean the scenario resulting from clinical studies carried out on AD patients. This would have highlighted the unmet medical need to which these new compounds (the Cu shuttles) can provide an answer. At least for a subpopulation of Alzheimer's patients, and we know that there are different subtypes of Alzheimer's disease (for example 10.1016/j.neurobiolaging.2004.04.001, but authors can find others), these Cu(II) selective shuttles could provide beneficial effects. Literature reports about a percentage of AD patients with increased levels of Cu (some papers on this topic e can be easily retrieved,), who may primarily benefit from these compounds. These can be easily identified as it is also characterized by a different biochemical, cognitive, and genetic profile. The current study is timely since AD patients with high Cu can be easily identified since they are characterized by a different biochemical, cognitive, and genetic profile as per recent findings (PMID: 37047347). This information can improve the quality of the manuscript by providing information about the unmet clinical need that this study can answer

      We thank the reviewer for his very positive evaluation and for his suggestion that gives more perspective to our work. Accordingly, we have added these parts to the introduction and discussion sections.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: The paper addresses the design and synthesis of novel copper (Cu)-selective peptide transporters to prevent Cu(Aβ)-induced toxicity and microglial activation in organotypic hippocampal slices.This is a very interesting study. I would define the study as pioneering and I hope that it is a seminal study, as it could be a study that opens the doors to future studies in the sector but above all applications in the clinical field. The methods are very complex and demonstrate an excellent knowledge of the biochemistry of beta-amyloid and copper. Methods are also clear and provide information for reproducibility

      Minor Concerns

      I think that authors can add some concepts of general interest on AD, as follows evidence showed that AD top-line disease-modifying drugs employing monoclonal antibodies (donanemab, lecanemab, and aducanumab) that tag Aβ, based on the 'Amyloid cascade hypothesis', are able to rid the brain of Aβ plaques, but the drug benefits consist in a reduction of 35% of cognitive decline. The remaining disease burden (more than 65%) has no disease-modifying therapeutic options, at the moment. Furthermore, monoclonal antibodies against Aβ have strong side- events (ARIA). On this basis, it could be suggested that removing Aβ plaque might not be sufficient to slow the 100% percentage of clinical decline in AD. This is why the Cu(II) shuttle invention presented by the candidate may represent a valid and concrete means to fight AD, since also meta-analyses demonstrate that Cu and more specifically non-Cp Cu is increased in AD (PMID: 34219710). The authors can add some of these clinical considerations in the Discussion

      there is only a very brief description of the scenario of evidence of the involvement of copper in Alzheimer's, especially from a clinical point of view, I mean the scenario resulting from clinical studies carried out on AD patients. This would have highlighted the unmet medical need to which these new compounds (the Cu shuttles) can provide an answer. At least for a subpopulation of Alzheimer's patients, and we know that there are different subtypes of Alzheimer's disease (for example 10.1016/j.neurobiolaging.2004.04.001, but authors can find others), these Cu(II) selective shuttles could provide beneficial effects. Literature reports about a percentage of AD patients with increased levels of Cu (some papers on this topic e can be easily retrieved,), who may primarily benefit from these compounds. These can be easily identified as it is also characterized by a different biochemical, cognitive, and genetic profile. The current study is timely since AD patients with high Cu can be easily identified since they are characterized by a different biochemical, cognitive, and genetic profile as per recent findings (PMID: 37047347). This information can improve the quality of the manuscript by providing information about the unmet clinical need that this study can answer

      Significance

      The significance of the study relies on that the Cu(II) selective shuttles can import Cu into cells and also release Cu once inside the cells, which was shown to be bioavailable, as revealed by the delocalization of ATP7A from the TGN to the sub-plasmalemma zone in PC12 cells. The novelty is well expressed by the implementation of Cu(II) selective shuttles that can release Cu inside the cells. Thus, they can restore Cu physiological levels in conditions of brain Cu deficiency that typify the neuronal cells in AD. Furthermore, this Cu trafficking can be finely tuned, thus preventing potential adverse drug reactions when transferred into human clinical phase I and II studies. This application may represent a step forward concerning previous copper attenuating compounds/Cu(II) ionophores such as Clioquinol and GTSM which mediated non-physiological Cu import into the cells that have likely contributed to neurotoxicity processes in previous unsuccessful phase II clinical trials.

      Furthermore, the originality of the current study relies on the fact that these shuttles can be tracked in real-time, once in the cell, since they employ a fluorophore moiety sensitive to Cu binding. Furthermore, DapHH-αR5W4NBD and HDapH-αR5W4NBD, can import bioavailable Cu(II) and can prevent ROS production by Cu(II)Aβ instantly, due to the much faster Cu-binding. Importantly, DapHH-αR5W4NBD and HDapH-αR5W4NBD shuttles have been also capable of preventing OHSC slices from Cu-induced neurotoxicity, microglial proliferation, and activation. Another important feature of the Cu shuttles is that they can be designed to control their site of cell delivery. In fact, previous ionophores had the tendency to accumulate in the mitochondria, and, in doing so, excess Cu in the mitochondria might have competed with other transitional metals (mainly Fe) and triggered mitochondrial dysfunction as well as cuproptosis. These are the main strengths of the study.

      The audience of this article is currently that of expert biochemists, but by adding aspects regarding the unmet clinical need relating to the large population of AD patients with high copper in the introduction and discussion, the article can capture the attention of clinicians.

      I am a neuroscientist working on biochemical pathways and metals in Alzheimer's disease.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is an interesting work characterizing a new generation of copper shuttles with an improved ability to retrieve copper intracellularly from amyloid beta (Ab). In the in-vitro experiments, the authors demonstrate that both DapHH-αR5W4NBD and HDapH-αR5W4NBD have faster Cu(II) retrieval kinetic than the previously characterized shuttle. The authors show the ability of on Cu(II)-DapHH-αR5W4NBD and Cu(II)-HDapH-αR5W4NBD to release copper intracellularly by monitoring changes in the intracellular pattern of the copper transporter ATP7A. Using PC12 cells, the author found that one of the shuttles, DapHH-αR5W4NBD can rescue Cu(II)Aβ1-42 toxicity, and this and other shuttles, show some protective effects in organotypic slices. Overall, the chemical and biochemical data are clear and the ability of new shuttles to deliver Cu to vesicles is convincingly demonstrated. Similarly, the protective effects on plasma membrane permeability in hippocampal staining are also apparent.

      Major issues:

      1. This reviewer is not convinced that the authors' experimental system is well suited for studies of glia activation and protective effects. With the exception of a couple of panels it is very hard to see differences. The authors should significantly improve the quality of images in Figure 5 to make this set of data convincing.
      2. Similarly, the quality of giantin staining is low and needs to be improved and more experimental details are needed (see details below)
      3. Given that shuttles are found within vesicles, the authors should discuss the mechanism through which Cu is released into the cytosol to trigger ATP7B trafficking.
      4. There are numerous small writing issues that make paper difficult to read. The authors are encouraged to carefully edit their manuscript

      Minor issues

      • „A solution of monomerized Aβ complex in 10% DMEM (diluted with DMEM salt solution) was prepared in microcentrifuge tubes" - here and further the description of media composition is confusing What is the rest 90%?
      • „Afterwards, AscH- was added to the tubes and vortexed, the mixture was then added to PC12 cells" - concentration of ascorbate is mentioned only once (later in the figure legend) where it can be barely found, also without explaining the choice of concentration. Additionally, ascorbate's product code is not listed. Please, correct.
      • Description of the cell (PC12 line) handling conditions is absent (growth medium, passage number used etc) and should be included.
      • ATP7A delocalization assay. Details for the secondary antibodies are absent (full name (e.g. AlexaFluor 488), manufacturer, code) and should be added
      • page 6: „Next, we investigated the capacity of the shuttles to withdraw Cu(II) from cell culture media, DMEM 10% and DMEM/F12 1:1 (D/F)." Here and further explanation is needed why the mixture of DMEM/F12 is needed (F12 is also not listed in the materials list).
      • Page 7. Legend to the figure 1B: „Conditions: Cu(II)=AKH-αR5W4NBD=DapHH-αR5W4NBD=HDapH-αR5W4NBD= 5 μM, DMEM 10%, D/F 100%, 25{degree sign}C, n=3." - „DMEM/F12" ratio equals to „100%" is confusing, please clarify
      • Page 8-9. Legend to the Figure 2A. „Similar observations were obtained with 5 different cell cultures." Same remark goes to the legend to supplementary figure 7 ("Similar observations were obtained with at least 3 different cell cultures"). Do the authors mean independent experiments or different cell lines? Please clarify. If different cell lines, consider including these data into the supplement
      • Page 8-9, figure 2B. Giantin is a cis-golgi marker, which should localize perinuclearly. In the cells shown the signal is diffuse and appears non-specific. Please improve the quality
      • Page 8-9, figure 2B. ATP7A is shown in green. The authors did not specify the secondary antibody has been used for it. If the secondary antibody used for labeling of ATP7A has green fluorescence then how does one distinguish between the transporter signal and signal of the green fluorescent shuttle? Please provide more details
      • Page 9 and Figure 2B. Why did authors use Cu(II)EDTA for the experiment? What was the concentration? Please, add this information as well as Cu(II)GTSM treatment conditions to the experiment description in materials and methods.
      • Figure 2 and supplementary fugure 7. It would be beneficial to have higher magnification images. Please, add them, if possible.
      • Page 11. „In conclusion, the novel Cu(II)-selective peptide shuttles .... capable of instantly preventing ... toxicity on PC12 cells, whereas ... instantly rescue Cu(II)Aβ1-42 toxicity". Authors should be more careful with terminology. According to the materials and methods, the survival assay was carried out after 24h of cells' treatment with the reagents. Effect visible after 24h and „instant rescue" is not the same, Please clarify or modify the wording
      • Page 13, figure 5, panels C and D. In both quantitations Cu(II) was used as one of the control conditions. Why in panel D the percentage of activated microglial cells (second graphs from right) is several fold higher (appr. 150% vs >500%)?
      • Supplementary Figure S3B. The lowest solid line does not correspond to any color in the legend (please, check and correct). However, by the method of exclusion, one may conclude that it refers to Cu(II)+HDapH-shuttle. What could be a potential explanation for stronger quenching of this shuttle by binding Cu(II) directly from the spiked media comparing to when it is pre-complexed with copper (also supported by the panel D)?
      • In discussion the authors mention that the designed shuttles are prone to degradation in 48 hours. In the viability assays, they treat cells for 24 hours, in the fluorescent and confocal microscopy experiments for one hour or less. What is the lifetime of these shuttle peptides in the cells?
      • From the microscopy observations, the mechanism of entry of apo-shuttles (with no Cu(II) in the complex) and in complex with Cu(II) looks quite different. Namely, in figure S7 the fluorescent signal is very strong in the plasma membrane with significantly less vesicular pattern when compared to figure 2A. It is especially apparent for DapHH shuttle at 15 minutes of incubation. Can authors hypothesize/discuss the reason for these differences?
      • Please, show the figures in the supplementary file in the same order as you refer to them.
      • Introduction. Description of the shuttle peptides: „(3) a cell penetrating peptide (CPP), αR5W4, with sequence RRWWRRRWWR, for cell entry35" - one R is the middle is extra.
      • Kd units are missing (pages 2, 3 and 15) and should be added
      • Figure 1A is either not referred at all or mislabeled
      • Page 7, Figure 1B: x axis on the second panel (+Mn+) misses a label
      • Page 8. „Upon addition of DapHH-αR5W4NBD or HDapH-αR5W4NBD, an immediate slow-down in ROS production was observed (Figure 1D and S1E), ..." - mislabeled supplementary figure, please, correct.
      • Page 11. „...but not in the presence of AKH-αR5W4NBD which required pre-incubation to prevent toxicity (Figure 3AFigure)." Please, correct the reference to the figure.
      • Page 11. „This is in line with the faster retrieval ... previously demonstrated in vitro (Figure 1)" - please, specify the panel.
      • Supplementary materials and methods, subsection „Retrieval of Cu by peptide shuttles from Aβ", page 2: „The same was done for 10 μM Cu(II)...to give the estimated 100% saturated emission level." - check the spelling of the shuttle species
      • Supplementary Figure S4. By the behavior of AKH-shuttle in the presence of copper and other metals, it looks that panels are shuffled, i.e. panel C looks corresponding to the panel B with DMEM/F12 conditions, whish is also supported by the values in the Table S1. Please, check and correct, if needed.
      • Supplementary figure S9, panel A. Apparently, mislabeled images with Abeta1-42 and Cu(II)Abeta1-42. Please, correct.

      Significance

      Delivering copper to various cells and tissue to improve cells function or removal excess copper to decrease pathology is an important and timely goal. This work describe new membrane-permeable reagents, "shuttles" with improved intracellular copper release and protective effects in PC12 cells. While, the results are overall interesting, the quality of writing and data presentation needs to be improved.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript titled "Next-generation Cu(II) selective peptide shuttles prevent Cu(Aβ)-induced toxicity and microglial activation in organotypic hippocampal slices" the authors have designed and synthesized two novel peptide shuttles that specifically bind to copper in the extracellular medium and transport them into the cells where copper is released and used for the copper-dependent function. The new copper shuttles are based on the previously published copper shuttle reported by the same group. Compared to the older peptide shuttle, which required pre-incubation for an hour in cellular media before adding AscH- to prevent copper(Aβ)-induced toxicity, the new copper shuttles reported in this article do not require pre-incubation. Overall, the manuscript is well written, experiments are controlled, and data are clear. The authors need to clarify some of the issues mentioned below:

      Major Comments:

      1. The data in the paper strongly suggests that the new copper shuttles are selective for copper and have faster binding kinetics (Fig 1) than the previous one. However, the data regarding the copper shuttling from the copper(Aβ) peptides is not very convincing. It appears to be due to the Cu effect alone (Fig.3), as the reduction in viability with Cu(II)+ AscH- is almost the same as the Cu(II)(Aβ)+AscH-. To convincingly show that the peptide shuttle can strip copper from (Aβ) peptides, the authors need to show that the copper is bound to the (Aβ) peptide before it is used in the experiment.

      Minor comments:

      1. The paper does not cite Figure 1A and some supplementary figures, especially Supp. Fig. 1, 2. All the figures and supplementary figures should be cited.
      2. The data presentation in Figures 3B and S8 is confusing."-" signs indicate no addition or the blank box means no addition. Also, the AKH-αR5W4 has no "-" sign in the first bar. For clarity, please indicate the -, +, or no sign means in the figure legends. Also, what does "Batch A" refer to in Figure 3B?
      3. Page 7, correct (Error! Referencesource not found.Figure 1C).
      4. The Giantin staining in Figure 2B is making it hard to visualize ATP7A trafficking. If the Giantin image overlay is removed, it may be easier to see the movement of ATP7A from the perinuclear region to the vesicles.
      5. In the introduction, the authors mention, "These molecules have, however, a major pitfall as is seen for Elesclemol, a candidate for Menkes disease treatments 32. The authors cite reference " Tsvetkov, P. et al. Copper induces cell death by targeting lipoylated TCA cycle proteins." The paper showing elesclomol as a candidate for Menkes disease treatments is Guthrie L et al., Elesclomol alleviates Menkes pathology and mortality by escorting Cu to cuproenzymes in mice. Science. 2020.

      Significance

      General Assessment: This well-written manuscript reports two novel peptide shuttles that specifically bind to copper in the extracellular medium and transport them into the cells where copper is released and available for the copper-dependent function. However, more convincing data is needed to show that the new peptide shuttles can pick copper from the copper bound to the (Aβ) peptides. In addition to their high specificity to copper, these copper shuttles can be tracked in real-time, making them well-suited for mechanistic studies to follow copper importation in cells, providing valuable new research tools to the copper community.

      Advance: The new copper shuttles in this manuscript are based on the previously published copper shuttle reported by the same group. Compared to the older peptide shuttle, which required pre-incubation for an hour in cellular media before adding AscH- to prevent copper(Aβ)-induced toxicity, the new copper shuttles reported in this article do not require pre-incubation and hence have faster binding kinetics.

      Audience: It will attract a broad audience, as the copper shuttles reported in this paper are promising drugs for Alzheimer's disease.

      My expertise: Mitochondria copper biology

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      • The authors investigate in this study the function of LIN-42 for the process of precise molting timing in C. elegans. To achieve this, they compare LIN-42 with its mammalian ortholog, Period. They found that similar to Period, LIN-42 interacted with the kinase KIN-20, a mammalian Casein kinase 1 (CK1) ortholog. Hence, two different proteins involved in rhythmic processes, LIN-42 and Period function in a conserved manner. *
      • First, they used mutants with specific deletions to untangle various phenotypes during C. elegans development. From this analysis they identify a specific region, corresponding to a CK1-binding region in mammals, to be mainly involved in the rhythmic molting phenotype. Next, they identify KIN-20, the CK1 ortholog as interaction partner of LIN-42. They even were able to demonstrate an interaction of CK1 with the region of LIN-42. Using CK1, they identified potential phosphorylation sites within LIN-42 and compared those with immunoprecipitated protein in vivo. There was a substantial overlap. While the C-terminal tail of LIN-42 was heavily phosphorylated, deletion of the C-terminal part resulted only in a minor phenotype for rhythmic molting. Last but not least, they demonstrated that point mutations that inactivate the catalytic function of KIN-20 produced a rhythmic molting phenotype. The interaction of LIN-42 with KIN-20 affected the localization of the kinase, similar to what was found to Period and CK1. *
      • Overall, the experiments are well done, well controlled and well described even for non-specialists. I guess it was not easy to kind of sort out the many overlapping phenotypes. It was certainly helpful just to focus on the clear rhythmic molting phenotype. *

      • I have no major or minor comments. *

      • Reviewer #1 (Significance (Required)): *

      • The manuscript is well written and can be followed by non-specialists of the field. The experiments are well performed. Even if some experiments did not yield the expected phenotype, e.g. deletion of the C-terminal tail of LIN-42 had only a minor phenotype inspire of heavy phosphorylation, these experiments are anyhow included and explained. *

      • Overall, the study is interesting for people in the C. elegans field and by similarity mammalian chronobiology. I would expect that most of the progress based on this study will be on the further elucidation of the molting phenotype and how the other phenotypes related to this. Then this could emerge as a blueprint for molting phenomena in other species as well. *
      • I am a mammalian chronobiologist working on Period proteins. *

      We thank the reviewer for their positive evaluation of our work.

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)): *

      • This study represents pioneering work on LIN-42, the C. elegans ortholog of PER, uncovering its role in molting rhythms and heterochronic timing. A key strength of this work lies in its integrative approach, combining genetic and developmental analyses in C. elegans with biochemical characterization of LIN-42 protein. *

      • At the organismal level, the authors take advantage of the power of C. elegans as a model system, employing precise genetic manipulations and high-resolution developmental assays to dissect the contributions of LIN-42 and its interaction partner KIN-20, the C. elegans ortholog of CK1, to molting rhythms. Their findings provide in vivo evidence that binding of LIN-42 with KIN-20 promotes the nuclear accumulation of KIN-20 and is crucial for molting rhythms, while its PAS domain appears dispensable for this function. This detailed phenotypic analysis of multiple LIN-42 and KIN-20 mutants represents a significant contribution to our understanding of the developmental clock. *

      • At the biochemical level, the study provides a detailed analysis of the mechanism underlying LIN-42's interaction with CK1, demonstrating that LIN-42 contains a functionally conserved CK1-binding domain (CK1BD). Through their in vitro kinase assays and structural insights, the authors identified distinct roles for CK1BD-A and CK1BD-B: the former in kinase inhibition and the latter in stable CK1 binding and phosphorylation. Importantly, their data align well with previous findings on PER-CK1 regulation in mammalian and Drosophila systems, reinforcing the evolutionary conservation of key clock components. *

      • Overall, this work stands out for its deep and important insights into how CK1-mediated regulation extends beyond the circadian clock to regulate the developmental clock. The combination of genetic approaches with biochemical analyses makes this an outstanding contribution to both chronobiology and nematode developmental biology. *

      We thank the reviewer for the strong endorsement for publication of our work

      *Major comment 1: * * In Figure 2D, I could not find a crucial control if the authors claim that KIN-20 binds to LIN-42. For example, a single mutant of LIN-42-3xFLAG could be used as a control for the double mutant. *

      We will do an appropriate control experiment.

      *Major comment 2: * * The sizes of the KIN20 bands were very diverged (~40 kDa and ~60 kDa), but the authors provide no explanation for this. *

      The worm produces several KIN-20 isoforms. We will state this in the revised manuscript.

      *Major comment 3: * * Regarding the MS study, the raw data are available, but the detailed supplemental Excel files would be more informative for readers. For example, are other interactors such as REV-ERB/NHR-85 detected in Figure 2A? Regarding Figure 4F, the list of phosphorylation sites and MS scores is also informative. *

      We apologize for our omission in stating clearly in the figure legend that the significantly enriched proteins were labeled with a red dot. These were only LIN-42 itself and KIN-20. NHR-85 was not enriched. We will state this explicitly in a revised version and provide all relevant information.

      *Major comment 4: * * It is an important finding that the PAS domain of LIN-42 is not essential for the molting rhythms. Is the PAS domain also dispensable for binding with KIN-20? *

      Although we have currently no reason to assume that the PAS domain would be required for KIN-20 binding, we will perform an in vitro experiment to test for binding.

      *Major comment 5 (Optional): * * In this study, the authors carefully performed in vitro kinase assays, and I strongly suggest that they investigate whether the CKI-mediated phosphorylation of LIN-42 is temperature-compensated and whether the CKI-BD-AB regions affect it. *

      Although this is an interesting question, addressing it appears outside the scope of the manuscript and a revision; please see section 4 below.

      *Major comment 6 (Optional): * * In Figure 6, the authors argue that the CKI-BD of LIN-42 is important for CK1 nuclear translocation. It would be better to show the effect of the nuclear accumulation of CKI on nuclear proteins, like the mammalian CKI-PER2-CLOCK story. Does CKI localization affect phosphorylation status of other clock-related proteins including REV-ERB/NHR-85? * * Phospho-proteome analysis would identify nuclear substrates of CK1. In addition, is phosphorylation of LIN-42 dispensable for the CK1 nuclear translocation? *

      This is another interesting question yet currently nothing is known about other CK1/KIN-20 targets, and we have no evidence for NHR-85 being one. Please see our detailed comments in the section 4 below.

      To address whether LIN-42 phosphorylation affects CK1/KIN-20 nuclear accumulation, we will seek to examine KIN-20 localization in LIN-42∆Tail animals.

      *Major comment 7 (Optional): * * LIN-42 rhythmic expression could drive rhythmic nuclear accumulation of KIN-20. It would be better to examine this possibility using kin-20::GFP in lin-42 mutants. *

      We agree that the mutant analysis is important for this and Fig. 6C shows reduced KIN-20 nuclear accumulation in LIN-42∆CK1BD.

      Minor 1: * * I could not find the full gel images of the Western blot analyses as supplemental materials.

      This data will be added.

      Minor 2: * * The authors discussed a conserved module in two different clocks. A statement regarding a recently published paper (Hiroki and Yoshitane, Commun Biol, 2024) would be informative for readers.

      We will add such a statement.

      ***Referee cross-commenting** *

      • I basically agree with reviewer 1 and hope that this paper will be published soon as it is very valuable for our field. I have constructively pointed out some parts that could be improved, but depending on the editor's judgement, I believe that even if not all of these are revised, it will be sufficient for publication. *

      • Reviewer #2 (Significance (Required)): *

      • This work stands out for its deep and important insights into how CK1-mediated regulation extends beyond the circadian clock to regulate the developmental clock. The combination of genetic approaches with biochemical analyses makes this an outstanding contribution to both chronobiology and nematode developmental biology. *

      • I strongly suggest editors to accept this study with minor modifications according to the following comments.*

      We thank the reviewer for their strong support and the clear indication of required vs. optional revisions.

      *Reviewer #3 (Evidence, reproducibility and clarity (Required)): *

      • In their manuscript "A conserved chronobiological complex times C. elegans development", Spangler, Braun, Ashley et al. investigate the mechanisms through which the PERIOD orthologue, lin-42, regulates rhythmic molting in C. elegans. Through precise genetic manipulations, the authors identify a particular region of lin-42, the 'CK1BD', which regulates molting timing, with less effect on other lin-42 phenotypes (e.g. heterochrony). They show that LIN-42 and the casein kinase 1 (CK1) homologue KIN-20 interact in vivo, and identify phosphorylation sites of LIN-42. Using biochemical assays, they find that the CK1BD of LIN-42 is sufficient for interaction with the human homologue of KIN-20, CK1, in vitro. The LIN-42 CK1BD is also required for the proper nuclear accumulation of KIN-20 in vivo. Furthermore, a point mutation that should disrupt the catalytic activity of KIN-20 also shows an irregular molting phenotype, similar to the lin-42 CK1BD mutant. The manuscript is very well-written and the data and methods are well-presented and detailed. Overall this work makes a convincing case that the C. elegans lin-42:Kin-20 and mammalian period:Ck1 interactions have functionally conserved roles in the oscillatory developmental programs of each organism (molting timing and circadian rhythms, respectively), with a few caveats below that can be addressed.*

      We thank the reviewer for their positive evaluation of our work.

      *Major comments: *

        1. The authors have shown that LIN-42 is phosphorylated in vivo, but the dependence of this phosphorylation on KIN-20 is not fully addressed. In the discussion (lines 417-420), the authors mention that the unhealthy phenotype of the kin-20 mutant animals prevented them from assessing LIN-42 phosphorylation in this genetic background. To bolster their model and to circumvent this issue, it should be feasible to generate a kin-20 degron allele and to perform the LIN-42 phospho-proteomics upon inducible degradation. Alternatively, perhaps a phos-tag western blot for LIN-42 could be used to compare the kin-20 wild-type to kin-20 mutants.*

      We agree, and acknowledged in the discussion, that phoshorylation of LIN-42 by KIN-20 in vivo has not been demonstrated by us. However, as discussed in the section 4 below, we find that this costly, challenging and time-consuming experiment is not warranted by the expected gain.

      For technical reasons, the in vitro biochemistry was done using human CK1 protein. There are a few places (e.g. results, line 248 and discussion line 437), where the language, in my opinion, is extrapolating the CK1 results too strongly to KIN-20. The authors mention that feedback inhibition is a known property of human CK1. It is indeed quite striking that the LIN-42 CK1BD region interacts with and is phosphorylated by the human counterpart of KIN-20, and that feedback inhibition is also seen! However, the language about KIN-20 itself should be softened, since there does not appear to be clear evidence that KIN-20 exhibits the same properties as human CK1 (unless perhaps human CK1 can functionally replace KIN-20 in worms, or the proteins were extremely similar?)

      We will follow the reviewer’s advice and carefully examine the text for instances where we extrapolated too much and tone these down. (We note that this does not apply to the example of line 248 where we wrote “Collectively, our data establish that the LIN-42

      CK1BD is functionally conserved and mediates stable binding to the CK1 kinase domain.”, i.e., there was no mentioning of KIN-20.)

      The role of the three LIN-42 isoforms should be further clarified. Minimally, it should be explained why the alleles where both b and c isoforms should be flag-tagged seem to only produce detectable b isoform (e.g. Fig. 2C).

      We will clarify that the individual roles of the isoforms are largely unknown and that we can only speculate that the c-isoform may exhibit either generally low expression or expression in only few cells or tissues.

      4. Related to points 2 and 3 above, the authors have shown that the CKIBD mediates association with human CK1 in vitro, and is required for nuclear accumulation of KIN-20 in vivo, but not that the complex formation between LIN-42 and KIN-20 depends on the CK1BD. Given the reciprocal co-IP findings, it should be feasible to create tagged versions of lin-42(deltaCK1BD) and to determine the effect on LIN-42-KIN-20 complex formation. While there is already a b-isoform tag, an a-isoform tag would also help to address whether both the b and a isoforms interact with KIN-20 in a CK1BD-dependent manner in vivo. These strains would also allow the authors to determine how the CK1BD deletion affects overall levels/stability/rhythmic accumulation of LIN-42(a or b), which would potentially serve to strengthen their conclusions about the role of the lin-42 CK1BD.

      We will attempt to generate a FLAG-tagged LIN-42∆CK1BD to perform IP and check for binding of KIN-20.

      As detailed in section 4, we cannot tag LIN-42a individually due to the structure of the genomic locus, and its level appear very low to begin with.

      In the molting timing assay, there is an unexpected result where the delta-C-terminal-tail lin-42 allele resembles the n1089 (N-terminal deletion) (line 315). Could the authors more clearly explain this finding?

      As we point out in the manuscript, n1089 is a partial deletion with a breakpoint in a noncoding (intronic) region of lin-42. Accordingly, it is currently unknown, what mature transcripts and proteins are made in the mutant animals. This prevents us from making educated guesses as to why there is a phenotypic resemblance between these and lin-42∆tail mutant animals. We will clarify in the manuscript that this is an interesting, but currently unexplained observation.

      *Minor comments: *

        1. The correspondence between the LIN-42 "SYQ" and "LT" motifs and the motifs referred to as "A" and "B" should be clarified, and consistent names/labels used. Are these interchangeable names? If it is necessary to use both names, the differences between SYQ/LT and A/B should be made more clear.*

      We agree that the situation is not completely satisfactory but feel that we need to use both names since they have both been used in the literature. We will work to revise the text to reflect more clearly the correspondence.

      For data presented as "% of animals", please indicate the number of animals scored (e.g. egl, alae assays - ~ how many animals per replicate (dot)?).

      We will provide these numbers.

      Line 145-148 - Mentioning the relevant phenotype(s) of the lin-42 null allele from the cited paper would provide a good point of comparison here.

      We will mention the previously described phenotypes.

      Line 201 - the phrase "This is also true for the proteins:" is unclear, as the previous sentence states that both lin-42 and kin-20 mRNAs oscillate, while the next sentence says that only LIN-42 protein oscillates.

      We apologize for the confusion and will correct the text.

      Line 231 - please explain the significance of the 'lower response signal' in the BLI assay for the CKIBD(no tail).

      We will clarify that the lower response signal observed for the CK1BD compared to the CK1BD+Tail (residues 402-589; same construct used in Fig. 3B) reflects its smaller molecular weight, which reduces the overall mass contribution to the BLI sensor.

      Fig. 2 - C/D - the genotype lane labels should I think indicate an N-terminal rather

      We will fix this mistake.

      7. Fig. 6, line 367 - lin-42 is variably described as promoting increased KIN-20 'nuclear accumulation' or 'localization'. I think that 'accumulation' is more accurate, as it doesn't imply a specific mechanism for the difference (transport vs stabilization, etc.)

      We will revise the manuscript accordingly.

      *8. Fig 6B - an overlay of the panels or another way of quantifying the colocalization would make this result more clear. *

      We will supply the requested overlay.

      *Reviewer #3 (Significance (Required)): *

      • This work presents a major mechanistic and conceptual advance in our understanding of the role of lin-42/Period, a conserved key regulator of C. elegans development. Previously, it was not clear if the heterochronic and circadian functions of lin-42 were genetically separable, nor was it known how LIN-42 physically interacted with the CK1 homologue. This work addresses these questions using precise genome engineering and detailed phenotypic and biochemical approaches. The work also reveals the conservation of bi-directional/reciprocal regulation between lin-42 and kin-20. The main limitations of the study, which can potentially be addressed as outlined in the 'major points' above, are that evidence should be provided that lin-42 phosphorylation depends on kin-20 in vivo, and that the CK1BD mediates the interaction in vivo (since the in vitro work is with human CK1). As the authors indicate, this is the first 'conserved clock module' of this type, and this work will therefore be of significant interest to both the C. elegans developmental biology and the more general biological timing fields. *

      • Field of expertise of the reviewer- C. elegans genetics and development.*

      Description of the studies that the authors prefer not to carry out

      *Major comment 5 (Optional): * * In this study, the authors carefully performed in vitro kinase assays, and I strongly suggest that they investigate whether the CKI-mediated phosphorylation of LIN-42 is temperature-compensated and whether the CKI-BD-AB regions affect it. *

      Temperature compensation is of course one of the most striking features of circadian clocks, and CK1-mediated phosphorylation of PER appears a critical component. We agree that it would be interesting to examine whether or not this feature exists in an animal whose development is not or only partially temperature-compensated. However, these studies are not straightforward – we would first have to set up an assay and demonstrate temperature compensation for the mammalian PER – CK1 pair as a positive control. We were not able to purify KIN-20 so could only test whether the LIN-42 substrate promoted temperature compensation. Moreover, either result for LIN-42 – CK1 would immediately raise new questions that would deserve extensive follow-up: if there is temperature compensation, why is worm development not compensated? If there is none, where/how do the interactions between CK1 and LIN-42 differ from those between CK1 and PER? Hence, we propose that these studies are outside the scope of the current study.

      *Major comment 6 (Optional): * * In Figure 6, the authors argue that the CKI-BD of LIN-42 is important for CK1 nuclear translocation. It would be better to show the effect of the nuclear accumulation of CKI on nuclear proteins, like the mammalian CKI-PER2-CLOCK story. Does CKI localization affect phosphorylation status of other clock-related proteins including REV-ERB/NHR-85? * * Phospho-proteome analysis would identify nuclear substrates of CK1. In addition, is phosphorylation of LIN-42 dispensable for the CK1 nuclear translocation? *

      We agree that it will be important to identify relevant targets of KIN-20 in future work. Unfortunately, at this point, none are known, and we especially do not have any knowledge of the phosphorylation status of NHR-85. Indeed, in unrelated (and unpublished) work we have done a phosphoproteomics time course of wild-type animals. We have not detected any NHR-85-derived phosphopeptides in our analysis. Thus, this would establish a completely new line of research, incompatible with the timelines of a revision.

      @Ref. 3:

      1. *The authors have shown that LIN-42 is phosphorylated in vivo, but the dependence of this phosphorylation on KIN-20 is not fully addressed. In the discussion (lines 417-420), the authors mention that the unhealthy phenotype of the kin-20 mutant animals prevented them from assessing LIN-42 phosphorylation in this genetic background. To bolster their model and to circumvent this issue, it should be feasible to generate a kin-20 degron allele and to perform the LIN-42 phospho-proteomics upon inducible degradation. Alternatively, perhaps a phos-tag western blot for LIN-42 could be used to compare the kin-20 wild-type to kin-20 mutants. * We agree, and acknowledged in the discussion, that phoshorylation of LIN-42 by KIN-20 in vivo has not been demonstrated by us. However, since our data from the LIN-42∆Tail mutant also suggest that LIN-42 phosphorylation be functionally largely dispensable for KIN-20’s function in rhythmic molting, we consider further elucidation of this point a lower priority, especially considering the challenges involved. As we have seen for our unpublished work on wild-type animals, a phosphoproteomics experiments would be costly and time-consuming, with a non-trivial analysis (due to the underlying dynamics of protein level changes). A phos-tag gel would be subject to multiple confounders given the abundance of the phosphosites that we detected on immunoprecipitated LIN-42 – unlikely to stem only from KIN-20 activity – and an increase in total LIN-42 levels that we observe upon KIN-20 depletion, and thus appears unsuited to providing a meaningful answer.

      *Related to points 2 and 3 above, the authors have shown that the CKIBD mediates association with human CK1 in vitro, and is required for nuclear accumulation of KIN-20 in vivo, but not that the complex formation between LIN-42 and KIN-20 depends on the CK1BD. Given the reciprocal co-IP findings, it should be feasible to create tagged versions of lin-42(deltaCK1BD) and to determine the effect on LIN-42-KIN-20 complex formation. While there is already a b-isoform tag, an a-isoform tag would also help to address whether both the b and a isoforms interact with KIN-20 in a CK1BD-dependent manner in vivo. These strains would also allow the authors to determine how the CK1BD deletion affects overall levels/stability/rhythmic accumulation of LIN-42(a or b), which would potentially serve to strengthen their conclusions about the role of the lin-42 CK1BD. *

      As detailed in section 2, we will address the point concerning LIN-42∆CK1BD. However, due to the overlapping exons, we are unable to tag the a-isoform independently of the b-isoform. Moreover, in a western blot of a line where both a- and b-isoforms are tagged, we have observed only little or no LIN-42a signal, suggesting that, like the c-isoform, its expression may be more limited, making biochemical characterization difficult. Hence, these experiments are not feasible.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In their manuscript "A conserved chronobiological complex times C. elegans development", Spangler, Braun, Ashley et al. investigate the mechanisms through which the PERIOD orthologue, lin-42, regulates rhythmic molting in C. elegans. Through precise genetic manipulations, the authors identify a particular region of lin-42, the 'CK1BD', which regulates molting timing, with less effect on other lin-42 phenotypes (e.g. heterochrony). They show that LIN-42 and the casein kinase 1 (CK1) homologue KIN-20 interact in vivo, and identify phosphorylation sites of LIN-42. Using biochemical assays, they find that the CK1BD of LIN-42 is sufficient for interaction with the human homologue of KIN-20, CK1, in vitro. The LIN-42 CK1BD is also required for the proper nuclear accumulation of KIN-20 in vivo. Furthermore, a point mutation that should disrupt the catalytic activity of KIN-20 also shows an irregular molting phenotype, similar to the lin-42 CK1BD mutant. The manuscript is very well-written and the data and methods are well-presented and detailed. Overall this work makes a convincing case that the C. elegans lin-42:Kin-20 and mammalian period:Ck1 interactions have functionally conserved roles in the oscillatory developmental programs of each organism (molting timing and circadian rhythms, respectively), with a few caveats below that can be addressed.

      Major comments:

      1. The authors have shown that LIN-42 is phosphorylated in vivo, but the dependence of this phosphorylation on KIN-20 is not fully addressed. In the discussion (lines 417-420), the authors mention that the unhealthy phenotype of the kin-20 mutant animals prevented them from assessing LIN-42 phosphorylation in this genetic background. To bolster their model and to circumvent this issue, it should be feasible to generate a kin-20 degron allele and to perform the LIN-42 phospho-proteomics upon inducible degradation. Alternatively, perhaps a phos-tag western blot for LIN-42 could be used to compare the kin-20 wild-type to kin-20 mutants.
      2. For technical reasons, the in vitro biochemistry was done using human CK1 protein. There are a few places (e.g. results, line 248 and discussion line 437), where the language, in my opinion, is extrapolating the CK1 results too strongly to KIN-20. The authors mention that feedback inhibition is a known property of human CK1. It is indeed quite striking that the LIN-42 CK1BD region interacts with and is phosphorylated by the human counterpart of KIN-20, and that feedback inhibition is also seen! However, the language about KIN-20 itself should be softened, since there does not appear to be clear evidence that KIN-20 exhibits the same properties as human CK1 (unless perhaps human CK1 can functionally replace KIN-20 in worms, or the proteins were extremely similar?)
      3. The role of the three LIN-42 isoforms should be further clarified. Minimally, it should be explained why the alleles where both b and c isoforms should be flag-tagged seem to only produce detectable b isoform (e.g. Fig. 2C).
      4. Related to points 2 and 3 above, the authors have shown that the CKIBD mediates association with human CK1 in vitro, and is required for nuclear accumulation of KIN-20 in vivo, but not that the complex formation between LIN-42 and KIN-20 depends on the CK1BD. Given the reciprocal co-IP findings, it should be feasible to create tagged versions of lin-42(deltaCK1BD) and to determine the effect on LIN-42-KIN-20 complex formation. While there is already a b-isoform tag, an a-isoform tag would also help to address whether both the b and a isoforms interact with KIN-20 in a CK1BD-dependent manner in vivo. These strains would also allow the authors to determine how the CK1BD deletion affects overall levels/stability/rhythmic accumulation of LIN-42(a or b), which would potentially serve to strengthen their conclusions about the role of the lin-42 CK1BD.
      5. In the molting timing assay, there is an unexpected result where the delta-C-terminal-tail lin-42 allele resembles the n1089 (N-terminal deletion) (line 315). Could the authors more clearly explain this finding?

      Minor comments:

      1. The correspondence between the LIN-42 "SYQ" and "LT" motifs and the motifs referred to as "A" and "B" should be clarified, and consistent names/labels used. Are these interchangeable names? If it is necessary to use both names, the differences between SYQ/LT and A/B should be made more clear.
      2. For data presented as "% of animals", please indicate the number of animals scored (e.g. egl, alae assays - ~ how many animals per replicate (dot)?).
      3. Line 145-148 - Mentioning the relevant phenotype(s) of the lin-42 null allele from the cited paper would provide a good point of comparison here.
      4. Line 201 - the phrase "This is also true for the proteins:" is unclear, as the previous sentence states that both lin-42 and kin-20 mRNAs oscillate, while the next sentence says that only LIN-42 protein oscillates.
      5. Line 231 - please explain the significance of the 'lower response signal' in the BLI assay for the CKIBD(no tail).
      6. Fig. 2 - C/D - the genotype lane labels should I think indicate an N-terminal rather than C-terminal LIN-42 tag.
      7. Fig. 6, line 367 - lin-42 is variably described as promoting increased KIN-20 'nuclear accumulation' or 'localization'. I think that 'accumulation' is more accurate, as it doesn't imply a specific mechanism for the difference (transport vs stabilization, etc.)
      8. Fig 6B - an overlay of the panels or another way of quantifying the colocalization would make this result more clear.

      Significance

      This work presents a major mechanistic and conceptual advance in our understanding of the role of lin-42/Period, a conserved key regulator of C. elegans development. Previously, it was not clear if the heterochronic and circadian functions of lin-42 were genetically separable, nor was it known how LIN-42 physically interacted with the CK1 homologue. This work addresses these questions using precise genome engineering and detailed phenotypic and biochemical approaches. The work also reveals the conservation of bi-directional/reciprocal regulation between lin-42 and kin-20. The main limitations of the study, which can potentially be addressed as outlined in the 'major points' above, are that evidence should be provided that lin-42 phosphorylation depends on kin-20 in vivo, and that the CK1BD mediates the interaction in vivo (since the in vitro work is with human CK1). As the authors indicate, this is the first 'conserved clock module' of this type, and this work will therefore be of significant interest to both the C. elegans developmental biology and the more general biological timing fields.

      Field of expertise of the reviewer- C. elegans genetics and development.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study represents pioneering work on LIN-42, the C. elegans ortholog of PER, uncovering its role in molting rhythms and heterochronic timing. A key strength of this work lies in its integrative approach, combining genetic and developmental analyses in C. elegans with biochemical characterization of LIN-42 protein.

      At the organismal level, the authors take advantage of the power of C. elegans as a model system, employing precise genetic manipulations and high-resolution developmental assays to dissect the contributions of LIN-42 and its interaction partner KIN-20, the C. elegans ortholog of CK1, to molting rhythms. Their findings provide in vivo evidence that binding of LIN-42 with KIN-20 promotes the nuclear accumulation of KIN-20 and is crucial for molting rhythms, while its PAS domain appears dispensable for this function. This detailed phenotypic analysis of multiple LIN-42 and KIN-20 mutants represents a significant contribution to our understanding of the developmental clock.

      At the biochemical level, the study provides a detailed analysis of the mechanism underlying LIN-42's interaction with CK1, demonstrating that LIN-42 contains a functionally conserved CK1-binding domain (CK1BD). Through their in vitro kinase assays and structural insights, the authors identified distinct roles for CK1BD-A and CK1BD-B: the former in kinase inhibition and the latter in stable CK1 binding and phosphorylation. Importantly, their data align well with previous findings on PER-CK1 regulation in mammalian and Drosophila systems, reinforcing the evolutionary conservation of key clock components.

      Overall, this work stands out for its deep and important insights into how CK1-mediated regulation extends beyond the circadian clock to regulate the developmental clock. The combination of genetic approaches with biochemical analyses makes this an outstanding contribution to both chronobiology and nematode developmental biology.

      Major comment 1:

      In Figure 2D, I could not find a crucial control if the authors claim that KIN-20 binds to LIN-42. For example, a single mutant of LIN-42-3xFLAG could be used as a control for the double mutant.

      Major comment 2:

      The sizes of the KIN20 bands were very diverged (~40 kDa and ~60 kDa), but the authors provide no explanation for this.

      Major comment 3:

      Regarding the MS study, the raw data are available, but the detailed supplemental Excel files would be more informative for readers. For example, are other interactors such as REV-ERB/NHR-85 detected in Figure 2A? Regarding Figure 4F, the list of phosphorylation sites and MS scores is also informative.

      Major comment 4:

      It is an important finding that the PAS domain of LIN-42 is not essential for the molting rhythms. Is the PAS domain also dispensable for binding with KIN-20?

      Major comment 5 (Optional):

      In this study, the authors carefully performed in vitro kinase assays, and I strongly suggest that they investigate whether the CKI-mediated phosphorylation of LIN-42 is temperature-compensated and whether the CKI-BD-AB regions affect it.

      Major comment 6 (Optional):

      In Figure 6, the authors argue that the CKI-BD of LIN-42 is important for CK1 nuclear translocation. It would be better to show the effect of the nuclear accumulation of CKI on nuclear proteins, like the mammalian CKI-PER2-CLOCK story. Does CKI localization affect phosphorylation status of other clock-related proteins including REV-ERB/NHR-85? Phospho-proteome analysis would identify nuclear substrates of CK1. In addition, is phosphorylation of LIN-42 dispensable for the CK1 nuclear translocation?

      Major comment 7 (Optional):

      LIN-42 rhythmic expression could drive rhythmic nuclear accumulation of KIN-20. It would be better to examine this possibility using kin-20::GFP in lin-42 mutants.

      Minor 1:

      I could not find the full gel images of the Western blot analyses as supplemental materials.

      Minor 2:

      The authors discussed a conserved module in two different clocks. A statement regarding a recently published paper (Hiroki and Yoshitane, Commun Biol, 2024) would be informative for readers.

      Referee cross-commenting

      I basically agree with reviewer 1 and hope that this paper will be published soon as it is very valuable for our field. I have constructively pointed out some parts that could be improved, but depending on the editor's judgement, I believe that even if not all of these are revised, it will be sufficient for publication.

      Significance

      This work stands out for its deep and important insights into how CK1-mediated regulation extends beyond the circadian clock to regulate the developmental clock. The combination of genetic approaches with biochemical analyses makes this an outstanding contribution to both chronobiology and nematode developmental biology.

      I strongly suggest editors to accept this study with minor modifications according to the following comments.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors investigate in this study the function of LIN-42 for the process of precise molting timing in C. elegans. To achieve this, they compare LIN-42 with its mammalian ortholog, Period. They found that similar to Period, LIN-42 interacted with the kinase KIN-20, a mammalian Casein kinase 1 (CK1) ortholog. Hence, two different proteins involved in rhythmic processes, LIN-42 and Period function in a conserved manner.

      First, they used mutants with specific deletions to untangle various phenotypes during C. elegans development. From this analysis they identify a specific region, corresponding to a CK1-binding region in mammals, to be mainly involved in the rhythmic molting phenotype. Next, they identify KIN-20, the CK1 ortholog as interaction partner of LIN-42. They even were able to demonstrate an interaction of CK1 with the region of LIN-42. Using CK1, they identified potential phosphorylation sites within LIN-42 and compared those with immunoprecipitated protein in vivo. There was a substantial overlap. While the C-terminal tail of LIN-42 was heavily phosphorylated, deletion of the C-terminal part resulted only in a minor phenotype for rhythmic molting. Last but not least, they demonstrated that point mutations that inactivate the catalytic function of KIN-20 produced a rhythmic molting phenotype. The interaction of LIN-42 with KIN-20 affected the localization of the kinase, similar to what was found to Period and CK1.

      Overall, the experiments are well done, well controlled and well described even for non-specialists. I guess it was not easy to kind of sort out the many overlapping phenotypes. It was certainly helpful just to focus on the clear rhythmic molting phenotype.

      I have no major or minor comments.

      Significance

      The manuscript is well written and can be followed by non-specialists of the field. The experiments are well performed. Even if some experiments did not yield the expected phenotype, e.g. deletion of the C-terminal tail of LIN-42 had only a minor phenotype inspire of heavy phosphorylation, these experiments are anyhow included and explained. Overall, the study is interesting for people in the C. elegans field and by similarity mammalian chronobiology. I would expect that most of the progress based on this study will be on the further elucidation of the molting phenotype and how the other phenotypes related to this. Then this could emerge as a blueprint for molting phenomena in other species as well.

      I am a mammalian chronobiologist working on Period proteins.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their comments and have included substantial new data to strengthen the work by specifically addressing questions regarding the molecular mechanisms driving the proteomic and phenotypic changes observed in these disease models. We have generated a new ganglioside disease model (GM1 gangliosidosis) and demonstrated that the lysosomal exocytosis mechanism identified for GM2 gangliosidosis is a conserved mechanism that alters the PM proteome (see new Figure 5).

      We have also carried out substantial additional experimental work to address the question of whether specific protein-lipid interactions drive some of these changes. We have preliminary data supporting this (included below) but we are not confident that these data are robust enough for inclusion in this manuscript. This work required substantial in vitro experiments including the expression and purification of several proteins for use in liposome binding assays. Although these data are promising, they have been challenging to reproduce and we would prefer to develop this work further for inclusion in a subsequent paper.

      Although not requested by any reviewers we have also included substantial additional multielectrode array (MEA) data in Figure 4 to further support the phenotypic changes to electrical signalling seen in the Tay Sachs disease model.

      We would like to note that even without these new data the reviewers highlighted that the “high-quality data presented significantly advance the field” and that the work “exposes key conceptual novelties” using “new insight” and “new tools” that shed “light on the complex pathophysiology that links lipid accumulation to neuronal dysfunction”. And that this highlights “an underappreciated dimension of these diseases” allowing them to be “understood better thanks to this study”. More generally the reviewers state that the work is of interest to both “clinicians and basic researchers” and is relevant to “broader fields in cellular and neurodegenerative biology”.

      Point-by-point description of the revisions

      • *

      Reviewer 1

      Confirmation of Neuronal Differentiation: To confirm neuronal differentiation in their i3N cell model, the authors show qPCR results indicating the expression of mature neuronal markers and the downregulation of stem cell markers by day 14. However, single-cell RNA sequencing (scRNA-seq) could provide a more detailed evaluation of the differentiation process, addressing the fine-grained cell-type composition within the cell population. Depending on the results, the authors might more precisely interpret functional data and assess the possible influence of increased GM2 levels on cell fate decisions.

      The accumulation of GM2 may not be identical across all neurons and so it is possible that, although the neuronal populations as a whole display mature differentiation, individual cells may respond differently to the amount of lipid debris. However, there are several technical reasons why obtaining samples for scRNAseq is extremely challenging. By 14 dpi the separation of individual neurons from each other is very difficult as they are in a densely grown and highly attached and interconnected network. Furthermore, the individual neurons have a highly polarized differentiated morphology with long delicate axonal and dendritic projections, that are readily cleaved and lysed in the process of harvesting and dissociation to obtain single cell suspensions for FACS sorting. In neurons, mRNAs are also abundantly localised along the length of their neuritic projections [1], thus these damaged preparations would provide unreliably meaningful data. Alternatively, sufficiently isolated individual neurons show poor survival and do not mature. If these technical difficulties could be overcome, in order to monitor altered differentiation, it would be necessary to determine which timepoint was most relevant to capture differences between day 0 stem cells and day 28 when they are synchronously firing glutamatergic neuron cultures. For this analysis to be robust it would require sample preparation and analysis of multiple stages of the differentiation process. For all the reasons above we cannot address this reviewer’s request.

      Mechanistic Links Between Lipid Accumulation and Proteomic Changes: The authors report specific proteome changes upon HEXA/B KO. What are the mechanistic links between lipid accumulation and proteomic changes? Is the overall degradative performance of lysosomes compromised? The authors note that certain proteins, such as TSPANs, can bind directly to GSL headgroups. Clarifying whether the observed proteomic changes result from specific, direct lipid-protein interactions versus indirect effects could strengthen the argument for targeted lipid-mediated proteomic shifts.

      In response to these questions, we have carried out substantial additional experimental work testing the lipid interactions of some of the proteins that are most altered in their abundance at the PM. We focussed on the top non-lysosomal proteins as we are proposing that the lysosomal ones are primarily changed due to lysosomal exocytosis, suggesting the non-lysosomal are the best candidates for direct GSL-binding. To robustly identify specific lipid-protein interactions is highly challenging but something we have demonstrated previously [2].

      In vitro lipid-binding assays require expression and purification of the proteins of interest to then be used in liposome pulldown experiments using liposomes of defined composition. As we are most interested in the specificity of the headgroup interaction we focussed on producing the extracellular portions of these proteins that would be predicted to bind these headgroups (again this is a strategy we have successfully used previously [2]). We expressed and purified the extracellular domains of three top non-lysosomal hits: CNTNAP4, CNTN5 and NTRK2 (Fig. R1A, provided in attached response document). These purified proteins were used in liposome-binding assays using liposomes composed of different sphingolipids and gangliosides (Fig. R1B). These data demonstrate that the GPI-anchored protein CNTN5 and its potential binding partner CNTNAP4 bind promiscuously to different headgroups. This may be consistent with their being incorporated into GSL-rich membrane microdomains via the GPI-anchor. Interestingly, in this assay NTRK2 demonstrates specific and substantial binding to GM2, with some weaker binding to GD3.

      These data support that the increased abundance of NTRK2 at the PM could be driven by direct interactions with the same lipid that is accumulating at the PM. As exciting and compelling as these data are, we have subsequently been unable to repeat this observation for NTRK2. We are unsure why and have tried several different strategies to test this interaction, but at this stage with only an N=1 for this observation we do not feel confident to include these data in the manuscript.

      We intend to pursue this further using a range of alternative techniques and protein constructs but this will take substantial additional time and effort that we feel go beyond the scope of this current manuscript.

      Additionally, does this phenomenon extend to other sphingolipidoses (e.g., Gaucher disease)? Comparing the proteomes of i3N cells across different sphingolipidoses could reveal whether the accumulation of distinct GSLs produces unique or shared proteomic profiles, highlighting similarities or specificities across lysosomal storage disorders.

      We agree with the reviewer that this is an interesting and important question and had intended to do this as follow-up work in a future publication. However, in the interests of addressing this point here, we are including additional data we have generated from a new i3N model of GM1 gangliosidosis. As for the GM2 gangliosidosis models, we used CRISPRi to knockdown GLB1 and have confirmed this KD by q-PCR. We have also profiled the GSL composition and quantified the increased GM1 abundance. We have followed this up with both whole-cell and PM proteomics. We have presented comparative proteomics of the two models and demonstrated that they both result in significant accumulation of lysosomal proteins both in cells and at the PM. This shared proteomic profile is consistent with lysosomal exocytosis being a conserved mechanism driving altered PM composition in these diseases. We have included this work as an additional results section and an additional figure (Figure 5) as well as expanding the discussion. For this analysis we collected mass spec data at 28 dpi based on our observations in the paper that electrical signalling was synchronised at this point (Fig 4). In the text we discuss additional changes in these new WCP data such as the appearance of other trafficking molecules such as Arl8a that further support a lysosomal exocytosis mechanism.

      In terms of the unique proteomic profiles of these diseases, the read depth of the PMP data in this case was not sufficient to confidently identify differences between the two gangliosidosis models and therefore we intend to pursue this work with additional LSDs in future studies to be included in a follow-up paper.

      In terms of mechanistic links between lipid accumulation and proteome changes, we feel these new data provide substantial additional support that the appearance of lysosomal proteins at the PM is driven by lysosomal exocytosis and have preliminary data supporting that some non-lysosomal protein changes may be driven by altered protein-lipid interactions.

      Impact of Increased PM GM2 Levels on Endocytic Pathways: Along similar lines, the authors show differences in the PM proteome and in the representation of specific PM lipid domain-associated proteins. As some of these proteins are turned over by mechanisms involving lipid domain-dependent endocytosis, the authors might want to examine the effect of increased PM GM2 levels on various endocytic pathways.

      We thank the reviewer for this suggestion and have attempted assays monitoring endocytosis using several approaches including the uptake of fluorescently labelled bovine serum albumin (DQ-BSA) [3–5]. These endocytosis assays are well established in standard cell lines such as HeLa cells. Despite several attempts by us to get this working in neurons using multiple alternative readouts (microscopy and plate-based fluorescence) we have been unable to measure changes in endocytosis. Exploration of alternative methods to probe Clathrin-independent/dynamin-independent endocytosis (CLIC/GEEC) suggests these pathways are difficult to observe by fluorescence microscopy as there is minimal concentration of cargo proteins during the formation of carriers before endocytosis [6]. As an alternative strategy to probe changes in lipid-domain dependent endocytosis we have analysed the proteomics data for changes in galectins but no changes were identified in the data. We also explored available tools for modulating lysosomal exocytosis and monitoring lysosomal movement including activating TRPML1 to trigger exocytosis and activating ABCA3 to drive more lipid accumulation [7–10]. Similarly to the endocytosis assays above, these were not translatable to neurons in our hands due to a range of challenges including increased toxicity of these drugs on this cell type. We have made a substantial effort to try and address these questions and have conferred with colleagues who have also reported difficulties in establishing these assays in neurons. We are keen to continue to pursue this question but due to the technical challenges we feel this work lies beyond the scope of the current manuscript.

      Multifaceted Nature of Gangliosidoses as PM Disorders: The manuscript presents an important perspective by reframing gangliosidoses as multifaceted PM disorders that disrupt neuronal function and membrane composition. By further elaborating on the connection between membrane lipid alterations, neuronal excitability, and synaptic composition, and by exploring the interplay with lysosomal dysfunction, the authors could provide a richer understanding of gangliosidoses and GSL function in general.

      We appreciate that the reviewer agrees with us that reframing gangliosidoses as more complex multifaceted diseases is important. We are not sure if there is a request here for more elaboration in the text but based on the new data included in the paper, we have expanded some of the discussion around these points. We are very enthusiastic to continue to probe the connections and interplay as described by the reviewer and this is the focus of our ongoing studies.

      Reviewer 2

      1. T-tests and one-way ANOVAs were used, but it is not clear if datasets were tested for normality and equal standard deviations. Please add these details. If data are not normal or standard deviations are unequal, other tests will have to be used.

      All graphs were checked for normality and variance in standard deviation and for figure 1F, where the data was not normally distributed, a Kruskal-Wallace test was used in place of a one-way ANOVA. All significantly different results are now labelled on graphs and the relevant tests described in the figure legends. This has also all been updated in the Supplementary data.

      1. It needs to be clearly explained how many data points were used for statistical analyses and what the data points were. E.g., N=3 independent experiments on 3 different days, each done in n=3 different wells, total n=9. Each well can be considered a biological replicate, but it's of lesser value than the "big Ns" done on different days. The authors can choose different ways of defining their N/n numbers, but it has to be transparent. The bar graphs would ideally display the data points.

      All figure legends now clearly explain N and n numbers used in experiments. Individual data points are displayed on qPCR graphs where N and n are mixed, with shapes denoting the biological repeat (N). In addition to clarification in figure legends, N and n numbers are described in the methods sections where appropriate.

      For completeness we also include here details of these N/n numbers.

      • For the q-PCR experiments, technical triplicates (repeats on the same day, n=3) were carried out for 3 separate biological replicates on different days (N=3). We have changed how these data are plotted to clarify this.
      • For the activity assays, N=3 biological replicates were carried out on cell lysates from cultures grown on different days.
      • For the microscopy analysis, coverslips from N=3 biological replicates on different days were used. n=2 coverslips per N were used to generate 15 images per N.
      • For the glycan analysis, N=3 independent cell pellets were prepared on different days.
      • For the proteomics experiments, these were done as N=3 independent cell cultures grown and prepared on different days. Specifically, one of each cell line SCRM, HEXA-1, HEXA-2, HEXB-1 and HEXB-2 were grown and harvested or biotinylated at a time (for WCP or PMP), with repeats on different days. These N=3 were then combined for the ΔHEX-A/B lines to provide N=12 biological repeats for disease cell lines to be compared to N=3 biological repeats for “SCRM” control cell lines.
      • For calcium imaging, n=4 wells for each of SCRM, ΔHEXA-1 and ΔHEXB1 were averaged and the mean from each was used to provide n=3 data points across two biological repeats of this experiment, N=2.
      • For the MEA data, we now include substantially more data than in the original manuscript (see comments at the top of this document). This is now N=3 biological replicates across n=52 wells over a time period from 38-45 dpi.
      • The N/n values and statistical tests have also all been updated in the Supplementary data.
        1. There should be a comment on how statistical power was calculated upfront and if not: how N/n numbers were chosen ("based on similar expts in the past").

      N/n numbers, as detailed above, were chosen based on previous experiments by ourselves and others, as well as recommended practice [2,11–15]. Typically, these papers do not describe the statistical power upfront. We have added statements to this effect and relevant references to the methods section of the manuscript.

      1. "This suggests that some of the proteins that are accumulating in these diseases are specifically products of lipid accumulation rather than a product of general lysosomal dysfunction. In further support of this, several lysosomal proteins including V-type ATPases (ATP6 family), mannose-6-phosphate receptor (M6PR) and biogenesis of lysosomal organelle complex subunits (BLOC1) are quantified in the WCP but are not increased in abundance." This part is confusing. It seems like the authors observe an accumulation of endolysosomes in general (page 6), but then only certain endolysosomal proteins accumulate - and the authors speculate that this is due to decreased degradation or enhanced translation (mRNA levels are unaffected). This question should be addressed better, ideally experimentally: are endolysosomes accumulating in general or not? And what defines the endolysosomal proteins that accumulate vs. those that don't? How is that regulated?

      Recently published work has identified that late endosomes/lysosomes do not possess one composition; they are dynamically remodelled and there is substantial heterogeneity in the composition of different lysosomes [16,17]. While some components, such as LAMP1 and Cathepsin D, are common across all lysosomal compartments there is considerable heterogeneity in the composition of these organelles. These studies also demonstrate that in disease-relevant conditions or upon drug treatment, lysosomes change their protein composition. For example, in a LIPL-4 KO mouse model they observe an increased abundance of Ragulator complex components, similarly to the increase in LAMTOR3 seen in our new 28 dpi WCP data for GM1 and GM2 gangliosidoses. Interestingly, in this study they demonstrate that lysosomal lipolysis leads to bigger changes in lysosomal protein composition than other pro-longevity mechanisms [17]. Another recent paper looking at a different lysosomal storage disease in microglia with accumulating GSLs and cholesterol has also identified abundance changes in a subset of lysosomal proteins including several we observe here including TTYH3, NPC1, PSAP and TSPAN7 [18]. Beyond proteomic analyses, the experimental tools for identifying these different populations are currently very limited, but these published studies support that it is possible to have accumulation of what we define as lysosomes by IF (using LAMP1 or lysotracker) but for the proteomic analysis to identify increased abundance of only a subset of lysosomal proteins.

      These papers do not identify or speculate on how these differences are regulated. Analysis of the changes in our WCP as well as the new data for GM1 gangliosidoses support that the proteins that are most changed in response to GSL accumulation are membrane proteins involved in lipid and cholesterol binding and transport (New Fig 2D and 5E and see response below). This specific enrichment suggests that the changes are directly linked to the lipid changes, thus our suggestion that these accumulate due to a need for the cell to process these lipids but also that they may get “trapped” in the membrane whorls such that they are not efficiently degraded.

      We have included the references above and a more detailed description of lysosomal heterogeneity into the main text to help address the reviewer’s questions.

      1. Fig. 1D: The GO terms are confusing. Why are there more proteins in the category lysosomal membrane than lysosome as a whole? Other categories seem to be overlapping as well.

      We apologize for the confusion; this graph does not display protein counts it is the adjusted P values for the enrichment of the term. To make this clearer, the DAVID analysis graphs are now presented in a new format. We present in this new graph the false discovery rate (FDR) (adjusted P value) which is a measure of the significance of whether that GO term is specifically enriched in the dataset. We have also expanded the GO term analysis to include molecular function and biological process descriptors in addition to the cellular component originally described. For full clarity, to the right of each term we include the number of significant hits that have this term, that being the number of proteins that are contributing to this GO term enrichment.

      1. Fig. 2C/3A: It'd be good to also show the hits that don't match the expectation/pathways of interest.

      We provide a full list in the Supplementary Information of all hits that are considered significant allowing the reader to access this information without having to download the datasets from PRIDE. We did not label all hits in these panels to avoid cluttering the image. In the main text we have focused on those that clearly fall within related categories or pathways as we feel that several “hits” in the same area represents a more compelling and confident assessment of the data. Several of the additional hits not mentioned in the main text do still match the expectations/pathways. For example, one of the top hits not labelled in the WCP is GPR155 (a cholesterol binding protein at the lysosomal membrane) and one of the top unlabelled hits in the PMP data is OPCML (a GPI-anchored protein that clusters in GSL-rich microdomains). There are some, such as KITLG (up in the PMP data), that we don’t currently have a hypothesis for why/how they change, but we are reluctant to describe and speculate upon additional isolated/orphan hits in the main text when these have not been further validated.

      1. Fig. 3: It is not intuitive that synaptic proteins in particular would accumulate at the plasma membrane due to the lipid storage defect. Are they mis-trafficked or are they at synaptic membranes? That could, e.g, be addressed by isolating synaptosomes. And why this selectivity for synaptic proteins? Neurons should have more plasma membrane that is not synaptic. And, e.g, the release of lysosomal material should not happen at synapses (and lysosomes should not deliver synaptic proteins to the PM, unless there is a failure to degrade them).

      We agree that synapses represent a relatively small proportion of the entire PM of neurons, but synapses are particularly enriched with glycosphingolipids where they affect synaptogenesis and synaptic transmission [19–22]. For these reasons we think that some synaptic proteins are particularly sensitive to these lipid changes as they are localised in GSL-rich membrane microdomains. We have now clarified this point in the text. We have also further clarified that we were not proposing that lysosomal proteins are present at the synapses. We observed that lysosomal proteins are enriched at the PM and this may be more generally across the whole PM, while the changes to synaptic proteins may or may not be localised at the synapse. We apologise for the confusion and have modified the text at the end of the PM proteomics results section to make this clearer.

      To try and address experimentally the question of whether these proteins are at synapses, we have attempted synaptosome enrichment. However, lysosomal compartments co-sedimented with synaptosomes during the preparation – LAMP1 staining was enriched in the synaptosome preparations of all samples including SCRM controls. Therefore, we cannot distinguish these compartments which is particularly problematic in this disease model.

      (7. Continued) Or is there an effect on synaptic vesicles? Are there more? Do they deliver their cargo more readily? Or is there a failure to do endocytosis of synaptic proteins, and that's why the accumulate? What is the connection between SVs and endolysosomes? More clarity would be good here.

      We do think that there is an effect on synaptic vesicles particularly as the SV proteins SYT1 and SV2b are significantly increased in abundance at the PM suggesting they are not being internalized normally. Furthermore, the new WCP data going out to 28 dpi for both GM1 and GM2 gangliosidoses have identified a significant increase in Arl8a which plays a shared role in lysosomal and SV anterograde trafficking [23,24]. Whilst previously thought of as discrete pathways, evidence now suggests that endolysosomal and SV recycling pathways form a continuum with several shared proteins involved in the fusion, trafficking and sorting in both pathways [25]. Arl8a provides a good example of an adaptor protein that functions in both pathways and also when overexpressed results in enhanced neurotransmission consistent with our studies [26]. We have adjusted the discussion text to include a description of the links between SVs and endolysosomal trafficking and the potential shared role Arl8a may be playing in both pathways.

      Regarding the question of whether there are more SVs or not, this is hard to determine directly as they are particularly small (~50 nm) and difficult to visualise or specifically stain for using microscopy. Not all SV-associated proteins are increased in the PMP data, for example SNAP25 and several other synaptotagmins are not changed in the 28 dpi data for both gangliosidosis models. We hope in the future to address SV changes more directly with higher resolution imaging such as electron microscopy or cryo-tomography but cannot currently confidently answer these specific questions.

      1. Fig. 4: The assumption that there is more synaptic activity because there are more synaptic proteins at the membrane seems to be plausible, but also speculative at this point.

      We have modified the text at the end of this results section to highlight that this is a speculative link.

      1. The possible contribution of glial cells should at least be discussed.

      We mention potential deleterious effects on bystander cells including other neurons, astrocytes and microglia in the second last paragraph of the discussion. In response to this request we have expanded and modified this text.

      Minor: there are some typos etc.

      Although no specific examples were listed, we have endeavored to find and correct typos, we have also checked for English spelling (not American) throughout.

      Reviewer 3

      1. Results section, 1st paragraph- to develop disease models- -- Please add cellular models as we already have KO mouse models.

      This has been added to the text.

      1. It was not clear what was the percentage of mutation success with their CRISPR technique.

      The CRISPR method employed here was CRISPRi so there is no mutation of the genome. Instead, inactive/dead-Cas9 is targeted to the promotor/early exon of the HEXA or HEXB gene to inhibit mRNA production. We have included qPCR data to demonstrate the extent of the KD for two different guides to each of these genes in Fig 1.

      1. Will the anti-GM2 antibody be available for other researchers? The researcher details needs to be clarified.

      The anti-GM2 antibody is not commercial available and was generated by one of the co-authors. We invite scientists with an interest in this antibody to contact the corresponding author for details.

      1. Hex activity assay was shown in 1C, but it was not clear that it is MUG or MUGS.

      We apologise for this and have relabelled these activity assay graphs and expanded the legend text to clarify how these two substrates were used to distinguish the two different KD lines. We also corrected a small mistake in the methods section.

      1. Is there a significance in Figure 2 B, 4A, 4B,4C and 4E?

      Based on additional requests from reviewer 2 we have added significance indicators and details of significance tests for several panels in Figures 1-5 including 2B and 4B. For 4A we do not state a significant difference, we use these data to select a timepoint (28 dpi) where all cell lines have synchronous (correlated) signal. The data in Figure 4C and D have been substantially updated and expanded. Analysis of the data in 4C is plotted in 4D where we show significance. For 4E we are stating that the applied stimulation (white triangles) stimulates the HEXA cells every time but the SCRM do not respond to each stimulation. It is not clear how we would quantify this difference and there is no precedent for doing this in the MEA literature or by the Axion company who provided the instrument. We have also included additional references for best practice when analysing MEA data.

      REFERENCES

      1. Mofatteh M. mRNA localization and local translation in neurons. AIMS Neurosci. 2020;7: 299–310. doi:10.3934/Neuroscience.2020016
      2. McKie SJ, Nicholson AS, Smith E, Fawke S, Caroe ER, Williamson JC, et al. Altered plasma membrane abundance of the sulfatide-binding protein NF155 links glycosphingolipid imbalances to demyelination. Proc Natl Acad Sci U S A. 2023;120: e2218823120. doi:10.1073/pnas.2218823120
      3. Marwaha R, Sharma M. DQ-Red BSA Trafficking Assay in Cultured Cells to Assess Cargo Delivery to Lysosomes. Bio Protoc. 2017;7: e2571. doi:10.21769/BioProtoc.2571
      4. gustavo.parfitt. Lysosome proteolysis analysis with DQ-BSA. 2022 [cited 13 Feb 2025]. Available: https://www.protocols.io/view/lysosome-proteolysis-analysis-with-dq-bsa-cgjxtupn
      5. Fernandez-Mosquera L, Yambire KF, Couto R, Pereyra L, Pabis K, Ponsford AH, et al. Mitochondrial respiratory chain deficiency inhibits lysosomal hydrolysis. Autophagy. 2019;15: 1572–1591. doi:10.1080/15548627.2019.1586256
      6. Rennick JJ, Johnston APR, Parton RG. Key principles and methods for studying the endocytosis of biological and nanoparticle therapeutics. Nat Nanotechnol. 2021;16: 266–276. doi:10.1038/s41565-021-00858-8
      7. Pastore N, Annunziata F, Colonna R, Maffia V, Giuliano T, Custode BM, et al. Increased expression or activation of TRPML1 reduces hepatic storage of toxic Z alpha-1 antitrypsin. Molecular Therapy. 2023;31: 2651–2661. doi:10.1016/j.ymthe.2023.06.018
      8. Zhang H, Wang Y, Wang R, Zhang X, Chen H. TRPML1 agonist ML-SA5 mitigates uranium-induced nephrotoxicity via promoting lysosomal exocytosis. Biomedicine & Pharmacotherapy. 2024;181: 117728. doi:10.1016/j.biopha.2024.117728
      9. Shen D, Wang X, Li X, Zhang X, Yao Z, Dibble S, et al. Lipid Storage Disorders Block Lysosomal Trafficking By Inhibiting TRP Channel and Calcium Release. Nat Commun. 2012;3: 731. doi:10.1038/ncomms1735
      10. Wünkhaus D, Tang R, Nyame K, Laqtom NN, Schweizer M, Scotto Rosato A, et al. TRPML1 activation ameliorates lysosomal phenotypes in CLN3 deficient retinal pigment epithelial cells. Sci Rep. 2024;14: 17469. doi:10.1038/s41598-024-67479-8
      11. Zlamalova E, Rodger C, Greco F, Cheers SR, Kleniuk J, Nadadhur AG, et al. Atlastin-1 regulates endosomal tubulation and lysosomal proteolysis in human cortical neurons. Neurobiol Dis. 2024;199: 106556. doi:10.1016/j.nbd.2024.106556
      12. Anderson GSF, Ballester-Beltran J, Giotopoulos G, Guerrero JA, Surget S, Williamson JC, et al. Unbiased cell surface proteomics identifies SEMA4A as an effective immunotherapy target for myeloma. Blood. 2022;139: 2471–2482. doi:10.1182/blood.2021015161
      13. Mossink B, Verboven AHA, Hugte EJH van, Gunnewiek TMK, Parodi G, Linda K, et al. Human neuronal networks on micro-electrode arrays are a highly robust tool to study disease-specific genotype-phenotype correlations in vitro. Stem Cell Reports. 2021;16: 2182–2196. doi:10.1016/j.stemcr.2021.07.001
      14. McCready FP, Gordillo-Sampedro S, Pradeepan K, Martinez-Trujillo J, Ellis J. Multielectrode Arrays for Functional Phenotyping of Neurons from Induced Pluripotent Stem Cell Models of Neurodevelopmental Disorders. Biology. 2022;11: 316. doi:10.3390/biology11020316
      15. Weaver S, Dube S, Mir A, Qin J, Sun G, Ramakrishnan R, et al. Taking qPCR to a higher level: Analysis of CNV reveals the power of high throughput qPCR to enhance quantitative resolution. Methods. 2010;50: 271–276. doi:10.1016/j.ymeth.2010.01.003
      16. Bond C, Hugelier S, Xing J, Sorokina EM, Lakadamyali M. Heterogeneity of late endosome/lysosomes shown by multiplexed DNA-PAINT imaging. J Cell Biol. 2025;224: e202403116. doi:10.1083/jcb.202403116
      17. Yu Y, Gao SM, Guan Y, Hu P-W, Zhang Q, Liu J, et al. Organelle proteomic profiling reveals lysosomal heterogeneity in association with longevity. Elife. 2024;13: e85214. doi:10.7554/eLife.85214
      18. Yasa S, Butz ES, Colombo A, Chandrachud U, Montore L, Tschirner S, et al. Loss of CLN3 in microglia leads to impaired lipid metabolism and myelin turnover. Commun Biol. 2024;7: 1373. doi:10.1038/s42003-024-07057-w
      19. Sipione S, Monyror J, Galleguillos D, Steinberg N, Kadam V. Gangliosides in the Brain: Physiology, Pathophysiology and Therapeutic Applications. Front Neurosci. 2020;14: 572965. doi:10.3389/fnins.2020.572965
      20. Svennerholm L. Gangliosides and Synaptic Transmission. In: Svennerholm L, Mandel P, Dreyfus H, Urban P-F, editors. Structure and Function of Gangliosides. Boston, MA: Springer US; 1980. pp. 533–544. doi:10.1007/978-1-4684-7844-0_46
      21. Palmano K, Rowan A, Guillermo R, Guan J, McJarrow P. The role of gangliosides in neurodevelopment. Nutrients. 2015;7: 3891–3913. doi:10.3390/nu7053891
      22. Hering H, Lin C-C, Sheng M. Lipid rafts in the maintenance of synapses, dendritic spines, and surface AMPA receptor stability. J Neurosci. 2003;23: 3262–3271. doi:10.1523/JNEUROSCI.23-08-03262.2003
      23. Rizalar FS, Lucht MT, Petzoldt A, Kong S, Sun J, Vines JH, et al. Phosphatidylinositol 3,5-bisphosphate facilitates axonal vesicle transport and presynapse assembly. Science. 2023;382: 223–230. doi:10.1126/science.adg1075
      24. Klassen MP, Wu YE, Maeder CI, Nakae I, Cueva JG, Lehrman EK, et al. An Arf-like small G protein, ARL-8, promotes the axonal transport of presynaptic cargoes by suppressing vesicle aggregation. Neuron. 2010;66: 710–723. doi:10.1016/j.neuron.2010.04.033
      25. Ivanova D, Cousin MA. Synaptic Vesicle Recycling and the Endolysosomal System: A Reappraisal of Form and Function. Front Synaptic Neurosci. 2022;14: 826098. doi:10.3389/fnsyn.2022.826098
      26. Vukoja A, Rey U, Petzoldt AG, Ott C, Vollweiter D, Quentin C, et al. Presynaptic Biogenesis Requires Axonal Transport of Lysosome-Related Vesicles. Neuron. 2018;99: 1216-1232.e7. doi:10.1016/j.neuron.2018.08.004
      27. Saheki Y, De Camilli P. Synaptic Vesicle Endocytosis. Cold Spring Harb Perspect Biol. 2012;4: a005645. doi:10.1101/cshperspect.a005645
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      I am quite impressed with the study. The use of i3N based cellular model was well established, characterized and produced some very interesting results.

      Authors have created a cellular model of iPSC cell line for TSD and SD. They confirmed the efficacy of new cell line and then did many assays including enzymatic assays, IHC, EM, gene expression, proteomics, electrophysiological studies. The information generated is very novel and will contribute in furthering the understanding of TSD and SD pathology.

      Use of triplicates, writing the possible conclusions are clear.

      Few minor concerns:

      1. Results section, 1st paragraph- to develop disease models- -- Please add cellular models as we already have KO mouse models.
      2. It was not clear what was the percentage of mutation success with their CRISPR technique.
      3. Will the anti-GM2 antibody be available for other researchers? The researcher details needs to be clarified.
      4. Hex activity assay was shown in 1C, but it was not clear that it is MUG or MUGS.
      5. Is there a significance in Figure 2 B, 4A, 4B,4C and 4E?

      Significance

      I consider this paper to be an advancement in the field and recommend acceptance after minor revisions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Nicholson et al. report interesting findings related to ganglioside biology. The ganglioside GM2 (a lipid with several sugar groups) is the substrate of the hydrolytic lysosomal β- hexosaminidase A (HexA) enzyme (cutting off sugar groups). When subunits of the enzyme are mutated and dysfunctional, GM2 lipids accumulate in cells (in lysosomes and in membranes). This leads to GM2 gangliosidoses, Tay-Sachs and Sandhoff diseases. The authors have generated i3Neuron-based models of Tay-Sachs and Sandhoff diseases by efficiently knocking down Hex enzymes. They observe storage of GM2, formation of "membrane whorls", and accumulation of endolysosomal proteins. The accumulating proteins seem to be largely related to lipid metabolism. Moreover, the composition of the plasma membrane is significantly impacted by both lipid and protein changes. In particular, synaptic proteins seem to accumulate at the plasma membrane.

      The following suggestions are made to improve the study:

      1. T-tests and one-way ANOVAs were used, but it is not clear if datasets were tested for normality and equal standard deviations. Please add these details. If data are not normal or standard deviations are unequal, other tests will have to be used.
      2. It needs to be clearly explained how many data points were used for statistical analyses and what the data points were. E.g., N=3 independent experiments on 3 different days, each done in n=3 different wells, total n=9. Each well can be considered a biological replicate, but it's of lesser value than the "big Ns" done on different days. The authors can choose different ways of defining their N/n numbers, but it has to be transparent. The bar graphs would ideally display the data points.
      3. There should be a comment on how statistical power was calculated upfront and if not: how N/n numbers were chosen ("based on similar expts in the past").
      4. "This suggests that some of the proteins that are accumulating in these diseases are specifically products of lipid accumulation rather than a product of general lysosomal dysfunction. In further support of this, several lysosomal proteins including V-type ATPases (ATP6 family), mannose-6-phosphate receptor (M6PR) and biogenesis of lysosomal organelle complex subunits (BLOC1) are quantified in the WCP but are not increased in abundance." This part is confusing. It seems like the authors observe an accumulation of endolysosomes in general (page 6), but then only certain endolysosomal proteins accumulate - and the authors speculate that this is due to decreased degradation or enhanced translation (mRNA levels are unaffected). This question should be addressed better, ideally experimentally: are endolysosomes accumulating in general or not? And what defines the endolysosomal proteins that accumulate vs. those that don't? HOw is that regulated?
      5. Fig. 1D: The GO terms are confusing. Why are there more proteins in the category lysosomal membrane than lysosome as a whole? Other categories seem to be overlapping as well.
      6. Fig. 2C/3A: It'd be good to also show the hits that don't match the expectation/pathways of interest.
      7. Fig. 3: It is not intuitive that synaptic proteins in particular would accumulate at the plasma membrane due to the lipid storage defect. Are they mis-trafficked or are they at synaptic membranes? That could, e.g, be addressed by isolating synaptosomes. And why this selectivity for synaptic proteins? Neurons should have more plasma membrane that is not synaptic. And, e.g, the release of lysosomal material should not happen at synapses (and lysosomes should not deliver synaptic proteins to the PM, unless there is a failure to degrade them). Or is there an effect on synaptic vesicles? Are there more? Do they deliver their cargo more readily? Or is there a failure to do endocytosis of synaptic proteins, and that's why the accumulate? What is the connection between SVs and endolysosomes? More clarity would be good here.
      8. Fig. 4: The assumption that there is more synaptic activity because there are more synaptic proteins at the membrane seems to be plausible, but also speculative at this point.
      9. The possible contribution of glial cells should at least be discussed.

      Minor: there are some typos etc.

      Significance

      General Assessment

      Strenghts:

      1. The data seem robust.
      2. From a descriptive point of you, there is new insight.
      3. New tools for the field are presented.
      4. Disease phenotypes are recapitulated.
      5. Several techniques are employed, protein and mRNA were studied.
      6. Protein and lipid changes are reported.

      Weaknesses:

      • see previous section for details
      • overall, the data are descriptive in nature and deeper insight into mechanisms would be desirable

      Advance:

      • New tools are presented that recapitulate diseases phenotypes
      • proteins, lipids and mRNAs are studied, and interesting effects are reported
      • GM2 lipid accumulation diseases will be understood better thanks to this study

      Audience:

      • Clinicians and basic researchers studying these diseases should be equally interested.
      • Clinicians and basic researchers studying neurodegenerative disease may also be interested (at least some)
      • lipid biologists will be interested

      About me:

      • cell biologist/protein biochemist studying Parkinson's disease
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study investigates the role of glycosphingolipids (GSLs), specifically gangliosides, in neurodegenerative diseases, focusing on GM2 gangliosidoses, which include Tay-Sachs and Sandhoff diseases. The authors employ advanced HEXA and HEXB KO i3Neuron-based models that successfully replicate key pathological features, such as GM2 accumulation, membrane whorl formation, and endolysosomal protein buildup, effectively mirroring the phenotypes of these conditions.

      Key findings include the impact of lysosomal dysfunction on plasma membrane (PM) composition, noting changes in both lipids and proteins. This effect is partially attributed to the exocytosis of lysosomal material, leading to an abnormal accumulation of GM2 and lysosomal proteins on the cell surface, reaching levels comparable to those of other neuronal gangliosides. Additionally, PM profiling reveals notable changes in synaptic proteins, contributing to neuronal hyperactivity, which may explain the functional deficits observed in GM2 gangliosidoses. This insight into neuronal dysfunction highlights the PM as a critical component of these disorders and extends its relevance to other lysosomal storage diseases and late-onset neurodegenerative diseases involving sphingolipid dysregulation. The manuscript is clear and engaging, and the high-quality data presented significantly advance the field. Below are some points the authors might want to address to further substantiate their conclusions:

      • Confirmation of Neuronal Differentiation: To confirm neuronal differentiation in their i3N cell model, the authors show qPCR results indicating the expression of mature neuronal markers and the downregulation of stem cell markers by day 14. However, single-cell RNA sequencing (scRNA-seq) could provide a more detailed evaluation of the differentiation process, addressing the fine-grained cell-type composition within the cell population. Depending on the results, the authors might more precisely interpret functional data and assess the possible influence of increased GM2 levels on cell fate decisions.
      • Mechanistic Links Between Lipid Accumulation and Proteomic Changes: The authors report specific proteome changes upon HEXA/B KO. What are the mechanistic links between lipid accumulation and proteomic changes? Is the overall degradative performance of lysosomes compromised? The authors note that certain proteins, such as TSPANs, can bind directly to GSL headgroups. Clarifying whether the observed proteomic changes result from specific, direct lipid-protein interactions versus indirect effects could strengthen the argument for targeted lipid-mediated proteomic shifts. Additionally, does this phenomenon extend to other sphingolipidoses (e.g., Gaucher disease)? Comparing the proteomes of i3N cells across different sphingolipidoses could reveal whether the accumulation of distinct GSLs produces unique or shared proteomic profiles, highlighting similarities or specificities across lysosomal storage disorders.
      • Impact of Increased PM GM2 Levels on Endocytic Pathways: Along similar lines, the authors show differences in the PM proteome and in the representation of specific PM lipid domain-associated proteins. As some of these proteins are turned over by mechanisms involving lipid domain-dependent endocytosis, the authors might want to examine the effect of increased PM GM2 levels on various endocytic pathways.
      • Multifaceted Nature of Gangliosidoses as PM Disorders: The manuscript presents an important perspective by reframing gangliosidoses as multifaceted PM disorders that disrupt neuronal function and membrane composition. By further elaborating on the connection between membrane lipid alterations, neuronal excitability, and synaptic composition, and by exploring the interplay with lysosomal dysfunction, the authors could provide a richer understanding of gangliosidoses and GSL function in general.

      Significance

      This study presents findings of considerable relevance not only to the sphingolipid research community but also to broader fields in cellular and neurodegenerative biology, as it exposes key conceptual novelties regarding the impact of GSL function and dysregulation. By identifying GM2 gangliosidoses as disorders affecting both lysosomal function and plasma membrane composition, the research sheds light on the complex pathophysiology that links lipid accumulation to neuronal dysfunction, highlighting an underappreciated dimension of these diseases.

      The study's main limitations lie in its incomplete exploration of the mechanisms by which GM2 accumulation in both lysosomes and the plasma membrane influences neuronal activity. Elucidating this connection more clearly would strengthen the mechanistic insight into how lipid dysregulation directly impacts neuronal excitability and synaptic composition, advancing the translational relevance of these findings.

      I am a lipid biologist, my expertise centers on the functional roles of complex lipids in cellular processes, membrane dynamics, and signaling.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      A systemic analysis of the influence of these ego-1 alleles on fertility can provide valuable information on further studies on EGO-1's functions in fertility.

      We thank the reviewer for this insightful comment. We scored the brood size of all strains carrying a missense mutation at the ego-1 locus and added an extended figure showing their brood sizes as Fig. EV1A. Although the strain carrying gk721963, which was outcrossed six times with tmC18, showed a slightly reduced brood size, other strains showed no significant change in brood size compared to wild-type animals. The original strain carrying gk721963 has 24 homozygous mutations on chromosome I, where ego-1 is located. Of these, 15 mutations are in the region covered by tmC18, and 9 alleles are not covered. These background mutations may not be unremoved and affect fertility in concert with the ego-1 mutation. However, we believe that identifying the cause of this slight phenotype is very difficult and not essential to the overall analysis, so we have only presented the scored data for future studies on EGO-1's functions.

      The genotype of JMC231 is hrde-1(tor125[GFP::3xFLAG::hrde-1]) III. In line 245 and 551, HRDE-1::GFP is typed. typo?

      Thank you for pointing this out. We have corrected these for consistency.

      1. In Figure 4C, the fluorescence intensity in ego-1(S1198L) appears to be more than twice as high as the wild type animals, yet the mean intensity shows only mildly upregulated in Figure 4D. Is the images representative?

      Thank you for your comment. We agree that the fluorescence intensity in the original wild-type image may not have been representative. To address this concern, we have replaced the wild-type image in Fig. 3C (4C in the previous version) with an image that is more reflective of the average fluorescence intensity observed across the biological replicates.

      1. A brief introduction of tmC18 in the legend of Figure 6 would be friendly to readers.

      Thank you for your suggestion. We have added statements explaining tmC18 to the legend of Fig. 5 (Fig. 6 in the previous version) for clarity and to make the experiments more understandable.

      1. In the discussion section, a detailed summary of three recent published papers about the "phenotypic hangover" phenotype would help to understand how EGO-1 contribute to feeding RNAi. (Dodson & Kennedy, 2019; Lev et al., 2019; Ouyang et al., 2019).

      Thank you for the suggestion. We have incorporated a detailed summary of the "phenotypic hangover" phenotype in the discussion section.

      1. Has the authors examined the cellular localization of EGO-1(S1198L) ? Construction of gfp::ego-1(S1198L) animals would provide this information.

      We thank the reviewer for this insightful comment. We have generated the GFP::EGO-1(S1198L) strain and analyzed its subcellular localization and dynamics. These analysis revealed no abnormality in the expression, localization and dynamics of GFP::EGO-1(S1198L) compared to the wild type. The data are shown in Fig. EV3, and a section of the description about this is added to the third section of the Results.

      Reviewer #2

      Key conclusions are convincing, but data and stats need to be clarified in some cases (see below).

      Line 202-211: The found that znfx-1(-) partially restored sensitivity of S1198L mutants to pos-1 RNAi but did not significantly restore pop-1 RNAi. Later, section 228-243, they provide evidence that cde-1 and hrde-1 mutations partially restore sensitivity to pos-1, but not pop-1, RNAi. The authors should discuss what might be going on here.

      Thank you for your comment. We have added a discussion on the differential restoration of sensitivity to pos-1 and pop-1 RNAi in the presence of znfx-1, cde-1, and hrde-1 mutations, proposing that this variation may result from differences in the RNA metabolism of these target genes (Knudsen-Palmer et al., 2024). Additionally, we incorporated the results from the additional RNAi experiments targeting gld-1 and mpk-1 (as outlined in our response to Reviewer 3, Comment 3), which further support our proposed model. We hope this revision presents a more thorough analysis of the interplay between these mutations and RNAi sensitivity.

      Lines 276-279: Confusing as written. The authors do not show RNAi assays for germline genes with rrf-1(null) ego-1(S1198L) double mutants. They should show these data.

      Thank you for the feedback. We have added the RNAi assay data for germline genes with rrf-1(null) ego-1(S1198L) double mutants in Figure EV3C and D.

      For the wording, I suggest "RRF-1 compensates for partial loss of EGO-1 activity in S1198L with respect to 25{degree sign}C brood size (Fig. #), but not for germline exo-RNAi (Fig. #). Therefore, the defects..."

      Thank you for the suggestion. We have revised the wording as recommended.

      Minor comments Throughout, figure legends shown indicate the statistical test used, and the p value must be indicated (e.g., *** indicates p-value of #).

      The authors should use consistent nomenclature for the ego-1 null allele. In Fig. 5 it's listed as "" and elsewhere as tm521.

      Thank you for pointing this out. We corrected this in the revised manuscript.

      Line 90: Please include references for the ego-1 null germline phenotype.

      Thank you for your suggestion. We included two references demonstrating the ego-1 null germline phenotype in the revised manuscript.

      Line 107-109: Wording is confusing. I suggest "Disruption of the E granule, of which EGO-1 is a component, has recently been shown to upregulate sRNA targeting ..."

      Thank you for the suggestion. We have revised the wording as suggested.

      Line 118-120: Wording is unclear. I suggest "In addition we found that sid-1 and rde-11 transcripts in ego-1(S1198L) were downregulated, and this effect was suppressed in hrde-1, cde-1, and znfx-1 mutants."

      Thank you for the suggestion. We have revised the wording as suggested.

      Line 121-123: The meaning is unclear. Please clarify what "detached" means in this context.

      Thank you for the comment. We have revised the sentence to remove the term "detached" for clarity and have instead explicitly described the phenomenon, stating that the RNAi-defective (Rde) phenotype persists over generations in an RRF-1-dependent manner, even in the absence of the original ego-1(S1198L) mutation.

      Line 171-172: Substitute "in the genome" for "in terms of its genomic locus"

      Thank you for the suggestion. We have revised the wording as suggested.

      Line 207: Substitute "the pos-1 RNAi defect" for "the Rde phenotype of pos-1 RNAi"

      Thank you for pointing this out. We have revised the text as suggested.

      Line 269: Text says Fig 5A,B, shows restoration to "wt levels," but stats only show significant change from ego-1(S1198L). Stats showing comparison with wt should be shown, as well.

      Thank you for the comment. We have revised the text to clarify the expression levels and removed the statement about "restoration to wild-type levels" where statistical comparisons were not provided.

      The text refers to the wrong figure/panel in some places. Line310 references Fig. 6A-C as showing the phenotype of ego-1(+/-) heterozygotes and ego-1(+/+) homozygotes, but only the latter is shown in 6A-C. Heterozygotes are shown in Fig. 6D-F.

      Thank you for pointing this out. We have revised the statement accordingly.

      Line 350 should reference Fig. 7C, D (not Fig 3A).

      Thank you for your suggestion. We have corrected it to Fig. 6C, D (Fig. 7C, D in the previous version) as suggested.

      Line 380-381: Wording is awkward. I suggest "Additionally, this allele showed synthetic ts sterility with an rrf-1 deletion mutation."

      Thank you for pointing this out. We have revised the text as suggested.

      Figure 8: There is a typo in panel C: the allele shown is ego-1(null) not ego-1(S1198).

      Thank you for pointing this out. We have updated the allele to ego-1(null) in panel C.

      Reviewer #3

      1. The authors link the direct gene-silencing function of EGO-1 with temperature-sensitive sterility (Figure 8). However, the data in Figure 1 show that the RNAi resistance phenotype and ts-sterility are anti-correlated, the most RNAi-resistant ego-1 alleles are least ts-sensitive and vice versa. Therefore, motivating further experiments through the connection between exo-RNAi resistance and ts-sterility is not justified, e.g. "the temperature sensitive sterile phenotype is a hallmark of the mutator complex.... which is necessary for exo-RNAi-driven silencing". Also, the claim of the redundancy between ego-1 and rrf-1 in controlling ts-sterility is not justified. The ego-1(V1128E) and (C823Y) alleles show strong ts-sterility (Figure 1E), which is not compensated by RRF-1. Therefore, the specific nature of ego-1(S1198L) and (R539Q) mutations leads to a higher dependence of endogenous RNAi silencing processes on RRF-1. Remarkably, although the exo-RNAi resistance of these alleles is dominant (Figure EV2 A,B) and clearly distinct from ego-1 null heterozygous animals, the ts-sterility of ego-1 null heterozygouts and S1198L or R539Q heterozygouts is identical (Figure EV C).

      We thank the reviewer for the insightful comments. We have revised the second section of the Results to simplify the argument by removing descriptions related to WAGO 22G RNA and fertility. This revision ensures that our conclusions remain focused and directly address the observed genetic interactions. Additionally, we have expanded the Discussion to further clarify the specific nature of ego-1(S1198L) with respect to RRF-1.

      1. The experiments in Figures 6 and Figure 7C,D are the most important findings of this study, showing that EGO-1 has a role in the licensing of genes important for exo-RNAi in the germline (such as sid-1 and rde-11). The apparent persistence of RRF-1-dependent (and presumably HDRE-1-dependent) silencing of sid-1 and rde-11 in a genetically wild-type background that correlates with exo-RNAi resistance is remarkable, although not novel (it was shown for mutants defective in P-granules). The use of ego-1 missense viable background was instrumental in these experiments. However, it is not clear whether the specific nature of ego-1(S1198L) mutation also played a role, such as enhanced production of RRF-1-dependent endogenous silencing small RNAs. The ego-1(V1128E) allele is an apparent hypomorph, which is viable and exo-RNAi-resistant (Figure 1, EV2A). Performing an experiment shown in Figure 6 with this allele for five generations would be highly illuminating, and either outcome would be interesting.

      Thank you for this insightful comment. We agree that investigating whether the specific nature of the ego-1(S1198L) mutation contributes to the observed effects is essential. To address this, we performed the experiment shown in Figure 6 using the ego-1(V1128E) allele four generations and data is now shown in Fig. EV7.

      1. Conclusions from the experiments in Figures 3 and 4 are not convincing. The imaging data can be moved to supplemental materials. The suppression experiments shown in Figure 4A,B are weak. The effects of cde-1 mutation are hard to interpret, and these data can be omitted. The znfx-1 and hrde-1 loss does not affect resistance to pop-1. If the authors want to insist on their model, they should use several additional exo-RNAi target genes producing Emb (or other) phenotypes and repeat the experiments.

      Thank you for your valuable feedback. We agree with the concerns raised and have made the suggested changes, including moving the imaging data to Fig. EV4 and omitting the cde-1 data. Regarding the lack of suppression effects for pop-1, we acknowledge the need for further investigation and have performed additional exo-RNAi experiments with target genes gld-1 (Ste) and mpk-1 (Ste) to evaluate our model. Both znfx-1 and hrde-1 mutants significantly suppressed the Rde phenotype in ego-1(S1198L) when subjected to these RNAi, supporting our model. We have added these data in Fig. 3B and EV5A and moved the pop-1 RNAi data to Fig. EV5B.

      1. The exo-RNAi resistance and reduced sid-1 and rde-11 expression correlate. The reduction of these exo-RNAi factors is a plausible explanation for the epigenetic RNAi resistance shown in Figure 6. However, ego-1(S1198L); hrde-1(-) P0 is resistant to pop-1(RNAi) to a large extent (Figure 4B), while sid-1 and rde-11 expression is restored in this double compared to single ego-1(S1198L) (Figure 5B). Therefore, ego-1(S1198L) exo-RNAi resistance is not likely driven to any extent by the misregulation of other RNAi genes. The nature of the (S1198L) mutation is likely to play a major role. Also, surprisingly, rrf-1(-) addition to ego-1(S1198L) does not restore sid-1 and rde-11 expression. Why? The authors do not comment on this.

      Thank you for your detailed comment. To address your concerns, we will incorporate additional experimental data outlined in our response to Comment 3 and revised our description accordingly. Regarding the observation that rrf-1(-) addition to ego-1(S1198L) does not restore sid-1 and rde-11 expression, we hypothesize that this may result from the process by which the rrf-1 knockout was generated via CRISPR in an ego-1(S1198L) mutant background, where sid-1 and rde-11 expression was already reduced. This suggests that rrf-1 may not be required to maintain the reduced expression state once it is established. We will include these points in the revised manuscript.

      1. The discussion points about the nature of new EGO-1 missense mutations involving Alpha Fold predictions can be illustrated through Alpha Fold model figures.

      Thank you for your comment. We agree that illustrating the discussion points with Alpha Fold model figures would enhance clarity. We included an extended view figure based on Alpha Fold predictions to better visualize the structural implications of the EGO-1 mutations.

      1. The authors should consider a model where ego-1(S1198L) affects RRF-1 activity such that it is more active in the endogenous RNAi silencing processes at the expense of exo-RNAi. This could explain the reduced ts-sterility in ego-1(S1198L), which is RRF-1-dependent, similar to the better-investigated epigenetic inheritance of exo-RNAi resistance. However, the exact mechanism of ego-1(S1198L) cannot be explained by genetic methods and is beyond the scope of this study.

      Thank you for this insightful and critical comment. We agree that the interaction between ego-1(S1198L) and RRF-1 activity is an important aspect to consider. Based on the results from our additional experiments described above, we discussed about this possibility. We deeply appreciate your suggestion, as it provides valuable direction for interpreting our findings and developing a more comprehensive understanding of the mechanism.

      Minor comments:

      • Figure 8C typo: ego-(0) is meant to be shown.

      Thank you for pointing this out. We have updated the allele to ego-1(null) in panel C.

      • Pak and Fire, Science, 2007 should be cited in connection to secondary siRNA production. Ruby and Bartel, Cell, 2006 should be cited as the first study that identified 21U-RNAs.

      Thank you for pointing this out. We added citations to Pak and Fire (Science, 2007) in connection to secondary siRNA production and to Ruby and Bartel (Cell, 2006) as the first study identifying 21U-RNAs.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Mitani and colleagues' manuscript investigates the role of RNA-dependent RNA polymerase (RdRP) EGO-1 in regulating exogenous RNAi (induced by dsRNA delivery) efficiency in the germline of C. elegans. Since the null ego-1 mutation leads to sterility, the authors take advantage of several missense ego-1 mutant strains that are fertile but RNAi-resistant.

      Major comments:

      The authors recognize at least two distinct mechanisms of EGO-1 function in regulating exo-RNAi. The first is direct, since EGO-1 RdRP is required for the production of secondary small RNAs mediating exo-RNAi silencing (this mechanism has been studied for many years), and the second one is indirect, through the role of EGO-1 RdRP in the production of endogenous "licensing" small RNAs that allow germline gene expression, including expression of genes required for exo-RNAi response. In addition, the authors find that the chosen missense mutant strains show a dominant exo-RNAi resistance phenotype, unlike the recessive ego-1 null.

      Although the authors recognize the complex nature of ego-1 phenotypes and provide a helpful model in Figure 8, I find that not all conclusions are consistent with the presented data. A more rigorous data interpretation and presentation logic is required for publication. Also, some additional simple experiments can be done to enhance the rigor of conclusions.

      1. The authors link the direct gene-silencing function of EGO-1 with temperature-sensitive sterility (Figure 8). However, the data in Figure 1 show that the RNAi resistance phenotype and ts-sterility are anti-correlated, the most RNAi-resistant ego-1 alleles are least ts-sensitive and vice versa. Therefore, motivating further experiments through the connection between exo-RNAi resistance and ts-sterility is not justified, e.g. "the temperature sensitive sterile phenotype is a hallmark of the mutator complex.... which is necessary for exo-RNAi-driven silencing". Also, the claim of the redundancy between ego-1 and rrf-1 in controlling ts-sterility is not justified. The ego-1(V1128E) and (C823Y) alleles show strong ts-sterility (Figure 1E), which is not compensated by RRF-1. Therefore, the specific nature of ego-1(S1198L) and (R539Q) mutations leads to a higher dependence of endogenous RNAi silencing processes on RRF-1. Remarkably, although the exo-RNAi resistance of these alleles is dominant (Figure EV2 A,B) and clearly distinct from ego-1 null heterozygous animals, the ts-sterility of ego-1 null heterozygouts and S1198L or R539Q heterozygouts is identical (Figure EV C).
      2. The experiments in Figures 6 and Figure 7C,D are the most important findings of this study, showing that EGO-1 has a role in the licensing of genes important for exo-RNAi in the germline (such as sid-1 and rde-11). The apparent persistence of RRF-1-dependent (and presumably HDRE-1-dependent) silencing of sid-1 and rde-11 in a genetically wild-type background that correlates with exo-RNAi resistance is remarkable, although not novel (it was shown for mutants defective in P-granules). The use of ego-1 missense viable background was instrumental in these experiments. However, it is not clear whether the specific nature of ego-1(S1198L) mutation also played a role, such as enhanced production of RRF-1-dependent endogenous silencing small RNAs. The ego-1(V1128E) allele is an apparent hypomorph, which is viable and exo-RNAi-resistant (Figure 1, EV2A). Performing an experiment shown in Figure 6 with this allele for five generations would be highly illuminating, and either outcome would be interesting.
      3. Conclusions from the experiments in Figures 3 and 4 are not convincing. The imaging data can be moved to supplemental materials. The suppression experiments shown in Figure 4A,B are weak. The effects of cde-1 mutation are hard to interpret, and these data can be omitted. The znfx-1 and hrde-1 loss does not affect resistance to pop-1. If the authors want to insist on their model, they should use several additional exo-RNAi target genes producing Emb (or other) phenotypes and repeat the experiments.
      4. The exo-RNAi resistance and reduced sid-1 and rde-11 expression correlate. The reduction of these exo-RNAi factors is a plausible explanation for the epigenetic RNAi resistance shown in Figure 6. However, ego-1(S1198L); hrde-1(-) P0 is resistant to pop-1(RNAi) to a large extent (Figure 4B), while sid-1 and rde-11 expression is restored in this double compared to single ego-1(S1198L) (Figure 5B). Therefore, ego-1(S1198L) exo-RNAi resistance is not likely driven to any extent by the misregulation of other RNAi genes. The nature of the (S1198L) mutation is likely to play a major role. Also, surprisingly, rrf-1(-) addition to ego-1(S1198L) does not restore sid-1 and rde-11 expression. Why? The authors do not comment on this.
      5. The discussion points about the nature of new EGO-1 missense mutations involving Alpha Fold predictions can be illustrated through Alpha Fold model figures.
      6. The authors should consider a model where ego-1(S1198L) affects RRF-1 activity such that it is more active in the endogenous RNAi silencing processes at the expense of exo-RNAi. This could explain the reduced ts-sterility in ego-1(S1198L), which is RRF-1-dependent, similar to the better-investigated epigenetic inheritance of exo-RNAi resistance. However, the exact mechanism of ego-1(S1198L) cannot be explained by genetic methods and is beyond the scope of this study.

      7. Data and the methods are presented in such a way that they can be reproduced.

      8. Statistical analyses are adequate.

      Minor comments:

      • Figure 8C typo: ego-(0) is meant to be shown.
      • Pak and Fire, Science, 2007 should be cited in connection to secondary siRNA production. Ruby and Bartel, Cell, 2006 should be cited as the first study that identified 21U-RNAs.

      Significance

      General assessment:

      The strength of this study is in generating reagents suitable for performing experiments that were not feasible with the sterile null mutant. The major finding of the paper is the epigenetic inheritance of resistance to exo-RNAi by the wild-type descendants of ego-1 mutants, which is dependent on rrf-1. There are numerous weaknesses in the interpretation of other data, which are described in section 1. The study's limitation is the exclusive use of genetic approaches. The effect of the antimorphic point mutations on EGO-1 stability, localization, and interaction with other proteins could have provided more insight into the protein's function.

      • The most notable results presented in the paper are very similar to the findings of several groups published in 2019 (Lev et al., Ouyang et al, and Dodson and Kennedy) and, therefore, are not novel. The experimental setup is identical to Dodson and Kennedy; it just uses different mutants. The novel aspect is the opposite relationship between ego-1 and rrf-1, which has not been described before.
      • This research will be of interest to C. elegans researchers and those following epigenetic phenomena.
      • My expertise is in RNAi in C. elegans and epigenetics. I have sufficient expertise to evaluate all aspects of the paper.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      EGO-1 is a C. elegans RNA-directed RNA polymerase well known to amplify small-interfering (si) RNA in the germline and to be required for germline development. The authors screened several partial loss-of-function mutations in ego-1, identified in the million mutation project collection, and identified one that does not reduce brood size yet is RNAi defective (Rde). Null and most other ego-1 mutations are completely sterile and strongly Rde. The newly identified allele, which the authors call S1198L, does not disrupt fertility at moderate culture temperatures yet severely disrupts RNAi, indicating that sterility is separable from the Rde phenotype. S1198L mutants do have reduced fertility at elevated culture temperature; this phenotype is enhanced by a rrf-1 null mutation, suggesting these two RdRPs are redundantly required for fertility under conditions of temperature stress. Using S1198L, they explore the relationship between EGO-1 and expression or function of other components and regulators of the small RNA machinery as well as components of germ granules (RRF-1, HRDE-1, PGL-1, CDE-1/PUP-1, ZNFX-1). One very interesting characteristic of ego-1(S1198L) is that it has a dominant RNAi defect, unlike null alleles; therefore, the EGO-1(S1198L) protein may interfere with EGO-1 wt activity. It seems likely that this allele will be useful for exploring additional aspects of EGO-1 activity beyond those included in this report.

      Major comments

      Key conclusions are convincing, but data and stats need to be clarified in some cases (see below).

      Line 202-211: The found that znfx-1(-) partially restored sensitivity of S1198L mutants to pos-1 RNAi but did not significantly restore pop-1 RNAi. Later, section 228-243, they provide evidence that cde-1 and hrde-1 mutations partially restore sensitivity to pos-1, but not pop-1, RNAi. The authors should discuss what might be going on here.

      Lines 276-279: Confusing as written. The authors do not show RNAi assays for germline genes with rrf-1(null) ego-1(S1198L) double mutants. They should show these data. For the wording, I suggest "RRF-1 compensates for partial loss of EGO-1 activity in S1198L with respect to 25{degree sign}C brood size (Fig. #), but not for germline exo-RNAi (Fig. #). Therefore, the defects..."

      Minor comments

      Throughout, figure legends shown indicate the statistical test used, and the p value must be indicated (e.g., *** indicates p-value of #).

      The authors should use consistent nomenclature for the ego-1 null allele. In Fig. 5 it's listed as "" and elsewhere as tm521.

      Line 90: Please include references for the ego-1 null germline phenotype.

      Line 107-109: Wording is confusing. I suggest "Disruption of the E granule, of which EGO-1 is a component, has recently been shown to upregulate sRNA targeting ..."

      Line 118-120: Wording is unclear. I suggest "In addition we found that sid-1 and rde-11 transcripts in ego-1(S1198L) were downregulated, and this effect was suppressed in hrde-1, cde-1, and znfx-1 mutants."

      Line 121-123: The meaning is unclear. Please clarify what "detached" means in this context.

      Line 171-172: Substitute "in the genome" for "in terms of its genomic locus"

      Line 207: Substitute "the pos-1 RNAi defect" for "the Rde phenotype of pos-1 RNAi"

      Line 269: Text says Fig 5A,B, shows restoration to "wt levels," but stats only show significant change from ego-1(S1198L). Stats showing comparison with wt should be shown, as well.

      The text refers to the wrong figure/panel in some places.<br /> Line310 references Fig. 6A-C as showing the phenotype of ego-1(+/-) heterozygotes and ego-1(+/+) homozygotes, but only the latter is shown in 6A-C. Heterozygotes are shown in Fig. 6D-F.<br /> Line 350 should reference Fig. 7C, D (not Fig 3A).

      Line 380-381: Wording is awkward. I suggest "Additionally, this allele showed synthetic ts sterility with an rrf-1 deletion mutation."

      Figure 8: There is a typo in panel C: the allele shown is ego-1(null) not ego-1(S1198).

      Significance

      The paper addresses the mechanisms and activity of small RNA-mediated pathways, including in regulating gene expression and development. The work will be general interest to the large community studying small RNA-mediate gene expression and/or germline development in C. elegans and more broadly. The work is significant because it reveals distinct requirements for EGO-1 RdRP in exo-RNAi, germline development under conditions of temperature stress, and germline development more broadly.

      I am a C. elegans biologist with many decades of experience studying germline development and RNAi-related phenomena.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study conducted by Katsufumi Dejima and colleagues represents an advance in understanding the multiple roles of RdRPs in C. elegans germ cells. EGO-1 is an essential RdRP that is required for multiple aspects of C. elegans germline development and efficient RNAi of germline-expressed genes. Yet, currently there is a lack of sufficient genetic mutants to differentiate the multiple biological functions of EGO-1. In this study, the authors examined a large number of non-null alleles for ego-1 gene and identified four alleles that affect exogenous RNAi, while does not compromise fertility. The authors then focused on the allele ego-1(S1198L), examined its influence on germ granule compartments and investigated the molecular mechanism of EGO-1's involvement in feeding RNA interference. Together, their work reveal an extensive interdependent RdRP network that is responsible for regulating exo-RNAi in the germline.

      Overall, this is a well-executed study that uncovers the molecular mechanism of EGO-1' function in germline RNAi response and the multiple roles of EGO-1 and RRF-1 in regulating germline RNAi. The findings are poised to have an impact on RNAi research fields.

      I have a few comments below. While they are largely minor, addressing them would further enhance the manuscript's clarity and impact.

      1. A systemic analysis of the influence of these ego-1 alleles on fertility can provide valuable information on further studies on EGO-1's functions in fertility.
      2. The genotype of JMC231 is hrde-1(tor125[GFP::3xFLAG::hrde-1]) III. In line 245 and 551, HRDE-1::GFP is typed. typo?
      3. In Figure 4C, the fluorescence intensity in ego-1(S1198L) appears to be more than twice as high as the wild type animals, yet the mean intensity shows only mildly upregulated in Figure 4D. Is the images representative?
      4. A brief introduction of tmC18 in the legend of Figure 6 would be friendly to readers.
      5. In the discussion section, a detailed summary of three recent published papers about the "phenotypic hangover" phenotype would help to understand how EGO-1 contribute to feeding RNAi. (Dodson & Kennedy, 2019; Lev et al., 2019; Ouyang et al., 2019).
      6. Has the authors examined the cellular localization of EGO-1(S1198L) ? Construction of gfp::ego-1(S1198L) animals would provide this information.

      Significance

      Strength: Enough genetic alleles to differentiate the multiple biological functions of EGO-1.

      Limitations: Whether mutant alleles affect siRNA production is unknown.

      Advance: The multiple functions of RdRp protein were analyzed through genetic means.

      Audience: Basic research, small RNA community and C. elegans community

      My expertise: small RNA and germ granule.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      • We thank the reviewers for their useful suggestions regarding how to improve our manuscript.
      • Reviewer 3 declared that s/he did not find and evaluate the provided Supplementary Materials. As a result, many of her/his criticisms seem invalid: the requested data, validations etc. were already there in the Supplementary Figures and Tables.
      • To avoid confusion, we renamed the transgene that is commonly used as a readout for STAT-activated transcription from 10xStat92E-GFP to 10xStat92E DNA binding site-GFP (please see comments by Reviewer 2 that show how easily one can think that Stat92E protein levels go up because of the misleading name of this transgene).
      • One co-author, Martin Csordós was among the authors by mistake. Although first considered, his contribution was not included in either the original or the current manuscript version, so we removed his name from the revised version with his permission.
      • We prefer to use colour coding for Sections 2., 3. and 4. in our responses to Reviewer comments rather than splitting the responses to queries in separate sections, because many of our answers contain a mixture of planned experiments (labeled as bold), already available data (labeled as underlined), and *explanations why we think that no additional analyses are necessary* (between asterisks). Data already provided in the original submission but missed by Reviewers has white background in our responses. Reviewer comments

      Reviewer 1

      Major comments:

      R1/1. ”Figure 6E seems to indicate that a subset of Su(var)2-10/PIAS isoforms may bind to ATG8 (directly or indirectly). This leads to the straightforward prediction that this subset should be differentially affected by the selective autophagy at the center of the manuscript. That could be tested to strengthen that point. “

      Response:

      The Atg8a-binding subset of Su(var)2-10/PIAS isoforms could indeed be differentially affected by selective autophagy__. To test this, we will analyze in vivo Su(var)2-10 isoform abundance on western blots with an anti- Su(var)2-10 antibody in __Atg8aΔ12and ____Atg8aK48A/Y49A (Atg8aLDS) mutants.

      Minor comments:

      R1/2. “ in Fig S1B,C the colocalization between GFP reporters for STAT92E and AP-1 activity and glia marker does not seem convincing, indicating other cell types may be expressing them as well.”

      *Response: *

      *The overlap between glia labelling and STAT92E and AP-1 transcriptional readout reporter expression is indeed not complete. First of all, epithelial cells in the wing display both STAT92E and AP-1 activity even in uninjured conditions when glial expression of these reporters is not yet observed. Transcriptional reporter activity outside of the wing nerve was previously indicated in figures with arrowheads, now the epithelium is labeled and the regions containing nerve glia are outlined everywhere. *

      The fiber-like reporter expression after injury in the wing nerve could correspond to either glia or axons1–3. Glia in the wing nerve have a filament-like appearance resembling axons in confocal images, even glial nuclei are flat/elongated1. Importantly, STAT92E enhancer-driven GFP also labels the nucleus in expressing cells, as opposed to glially driven mtdTomato that is membrane-tethered (and thus excluded from the nucleus: see Fig. S1B, C). Of note, TRE-GFP and Stat-GFP are not expressed in neurons because the cell bodies and nuclei of wing vein neurons are never GFP-positive, see Fig. 2C, Figs. S1, S4 in Neukomm et al.1 and Figure 1 for Reviewers. We also explain this better now in the revised manuscript (please see the legend of Fig. S1).

      Nonetheless, we plan to analyze colocalization of mtdTomato-labeled neurons and TRE-GFP and Stat-GFP around the neuronal cell bodies to unequivocally show their different identities. Additionally, we will include transverse confocal sections of the genotypes in Fig. S1B, C that may better illustrate the colocalization.

      Fig. 1 for Reviewers. Neuronal (nSyb+) and Stat92E-GFP+ cell morphology in the L1 vein at the anterior wing margin around the neuronal cell bodies which occupy a stereotypical position at the sensilla1. The location and shape of neuronal nuclei (left panel) are different from Stat-GFP+ cell nuclei (right panel, please see also Fig. S1B, C) based on the circumferential GFP signal. Therefore, cells expressing TRE-GFP and Stat-GFP in injured wing nerves are glia and not neurons.

      R1/3. “p.7 Instead of "Su(var)2-10 is mainly nuclear due to its transcriptional repressor and chromatin organizer functions" It may be better to say" .. .consistent with its transcriptional repressor and chromatin organizer functions"”

      Response:

      We have modified the manuscript accordingly.

      R1/4. It is not clear whether the differences in Su(var)2-10/PIAS accumulation between Atg16 and Atg101 RNAi indicate functional differences of blocking autophagy at different stages or simply differences in RNAi efficiency (Atg16) versus the Atg101 mutant.”

      Response:

      We have added glial Atg1 (the catalytic subunit of the autophagy initiation complex that also includes Atg101) knockdown experiments that show the same lack of Su(var)2-10 accumulation in uninjured conditions as seen in the Atg101 null mutant (please see Fig. S6C). Please note that Atg16-Atg5-Atg12 dependent conjugation of LC3/Atg8a is involved in various vesicle trafficking pathways in addition to autophagy4–6, alterations of which may perturb baseline Su(var)2-10 levels in uninjured animals.

      Significance:

      R1/5. “STAT92E-dependent glial upregulation of vir-1, but not Draper, is shown, but consequences for glial functions in nerve injury are not tested.”

      Response:

      We will test antimicrobial peptide (AMP) expression in glia after nerve injury and whether this is affected by STAT92E and vir-1. Certain AMPs such as Attacin C are known to be regulated by both the Stat and NF-____κΒpathways7, and AMPs can be generally upregulated in response to brain injury8,9. This could serve pathogen clearance functions after defence lines such as the epithelium and blood-brain barrier are compromised. In addition, we will test the recruitment of glial processes into the antennal lobe after olfactory nerve injury in animals with glial STAT92E or vir-1 deficiency. Glial invasion is an adaptive response to axon injury and a first step towards debris clearance10.

      R1/6. “experiments indicate a role for Su(var)2-10/PIAS SUMOylation activity in tis autophagic degradation, but it is not clear whether the critical substrata Su(var)2-10/PIAS itself or another protein.”

      “binding of Su(var)2-10/PIAS to ATG8 is indicated, but no in vitro experiment performed to test whether this is direct and perhaps SUMOylation dependent.”

      Response:

      *We aimed to answer this question by using a point mutant form of Su(var)2-10: CTD2, which is unable to properly autoSUMOylate itself11, see Fig. 6D. CTD2 mutant Su(var)2-10 levels increased in S2 cells transfected with the mutant construct relative to the wild-type, similar to lysosome inhibition affecting the wild-type protein level but not the mutant variant. Importantly, wild-type Su(var)2-10 is present in CTD2 mutant Su(var)2-10-transfected cells, which can still SUMOylate other Su(var)2-10 targets. It is thus the intrinsic SUMOylation defect of the CTD2 mutant that results in its impaired degradation. It is firmly established that increased Su(var)2-10/PIAS levels repress STAT92E activity12, mammalian example: Liu et al., 199813, pointing to Su(var)2-10 as the critical substrate for autophagy during STAT92E derepression.*

      We will further address this point and investigate if Su(var)2-10 directly binds to Atg8a by in vitro SUMOylation of GST-Su(var)2-10 and subsequent GST pulldown assay with HA-Atg8a. In vitro SUMOylation reaction with purified GST-Su(var)2-10 and negative controls are available via in-house collaboration11. We will incubate the resulting proteins and non-SUMOylated counterparts with in vitro transcribed /translated HA-Atg8a, and interactions will be tested by anti-HA western blotting with quantitative fluorescent LICOR Odyssey CLX detection.

      Reviewer 2

      Major comments:

      R2/1. The working hypothesis is that upon injury, Su(var)2-10 is degraded by autophagy and, as a consequence, Stat92E induces vir-1 expression.

      Could the authors clarify why do Stat92E levels increase upon injury? Does Stat92E stability increase upon ATG mediated Su(var)2-10 degradation? Or does it expression/nuclear translocation change?“

      Response:

      We did not state that Stat92E levels increase during injury - we only used the 10xStat92E DNA binding site-GFP reporter (we have renamed it as such in our revised manuscript to avoid confusion) that is commonly referred to as 10xStat92E-GFP in the literature14, as a readout for Stat92E-dependent transcription.

      To address these questions, we will use an endogenous promoter-driven STAT92E::GFP::FLAG protein-protein fusion transgene (https://flybase.org/reports/FBti0147707.htm) to test if STAT92E stability/expression or translocation is altered during injury or upon disruption of selective autophagy. We have already tested this reporter and it is detected in the wing nerve nuclei after injury (Figure 2 for Reviewers, panel A).

      As the Atg8aLDS mutation specifically impairs selective autophagy, we will use this mutant and wild-type controls to assess STAT92E::GFP::FLAG abundance on western blots from fly lysates with anti-GFP antibody. To assess STAT92E::GFP::FLAG nuclear translocation as well as stability/expression, we will use independently Atg8aLDS and Su(var)2-10 RNAi in glia to perturb STAT92E -dependent transactivation and visualize glia cell membrane by membrane-tethered tdTomato, glial nuclei by DAPI/anti-Repo and STAT92E with the STAT92E::GFP::FLAG fusion transgene in dissected brains. We can also evaluate STAT92E nuclear translocation with the same genotypes in the injured wing nerve glia. Of note, studies in mammals failed to identify an obvious effect of PIAS1 on STAT1 abundance13, please see Figure 2B from this paper as Figure 2 for Reviewers, panel B. Rather, PIAS family proteins bind tyrosine-phosporylated STAT dimers and impair their DNA binding thereby their transcriptional activation function15.

      A.

      Proc. Natl. Acad. Sci. USA Vol. 95, pp. 10626–10631

      https://doi.org/10.1073/pnas.95.18.10626.

      Fig. 2 for Reviewers.

      1. Stat92E::GFP::FLAG expression and nuclear appearance in the wing nerve before and after injury
      2. Increasing PIAS1 (Su(var)2-10 ortholog) levels does not affect STAT1 abundance in mammalian cells R2/2. Also, since Su(var) levels increase upon ATG RNAi, independently of injury, do ATG levels increase upon injury? It does not seem to be the case from Fig 6D, but then, if the ATG levels do not increase, how to explain the injury mediated effects of Su(var)2-10? “

      Response:

      *We have not seen an effect of injury on the rate of autophagic degradation (flux) using the common flux reporter GFP-mCherry -Atg8a in glia after injury (shown in Fig. S2D – not 6D). Also, levels of the typical autophagic cargo p62/Ref(2)P and core autophagy proteins such as Atg12, Atg5, Atg16 do not change after nervous system injury16suggesting no change in general autophagic turnover. *

      *An increase in general autophagy would be one option to promote degradation of a given cargo. Just as for the ubiquitin-proteasome system, in selective autophagy the labelling of the cargo/substrate for degradation is a regulated process. Dynamic ubiquitylation of a cargo often promotes its autophagic degradation17. We hypothesize that SUMO may fulfil a similar role in labelling cargo for elimination and this may be promoted by injury in the case of Su(var)2-10, which warrants future studies. *

      R2/3. “Su(var)2-10 levels in control and injured wings are different between ATG18RNAi and ATG101 mutant (Fig 5). Could the authors explain the rational for using two ATG mutants? and the meaning of this difference? Also, why comparing data using the RNAi approach and a mutation?”

      Response:

      This issue was also raised in R1/4 and we refer the Reviewer/Editor to that section for our new Atg1 knockdown data and explanations.

      *There is a consensus in the autophagy community that mutants for multiple Atg genes should always be used to ensure that it is indeed canonical autophagy that is affected (because Atg proteins can have non-autophagic roles, as is the case for Atg16 in regulation of phagosome maturation - LAP). *

      R2/4. “Fig 6 What is the relevance of the Atg8, Sumo and Su(var)2-10 colocalization at puncta, since there is a lot of colocalization outside the puncta and also lots of Su(var)2-10 or Atg8 labeling that does not colocalize? “

      Response:

      *Su(var)2-10 orthologs PIAS1-4 localize to the nuclear matrix and certain foci in the chromatin and may play roles in heterochromatin formation, DNA repair, and repression of transposable elements in addition to transcriptional repression18–20. SUMO-modified proteins accumulate in response to PIAS activity in phase-separated foci also referred to as SUMO glue21. We show colocalization of Atg8a with similar Su(var)2-10 and SUMO double positive structures in foci. *

      *We do not expect a full overlap between Su(var)2-10 and Atg8a labeling for a number of reasons. First, Su(var)2-10 has many different roles that may not be regulated by autophagy. Second, Atg8a+ autophagosomes in the cytoplasm deliver not only indidivual proteins such as Su(var)2-10 for degradation but also many other cellular components. Third, nuclear Atg8a is implicated in the removal of the Sequoia transcriptional repressor from autophagy genes that is unlikely to involve Su(var)2-1022. Now we include these points in the Discussion section.*

      R2/5. “The statement made in the first sentence of the discussion is very strong: 'we have uncovered an activation mechanism for Stat92E', without sufficient supporting evidence.”

      Response:

      We have rephrased this section as follows:

      Here we have uncovered the autophagy-dependent clearance of a direct repressor of the Stat92E transcription factor. This, synergistically with injury-induced Stat92E phosphorylation, may ensure proper Stat92E-dependent responses in glia after nerve injury to promote glial reactivity.

      R2/6. “Could the authors validate (some) expression data by in situ hybridization experiments?”

      Response:

      *Our gene expression data were derived from wing nerve imaging or wing tissue. Unfortunately, in situ hybridization is not feasible in this organ because probes do not penetrate the thick chitin-based cuticule and wax cover of the wing (and the same is true for wing immunostaining).* We do provide independent evidence for vir-1 upregulation in the wing after injury via quantitative PCR (qPCR) in Fig. S5C. To corroborate reporter-based data, we will also analyze drpr in qPCR using wing material after injury at the same time points.

      R2/7. “Could the authors validate the RNAi lines molecularly (or refer to published data on these lines?”

      Response:

      *Almost all RNAi lines have already been validated by qPCR, western blot, or immunostaining in Szabo et al., 202316 and other publications23–25. The only exception is Su(var)2-10JF03384 and we show that it is indistinguishable from the validated Su(var)2-10HMS00750 RNAi line (which causes 95% transcript reduction): it also strongly derepresses STAT activity. These reagents have also been widely used in the community (e.g. https://flybase.org/reports/FBal0242556.htm, https://flybase.org/reports/FBal0233496.htm).*

      R2/8. „Clarifying the role of Su(var)2-10 on Stat92E would benefit to the presented work. Does Atg8-Su(var)2-10 binding affect Stat92E accumulation, expression, translocation to the nucleus? Some of these experiments could be obtained in S2 cell transfection assays, if too complex in vivo.”

      Response:

      As explained in R2/1, we will use an endogenous promoter-driven STAT92E::GFP::FLAG protein-protein fusion transgene to test if STAT92E stability/expression or translocation is altered upon disruption of selectiveautophagy (in Atg8aLDS mutant flies).

      R2/9. „Also, what happens to the axons in the mutant conditions described in the manuscript? This would higher the impact of the work, but would require in vivo work with fly stocks containing several transgenes.”

      Response:

      We have already published in our previous paper, Szabo et al., 202316 that the mutants used in the current study display normal axon morphology__. There are only two mutants that we did not test in that paper: Atg8aLDS and our new Atg8anull and we will examine these remaining two during the revision, __but we already published in the above paper that axons appear normal in Atg8aΔ4, a widely used Atg8a mutant allele.

      R2/10. „It has been published that Draper is involved in the response to injury in the adult wing nerve. See for example Neukomm et al (2014). The authors should discuss how this fits with their hypothesis and data. In this respect, Fig S4B, which should support the hypothesis, should be improved. It is rather hard to interpret it.”

      Response:

      Fig. S3 (draper protein trap-Gal4 driven GFP-RFP reporter expression) and S4B (intronic STAT92E binding site of the draper gene driven GFP-RFP reporter expression) show similar results: drpr is already expressed in wing nerve glia before injury, which is in line with Draper’s crucial role in the injury response because Draper-mediated glial signaling triggers glial reactivity. This has been added to the Discussion.

      Minor comments:

      R2/11. „Rubicon is also a negative regulator of autophagy (doi:10.1038/s41598-023-44203-6). in (Fig2 B, D) we have a higher GFP intensity in both uninjured and injured, and the difference between Injured/uninjured is less significant compared to control. It is possible that Rubicon KD causes more autophagy leading to a higher activation of Stat92E even in control. I wouldn't take the results as a proof of canonical autophagy implication and not LC3-associated phagocytosis”

      Response:

      Loss of Rubicon could indeed potentially remove more Su(var)2-10 via increased autophagy, leading to higher Stat92E activity. However, there is no statistically significant difference between injured and uninjured controls and injured and uninjured Rubicon knockdown, respectively, in Fig2 B, D (p=0.6975 and >0.9999 for each comparison). We are puzzled by the statement that the reviewer „wouldn't take the results as a proof of canonical autophagy implication and not LC3-associated phagocytosis”. We analyzed Rubicon as a factor critical for LAP and its deficiency does not prevent Stat transcriptional activity following injury unlike the loss of Atg8a, Atg16, Atg13 and Atg5. We will further support this result with a mutant of Atg16 with part of the WD40 domain deleted, because this region is critical for LAP but not for autophagy.16,26,27

      R2/12. „The rationale for using both repoGal4 and repoGS is unclear. If, as mentioned, the goal is to avoid developmental defects, repoGS should be consistently used. Especially I don't understand how both were utilized to knock down the same genes, such as Atg16”

      Response:

      *We had to use repoGS (a drug-inducible Gal4 active in glia) because knocking down Su(var)2-10 with repoGal4 resulted in no viable adult progeny. Su(var)2-10 is an essential gene as opposed to most autophagy genes and its absence results in embryonic lethality24. Thus all Su(var)2-10 silencing experiments were done with repoGS. Similarly, Stat92E is involved in various developmental processes and its loss is embryonic lethal. repoGal4 was used for genes generally not having an adverse effect when absent during development16 in the first two figures. In Fig. 4D, we silenced Atg16 by repoGS because it is one of the controls for testing a genetic epistasis between Su(var)2-10 and Atg16. Please note that we see exactly the same phenotype in case of Atg16 knockdown when using either Gal4 version.* This has been explained in the revised methods section.

      R2/13. „In the third paragraph of the introduction, I am confused whether Stat92E regulates drpr of the reverse”

      Response:

      Upon antennal injury, Drpr receptor binding to phagocytic cargo initiates a positive feedback loop in glial cells to promote its own transcription28. Drpr receptor in the plasma membrane regulates Stat92E and AP-1 activity via signal transduction. Stat92E and AP-1, in turn, increases drpr transcription10,28–30 that will result in more plasma membrane Drpr protein expression. We have explained this more clearly in the revised Introduction.

      R2/14. „I cannot find the evidence for vir-1 being expressed in glia and target of Gcm in the refences that have been cited.”

      Response:

      We apologize for not explaining this better: vir-1 is called CG5453 in Freeman et al., 200331. It is listed in Table 1 as a Gcm target since there is no detectable CG5453 expression in a Gcm null mutant, please see below. We have updated the manuscript with this gene name.

      .....

      .....

      Part of Table 1 from Freeman et al., 200331.

      R2/15. „The presence of a Stat92E binding site on the vir-1 promoter has already bene described in the paper from Imler and collaborators, Nature immunology 2005. Actually, if this site is present in their transgenic line, it would help the authors strengthen the argument that Stat92E has a direct role on vir1 (for which they make a very strong statement in the discussion, with no direct evidence).”

      Response:

      *The evidence that Stat92E may have a direct role in vir-1 transcription in glia comes exactly from the same reporter transgene described by Imler and collaborators in the mentioned paper32. We received this transgenic line from the Imler group and monitored its expression after injury upon depletion of Stat92E (Fig. 3B). It thus contains the studied Stat binding site. This was referenced in the Methods and in all relevant sections of the main text, and we now explicitly state this in the revised text.*

      R2/16. In the Fig S2D, I do not see a lot of GFP+ (Glia) cells. I see more Atg8a in injured 3 dpi regardless of colocalization with glia”

      Response:

      Fig S2D uses one of the standard assays for autophagic turnover, which we now explain in more detail in the Results section. Basically, the dual tagged GFP::mCherry::Atg8a transgene is expressed in glia, and GFP is quenched in lysosomes after delivery by autophagy while mCherry remains fluorescent. So, in addition to double positive dots (autophagosomes), there are mCherry dots lacking GFP (autolysosomes) if autophagy is functional. All of these dots are in glia but the cell boudaries are not visible.

      The images shown are single optical slices. The number of mCherry+ puncta are around 7-8 per field in both uninjured and injured (3 dpi) conditions, but puncta brightness is always variable. Since most mCherry+ puncta were rather bright in the original 3 dpi image, we changed it to a more representative image.

      R2/17. „The quantification of the signals is made in a specific region of the wing, I guess throughout the nerve thickness. This could be represented more carefully in a schematic and It would also help defining colocalization in the first figure, by using a transverse section.”

      Response:

      The quantification method is described in Materials and Methods and we have added that quantification was done on single optical slices. The imaged region is depicted in Fig. S1A, where we indicated the rectangular region used in Fiji for image quantification. We will add transverse sections of wings as suggested.

      R2/18. „A number of ATG genes are considered in the manuscript, but the rational for using them is not always clear. Showing a schematic would help clarify this. „

      Response:

      We have added a table showing the different steps of autophagy where the studied Atg genes/proteins function (now Supplementary Table 1). We also added whether the gene is considered specific for autophagy or can play a role in another process, e.g. LAP. We studied different autophagy genes in line with the assumption that disabling distinct autophagic complexes should produce the same phenotype if this process is indeed autophagy (and not LC3-associated phagocytosis for example).

      R2/19. „Fig 7 is not cited and its legend is very short.”

      Response:

      We have now cited Fig 7 and expanded its legend.

      R2/20. „Clarify the color coding in Fig S1E”

      Response:

      We added that red is injured, black is uninjured.

      R2/21. „What is the tandem tagged autophagic fly reporter in fig S2D?”

      Response:

      This is one of the most common tools to study autophagy, please see the updated explanation above at your first question regarding Fig. S2D.

      R2/22. „Add a schematic on the vir-1 isoforms.”

      Response:

      We have added a a schematic showing the vir-1 isoforms in Fig. S5B.

      R2/23. „Fig S6B and Fig 5 relate on the levels of Su(var)2-10 upon Atg16 RNAi, but the scale is not the same, why?”

      Response:

      *The scales are different because these two images measure different things. Fig. 5 indeed displays quantification of Su(var)2-10 levels in brain glia. However, Fig S6B shows quantification of Stat92E-induced GFP reporter levels (as a proxy of Stat92E transcriptional activity) in the wing nerve upon Atg16 knockdown. *

      Reviewer 3

      R3/1. „The claim that the negative regulator of Stat92E signaling is removed by selective autophagy, involving selective autophagy receptors different from/in addition to Ref(2)P/p62 is not convincingly shown. This claim probably needs to be softened.”

      Response:

      *We have rephrased this sentence as follows: *

      „These data suggest that selective autophagy is involved in Stat92E-dependent transcriptional activation in glia.”

      R3/2. „The reporter that was used (10xSTAT92E-eGFP) is not a dynamic reporter of STAT92E activity. It accumulates in glia and is highly stable. The appropriate reporter to look at dynamic changes would be 10XSTAT92E-dGFP, which has a degradable (unstable) GFP that is required to see dynamic changes even in the CNS. All of the claims about STAT92E regulation use this reporter, so they are questionable.”

      Response:

      10XSTAT92E-dGFP featuring destabilized GFP could be a more appropriate tool for monitoring dynamic changes in transcription when short term- e.g. few hours - changes are investigated. However, we did not see any expression of 10XSTAT92E-dGFP (we tried 2 different transgenic insertions) in the wing nerve, please see Figure 3 for Reviewers. In the brain, dGFP expression with this reporter is also several times lower than stable GFP, please compare Fig. 4A and B in Doherty et al28.

      The use of 10xSTAT92E-eGFP to follow dynamic expression changes is justified by many lines of evidence. First, there is no 10xSTAT92E-EGFP expression in uninjured wing nerves (Fig. S1D,E). Injury induces EGFP expression in the wing nerve with a sustained activation from 1 to 3 dpi (days post injury), and the EGFP expression returns to the baseline by 5 dpi (Fig. S1D, E). Second, the initial Stat-dependent upregulation of drpr and the 10XSTAT92E-dGFP signal in the brain both occur in the first 24 hours after injury and are sustained for 72 hours28 similar to our results with 10xSTAT92E-EGFP ((Fig. S1D,E). These results indicate that the dynamics of 10xSTAT92E-EGFP expression allows monitoring changes in Stat-dependent transcription occurring over days.

      Figure 3 for Reviewers. Lack of 10XSTAT92E-dGFP signal in the wing nerve from two independent insertions of the same transgene at the indicated time points after wing injury.

      R3/3. „The claim that glial drpr is not upregulated by wing injury and drpr accumulation is not apparently a prerequisite for efficient debris processing within the wing is weak. First, they did not stain for Draper using antibodies, rather they used expression constructs. Dee7 is a promoter that was found to be injury activated in the CNS (were they able to replicate that result? I did not receive the supplemental data), but it might not be the crucial regulator in the periphery. The MIMIC line that was converted is better, but might not represent the full spectrum of regulatory events at the draper locus. Finally, they never actually test for endogenous RNA changes, or use the antibody on westerns. Their lack of evidence is not as compelling as it could be.”

      Response:

      The__ original Supplemental Material already provides answers for this and subsequent questions of Reviewer 3__. We deposited the Supplemental Material to bioRxiv at the time of the first Review Commons submission and it was/is available at https://www.biorxiv.org/content/10.1101/2024.08.28.610109v2.supplementary-material.

      Figs. S3 and S4 show in the wing and the brain (using two different drpr reporters for its transcriptional regulation) that drpr expression does not change much in the wing after nerve injury, as opposed to the brain.

      *We did indeed replicate that dee7-Gal4 expression is induced in the brain after antennal injury using UAS- TransTimer (Fig. S4A). In contrast, wing cell nuclei already show expression of both fluorescent proteins in uninjured conditions, and RFP+ nucleus numbers do no change after wing injury (Fig. S4B, C). drpr-Gal4 was generated by conversion of a MiMIC gene trap element into a Gal4 that traps all transcripts. drprMI07659 is in an intron that is common in all drpr isoforms so it should capture the regulation of all transcript isoforms. *

      We will further analyze drpr expression via independent methods during the revision: qPCR amplification of a common region of drpr transcripts, and western blot with anti-Drpr antibody to compare injured and uninjured wing material. Of note, we see no upregulation of drpr 2 days after wing injury in our (unpublished) RNAseq results either.

      *Unfortunately, immunostaining of the adult wing is not feasible because antibodies do not penetrate the thick chitin-based cuticle and wax cover of the wing.*

      R3/4. „The authors claim autophagy contributes to glial reactive states in part by acting on JAK-STAT pathway via regulation of Stat92E. They did not investigate other potential STAT92E targets. Does Atg16 knockdown alter STAT92E expression? Apparently Vir1 is still upregulated in the absence of Atg16 following injury, but they don’t show STAT92E changes.”

      Response:

      We did investigate other potential STAT92E targets besides vir-1. This is referred to in the text as „*immunity-related gene reporters” and it again can be found in the Supplemental Material (____Supplementary Table 2). None of these genes showed glia-specific upregulation following injury. *

      We will investigate STAT92E expression with the STAT92E::GFP::FLAG protein-protein fusion transgene after disrupting autophagy as also suggested by Reviewer 2. Please see our detailed answer to the first comment of Reviewer 2.

      *We do not agree with the comment that „Vir1 is still upregulated in the absence of Atg16 following injury” because Fig. 3F,G show that lack of Atg16 abolishes the upregulation of the vir-1 reporter: the change from uninjured to injured becomes statistically not significant and the mean GFP intensities are practically identical. *

      R3/5. „The authors claim Su(var)2-10 is an autophagic cargo. They should better characterize Su(var)2-10 degradation and its regulation, and image quality needs to be improved (better images, merged examples, and clearer indication of what they are highlighting. There are many arrows in figures that I don't know what they are pointing to. Much of the labeling in Fig 1 (and others) looks like axons. Could TRE-GFP be turned on in neurons? How did they discriminate?”

      Response:

      As also explained to Reviewer 1’s last comment, we will carry out experiments to address whether SUMOylated Su(var)2-10 binds Atg8a, which can provide evidence for a direct SUMO-dependent autophagic elimination of Su(var)2-10. Please see our detailed response there.

      We will further improve image quality for brain images and we already incorporated new images in Fig. S6. *Merged images were missing only in Fig 5, which we have included in the current version. Arrows and arrowheads were used as described in Figure legends, but instead of those, we now clearly label the epithelium and we outlined the region of wing nerve glia in all images. *

      Please see our response to the first minor comment of Reviewer 1 regarding the expression of reporters in wing tissues.

      R3/6. „The authors claim interaction of Su(var)2-10 with Atg8a in the nucleus and cytoplasm can trigger autophagic breakdown, involving Su(var)2-10 SUMOylation. The paper would benefit from showing direct SUMOylation of Su(var)2-10 after injury. Is there any way to examine this in vivo?”

      Response:

      We will test direct SUMOylation of Su(var)2-10 using a recently described method by Andreev et al., 202233. FLAG-GFP-Smt3 (SUMO)____ is expressed under SUMO transcriptional regulation and we will immunoprecipitate FLAG-GFP-SUMO and GFP alone as negative control with GFPTrap beads from lysates of heads subjected to traumatic brain injury that results in glial reactivity16____, and also from uninjured head lysates. We will use anti-____Su(var)2-10 ____western blotting to visualize SUMOylated Su(var)2-10 and whether its levels are modulated by brain injury.

      R3/7. „The authors state in discussion "we find that draper is highly expressed in wing nerve glia already in uninjured conditions and it is not further induced by wing transection - indicating high phagocytic capacity in wing glia ... axon debris clearance takes substantially longer in the wing nerve than in antennal lobe glomeruli, thus draper levels may not readily predict actual phagocytic activity in glia". However, they never actually assess this in their experiments. All the conclusions about Draper are made from promoter fusions of integrated reporters, which are imperfect. This conclusion cannot be made.”

      Response:

      As described in our response to R3/3, we will further test drpr expression changes after wing injury using two independent methods: qPCR and western blot .

      We deleted this part from the Discussion that were criticized by the reviewer because these are not important for the main message of our manuscript.

      R3/8. „Both STAT92E and Jun are activated by a stress response. Could this be a stress response to disrupting autophagy that is somehow enhance by injury?”

      Response:

      *Stress responses are indeed relayed by AP-1 and Stat signaling, and impaired autophagy could be a source of stress. We would like to emphasize, though, that the main finding of our manuscript is that disrupting autophagy suppresses Stat-dependent transcription. Autophagy inhibition does not increase Stat signaling in uninjured wing nerves and while control flies upregulate Stat activity upon injury, autophagy-deficient animals fail to do so (Fig. 1). Thus, Stat signaling is not activated by loss of autophagy – it is activated by injury (that is the stress) and Stat activation requires autophagy in this setting.*

      R3/9. „Minor:

      I don't think that "glially" is a word.”

      Response:

      Online dictionaries such as Wiktionary list glially as a word, and many scientific articles use it: https://doi.org/10.1016/j.conb.2022.102653, https://doi.org/10.1016/j.yexcr.2013.08.016,https://doi.org/10.1016/j.jpain.2006.04.001*, to give some examples. *

      We nonetheless refrain from using it in the updated text.

      References

      1. Neukomm, L.J., Burdett, T.C., Gonzalez, M.A., Züchner, S., and Freeman, M.R. (2014). Rapid in vivo forward genetic approach for identifying axon death genes in Drosophila. Proc National Acad Sci 111, 9965–9970. https://doi.org/10.1073/pnas.1406230111.
      2. Giangrande, A., Murray, M.A., and Palka, J. (1993). Development and organization of glial cells in the peripheral nervous system of Drosophila melanogaster. Development 117, 895–904. https://doi.org/10.1242/dev.117.3.895.
      3. Stork, T., Engelen, D., Krudewig, A., Silies, M., Bainton, R.J., and Klämbt, C. (2008). Organization and Function of the Blood–Brain Barrier in Drosophila. J. Neurosci. 28, 587–597. https://doi.org/10.1523/jneurosci.4367-07.2008.
      4. Figueras-Novoa, C., Timimi, L., Marcassa, E., Ulferts, R., and Beale, R. (2024). Conjugation of ATG8s to single membranes at a glance. J. Cell Sci. 137, jcs261031. https://doi.org/10.1242/jcs.261031.
      5. Galluzzi, L., and Green, D.R. (2019). Autophagy-Independent Functions of the Autophagy Machinery. Cell 177, 1682–1699. https://doi.org/10.1016/j.cell.2019.05.026.
      6. Nieto-Torres, J.L., Leidal, A.M., Debnath, J., and Hansen, M. (2021). Beyond Autophagy: The Expanding Roles of ATG8 Proteins. Trends Biochem Sci 46, 673–686. https://doi.org/10.1016/j.tibs.2021.01.004.
      7. Huang, Z., Kingsolver, M.B., Avadhanula, V., and Hardy, R.W. (2013). An Antiviral Role for Antimicrobial Peptides during the Arthropod Response to Alphavirus Replication. J. Virol. 87, 4272–4280. https://doi.org/10.1128/jvi.03360-12.
      8. Purice, M.D., Ray, A., Münzel, E.J., Pope, B.J., Park, D.J., Speese, S.D., and Logan, M.A. (2017). A novel Drosophila injury model reveals severed axons are cleared through a Draper/MMP-1 signaling cascade. Elife 6, e23611. https://doi.org/10.7554/elife.23611.
      9. Alphen, B. van, Stewart, S., Iwanaszko, M., Xu, F., Li, K., Rozenfeld, S., Ramakrishnan, A., Itoh, T.Q., Sisobhan, S., Qin, Z., et al. (2022). Glial immune-related pathways mediate effects of closed head traumatic brain injury on behavior and lethality in Drosophila. Plos Biol 20, e3001456. https://doi.org/10.1371/journal.pbio.3001456.
      10. MacDonald, J.M., Beach, M.G., Porpiglia, E., Sheehan, A.E., Watts, R.J., and Freeman, M.R. (2006). The Drosophila Cell Corpse Engulfment Receptor Draper Mediates Glial Clearance of Severed Axons. Neuron 50, 869–881. https://doi.org/10.1016/j.neuron.2006.04.028.
      11. Bence, M., Jankovics, F., Kristó, I., Gyetvai, Á., Vértessy, B.G., and Erdélyi, M. (2024). Direct interaction of Su(var)2‐10 via the SIM‐binding site of the Piwi protein is required for transposon silencing in Drosophila melanogaster. FEBS J. 291, 1759–1779. https://doi.org/10.1111/febs.17073.
      12. Betz, A., Lampen, N., Martinek, S., Young, M.W., and Darnell, J.E. (2001). A Drosophila PIAS homologue negatively regulates stat92E. Proc. Natl. Acad. Sci. 98, 9563–9568. https://doi.org/10.1073/pnas.171302098.
      13. Liu, B., Liao, J., Rao, X., Kushner, S.A., Chung, C.D., Chang, D.D., and Shuai, K. (1998). Inhibition of Stat1-mediated gene activation by PIAS1. Proc. Natl. Acad. Sci. 95, 10626–10631. https://doi.org/10.1073/pnas.95.18.10626.
      14. Bach, E.A., Ekas, L.A., Ayala-Camargo, A., Flaherty, M.S., Lee, H., Perrimon, N., and Baeg, G.-H. (2007). GFP reporters detect the activation of the Drosophila JAK/STAT pathway in vivo. Gene Expr Patterns 7, 323–331. https://doi.org/10.1016/j.modgep.2006.08.003.
      15. Hu, X., li, J., Fu, M., Zhao, X., and Wang, W. (2021). The JAK/STAT signaling pathway: from bench to clinic. Signal Transduct. Target. Ther. 6, 402. https://doi.org/10.1038/s41392-021-00791-1.
      16. Szabó, Á., Vincze, V., Chhatre, A.S., Jipa, A., Bognár, S., Varga, K.E., Banik, P., Harmatos-Ürmösi, A., Neukomm, L.J., and Juhász, G. (2023). LC3-associated phagocytosis promotes glial degradation of axon debris after injury in Drosophila models. Nat. Commun. 14, 3077. https://doi.org/10.1038/s41467-023-38755-4.
      17. Goodall, E.A., Kraus, F., and Harper, J.W. (2022). Mechanisms underlying ubiquitin-driven selective mitochondrial and bacterial autophagy. Mol. Cell 82, 1501–1513. https://doi.org/10.1016/j.molcel.2022.03.012.
      18. Zhang, T., Yang, H., Zhou, Z., Bai, Y., Wang, J., and Wang, W. (2022). Crosstalk between SUMOylation and ubiquitylation controls DNA end resection by maintaining MRE11 homeostasis on chromatin. Nat. Commun. 13, 5133. https://doi.org/10.1038/s41467-022-32920-x.
      19. Chen, Z., Zhang, Y., Guan, Q., Zhang, H., Luo, J., Li, J., Wei, W., Xu, X., Liao, L., Wong, J., et al. (2021). Linking nuclear matrix–localized PIAS1 to chromatin SUMOylation via direct binding of histones H3 and H2A.Z. J. Biol. Chem. 297, 101200. https://doi.org/10.1016/j.jbc.2021.101200.
      20. Brown, J.R., Conn, K.L., Wasson, P., Charman, M., Tong, L., Grant, K., McFarlane, S., and Boutell, C. (2016). SUMO Ligase Protein Inhibitor of Activated STAT1 (PIAS1) Is a Constituent Promyelocytic Leukemia Nuclear Body Protein That Contributes to the Intrinsic Antiviral Immune Response to Herpes Simplex Virus 1. J. Virol. 90, 5939–5952. https://doi.org/10.1128/jvi.00426-16.
      21. Gutierrez-Morton, E., and Wang, Y. (2024). The role of SUMOylation in biomolecular condensate dynamics and protein localization. Cell Insight 3, 100199. https://doi.org/10.1016/j.cellin.2024.100199.
      22. Jacomin, A.-C., Petridi, S., Monaco, M.D., Bhujabal, Z., Jain, A., Mulakkal, N.C., Palara, A., Powell, E.L., Chung, B., Zampronio, C., et al. (2020). Regulation of Expression of Autophagy Genes by Atg8a-Interacting Partners Sequoia, YL-1, and Sir2 in Drosophila. Cell Reports 31, 107695. https://doi.org/10.1016/j.celrep.2020.107695.
      23. Maimon, I., Popliker, M., and Gilboa, L. (2014). Without children is required for Stat-mediated zfh1 transcription and for germline stem cell differentiation. Development 141, 2602–2610. https://doi.org/10.1242/dev.109611.
      24. Ninova, M., Chen, Y.-C.A., Godneeva, B., Rogers, A.K., Luo, Y., Tóth, K.F., and Aravin, A.A. (2020). Su(var)2-10 and the SUMO Pathway Link piRNA-Guided Target Recognition to Chromatin Silencing. Mol. Cell 77, 556-570.e6. https://doi.org/10.1016/j.molcel.2019.11.012.
      25. Pircs, K., Nagy, P., Varga, A., Venkei, Z., Erdi, B., Hegedus, K., and Juhasz, G. (2012). Advantages and Limitations of Different p62-Based Assays for Estimating Autophagic Activity in Drosophila. PLoS ONE 7, e44214. https://doi.org/10.1371/journal.pone.0044214.
      26. Fletcher, K., Ulferts, R., Jacquin, E., Veith, T., Gammoh, N., Arasteh, J.M., Mayer, U., Carding, S.R., Wileman, T., Beale, R., et al. (2018). The WD40 domain of ATG16L1 is required for its non‐canonical role in lipidation of LC3 at single membranes. EMBO J 37, e97840. https://doi.org/10.15252/embj.201797840.
      27. Rai, S., Arasteh, M., Jefferson, M., Pearson, T., Wang, Y., Zhang, W., Bicsak, B., Divekar, D., Powell, P.P., Nauman, R., et al. (2018). The ATG5-binding and coiled coil domains of ATG16L1 maintain autophagy and tissue homeostasis in mice independently of the WD domain required for LC3-associated phagocytosis. Autophagy 15, 1–14. https://doi.org/10.1080/15548627.2018.1534507.
      28. Doherty, J., Sheehan, A.E., Bradshaw, R., Fox, A.N., Lu, T.-Y., and Freeman, M.R. (2014). PI3K Signaling and Stat92E Converge to Modulate Glial Responsiveness to Axonal Injury. PLoS Biol 12, e1001985. https://doi.org/10.1371/journal.pbio.1001985.
      29. Logan, M.A., Hackett, R., Doherty, J., Sheehan, A., Speese, S.D., and Freeman, M.R. (2012). Negative regulation of glial engulfment activity by Draper terminates glial responses to axon injury. Nat. Neurosci. 15, 722–730. https://doi.org/10.1038/nn.3066.
      30. MacDonald, J.M., Doherty, J., Hackett, R., and Freeman, M.R. (2013). The c-Jun kinase signaling cascade promotes glial engulfment activity through activation of draper and phagocytic function. Cell Death Differ 20, 1140–1148. https://doi.org/10.1038/cdd.2013.30.
      31. Freeman, M.R., Delrow, J., Kim, J., Johnson, E., and Doe, C.Q. (2003). Unwrapping Glial Biology Gcm Target Genes Regulating Glial Development, Diversification, and Function. Neuron 38, 567–580. https://doi.org/10.1016/s0896-6273(03)00289-7.
      32. Dostert, C., Jouanguy, E., Irving, P., Troxler, L., Galiana-Arnoux, D., Hetru, C., Hoffmann, J.A., and Imler, J.-L. (2005). The Jak-STAT signaling pathway is required but not sufficient for the antiviral response of drosophila. Nat. Immunol. 6, 946–953. https://doi.org/10.1038/ni1237.
      33. Andreev, V.I., Yu, C., Wang, J., Schnabl, J., Tirian, L., Gehre, M., Handler, D., Duchek, P., Novatchkova, M., Baumgartner, L., et al. (2022). Panoramix SUMOylation on chromatin connects the piRNA pathway to the cellular heterochromatin machinery. Nat. Struct. Mol. Biol. 29, 130–142. https://doi.org/10.1038/s41594-022-00721-x.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study the authors explore a potential role for STAT92E and Su(var)2-10 in glial responses to injury in the adult Drosophila wing. The major claims are that canonical autophagy and not LAP sustains STAT92E signaling after in jury. The negative regulator STAT92E is removed by selective autophagy, but this is not ref(2)p/p62 (perhaps). Glial draper expression is not upregulated and Draper accumulation is not apparently a prerequisite for efficient debris clearance in the wing. Su(var)2-10 is an autophagic cargo, mediator of STAT92E-dependennt transcription; and interacts with Atg8a, perhaps sumoylating targets. In general, the model is reasonable, but the data do not support the conclusions, and the quality of the data needs improvement before firm conclusions can be reached. Concerns include:

      1. The claim that the negative regulator of Stat92E signaling is removed by selective autophagy, involving selective autophagy receptors different from/in addition to Ref(2)P/p62 is not convincingly shown. This claim probably needs to be softened.
      2. The reporter that was used (10xSTAT92E-eGFP) is not a dynamic reporter of STAT92E activity. It accumulates in glia and is highly stable. The appropriate reporter to look at dynamic changes would be 10XSTAT92E-dGFP, which has a degradable (unstable) GFP that is required to see dynamic changes even in the CNS. All of the claims about STAT92E regulation use this reporter, so they are questionable.
      3. The claim that glial drpr is not upregulated by wing injury and drpr accumulation is not apparently a prerequisite for efficient debris processing within the wing is weak. First, they did not stain for Draper using antibodies, rather they used expression constructs. Dee7 is a promoter that was found to be injury activated in the CNS (were they able to replicate that result? I did not receive the supplemental data), but it might not be the crucial regulator in the periphery. The MIMIC line that was converted is better, but might not represent the full spectrum of regulatory events at the draper locus. Finally, they never actually test for endogenous RNA changes, or use the antibody on westerns. Their lack of evidence is not as compelling as it could be.
      4. The authors claim autophagy contributes to glial reactive states in part by acting on JAK-STAT pathway via regulation of Stat92E. They did not investigate other potential STAT92E targets. Does Atg16 knockdown alter STAT92E expression? Apparently Vir1 is still upregulated in the absence of Atg16 following injury, but they don't show STAT92E changes.
      5. The authors claim Su(var)2-10 is an autophagic cargo. They should better characterize Su(var)2-10 degradation and its regulation, and image quality needs to be improved (better images, merged examples, and clearer indication of what they are highlighting. There are many arrows in figures that I don't know what they are pointing to. Much of the labeling in Fig 1 (and others) looks like axons. Could TRE-GFP be turned on in neurons? How did they discriminate?
      6. The authors claim interaction of Su(var)2-10 with Atg8a in the nucleus and cytoplasm can trigger autophagic breakdown, involving Su(var)2-10 SUMOylation. The paper would benefit from showing direct SUMOylation of Su(var)2-10 after injury. Is there any way to examine this in vivo? The authors state in discussion "we find that draper is highly expressed in wing nerve glia already in uninjured conditions and it is not further induced by wing transection - indicating high phagocytic capacity in wing glia ... axon debris clearance takes substantially longer in the wing nerve than in antennal lobe glomeruli, thus draper levels may not readily predict actual phagocytic activity in glia". However, they never actually assess this in their experiments. All the conclusions about Draper are made from promoter fusions of integrated reporters, which are imperfect. This conclusion cannot be made. Both STAT92E and Jun are activated by a stress response. Could this be a stress response to disrupting autophagy that is somehow enhance by injury?

      Minor:

      I don't think that "glially" is a word.

      Significance

      Based on the quality of the data, it is hard to consider this manuscript having made a major step forward. A significant amount of work needs to be done to firm up the conclusions. In its present form, the major contributions are the identification vir-1 as upregualted (maybe) and a potential role for autophagy.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Vincze et al. explores the regulatory mechanisms of Stat92E in glial reactivity following axonal injury. Utilizing a wing injury model in Drosophila, the study demonstrates the role of autophagy in regulating Stat92E expression in glia during injury. Through genetic and biochemical assays, the authors reveal that autophagy facilitates the degradation of Su(var)2-10, a negative regulator of Stat92E, thereby enabling the activation of this pathway. Overall, this study highlights a crucial role for autophagy in glial immunity during axonal injury.

      Major comments:

      • Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      The working hypothesis is that upon injury, Su(var)2-10 is degraded by autophagy and, as a consequence, Stat92E induces vir-1 expression.<br /> Could the authors clarify why do Stat92E levels increase upon injury? Does Stat92E stability increase upon ATG mediated Su(var)2-10 degradation? Or does it expression/nuclear translocation change? Also, since Su(var) levels increase upon ATG RNAi, independently of injury, do ATG levels increase upon injury? It does not seem to be the case from Fig 6D, but then, if the ATG levels do not increase, how to explain the injury mediated effects of Su(var)2-10? Su(var)2-10 levels in control and injured wings are different between ATG18RNAi and ATG101 mutant (Fig 5). Could the authors explain the rational for using two ATG mutants? and the meaning of this difference? Also, why comparing data using the RNAi approach and a mutation? Fig 6 What is the relevance of the Atg8, Sumo and Su(var)2-10 colocalization at puncta, since there is a lot of colocalization outside the puncta and also lots of Su(var)2-10 or Atg8 labeling that does not colocalize? The statement made in the first sentence of the discussion is very strong: 'we have uncovered an activation mechanism for Stat92E', without sufficient supporting evidence. - Please request additional experiments only if they are essential for the conclusions. Alternatively, ask the authors to qualify their claims as preliminary or speculative, or to remove them altogether.

      Could the authors validate (some) expression data by in situ hybridization experiments? Could the authors validate the RNAi lines molecularly (or refer to published data on these lines? - If you have constructive further reaching suggestions that could significantly improve the study but would open new lines of investigations, please label them as "OPTIONAL". - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated time investment for substantial experiments.

      Clarifying the role of Su(var)2-10 on Stat92E would benefit to the presented work. Does Atg8-Su(var)2-10 binding affect Stat92E accumulation, expression, translocation to the nucleus? Some of these experiments could be obtained in S2 cell transfection assays, if too complex in vivo. Also, what happens to the axons in the mutant conditions described in the manuscript? This would higher the impact of the work, but would require in vivo work with fly stocks containing several transgenes. - Are the data and the methods presented in such a way that they can be reproduced?

      It has been published that Draper is involved in the response to injury in the adult wing nerve. See for example Neukomm et al (2014). The authors should discuss how this fits with their hypothesis and data. In this respect, Fig S4B, which should support the hypothesis, should be improved. It is rather hard to interpret it. - Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Rubicon is also a negative regulator of autophagy (doi:10.1038/s41598-023-44203-6). in (Fig2 B, D) we have a higher GFP intensity in both uninjured and injured, and the difference between Injured/uninjured is less significant compared to control. It is possible that Rubicon KD causes more autophagy leading to a higher activation of Stat92E even in control. I wouldn't take the results as a proof of canonical autophagy implication and not LC3-associated phagocytosis The rationale for using both repoGal4 and repoGS is unclear. If, as mentioned, the goal is to avoid developmental defects, repoGS should be consistently used. Especially I don't understand how both were utilized to knock down the same genes, such as Atg16. In the third paragraph of the introduction, I am confused whether Stat92E regulates drpr of the reverse? - Are prior studies referenced appropriately?

      Published work should be acknowledged properly. I cannot find the evidence for vir-1 being expressed in glia and target of Gcm in the refences that have been cited.

      The presence of a Stat92E binding site on the vir-1 promoter has already bene described in the paper from Imler and collaborators, Nature immunology 2005. Actually, if this site is present in their transgenic line, it would help the authors strengthen the argument that Stat92E has a direct role on vir1 (for which they make a very strong statement in the discussion, with no direct evidence). - Are the text and figures clear and accurate? - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      In the Fig S2D, I do not see a lot of GFP+ (Glia) cells. I see more Atg8a in injured 3 dpi regardless of colocalization with glia. The quantification of the signals is made in a specific region of the wing, I guess throughout the nerve thickness. This could be represented more carefully in a schematic and It would also help defining colocalization in the first figure, by using a transverse section. A number of ATG genes are considered in the manuscript, but the rational for using them is not always clear. Showing a schematic would help clarify this. Fig 7 is not cited and its legend is very short.

      Clarify the color coding in Fig S1E

      What is the tandem tagged autophagic fly reporter in fig S2D?

      Add a schematic on the vir-1 isoforms.

      Fig S6B and Fig 5 relate on the levels of Su(var)2-10 upon Atg16 RNAi, but the scale is not the same, why?

      Significance

      • General assessment: provide a summary of the strengths and limitations of the study. What are the strongest and most important aspects? What aspects of the study should be improved or could be developed? The manuscript by Vincze et al. investigates the regulatory mechanisms of Stat92E in glial reactivity following axonal injury. This research addresses a significant topic relevant to neuroinflammatory conditions in humans, such as neurodegenerative diseases. Utilizing a wing injury model in Drosophila, the study identifies a novel upstream regulatory mechanism of Stat92E. Specifically, after axonal injury, autophagy facilitates the degradation of Su(var)2-10, a negative regulator of Stat92E in glia. This process enables a non-canonical activation of the JAK-STAT pathway, leading to the induction of downstream target genes, such as Vir-1, highlighted in this study. Altogether, the manuscript advances our understanding of the glial response to neuronal damage, building on previous work by this group and others. Notably, it highlights progress in the role of both autophagy machinery and JAK-STAT pathway in this context.
      • Limitations and possible improvements: A more mechanistic analysis will higher the impact of the findings. Clarifying the role of Su(var)2-10 on STAT92E would benefit to the presented work. Does Atg8-Su(var)2-10 binding affect STAT92E accumulation, expression, translocation to the nucleus? Also, what happens to the axons in the mutant conditions described in the manuscript?
      • Advance: compare the study to the closest related results in the literature or highlight results reported for the first time to your knowledge; does the study extend the knowledge in the field and in which way? Describe the nature of the advance and the resulting insights (for example: conceptual, technical, clinical, mechanistic, functional,...). The work provides a conceptual advance in the field by assessing the role of ATG genes and a novel pathway linked to STAT and vir-1.
      • Audience: describe the type of audience ("specialized", "broad", "basic research", "translational/clinical", etc...) that will be interested or influenced by this research; how will this research be used by others; will it be of interest beyond the specific field? A broad audience working on neurodegeneration will be interested in the described work.
      • Please define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. Neural development and the transcriptional mechanisms involved to the process.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Regulation of immune pathway responses in glia is critical after nervous system injuries. The authors use the nerve fibers in the Drosophila wing as a model system to further analyze molecular mechanisms of glial responses with a focus on the regulation of the STAT92E protein, the single STAT family protein in Drosophila, which in glial injury responses has previously been shown to be independent of the canonical Domeless receptor / JAK kinase pathway.

      Here, the authors show convincingly that STAT92E activation depends on the selective autophagic degradation of the SUMOligase Su(var)2-10/PIAS in the absence of elevated bulk autophagy. IF and IP experiments indicated that direct or indirect interactions with Atg8 may drive this selective autophagy of Su(var)2-10/PIAS and that its SUMOylation activity appears to promote its degradation.

      Further observations show that STAT92E in this context does not result in elevated expression of the glial phagocytic Draper receptor and instead yields elevated vir-1 expression with unknown consequences for neuronal health.

      All key conclusions in the paper are well supported by experimental evidence and careful quantification.

      Major comments:

      Figure 6E seems to indicate that a subset of Su(var)2-10/PIAS isoforms may bind to ATG8 (directly or indirectly). This leads to the straightforward prediction that this subset should be differentially affected by the selective autophagy at the center of the manuscript. That could be tested to strengthen that point.

      Minor comments:

      • in Fig S1B,C the colocalization between GFP reporters for STAT92E and AP-1 activity and glia marker does not seem convincing, indicating other cell types may be expressing them as well.
      • p.7 Instead of "Su(var)2-10 is mainly nuclear due to its transcriptional repressor and chromatin organizer functions" It may be better to say" .. .consistent with its transcriptional repressor and chromatin organizer functions"
      • It is not clear whether the differences in Su(var)2-10/PIAS accumulation between Atg16 and Atg101 RNAi indicate functional differences of blocking autophagy at different stages or simply differences in RNAi efficiency (Atg16) versus the Atg101 mutant.

      Significance

      The manuscript convincingly shows that autophagic degradation is an important component of the regulation of STAT92E, an important transcriptions factor for glial responses to nerve injuries. That is a novel observation that will be of interest to experts in the field of autophagy and its roles in brain homeostasis.

      In addition. some other interesting initial observations are reported, but without much follow up that could have significantly strengthened the paper:

      • STAT92E-dependent glial upregulation of vir-1, but not Draper, is shown, but consequences for glial functions in nerve injury are not tested.
      • experiments indicate a role for Su(var)2-10/PIAS SUMOylation activity in tis autophagic degradation, but it is not clear whether the critical substrata Su(var)2-10/PIAS itself or another protein.
      • binding of Su(var)2-10/PIAS to ATG8 is indicated, but no in vitro experiment performed to test whether this is direct and perhaps SUMOylation dependent.
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer # 1: The study is well-executed, and the claims are supported by appropriate experiments. As introduced by the authors in their introduction, ubiquitin-dependent endocytosis of AA transporters has been previously shown in S. cerevisiae and TXNIP has previously been identified as a regulator of glucose uptake by promoting endocytosis of GLUT1 and GLUT4. Here, the authors identify the molecular mechanism by which TXNIP promotes the endocytosis, and degradation of amino acid transporters (SLC7A5-SLC7A3) through its interaction with HECT-type ubiquitin ligases. This is an advance in the field and will be of interest for researchers in the fields of quiescence, metabolism and cell biology. Experiments are well designed and important controls have been performed. Overall, the claims and the conclusions are supported by the data.* *

      Response: We thank the reviewer for the thorough evaluation of our manuscript and for the insightful, constructive comments. Reviewer 1 had five minor comments, and we have addressed them all.

      Minor comment 1: The authors should indicate how often western blot experiments were repeated with similar results. Ideally band quantification (as in Fig. 2b) for the most relevant proteins should be provided for all shown Western blots.* *

      Response: Each Western Blot (WB) experiment has been performed at least 3 times and each WB result for SLC7A5 is complemented by immunofluorescence and/or additionally by FACS analysis, across the manuscript.

      In the partially revised version of the manuscript, we already__ incorporated WB quantifications of SLC7A5 protein levels__ for Figures 1c, f, h, Figure 3b, Figure 4b, and Figure 5a, c in Supplementary Figure 1b, Supplementary Figure 2c, f, Supplementary Figure 4a, e, and in Supplementary Figure 5a, c, respectively.

      Minor comment 2: For confocal images no n number of experiments/analyzed cells is stated. Often only 2-3 cells are shown in these images. In some figures, conclusions from these confocal images are additionally supported by cell surface FACS.

      Response: Each immunofluorescence experiment has been performed at least 3 times.

      Minor ____comment 3: For panels with missing cell surface FACS quantifications, the authors should consider using the existing imaging data to perform quantifications of the membrane signal. In this way the reader can get the right impression of the reproducibility of the phenotype described.* *

      Response: Each immunofluorescence experiment has been performed at least 3 times. In the partially revised version of the manuscript, line-scan quantification of immunofluorescence (IF) of SLC3A2 at the plasma membrane (PM) is now provided for immunofluorescence experiments in Figure 1e, g, Figure 3c, e in Supplementary Figure 2b, e, Supplementary Figure 4b, c, and for SLC2A1 in Supplementary Figure 3i, were FACS data was missing. In addition, WB experiments complement the results of each IF experiment.

      Minor comment 4: I appreciate that the authors have also investigated SLC2A1 endocytosis in their experimental setup. Interestingly, they found that TXNIP mediated downregulation of SLC7A5-SLC3A2 was not linked to TXNIP mediated SLC2A1 endocytosis. Since the role of TXNIP in glucose metabolism has been studied in more detail in the past, it would be interesting if the authors could further comment on the differences/similarities in the molecular mechanism of glucose and AA transporter downregulation in the discussion.* *

      Response: Thank you for bringing up this point. We now have added the following paragraph to the discussion to speculate about the differences/similarities in the molecular mechanism of glucose and AA transporter downregulation in the discussion:

      ‘Moreover, in RPE1 cells entering quiescence, GLUT1/4 was not downregulated. Hence, it seems that TXNIP can discriminate, in a context dependent manner, between targeting SLC7A5-SLC3A2 or GLUT1/4 for endocytosis. Since AKT mediated phosphorylation invariably appeared to inactivate TXNIP, and dephosphorylation re-activated it, additional mechanism must confer TXNIP selectivity towards SLC7A5-SLC3A2 or GLUT1/4. We consider it likely, that the exposure of sorting motifs in cytosolic tails of SLC7A5 or GLUT1/4 could regulate the binding of activated TXNIP and thus controls selective endocytosis to adapt nutrient uptake. The exposure of these sorting motifs could be dependent on the metabolic context / state of the cell. Indeed, yeast a-arrestins can detect n- or c-terminal acidic sorting motifs in amino acid transporters, respectively, that are alternatively exposed in response to amino acid excess or starvation (Ivashov et al., 2020a) (Guiney et al, 2016). Inspection of the SLC7A5 sequence indicates a possible n-terminal acidic sorting motif (17EEKEEAREK25). Two lysine residues (K19, K25) in this sequence have been found to be ubiquitinated in an earlier study upon protein kinase C (PKC) activation and mTORC1 inhibition (Barthelemy & Andre, 2019; Rosario et al, 2016).’

      Minor ____comment 5: I would recommend a colour blind-friendly colour palette for the confocal images* *

      Response: Thank you for pointing this out – we have changed the color palette accordingly.

      Reviewer # 2: This study establishes TXNIP as a regulator of LAT1 endocytosis and metabolic homeostasis in quiescence. The integration of KO models and a TXNIP-deficient patient strengthens the findings, though clinical characterization remains underdeveloped relative to the mechanism reported, and biochemical interactions require endogenous validation. The work expands our understanding of TXNIP beyond association studies, positioning it as a key player in nutrient sensing and metabolic regulation. Addressing the concerns will enhance its relevance across fields - particularly metabolism, cell biology, and disease research. Overall, this is a very interesting study indeed. The use of TXNIP knockout models and a loss-of-function patient variant strengthens the conclusion that TXNIP is required for LAT1 degradation. The functional consequences of TXNIP deficiency (elevated intracellular aa, sustained mTORC1 activation, and accelerated quiescence exit) are well-supported by the data. The major concerns are as follows:

      Response: We thank the reviewer for the thorough evaluation of our manuscript and for the insightful, constructive comments. Reviewer 2 had three major concerns and one minor comment.

      Major concern 1. The identification of a biallelic TXNIP loss-of-function variant in a patient with metabolic disease and neurological dysfunction is highly significant. However, it is problematic that the manuscript effectively presents a case report but does not explicitly frame it as such, and the clinical details are very superficial (lack of pedigree, genetics, structured disease timeline, differential diagnosis, any histology/scans/photography and broader metabolic profiling - please see best practices for case reports). Although whole-exome sequencing identified the TXNIP variant, it remains unclear whether other genetic or metabolic contributors were systematically excluded. At first glance, the clinical discovery strengthens the physiological significance of the cell biology. However, a discrepancy remains between the clear neurological presentation of the patient (intellectual disability, autism and epilepsy) and the fibroblast-based TXNIP-LAT1 mechanism described in the study. Furthermore, the metabolic phenotype described in this manuscript is significantly more severe than that reported in a previous Swedish study of TXNIP deficiency in humans, where the clinical presentation was milder. This discrepancy suggests that different TXNIP mutations may lead to a spectrum of clinical outcomes, which is highly novel (i.e. metabolic and neurological in terms of loss of function, and carcinogenesis with respect to association studies, reviewed in PMID: 37794178). Of course, this could be influenced by mutation type, genetic background, compensatory mechanisms or environmental factors - it is noteworthy that the previous siblings had mitochondrial dysfunction, and this remains unknown in the present individual. Addressing this variability and discussing potential reasons for the pronounced phenotype observed in this patient would strengthen the manuscript overall. It is noteworthy that LAT1 is highly expressed in brain endothelial cells, which can also adopt a quiescent state (PMID: 33627876), and the authors should expand beyond the single sentence in their discussion. In the absence of the above details, the title and conclusions of Figure 3 and in the discussion greatly overstate causality, implying a direct relationship between TXNIP loss and metabolic dysfunction, despite data from only one patient. his may indeed be the case, but the claims should be carefully revised to reflect an association rather than definitive causation until additional patients are identified. Additionally, while it is assumed that the authors have obtained ethical approval and informed consent, this needs to be explicitly stated for transparency, with dedicated details in the methods sections. Addressing these issues will improve the rigor and mechanistic coherence of the study - otherwise it is quite disjointed.

      Response: We have addressed many these valid concerns and provide a detailed description of the patient in the partially revised manuscript (please see below).

      ‘The patient is a boy, born in 2014 as the first child of healthy, consanguineous parents of Turkish origin. During pregnancy, the mother was diagnosed with polyhydramnios. At 38 + 6 weeks of gestation, the baby was in a breech position, leading to a cesarean section. At birth, he weighed 3880 g (P90), measured 55 cm in length, and had a head circumference of 38 cm.

      On the seventh day of life, he exhibited floppiness, recurrent hypoglycaemia, and lactic acidosis, prompting his transfer from the birth hospital to a tertiary care centre. During the first three days there, his lowest recorded blood glucose level was 30 mg/dl, lactate levels were approximately 6.5 mmol/l, and pH was 7.11. Subsequently, he developed hypertriglyceridemia, with triglyceride levels reaching 364 mg/dl. Initially stable, he began experiencing elevated pCO2 levels (up to 70 mmHg due to bradypnea) and metabolic acidosis on day 10. A glucose infusion (10 mg/kg/min) stabilized his glucose and lactate concentrations, though lactate remained elevated at around 3-4 mmol/l. Regardless, his muscular hypotonia persisted. On day 12, a skin punch biopsy for a fibroblast culture was performed.

      By day 20, glucose and lactate levels had stabilized with regular feeding, allowing his transfer back to a peripheral hospital. During infancy, his blood glucose concentrations were within standard range (Supplementary Table 1), but the boy experienced recurrent hypoglycaemia in response to metabolic stress, e.g., infections. He exhibited psychomotor developmental delays and, from 18 months of age, experienced increasing epileptic seizures (up to 3-4 per month), which were managed with levetiracetam, topiramate, and lamotrigine. Currently, he remains metabolically stable but presents with significant developmental delay, muscular hypotonia, and autistic features.

      Whole-exome sequencing from peripheral blood of the patient detected a homozygous single nucleotide insertion c.642_643insT in exon 5 of 8 of the TXNIP gene. This variant was not recorded in the population genetic variant database gnomAD that lists TXNIP as likely haplosufficient (pLI = 0, LOEUF = 0,709: https://gnomad.broadinstitute.org accessed Sept. 10, 2024). No other (likely) pathogenic variant in any other gene, with known function in metabolism was identified as explanation of the clinical features in the child. Potential pathogenic variants in genes required for mitochondrial functions were also not detected, although they were initially expected to cause the phenotype of the boy.

      The TXNIP variant c.642_643insT caused a frameshift and a premature stop codon after 59 AA (denoted p.Ile215TyrfsTer59), likely causing nonsense-mediated decay (NMD) or the synthesis of a severely truncated TXNIP protein (Figure 3a). Both parents are healthy heterozygous carriers for the TXNIP variant. Serendipitously, this TXNIP variant was similar to the gene-edited version in the RPE1 TXNIPKO cells (p.I215TfsX11).

      The patient showed consistent metabolic alterations compatible with an AA transporter deficiency. Blood plasma concentrations of several large neutral amino acids (LNAAs, including L, I, V) were elevated throughout the years 2014 – 2022 (Supplementary Table 1). The increased molar ratio of the LNAAs (L, I, V) to aromatic AAs (F, Y), resulted in an elevated Fischer’s ratio (FR, 2014: FR = 4.46; 2016: FR = 5.38, 2018: FR = 5.90; 2021; FR= 6.98; 2022: FR = 4.23; FR reference range = 2.10 - 4). The methionine levels are not dramatically altered (Supplementary Table 1).’

      We also provide the following ethical statement:

      __‘Ethical statement __

      All patients’ data were extracted from the medical routine records. Written informed consent for molecular genetic studies and publication of data was obtained from the legal guardians of the patient. This approach was approved by the ethics committee of the Medical University of Innsbruck (UN4501-MUI). The study was conducted in accordance with the principles of the Declaration of Helsinki.’

      During the revision, we will additionally address how the other known TXNIP variant (TXNIP p.Gln58His; p.Gly59*; PMID: 30755400) affects nutrient transporter endocytosis. This TXNIP variant will be expressed in TXNIPKO RPE1 cells to analyze its effect on quiescence induced SLC7A5 downregulation. The results of this experiment will allow comparing directly the effect of both known TXNIP variants (p.Gln58His; p.Gly59* and p.Ile215TyrfsTer59) on SLC7A5 downregulation in an identical genetic background. In addition, we will compare how both TXNIP variants affect mitochondrial function (using Seahorse technology).

      Major concern 2. The authors report that TXNIP interacts with HECT E3 ligases to regulate substrate degradation, yet this conclusion is drawn from overexpression-based immunoprecipitation studies, which do not confirm interaction under endogenous conditions. Without direct evidence of TXNIP-HECT E3 binding at native expression levels, this mechanistic link remains unresolved. Given that the authors have already generated antibody-validated TXNIP KO models, endogenous validation should be feasible if the interactions are not super-transient.

      Response: While the manuscript was under review, we have improved the stringency of our TXNIP-HECT type ubiquitin ligase interaction experiments and developed additional biochemical experiments that strengthen our original conclusions. In the course of these experiments, we found that the interaction of TXNIP with NEDD4, WWP2 and HECW1/2 (but not with WWP1 or ITCH) were particularly dependent on the PPxY331 motif.

      During the revision, we will conduct additional experiments to substantiate these findings and to narrow down the list possible ubiquitin ligases that are required for the downregulation of SLC7A5. In particular, we will test if endogenous TXNIP co-immunoprecipitates (in a PPxY motif dependent manner) NEDD4, HECW1/2 or another HECT type ubiquitin ligase.

      Furthermore, we will include a newly developed ‘Bead-Immobilized Prey Assay (BIPA)’, were protein-protein interactions can be analyzed by microscopy in a fast in straight forward manner. In the BIPA, ALFA-TXNIP (or mutant variants) are first captured on ALFA-beads (Bead immobilized). These TXNIP beads are then incubated with cell lysates from HEK293 expressing GFP-tagged HECT type ubiquitin ligases (Prey). The binding of the GFP-tagged ubiquitin ligases to the TXNIP beads is analyzed by fluorescence microscopy and quantified (Figure 1b, a BIPA with YFP-NEDD4). This efficient assay will also be conducted with NEDD4, WWP1, WWP2, HECW2, and ITCH to analyze how they bind to TXNIP, TXNIP-PPxY331 and the PPxY double mutant.

      Together we are confident that our experiments establish that TXNIP must interact with a specific subset of HECT type ubiquitin ligase (our prime candidate are NEDD4 and HECW1/2) to trigger SLC7A5-SLC3A2 ubiquitination, endocytosis and lysosomal degradation.

      Major concern 3. What are the temporal dynamics of TXNIP-associated degradation, and is this process distinct from endosomal microautophagy (as reported in PMID: 30018090)? The authors present convincing, high-quality FACS-based data supporting TXNIP-mediated turnover. If this pathway is mechanistically separate from endosomal microautophagy, it suggests a hierarchy of degradation pathways leading to quiescence. Live cell imaging studies that define the temporal dynamics of this process using the tools the authors have created may reveal the relationship between these processes and refine the broader implications of TXNIP in homeostatic adaptation.

      Response: Thank you for this interesting suggestion.

      During the revision, we will first investigate a potential temporal correlation of endosomal micro-autophagy of p62/SQSTM1, NBR1, TAX1BP1, NDP52, and NCOA4 (PMID: 30018090) and the downregulation of SLC7A5 as cells enter quiescence. For these experiments, we will follow the turn-over of the above-mentioned autophagy adaptors and compare it to the turnover of SLC7A5, using either WB analysis, or microscopy or both.

      Next, we will test if SLC7A5-SLC3A2 endocytosis and lysosomal degradation is required to initiate endosomal micro-autophagy of p62/SQSTM1, NBR1, TAX1BP1, NDP52, and NCOA4 in TXNIPKO cells.

      Together, these experiments will address if the endosomal micro-autophagy and TXNIP mediated downregulation of SLC7A5 are mechanistically linked during entry into quiescence.

      Minor comment 1. In the discussion, the authors might briefly speculate on the implications of any functional redundancy that might exist between other arrestins.

      We will provide this information in the fully revised version of the manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Cells entering quiescence must recalibrate metabolism to match lower energy demands, yet the role of endocytosis in this process remains poorly defined. In yeast, amino acid transporters undergo rapid endocytic degradation upon entry into quiescence, but whether a similar mechanism exists in human cells is unknown. Kahlhofer and colleagues demonstrate that human quiescent cells selectively degrade plasma membrane-resident amino acid (AA) transporters, particularly SLC7A5-SLC3A2 (LAT1-4F2hc) and SLC1A5 (ASCT2). TXNIP facilitates LAT1 endocytosis and lysosomal degradation, thereby limiting AA uptake and intracellular AA levels to attenuate mTORC1 signaling and protein translation. In TXNIP-deficient cells, LAT1 remains at the plasma membrane, leading to persistent AA uptake, sustained mTORC1 activation, and accelerated proliferation upon exiting quiescence. In proliferating cells, AKT phosphorylates TXNIP at Ser308, inactivating it and preventing LAT1 degradation, a process that is reversed upon entering quiescence. Notably, the authors identify a biallelic TXNIP loss-of-function variant in a patient with severe metabolic disease, recurrent hypoglycemia, and amino acid imbalances. Patient-derived fibroblasts exhibit defective LAT1 internalization, a phenotype that cannot be rescued by complementation with the pathogenic TXNIP variant, supporting an important role in disease pathology. Functionally, TXNIP-deficient cells have elevated AA levels that sustain mTORC1 activation, enhancing translation, and accelerate exit from quiescence. This study establishes TXNIP as a key regulator of amino acid transporter endocytosis in quiescent cells, linking metabolic adaptation, mTORC1 signaling, and cell cycle control through a previously unrecognized mechanism.

      Major comments

      Overall, this is a very interesting study indeed. The use of TXNIP knockout models and a loss-of-function patient variant strengthens the conclusion that TXNIP is required for LAT1 degradation. The functional consequences of TXNIP deficiency (elevated intracellular aa, sustained mTORC1 activation, and accelerated quiescence exit) are well-supported by the data. The major concerns are as follows:

      1. The identification of a biallelic TXNIP loss-of-function variant in a patient with metabolic disease and neurological dysfunction is highly significant. However, it is problematic that the manuscript effectively presents a case report but does not explicitly frame it as such, and the clinical details are very superficial (lack of pedigree, genetics, structured disease timeline, differential diagnosis, any histology/scans/photography and broader metabolic profiling - please see best practices for case reports). Although whole-exome sequencing identified the TXNIP variant, it remains unclear whether other genetic or metabolic contributors were systematically excluded. At first glance, the clinical discovery strengthens the physiological significance of the cell biology. However, a discrepancy remains between the clear neurological presentation of the patient (intellectual disability, autism and epilepsy) and the fibroblast-based TXNIP-LAT1 mechanism described in the study. Furthermore, the metabolic phenotype described in this manuscript is significantly more severe than that reported in a previous Swedish study of TXNIP deficiency in humans, where the clinical presentation was milder. This discrepancy suggests that different TXNIP mutations may lead to a spectrum of clinical outcomes, which is highly novel (i.e. metabolic and neurological in terms of loss of function, and carcinogenesis with respect to association studies, reviewed in PMID: 37794178). Of course, this could be influenced by mutation type, genetic background, compensatory mechanisms or environmental factors - it is noteworthy that the previous siblings had mitochondrial dysfunction, and this remains unknown in the present individual. Addressing this variability and discussing potential reasons for the pronounced phenotype observed in this patient would strengthen the manuscript overall. It is noteworthy that LAT1 is highly expressed in brain endothelial cells, which can also adopt a quiescent state (PMID: 33627876), and the authors should expand beyond the single sentence in their discussion. In the absence of the above details, the title and conclusions of Figure 3 and in the discussion greatly overstate causality, implying a direct relationship between TXNIP loss and metabolic dysfunction, despite data from only one patient. his may indeed be the case, but the claims should be carefully revised to reflect an association rather than definitive causation until additional patients are identified. Additionally, while it is assumed that the authors have obtained ethical approval and informed consent, this needs to be explicitly stated for transparency, with dedicated details in the methods sections. Addressing these issues will improve the rigor and mechanistic coherence of the study - otherwise it is quite disjointed.
      2. The authors report that TXNIP interacts with HECT E3 ligases to regulate substrate degradation, yet this conclusion is drawn from overexpression-based immunoprecipitation studies, which do not confirm interaction under endogenous conditions. Without direct evidence of TXNIP-HECT E3 binding at native expression levels, this mechanistic link remains unresolved. Given that the authors have already generated antibody-validated TXNIP KO models, endogenous validation should be feasible if the interactions are not super-transient.
      3. What are the temporal dynamics of TXNIP-associated degradation, and is this process distinct from endosomal microautophagy (as reported in PMID: 30018090)? The authors present convincing, high-quality FACS-based data supporting TXNIP-mediated turnover. If this pathway is mechanistically separate from endosomal microautophagy, it suggests a hierarchy of degradation pathways leading to quiescence. Live cell imaging studies that define the temporal dynamics of this process using the tools the authors have created may reveal the relationship between these processes and refine the broader implications of TXNIP in homeostatic adaptation.

      Minor comments

      In the discussion, the authors might briefly speculate on the implications of any functional redundancy that might exist between other arrestins.

      Significance

      This study establishes TXNIP as a regulator of LAT1 endocytosis and metabolic homeostasis in quiescence. The integration of KO models and a TXNIP-deficient patient strengthens the findings, though clinical characterization remains underdeveloped relative to the mechanism reported, and biochemical interactions require endogenous validation. The work expands our understanding of TXNIP beyond association studies, positioning it as a key player in nutrient sensing and metabolic regulation. Addressing the concerns will enhance its relevance across fields - particularly metabolism, cell biology, and disease research.

      Referees cross-commenting

      The comments raised by Reviewer #1 are reasonable, well-founded and align well with the concerns I have raised.

      Expertise: Organelle dynamics/degradation, metabolism, biochemistry, tissue homeostasis/disease.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In their study, Kahlhofer et al. investigate the mechanism by which cells downregulate amino acid (AA) uptake while entering quiescence. Using western blotting, immunohistochemistry and KO cell lines, the authors show that the α-arrestin family protein TXNIP acts as a regulator of specific membrane-resident AA transporters. They demonstrate that TXNIP promotes the endocytosis and degradation of SLC7A5-SLC7A3 in serum-starved cells as a result of reduced AKT signalling. They further show that the molecular mechanism involves a direct interaction between a PPCY motif in TXNIP and HECT-type ubiquitin ligases which promote AA transporter ubiquitination. Additionally, they identify a novel TXNIP loss-of-function in a patient and show that patient-derived fibroblasts fail to downregulate SLC7A5-SLC7A3 upon starvation. This dysregulation likely contributes to persistent alterations in serum AA levels observed in the patient.

      Experiments are well designed and important controls have been performed. Overall, the claims and the conclusions are supported by the data.

      Minor comments:

      Authors should indicate how often western blot experiments were repeated with similar results. Ideally band quantification (as in Fig. 2b) for the most relevant proteins should be provided for all shown Western blots.

      For confocal images no n number of experiments/analysed cells is stated. Often only 2-3 cells are shown in these images. In some figures, conclusions from these confocal images are additionally supported by cell surface FACS. For panels with missing cell surface FACS quantifications, the authors should consider using the existing imaging data to perform quantifications of the membrane signal. In this way the reader can get the right impression of the reproducibility of the phenotype described.

      I appreciate that the authors have also investigated SLC2A1 endocytosis in their experimental setup. Interestingly, they found that TXNIP mediated downregulation of SLC7A5-SLC3A2 was not linked to TXNIP mediated SLC2A1 endocytosis. Since the role of TXNIP in glucose metabolism has been studied in more detail in the past, it would be interesting if the authors could further comment on the differences/similarities in the molecular mechanism of glucose and AA transporter downregulation in the discussion.

      I would recommend a colour blind-friendly colour palette for the confocal images

      Significance

      The study is well-executed, and the claims are supported by appropriate experiments. As introduced by the authors in their introduction, ubiquitin-dependent endocytosis of AA transporters has been previously shown in S. cerevisiae and TXNIP has previously been identified as a regulator of glucose uptake by promoting endocytosis of GLUT1 and GLUT4. Here, the authors identify the molecular mechanism by which TXNIP promotes the endocytosis, and degradation of amino acid transporters (SLC7A5-SLC7A3) through its interaction with HECT-type ubiquitin ligases. This is an advance in the field and will be of interest for researchers in the fields of quiescence, metabolism and cell biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: This manuscript authored by Kakui and colleagues aims to understand on how mitotic chromosomes get their characteristic, condensed X shape, which is functionally important to ensure faithful chromosome segregation and genome inheritance to both daughter cells. The authors focus on the condensin complex, a central player in chromosome condensation. They ask whether it condenses chromosomes through a now broadly popular "loop-extrusion" mechanism, in which a chromatin-bound condensin complex reels chromatin into loops until it dissociates or encounters a roadblock on the polymer (another condensin or some other protein complex), or through an alternative, "diffusion-capture" mechanism, in which a chromatin-bound condensin complex forms loops by encountering another chromatin-bound condensin until they dissociate from DNA (or from each other.) The authors measured the progressive changes in the shape of mitotic chromosomes by taking samples at given time points from synchronized and mitotically arrested cells and found that while all chromosomes become more condensed and shorter, their width correlated with the length of the chromosome arms. They also observed that chromosome compaction/shortening evolves on a time scale much longer than the interval between the onset of chromosome condensation and the start of chromosome segregation, suggesting that chromatin condensation does not reach its steady-state during an unperturbed mitosis. The observed width-length correlation could be described by a power law with an exponent that increases with the time (i.e. chromosome condensation). The authors also performed polymer simulations of the diffusion-capture mechanism and found that the simulations semi-quantitatively recapitulate their experimental observations. Major Comments My most substantial comments focus on somewhat technical details of the image analysis approaches taken and the polymer models employed. However, as all reported data are derived from those details, I feel it is crucial to address them. *

      We thank the reviewer for their suggestions on how to improve our image analysis and polymer modelling experiments. We are keen to develop both aspects of our manuscript with additional experiments as detailed below.

      1. * Definition/measurement of chromatin arms width and length. The approach taken to manually threshold an "arm" object and then fitting it with a same-area ellipse is not an ideal approach to gauge length and width of the arm, for the following reasons: (1) An ellipse appears to do a poor job approximating many of the objects that we see in the zoom-in insets of Fig.1. Importantly, for somewhat bent shapes we see in the insets it likely strongly underestimates the length of the arms; this approach also presents potential problems for measuring width as well (see 2 and 3 here). (2) One concern is that, due to the diffraction limit, a cylindrical fluorescent object could appear somewhat wider at the mid-length than the real underlying cylinder or the poles; this effect could become more pronounced as the object gets brighter and shorter. (3) Forcing the fit to an ellipse to objects that are not truly rod-shaped can drive an overestimation of the width of the object, and I suspect that this effect also might correlate with the length and brightness of the object. (4) Given 1-3 above, I think the approach the authors used for the first two time points, while not perfect, is better suited and likely more robust while avoiding these caveats. Moreover, why the authors cannot use this same approach (but just for each arm separately) for the later (30+ min) time points as they used for first two is unclear. This point is underscored by the observation that there is a drastic difference in the results between the first two and all subsequent points. When the authors compared the two approaches at the 30 min time point (where width-length dependence is still weak) in different cell lines they did indeed see different results (Fig. S2), although they concluded that the difference was acceptable. * While the manuscript was under review, we have developed an improved pipeline to measure chromosome widths. As suggested by the reviewer, this approach is based on the method used for the first two time points. An additional improvement allows us to take automated measurements along the entire chromosome arm length, instead of being restricted to straight segments. We propose to use the improved algorithm to repeat the measurements at later time points.

      * Along these lines, the difference between short and long arms for the chromosome in the insets of Fig.1 are quite subtle, except maybe at 180 and 240 min. On a related note, it might be informative to compare data for the two sister chromatid arms (as the underlying polymer has the same length) long vs long and short vs short and long vs short to help establish the robustness of the approach. *

      The chromosome arm width differences are clear and measurable. We will select insets that illustrate the arm width differences in a more representative way, and we will furthermore conduct the suggested analyses on subsets of chromosome arms to test the robustness of our approach.

      * Regarding the power-law distribution, it is hard to judge based on the presented data whether it is a really good description of the data or not. In Fig.1c, the points for a given time can barely be distinguished, while in Fig.1b the authors plot individual time points in the panels, but the fits and points are overlapping so much that it is challenging to the main trends described by the clouds. The most informative approach for the reader would be to provide confidence intervals of the best fit parameters for all parameters that were varied in the fit. As the authors make some conclusions based on the power-law exponent values they observed, it would be helpful to know how confident we are in those values. *

      Confidence intervals of the power law exponents will be provided.

      * The conclusion that short arms equilibrate faster based on Fig.3a is not fully convincing. For example, in a scenario where ~1.5 microns is the equilibrium length for all arms, and that the longest arms equilibrate the fastest - you would see the same qualitative pattern for quantiles, not much change in low percentiles, while you would observe a decrease in the values for the high percentiles. The authors might be right, but Fig. 3A does not unambiguously demonstrate that it is so based on this evidence alone. *

      Our reasoning is based on the observation that the shortest percentiles do not change or do not change rapidly after 30 minutes, while the longest percentiles are clearly still relaxing towards a steady state. We will repeat this analysis with the new measurements, obtained in response to point 1.

      * As for chromosome roundness, typically in image analysis, roundness is defined through the ratio of (perimeter)2/area; it might be better to use "aspect ratio" for the metrics used by the authors. And, perhaps, one should expect that shorter (measured, not necessarily by polymer contour length) arms should have a higher width/length ratio? If one selects for more round objects, there should be no surprise that the width and length get almost proportional. Given all of this, I am not sure whether width/aspect ratio serves as a good proxy for the chromatin condensation progression, which is how the authors are employing this data in the manuscript as written. *

      We thank the reviewer for alerting us to an alternatively used definition of ‘roundness’. We will consider this concern, with one solution being to use ‘width-length ratio’ in its place.

      * For the diffusion-capture model simulations, I think the results of the simulation would strongly depend on the assumptions of the probability to associate and the time scale of dissociation of the beads representing the condensin complex. For example, for a very strong association one might expect that all condensin will end up in one big condensate, even in the case of a long polymer. This is not explored/discussed at all. Did the authors optimize their model in any way? If not, how have they estimated the values they used? Moreover, perhaps this is an opportunity to learn/predict something about condensin properties, but the authors do not take advantage of this opportunity. *

      We in fact explored the consequences of altering diffusion capture on and off rates when we initially developed the loop capture simulations, and we will report on the robustness of our model to the probability of dissociation as part of our revisions.

      * In addition, the authors did some checks to show that the steady-state results of the simulations do not depend on the initial conditions. However, as some of the results reported concern the polymer evolution to the steady state (Fig.6b-c), they also need to examine whether these results depend on the chosen initial conditions (or not), and if they do, what is the rationale for the choices the authors have made? *

      The current manuscript contains a comparison of steady states reached after simulations were started from elongated or random walk initial states (see Supplementary Figure 4). We will provide better justification for the choice of a 4x elongated initial state, which approximates the initial state observed in vivo.

      * A more thorough discussion of other possible models, beyond diffusion-capture model considered here, would be beneficial to the reader. First, the authors practically discard the possibility of the loop-extrusion model to explain their observations (although they never explicitly state this in the abstract or discussion). However, they neither leveraged simulations to rigorously compare models nor included some other substantiated arguments to explain why they prefer their model. This is important, as one of the major findings here is that the chromatin never reaches steady state for condensation, making it challenging to intuit what one should expect in this very dynamic state. Second, the authors, while briefly mentioning that there might be some other mechanisms contributing to the mitotic chromosome reshaping, do not really discuss those possibilities in a scholarly way. For example, work by the Kleckner group has suggested an involvement of bridges between sister chromatids into their shortening dynamics (Chu et al. Mol Cell 2020). Third, the authors do not discuss how they envision the interplay between the different SMC complexes - cohesin, condensin I and condensin II - as they act on the same chromatin polymer, or at least acknowledge a possible role that this interplay might contribute to the observed time dependencies. The reviewer raises important points, which we are keen to explore by performing loop extrusion simulations, as well as in an expanded discussion section.

      Reviewer #1 (Significance (Required)):

      Significance: The question the authors are trying to address is fundamental and important. While loop extrusion-driven mitotic chromosome organization is a popular model, considering alternative models is always crucial, especially when one can find experimental observations that allow us to discriminate between possible models. The main limitations are: 1) the performance of the approach the authors take to measure chromosome shape is in question and 2) the main competitive model (loop extrusion) is not modeled. If all shortcomings are addressed this work may provide strong evidence for the diffusion-capture model and thus advance our mechanistic understanding of mitotic processes, which will be of broad interest to the fields of genome and chromosome biology. We are happy to hear that the reviewer agrees that our work ‘may provide strong evidence for the diffusion-capture model and thus advance our mechanistic understanding of mitotic processes’. See above for how we propose to address the two main limitations.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      SUMMARY The authors tracked the progression of mitotic chromosome compaction over time by imaging chromatin spreads from HeLa cells that were released from G2/M arrest. By measuring the mitotic chromosome arms' width and length at different times post-release, the authors demonstrated that the speed at which the chromosome arms reach an equilibrium state is dependent on their length. The authors were able to recapitulate this observation using polymer simulations that they previously developed, supporting the model of loop capture as the mechanism for mitotic chromosome compaction.

      MAIN COMMENTS This is a straightforward paper that supports an alternative mechanism (relative to the highly popular loop-extrusion) model for chromosome compaction. My comments are meant to help the manuscript reach a wider audience.

      I suggest that "equilibrium" be replaced with "equilibrium length" since it is the only equilibrium parameter of concern. *

      The reviewer is correct, and we will implement this change, also taking into account the reasoning of reviewer 3 that ‘steady state’ is a better term to describe a final shape that is maintained by an active process.*

      In the results, it may help to describe how loop capture and loop extrusion are incorporated into the simulations, using terminology that non-experts can understand. Such a description should be accompanied by figures that can be related to the other figures (color scheme, nomenclature if possible). *

      Following from the reviewer’s suggestion, we will provide schematics of the loop capture and loop extrusion mechanisms.*

      OTHER COMMENTS P5: Is it possible the chromosome-spread processing may distort the structures of the chromosomes? *

      We will compare chromosome dimension in live cells with those following spreading to investigate this possibility.*

      Please clarify whether mitosis can complete after drug removal at the various treatment intervals. *

      Drug treatment and removal is often used as an experimental tool. We will perform a control experiment to explore whether mitosis can indeed complete after drug removal under our experimental conditions.*

      P6: "Our records are not, therefore, meant as an accurate absolute measure of individual arms. Rather, fitting allows us to sample all chromosome arms and deduce overall trends of chromosome shape changes over time" It would be better to state this sentence earlier in this paragraph, or earlier in the section so that readers' expectations are curbed when they're reading the detailed analysis plan. *

      Note that we will employ an additional image analysis method, in response to comments from reviewer 1, which should lead to more reliable width measurements.*

      P6: "As soon as individual chromosome arms become discernible (30 minutes), longer chromosome arms were wider, a trend that became more pronounced as time progressed." Implies that at early time points, when the lengths of the arms were unknown, the longer arms were equal or narrower than the short arms. I think it's more accurate to say that as soon as the arms were resolved, the longer arms appeared wider. *

      We will adopt the reviewers’ more accurate wording.*

      P7: Is there a functional consequence to the long arms not equilibrating before anaphase onset? *

      The reviewer raises an interesting question, which we will explore in our revised discussion. One consequence of not reaching ‘steady state’ is that ‘time in mitosis’ becomes a key parameter that defines compaction at anaphase onset.*

      P13: "In a loop capture scenario, we can envision how condensin II sets up a coarse rosette architecture, with condensin I inserting a layer of finer-grained rosettes." This should be illustrated in a figure. *

      We will consider such a figure, though the roles of two condensin complexes is peripheral to our current study. Investigating the consequences of two distinct condensins for chromosome formation will provide fertile ground for future investigations. *

      FIGURES Fig. 1: "...while insets show chromosomes at increasing magnification over time" sounds like the microscope magnification is changing over time. Please change "magnification" to "enlargement". Alternatively, if the goal of the figure is to illustrate the shape/dimensions change of the chromosomes over time, wouldn't it be better to keep all the enlargements at the same scale? *

      During the revisions, we will explore whether to show the insets at the same magnification, or to adjust the wording as suggested by the reviewer.*

      Fig. 2a plot: Does the distribution of normalized intensities really justify a Gaussian fit? I see a double Gaussian. *

      The chosen example indeed resembles a double Gaussian. We will explore whether this is due to noise in the measurement and a poor choice of an example, or whether a double Gaussian fit is indeed merited.*

      Please label the structures that resemble "rosettes". Good idea, which we will implement.

      Lu Gan

      Reviewer #2 (Significance (Required)):

      General - This is a simulation-centric study of mammalian chromosome compaction that supports the loop-capture mechanism. It may be viewed as provocative by some readers because loop-extrusion has dominated the chromosome-compaction literature in the past decade. The only limitation, which is best addressed by future studies, is the absence of more direct molecular evidence of loop capture in situ. Though this same limitation applies to studies of the loop-extrusion mechanism.

      Advance - It is valuable for the field to consider alternative mechanisms. In my opinion, the dominant one has been studied to death by indirect methods without a direct molecular-resolution readout in situ. While the field awaits better experimental tools, more mechanisms should be explored.

      Audience - The chromosome-biology community (both bacterial and eukaryotic) will be interested.

      Expertise - My lab uses cryo-ET to study chromatin in situ.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Kakui et al. measured the length/width relationships of mitotic chromosomes in human cells that had entered mitosis for different durations. This simple measurement revealed very interesting behaviors of mitotic chromosomes. They found that the longer chromosome arms were wider than shorter ones. Mitotic chromosoms became progressively wider over time, with shorter ones reached the final state faster than the longer ones. They then built a loop-capture polymer model, which explained the time-dependent increase of width/length ration rather well, but did not quite explain the final roundness of chromosomes.

      I suggest the following points for the authors to consider.

      Major points (1) There is no experimental evidence that the loop capture mechanism is condensin-depdendent. Can the authors deplete condensin I or II or both and measure chromosome length and width in similar assays? This will link their models to molecular players. *

      Such analyses have been conducted by others, and we will provide a brief survey with relevant references to the literature in our revised introduction.*

      (2) It seems rather intuitive to me that if one defines the spacing the condensin-binding sites, then the loop sizes will be the same between shorter and longer chromosomes. It then follows that shorter chromosomes are rounder. Is it that simple? If not, can the authors provide a better explanation. *

      The reviewer makes an interesting point that roundness (width-length ratio), is greater for shorter chromosome arms, even if chromosome width is constant. We will make this clear in the revised manuscript.*

      (3) If the loop sizes are the same between shorter and longer chromosomes, why can't loop extrusion model explain this phenomenon? If one assumes that condensin is stopped by the same barrier element and has the same distrution at the loop base, this should produce the same outcome as loop capture. *

      The key feature of loop extrusion is the formation of a linear condensin backbone, resulting in a bottle brush-shaped chromosome. This arrangement prevents further equilibration of loops into a wider structure, as occurs in the loop capture mechanism by rosette rearrangements. These differences will be better explained, using a schematic, in the revised manuscript.*

      Minor points (1) "We are aware that this approximation underestimates the length of the longest chromosome arms and overestimates the length of the shortest arms." should be "We are aware that this approximation underestimates the length of the longer chromosome (q) arms and overestimates the length of the shorter (p) arms.". Right? *

      In fact, this comparison applies to all longer and shorter arms, not only pairs of p and q arms, which we will clarify.*

      (2) Some scientists argue that the final chromosome conformation might be kinetically driven. Even if the short chromosomes have reached the final roundness, this doesn't necessarily mean that they have reached equilibrium in cells. "Steady state" might be a better term to describe the chromosomes in vivo, as there are clearly energy-burning processes. *

      The reviewer is right that the term ‘equilibrium’ can be seen as misleading, which we will replace with ‘steady state’.*

      Reviewer #3 (Significance (Required)):

      I find the paper intellectually stimulating and a pleasure to read. It suggests a plausible explanation for mitotic chromosome formation. As such, it will be of great interest to scientists in the chromatin field.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      The take home message of this study is that chromosome structure can be attained through mechanisms of looping that do not require an explicit loop extrusion function. As the authors states, alternative models of loop capture have been proposed, dating from 2015-2016. THese models show DNA chains through simply Brownian diffusion can adopt a loop structure (citation 27, 28 and similarly Entropy gives rise to topologically associating domains Vasquez et al 2016 DOI: 10.1093/nar/gkw510).*

      The reviewer makes an excellent point in that entropy considerations, e.g. depletion attraction, likely contribute to the efficiency of loop capture. We will refer to this principle, including a citation to the Vasquez et al. study, in the revised manuscript.

      * In this study, the authors go through careful and well-documented chromosome length measurements through prophase and metaphase. The modeling studies clearly show that loop capture provides a tenable mechanism that accounts for the biological results. The results are clearly written and propose an important alternative narrative for the foundation of chromosome organization.

      Reviewer #4 (Significance (Required)):

      The study is important because it takes a reductionist approach using just Brownian motion and loop capture to ask how well the fundamental processes will recapitulate the biological outcome. The fact that loop capture can account for the arm length to width relationships on biological time scales is important to report to the community. The work is extremely well done and the analysis of chromosome features is thorough and well-documented.*

      • *
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The take home message of this study is that chromosome structure can be attained through mechanisms of looping that do not require an explicit loop extrusion function. As the authors states, alternative models of loop capture have been proposed, dating from 2015-2016. THese models show DNA chains through simply Brownian diffusion can adopt a loop structure (citation 27, 28 and similarly Entropy gives rise to topologically associating domains Vasquez et al 2016 DOI: 10.1093/nar/gkw510).

      In this study, the authors go through careful and well-documented chromosome length measurements through prophase and metaphase. The modeling studies clearly show that loop capture provides a tenable mechanism that accounts for the biological results. The results are clearly written and propose an important alternative narrative for the foundation of chromosome organization.

      Significance

      The study is important because it takes a reductionist approach using just Brownian motion and loop capture to ask how well the fundamental processes will recapitulate the biological outcome. The fact that loop capture can account for the arm length to width relationships on biological time scales is important to report to the community.

      The work is extremely well done and the analysis of chromosome features is thorough and well-documented.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, Kakui et al. measured the length/width relationships of mitotic chromosomes in human cells that had entered mitosis for different durations. This simple measurement revealed very interesting behaviors of mitotic chromosomes. They found that the longer chromosome arms were wider than shorter ones. Mitotic chromosoms became progressively wider over time, with shorter ones reached the final state faster than the longer ones. They then built a loop-capture polymer model, which explained the time-dependent increase of width/length ration rather well, but did not quite explain the final roundness of chromosomes.

      I suggest the following points for the authors to consider.

      Major points

      1. There is no experimental evidence that the loop capture mechanism is condensin-depdendent. Can the authors deplete condensin I or II or both and measure chromosome length and width in similar assays? This will link their models to molecular players.
      2. It seems rather intuitive to me that if one defines the spacing the condensin-binding sites, then the loop sizes will be the same between shorter and longer chromosomes. It then follows that shorter chromosomes are rounder. Is it that simple? If not, can the authors provide a better explanation.
      3. If the loop sizes are the same between shorter and longer chromosomes, why can't loop extrusion model explain this phenomenon? If one assumes that condensin is stopped by the same barrier element and has the same distrution at the loop base, this should produce the same outcome as loop capture.

      Minor points

      1. "We are aware that this approximation underestimates the length of the longest chromosome arms and overestimates the length of the shortest arms." should be "We are aware that this approximation underestimates the length of the longer chromosome (q) arms and overestimates the length of the shorter (p) arms.". Right?
      2. Some scientists argue that the final chromosome conformation might be kinetically driven. Even if the short chromosomes have reached the final roundness, this doesn't necessarily mean that they have reached equilibrium in cells. "Steady state" might be a better term to describe the chromosomes in vivo, as there are clearly energy-burning processes.

      Significance

      I find the paper intellectually stimulating and a pleasure to read. It suggests a plausible explanation for mitotic chromosome formation. As such, it will be of great interest to scientists in the chromatin field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The authors tracked the progression of mitotic chromosome compaction over time by imaging chromatin spreads from HeLa cells that were released from G2/M arrest. By measuring the mitotic chromosome arms' width and length at different times post-release, the authors demonstrated that the speed at which the chromosome arms reach an equilibrium state is dependent on their length. The authors were able to recapitulate this observation using polymer simulations that they previously developed, supporting the model of loop capture as the mechanism for mitotic chromosome compaction.

      Main Comments

      This is a straightforward paper that supports an alternative mechanism (relative to the highly popular loop-extrusion) model for chromosome compaction. My comments are meant to help the manuscript reach a wider audience.

      I suggest that "equilibrium" be replaced with "equilibrium length" since it is the only equilibrium parameter of concern.

      In the results, it may help to describe how loop capture and loop extrusion are incorporated into the simulations, using terminology that non-experts can understand. Such a description should be accompanied by figures that can be related to the other figures (color scheme, nomenclature if possible).

      Other comments

      P5: Is it possible the chromosome-spread processing may distort the structures of the chromosomes?

      Please clarify whether mitosis can complete after drug removal at the various treatment intervals.

      P6: "Our records are not, therefore, meant as an accurate absolute measure of individual arms. Rather, fitting allows us to sample all chromosome arms and deduce overall trends of chromosome shape changes over time" It would be better to state this sentence earlier in this paragraph, or earlier in the section so that readers' expectations are curbed when they're reading the detailed analysis plan.

      P6: "As soon as individual chromosome arms become discernible (30 minutes), longer chromosome arms were wider, a trend that became more pronounced as time progressed." Implies that at early time points, when the lengths of the arms were unknown, the longer arms were equal or narrower than the short arms. I think it's more accurate to say that as soon as the arms were resolved, the longer arms appeared wider.

      P7: Is there a functional consequence to the long arms not equilibrating before anaphase onset?

      P13: "In a loop capture scenario, we can envision how condensin II sets up a coarse rosette architecture, with condensin I inserting a layer of finer-grained rosettes." This should be illustrated in a figure.

      Figures

      Fig. 1: "...while insets show chromosomes at increasing magnification over time" sounds like the microscope magnification is changing over time. Please change "magnification" to "enlargement". Alternatively, if the goal of the figure is to illustrate the shape/dimensions change of the chromosomes over time, wouldn't it be better to keep all the enlargements at the same scale?

      Fig. 2a plot: Does the distribution of normalized intensities really justify a Gaussian fit? I see a double Gaussian.

      Please label the structures that resemble "rosettes".

      Lu Gan

      Significance

      General This is a simulation-centric study of mammalian chromosome compaction that supports the loop-capture mechanism. It may be viewed as provocative by some readers because loop-extrusion has dominated the chromosome-compaction literature in the past decade. The only limitation, which is best addressed by future studies, is the absence of more direct molecular evidence of loop capture in situ. Though this same limitation applies to studies of the loop-extrusion mechanism.

      Advance It is valuable for the field to consider alternative mechanisms. In my opinion, the dominant one has been studied to death by indirect methods without a direct molecular-resolution readout in situ. While the field awaits better experimental tools, more mechanisms should be explored.

      Audience The chromosome-biology community (both bacterial and eukaryotic) will be interested.

      Expertise My lab uses cryo-ET to study chromatin in situ.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: This manuscript authored by Kakui and colleagues aims to understand on how mitotic chromosomes get their characteristic, condensed X shape, which is functionally important to ensure faithful chromosome segregation and genome inheritance to both daughter cells. The authors focus on the condensin complex, a central player in chromosome condensation. They ask whether it condenses chromosomes through a now broadly popular "loop-extrusion" mechanism, in which a chromatin-bound condensin complex reels chromatin into loops until it dissociates or encounters a roadblock on the polymer (another condensin or some other protein complex), or through an alternative, "diffusion-capture" mechanism, in which a chromatin-bound condensin complex forms loops by encountering another chromatin-bound condensin until they dissociate from DNA (or from each other.)

      The authors measured the progressive changes in the shape of mitotic chromosomes by taking samples at given time points from synchronized and mitotically arrested cells and found that while all chromosomes become more condensed and shorter, their width correlated with the length of the chromosome arms. They also observed that chromosome compaction/shortening evolves on a time scale much longer than the interval between the onset of chromosome condensation and the start of chromosome segregation, suggesting that chromatin condensation does not reach its steady-state during an unperturbed mitosis. The observed width-length correlation could be described by a power law with an exponent that increases with the time (i.e. chromosome condensation). The authors also performed polymer simulations of the diffusion-capture mechanism and found that the simulations semi-quantitatively recapitulate their experimental observations.

      Major Comments

      My most substantial comments focus on somewhat technical details of the image analysis approaches taken and the polymer models employed. However, as all reported data are derived from those details, I feel it is crucial to address them. 1. Definition/measurement of chromatin arms width and length. The approach taken to manually threshold an "arm" object and then fitting it with a same-area ellipse is not an ideal approach to gauge length and width of the arm, for the following reasons: (1) An ellipse appears to do a poor job approximating many of the objects that we see in the zoom-in insets of Fig.1. Importantly, for somewhat bent shapes we see in the insets it likely strongly underestimates the length of the arms; this approach also presents potential problems for measuring width as well (see 2 and 3 here). (2) One concern is that, due to the diffraction limit, a cylindrical fluorescent object could appear somewhat wider at the mid-length than the real underlying cylinder or the poles; this effect could become more pronounced as the object gets brighter and shorter. (3) Forcing the fit to an ellipse to objects that are not truly rod-shaped can drive an overestimation of the width of the object, and I suspect that this effect also might correlate with the length and brightness of the object. (4) Given 1-3 above, I think the approach the authors used for the first two time points, while not perfect, is better suited and likely more robust while avoiding these caveats. Moreover, why the authors cannot use this same approach (but just for each arm separately) for the later (30+ min) time points as they used for first two is unclear. This point is underscored by the observation that there is a drastic difference in the results between the first two and all subsequent points. When the authors compared the two approaches at the 30 min time point (where width-length dependence is still weak) in different cell lines they did indeed see different results (Fig. S2), although they concluded that the difference was acceptable. Along these lines, the difference between short and long arms for the chromosome in the insets of Fig.1 are quite subtle, except maybe at 180 and 240 min. On a related note, it might be informative to compare data for the two sister chromatid arms (as the underlying polymer has the same length) long vs long and short vs short and long vs short to help establish the robustness of the approach. 2. Regarding the power-law distribution, it is hard to judge based on the presented data whether it is a really good description of the data or not. In Fig.1c, the points for a given time can barely be distinguished, while in Fig.1b the authors plot individual time points in the panels, but the fits and points are overlapping so much that it is challenging to the main trends described by the clouds. The most informative approach for the reader would be to provide confidence intervals of the best fit parameters for all parameters that were varied in the fit. As the authors make some conclusions based on the power-law exponent values they observed, it would be helpful to know how confident we are in those values. 3. The conclusion that short arms equilibrate faster based on Fig.3a is not fully convincing. For example, in a scenario where ~1.5 microns is the equilibrium length for all arms, and that the longest arms equilibrate the fastest - you would see the same qualitative pattern for quantiles, not much change in low percentiles, while you would observe a decrease in the values for the high percentiles. The authors might be right, but Fig. 3A does not unambiguously demonstrate that it is so based on this evidence alone. 4. As for chromosome roundness, typically in image analysis, roundness is defined through the ratio of (perimeter)2/area; it might be better to use "aspect ratio" for the metrics used by the authors. And, perhaps, one should expect that shorter (measured, not necessarily by polymer contour length) arms should have a higher width/length ratio? If one selects for more round objects, there should be no surprise that the width and length get almost proportional. Given all of this, I am not sure whether width/aspect ratio serves as a good proxy for the chromatin condensation progression, which is how the authors are employing this data in the manuscript as written. 5. For the diffusion-capture model simulations, I think the results of the simulation would strongly depend on the assumptions of the probability to associate and the time scale of dissociation of the beads representing the condensin complex. For example, for a very strong association one might expect that all condensin will end up in one big condensate, even in the case of a long polymer. This is not explored/discussed at all. Did the authors optimize their model in any way? If not, how have they estimated the values they used? Moreover, perhaps this is an opportunity to learn/predict something about condensin properties, but the authors do not take advantage of this opportunity. In addition, the authors did some checks to show that the steady-state results of the simulations do not depend on the initial conditions. However, as some of the results reported concern the polymer evolution to the steady state (Fig.6b-c), they also need to examine whether these results depend on the chosen initial conditions (or not), and if they do, what is the rationale for the choices the authors have made? 6. A more thorough discussion of other possible models, beyond diffusion-capture model considered here, would be beneficial to the reader. First, the authors practically discard the possibility of the loop-extrusion model to explain their observations (although they never explicitly state this in the abstract or discussion). However, they neither leveraged simulations to rigorously compare models nor included some other substantiated arguments to explain why they prefer their model. This is important, as one of the major findings here is that the chromatin never reaches steady state for condensation, making it challenging to intuit what one should expect in this very dynamic state. Second, the authors, while briefly mentioning that there might be some other mechanisms contributing to the mitotic chromosome reshaping, do not really discuss those possibilities in a scholarly way. For example, work by the Kleckner group has suggested an involvement of bridges between sister chromatids into their shortening dynamics (Chu et al. Mol Cell 2020). Third, the authors do not discuss how they envision the interplay between the different SMC complexes - cohesin, condensin I and condensin II - as they act on the same chromatin polymer, or at least acknowledge a possible role that this interplay might contribute to the observed time dependencies.

      Significance

      The question the authors are trying to address is fundamental and important. While loop extrusion-driven mitotic chromosome organization is a popular model, considering alternative models is always crucial, especially when one can find experimental observations that allow us to discriminate between possible models. The main limitations are: 1) the performance of the approach the authors take to measure chromosome shape is in question and 2) the main competitive model (loop extrusion) is not modeled. If all shortcomings are addressed this work may provide strong evidence for the diffusion-capture model and thus advance our mechanistic understanding of mitotic processes, which will be of broad interest to the fields of genome and chromosome biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Our manuscript shows that, in cycling cells, the proneural master regulator transcription factor ASCL1 binds preferentially to pro-neurogenic enhancers in G1 phase of the cell cycle but this binding does not drive gene expression. As cells move to S/G2, ASCL1 binding is now enriched at promoters of pro-proliferative genes where it activates gene expression to maintain a pro-proliferative progenitor state. However, stalling of the cell cycle in G1 allows ASCL1 binding at enhancers to facilitate H3K27ac deposition and pro-neurogenic gene expression, driving the differentiation programme. We thus show hitherto unknown cell cycle dependency of distinct transcriptional programmes driven by the same transcription factor at different cell cycle stages and reveal why a lengthening specifically of G1 can allow engagement of a differentiation programme by turning unproductive factor binding into a productive interaction.

      • *

      We note, Reviewer 1:

      This is an interesting study and provides new insight into the dual mechanisms of proneural transcription factors in neuroblastoma proliferation and differentiation. Since ASCL1 has similar dual roles in proliferation and neural differentiation in normal CNS development, the results of this report will improve the understanding of this factor more generally.

      from Reviewer 2:

      This work addresses an important long-standing question: how can Ascl1 simultaneously promote cell cycle and neurogenesis? It will be of relevance for the fields of neurogenesis, stem cell biology, reprogramming, and cancer biology.

      We thank the reviewers for their very positive evaluations of the paper and its implications. Where questions and concerns were raised we have addressed them fully, below.

      1. Point-by-point description of the revisions

      Reviewer 1:

      “The authors have not done a motif analysis of the ASCL1-ChIPseq so it is not clear whether E-box motifs are enriched/dominate. This is an important control. Also, it would be very useful to compare the ASCL1-ChIP-seq with other published datasets in other neural tissues, as an additional control.”

      Prompted by this comment, we have performed motif analysis on the consensus set of ASCL1 ChIP-seq peaks in the DMSO control samples (i.e. freely cycling cells). This identified the canonical ASCL1 E-box motif as the most significantly enriched, occurring in the majority of peaks:

      We have now added this motif analysis output to Figure 1A.

      As requested, we downloaded a previously published ASCL1 ChIP-seq dataset (Păun et al. 2023) where human iPSCs were differentiated into cortical neurons. We find that ~25% of our consensus peakset intersects with binding sites detected in cortical neurons, representing just under 50% of this latter set. This is a large intersection of 25,000 peaks, especially considering the developmental differences between the two cell types (neuroblastic progenitors of the PNS versus more differentiated cortical neurons of the CNS). We have now added this figure to Supplementary Figure 1.

      “Most of the analysis is done on regions that are less than 50 kb from the nearest TSS. This restricts the analysis to about half the peaks. Since they observe a difference between the G2M peak and the G1 peaks in their distance from the TSSOur ChIP-seq protocol was very sensitive and detected even low levels/transient ASCL1 binding, giving a large number of ASCL1 peaks. Consequently, a significant fraction of the genes in the genome became associated with ASCL1 binding and so we used a stringent distance based cut-off based on the assumption that there is a higher likelihood of enhancers acting on nearby promoters, rather than those further away. When we link all peaks to their nearest TSS, irrespective of distance, we find a similar trend, namely G1 enriched ASCL1 binding is associated with neuronal developmental processes, whereas SG2M enriched binding is uniquely associated with mitotic and cell cycle processes, (although we do now see some axonal terms appear under these less stringent conditions). These two figures have now been added to Supplementary Figure 4.

      “The correlate the genes that decline with ASCL1 KO and the peaks from the ChIP-seq using GO terms, but would be very useful to determine how many of these genes are direct targets. This can bve done by showing the correlaiton between the RNAseq and the ChIP-seq on a gene-by-gene basis rather than using GO.”

      Thank you for this useful suggestion. To investigate any correlation between the ASCL1 ChIP-seq and ASCL1 KO RNA-seq, we quantified the log2 fold change in expression level (WT/KO) following ASCL1 KO for any gene that was associated with an ASCL1 binding site in asynchronous cycling cells. Plotting these fold changes as a histogram/density plot (left) reveals that these genes generally exhibited a positive fold change i.e. a decrease in expression level following ASCL1 KO (blue dotted line shows the mean log2 fold change for the ASCL1 bound genes, black dotted line is at 0). Looking specifically at the 1000 genes associated with the most significant ASCL1 ChIP-seq peaks confirms this (right), where more genes show large decreases in gene expression following KO, where the local polynomial regression (LOESS; locally estimated scatterplot smoothing, black line) is consistently higher than 1.

      Left plot: Log2 fold change in expression level for all ASCL1 bound genes, where positive fold change indicates a reduction in expression level following ASCL1 knockout, and a negative fold change indicates an increased expression following knockout. The mean value (blue dotted line), mode and median are all greater than 0 (black dotted line) indicating general reduction in expression level following ASCL1 knockout.

      Right plot: 1000 genes associated with the strongest ASCL1 peaks (normalised peak score from DiffBind) were plotted against their fold change in expression following ASCL1 knockout. There is a large amount of variability, but the local polynomial regression (LOESS, black line) is consistently greater than 1 (red dotted line; no fold change).

      We have now added the right figure to Supplementary Figure 2

      Reviewer 2 also raised similar concerns:

      “Other minor points: In figure 2, it would be interesting to display the overlap between bound and regulated genes.”

      As suggested, we looked at the overlap between genes bound by ASCL1 in DMSO treated, freely cycling cells and intersected them with genes that showed a significant change in expression level following ASCL1 KO. This reveals that the majority of bound genes are regulated by ASCL1. Put another way, the large majority of genes that exhibited differential expression following ASCL1 KO were bound by ASCL1 in WT cells.

      We have now added this Venn diagram to Figure 2.

      “The lack of ASCL1 dependence of the G1 neuronal genes (Fig 5B) is interesting, but may be confounded by the possibility that these sites are driven equally well by a redundant proneural trnascription factor, like NEUROD1 or NEUROG. This possibility should be addressed by carrying out ChIP for these factors at select sites (G2M vs G1). Alternatively ChIP-seq for these factors would be ideal. Without these experiments the conclusion is not supported: "This indicates that ASCL1 is capable of binding to neuronal targets in G1 phase of the cell cycle in neuroblastoma cells but is not supporting their expression under cycling conditions."

      The problem of redundant TFs is also an issue with the experiments to teat the effects of long G1 arrest.”

      Thank you for raising this possibility, which prompted us to look at expression of other proneural proteins in these neuroblastoma cells. Consistent with the important role for ASCL1 in neuroblastoma previously reported in contrast to lack of reports about prominent roles for other proneural transcription factors, we quantified the expression levels of other proneural proteins in parental SK-N-BE(2)-C cells and the ASCL1 KO clone. We found that the expression level of all other proneural factors was very low, especially when compared to ASCL1, and did not increase following ASCL1 KO, showing no signs of compensatory uplift. We therefore conclude that there is a very low likelihood of interference from these factors. Moreover, methodologies such as ChIP-seq for these other proneural proteins are unlikely to work given their extremely low expression levels. We now include these findings in Supplementary Figure 5.

      “The finding that G1 ASCL1 sites show less accessibility than G2M sites is interesting; is thre a reduction in ASCL1 ChIP-seq signal at these sites as well? Or is ASCL1 bound but not able to open the chromaitn at these sites?”

      We have shown in Supplementary Figure 3 of the original manuscript that there is a reduced level of ASCL1 binding at G1 enriched sites compared to SG2M enriched sites when looking at asynchronous, freely cycling cells SK-N-BE(2)-C, and two other neuroblastoma cell lines.

      To further investigate this, we performed this same analysis on the individual SK-N-BE(2)-C asynchronous replicates independently, which showed the same trend. These freely cycling cells comprise approximately 65% G0/G1 cells and 35% SG2M cells (Figure 3C). Despite more cells being in G1 in asynchronous freely cycling cells, the ASCL1 ChIP-seq signal is markedly reduced for sites which are preferentially bound by ASCL1 during G1 phase. Addressing the Reviewer’s question, this indicates that the lower levels of accessibility at G1 enriched sites versus G2M enriched sites are a result of reduced binding of ASCL1 in G1.

      We hypothesised that reduced binding in G1 could be a result of lower ASCL1 protein concentrations. To address this, we performed ASCL1 antibody-based staining and hoechst based cell cycle analysis in SK-N-BE(2)-C cells, followed by flow cytometry. This enabled us to individually quantify ASCL1 protein levels in specific cell cycle subpopulations. The relative cell size changes across the cell cycle, so to account for this we plotted the relative changes in ASCL1 protein levels with the relative changes in cell size. This revealed that ASCL1 protein levels in G2M were significantly higher than expected if solely due to changes in cell size (and the levels in S phase were lower than expected for the cell size). In contrast, when we performed the same analysis for the housekeeping gene, TBP, we observed more consistent protein levels that scaled proportionately with cell size. This reveals a degree of cell cycle-dependent regulation of ASCL1 protein levels, which may account for differences in overall binding between the two phases, and indicate that reduced ASCL1 binding in G1 may be due to a lower amount of ASCL1 protein compared to the level in other cell cycle stages (normalised for cell size).

      We have now moved the SK-N-BE(2)-C plot from original Supplementary Figure 3 to Figure 4, and added the results above to Figure 4.

      “The reduction in accessible sites in the ASCL1 KO for the G2M sites is consistent with the effects on proliferation, but the effect is very modest. Would this effect be greater if the analysis of the ATAC-seq data were confined to sites with E-boxes? it would be useful to know what percentage of the accessible sites have an E-box and what percent of these sites are lost in the ASCL1 KO. This might show the importance of redundant proneural TFs.”

      We now undertake additional analysis to address this important point directly. Of the 14,460 peaks that exhibit enriched ASCL1 binding during SG2M, 9,228 contain a canonical ASCL1 E-box motif (NNVVCAGCTGBN, taken from HOMER motif analysis above), as determined by FIMO, MEME suite (q-value We quantified the ATAC-seq signal at these peaks containing high confidence ASCL1 E-box motifs before and after ASCL1 KO and found that this extra filtering step had no impact on the magnitude of the change in accessibility following ASCL1 KO. This suggests that ASCL1 knockout has an equal effect on the accessibility of bound sites regardless of the underlying motif, and indirectly indicates that even the peaks showing degenerate ASCL1 motifs show a reduction in accessibility following ASCL1 knockout. This latter set could include sites where ASCL1 binding is mediated or enhanced by a cofactor.

      Reviewer 2:

      “There is however, one important concern to be clarified before strong conclusions can be extracted from the data: are palbociclib-treated cells comparable to control cells? 7 days of G1 arrest could have led to differentiation of at least a fraction of the NSCs and therefore the increased expression of neuronal genes (and chromatin changes) could reflect a higher percentage of differentiated cells (or higher degree of differentiation) in that sample rather than increased expression of neuronal genes in NSCs. A characterization of the cultures after the 7-day treatment is therefore necessary before drawing any conclusions. This could be done through immunohistochemistry to assess the presence of differentiated cells and control for the continuous and homogeneous expression of stemness markers (some useful markers include Nestin, Sox2, DCX, Tubb3 or GFAP). The reversibility assay, as shown in Figure S2 would also be very informative for the 7-day time point.”

      For ASCL1 ChIP-seq experiments on cell cycle synchronized cells, palbociclib treatment was for a short duration of 24 hours, to ensure that the cells are only stalled in G1, and not differentiating. Control cells were treated with DMSO for the same duration, and the confluency was not more than 80% to ensure that they are healthy, cycling cells.

      It was not experimentally possible to directly compare cells plated at the same density and then grown with or without PB for 7 days as extreme overgrowth and extensive cell death (rather that cell cycle arrest and differentiation) occurred in the cells without PB. When we performed 7 day palbociclib treatments, we plated control cells at half the density of treated cells so that by the 7 day time point, they were not overly confluent and were still cycling, allowing us to collect control cells for the RNA-seq analysis comparison. The morphology of the 7 day PB-treated cells were markedly different from control cells, showing extended neurites and overall lower confluency due to cell cycle exit and differentiation (see below).

      The morphological effects of PB treatment on neuroblastoma cells was covered in some detail in a previous publication, Ferguson et al, 2023, Dev Cell, 58:1967-1982 . In this previous study we have extensively characterised the morphology of SK-N-BE(2)-C cells plated under very similar conditions to those used here, DMSO treated (again plated on day 0 at a lower density that PB treated to limit control cell death) versus palbociclib treated, below,). These cells were stained for Tubb3 as suggested by the Reviewer. We saw extensive cell cycle inhibition morphological differentiation with PB accompanied by upregulation of Tubb3 and neurite extension. In contrast we saw very little Tubb3 upregulation or morphological change in the DMSO control cells, and cells maintain a largely uniform typical neuroblast morphology. We now describe this previous work that directly addresses the point raised more fully in the results and discussion of this manuscript.

      ­­­­Figure from Ferguson et al., 2023.

      To further address the point raised by Reviewer 2, we undertook more interrogation of our RNAseq data to confirm that 7 days of palbociclib treatment is inducing differentiation compared to the control cells. Taking suggestions from the Reviewer, we quantified the expression of several markers of stemness and neuronal differentiation from the RNA-seq data comparing treated and untreated cells. Indeed, the stemness markers SOX2, MYCN and HES1 all decrease following treatment, while the expression of key early neuronal genes (DCX, MAP2) increases.

      We have now added this plot to Supplementary Figure 4.

      “Other minor points: In figure 2, it would be interesting to display the overlap between bound and regulated genes.”

      As suggested, we looked at the overlap between genes bound by ASCL1 in DMSO treated, freely cycling cells and intersected them with genes that showed a significant change in expression level following ASCL1 KO. This reveals that the majority of bound genes are regulated by ASCL1. Put another way, the large majority of genes that exhibited differential expression following ASCL1 KO were bound by ASCL1 in WT cells:

      We have now added this Venn diagram to Figure 2.

      “Please clarify where does the number of 47,294 non-commonly regulated genes between G1 and S/G2/M come from. From the data in figure 3D the number should be roughly 30k.”

      Thank you for raising this. We agree that this is not clear and have changed the text and figure legend to better explain it. Prior to DiffBind analysis, the consensus peak sets for palbociclib-treated cells and thymidine-treated cells are shown in figure 3D. A consensus peak is one that appears in two out of the three replicates for that condition. DiffBind is then run using these consensus peak sets, which takes the magnitude of the peaks into account, identifying 47,294 differentially bound regions.

      “In figure 3F/G, it would be very informative to show also examples of cell cycle independent genes.”

      Recognising this was a minor point, we would suggest that this is largely a control for cell cycle-dependent expression that is extensively analysed in the rest of the paper. Unfortunately we do not have any remaining ChIP’ed DNA with which to show control regions. The samples were generated from approx. 1 million FACS sorted cells and so all ChIP’ed DNA was used for the qPCR reactions shown.

      “In graph 4B, please unify the way the legend is displayed (location of "count" and "p.adjust").”

      Corrected in the figure.

      “In figure 5A, could it be that the expression levels of neuronal genes are too low in control cells, so that it is difficult to see a difference in the cKO cells? Even if not significant, would be good to show the p value.”

      It is certainly possible that expression of neuronal genes is low in the WT cells and that this is why ASCL1 KO has no significant effect, but it still raises the question of how ASCL1 can bind and not drive the expression of these genes in this context. We would expect the statistical test to identify significant differences regardless of the expression level.

      Since there are multiple t tests performed in each of the right figure panels, we used the Bonferroni’s Correction for multiple testing which is equal to the p-value divided by the number of statistical tests performed (i.e. 0.05/7 = 0.0071). Thus, any test with an adjusted p-value higher than 0.0071 is considered non-statically significant.

      We have now updated the figure to show the p-values, and will modify the figure legend to explain the multiple testing correction. Additional information has also been added to the methods section.

      “And simply a style point: I found the color scheme for significance in the graphs confusing, as dark colors signify less significance and white/clear shades high significance.”

      For all other GO analyses figures, we have used a colour to represent high significance and black to represent lower significance, and it is for this reason that the GO analyses in Figures 1 and 2 use black to represent low significance. For consistency we feel it is best to keep it the same throughout the paper.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript by Beckman et al. the authors propose that different dynamics of Ascl1 binding to promoters of cell cycle and neuronal genes could explain the known association between cell cycle lengthening and differentiation. This stems from their observation that Ascl1 binds preferentially enhancers of neuronal genes in G1, although it does not drive their expression, while it binds the promoters and regulates the expression of cell cycle associated genes in G2/M. They also show that lengthening of G1 through pharmacological means increases chromatin accessibility (shown by ATAC-seq and H3K27ac) and allows Ascl1 to induce the expression of neuronal genes. They therefore propose a system where Ascl1 binds to primed neuronal enhancers in G1 but only drives their expression when a lengthened G1 phase has previously allowed chromatin changes involving histone modification. Their data is nicely controlled using Asc1cKO cells, allowing them to show specificity to the ability of Ascl1 to promote the expression of neuronal vs cell cycle genes. Overall, the work is nicely executed and clearly presented.

      There is however, one important concern to be clarified before strong conclusions can be extracted from the data: are palbociclib-treated cells comparable to control cells? 7 days of G1 arrest could have led to differentiation of at least a fraction of the NSCs and therefore the increased expression of neuronal genes (and chromatin changes) could reflect a higher percentage of differentiated cells (or higher degree of differentiation) in that sample rather than increased expression of neuronal genes in NSCs. A characterization of the cultures after the 7-day treatment is therefore necessary before drawing any conclusions. This could be done through immunohistochemistry to assess the presence of differentiated cells and control for the continuous and homogeneous expression of stemness markers (some useful markers include Nestin, Sox2, DCX, Tubb3 or GFAP). The reversibility assay, as shown in Figure S2 would also be very informative for the 7-day time point.

      Other minor points:

      • In figure 2, it would be interesting to display the overlap between bound and regulated genes.
      • Please clarify where does the number of 47,294 non-commonly regulated genes between G1 and S/G2/M come from. From the data in figure 3D the number should be roughly 30k.
      • In figure 3F/G, it would be very informative to show also examples of cell cycle independent genes.
      • In graph 4B, please unify the way the legend is displayed (location of "count" and "p.adjust").
      • In figure 5A, could it be that the expression levels of neuronal genes are too low in control cells, so that it is difficult to see a difference in the cKO cells? Even if not significant, would be good to show the p value.
      • And simply a style point: I found the color scheme for significance in the graphs confusing, as dark colors signify less significance and white/clear shades high significance.

      Significance

      This work addresses an important long-standing question: how can Ascl1 simultaneously promote cell cycle and neurogenesis? It will be of relevance for the fields of neurogenesis, stem cell biology, reprogramming, and cancer biology.

      Conceptually, it could be made clearer in the discussion that Ascl1 appears to be dispensable for the increased chromatin accessibility caused by G1 lengthening, and even for the expression of neuronal genes (as shown in figure 5B, where there is a similar increase in neuronal genes expression in the absence of Ascl1 than in control cells after 7 days of palbociclib). This won't compromise the significance of the work, which has the potential to explain the dual role of Ascl1 in NSCs. But will hopefully encourage the field to further investigate the mechanisms behind the effects of G1 lengthening on chromatin accessibility.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an interesting study investigating the role of the proneural transcription factor ASCL1 in neuroblastoma. Previous work has shown that over-expression of ASCL1 can drive differentiation on neuroblastoma cells, but the gene also has roles in maintaining proliferation. The authors carry out a series of genomic studies including ChIP-seq and ATAC-seq to untangle these different roles of ASCL1. While most of the work presented is well-done and the analysis is straightforward, there are some concerns with the conclusions, since some key controls have not been done.

      1. The authors have not done a motif analysis of the ASCL1-ChIPseq so it is not clear whether E-box motifs are enriched/dominate. This is an important control. Also, it would be very useful to compare the ASCL1-ChIP-seq with other published datasets in other neural tissues, as an additional control.
      2. Most of the analysis is done on regions that are less than 50 kb from the nearest TSS. This restricts the analysis to about half the peaks. Since they observe a difference between the G2M peak and the G1 peaks in their distance from the TSS< it would be useful to show whether the same relationship holds when all peaks are included. This may stregthen the finding.
      3. The correlate the genes that decline with ASCL1 KO and the peaks from the ChIP-seq using GO terms, but would be very useful to determine how many of these genes are direct targets. This can bve done by showing the correlaiton between the RNAseq and the ChIP-seq on a gene-by-gene basis rather than using GO.
      4. The cell cycle synchronization experiments are a good confirmation of the unsynchronized experiments.
      5. The lack of ASCL1 dependence of the G1 neuronal genes (Fig 5B) is interesting, but may be confounded by the possibility that these sites are driven equally well by a redundant proneural trnascription factor, like NEUROD1 or NEUROG. This possibility should be addressed by carrying out ChIP for these factors at select sites (G2M vs G1). Alternatively ChIP-seq for these factors would be ideal. Without these experiments the conclusion is not supported: "This indicates that ASCL1 is capable of binding to neuronal targets in G1 phase of the cell cycle in neuroblastoma cells but is not supporting their expression under cycling conditions."
      6. The problem of redundant TFs is also an issue with the experiments to teat the effects of long G1 arrest.
      7. The finding that G1 ASCL1 sites show less accessibility than G2M sites is interesting; is thre a reduction in ASCL1 ChIP-seq signal at these sites as well? Or is ASCL1 bound but not able to open the chromaitn at these sites?
      8. The reduction in accessible sites in the ASCL1 KO for the G2M sites is consistent with the effects on proliferation, but the effect is very modest. Would this effect be greater if the analysis of the ATAC-seq data were confined to sites with E-boxes? it would be useful to know what percentage of the accessible sites have an E-box and what percent of these sites are lost in the ASCL1 KO. This might show the importance of redundant proneural TFs.

      Significance

      This is an interesting study and provides new insight into the dual mechanisms of proneural transcription factors in neuroblastoma proliferation and differentiation. Since ASCL1 has similar dual roles in proliferation and neural differentiation in normal CNS development, the results of this report will improve the understanding of this factor more generally.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to reviewers

      We sincerely thank all reviewers for taking the time to review our manuscript and for providing insightful comments and suggestions. Your feedback has been invaluable in improving the quality and clarity of our work.

      Reviewer #1

      Evidence, reproducibility and clarity

      This manuscript by Peterl and colleagues seeks to understand the long-standing observation that influenza A virus generally exhibits a filamentous phenotype in vivo which is lost upon serial passaging in vitro or in embryonated chicken eggs. In addressing this question, the authors perform a detailed quantitative comparison of how filamentous and spherical strains of influenza spread in cell culture in the presence or absence of perturbations including neutralizing antibodies, mucin, and disruption of cell-cell junctions.

      The manuscript reports several observations that will be of interest to researchers in the area of influenza virus morphology and spread. Using a combination of imaging modalities, the authors convincingly demonstrate that spherical strains of influenza virus produce larger plaques than filamentous strains that are isogenic except for mutations in M1. The authors show that this is at least partly attributable to differences in entry kinetics. The authors also recapitulate a prior finding that filamentous viruses are more resistant to neutralizing antibodies than spherical ones. In most cases, the authors' claims are supported by the data presented. A few partial exceptions are noted below.

      The paper would be strengthened by a clearer description of some of the experimental approaches which lack important details in some instances. The scope of the paper is also limited somewhat by the use of immortalized cell lines that lack physiological features of the airway epithelium. Although this limitation is understandable from a technical standpoint, a discussion of these limitations should be included. Specific comments are listed below.

      Major Points

      In Figure 4, it is not stated at what time the cell density is measured in panel B, and how this might change across the time points sampled in panel C. This would make the experiment difficult to reproduce. This could be a very important consideration if the cells reach confluency soon after the infection is initiated, since the plaque sizes seem statistically similar out to 24hpi in 4B.

      Thank you for your comment on cell densities in Figure 4 B. We agree that the quantification of cell confluency across the time points is crucial in this context. Furthermore, we recognize that counting the number of nuclei within a well is not the most accurate method for comparing the two cell lines. We now provide measurements of relative cell density based on plasma membrane staining for uninfected MDCK-WT and MDCK-α-Catenin-KO cells at 24h and 48h for three biological replicates (Figure 4 A and B). These data show that MDCK-α-Catenin-KO have lower confluency (area=229.69 µm2) at 48 h compared to MDCK-WT cells (area=361.24 µm2). While the confluency of MDCK-WT cells was > 95% at both time points, MDCK-a-Catenin-KO cells did not reach 70% confluency, which reflects the lack of adherens junctions in these cells.

      In Figure 4F, it appears that plaque sizes for M1Ud are less affected by mucin than M1WSN plaques at all concentrations tested. However, the authors conclude that "mucin did not show any IAV morphology-dependent inhibitory effect as indicated by the slopes of linear fits of the plaque diameters" (Line 265). I understand that the authors are looking for dose-dependent effects, but it is not clear to me why an analysis based on the slope is preferable, especially when the response to mucins may not be linear. How does the availability of IAV receptors in the porcine gastric mucin used here compare to human airway mucins? Finally, the authors should clarify the number of replicates for this experiment.

      Thank you for pointing out that the data representation of IAV WSN and WSN-M1Udorn plaque growth in the presence of mucin (Figure 4 C) lacked clarity. We agree and removed the regression fitting and, instead, show all individual plaque sizes (Extended Figure 4 B). We now provide relative reduction of plaque sizes compared between WSN and WSN-M1Udorn plaques at each mucin concentration using 3 or 4 independent experiments (Figure 4 E). This did not reveal that there was a significant reduction in plaque size change between WSN and WSN-M1Udorn in the absence or presence of mucin. We changed our conclusion: "mucin did not show an IAV morphology-dependent inhibitory effect as indicated by the relative plaque size decrease of WSN-M1Udorn compared to WSN across the mucin concentrations" (Line 278).

      We have included information on the mucus composition and receptor availability in the discussion: "Notably, we used porcine gastric mucin, which might differ structurally and in the sialic acid linkage types compared to human mucins (Nordman et al., 2002, doi: 10.1042/bj3640191; Zhang et al., 2021, doi: 10.1007/s10719-021-10014-y). However, both in the porcine stomach and human airway, MUC5AC molecules are the predominant gel-forming mucins." (Graigner et al., 2006, 10.1007/s11095-006-0255-0) (Line 436).

      One key difference between the cells used here and the airway epithelium is the presence of multiciliated cells that could alter viral transport in ways that depend on morphology and may be difficult to predict. I appreciate that this concept is outside the scope of the current work, but it is an important point that warrants mention.

      We have now included fluorescence microscopy data using anti-MUC5AC antibody to assess mucin production in Calu-3 cells. Importantly, we could demonstrate that Calu-3 cells used in our study express mucins (Figure 4 D). We acknowledge that the absence of multiciliated cells is a limitation and plan to address this in future studies by using air-liquid interface cultures and by incorporating primary human bronchial cells. We established a transwell Calu-3 cell culture under air-liquid interface (ALI) conditions, which allowed for cell polarization. The apical surface of Calu-3 cells grown in an ALI culture contains more mucin than in liquid-covered unpolarized cultures. We plan to adapt and further develop a correlative imaging workflow to be able to assess spread in transwells in a separate study, as this is technically more challenging. We have included this in the discussion (Line 440-444).

      Minor Points

      It is somewhat unclear what is being captured in the data in Figure 5D-I. I assume that the cell surfaces that are imaged here are from infected cells within the plaque. If this is the case, it is difficult to tell whether the particles that are being quantified are incoming viruses or viruses that are currently budding. MEDI8852 is a stalk-binding antibody which would not be expected to inhibit viral attachment. This is unlikely to change the interpretation since the data shows differences between spherical and filamentous strains. However, a clearer description of this data would be helpful.

      We appreciate your constructive feedback. Figure 5 captures the effect of HA-stalk-binding MEDI8852 antibodies on IAV spread and morphology. While this antibody does not prevent receptor binding, it blocks membrane fusion and exerts pressure on the viruses, which, based on our hypothesis, can be overcome by increasing the number of HA on the surface of filamentous viruses. This is now also confirmed in Figure 5B showing that entry of spherical viruses is more sensitive to MEDI8852 than entry of filamentous viruses above concentration of 5 nM.

      SEM images of IAV plaques in MDCK cells in the presence of 1 nM MEDI8852 antibody show that viral morphology is not altered by antibody pressure. We agree that this method provides information on IAV morphology but does not allow us to distinguish between incoming or budding viruses. However, virus entry is fast, and IAV release from plasma membrane is slow as obvious from transmission electron microscopy studies showing large quantities of budding virions connected to plasma membrane by budding neck (example: DOI: 10.1099/vir.0.036715-0). Hence, it can be assumed that the majority of viruses captured by SEM on the cell surface are budding viruses. We have included this in the discussion (Line 409-414).

      Nevertheless, to further address this limitation, we now provide a more robust analysis of IAV particle numbers and morphologies from supernatants of serial passaging in MDCK cells under MEDI8852 antibody pressure, using cryo-EM (Fig. 5 D, E). In accordance with the SEM data, we did not observe morphological changes of IAV in the presence of the antibody.

      For experiments in Calu-3 cells, is trypsin added to the culture media following infection? If not, what percentage of HA is proteolytically cleaved? I would expect these cells to express activating proteases, but if activation is less efficient, this could favor the filamentous strain (as discussed in ref 49).

      Thank you for this comment. Yes, trypsin was added to the medium of Calu-3 cells during infection. We included this in the methods section.

      The schematic in Figure 4D illustrates mucins as tethered to the cell surface. This does not reflect the experiments in Figure 4E and F, where secreted mucins are added to the overlay media.

      We agree, and we removed the schematic representation of mucins in Figure 4D, instead we show data on mucin production in Calu-3 cells (Figure 4 D).

      There are a few small typos. Line 61: "to results in" and Line 111: "neutralizing antibodies against hemagglutinin are more effectively blocking virions with spherical morphology."

      We corrected the typo in line 61 and changed the phrasing of lines 111-112 for more clarity.

      Significance

      A strength of this manuscript is the quantitative rigor of the approaches used, which reveal interesting differences in the spread of filamentous and spherical influenza. These differences are compelling, but are limited somewhat in their significance by the difficulty of evaluating whether or not some of the observations would be preserved in differentiated airway epithelial cells. The authors do not over-generalize their conclusions, but more detailed discussion of these potential limitations is warranted.

      As mentioned above, we agree that a differentiated airway is important; however, assessing determining factors responsible for inhibition might be difficult due to the high complexity of the culture composed of different cells. The presented methods allow quantitatively assessing individual factors, which provides benefits. Hence, both approaches are valid and important.

      Reviewer #2

      Evidence, reproducibility and clarity

      Summary: This manuscript by Peteryl and colleagues explores the question of why some influenza viruses (typically those that have been recently isolated from animals, though also the Udorn strain) produce filamentous particles, while influenza viruses that have been adapted to eggs or cell culture form spherical particles. This is a long standing question in the influenza field, and the authors have used a nice set of new tools and approaches to shed light on this question. They created mScarlet labelled viruses that produce spherical (WSN) or predominantly filamentous (WSN with an M segment from Udorn) virions, but share the same glycoproteins. While this approach is not novel (the fact that the segment 7 of Udorn drives a filamentous phenotype has been previously demonstrated), the authors used these viruses in an elegant series of experiments to look at the rate of cell to cell spread within a plaque to show that the spherical viruses spread more quickly. The authors then explored the effect of cell density, inhibitors designed to inhibit different routes of viral entry, and the presence of neutralizing antibody. The experiments are thoughtfully designed, and the electron microscopy in particular is beautifully done. In general, the conclusions are supported by the data, though the specific claim that filamentous viruses have an advantage in viral entry in the presence of neutralizing antibody would be significantly strengthened by performing the specific entry assay the authors employ earlier in the manuscript.

      Major comments: The key conclusions are largely convincing, though the authors should perform the entry assays they employ in figure 3 (measuring the kinetics of entry and the efficiency of entry) to determine whether the delay in cell to cell spread they observe for spherical viruses in the presence of neutralizing antibody is due specifically to the effect on entry. I also am concerned about the method used to determine that the antibody treatment in Fig 5D-H results in a difference in the number of virions produced. While I appreciate that SEM is time consuming and difficult to quantify, counting the number of virions seen in a single field of view from 7 or 12 cells does not provide a robust foundation to support the central claim of the paper, that the difference in speed of filamentous and spherical viral spread is due to a difference in their ability to support viral entry in the presence of neutralizing antibody . If the authors wish to count virions produced by the WSN/WSN M-Udorn viruses in the presence/absence of neutralizing antibody it would be sensible to perform a synchronized high MOI infection and measure infectious titer by plaque assay (as this would be able to quickly and easily measure millions of virions produced by hundreds of thousands of cells).

      Thank you very much for the suggestion to perform an entry assay in the presence of a neutralizing antibody to determine whether the antibody acts at the level of viral entry. We now provide data on the entry efficiency of WSN and WSN-M1Udorn in the presence of increasing MEDI8852 concentrations (Figure 5 B). The results show that entry of the WSN spherical viruses are more affected by MEDI8852 at 5 nM and 10 nM, compared to WSN-M1Udorn, suggesting that the reduced plaque growth presented in Figure 5 C reflects an inhibition of IAV entry.

      We agree that the quantification of virions at the surface of 7-12 cells in SEM images is not a robust method. Therefore, we removed the quantification as it is technically very time-consuming to obtain a large enough dataset or to perform statical power analysis on how many cells would need to be screened. We additionally performed a serial passaging experiment of WSN and WSN-M1Udorn under antibody pressure, providing a more robust analysis of IAV particle numbers and morphologies from supernatants using cryo-EM (Fig. 5 D, E). By quantifying the length/diameter ratio of at least 80 virions per condition, we observed that both IAV morphologies remained stable in the presence of the antibody after five passages.

      The two entry assays could be done in parallel, and I anticipate them to take ~3 days per replicate (a day to seed, a day to infect/add NH4Cl at the indicated time points and fix, a day to image and analyze data). Similarly, infected cells at high MOI in the presence/absence of nAb, collecting viral supernatants, and tittering by plaque assay should take ~one week. The reagents to perform these experiments are already in hand, and as the costs will be limited to standard tissue culture reagents, using a microscopy set up the authors already possess. The experiments throughout the paper are well described, with appropriate methodological detail and statistical analysis.

      Minor comments: • Viruses without the mScarlet spread faster, the WSN-Udorn has more viruses with mScarlet than the WSN does so how do we know that some of the difference isn't down to that?

      Thank you for this important question. It is correct that viruses without mScarlet spread faster. We used WSN mScarlet viruses for CLSEM and live cell imaging of Calu-3 cells. To ensure that the observed differences in viral spread kinetics were not attributable to the presence or absence of mScarlet but to viral morphology, we conducted additional immunofluorescence staining for viral nucleoprotein (NP) or matrix protein 2 (M2) (Extended Figure 1 H-I). This allowed us to account for all viral plaques, including those that were not mScarlet-positive. This way we obtained data for our experiments with MDCK-α-Catenin-KO cells, mucin, zanamivir and MEDI8852 (Figure 4 and 5).

      • While Calu3 cells are reported to make mucus the authors should verify the expression of relevant mucus proteins in their hands, and this phenotype can be variable depending on culture conditions.

      Thank you for highlighting this important point. We verified the expression of MUC5AC in Calu-3 cells grown on cover slips and observed MUC5AC expression in distinct puncta (Figure 5 D).

      • In 5F and I does 'mock' mean no antibody or no virus?

      We apologize for the imprecise nomenclature in Figure 5 F and changed the Figure description.

      • The authors should either include data to support the claim in line 410: "Our data provide further evidence that IAV filamentous morphology is lost to accelerate cell-to-cell spread by faster entry kinetics and to achieve higher entry efficiency" or reword this sentence, since at present this manuscript does not include experiments demonstrating the loss of filamentous morphology in tissue culture of the WSN-M1 Udorn virus.

      Thank you, we agree and modified the sentence.

      Significance

      The data and conclusions presented in this manuscript are exciting and novel, and should be of high interest to virologists and cell biologists. The work builds on (and appropriately references) prior work in the field of influenza particle shape by the Lamb, Barclay, Garcia-Sastre, Vahey, Fletcher and Ivanovic groups. It provides new information and techniques to show that spherical virions spread faster than filamentous virions within plaques, and this advantage is not negated by cell density, the presence of mucus, or different entry inhibitors but is significantly reduced in the presence of neutralizing antibodies. It also includes other useful observations to the field (the fact that infected Calu3 cells migrate to the center of infected plaques, the fact that the entry kinetics and success rate of filaments is lower compared to spheres). Expertise: virology, influenza, virion morphology, cell biology

      __Reviewer #3 __

      Evidence, reproducibility and clarity:

      The manuscript by Peterl et al. deals with the still interesting question of why influenza A viruses are filamentous in natural isolates but adopt a spherical phenotype in cell culture. The authors generated recombinant IAV reporter viruses that display identical antigenic (HA and NA) surfaces but differ in their morphology due to expression of an M1 protein that confers a spherical or filamentous phenotype. The data show that spherical viruses exhibit increased entry kinetics and spread faster in cell culture compared to filamentous viruses and that this is also the case in the presence of mucins and at a low cell density. Interestingly, the authors found that spherical viruses are more efficiently blocked by neutralizing HA antibodies than filamentous viruses, providing an interesting advantage for the filamentous phenotype of natural IAV isolates due to antibody pressure. The manuscript is of the usual excellent quality of the working group of Petr Chlanda and the data are very interesting. The experiments are well thought out and the results are comprehensible, convincing and visually very clear. The fact that a current preprint also describes that neutralizing antibodies drives filamentous virus formation (as mentioned by the authors in the discussion) does not diminish the message and quality of this work. There were a few minor open questions that came to mind that could be included in the discussion: The authors found that the filamentous morphology was stable throughout multiple rounds of infection during plaque formation. Is this still the case even with multiple passages (e.g 10x) in cell culture or does the number of spherical particles increase at some point?

      Thank you for your positive feedback and this suggestion. We performed serial passaging of WSN and WSN-M1Udorn in MDCK cells in the presence of 1 nM MEDI8852 antibody and harvested supernatants from passage 1 and 5. Supernatants were plunge-frozen, and virion counts and morphologies were determined by cryo-electron microscopy. Data from at least 80 analyzed virions per condition showed that the overall number of spherical and filamentous virions was reduced after passage 5 under antibody pressure (Fig 5 D). However, both morphologies remained stable throughout five passages in the presence of MEDI8852 (Fig. 5 E). We did not observe an increase in spherical particles after five passages.

      The filamentous virus spreads slower in cell culture. Does NA play a role here? NA is probably distributed differently on the surface of filamentous viruses (at the tips) than on spherical viruses?

      Thank you for this comment. As correctly pointed out, NA is enriched on one side/tip of filamentous (Calder et al., 2010, doi:10.1073/pnas.1002123107) or spherical IAV as now highlighted in Figure 1 D and E (white arrowheads). This asymmetric NA distribution and the HA-NA balance have been reported to be crucial for the release of newly formed virions and their spread through the mucus layer in the airway epithelium (De Vries et al., 2019, doi: 10.1016/j.tim.2019.08.010). Additionally, we compared the role of NA in the spread of spherical and filamentous IAV by performing fluorescent plaque assays in the presence of Zanamivir, a potent NA inhibitor. Analysis of plaque growth in the presence of increasing Zanamivir concentrations showed that the spread of both IAV morphologies was inhibited to a comparable extent (Figure 4 F and extended Figure 4 C). This result suggests that the inhibition of NA enzymatic activity does not influence the IAV morphology-dependent spread. We have included this information in the results (Line 281-285) and discussion (Line 465-468).

      Reviewer #3 (Significance (Required)):

      The manuscript is of the usual excellent quality of the working group of Petr Chlanda and the data are very interesting. The experiments are well thought out and the results are comprehensible, convincing and visually very clear.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Peterl et al. deals with the still interesting question of why influenza A viruses are filamentous in natural isolates but adopt a spherical phenotype in cell culture. The authors generated recombinant IAV reporter viruses that display identical antigenic (HA and NA) surfaces but differ in their morphology due to expression of an M1 protein that confers a spherical or filamentous phenotype. The data show that spherical viruses exhibit increased entry kinetics and spread faster in cell culture compared to filamentous viruses and that this is also the case in the presence of mucins and at a low cell density. Interestingly, the authors found that spherical viruses are more efficiently blocked by neutralizing HA antibodies than filamentous viruses, providing an interesting advantage for the filamentous phenotype of natural IAV isolates due to antibody pressure. The manuscript is of the usual excellent quality of the working group of Petr Chlanda and the data are very interesting. The experiments are well thought out and the results are comprehensible, convincing and visually very clear. The fact that a current preprint also describes that neutralizing antibodies drives filamentous virus formation (as mentioned by the authors in the discussion) does not diminish the message and quality of this work. There were a few minor open questions that came to mind that could be included in the discussion: The authors found that the filamentous morphology was stable throughout multiple rounds of infection during plaque formation. Is this still the case even with multiple passages (e.g 10x) in cell culture or does the number of spherical particles increase at some point? The filamentous virus spreads slower in cell culture. Does NA play a role here? NA is probably distributed differently on the surface of filamentous viruses (at the tips) than on spherical viruses?

      Significance

      The manuscript is of the usual excellent quality of the working group of Petr Chlanda and the data are very interesting. The experiments are well thought out and the results are comprehensible, convincing and visually very clear.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript by Peteryl and colleagues explores the question of why some influenza viruses (typically those that have been recently isolated from animals, though also the Udorn strain) produce filamentous particles, while influenza viruses that have been adapted to eggs or cell culture form spherical particles. This is a long standing question in the influenza field, and the authors have used a nice set of new tools and approaches to shed light on this question. They created mScarlet labelled viruses that produce spherical (WSN) or predominantly filamentous (WSN with an M segment from Udorn) virions, but share the same glycoproteins. While this approach is not novel (the fact that the segment 7 of Udorn drives a filamentous phenotype has been previously demonstrated), the authors used these viruses in an elegant series of experiments to look at the rate of cell to cell spread within a plaque to show that the spherical viruses spread more quickly. The authors then explored the effect of cell density, inhibitors designed to inhibit different routes of viral entry, and the presence of neutralizing antibody. The experiments are thoughtfully designed, and the electron microscopy in particular is beautifully done. In general, the conclusions are supported by the data, though the specific claim that filamentous viruses have an advantage in viral entry in the presence of neutralizing antibody would be significantly strengthened by performing the specific entry assay the authors employ earlier in the manuscript.

      Major comments:

      The key conclusions are largely convincing, though the authors should perform the entry assays they employ in figure 3 (measuring the kinetics of entry and the efficiency of entry) to determine whether the delay in cell to cell spread they observe for spherical viruses in the presence of neutralizing antibody is due specifically to the effect on entry. I also am concerned about the method used to determine that the antibody treatment in Fig 5D-H results in a difference in the number of virions produced. While I appreciate that SEM is time consuming and difficult to quantify, counting the number of virions seen in a single field of view from 7 or 12 cells does not provide a robust foundation to support the central claim of the paper, that the difference in speed of filamentous and spherical viral spread is due to a difference in their ability to support viral entry in the presence of neutralizing antibody . If the authors wish to count virions produced by the WSN/WSN M-Udorn viruses in the presence/absence of neutralizing antibody it would be sensible to perform a synchronized high MOI infection and measure infectious titer by plaque assay (as this would be able to quickly and easily measure millions of virions produced by hundreds of thousands of cells).

      The two entry assays could be done in parallel and I anticipate them to take ~3 days per replicate (a day to seed, a day to infect/add NH4Cl at the indicated time points and fix, a day to image and analyze data). Similarly, infected cells at high MOI in the presence/absence of nAb, collecting viral supernatants, and tittering by plaque assay should take ~one week. The reagents to perform these experiments are already in hand, and as the costs will be limited to standard tissue culture reagents, using a microscopy set up the authors already possess. The experiments throughout the paper are well described, with appropriate methodological detail and statistical analysis.

      Minor comments:

      • Viruses without the mScarlet spread faster, the WSN-Udorn has more viruses with mScarlet than the WSN does so how do we know that some of the difference isn't down to that?
      • While Calu3 cells are reported to make mucus the authors should verify the expression of relevant mucus proteins in their hands, and this phenotype can be variable depending on culture conditions.
      • In 5F and I does 'mock' mean no antibody or no virus?
      • The authors should either include data to support the claim in line 410: "Our data provide further evidence that IAV filamentous morphology is lost to accelerate cell-to-cell spread by faster entry kinetics and to achieve higher entry efficiency" or reword this sentence, since at present this manuscript does not include experiments demonstrating the loss of filamentous morphology in tissue culture of the WSN-M1 Udorn virus.

      Significance

      The data and conclusions presented in this manuscript are exciting and novel, and should be of high interest to virologists and cell biologists. The work builds on (and appropriately references) prior work in the field of influenza particle shape by the Lamb, Barclay, Garcia-Sastre, Vahey, Fletcher and Ivanovic groups. It provides new information and techniques to show that spherical virions spread faster than filamentous virions within plaques, and this advantage is not negated by cell density, the presence of mucus, or different entry inhibitors but is significantly reduced in the presence of neutralizing antibodies. It also includes other useful observations to the field (the fact that infected Calu3 cells migrate to the center of infected plaques, the fact that the entry kinetics and success rate of filaments is lower compared to spheres).

      Expertise: virology, influenza, virion morphology, cell biology

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript by Peterl and colleagues seeks to understand the long-standing observation that influenza A virus generally exhibits a filamentous phenotype in vivo which is lost upon serial passaging in vitro or in embryonated chicken eggs. In addressing this question, the authors perform a detailed quantitative comparison of how filamentous and spherical strains of influenza spread in cell culture in the presence or absence of perturbations including neutralizing antibodies, mucin, and disruption of cell-cell junctions.

      The manuscript reports several observations that will be of interest to researchers in the area of influenza virus morphology and spread. Using a combination of imaging modalities, the authors convincingly demonstrate that spherical strains of influenza virus produce larger plaques than filamentous strains that are isogenic except for mutations in M1. The authors show that this is at least partly attributable to differences in entry kinetics. The authors also recapitulate a prior finding that filamentous viruses are more resistant to neutralizing antibodies than spherical ones. In most cases, the authors' claims are supported by the data presented. A few partial exceptions are noted below.

      The paper would be strengthened by a clearer description of some of the experimental approaches which lack important details in some instances. The scope of the paper is also limited somewhat by the use of immortalized cell lines that lack physiological features of the airway epithelium. Although this limitation is understandable from a technical standpoint, a discussion of these limitations should be included. Specific comments are listed below.

      Major Points

      In Figure 4, it is not stated at what time the cell density is measured in panel B, and how this might change across the time points sampled in panel C. This would make the experiment difficult to reproduce. This could be a very important consideration if the cells reach confluency soon after the infection is initiated, since the plaque sizes seem statistically similar out to 24hpi in 4B.

      In Figure 4F, it appears that plaque sizes for M1Ud are less affected by mucin than M1WSN plaques at all concentrations tested. However, the authors conclude that "mucin did not show any IAV morphology-dependent inhibitory effect as indicated by the slopes of linear fits of the plaque diameters" (Line 265). I understand that the authors are looking for dose-dependent effects, but it is not clear to me why an analysis based on the slope is preferable, especially when the response to mucins may not be linear. How does the availability of IAV receptors in the porcine gastric mucin used here compare to human airway mucins? Finally, the authors should clarify the number of replicates for this experiment.

      One key difference between the cells used here and the airway epithelium is the presence of multiciliated cells that could alter viral transport in ways that depend on morphology and may be difficult to predict. I appreciate that this concept is outside the scope of the current work, but it is an important point that warrants mention.

      Minor Points

      It is somewhat unclear what is being captured in the data in Figure 5D-I. I assume that the cell surfaces that are imaged here are from infected cells within the plaque. If this is the case, it is difficult to tell whether the particles that are being quantified are incoming viruses or viruses that are currently budding. MEDI8852 is a stalk-binding antibody which would not be expected to inhibit viral attachment. This is unlikely to change the interpretation since the data shows differences between spherical and filamentous strains. However, a clearer description of this data would be helpful.

      For experiments in Calu-3 cells, is trypsin added to the culture media following infection? If not, what percentage of HA is proteolytically cleaved? I would expect these cells to express activating proteases, but if activation is less efficient, this could favor the filamentous strain (as discussed in ref 49).

      The schematic in Figure 4D illustrates mucins as tethered to the cell surface. This does not reflect the experiments in Figure 4E and F, where secreted mucins are added to the overlay media.

      There are a few small typos. Line 61: "to results in" and Line 111: "neutralizing antibodies against hemagglutinin are more effectively blocking virions with spherical morphology."

      Significance

      A strength of this manuscript is the quantitative rigor of the approaches used, which reveal interesting differences in the spread of filamentous and spherical influenza. These differences are compelling, but are limited somewhat in their significance by the difficulty of evaluating whether or not some of the observations would be preserved in differentiated airway epithelial cells. The authors do not over-generalize their conclusions, but more detailed discussion of these potential limitations is warranted.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for their thorough evaluation of this manuscript. We are pleased that overall, they found our work and results valuable for the scientific community. Based on their feedback, we performed additional experiments and made several changes to strengthen the manuscript and expand the target audience.

      *All three reviewers pointed out that the manuscript lacked demonstration of OneSABER method applicability across sample types (i.e., its claimed versatility) and other whole-mount systems beyond the Macrostomum lignano flatworm. *

      We now include an additional results section with accompanying figures (Figs. 6 and 7) that demonstrate the application of OneSABER in whole-mount samples of another flatworm, the planarian Schmidtea mediterranea (Fig. 6), which is much larger than M. lignano, and in formalin-fixed paraffin-embedded (FFPE) mouse small intestine tissue sections (Fig. 7). We believe that these additional experiments on different sample types demonstrate the versatility of the OneSABER approach.

      Please note that two more authors, Jan Freark de Boer and Folkert Kuipers, have been added for their contribution to mouse FFPE sections.

      Furthermore, two reviewers asked for an additional main figure with a comparison of the signal strengths between the different OneSABER methods.

      We have addressed this comment by including an additional results section and its adjacent figure (Fig. 5), where we provide a comparison of fluorescent signals from the same probes and gene but different OneSABER development methods.

      Additionally, to implement the revisions, we modified Fig. 1 and Supplementary Fig. 6 and broadened Supplementary Tables S1-S2, S4-S6.

      2. Point-by-point description of the revisions

      Reviewer #1

      1) “Fig.1 seems to suggest that the protocol for in vitro swapping of 3' concatemers happens in two consecutive PCR steps. I recommend indicating in the figure that the switching can be conducted in a single in vitro reaction.”

      We have changed Fig. 1 to make this clearer.

      2) “Is it possible to multiplex the switching in one single reaction? For example, perform p27 to p28 and p29 to p30 simultaneously? This will be crucial for the split-probe methodology.”

      We did not test it. This should be possible if there is no overlap between the 3’ initiator sequences. However, it seems counterproductive as the elongation efficiencies of switching reactions from the 3’ initiator sequences to another concatemer may vary (Supplementary Fig. S6). Running independent extension/switch reactions and performing equimolar mixing of purified extended probes could be a better solution.

      3) “Did the authors encounter any switching hairpins sequence that does not work? If not, can they postulate, what are the requirements for the design of switching sequences.”

      The design criteria followed the requirements postulated in the original SABER article and its Supplementary Materials (Kishi et al 2019). All switching hairpins we tested in the pairs of the 3 used 3’ initiator sequences (p27, p28 and p30) worked, but elongation efficiencies varied (see an example in Supplementary Fig. S6).

      4) “Is there cross hybridization between the switched and original hairpins? For example, can the authors show that the signals from p27 and p30 do not overlaps?”

      The in situ hybridization results with swapped primary probes are shown in Fig. 6B (multiplexed HCR in S. mediterranea). All probes were originally designed using a p27 PER initiator. We swapped Smed-vit-1 with p30 and Smedwi-1 with p28. We also updated Fig. S6, by adding the second section (B) showing the in vitro results after concatemer swapping, as well as hybridization specificity of the secondary imager probes.

      5) “Can the authors quantify results from the direct, AP, TSA, and HCR? What do you mean by 'narrow anatomical structures like neural chords (syt11) or muscles (tnnt2) seem less visible'?”

      *“I agree with reviewer #2 regarding the lack of comparison to standard SABER.” *

      A comparison of fluorescent signals from the same probes/genes but different OneSABER development methods is shown in Fig. 5.

      We have rephrased the sentence for clarity. From “As a result, despite higher intracellular resolution, some narrow anatomical structures like neural chords (syt11) or muscles (tnnt2) seem less visible for the human eye after SABER HCR (Figs. 3, 4).” to “As a result, despite higher intracellular resolution, some fine anatomical structures like neural chords (syt11) or muscles (tnnt2) are less resolved by widefield fluorescence microscopy after SABER HCR FISH compared to SABER TSA FISH”

      Reviewer #2

      1) “This work is building on standard SABER (a set of PER-extended primary probes that serve as landing pads for secondary fluorescently-labeled readout oligos) and pSABER (the readout oligo carries HRP instead of a dye for downstream TSA). The novelty of the work presented here is introducing additional variations of signal amplification, i.e. by using an hapten-labeled oligo to recruit a tertiary readout probe (antibodies conjugated with HRP or AP) or using SABER in combination with HCR. Since SABER can be seen as the underlying platform and pSABER was (arguably) also already introduced as a new platform by Attar et al. 2023, it seems difficult to introduce OneSABER as yet another new platform, of which standard SABER and pSABER are a part of. The reviewer encourages the authors to overthink the conceptual introduction, which in view of its certainly distinct novel features might allow a clearer distinction to previous work.”

      We agree with the reviewer’s comments. We have added additional information in the Introduction section to clarify the novelty and key distinct features of OneSABER that justify its separation from other SABER protocols.

      2) “Although the authors take care in tributing prior work, some of the studies are only mentioned in the results section, one of such cases is pSABER by Attar et al. 2023. The close relation between pSABER and SABER TSA (HRP on readout oligo vs. hapten on readout oligo + HRP-conjugated antibody) needs to be better positioned in the introduction, clearly framing earlier work, inspirations drawn etc.. This is in line with my previous point.”

      The pSABER preprint article by Attar et al. 2023 (now published in a peer-reviewed journal as Attar et al. 2025) is now mentioned in the Introduction, and its inspirational impact on our research is clearly stated.

      3) “Fig. 1 lists the individual modules of the OneSABER platform: i) standard SABER, ii) AP SABER, iii) SABER TSA, iv) pSABER (TSA FISH) (would recommend leaving it with original name when introducing it and include additional explanation in parentheses) and iv) SABER HCR. The main figures feature only AP SABER, SABER TSA and SABER HCR, for standard SABER and pSABER one must look up the SI. Since the authors describe the limited performance of standard SABER for one of their targets of interest (syt11) and since they have tested this target for all five conditions, it would be valuable to include a comparative view of all five platform modules in a single figure for syt11 or even also piwi, which also seems to have been tested for all five. Comparing the signal strength would be useful for the community, at least of each SABER variation compared to standard SABER.”

      We agree with the reviewer’s comments. Except for pSABER, a comparison of fluorescence signals from the same probes/genes but different OneSABER development methods is shown in Fig. 5. To make the comparison as objective as possible, all FISH developments were re-done using available “far red” fluorophores, except for pSABER. Unfortunately, our directly labeled HRP oligonucleotides for pSABER lost their activity after a year of storage at +4oC. These conjugated oligonucleotides are very expensive and, given their limited shelf life, we cannot justify ordering a new batch for this experiment. Therefore, we only have the data for pSABER syt11 with FITC green tyramide, which is not comparable to “far red” fluorophore signals. This issue has also been discussed in the main text.

      In addition, we have modified Fig. 1, as suggested.

      4) “The description of how the authors designed their probes is very detailed and they also provide a nice step-by-step protocol for their individual commands using Oligominer and BLAT software. This reviewer is wondering how the authors chose their PER sequences that they appended to their mined set of homologous in situ hybridization probes (p27,p28,p30). This is a general problem of multiplexed ISH approaches with single-stranded overhang, could the author's comment on potential self-interaction of the appended sequence with the homologous part, which might limit the PER efficiency, or elaborate on their choice?”

      As being ourselves novice to SABER when we started our work, we based our selection of the p27, p28, and p30 PER sequences on their multiple co-occurrences in previous publications (Amamoto et al. 2019, doi: 10.7554/eLife.51452; Saka et al. 2019, doi: 10.1038/s41587-019-0207-y; Wang et al. 2020, doi: 10.1016/j.omtm.2020.10.003; Salinas-Saavedra et al. 2023, doi: 10.1016/j.celrep.2023.112687; and Attar et al. 2023, doi: 10.1101/2023.01.30.526264). We did not consider the potential interference between PER concatemers and homologous primary probe-binding sequences. However, as all PER concatemers were specifically designed to lack G nucleotides to keep them from self-annealing (Kishi et al. 2019, doi: 10.1038/s41592-019-0404-0), we assumed that it would also reduce potential annealing to the homologous part of the probe.

      5) “Fig.1 and l. 125 describe straightforward in vitro switching of the concatemer sequence for an existing set of primary probes as a central feature of the OneSABER platform. However, the authors to my knowledge do not show such experiments themselves and only cite the original SABER paper by Kishi et al. 2019. This reviewer would be grateful to be pointed toward where in Kishi et al. 2019 this was demonstrated, however in view of this central part of the swopping scheme in the OneSABER platform an experiment showing this swopping is missing.”

      In the article by Kishi et al. 2019, concatemer switching/swapping is termed as “primer remapping”. We found this term confusing because it does not describe the essence of the reaction. The in situ hybridization results with swapped primary probes are shown in Fig. 6B (multiplexed HCR in S. mediterranea). All probes were originally designed using a p27 PER initiator. We swapped Smed-vit-1 with p30 and Smewi-1 with p28. We also updated Fig. S6, by adding the second section (B) showing the in vitro results after concatemer swapping, as well as hybridization specificity of the secondary imager probes.

      6) “the description of Table S6 could use additional information in the legend such that the reader does not have to scroll down to Section S1 to retrieve the information (PER reaction, gel conditions, ladder is dsDNA, what are the individual bands)”

      Probably, the reviewer meant Fig. S6. We now wrote a more detailed caption for the figure and extended it with a second panel (B) to illustrate the results of 3’ concatemer swapping.

      7) “the manuscript features an extensive set of resources in main body, supplementary materials and protocols. It is important and usually not merited sufficiently making the effort to compare orthogonal approaches for a given aim. This reviewer particularly appreciates the detailed strengths & weaknesses discussion in Table S6.”

      We thank the reviewer for the appreciation of our work.

      8) “Minor comments:

      -Definitions should be consistent, in Fig. 1 all approaches are defined with FISH added, but this definition is not followed consistently in the main text.”

      These definitions are now made consistent throughout the text.

      9) “Optional:

      -The authors describe several newly developed optimization steps during sample preparation for M. lignano ISH experiments compared to established ones. If the data exists, they include a supplementary figure showing improvements of optimized protocol steps”

      As almost every step and the buffer recipes were different from the original ISH protocol by Pfister et al. (2007) because of the use of liquid-exchange columns, different probes, and development chemistry, we believe that a comparison would be excessive. We think that the key difference points are already substantially highlighted in the results section.

      Reviewer #3

      1) “Despite including a whole figure (Figure 1) featuring the operation scheme of the OneSABER platform, the figure as well as the associated text fall short with respect to clearly stating the advantage of the different aspects of the platform. Consider a clearer and more thorough explanation of the different aspects of the platfrom.”

      Details on the advantages and disadvantages of using different OneSABER methods in terms of their experimental application and cost efficiency are described in Supplementary Tables S4-S6 of the submitted manuscript. However, we agree that the description in Fig. 1 was too concise and also did not refer to these tables. We have expanded the description in Fig. 1.

      2) “Related to the first comment: A more detailed description of the similarities and/or differences of this platform relative to similar applications such as the study by Hall et al, 2024”

      The mere point of mentioning the preprint of Hall et al. 2024 (now peer-reviewed, https://doi.org/10.1016/j.celrep.2024.114892) was to acknowledge that in M. lignano the HCR technology has been previously applied (although only once), while all other previously published works on M. lignano utilized canonical antisense RNA probes colorimetric in situ hybridization. We have extensively mentioned the HCR approach and its working principles throughout the submitted manuscript.

      3) “The authors describe the probes used as short, synthetic DNA probes targeting short RNA transcripts. Are these probes Oligopaints (Beliveau et al, 2015)? Why is that not more clearly stated in the text?”

      Oligopaints use oligo libraries as a renewable source of FISH probes, and these libraries are amplified with fluorophore-conjugated PCR primers. We used synthetic DNA probes directly. In this sense, our probe sets are not oligopaints. However, we used the OligoMiner pipeline of Oligopaints for the design of the probes, and thus used the same tiling strategy as oligopaints. We believe that this has been explained in the manuscript. Please refer to comment 4 of Reviewer 2.

      4) “Line 105, p5: The authors state that the number of probes depends on the target RNA length and its expression strength. This data should be in the main text and described in detail since it is a major aspect of the platform design.”

      We believe that this statement is common sense, as one cannot design more than 5x 30-50 bp probes for 200 nt transcripts, while for a 2000 bp mRNA, the theoretical limit is ~50 probes. Similarly, weakly expressed genes (regardless of their length) would require either more probes to reach the detection threshold or stronger amplification through choice of concatemer length and/or signal developing techniques. We have rephrased this sentence in the main text to reflect this.

      5) “Figure 2 showcases one of the most compelling data supporting the versatility of the platform. Can the signals in each panel be quantified and compared to 1. Published Ab staining? Is there a clear correlation in the intensity of the signals? 2. Between Vector Blue and NBT? 3. Chemical staining and FISH signals?”

      Since M. lignano is a relatively new model, there are no published antibody stainings for M. lignano genes used in this study. Furthermore, colorimetric precipitate methods are not quantitative but rather qualitative, because their signal strength is proportional to both the target RNA level and the development time; thus, signals from weakly expressed transcripts can be “boosted” simply by longer development. Therefore, a correct quantitative comparison with colorimetric methods, as requested by the reviewer, was not possible. However, with some corrections on fluorophore differences and animal-to-animal variability, it is possible to roughly compare peak saturation intensities for FISH methods if the experiments are designed for this aim. We performed these experiments, and a comparison of fluorescent signals from the same probes/genes but different OneSABER development methods is shown in Fig. 5.

      Minor comments:

      6) “The whole mount images and signals are often diffuse, can they be visualized using a DIC where the morphology of the organism is clearer?”

      We are unsure which images appear to be diffused to the reviewer. The other reviewers have not pointed out similar issues. Perhaps the question resolves once full-resolution uncompressed images are uploaded.

      7) “In order to support the claim that this is a universal approach for whole-mount staining, can the authors show an example of applicability to C. elegans?”

      This is now addressed. We included two additional results sections with two accompanying figures (Figs. 6 and 7) that demonstrate OneSABER’s application in whole-mount samples of a much larger than M. lignano model flatworm, the planarian Schmidtea mediterranea (Fig. 6), as well as in formalin-fixed paraffin-embedded (FFPE) small intestine tissue sections of a mouse model (Fig. 7).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors of this study feature a proof-of-concept implementation of OneSABER ISH platform, that combines single, and multiplex colorimetric and fluorescent approaches in whole-mount samples of M. lignano. This includes RNA ISH, multiplex TSA and HCR FISH. The approach is supposed to provide advantages that reduce sample loss and sample processing time and cost while being applicable to whole-mount samples of one organism, M. lignano, a powerful model that is used to study tissue regeneration. One of the more obvious advantages is the use of this tool as an alternative to antibody staining for specific proteins. However, despite claiming applicability of this approach to other whole-mount organisms, no evidence was shown to support that claim. In addition, the advantage of using this approach over other ISH protocols to study tissue regeneration in particular had not been shown.

      Major comments:

      • Despite including a whole figure (Figure 1) featuring the operation scheme of the OneSABER platform, the figure as well as the associated text fall short with respect to clearly stating the advantage of the different aspects of the platform. Consider a clearer and more thorough explanation of the different aspects of the platfrom.
      • Related to the first comment: A more detailed description of the similarities and/or differences of this platform relative to similar applications such as the study by Hall et al, 2024.
      • The authors describe the probes used as short, synthetic DNA probes targeting short RNA transcripts. Are these probes Oligopaints (Beliveau et al, 2015)? Why is that not more clearly stated in the text?
      • Line 105, p5: The authors state that the number of probes depends on the target RNA length and its expression strength. This data should be in the main text and described in detail since it is a major aspect of the platform design.
      • Figure 2 showcases one of the most compelling data supporting the versatility of the platform. Can the signals in each panel be quantified and compared to 1. Published Ab staining? Is there a clear correlation in the intensity of the signals? 2. Between Vector Blue and NBT? 3. Chemical staining and FISH signals?

      Minor comments:

      • The whole mount images and signals are often diffuse, can they be visualized using a DIC where the morphology of the organism is clearer?
      • In order to support the claim that this is a universal approach for whole-mount staining, can the authors show an example of applicability to C. elegans?

      Significance

      The work presented by the authors is promising in its versatility to single, and multiplex colorimetric and fluorescent approaches. In particular, multiplexing several targets showcases the strength of this approach. However, the versatility, applicability to other whole-mount studies and as a tool to study tissue regeneration in this model organism are not shown in the manuscript. Additional experiments will be necessary to support several of these claims.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their manuscript entitled "One probe fits all: a highly customizable modular RNA in situ hybridization platform expanding the application of SABER DNA probes" Ustyantsev et al. present combinations of the SABER (signal amplification by exchange reaction) method for RNA in situ hybridization (ISH) experiments with additional fluorescence amplification strategies such as alkaline phosphatase (AP) colorimetric-based, tyramide signal amplification-based (TSA) and hybridization chain reaction-based (HCR) ISH. All experiments are performed within whole-mount samples of M. lignano and single-plex data for a total of 7 genes and multiplexed data for up to three genes are shown. Based on an initial set of SABER probes, the OneSABER platform, standard SABER fluorescently-labeled readout oligos (imagers) can be easily replaced by oligos introducing the above mentioned alternative amplification strategies. Furthermore, the authors claim to have optimized existing sample protocols for in situ hybridization in M. lignano.

      Major comments:

      Overall, the study is carefully conducted and many of the author's claims are supported by data presented in their manuscript.

      Please find my comments below:

      • This work is building on standard SABER (a set of PER-extended primary probes that serve as landing pads for secondary fluorescently-labeled readout oligos) and pSABER (the readout oligo carries HRP instead of a dye for downstream TSA). The novelty of the work presented here is introducing additional variations of signal amplification, i.e. by using an hapten-labeled oligo to recruit a tertiary readout probe (antibodies conjugated with HRP or AP) or using SABER in combination with HCR. Since SABER can be seen as the underlying platform and pSABER was (arguably) also already introduced as a new platform by Attar et al. 2023, it seems difficult to introduce OneSABER as yet another new platform, of which standard SABER and pSABER are a part of. The reviewer encourages the authors to overthink the conceptual introduction, which in view of its certainly distinct novel features might allow a clearer distinction to previous work.
      • Although the authors take care in tributing prior work, some of the studies are only mentioned in the results section, one of such cases is pSABER by Attar et al. 2023. The close relation between pSABER and SABER TSA (HRP on readout oligo vs. hapten on readout oligo + HRP-conjugated antibody) needs to be better positioned in the introduction, clearly framing earlier work, inspirations drawn etc.. This is in line with my previous point.
      • Fig. 1 lists the individual modules of the OneSABER platform: i) standard SABER, ii) AP SABER, iii) SABER TSA, iv) pSABER (TSA FISH) (would recommend leaving it with original name when introducing it and include additional explanation in parentheses) and iv) SABER HCR. The main figures feature only AP SABER, SABER TSA and SABER HCR, for standard SABER and pSABER one must look up the SI. Since the authors describe the limited performance of standard SABER for one of their targets of interest (syt11) and since they have tested this target for all five conditions, it would be valuable to include a comparative view of all five platform modules in a single figure for syt11 or even also piwi, which also seems to have been tested for all five. Comparing the signal strength would be useful for the community, at least of each SABER variation compared to standard SABER.
      • The description of how the authors designed their probes is very detailed and they also provide a nice step-by-step protocol for their individual commands using Oligominer and BLAT software. This reviewer is wondering how the authors chose their PER sequences that they appended to their mined set of homologous in situ hybridization probes (p27,p28,p30). This is a general problem of multiplexed ISH approaches with single-stranded overhang, could the author's comment on potential self-interaction of the appended sequence with the homologous part, which might limit the PER efficiency, or elaborate on their choice?
      • Fig.1 and l. 125 describe straightforward in vitro switching of the concatemer sequence for an existing set of primary probes as a central feature of the OneSABER platform. However, the authors to my knowledge do not show such experiments themselves and only cite the original SABER paper by Kishi et al. 2019. This reviewer would be grateful to be pointed toward where in Kishi et al. 2019 this was demonstrated, however in view of this central part of the swopping scheme in the OneSABER platform an experiment showing this swopping is missing.
      • the description of Table S6 could use additional information in the legend such that the reader does not have to scroll down to Section S1 to retrieve the information (PER reaction, gel conditions, ladder is dsDNA, what are the individual bands)
      • The manuscript features an extensive set of resources in main body, supplementary materials and protocols. It is important and usually not merited sufficiently making the effort to compare orthogonal approaches for a given aim. This reviewer particularly appreciates the detailed strengths & weaknesses discussion in Table S6.

      Minor comments:

      • Definitions should be consistent, in Fig. 1 all approaches are defined with FISH added, but this definition is not followed consistently in the main text.

      Optional:

      • The authors describe several newly developed optimization steps during sample preparation for M. lignano ISH experiments compared to established ones. If the data exists, they include a supplementary figure showing improvements of optimized protocol steps

      Referees cross-commenting

      I agree with most points raised by the other reviewers, especially with the lacking demonstration and related questions regarding swapping also raised by reviewer 1 and the questioned claim of translatability of OneSABER to other whole mount systems.

      I do not question the value of this work in view of enabling new biological discovery, since it might accelerate/improve optimizations for RNA ISH experiments. In line with my comments, the manuscript would strongly benefit from a comparison to standard SABER demonstrating its insufficient signal for robust target detection.

      Significance

      Without a doubt this method-development focused study conducted by Ustyantsev et al. is a valuable resource featuring extensive sample optimization, protocols and guidelines for RNA in situ hybridization studies in M. lignano and as such deserves publication after the points raised were addressed. The manuscript is of high interest to the M. lignano community, to researchers conducting in situ hybridization experiments in larger/challenging-to-access samples and also to other methods developers.

      Field of expertise: DNA nanotechnology and DNA-based multiplexed fluorescence imaging in mammalian cell culture & tissues.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Authors developed a customizable PER reaction system that is able to switch between different imager probes, as well as imaging modalities (Hapten, HCR, etc). This work will be of interest to biologists looking to validate gene expression, as well as biotechnologist looking to advance imaging-based spatial transcriptomics. The paper is well written and easy to read. The protocol is also very clear and well written. However, it is unclear how the method can enable new biological discovery.

      Lack of demonstration of the applicability across sample types. Can the authors show some results in mammalian cells or tissues?

      Fig.1 seems to suggest that the protocol for in vitro swapping of 3' concatemers happens in two consecutive PCR steps. I recommend indicating in the figure that the switching can be conducted in a single in vitro reaction.

      Is it possible to multiplex the switching in one single reaction? For example, perform p27 to p28 and p29 to p30 simultaneously? This will be crucial for the split-probe methodology.

      Did the authors encounter any switching hairpins sequence that does not work? If not, can they postulate, what are the requirements for the design of switching sequences.

      Is there cross hybridization between the switched and original hairpins? For example, can the authors show that the signals from p27 and p30 do not overlaps?

      Can the authors quantify results from the direct, AP, TSA, and HCR? What do you mean by 'narrow anatomical structures like neural chords (syt11) or muscles (tnnt2) seem less visible'?

      Referees cross-commenting

      I agree with reviewer #2 regarding the lack of comparison to standard SABER.

      Significance

      Authors developed a customizable PER reaction system that is able to switch between different imager probes, as well as imaging modalities (Hapten, HCR, etc). This work will be of interest to biologists looking to validate gene expression, as well as biotechnologist looking to advance imaging-based spatial transcriptomics. The paper is well written and easy to read. The protocol is also very clear and well written. However, it is unclear how the method can enable new biological discovery.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      In this manuscript, the authors highlight the importance of the Golgi apparatus during SARS-CoV-2 infection. Specifically, using different compounds able to alter Golgi structure and function, the authors show a strong reduction in SARS-CoV-2 infection rate. In particular it is interesting to observe that treatments of 24 hrs with BFA strongly impair viral infection, highlithing the importance of Golgi function for this virus. Albeit the time of treatment is different. this observation is in contrast with previous studies on related coronaviruses (Ghosh et al., 2020) that did not observe any effect upon treatment with BFA. This might imply that SARS-CoV-2 relies more on conventional trafficking pathways respect to other coronaviruses which, under certain conditions, favour different trafficking routes.

      We thank the reviewer for the positive comments. Indeed, our results with BFA treatment for 24 hours are inconsistent with previous studies based on the prototype coronavirus MHV (Ghosh et al., 2020). To validate this observation, we have now performed new experiments with BFA treatment for 4, 6, and 8 hours, matching the time points used in the previous study (Ghosh et al, 2020). Our new results show that BFA treatment at these early time points significantly inhibits SARS-CoV-2 assembly and secretion, as measured by immunoblotting and TCID50 assays, without reducing intracellular viral RNA levels, which serve as a marker of genome replication. This implies that Golgi function and an intact ER-to-Golgi trafficking route are required for SARS-CoV-2 assembly and secretion. These new results are now presented as new Fig. 2C-H.

      The authors additionally observed that viral infection increases TGN46 levels while decreasing GRASP55 levels. To dissect the role of TGN46 and GRASPR55, the authors performed several infection studies in cells in which the levels of the two proteins were modulated either by overexpression (GRASP55) and/or siRNA-mediated knock-down (GRASP55 and TGN46). Those approaches suggest that GRASPR55 overexpression, a protein essential for Golgi stack formation, decelerates viral trafficking and inhibits viral assembly while its depletion reverses the effects. On the other hand, TGN46 knock-down impairs viral trafficking but not assembly. Overall the study clearly shows the importance of the Golgi during SARS-CoV-2 and also shows that modulation of those two factors affect viral infection.

      We appreciate the reviewer's accurate summary of our work and positive comments.

      However the claims that specifically the trafficking (TGN46) and trafficking and assembly (GRASP55) are not fully substantiated. Regarding GRASP55, the authors state that viral infection decreases GRASPR55 levels and this results in Golgi fragmentation. However GRASPR55 levels decrease is shown at 24 hrs post infection while Golgi fragmentation occurs as early as 5 hrs. Thus there might be no direct casual effect between the two effects.

      We agree with the reviewer that GRASP55 downregulation is unlikely to be the only reason for Golgi fragmentation in the infected cells. In our results, 5- or 8-hour post infection caused only mild Golgi fragmentation (Fig. S6D), while 24 hours post infection led to severe Golgi fragmentation. On the other hand, GRASP55 is likely to play a relevant role as SARS-CoV-2 induced Golgi fragmentation can be partially rescued by exogenous GRASP55 expression (Fig S6C). We have modified the text in lines 303-305 accordingly to acknowledge the possibility that other factors also contribute to Golgi fragmentation in infected cells.

      Additionally, the authors show that overexpression of GRASP55 rescue Golgi fragmentation, as observed by imaging, however is not clear if only infected cells where quantified and if they had the same level of infection.

      Yes, only infected cells with either GFP or GRASP55-GFP expression were quantified. The viral infection rate was significantly lower in GRASP55-GFP expressing cells compared to GFP expressing cells (Fig 5A-B).

      The authors exclude and effect on entry based on experiment on Spike expressing pseudovirus in 293-ACE2, however they also clearly observe reduction of ACE2 on the membrane of GRASPR55 expressing cells (Fig S6B). Thus how can they explain this discrepancy and how ca defect in entry can be fully marked out in these cell lines?

      We thank the reviewer for pointing this out. This discrepancy is likely due to the different systems used in the two experiments.

      In the pseudovirus entry assay, ACE2 was exogenously expressed in 293T cells and GRASP55 expression did not show any effect on the viral entry efficiency. In contrast, Huh7-ACE2 cells were selected for a high surface expression of ACE2. While GRASP55 expression reduces surface ACE2 levels as shown in our cell surface biotinylation assay, we believe that the surface ACE2 levels in GRASP55-expressing cells remain sufficient to support viral entry. To further investigate whether GRASP55 expression affects viral entry using authentic SARS-CoV-2, we performed RT-qPCR analysis of intracellular RNA level of the spike, N, and RdRp in both GFP and GRASP55-GFP expressing cells 4 hours post infection (new Fig 5D). Our results show that GRASP55 expression does not affect SARS-CoV-2 entry efficiency, even though it reduces ACE2 surface expression levels.

      It is not clear to which process the authors refer to when they write about "viral trafficking". Is it virion trafficking or viral proteins trafficking? The two process are linked but are not the same. This oversemplification can be misleading. For instance the authors show that overexpression of GRASP55 decreases Spike protein on the plasma membrane and its depletion increases S protein incorporation into psudoviruses. However it was shown that in infected cells S protein is mainly retained at the ERGIC by M and E (Boson et al., 2021) where viral assembly occurs. Thus an increase in S trafficking on the PM does not correlate with an increase in virion trafficking,

      We agree with the reviewer that our use of the term "viral trafficking" is imprecise and we have changed this throughout the manuscript to be more specific. S trafficking to the PM may not necessarily be equal to an increase in virion trafficking and thus have rephrased these terms in our writing accordingly.

      We acknowledge that our cell surface biotinylation assay results only demonstrate that GRASP55 overexpression slows down spike protein trafficking to the PM. We have accordingly also examined viral protein and infectious particle secretion into the culture medium as a more direct readout of virion trafficking (new Fig 2E, 2H, 6K, and 7P).

      Finally, we have removed all of the data describing spike incorporation into pseudoviruses as we acknowledge that plasma membrane assembly of lentiviruses is not a good model for SARS-CoV-2 assembly.

      ...and ultimately, the data provided do not fully support the authors claim on a modulation of "virion trafficking" in response to GRASP or TGN46 changes, since no experiments clearly show a change in virions secretion.

      In response to the above comment, we provide the following clarification: Our Western blotting, TCID50 assay, and plaque assay results collectively demonstrate that SARS-CoV-2 virion secretion is reduced in GRASP55 expressing cells (new Fig 5E-M) and in TGN46-depleted cells (new Fig 7F-H, 7L-N). Conversely, viral assembly and secretion appear to be increased in GRASP55-depleted cells (new Fig 6A, 6E-I) at 24 hpi. Furthermore, within a single viral secretion cycle (10 hpi), GRASP55 depletion increased viral secretion (new Fig 6K), while TGN46 depletion reduced viral secretion (new Fig 7P). These findings strongly support the conclusion that GRASP55 and TGN46 modulate viral secretion.

      Importantly, the authors do not rule out potential effects of their perturbations on genome replication. The only experiment that they perform in this direction is presented in Fig. S7B, where the authors show similar percentage of infected cells at early stage upon silecing of GRASPR55. The experiment suggests that productive entry is similar in these conditions, but quantification of intracellular viral genome could exclude a change in viral replication. If no changes in viral replication are observed, the authors could verify an increase in particles secretion by collecting supernatants from the early time points and performing plaque assays and quantification of viral genomes by qRT-PCR, to prove that modulation of GRASPR55 indeed promote SARS-CoV-2 trafficking.

      We thank the reviewer for the excellent suggestions. In response, we performed RT-qPCR analysis in GRASP55-expressing and TGN46-depleted cells at 4 hpi to compare the viral genome replication process. Additionally, we performed western blotting analysis and released viral titer assay of the culture media from both GRASP55-depleted and TGN46-depleted cells at 10 hpi to investigate virion release. Our new results show that GRASP55 depletion increases viral secretion (new Fig. 6K), while TGN46 depletion reduces viral secretion (new Fig. 7P). Furthermore, GRASP55 expression and TGN46 depletion do not perturb viral genome replication (new Fig. 5D and new Fig. 7R).

      Finally, whenever reduction of viral infection is observed upon cell partubation, a robust analysis of cell viability should be presented to exclude pleiotropic effects. Expecially in presence of multiple pertubation that might affect cell metabolism. The authors should carefully control cell viability and growth in response to depletion of TGN46 and GRASP55.

      We thank the reviewer for the excellent suggestions, which were also pointed out by reviewer #3. To address this, we performed the LDH cytotoxicity assay under SARS-CoV-2 infection conditions with TGN46 depletion and GRASP55 depletion/expression (new Fig. 5C, 6L, 7Q). Our new results show that no significant cell death was induced by TGN46 depletion, GRASP55 depletion/expression, or other perturbations.

      Minor: show data on viability of the drug and add the relative section in Material and Methods.

      We performed LDH assays of SARS-CoV-2 infected Huh7-ACE2 cells treated with 9 small molecules, and LDH release levels were similar across all treatments (new Fig. S3C). Additionally, a CellTiter Glo viability assay of 293T-ACE2 cells did not show any significant effect of cell viability with small molecule treatment (new Fig S3F). Detailed descriptions of these assays have been included in the Material and Methods section.

      Figure 3A: should read spike and not nucleocapsid eported for SARS-CoV-2

      Fig. 3A labeling is correct - cells were labeled with antibodies for GRASP65 (rabbit) and for nucleocapsid (mouse).

      Lack of inhibition with camostat correlates with lack of TMPRSS2 in the Huh7. The sentence seems to be too general while in this case the effect is clearly cell specific. Similarly, the importance of the lysosome in viral entry is restricted to cells lacking TMPRSS2 and cannot be generalized since CQ, for example, does not work in Calu-3 cells that express TMPRSS2 cells.

      We agree with the reviewer and have added one sentence: The relative smaller effect of camostat mesylate observed here, compared to previous studies (Hoffmann et al, 2021), might be due to the use of different cell lines across studies in lines 182-184. We also discussed the discrepancy of CQ treatment between our Huh7-ACE2 cells and Calu-3 cells (Hoffmann et al, 2020) in lines 466-473.

      Typo: Fig S3B - Y axis should reat viral not vrial

      Thank you - we have corrected this.

      S3C: concentrations of the compound used in the assay should be reported. Was a viability assay performed also in the 293T-ACE2 cell line?

      We thank the reviewer for the suggestion. We have added the concentration information to the legend in Fig. S3E "Cell entry assay of 293T or 293T-ACE2 cells by SARS-CoV-2 Spike pseudotyped lentivirus for 24h in the presence of indicated molecules at the same concentrations as in Fig. 2A." Additionally, we performed a CellTiter Glo assay to assess the viability of 293T-ACE2 cells treated with the 9 molecules. The results demonstrate that treatment with these 9 molecules does not alter cell viability (Fig. S3F).

      Significance

      Overall, the major strenght of the manuscript is that it has clarified the importance of the Golgi during SARS-CoV-2 infection. The drugs screening demonstrate that for SARS-CoV-2 the conventional secretion seems to have major role respect to other secretory routes observed for other coronaviruses. Also it is clear that the two factors identified by the authors have a role in viral infection, however the major limitation is that the authors failed to clearly highlight which step/s of the viral life cycle are modulated upon GRASP55 and TGN46 perturbatio. Expecially the claims on "trafficking" is not fully substantiated, since the only experiment in this direction is the transport of Spike protein on the plasma membrane upon GRASPR55 overexpression. It is risky to conclude that the trafficking of a single protein reflect the intracellular trafficking of the virions.

      Several of the finding presented in the first part of the manuscript have been already previously reported (for example the fragmentation of the Golgi upon SARS-CoV-2 infection), however the role of GRASP55 and TGN46 in SARS-CoV-2 infection has been reported here for the first time. This manuscript can be of interest for a broad audience considering the topic (cell biology, host-pathogen interactions and molecular virology)

      My expertise reside in the field of molecular virology, expecially in the contest of the mechanisms of viral replication and host-pathogen interactions.

      We thank the reviewer for the overall positive comments and excellent suggestions. We hope that our new results have convincingly demonstrated that viral trafficking is regulated by GRASP55 and TGN46.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: In this study, Zhang and colleagues address the impact on SARS-CoV-2 infection on the morphology of the Golgi apparatus and convincingly demonstrate a fragmentation of this organelle in infected cells. Conversely, they show that the modulation of TGN46 or GRASP55 expressions, two components of this organelle impact SARS_CoV-2 replication. By monitoring the relative levels of viral Spike and nucleocapsid in the cell supernatants, they conclude that GRASP55 regulates particle assembly and trafficking while TGN46 controls only secretion. The study was generally well performed, and the quality of the microscopy and western blot data is good. It was appreciated that all the phenotypes were robustly quantified. I believe that this study is potentially interesting and relevant for the SARS-CoV-2 community since providing an extensive characterization of the interplay between SARS-CoV-2 and the Golgi apparatus.

      We thank the reviewer for the positive comments.

      However, as described below, I have some concerns regarding the interpretations of some of the key conclusions. Moreover, the fact that it was already described by several groups that Golgi is a key machinery used for SARS-CoV-2 virion assembly (ERGIC) and secretion dampens my enthusiasm about the study, especially without clear molecular mechanisms about the interplay between SARS-CoV-2 proteins and TNG46/GRASP55.

      We rephrased some sentences following the reviewer's suggestions. Although it was believed that SARS-CoV-2 is assembled at the ERGIC, there has been significant controversy surrounding the virion secretion pathway. Our results strongly support that SARS-CoV-2 virions traffic through the Golgi apparatus and that an intact ER-to-Golgi trafficking pathway is essential for SARS-CoV-2 assembly and secretion. Manipulation of two Golgi-resident proteins, GRASP55 and TGN46, significantly regulates SARS-CoV-2 secretion. Interestingly, GRASP55 regulates both assembly and secretion of SARS-CoV-2, while TGN46 exclusively modulates viral secretion. This is consistent with their subcellular localization, as GRASP55 is localized to the medial/trans Golgi, whereas TGN46 is localized to the TGN. We hope that our new experimental results (Figs. 2C-H, 5C-D, 6J-L, and 7O-R) have addressed all concerns from the reviewer. Identification of downstream protein targets involved in TGN46/GRASP55-mediated modulation of SARS-CoV-2 trafficking will be the focus of our future studies.

      Major comments: -All the assays have been performed in liver-derived Huh7 cells (overexpressing SARS-CoV-2 receptor) ACE2 (for infection) or kidney 293 cells (for pseudotyped HIV entry assays). However, no conclusion was validated in lung-derived cells (like A549-ACE2, Calu-3 or primary cells), which would be important since the respiratory tract is the main target of SARS-CoV-2

      In our study, Huh7-ACE2 cells are sorted for the high expression of endogenous ACE2 protein, and we did not overexpress ACE2 protein. Also, the liver has been reported to be a site of SARS-CoV-2 infection in humans (Barnes, 2022). We did use A549 and Calu-3 cells in pilot experiments; A549 cells displayed infection rates that were too low for our purposes, and Calu-3 cells showed both low infection rates and relatively disorganized Golgi in the absence of viral infection. We were able to add new IF results from Calu-3 cells. Consistent with our findings in Huh7-ACE2 cells, SARS-CoV-2 infection disrupts Golgi structure and alters protein levels of TGN46 and GRASP55 in Calu3 cells (new Fig. S5R-W). We also confirmed GRASP55 downregulation and TGN46 upregulation in VeroE6 cells (Fig S5K-N).

      -Fig2: The impact of the drugs on replication was assessed by measuring the % of infected cells. At 24 hpi, I am unsure about what this value is supposed to measure (the whole life cyle, intracellular replication or spread?), especially since it is not indicated when the drugs were added to the cells. Was it during, before or after the infection? This information should be provided.

      Fig. 2 refers to infection, not replication. We agree that infection encompasses multiple steps of the viral cycle. In our experiments, cells were treated with the drugs immediately before viral infection. We have added the information into the Fig. 2 legend.

      If the "Golgi" drugs impact egress only (as inferred by the genetic modulation phenotypes), I would expect that at this early time point, the % of infection would not drastically change (as well as intracellular RNA) but that the extracellular infectious titers would decrease. Plaque assays (or TCID50 assays) and RT-qPCR on intracellular viral RNA should be conducted to better understand the impact of drug treatments.

      This is a great suggestion! As the reviewer expected, our new BFA time-point assay shows that at early time points, the intracellular RNA levels for S, N and RdRp are not reduced. However, the extracellular N protein (measured by WB) and virial titer (measured by TCID50 assay), which serve as readouts for virion secretion, are significantly decreased (new Fig. 2C-H).

      On page 10, it is said that the virus makes three cycles of replication within 24 hours following infection. On what data is this based? This seems a lot. If this is true (and shown in Huh7-ACE2 cells), does the assay of figure 2 measure spread in general? More importantly, despite mentioned, the cell viability data are not provided. It is important to show them to ensure that these concentrations of drugs are not toxic at the tested concentrations.

      It has been reported that a single cycle of SARS-CoV-2 infection is approximately 8 hours (Eymieux et al, 2021). Therefore, Fig. 2 represents a multicycle infection, reflecting a composite measure of viral infection and spread. Under the microscope, we did not observe dramatic cell death at the tested concentration. To further assess cytotoxicity, we performed a cell toxicity assay for the 9 small molecules that inhibit viral infection of Huh7-ACE2 cells. The results show that no or minor cell death was observed with all these compounds (Fig. S3C).

      -I appreciated the extensive confocal microscopy analysis performed by the authors, which seems of high quality and overall, very convincing. They clearly show that SARS-CoV-2 infection induces the fragmentation of the Golgi apparatus although it was reported by others before as mentioned by the authors.

      We thank the reviewer for the positive comments. We agree that Golgi fragmentation was observed during SARS-CoV-2 infection, as we mentioned. However, our study provides a comprehensive and systematic analysis of the entire host cell endomembrane system in the response to viral infection.

      However, it was hard for me to make the functional link between these data and those related to GRASP55 and TGN46 overexpression/knockdown. First, the authors should assess the morphology of the Golgi apparatus in Huh7-ACE2 when GRASP55 is knocked down/out or when TGN46 is overexpressed. Second, in these 2 conditions that favor replication, it should be assessed whether this correlates with Golgi fragmentation. Even if this was probably shown before, it is relevant to show that these genetic modulations induce Golgi reshaping in this particular cell type by confocal microscopy (and ideally electron microscopy).

      Thank you for the suggestion. We performed IF analysis to assess Golgi morphology in Huh7-ACE2 cells under conditions of GRASP55 knockdown or TGN46 overexpression. Our results show that GRASP55 depletion disrupts Golgi structure (Fig. S7D), whereas TGN46 expression does not significantly alter the Golgi morphology (Fig. S8D).

      -The fact that GRASP55-GFP expression decreases in 293T the cell surface levels of ACE2, the receptor of Spike (Fig S6), raises concern that the effect of GRASP55 is not specific to the virus and suggests that the whole secretory pathway is altered, while an impairment of virus entry should be expected in this cell line. Is there a similar trend in Huh7-ACE2?

      Reviewer 1 raised a similar question regarding viral entry efficiency. Fig. S6B, performed in Huh7-ACE2 cells, shows that GRASP55-GFP expression also decreases ACE2 surface level in these cells. To further assess whether GRASP55 expression affects viral entry, we performed RT-qPCR analysis of viral RNA at early time points of infection. We found that authentic SARS-CoV-2 entry efficiency was not altered by GRASP55 expression (new Fig. 5D). Although GRASP55 overexpression does alter the secretory pathway, we want to point out that SARS-CoV-2 infection downregulates endogenous GRASP55 expression. We have used GRASP55 overexpression as a probe to assess the effects of GRASP55 on the secretory pathway and on SARS-CoV-2 virion trafficking, but this does not actually reflect what is observed in SARS-CoV-2 infection.

      In addition to addressing the functionality of the secretory machinery in Huh7-ACE2, it would be relevant to repeat the cell surface labelling in the context of pseudotyped virus production with other viral envelopes such as VSV G protein or HIV gp41/gp120. If the phenotype is specific to Spike trafficking, the cell surface abundance of these alternative viral proteins should not be impacted by GRASP55 overexpression. Otherwise, this would indicate a general effect of on the secretory pathway. Besides, since HIV Gag is directed directly to the plasma membrane during particle assembly without entering the secretory pathway, I am not convinced that upstream alteration on nucleocapsid assembly at the ERGIC should be excluded. Indeed, changes on the S/N ratios are generally mild and I feel that this cannot explain the phenotypes in the extracellular infectious titers.

      We have removed the original figure because we acknowledge that HIV Gag is directed directly to the plasma membrane, which is different from the trafficking of SARS-CoV-2 spike protein. We appreciate the reviewer's recognition of the difference in extracellular infectious titers between GFP and G55-GFP expressing cells. We hypothesize that GRASP55 expression not only reduces the number of spikes on each virion but also inhibits the secretion of SARS-CoV-2, resulting in a significantly lower extracellular infectious titer. We agree that it would be interesting to test whether GRASP55 expression affects viral production with other viral envelopes. However, this is beyond the scope of the current study and represents a promising direction for future research.

      More generally, the comparison between trafficking and assembly should be better assessed and not simply based on extracellular N and S levels. It was hard to see the differences between the two in terms of phenotypes. The authors should at least measure the intracellular infectivity upon TGN46 and GRASP55 knock/down and overexpression as well as intracellular vRNA abundance as a readout of RNA replication (which is anticipated to remain unchanged).

      We thank the reviewer for the valuable suggestions. We performed RT-qPCR analysis of Spike, N, and RdRp at early time points of infection. The new results show that neither GRASP55 expression (new Fig. 5D) nor TGN46 depletion (new Fig. 7R) affects viral RNA abundance at an early infection timepoint (4 hpi). Also, we found that GRASP55 depletion increased intracellular infectivity (new Fig. 6J) while TGN46 depletion did not affect intracellular infectivity (new Fig. 7O), suggesting that GRASP55 modulates viral assembly but TGN46 does not.

      -Finally, mechanistic insight about the viral determinants regulating the morphology of the Golgi would significantly strengthen the study.

      Fig S6 shows that S expression decreases ACE2 surface levels? If so, could some S mutants be tested? Does it correlate with Golgi fragmentation? Do other viral structural proteins contribute to Golgi morphological alterations?

      We thank the reviewer for the suggestions. These are indeed interesting experiments, but we believe that investigating viral determinants of Golgi fragmentation should be pursued by future studies.

      In the same line of idea, how GRASP55 and TGN46 regulate replication. The link with Golgi morphology is unclear. Are these proteins hijacked by SARS-COV-2?

      Our new data in this revised manuscript more clearly define the stages in the viral infection cycle that are modulated by GRASP55 and TGN46. New Fig. 5D and Fig. 7R show that neither GRASP55 nor TGN46 affects viral entry or early viral replication. However, GRASP55 perturbation modulates viral assembly and secretion, while TGN46 perturbation affects virion secretion but not assembly. Fig. S6C shows that GRASP55 overexpression in the presence of the virus partially rescues Golgi fragmentation. The mechanisms by which GRASP55 and TGN46 are hijacked by SARS-CoV-2 will be explored in the future studies.

      Page 13 mentions some relevant mutants that could be assessed in this context and provide mechanistic insights.

      It would be interesting to investigate the effects of GRASP55 mutants or specific domains on SARS-CoV-2 trafficking, which we plan to explore in future studies.

      Minor comments: -The signal of calreticulin in Fig. S1 is too low to appreciate it distribution.

      We have increased the intensity of calreticulin staining for both uninfected and infected cells in parallel in Fig. S1. Thank you.

      -Fig 4K, Q: The differences in LC3 forms levels are not convincing. These results do not allow to draw any conclusion about autophagy, especially considering that this was done at steady-state and that the autophagic flux was not measured. Indeed, a bafilomycin A treatment control would be required to measure the real induction of autophagosomes. Lysosomal degradation inhibition allows the detection of LC3 accumulation.

      We agree that additional experiments are needed to demonstrate autophagic flux alteration by SARS-CoV-2. We observed an increase in LC3II/LC3I ratio in infected cells at steady state and did not explore this further, since this is not our main focus of this study. Therefore, we have removed the LC3 blots and quantification from Figs. 4 and S5.

      -In the GRASP55 overexpression and TGN46 knockdown studies, associated cell viability should be measured to control that that these genetic manipulations do not induce any cytotoxicity which may impact viral replication.

      We appreciate the reviewer's suggestions. We performed the LDH cytotoxicity assay under SARS-CoV-2 infection with TGN46 depletion or GRASP55 expression. Our new results show that TGN46 depletion or GRASP55 depletion/expression did not induce significant cell death (Figs. 5C, 6L, and 7Q).

      -The authors should test the impact of GRASP55 and GRASP65 knock-out on SARS-CoV-2 replication

      Investigating the genetic GRASP55 knockout effect on SARS-CoV-2 replication would be valuable. However, ACE2 protein expression in our Huh7-ACE2 cells decreases with cell passages, making knockout construction on this background impractical due to low ACE2 levels and poor viral infection rates. We believe that both our GRASP55 overexpression and depletion assays sufficiently support its role in SARS-CoV-2 trafficking. Future studies will explore GRASP55 knockout in different cell lines.

      -The authors should provide more details about the USA-WA1/2020 isolate in the Methods section. Is it related to the "Wuhan" strain or the variant which spread globally in early 2020 (with D614G mutation in Spike).

      USA-WA1/2020 was isolated from an oropharyngeal swab from a patient who returned from China and developed COVID-19 on January 19, 2020, in Washington, USA. It is related to the "Wuhan" strain but does not have D614G mutation in spike. Additional details have been added to the Methods section.

      -Fig 8: The combined modulation of GRASP55 and TGN46 expressions does not really seem additive to me since a 70% decrease of either protein modulation is observed while the combined condition brings this value to 75% in TCID50 assays. This does not bring much insight to the study in my opinion. I would suggest that the authors consider removing this figure.

      We agree with the reviewer's recommendation and have removed Fig. 8.

      Reviewer #2 (Significance (Required)):

      General assessment and advance: The study was generally well performed, and the quality of the microscopy and western blot data is good. It was appreciated that all the phenotypes were quantified extensively. However, I have some concerns regarding the interpretations of some of the key conclusions. Moreover, the fact that it was already described by several groups that Golgi is a key machinery for SARS-CoV-2 virion assembly (ERGIC) and secretion dampens my enthusiasm about the study. In addition, the antiviral activity of several tested drugs was also reported elsewhere. A clear mechanism of how SARS-CoV-2 induces a fragmentation of the Golgi would strengthen the study. In the same line of idea, it is unclear how TGN46 and GRASP55 regulate the late steps of the life cycle. The link between SARS-CoV-2-induced Golgi fragmentation and TGN46/GRASP55 is unclear. In my opinion, the data did not allow to clearly discriminate between virion assembly and egress. I was not convinced that it was not simply due to a general disruption of the secretory pathway (as attested by ACE2 down regulation upon GRASP55 overexpression).

      Targeted audience: This study will be of high interest for molecular virologists (not only working on SARS-CoV-2) but could be very well fit into the scope of molecular/cell biology-focused generalist journals

      Reviewer expertise: Molecular virology, virus-host interactions (especially involving membranous organelles), SARS-CoV-2, RNA viruses

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Zhang et al. demonstrated in this study that the Golgi apparatus and many other organelles are disturbed by SARS-CoV-2 infection. They focused on the Golgi apparatus and especially on TGN46 and GRASP55 which are both affected differently in their level of expression by the SARS-CoV-2 infection. TGN46 is overexpressed while GRASP55 is decreased in expression. Through different methods overexpression or depletion, the authors nicely demonstrated that modulation of both proteins either increased or decreased particles production. They demonstrated that in absence of GRASP55, SARS-CoV-2 release is increased in the medium. On the contrary, depletion of TGN46 decreases the secretion of SARS-CoV-2 particles.

      We thank the reviewer for the accurate summary of our work.

      Major comments:

      Figure 1: The authors demonstrated that SARS-CoV-2 expression affected the morphology of multiple organelles. Although the results are clear, my concern was that the MOI=1 was really high which indeed would affect the whole cell. To have a less drastic effect on the cell, I would suggest realizing the visualization of some organelles (Golgi, EEA1, Rab7 for example) at a lower MOI=0.1. In addition, it would be nice to verify with a live-dead assay with the MOI=1 if after 24h the cells are still alive, which will confirm that these disturbances are not caused by cells in process of dying.

      We thank the reviewer for the excellent suggestions. Investigating how SARS-CoV-2 reshapes subcellular organelles at low MOI (e.g., 0.1) and at different time points would be interesting but is beyond the scope of our study. However, we have performed LDH assay at MOI=1, 2 and 3 for 24 hours to assess cell death. Our results show that LDH release was similar across these conditions (Fig. S5R). We also performed RT-qPCR analysis of Spike, N, and RdRp at early time points of infection. The new results show that neither GRASP55 expression (new Fig. 5D) nor TGN46 expression (Fig. 7R) affects viral RNA abundance at an early infection timepoint (4 hpi).

      Figure 2: The results indicated in that panel are really nice. However, the addition of a virus with drugs could increase the proportion of cell death. For the Figure 2C, I propose that the author use a LDH assay to prove that the decrease in infection is not caused by cell death. In addition, a RT-qPCR would be more appropriate to indicate the infection rate and support the microscopy data.

      We thank the reviewer for the positive feedback and suggestions. As recommended, we performed an LDH assay to assess cytotoxicity under 9 small molecules treatment of infected cells. Additionally, we performed RT-qPCR analysis for the BFA time-point treatment assay. No significant cell death was observed under these conditions (new Figs. 2D, and S3C).

      Figure 3: The authors should have been consistent and add spike instead of nucleocapsid for GalT. According to the figures, Spike seemed to co-localize more with GM130 than Golgin 245. Data analysis of colocalization between Spike and GM130 should be performed to complete the observation. Are no colocalizations of Spike observed with the other Golgi markers?

      We agree with the reviewer that it was ideal if spike and GalT were co-stained. Unfortunately, both our spike antibody and GalT antibody are from rabbit, so co-staining could not be done as GM130/spike. We performed colocalization analysis between Spike and GM130, and the results show that GRASP55 expression did enhance Spike and GM130 colocalization to some extent (new Fig. S6E-F). We only co-stained spike with GM130 and Golgin-245 due to the antibody availability.

      Figure 4K: While all the experiments were performed at MOI=1, why is the authors using MOI=2 for the immunoblots. Did they have a different result in protein expression for MOI=1 in HuH cells? if so they should show a blot indicating this result.

      We did not perform WB to assess protein expression at MOI=1, but our cell toxicity assay showed that there is no significant difference between MOI=2 and MOI=1.

      Figure 5: Viral infection should be indicated using RT-qPCR data analysis to support the microscopy observations.

      We performed RT-qPCR analysis (new Figs. 2F, 5D, and 7R) and found that BFA treatment did not reduce viral RNA levels at all three time points. Also, GRASP55 expression and TGN46 depletion did not inhibit viral genome RNA levels within one viral infection cycle. Additionally, our new TCID50 assay results support our microscope observation (new Fig. 7O-P). Thanks for the suggestion.

      Figure 6: The authors should look at the trafficking of ACE2 and TfR in case of GRASP55 depletion like they did in case of GRASP55 overexpression. It could demonstrate if the virus is using trafficking pathways that are common to the one used by some host receptors to reach the plasma membrane.

      Thanks for the excellent suggestion. We performed cell surface biotinylation assay of control and GRASP55-depleted cells. We found that ACE2 and TfR receptor displayed a similar reduction on the cell surface (Fig. S7C), consistent with previous findings that GRASP55 depletion induced Golgi fragmentation and accelerated global conventional protein secretion.

      Figure 7: Viral infection assay should also be performed by RT-qPCR. Figure 7H: The immunoblots conditions were performed at MOI=3 this time. The authors should indicate why they did not keep the same MOI conditions. In that case, they should use an intracellular marker for their medium experiment to prove that they isolated proteins that are secreted and not simply released from dead cells. I will also suggest to show LDH assay at MOI=2 and 3 to monitor cell death. Is the Golgi fragmented when GRASP 55 is overexpressed in presence of the virus? Microscopy observations should be performed to reply to this question as it will support their model. The authors suggest that GRASP55 overexpression decreases spike incorporation inside the virion. Can they observe if Spike still colocalizes with GM130 when GRASP55 is overexpressed?

      We showed that TGN46 depletion inhibits viral infection by both IF and WB. We further confirmed this through TCID50 assay for both cells and media (new Fig. 7O-P), strengthening our hypothesis.

      As we described above, we performed morphological analysis at MOI=1 so that we could observe a significant number of infected cells but minimize cell toxicity. We performed immunoblotting (in Fig. 7H) at MOI=3 to get a good viral infection rate.

      As suggested, we also performed LDH assay at MOI=2 and 3 to monitor cell death (new Fig. S2O). Fig. S6C shows that GRASP55 overexpression in the presence of the virus partially rescues Golgi fragmentation. GRASP55 expression did also enhance Spike and GM130 colocalization to some extent (new Fig. S6E-F).

      Minor comments:

      Figure 1P in the text: Considering that Rab7 up-regulation is equal to "growth of late endosome" is an overstatement. Rab7 is cytosolic at its inactive state and at the endosome at its active state. The authors would have to prove this statement by monitoring an increased quantity of Rab7 at the endosomes which is not enough by just monitoring protein intensity by microscopy. As Rab7 is also localized in lysosomes, and the authors used Lamp2 as a lysosomal marker, it is strange that the area of these structures is not increased. The authors should replace the term "growth" by "an increase in the area of their vesicles".

      We did observe less but larger LAMP2 puncta in the infected cells. We agree with the reviewer and rephrased "growth" by an increase in the area of their vesicles". Thank you for the excellent suggestions.

      Figure 1Q-T: The observations described in the text did not match the quantification, the area of lysosomes is not significantly different from the non-infected conditions.

      In Fig. 1Q-T, we did observe fewer but larger LAMP2 puncta in the infected cells, which was consistent with our quantification, i.e., fewer puncta (Fig. 1R), but each punctum was larger (Fig. 1S), and total area was similar.

      Figure 8: In the text, it is mentioned that there is "a dramatic reduction of spike and N in the lysate in GRASP55-expressing and TGN46 depleted cells". However, the quantification indicated that the decrease in N and S content is non-significant. Can the authors precise what was the sample of comparison in the text (siControl versus siTGN46 or siTGN46+GFP versus siTGN46+GFP-GRASP55)?

      The decrease in N and S content is significant with the lysate sample comparison (siControl versus siTGN46; siControl+GFP versus siTGN46+GFP; siTGN46+GFP versus siTGN46+GFP-GRASP55). We have now removed this Figure following Reviewer #2's suggestion, since the results are consistent with single protein manipulation and more experiments are needed to confirm whether there is an additive effect.

      **Referee cross-commenting**

      I agree with most of the concerns of the other reviewers. I do also consider that they should have done their study on cells expressing naturally ACE2. However, at this stage, it will be a lot of work to perform all of their study in a more relevant cell type. The authors should repeat some of their key experiments in lung-derived cell types, to determine if GRASP55 and TGN46 have the same effect on SARS-CoV-2 virion secretion/production.

      We thank the reviewer for the suggestions and understanding. As we mentioned before, our study utilizes Huh7-ACE2 cells, which are sorted for the high expression of endogenous ACE2 protein, without ACE2 overexpression. Actually, we also tested A549 and Calu-3 cells. While A549 cells displayed very low infection rate, Calu-3 cells displayed disorganized Golgi without viral infection. However, we did perform immunofluorescence assays in Calu-3 cells. Consistent with our findings in Huh7-ACE2 cells, SARS-CoV-2 infection disrupts Golgi structure and alters protein levels of TGN46 and GRASP55 in Calu3 cells (new Fig. S5R-W). Also, others have reported that liver can be a target for SARS-CoV-2 infection in humans. Furthermore, we confirmed GRASP55 downregulation and TGN46 upregulation in VeroE6 cells (Fig. S6K-N).

      Reviewer #3 (Significance (Required)):

      The study identified two Golgi proteins (TGN46 and GRASP55) that are involved in modulating the release of SARS-CoV-2 particles from the cells. As these proteins are also acting on general secretion of host proteins to the plasma membrane, the effect on SARS-CoV-2 release could just be indirect. However, it does not change the informative points of the study raised by Zhang et al. It highlights really well how the host trafficking pathway could be diverted for the purpose of the virus, which is to produce particles to maintain its survival.

      Strengths: The authors performed a precise and well quantified study. Observing how SARS-CoV-2 impacts host organelles morphology and uses host trafficking proteins to produce particles, brings more clarity on some unclear parts of the life cycle of the virus. In addition, it exposes new targets for therapeutic studies.

      We thank the reviewer for the positive comments.

      Weakness: The paper is mostly based on microscopy analysis and need some other methods to support their data. The paper lacks some molecular mechanisms explaining the clear role of GRASP55 and TGN46 in particle production or assembly.

      In the revised version, we incorporated RT-qPCR assay, cell cytotoxicity assay, and BFA time-point treatment assay. Notably, we added intracellular and extracellular viral titer assays to more precisely distinguish between effects on virion assembly and virion secretion. We also confirmed the key observation that SARS-CoV-2 infection modulates GRASP55 and TGN46 expression in the Calu-3 lung cell line. Additionally, our early time-point results clearly support the role of GRASP55 and TGN46 in viral trafficking.

      • Audience: The paper will be interesting for basic research for a virology and cell biology audience.
      • Field of expertise with a few keywords: Virology and host cell trafficking.

      References

      Barnes E (2022) Infection of liver hepatocytes with SARS-CoV-2. Nat Metab 4: 301-302

      Bekier ME, 2nd, Wang L, Li J, Huang H, Tang D, Zhang X, Wang Y (2017) Knockout of the Golgi stacking proteins GRASP55 and GRASP65 impairs Golgi structure and function. Mol Biol Cell 28: 2833-2842

      Eymieux S, Rouille Y, Terrier O, Seron K, Blanchard E, Rosa-Calatrava M, Dubuisson J, Belouzard S, Roingeard P (2021) Ultrastructural modifications induced by SARS-CoV-2 in Vero cells: a kinetic analysis of viral factory formation, viral particle morphogenesis and virion release. Cell Mol Life Sci 78: 3565-3576

      Ghosh S, Dellibovi-Ragheb TA, Kerviel A, Pak E, Qiu Q, Fisher M, Takvorian PM, Bleck C, Hsu VW, Fehr AR et al (2020) beta-Coronaviruses Use Lysosomes for Egress Instead of the Biosynthetic Secretory Pathway. Cell 183: 1520-1535 e1514

      Hoffmann M, Hofmann-Winkler H, Smith JC, Kruger N, Arora P, Sorensen LK, Sogaard OS, Hasselstrom JB, Winkler M, Hempel T et al (2021) Camostat mesylate inhibits SARS-CoV-2 activation by TMPRSS2-related proteases and its metabolite GBPA exerts antiviral activity. EBioMedicine 65: 103255

      Hoffmann M, Mosbauer K, Hofmann-Winkler H, Kaul A, Kleine-Weber H, Kruger N, Gassen NC, Muller MA, Drosten C, Pohlmann S (2020) Chloroquine does not inhibit infection of human lung cells with SARS-CoV-2. Nature 585: 588-590

      Xiang Y, Wang Y (2010) GRASP55 and GRASP65 play complementary and essential roles in Golgi cisternal stacking. J Cell Biol 188: 237-251

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Zhang et al. demonstrated in this study that the Golgi apparatus and many other organelles are disturbed by SARS-CoV-2 infection. They focused on the Golgi apparatus and especially on TGN46 and GRASP55 which are both affected differently in their level of expression by the SARS-CoV-2 infection. TGN46 is overexpressed while GRASP55 is decreased in expression. Through different methods overexpression or depletion, the authors nicely demonstrated that modulation of both proteins either increased or decreased particles production. They demonstrated that in absence of GRASP55, SARS-CoV-2 release is increased in the medium. On the contrary, depletion of TGN46 decreases the secretion of SARS-CoV-2 particles.

      Major comments:

      Figure 1: The authors demonstrated that SARS-CoV-2 expression affected the morphology of multiple organelles. Although the results are clear, my concern was that the MOI=1 was really high which indeed would affect the whole cell. To have a less drastic effect on the cell, I would suggest realizing the visualization of some organelles (Golgi, EEA1, Rab7 for example) at a lower MOI=0.1. In addition, it would be nice to verify with a live-dead assay with the MOI=1 if after 24h the cells are still alive, which will confirm that these disturbances are not caused by cells in process of dying.

      Figure 2: The results indicated in that panel are really nice. However, the addition of a virus with drugs could increase the proportion of cell death. For the Figure 2C, I propose that the author use a LDH assay to prove that the decrease in infection is not caused by cell death. In addition, a RT-qPCR would be more appropriate to indicate the infection rate and support the microscopy data.

      Figure 3: The authors should have been consistent and add spike instead of nucleocapsid for GalT. According to the figures, Spike seemed to co-localize more with GM130 than Golgin 245. Data analysis of colocalization between Spike and GM130 should be performed to complete the observation. Are no colocalizations of Spike observed with the other Golgi markers?

      Figure 4K: While all the experiments were performed at MOI=1, why is the authors using MOI=2 for the immunoblots. Did they have a different result in protein expression for MOI=1 in HuH cells? if so they should show a blot indicating this result.

      Figure 5: Viral infection should be indicated using RT-qPCR data analysis to support the microscopy observations.

      Figure 6: The authors should look at the trafficking of ACE2 and TfR in case of GRASP55 depletion like they did in case of GRASP55 overexpression. It could demonstrate if the virus is using trafficking pathways that are common to the one used by some host receptors to reach the plasma membrane.

      Figure 7: Viral infection assay should also be performed by RT-qPCR. Figure 7H: The immunoblots conditions were performed at MOI=3 this time. The authors should indicate why they did not keep the same MOI conditions. In that case, they should use an intracellular marker for their medium experiment to prove that they isolated proteins that are secreted and not simply released from dead cells. I will also suggest to show LDH assay at MOI=2 and 3 to monitor cell death. Is the Golgi fragmented when GRASP 55 is overexpressed in presence of the virus? Microscopy observations should be performed to reply to this question as it will support their model. The authors suggest that GRASP55 overexpression decreases spike incorporation inside the virion. Can they observe if Spike still colocalizes with GM130 when GRASP55 is overexpressed?

      Minor comments:

      Figure 1P in the text: Considering that Rab7 up-regulation is equal to "growth of late endosome" is an overstatement. Rab7 is cytosolic at its inactive state and at the endosome at its active state. The authors would have to prove this statement by monitoring an increased quantity of Rab7 at the endosomes which is not enough by just monitoring protein intensity by microscopy. As Rab7 is also localized in lysosomes, and the authors used Lamp2 as a lysosomal marker, it is strange that the area of these structures is not increased. The authors should replace the term "growth" by "an increase in the area of their vesicles".

      Figure 1Q-T: The observations described in the text did not match the quantification, the area of lysosomes is not significantly different from the non-infected conditions.

      Figure 8: In the text, it is mentioned that there is "a dramatic reduction of spike and N in the lysate in GRASP55-expressing and TGN46 depleted cells". However, the quantification indicated that the decrease in N and S content is non-significant. Can the authors precise what was the sample of comparison in the text (siControl versus siTGN46 or siTGN46+GFP versus siTGN46+GFP-GRASP55)?

      Referee cross-commenting

      I agree with most of the concerns of the other reviewers. I do also consider that they should have done their study on cells expressing naturally ACE2. However, at this stage, it will be a lot of work to perform all of their study in a more relevant cell type. The authors should repeat some of their key experiments in lung-derived cell types, to determine if GRASP55 and TGN46 have the same effect on SARS-CoV-2 virion secretion/production.

      Significance

      The study identified two Golgi proteins (TGN46 and GRASP55) that are involved in modulating the release of SARS-CoV-2 particles from the cells. As these proteins are also acting on general secretion of host proteins to the plasma membrane, the effect on SARS-CoV-2 release could just be indirect. However, it does not change the informative points of the study raised by Zhang et al. It highlights really well how the host trafficking pathway could be diverted for the purpose of the virus, which is to produce particles to maintain its survival.

      Strengths: The authors performed a precise and well quantified study. Observing how SARS-CoV-2 impacts host organelles morphology and uses host trafficking proteins to produce particles, brings more clarity on some unclear parts of the life cycle of the virus. In addition, it exposes new targets for therapeutic studies.

      Weakness: The paper is mostly based on microscopy analysis and need some other methods to support their data. The paper lacks some molecular mechanisms explaining the clear role of GRASP55 and TGN46 in particle production or assembly.

      Audience: The paper will be interesting for basic research for a virology and cell biology audience.

      Field of expertise with a few keywords: Virology and host cell trafficking.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, Zhang and colleagues address the impact on SARS-CoV-2 infection on the morphology of the Golgi apparatus and convincingly demonstrate a fragmentation of this organelle in infected cells. Conversely, they show that the modulation of TGN46 or GRASP55 expressions, two components of this organelle impact SARS_CoV-2 replication. By monitoring the relative levels of viral Spike and nucleocapsid in the cell supernatants, they conclude that GRASP55 regulates particle assembly and trafficking while TGN46 controls only secretion. The study was generally well performed, and the quality of the microscopy and western blot data is good. It was appreciated that all the phenotypes were robustly quantified. I believe that this study is potentially interesting and relevant for the SARS-CoV-2 community since providing an extensive characterization of the interplay between SARS-CoV-2 and the Golgi apparatus. However, as described below, I have some concerns regarding the interpretations of some of the key conclusions. Moreover, the fact that it was already described by several groups that Golgi is a key machinery used for SARS-CoV-2 virion assembly (ERGIC) and secretion dampens my enthusiasm about the study, especially without clear molecular mechanisms about the interplay between SARS-CoV-2 proteins and TNG46/GRASP55.

      Major comments:

      • All the assays have been performed in liver-derived Huh7 cells (overexpressing SARS-CoV-2 receptor) ACE2 (for infection) or kidney 293 cells (for pseudotyped HIV entry assays). However, no conclusion was validated in lung-derived cells (like A549-ACE2, Calu-3 or primary cells), which would be important since the respiratory tract is the main target of SARS-CoV-2
      • Fig2: The impact of the drugs on replication was assessed by measuring the % of infected cells. At 24 hpi, I am unsure about what this value is supposed to measure (the whole life cyle, intracellular replication or spread?), especially since it is not indicated when the drugs were added to the cells. Was it during, before or after the infection? This information should be provided. If the "Golgi" drugs impact egress only (as inferred by the genetic modulation phenotypes), I would expect that at this early time point, the % of infection would not drastically change (as well as intracellular RNA) but that the extracellular infectious titers would decrease. Plaque assays (or TCID50 assays) and RT-qPCR on intracellular viral RNA should be conducted to better understand the impact of drug treatments. On page 10, it is said that the virus makes three cycles of replication within 24 hours following infection. On what data is this based? This seems a lot. If this is true (and shown in Huh7-ACE2 cells), does the assay of figure 2 measure spread in general? More importantly, despite mentioned, the cell viability data are not provided. It is important to show them to ensure that these concentrations of drugs are not toxic at the tested concentrations.
      • I appreciated the extensive confocal microscopy analysis performed by the authors, which seems of high quality and overall, very convincing. They clearly show that SARS-CoV-2 infection induces the fragmentation of the Golgi apparatus although it was reported by others before as mentioned by the authors. However, it was hard for me to make the functional link between these data and those related to GRASP55 and TGN46 overexpression/knockdown. First, the authors should assess the morphology of the Golgi apparatus in Huh7-ACE2 when GRASP55 is knocked down/out or when TGN46 is overexpressed. Second, in these 2 conditions that favor replication, it should be assessed whether this correlates with Golgi fragmentation. Even if this was probably shown before, it is relevant to show that these genetic modulations induce Golgi reshaping in this particular cell type by confocal microscopy (and ideally electron microscopy).
      • The fact that GRASP55-GFP expression decreases in 293T the cell surface levels of ACE2, the receptor of Spike (Fig S6), raises concern that the effect of GRASP55 is not specific to the virus and suggests that the whole secretory pathway is altered, while an impairment of virus entry should be expected in this cell line. Is there a similar trend in Huh7-ACE2? In addition to addressing the functionality of the secretory machinery in Huh7-ACE2, it would be relevant to repeat the cell surface labelling in the context of pseudotyped virus production with other viral envelopes such as VSV G protein or HIV gp41/gp120. If the phenotype is specific to Spike trafficking, the cell surface abundance of these alternative viral proteins should not be impacted by GRASP55 overexpression. Otherwise, this would indicate a general effect of on the secretory pathway. Besides, since HIV Gag is directed directly to the plasma membrane during particle assembly without entering the secretory pathway, I am not convinced that upstream alteration on nucleocapsid assembly at the ERGIC should be excluded. Indeed, changes on the S/N ratios are generally mild and I feel that this cannot explain the phenotypes in the extracellular infectious titers. More generally, the comparison between trafficking and assembly should be better assessed and not simply based on extracellular N and S levels. It was hard to see the differences between the two in terms of phenotypes. The authors should at least measure the intracellular infectivity upon TGN46 and GRASP55 knock/down and overexpression as well as intracellular vRNA abundance as a readout of RNA replication (which is anticipated to remain unchanged).
      • Finally, mechanistic insight about the viral determinants regulating the morphology of the Golgi would significantly strengthen the study. Fig S6 shows that S expression decreases ACE2 surface levels? If so, could some S mutants be tested? Does it correlate with Golgi fragmentation? Do other viral structural proteins contribute to Golgi morphological alterations? In the same line of idea, how GRASP55 and TGN46 regulate replication. The link with Golgi morphology is unclear. Are these proteins hijacked by SARS-COV-2? Page 13 mentions some relevant mutants that could be assessed in this context and provide mechanistic insights.

      Minor comments:

      • The signal of calreticulin in Fig. S1 is too low to appreciate it distribution.
      • Fig 4K, Q: The differences in LC3 forms levels are not convincing. These results do not allow to draw any conclusion about autophagy, especially considering that this was done at steady-state and that the autophagic flux was not measured. Indeed, a bafilomycin A treatment control would be required to measure the real induction of autophagosomes. Lysosomal degradation inhibition allows the detection of LC3 accumulation.
      • In the GRASP55 overexpression and TGN46 knockdown studies, associated cell viability should be measured to control that that these genetic manipulations do not induce any cytotoxicity which may impact viral replication.
      • The authors should test the impact of GRASP55 and GRASP65 knock-out on SARS-CoV-2 replication
      • The authors should provide more details about the USA-WA1/2020 isolate in the Methods section. Is it related to the "Wuhan" strain or the variant which spread globally in early 2020 (with D614G mutation in Spike).
      • Fig 8: The combined modulation of GRASP55 and TGN46 expressions does not really seem additive to me since a 70% decrease of either protein modulation is observed while the combined condition brings this value to 75% in TCID50 assays. This does not bring much insight to the study in my opinion. I would suggest that the authors consider removing this figure.

      Significance

      General assessment and advance: The study was generally well performed, and the quality of the microscopy and western blot data is good. It was appreciated that all the phenotypes were quantified extensively. However, I have some concerns regarding the interpretations of some of the key conclusions. Moreover, the fact that it was already described by several groups that Golgi is a key machinery for SARS-CoV-2 virion assembly (ERGIC) and secretion dampens my enthusiasm about the study. In addition, the antiviral activity of several tested drugs was also reported elsewhere. A clear mechanism of how SARS-CoV-2 induces a fragmentation of the Golgi would strengthen the study. In the same line of idea, it is unclear how TGN46 and GRASP55 regulate the late steps of the life cycle. The link between SARS-CoV-2-induced Golgi fragmentation and TGN46/GRASP55 is unclear. In my opinion, the data did not allow to clearly discriminate between virion assembly and egress. I was not convinced that it was not simply due to a general disruption of the secretory pathway (as attested by ACE2 down regulation upon GRASP55 overexpression).

      Targeted audience: This study will be of high interest for molecular virologists (not only working on SARS-CoV-2) but could be very well fit into the scope of molecular/cell biology-focused generalist journals

      Reviewer expertise: Molecular virology, virus-host interactions (especially involving membranous organelles), SARS-CoV-2, RNA viruses

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, the authors highlight the importance of the Golgi apparatus during SARS-CoV-2 infection. Specifically, using different compounds able to alter Golgi structure and function, the authors show a strong reduction in SARS-CoV-2 infection rate. In particular it is interesting to observe that treatments of 24 hrs with BFA strongly impair viral infection, highlithing the importance of Golgi function for this virus. Albeit the time of treatment is different. this observation is in contrast with previous studies on related coronaviruses (Ghosh et al., 2020) that did not observe any effect upon treatment with BFA. This might imply that SARS-CoV-2 relies more on conventional trafficking pathways respect to other coronaviruses which, under certain conditions, favour different trafficking routes. The authors additionally observed that viral infection increases TGN46 levels while decreasing GRASP55 levels. To dissect the role of TGN46 and GRASPR55, the authors performed several infection studies in cells in which the levels of the two proteins were modulated either by overexpression (GRASP55) and/or siRNA-mediated knock-down (GRASP55 and TGN46). Those approaches suggest that GRASPR55 overexpression, a protein essential for Golgi stack formation, decelerates viral trafficking and inhibits viral assembly while its depletion reverses the effects. On the other hand, TGN46 knock-down impairs viral trafficking but not assembly.

      Overall the study clearly shows the importance of the Golgi during SARS-CoV-2 and also shows that modulation of those two factors affect viral infection. However the claims that specifically the trafficking (TGN46) and trafficking and assembly (GRASP55) are not fully substantiated.

      Regarding GRASP55, the authors state that viral infection decreases GRASPR55 levels and this results in Golgi fragmentation. However GRASPR55 levels decrease is shown at 24 hrs post infection while Golgi fragmentation occurs as early as 5 hrs. Thus there might be no direct casual effect between the two effects. Additionally, the authors show that overexpression of GRASP55 rescue Golgi fragmentation, as observed by imaging, however is not clear if only infected cells where quantified and if they had the same level of infection.

      The authors exclude and effect on entry based on experiment on Spike expressing pseudovirus in 293-ACE2, however they also clearly observe reduction of ACE2 on the membrane of GRASPR55 expressing cells (Fig S6B). Thus how can they explain this discrepancy and how ca defect in entry can be fully marked out in these cell lines? It is not clear to which process the authors refer to when they write about "viral trafficking". Is it virion trafficking or viral proteins trafficking? The two process are linked but are not the same. This oversemplification can be misleading. For instance the authors show that overexpression of GRASP55 decreases Spike protein on the plasma membrane and its depletion increases S protein incorporation into psudoviruses. However it was shown that in infected cells S protein is mainly retained at the ERGIC by M and E (Boson et al., 2021) where viral assembly occurs. Thus an increase in S trafficking on the PM does not correlate with an increase in virion trafficking, and ultimately, the data provided do not fully support the authors claim on a modulation of "virion trafficking" in response to GRASP or TGN46 changes, since no experiments clearly show a change in virions secretion. Importantly, the authors do not rule out potential effects of their perturbations on genome replication. The only experiment that they perform in this direction is presented in figure S7B, where the authors show similar percentage of infected cells at early stage upon silecing of GRASPR55. The experiment suggests that productive entry is similar in these conditions, but quantification of intracellular viral genome could exclude a change in viral replication. If no changes in viral replication are observed, the authors could verify an increase in particles secretion by collecting supernatants from the early time points and performing plaque assays and quantification of viral genomes by qRT-PCR, to prove that modulation of GRASPR55 indeed promote SARS-CoV-2 trafficking.

      Finally, whenever reduction of viral infection is observed upon cell partubation, a robust analysis of cell viability should be presented to exclude pleiotropic effects. Expecially in presence of multiple pertubation that might affect cell metabolism. The authors should carefully control cell viability and growth in response to depletion of TGN46 and GRASP55.

      Minor:

      show data on viability of the drug and add the relative section in Material and Methods

      Figure 3A: should read spike and not nucleocapsid eported for SARS-CoV-2 Lack of inhibition with camostat correlates with lack of TMPRSS2 in the Huh7. The sentence seems to be too general while in this case the effect is clearly cell specific. Similarly, the importance of the lysosome in viral entry is restricted to cells lacking TMPRSS2 and cannot be generalized since CQ, for example, does not work in Calu-3 cells that express TMPRSS2 cells. Typo: Fig S3B - Y axis should reat viral not vrial S3C: concentrations of the compound used in the assay should be reported. Was a viability assay performed also in the 293T-ACE2 cell line?

      Significance

      Overall, the major strenght of the manuscript is that it has clarified the importance of the Golgi during SARS-CoV-2 infection. The drugs screening demonstrate that for SARS-CoV-2 the conventional secretion seems to have major role respect to other secretory routes observed for other coronaviruses. Also it is clear that the two factors identified by the authors have a role in viral infection, however the major limitation is that the authors failed to clearly highlight which step/s of the viral life cycle are modulated upon GRASP55 and TGN46 perturbatio. Expecially the claims on "trafficking" is not fully substantiated, since the only experiment in this direction is the transport of Spike protein on the plasma membrane upon GRASPR55 overexpression. It is risky to conclude that the trafficking of a single protein reflect the intracellular trafficking of the virions.

      Several of the finding presented in the first part of the manuscript have been already previously reported (for example the fragmentation of the Golgi upon SARS-CoV-2 infection), however the role of GRASP55 and TGN46 in SARS-CoV-2 infection has been reported here for the first time. This manuscript can be of interest for a broad audience considering the topic (cell biology, host-pathogen interactions and molecular virology)

      My expertise reside in the field of molecular virology, expecially in the contest of the mechanisms of viral replication and host-pathogen interactions.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 Evidence, reproducibility and clarity Summary: Bhatt et al. seek to define factors that influence H3.3 incorporation in the embryo. They test various hypotheses, pinpointing the nuclear/cytoplasmic ratio and Chk1, which affects cell cycle state, as influencers. The authors use a variety of clever Drosophila genetic manipulations in this comprehensive study. The data are presented well and conclusions reasonably drawn and not overblown. I have only minor comments to improve readability and clarity. I suggest two OPTIONAL experiments below. We thank the reviewer for their positive and helpful comments. Major comments: We found this manuscript well written and experimentally thorough, and the data are meticulously presented. We have one modification that we feel is essential to reader understanding and one experimental concern: The authors provide the photobleaching details in the methodology, but given how integral this measurement is to the conclusions of the paper, we feel that this should be addressed in clear prose in the body of the text. The authors explain briefly how nuclear export is assayed, but not import (line 99). Would help tremendously to clarify the methods here. This is especially important as import is again measured in Fig 4. This should also be clarified (also in the main body and not solely in the methods). We have added the following sentences to the main body of the text to clarify how photobleaching and import were assayed. “We note that these differences are not due to photobleaching as our measurements on imaged and unimaged embryos indicate that photobleaching is negligible under our experimental conditions (see methods, Figure S1G-H)” lines 98-101 and “Since nuclear export is effectively zero, we attribute the increase in total H3.3 over time solely to import and therefore the slope of total H3.3 over time corresponds to the import rate.” lines 111-113 Revision Plan In addition we have clarified how import was calculated to figure legends in Figure 5D (formerly 4D) and S1F which now read: “Initial slopes of nuclear import curves (change in total nuclear intensity over time for the first 5 timepoints) …” We also added the following explanation of how nuclear import rates were calculated to the methods section: “Import rates were calculated by using a linear regression for the total nuclear intensity over time for the first 5 timepoints in the nuclear import curves.” lines 471-473, methods If the embryos appeared "reasonably healthy" (line 113) after slbp RNAi, how do the authors know that the RNAi was effective, especially in THESE embryos, given siblings had clear and drastic phenotype? This is especially critical given that the authors find no effect on H3.3 incorporation after slbp RNAi (and presumably H3 reduction), but this result would also be observed if the slbp RNAi was just not effective in these embryos. We apologize for the confusion caused by our word choice. The “healthy” slbp-RNAi embryos had measurable phenotypes consistent with histone depletion that we have reported previously (Chari et al, 2019) including cell cycle elongation and early cell cycle arrest (Figure S4D). However, they did not have the catastrophic mitosis observed in more severely affected embryos. We agree with the reviewer that a concern of this experiment is that the less severely affected embryos likely have more remaining RD histones including H3. To address this we also tested H3.3 incorporation in the embryos that fail to progress to later cell cycles in the cycles that we could measure. Even in these more severely affected embryos we were not able to detect a change in H3.3 incorporation relative to controls (lines 240-243 and Fig S4B). Unfortunately, it is impossible to conduct the ideal experiment, which would be a complete removal of H3 since this is incompatible with oogenesis and embryo survival. To address this confusion we have added supplemental videos of control, moderately affected and severely affected SLBP-RNAi embryos as movies 3-5 and modified the text to read: “All embryos that survive through at least NC12, had elongated cell cycles in NC12 and 60% arrested in NC13 as reported previously indicating the effectiveness of the knockdown (Figure S4C, Movie 3-5)39. In these embryos, H3.3 incorporation is largely unaffected by the reduction in RD H3 (Figure 6B).” lines 236-240 Finally, to characterize the range of SLBP knockdown in the RNAi embryos we propose to do single embryo RT-qPCRs for SLBP mRNA for multiple individual embryos. This will provide a measure of the range knockdown that we observed in our H3.3 movies. Minor comments: Introduction: Revision Plan Consider using "replication dependent" (RD) rather than "replication coupled." Both are used in the field, but RD parallels RI ("replication independent"). We thank the reviewer for this suggestion. We have made the text edits to change "replication coupled" (RC) to "replication dependent" (RD) throughout the manuscript. Would help for clarity if the authors noted that H3 is equivalent to H3.2 in Drosophila. Also it is relevant that there are two H3.3 loci as the authors knock mutations into the H3.3A locus, but leave the H3.3B locus intact. The authors should clarify that there are two H3.3 genes in the Drosophila genome. We have changed the text as follows to increase clarity as suggested: “Similarly, we have previously shown that RD H3.2 (hereafter referred to as H3) is replaced by RI H3.3 during these same cycles, though the cause remains unclear29” lines 52-54 “There are ~100 copies of H3 in the Drosophila genome, but only 2 of H3.3 (H3.3A and H3.3B)26. To determine which factor controls nuclear availability and chromatin incorporation, we genetically engineered flies to express Dendra2-tagged H3/H3.3 chimeras at the endogenous H3.3A locus, keeping the H3.3B locus intact.” lines 127-131 Please add information and citation (line 58): H3.3 is required to complete development when H3.2 copy number is reduced (PMID: 37279945, McPherson et al. 2023) We have added the suggested information. The text now reads “Nonetheless, H3.3 is required to complete development when H3.2 copy number is reduced54.” lines 61-62 Results: Embryo genotype is unclear (line 147): Hira[ssm] haploid embryos inherit the Hira mutation maternally? Are Hira homozygous mothers crossed to homozygous fathers to generate these embryos, or are mothers heterozygous? This detail should be in the main text for clarity. The Hira mutants are maternal effect. We crossed homozygous Hirassm females to their hemizygous Hirassm or FM7C brothers. However, the genotype of the male is irrelevant since the Hira phenotype prevents sperm pronuclear fusion and therefore there is no paternal contribution to the embryonic genotype. We have clarified this point in the text: “We generated embryos lacking functional maternal Hira using Hirassm-185b (hereafter Hirassm) homozygous mothers which have a point mutation in the Hira locus57.” lines 160-162 Revision Plan Line 161: Shkl affects nuclear density, but it also appears from Fig 3 to affect nuclear size? The authors do not address this, but it should at least be mentioned. We thank the reviewer for the astute observation. More dense regions of the Shkl embryos do in fact have smaller nuclei. We believe that this is a direct result of the increased N/C ratio since nuclear size also falls during normal development as the N/C ratio increases. We have added a new figure 1 in which we more carefully describe the events of early embryogenesis in flies including a quantification of nuclear size and number in the pre-ZGA cell cycles (Figure 1C). We also note the correlation of nuclear size with nuclear density in the text: “During the pre-ZGA cycles (NC10-13), the maximum volume that each nucleus attains decreases in response to the doubling number of nuclei with each division (Figure 1C).” lines 86-87 “To test this, we employed mutants in the gene Shackleton (shkl) whose embryos have non-uniform nuclear densities and therefore a gradient of nuclear sizes across the anterior/posterior axis (Figure 3A-B, Movie 1-2)58.” lines 180-183 The authors often describe nuclear H3/H3.3 as chromatin incorporated, but these image-based methods do not distinguish between chromatin-incorporated and nuclear protein. To distinguish between chromatin incorporated and nuclear free histone we have exploited the fact that histones that are not incorporated into DNA freely diffuse away from the chromatin mass during mitosis while those that are bound into nucleosomes remain on chromatin during this time. In our previous study we showed that H3-Dendra2 that is photoconverted during mitosis remains stably associated with the mitotic chromatin through multiple cell cycles (Shindo and Amodeo, 2019) strengthening our use of this metric. To help clarify this point as well as other methodological details we have added a new Figure 1B which documents the time points at which we make various measurements within the lifecycle of the nucleus. We also edited the text to read: “We have previously shown that with each NC, the pool of free H3 in the nucleus is depleted and its levels on chromatin during mitosis decrease (Figure 1D, S1C-D)29. In contrast, H3.3 mitotic chromatin levels increase during the same cycles (Figure 1D, S1C-D)29.” lines 89-92 I very much appreciate how the authors laid out their model in Fig 3 and then used the same figure to explain which part of the model they are testing in Figs 4 and 5. This is not a critique- we can complement too! Thank you! Revision Plan OPTIONAL experimental suggestion: The experiments in Figure 4 and 5 are clever. One would expect that H3 levels might exhaust faster in embryos lacking all H3.2 histone genes (Gunesdogan, 2010, PMID: 20814422), allowing a comparison testing the H3 availability > H3.3 incorporation portion of the hypothesis without manipulating the N/C ratio. This might also result in a more consistent system than slbp RNAi (below). We thank the reviewer for the experimental suggestion. We also considered this experimental manipulation to decrease RD histone H3.2. We chose not to do this experiment because in the Gunesdogan paper they show that the zygotic HisC nulls have normal development until after NC14 (unlike the maternal SLBP-RNAi that we used) suggesting that maternal H3.2 supplies do not become limiting until after the stages under consideration in our paper. Maternal HisC-nulls are, of course, impossible to generate since histones are essential. O'Haren 2024 (PMID: 39661467) did not find increased Pol II at the HLB after zelda RNAi (line 227). Might also want to mention here that zelda RNAi does not result in changes to H3 at the mRNA level (O'Haren 2024), as that would confound the model. We thank the reviewer for the suggestion. We have removed the discussion of Pol II localization and replaced it with the information about histone mRNA : “zelda controls the transcription of the majority of Pol II genes during ZGA but disruption of zelda does not change RD histone mRNA levels67–70”. lines 249-251 Discussion: Should discuss results in context of McPherson et al. 2023 (PMID: 37279945), who showed that decreasing H3.2 gene numbers does not increase H3.3 production at the mRNA or protein levels. We expanded our discussion to include the following: “Given the fact that H3.3 pool size does not respond to H3 copy number in other Drosophila tissues,54 our results suggest that H3.3 incorporation dynamics are likely independent of H3 availability.” lines 278-280 The Shackleton mutation is a clever way to alter N/C ratio, but the authors should point out that it is difficult (impossible?) to directly and cleanly manipulate the N/C ratio. For example, Shkl mutants seem to also have various nuclear sizes. As discussed above, we think that nuclear size is a direct response to the N/C ratio. We have added the following sentence to the discussion as well as a citation to a paper which discusses how the N/C ratio might contribute to nuclear import in early embryos to the discussion: “This may be due to N/C ratio-dependent changes in nuclear import dynamics which may also contribute to the observed changes in nuclear size across the shkl embryo75.” lines 307-309 Revision Plan How is H3.3 expression controlled? Is it possible that H3.3 biosynthesis is affected in Chk1 mutants? To address this question we propose to perform RT-qPCR for H3.3A and H3.3B as well as Hira in the Chk1 mutant. Unfortunately, we do not have antibodies that reliably distinguish between H3 and H3.3 in our hands (despite literature reports), but we will also perform a pan-H3 immunostaining in the Chk1 embryos to measure how the total H3-type histone pool changes as a result of the loss of Chk1. Figures: While I appreciate the statistical summaries in tables, it is still helpful to display standard significance on the figures themselves. We have added statistical comparisons in Figure 3 (formerly Figure 2). We do not feel that it is appropriate to directly compare the intensities of the H3-Dendra2 construct expressed from the pseudo-endogenous locus to the H3.3 and chimeric proteins expressed from the H3.3A locus as they were imaged using different settings. Although we plot H3 on the same graph as the other proteins to allow for ease of comparison of their trends over time it is not appropriate to directly compare their normalized intensities which including statistical tests would encourage. We have added a note to the legend of Figure 1 explaining this which reads: “Note that statistical comparisons between the two Dendra2 constructs have not been done as they were expressed from different loci and imaged under different experimental settings.” Fig 1: A: Is it possible to label panels with the nuclear cycle? We have done this. B: Statistics required - caption suggests statistics are in Table S2, but why not put on graph? Please see the explanation above for why we do not feel that it is appropriate to perform this comparison. C/D: Would be helpful if authors could plot H3/H3.3 on same graph because what we really need to compare is NC13 between H3/H3.3 (and statistics between these curves) Please see the explanation above for why we do not feel that it is appropriate to perform this comparison. These curves can be directly compared within a construct and we can evaluate their trends over time, but the normalized values should not be directly compared in the way that would be encouraged by plotting the data as suggested. E: The comparison in the text is between H3.3 and H3, but only H3.3 data is shown. I realize that it is published prior, but the comparison in figure would be helpful. We have added the previously published values to the text. Revision Plan “These changes in nuclear import and incorporation result in a less complete loss of the free nuclear H3.3 pool (~70% free in NC11 to ~30% in NC13) than previously seen for H3 (~55% free in NC11 to ~20% in NC13)” lines 116-119 Fig 2: A: A very helpful figure. Slightly unclear that the H3 that is not Dendra tagged is at the H3.3 locus. Also unclear that the H3.3A-Dendra2 line exists and used as control, as is not shown in figure. Should show H3 and H3.3 controls (Figure S2) We have edited the figure to add Dendra2 to all of the constructs and made clear the location of each construct including adding the landing site for H3-Dendra2. We have also cited Figure S1 in the legend which contains a more detailed diagram of the integration strategy. F/H- As the comparison is between H3 and ASVM, it would help to combine these data onto the same graph. As the color is currently used unnecessarily to represent nuclear cycle, the authors could use their purple/pink color coding to represent H3/ASVM. We have combined these data onto a single graph as requested and changed the colors appropriately. We have not added statistical comparisons to this graph as we again believe that they would be inappropriate. In the legend of Fig 2 the authors write "in the absence of Hira." Technically, there is only a point mutation in Hira. It is not absent. Good catch! We have changed this to “in Hirassm mutants”. Fig 3: G: Please show WT for comparison. Can use data in Fig 3A. We have added the color-coded number of neighbor embryo representations for WT and Shkl embryos underneath the example embryo images in 4A-B (formerly 3A-B,G). Model in H is very helpful (complement)! Thank you. Fig 4: B/C/F/G: The authors use a point size scale to represent the number of nuclei, but the graphs are so overlaid that it is not particularly useful. Is there a better way to display this dimension? We chose to represent the data in this way so that the visual impact of each line is representative of the amount of data (number of nuclei in each bin) that underlies it. This helps to prevent sparsely populated outlier bins at the edges of the distribution from dominating the interpretation of the data. If the reviewer has a suggestion for a better way to visualize this information we would welcome their suggestion, but we cannot think of a better way at this time. D/E/H/I: What does "min volume" mean on the X axis? Since the uneven N/C ratio in the shkl embryos results in a wavy cell cycle pattern there is no single time point where we can calculate the number of neighbors for the whole embryo (since Revision Plan not all nuclei are in the same cell cycle at a given point). Therefore, we had to choose a criterion for when we would calculate the number of neighbors for each nucleus. We chose nuclear size as a proxy for nuclear age since nuclear size increases throughout interphase (see new figure 1B). So, the minimum volume is the newly formed nucleus in a given cell cycle. We also tested other timepoints for the number of neighbors (maximum nuclear volume, just before nuclear envelope breakdown and midway between these two points) and found similar results. We chose to use minimum volume in this paper because this is the time point when the nucleus is growing most quickly and nuclear import is at its highest. We have added the following explanation to the methods: “For shkl embryos, as the nuclear cycles are asynchronous, nuclear divisions start at different timepoints within the same cell cycle and the nuclear density changes as the neighboring nuclei divide. Therefore, the total intensity traces were aligned to match their minimum volumes (as shown in Figure 1B) to T0.” lines 485-488, methods And the following detail to the figure legend: “...plotted by the number of nuclear neighbors at their minimum nuclear volume…” Figure 5 legend We also added a depiction of the lifecycle of the nucleus in which we marked the minimum volume as the new Figure 1B. Fig 5: F: OPTIONAL Experimental request: Here I would like to see H3 as a control. This is a very good suggestion, and we are currently imaging H3-Dendra2 in the Chk1 background. However, our preliminary results suggest that there may be some synthetic early lethality between the tagged H3-Dendra2 and Chk1 since these embryos are much less healthy than H3.3-Dendra2 Chk1 embryos or Chk1 with other reporters. In addition, we have observed a much higher level of background fluorescence in this cross than in the H3-Dendra2 control. We are uncertain if we will be able to obtain usable data from this experiment, but will continue to try to find conditions that allow us to analyze this data. As an orthogonal approach to answer the question, we will perform immunostaining with a pan-H3 antibody in Chk1 mutant embryos to measure total H3 levels under these conditions. Since the majority of H3-type histone is H3.2 and we know how H3.3 changes, this staining will give us insight into the dynamics of H3 in Chk1 mutant embryos. Significance General assessment: Many long-standing mysteries surround zygotic genome activation, and here the authors tackle one: what are the signals to remodel the zygotic chromatin around ZGA? This is a tricky question to answer, as basically all manipulations done to the embryo Revision Plan have widespread effects on gene expression in general, confounding any conclusions. The authors use clever novel techniques to address the question. Using photoconvertible H3 and H3.3, they can compare the nuclear dynamics of these proteins after embryo manipulation. Their model is thorough and they address most aspects of it. The hurdle this study struggles to overcome is the same that all ZGA studies have, which is that manipulation of the embryo causes cascading disasters (for example, one cannot manipulate the nuclear:cytoplasmic ratio without also altering cell cycle timing), so it's challenging to attribute molecular phenotypes to a single cause. This doesn't diminish the utility of the study. Advance: The conceptual advance of this study is that it implicates the nuclear:cytoplasmic ratio and Chk1 in H3.3 incorporation. The authors suggest these factors influence cell cycle closing, which then affects H3.3 incorporation, although directly testing the granularity of this model is beyond the scope of the study. The authors also provide technical advancement in their use of measuring histone dynamics and using changes in the dynamics upon treatment as a useful readout. I envision this strategy (and the dendra transgenes) to be broadly useful in the cell cycle and developmental fields. Audience: The basic research presented in this study will likely attract colleagues from the cell cycle and embryogenesis fields. It has broader implications beyond Drosophila and even zygotic genome activation. This reviewer's expertise: Chromatin, Drosophila, Gene Regulation Reviewer #2 (Evidence, reproducibility and clarity (Required)): This manuscript investigates the regulation of H3.3 incorporation during zygotic genome activation (ZGA) in Drosophila, proposing that the nuclear-to-cytoplasmic (N/C) ratio plays a central role in this process. While the study is conceptually interesting, several concerns arise regarding the lack of proper control experiments and the clarity of the writing. The manuscript is difficult to follow due to vague descriptions, insufficient distinctions between established knowledge and novel findings, and a lack of rigorous statistical analyses. These issues need to be addressed before the study can be considered for publication. We thank the reviewers for their careful reading of this manuscript. We have sought to clarify the concerns regarding clarity through numerous text edits detailed below. We did include ANOVA analysis for all of the relevant statistical comparisons in the supplemental table. However, to increase clarity we have also added some statistical comparisons in the main figures. We note that we do not feel that it is appropriate to directly compare the intensities of the H3-Dendra2 construct expressed from the pseudo-endogenous locus to the H3.3 and chimeric proteins expressed from the H3.3A locus as they were imaged using different settings. Although we plot H3 on the same graph as the other proteins to allow for ease of comparison of their trends over time it is not appropriate to directly compare their normalized intensities which including statistical tests would encourage. We have added a note to the legend of the new Figure 1 Revision Plan explaining this which reads: “Note that statistical comparisons between the two Dendra2 constructs have not been done as they were expressed from different loci and imaged under different experimental settings.” Major Concerns The manuscript would benefit from a clearer introduction that explicitly distinguishes between previously known mechanisms of histone regulation during ZGA and the novel contributions of this study. Currently, the introduction lacks sufficient background on early embryonic chromatin regulation, making it difficult for readers unfamiliar with the field to grasp the significance of the findings. The authors should also be more precise when discussing the timing of ZGA. While they state that ZGA occurs after 13 nuclear divisions, it is well established that a minor wave of ZGA begins at nuclear cycle 7-8, whereas the major wave occurs after cycle 13. Clarifying this distinction will improve the manuscript's accessibility to a broader audience. We have added a new figure 1 to make the timing and nuclear behaviors of the embryo during ZGA in Drosophila more clear. We have also added information about how the chromatin changes during Drosophila ZGA in the following sentence: “ In Drosophila, these changes include refinement of nucleosomal positioning, partitioning of euchromatin and heterochromatin and formation of topologically associated domains20–22,24.” lines 39-41 We have clarified the major and minor waves of ZGA in the introduction and results by adding the following sentences to the introduction and results respectively: “In most organisms ZGA happens in multiple waves but the chromatin undergoes extensive remodeling to facilitate bulk transcription during the major wave of ZGA (hereafter referred to as ZGA)18–20,22–25..” lines 36-39 “In Drosophila, ZGA occurs in 2 waves. The minor wave starts as early as the 7th cycle, while major ZGA occurs after 13 rapid syncytial nuclear cycles (NCs) and is accompanied by cell cycle slowing and cellularization (Figure 1A-B).” lines 83-85 We hope that these changes help to reduce confusion and make the paper more accessible. However, we are happy to add additional information if the reviewer can provide specific points which require further attention. One of the primary weaknesses of this study is the lack of adequate control experiments. In Figure 1, the authors suggest that the levels of H3 and H3.3 are influenced by the N/C ratio, but Revision Plan it is unclear whether transcription itself plays a role in these dynamics. To properly test this, RNA-seq or Western blot analyses should be performed at nuclear cycles 10 and 13-14 to compare the levels of newly transcribed H3 or H3.3 against maternally supplied histones. Without such data, the authors cannot rule out transcriptional regulation as a contributing factor. In the pre-ZGA cell cycles the vast majority of protein including histones is maternally loaded. Gunesdogan et al. (2010) showed that the zygotic RD histone cluster nulls survive past NC14 (well past ZGA) with no discernible defects indicating that maternal RD histone supplies are sufficient for normal development during the cell cycles under consideration. Therefore, new transcription of replication coupled histones is not needed for apparently normal development during this period. Moreover, we have done the western blot analysis using a Pan-H3 antibody as suggested by the reviewer in our previously published paper (Shindo and Amodeo, 2019 supplemental figure S3A-B) and found that total H3-type histone proteins only increase moderately during this period of development, nowhere near the rate of the nuclear doublings. We have added the following sentence to clarify this point. “These divisions are driven by maternally provided components and the total amount of H3 type histones do not keep up with the pace of new DNA produced29.” lines 88-89 We have also previously done RNA-seq on wild-type embryos (and those with altered maternal histone levels) (Chari et al 2019). In this RNA-seq (like most RNA-seq in flies) we used poly-A selection and therefore cannot detect the RD histone mRNAs (which have a stem-loop instead of a poly-A tail). We have plotted the mRNA concentrations for both H3.3 variants from that dataset below for the reviewers reference (we have not included this in the revised manuscript). The total H3.3 mRNA levels are nearly constant from egg laying (NC0- these are from unfertilized embryos) until after ZGA (NC14). These data combined with the westerns discussed above give us confidence that what we are observing is the partitioning of large pools of maternally provided histones with only a relatively small contribution of new histone synthesis. Revision Plan In Figure 2, the manuscript introduces chimeric embryos expressing modified histone variants, but their developmental viability is not addressed. It is essential to determine whether these embryos survive and whether they exhibit any phenotypic consequences such as altered hatching rates, defects in nuclear division, or developmental arrest. Tagging histones is often deleterious to organismal health. In Drosophila there are two H3.3 loci (H3.3A and H3.3B). In all of our chimera experiments we have left the H3.3B and one copy of the H3.3A locus unperturbed to provide a supply of untagged H3.3. This allows us to study H3.3 and chimera dynamics without compromising organism health. All of our chimeras are viable and fertile with no obvious morphological defects. We have added the following sentences to the text to clarify this point: “There are ~100 copies of H3 in the Drosophila genome, but only 2 of H3.3 (H3.3A and H3.3B)26. To determine which factor controls nuclear availability and chromatin incorporation, we genetically engineered flies to express Dendra2-tagged H3/H3.3 chimeras at the endogenous H3.3A locus, keeping the H3.3B locus intact….These chimeras were all viable and fertile. ” lines 127-131, 136 In addition we propose performing hatch rate assays for embryos from the chimeric embryos of S31A, SVM and ASVM to assess if there is any decrease in fecundity due to the presence of the chimeras. Moreover, given that H3.3 is associated with actively transcribed genes, an RNA-seq analysis of chimeric embryos should be included to assess transcriptional changes linked to H3.3 incorporation. This is an excellent suggestion and will definitely be a future project for the lab. However, to do this experiment correctly we will need to generate untagged chimeric lines that will (hopefully) allow for the full replacement of H3.3 with the chimeric histones instead of a single copy among 4. This is beyond the scope of this paper. Figures 3 and 4 raise additional concerns about whether histone cluster transcription is altered in shkl mutant embryos. The authors propose that the shkl mutation affects the N/C ratio, yet it remains unclear whether this leads to changes in the transcription of histone clusters. Furthermore, since HIRA is a key chaperone for H3.3, it would be important to assess whether its levels or function are compromised in shkl mutants. To address these gaps, RT-qPCR or RNA-seq should be performed to quantify histone cluster transcription, and Western blot analysis should be used to determine if HIRA protein levels are affected. The changes in the N/C ratio that are observed in the shkl mutant are within SINGLE embryo (differences in nuclear spacing). In these experiments we are comparing nuclei within a common cytoplasm that have different local nuclear densities (N/C ratios). Therefore, if Shkl Revision Plan were somehow affecting the transcription of histones or their chaperones we would expect all of the nuclei within the same mutant embryo to be equally affected since they are genetically identical and share a common cytoplasm. We do not directly compare the behavior of shkl embryos to wildtype except to demonstrate that there is no positional effect on the import of H3 and H3.3 across the length of the embryo in wildtype. To clarify our experimental system for these experiments we have added additional panels to Figure 4A and B that depict the number of neighbors for both control and Shkl embryos. Nonetheless, to address the reviewer’s concern that shkl may change the amount of H3 present in the embryo, we propose to conduct a western blot comparison of wildtype and shkl embryos using a pan-H3 antibody. There are no tools (antibodies or fluorescently tagged proteins) to assess HIRA protein levels in Drosophila. We therefore propose to perform RT-qPCR for HIRA in wildtype and shkl embryos. A similar issue arises in Figure 5, where the authors claim that H3.3 incorporation is dependent on cell cycle state but do not sufficiently test whether this is linked to changes in HIRA levels. Given the importance of HIRA in H3.3 deposition, its levels should be examined in Slbp, Zelda, and Chk1 RNAi embryos to verify whether changes in H3.3 incorporation correlate with HIRA function. Without this, it is difficult to conclude that the observed effects are strictly due to cell cycle regulation rather than histone chaperone dynamics. Since H3.3 incorporation is unaffected in the Slbp and Zelda-RNAi lines there is no reason to suspect a change in HIRA function. There are no available tools (antibodies or fluorescently tagged proteins) to directly measure HIRA protein in Drosophila. To test if changes in HIRA loading might contribute to the decreased H3.3 incorporation in the Chk1 mutant we propose to perform RT-qPCR for HIRA in wildtype and Chk1 embryos. Several figures require additional statistical analyses to support the claims made. In Figure 1B, statistical testing should be included to validate the reported differences. Figure 1C-D states that "H3.3 accumulation reduces more slowly than H3," yet there is no quantitative comparison to substantiate this claim. Similarly, Figure 1E presents the conclusion that "These changes in nuclear import and incorporation result in a less dramatic loss of the free nuclear H3.3 pool than previously seen for H3," despite the fact that H3 data are not included in this figure. The conclusions drawn from these data need to be supported with appropriate statistical comparisons and more precise descriptions of what is being measured. For Figure 1B (now 2B) we do not feel that it is appropriate to directly compare the intensities of the H3-Dendra2 construct expressed from the pseudo-endogenous locus to the H3.3 and chimeric proteins expressed from the H3.3A locus as they were imaged using different settings and therefore we do not feel that direct statistical tests are appropriate. Rather, we plot the two histones on the same graph normalized to their own NC10 values so that the trend in their decrease over time may be compared. The statistical tests for H3.3 compared to the chimeras which were originally in the supplemental table have been added to Figure 3 (formerly figure 2). Revision Plan It is important to note that in this directly comparable situation the ASVM mutant (whose trends closely mirror H3) is highly statistically distinct from H3.3. We have added a note to the legend of the new Figure 1 explaining this which reads: “Note that statistical comparisons between the two Dendra2 constructs have not been done as they were expressed from different loci and imaged under different experimental settings.” For Figure 1C-D (now 2C-D) we have removed this claim from the text. We were referring to the plateau in nuclear import for H3 that is less dramatic in H3.3, but this is more carefully discussed in the next paragraph and its addition at that point generated confusion. The text now reads: “To further assess how nuclear uptake dynamics changed during these cycles, we tracked total nuclear H3 and H3.3 in each cycle (Figure 2C-D). Since nuclear export is effectively zero, we attribute the increase in total H3.3 over time solely to import and therefore the slope of total H3.3 over time corresponds to the import rate. Though the change in initial import rates between NC10 and NC13 are similar between the two histones (Figure S1F), we observed a notable difference in their behavior in NC13. H3 nuclear accumulation plateaus ~5 minutes into NC13, whereas H3.3 nuclear accumulation merely slows (Figure 2C-D).” lines 109-116 For Figure 1E (now 2E), to address the difference between H3 and H3.3 free pools we have added the previously published values to the text and changed the phrasing from “less dramatic” to “less complete”. The sentence now reads: “These changes in nuclear import and incorporation result in a less complete loss of the free nuclear H3.3 pool (~70% free in NC11 to ~30% in NC13) than previously seen for H3 (~55% free in NC11 to ~20% in NC13)” lines 116-119 Figure 2 presents additional concerns regarding data interpretation. The comparisons between H3.3 and H3.3S31A to H3 and H3.3SVM/ASVM lack statistical analysis, making it difficult to determine the significance of the observed differences. As discussed above, it is not appropriate to directly compare H3 to H3.3 and the chimeras at the H3.3A locus since they are expressed from different promoters and imaged with different settings. The ANOVA comparisons between all of the constructs in the H3.3A locus can be found in the supplemental table. We have also added the statistical significance between each chimera and H3.3 within a cell cycle to the figure. Including the full set of comparisons for all genotypes and timepoints makes the figure nearly impossible to interpret, but they remain available in the supplemental table. Revision Plan The disappearance of H3.3 from mitotic chromosomes in Figure 2E is also not explained. If this phenomenon is functionally relevant, the authors should provide a mechanistic interpretation, or at the very least, discuss potential explanations in the text. In Figures 2F-H, the reasoning behind comparing the nuclear intensity of H3.3 to H3 in Hira mutants is unclear. To properly assess the role of HIRA in H3.3 chromatin accumulation, a more appropriate comparison would be between wild-type H3.3 and H3.3 levels in Hira knockdown embryos. As explained in the text and depicted in Figure 3D (formerly 2D), the HIRAssm mutant is a point mutation that prevents observable H3.3 chromatin incorporation, but not nuclear import. This is what is depicted in Figure 3E (formerly 2E). The loss of H3.3 from mitotic chromatin is due to the inability to incorporate H3.3 into chromatin as expected for a HIRA mutant. We have edited the figure 3 legend to make this more clear. It now reads: “Hirassm mutation nearly abolishes the observable H3.3 on mitotic chromatin (E).” In Figure 3F (formerly 2F-H) we ask what happens to H3 chromatin incorporation when there is almost no incorporation of H3.3 due to the HIRA mutation. In this mutant there is so little H3.3 incorporation that we cannot quantify H3.3 levels on mitotic chromatin (see the new Figure 1B for the stage where chromatin levels are quantified). This experiment was done to test if H3.3ASVM (expressed at the H3.3A locus) is incorporated into chromatin in embryos lacking the function of H3.3’s canonical chaperone. We have edited the text to make this more clear: “Since the chromatin incorporation of the H3/H3.3 chimeras appears to depend on their chaperone binding sites, we asked if impairing the canonical H3.3 chaperone, Hira, would affect the incorporation of H3.3ASVMexpressed from the H3.3A locus.”lines 158-160 A broader concern is that the authors only test HIRA as a histone chaperone but do not consider alternative chaperones that could influence H3.3 deposition. Since multiple chaperone systems regulate histone incorporation, it would strengthen the conclusions if additional chaperones were tested. Since HIRAssm reduced H3.3-Dendra2 incorporation to nearly undetectable levels (Figure 3E) we believe that it is the primary H3.3 incorporation pathway during this period of development. Therefore, we believe that removing HIRA function is a sufficient test of the dependance of H3.3ASVM on the major H3.3 chaperone at this time. Although it would be interesting to fully map how all H3 and H3.3 chimera constructs respond to all histone chaperone pathways, we believe that this is beyond the scope of this manuscript. Additionally, the manuscript does not include any validation of the RNAi knockdown efficiencies used throughout the study. This raises concerns about whether the observed phenotypes are truly due to target gene depletion or off-target effects. RT-qPCR or Western blot analyses should be performed to confirm knockdown efficiency. Revision Plan Both the Zelda and slbp-RNAi lines used for knockdowns have been used and validated in the early fly embryo in previously published works ((Yamada et al., 2019), (Duan et al., 2021), (O’Haren et al., 2025), (Chari et al, 2019)) and the phenotypes that we observe in our embryos are consistent with the published data including altered cell cycle durations (Figure S4C) and lack of cellularization/gastrulation. We note that the zelda RNAi phenotypes are also highly consistent with the effects of Zelda germline clones. To validate that slbp-RNAi knocks down histones we included a western blot for Pan-H3 in slbp-RNAi embryos that demonstrates a large effect on total H3 levels (Figure S4A). To further demonstrate the phenotypic effects of the slbp-RNAi we have added supplemental movies (Videos 4 and 5). To fully characterize the RNAi efficiency under our conditions we propose to perform RT-qPCR for slbp in slbp-RNAi and Zelda in Zelda-RNAi compared to control (w) RNAi embryos. Finally, the section discussing "H3.3 incorporation depends on cell cycle state, but not cell cycle duration" is unclear. The term "cell cycle state" is vague and should be explicitly defined. Does this refer to a specific phase of the cell cycle, changes in chromatin accessibility, or another regulatory mechanism? The term cell cycle state is deliberately vague. We know that Chk1 regulates many aspects of cell cycle progression and cannot determine from our data which aspect(s) of cell cycle regulation by Chk1 are important for H3.3 incorporation. Our data indicate that it is not simply interphase duration as we originally hypothesized. We have expanded our discussion section to underscore some aspects of Chk1 regulation that we speculate may be responsible for the change in H3.3 behavior. “Chk1 mutants decrease H3.3 incorporation even before the cell cycle is significantly slowed. Cell cycle slowing has been previously reported to regulate the incorporation of other histone variants in Drosophila15. However, our results indicate that cell cycle state and not duration per se, regulates H3.3 incorporation. In most cell types, the primary role of Chk1 is to stall the cell cycle to protect chromatin in response to DNA damage. Therefore, Chk1 activity directly or indirectly affects the chromatin state in a variety of ways. We speculate that Chk1’s role in regulating origin firing may be particularly important in this context73,74. Late replicating regions and heterochromatin first emerge during ZGA, and Chk1 mutants proceed into mitosis before the chromatin is fully replicated22,23,25,71. Since H3.3 is often associated with heterochromatin, the decreased H3.3 incorporation in Chk1 mutants may be an indirect result of increased origin firing and decreased heterochromatin formation73,74.” lines 287-298 Reviewer #2 (Significance (Required)): This manuscript investigates the regulation of H3.3 incorporation during zygotic genome Revision Plan activation (ZGA) in Drosophila, proposing that the nuclear-to-cytoplasmic (N/C) ratio plays a central role in this process. While the study is conceptually interesting, several concerns arise regarding the lack of proper control experiments and the clarity of the writing. The manuscript is difficult to follow due to vague descriptions, insufficient distinctions between established knowledge and novel findings, and a lack of rigorous statistical analyses. These issues need to be addressed before the study can be considered for publication. Reviewer #3 (Evidence, reproducibility and clarity (Required)): Summary: Based on previous findings of the changing ratios of histone H3 to its variant H3.3, the authors test how H3.3 incorporation into chromatin is regulated for ZGA. They demonstrate here that H3 nuclear availability drops and replacement by H3.3 relies on chaperone binding, though not on its typical chaperone Hira. Furthermore, they show that nuclear-cytoplasmic (N/C) ratios can influence this histone exchange likely by influencing cell cycle state. We thank the reviewer for their thoughtful comments. We note that our data ARE consistent with H3.3 incorporation depending on Hira through its chaperone binding site. Major comments: 1. The claims are largely supported by the data but I think a couple more experiments could help bolster the claims about cell cycle and chk1 regulation. a. Creating a phosphomimetic of the chk1 phosphorylation site on H3.3 to see if it can overcome the defects seen in chk1 mutants b. Assessing heterochromatin of embryos without chk1 (or ASVM mutants) for example, by looking at H3K9me3 levels The first experiments could take several months if the flies haven't already been generated by the authors but the second should be quicker. a. This is an excellent experimental suggestion which is bolstered by the fact that in frogs H3.3 S31A cannot rescue H3.3 morpholino during gastrulation, but H3.3S31D can (Sitbon et al, 2020). However, to correctly conduct this experiment would require generating and validating multiple additional endogenous H3.3 replacement lines, likely without a fluorescent tag as they can interfere with histone rescue constructs in most species. As the reviewer notes, this would take several months of work (we have not generated the critical flies yet) and may not yield a satisfying answer since there are reports that H3.3 may be dispensable in flies aside from as a source of H3-type histone outside of S-phase (Hödl and Bassler, 2012). While we hope to continue experiments along these lines in the future we feel that this is beyond the scope of the current manuscript. Revision Plan b. To address this we propose to stain for H3K9me3 in wildtype and Chk1-/- embryos. Since the ASVM line is not a full replacement of all H3.3 we think that staining for H3K9me3 in this line is unlikely to yield a detectable difference. 2. It would also be interesting to see what the health of the flies with some mutations in this paper are beyond the embryo stage if they are viable (e.g., development to adulthood, fertility etc.) a. the SVM, ASVM mutations b. the hira + ASVM mutations The authors might already have this data but if not they have the flies and it shouldn't take long to get these data. a. To address this concern we propose to conduct hatch rate assays for embryos from the Dendra tagged H3.3, S31A, SVM, ASVM flies. However, we do note that in our experiments only one copy of the H3.3A locus was mutated and tagged with Dendra2 leaving one copy of H3.3A and both copies of H3.3B untouched to ensure normal development as tagging all copies of histone genes can lead to lethality. b. All Hira mutants develop as haploids due to the inability to decondense the sperm chromatin (which is dependent on Hira). This leads to one extra division to restore the N/C ratio prior to cell cycle slowing and ZGA. These embryos go on to gastralate and die late in development after cuticle formation (presumably due to their decreased ploidy) (Loppin et al., 2000). The addition of ASVM into the Hira background does not appear to rescue the ploidy defect as these embryos also undergo the extra division (Figure 3H). We are therefore confident that these embryos will not hatch. We have added the information about the development of Hira mutant to the text as follow: “These embryos develop as haploids and undergo one additional syncytial division before ZGA (NC14). Hirassmembryos develop otherwise phenotypically normally through organogenesis and cuticle formation, but die before hatching57.” lines 164-167 3. In the discussion section, can the authors speculate on how they think H3.3 ASVM is getting incorporated if not through Hira. Are there other known H3 variant chaperones, or can the core histone chaperone substitute? We have expanded our discussion to include the the following: “In the case of the chimeric histone proteins the incorporation behavior was dependent on the chaperone binding site. For example, H3.3ASVM import and incorporation was similar to H3 in control embryos and H3.3ASVM was still incorporated in Hirassm mutants. This is consistent with the chaperone binding site determining the chromatin incorporation pathway and suggests that H3.3ASVM likely interacts with H3 chaperones such as Caf1.” lines 280-285 Revision Plan Minor comments: While the paper is well written, I found the figures very confusing and difficult to interpret. Comments here are meant to make it easier to interpret. 1. Fig 1 and most of the paper would benefit from a schematic of early embryo transitions labelled with time and stages of cell cycle to make interpreting data easier This is an excellent suggestion! We have added a new figure (Figure 1) to explain both the biological system and the way that we measured many properties in this paper. 2. Fig 1- same green color is used for nuclear cycle 12 and for H3.3 making it confusing when reading graphs. Please check other figures where there is a similar use of color for two different things We have changed the colors so that they are more distinct. 3. Fig 1C,D might benefit more from being split up into 3 graphs by cell cycle with H3 and H3.3 plotted on the same graphs rather than the way it is now We do not feel that it is appropriate to directly compare the intensities of the H3-Dendra2 construct expressed from the pseudo-endogenous locus to the H3.3 and chimeric proteins expressed from the H3.3A locus as they were imaged using different settings. These curves can be directly compared within a construct and we can evaluate their trends over time, but the normalized values should not be directly compared in the way that would be encouraged by plotting the data as suggested. 4. Line 130-133: can they also comment on the different between SVM and ASVM. It seems like SVM might be even worse than ASVM (Fig 2C). Is this related to chk1 phosphorylation? We think that this is a property of the mixed chimeras since S31A is also imported less efficiently than H3.3 (though we cannot be sure without further experiments). We have added this explanation to the text: “We speculate that chimeric histone proteins (H3.3S31A and H3.3SVM) are not as efficiently handled by the chaperone machinery as species that are normally found in the organism including H3.3ASVM which is protein-identical to H3.” lines 150-152 5. Fig 2F-G: It is very difficult to compare between histones when they are on different graphs, please consider putting H3, H3.3 and H3.3ASVM in a hirassm background on the same graph. We have done this in the new Figure 3F. Revision Plan 6. Fig 3- move G to become A and then have A and B. We have restructured this figure to include the nuclear density map of control in response to a comment from Reviewer 1. Although not exactly what the reviewer has envisioned, we hope that this adds clarity to the figure. 7. The initial slope graphs in 4D, E, H and I are not easy to understand and would benefit from an explanation in the legend. We have edited the legend of Figure 5D (formerly 4D) and S1F which now read: “Initial slopes of nuclear import curves (change in total nuclear intensity over time for the first 5 timepoints) …” In addition we have updated the methods to include: “Import rates were calculated by using a linear regression for the total nuclear intensity over time for the first 5 timepoints in the nuclear import curves.” lines 471-473, methods Reviewer #3 (Significance (Required)): This paper addresses an important and understudied question- how do histones and their variants mediate chromatin regulation in the early embryo before zygotic genome activation? The authors follow up on some previous findings and provide new insights using clever genetics and cell biology in Drosophila melanogaster. However, the authors do not directly look at chromatin structural changes using existing genomic tools. This may be beyond the scope of this work but would make for a nice addition to strengthen their claims if they can implement these chromatin accessibility techniques in the early embryo. Histones affect a majority of biological processes and understanding their role in the early embryo is key to understanding development. I believe this study applies to a broad audience interested in basic science. However, I do think the authors might benefit from a more broad discussion of their results to attract a broad readership.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Based on previous findings of the changing ratios of histone H3 to its variant H3.3, the authors test how H3.3 incorporation into chromatin is regulated for ZGA. They demonstrate here that H3 nuclear availability drops and replacement by H3.3 relies on chaperone binding, though not on its typical chaperone Hira. Furthermore, they show that nuclear-cytoplasmic (N/C) ratios can influence this histone exchange likely by influencing cell cycle state.

      Major comments:

      1. The claims are largely supported by the data but I think a couple more experiments could help bolster the claims about cell cycle and chk1 regulation.

      a. Creating a phosphomimetic of the chk1 phosphorylation site on H3.3 to see if it can overcome the defects seen in chk1 mutants

      b. Assessing heterochromatin of embryos without chk1 (or ASVM mutants) for example, by looking at H3K9me3 levels The first experiments could take several months if the flies haven't already been generated by the authors but the second should be quicker. 2. It would also be interesting to see what the health of the flies with some mutations in this paper are beyond the embryo stage if they are viable (e.g., development to adulthood, fertility etc.)

      a. the SVM, ASVM mutations

      b. the hira + ASVM mutations The authors might already have this data but if not they have the flies and it shouldn't take long to get these data. 3. In the discussion section, can the authors speculate on how they think H3.3 ASVM is getting incorporated if not through Hira. Are there other known H3 variant chaperones, or can the core histone chaperone substitute?

      Minor comments:

      While the paper is well written, I found the figures very confusing and difficult to interpret. Comments here are meant to make it easier to interpret.

      1. Fig 1 and most of the paper would benefit from a schematic of early embryo transitions labelled with time and stages of cell cycle to make interpreting data easier
      2. Fig 1- same green color is used for nuclear cycle 12 and for H3.3 making it confusing when reading graphs. Please check other figures where there is a similar use of color for two different things
      3. Fig 1C,D might benefit more from being split up into 3 graphs by cell cycle with H3 and H3.3 plotted on the same graphs rather than the way it is now
      4. Line 130-133: can they also comment on the different between SVM and ASVM. It seems like SVM might be even worse than ASVM (Fig 2C). Is this related to chk1 phosphorylation?
      5. Fig 2F-G: It is very difficult to compare between histones when they are on different graphs, please consider putting H3, H3.3 and H3.3ASVM in a hirassm background on the same graph.
      6. Fig 3- move G to become A and then have A and B.
      7. The initial slope graphs in 4D, E, H and I are not easy to understand and would benefit from an explanation in the legend.

      Significance

      This paper addresses an important and understudied question- how do histones and their variants mediate chromatin regulation in the early embryo before zygotic genome activation? The authors follow up on some previous findings and provide new insights using clever genetics and cell biology in Drosophila melanogaster. However, the authors do not directly look at chromatin structural changes using existing genomic tools. This may be beyond the scope of this work but would make for a nice addition to strengthen their claims if they can implement these chromatin accessibility techniques in the early embryo.

      Histones affect a majority of biological processes and understanding their role in the early embryo is key to understanding development. I believe this study applies to a broad audience interested in basic science. However, I do think the authors might benefit from a more broad discussion of their results to attract a broad readership.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript investigates the regulation of H3.3 incorporation during zygotic genome activation (ZGA) in Drosophila, proposing that the nuclear-to-cytoplasmic (N/C) ratio plays a central role in this process. While the study is conceptually interesting, several concerns arise regarding the lack of proper control experiments and the clarity of the writing. The manuscript is difficult to follow due to vague descriptions, insufficient distinctions between established knowledge and novel findings, and a lack of rigorous statistical analyses. These issues need to be addressed before the study can be considered for publication.

      Major Concerns

      The manuscript would benefit from a clearer introduction that explicitly distinguishes between previously known mechanisms of histone regulation during ZGA and the novel contributions of this study. Currently, the introduction lacks sufficient background on early embryonic chromatin regulation, making it difficult for readers unfamiliar with the field to grasp the significance of the findings. The authors should also be more precise when discussing the timing of ZGA. While they state that ZGA occurs after 13 nuclear divisions, it is well established that a minor wave of ZGA begins at nuclear cycle 7-8, whereas the major wave occurs after cycle 13. Clarifying this distinction will improve the manuscript's accessibility to a broader audience. One of the primary weaknesses of this study is the lack of adequate control experiments. In Figure 1, the authors suggest that the levels of H3 and H3.3 are influenced by the N/C ratio, but it is unclear whether transcription itself plays a role in these dynamics. To properly test this, RNA-seq or Western blot analyses should be performed at nuclear cycles 10 and 13-14 to compare the levels of newly transcribed H3 or H3.3 against maternally supplied histones. Without such data, the authors cannot rule out transcriptional regulation as a contributing factor. In Figure 2, the manuscript introduces chimeric embryos expressing modified histone variants, but their developmental viability is not addressed. It is essential to determine whether these embryos survive and whether they exhibit any phenotypic consequences such as altered hatching rates, defects in nuclear division, or developmental arrest. Moreover, given that H3.3 is associated with actively transcribed genes, an RNA-seq analysis of chimeric embryos should be included to assess transcriptional changes linked to H3.3 incorporation. Figures 3 and 4 raise additional concerns about whether histone cluster transcription is altered in shkl mutant embryos. The authors propose that the shkl mutation affects the N/C ratio, yet it remains unclear whether this leads to changes in the transcription of histone clusters. Furthermore, since HIRA is a key chaperone for H3.3, it would be important to assess whether its levels or function are compromised in shkl mutants. To address these gaps, RT-qPCR or RNA-seq should be performed to quantify histone cluster transcription, and Western blot analysis should be used to determine if HIRA protein levels are affected. A similar issue arises in Figure 5, where the authors claim that H3.3 incorporation is dependent on cell cycle state but do not sufficiently test whether this is linked to changes in HIRA levels. Given the importance of HIRA in H3.3 deposition, its levels should be examined in Slbp, Zelda, and Chk1 RNAi embryos to verify whether changes in H3.3 incorporation correlate with HIRA function. Without this, it is difficult to conclude that the observed effects are strictly due to cell cycle regulation rather than histone chaperone dynamics. Several figures require additional statistical analyses to support the claims made. In Figure 1B, statistical testing should be included to validate the reported differences. Figure 1C-D states that "H3.3 accumulation reduces more slowly than H3," yet there is no quantitative comparison to substantiate this claim. Similarly, Figure 1E presents the conclusion that "These changes in nuclear import and incorporation result in a less dramatic loss of the free nuclear H3.3 pool than previously seen for H3," despite the fact that H3 data are not included in this figure. The conclusions drawn from these data need to be supported with appropriate statistical comparisons and more precise descriptions of what is being measured.

      Figure 2 presents additional concerns regarding data interpretation. The comparisons between H3.3 and H3.3S31A to H3 and H3.3SVM/ASVM lack statistical analysis, making it difficult to determine the significance of the observed differences. The disappearance of H3.3 from mitotic chromosomes in Figure 2E is also not explained. If this phenomenon is functionally relevant, the authors should provide a mechanistic interpretation, or at the very least, discuss potential explanations in the text. In Figures 2F-H, the reasoning behind comparing the nuclear intensity of H3.3 to H3 in Hira mutants is unclear. To properly assess the role of HIRA in H3.3 chromatin accumulation, a more appropriate comparison would be between wild-type H3.3 and H3.3 levels in Hira knockdown embryos. A broader concern is that the authors only test HIRA as a histone chaperone but do not consider alternative chaperones that could influence H3.3 deposition. Since multiple chaperone systems regulate histone incorporation, it would strengthen the conclusions if additional chaperones were tested. Additionally, the manuscript does not include any validation of the RNAi knockdown efficiencies used throughout the study. This raises concerns about whether the observed phenotypes are truly due to target gene depletion or off-target effects. RT-qPCR or Western blot analyses should be performed to confirm knockdown efficiency. Finally, the section discussing "H3.3 incorporation depends on cell cycle state, but not cell cycle duration" is unclear. The term "cell cycle state" is vague and should be explicitly defined. Does this refer to a specific phase of the cell cycle, changes in chromatin accessibility, or another regulatory mechanism?

      Significance

      This manuscript investigates the regulation of H3.3 incorporation during zygotic genome activation (ZGA) in Drosophila, proposing that the nuclear-to-cytoplasmic (N/C) ratio plays a central role in this process. While the study is conceptually interesting, several concerns arise regarding the lack of proper control experiments and the clarity of the writing. The manuscript is difficult to follow due to vague descriptions, insufficient distinctions between established knowledge and novel findings, and a lack of rigorous statistical analyses. These issues need to be addressed before the study can be considered for publication.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Bhatt et al. seek to define factors that influence H3.3 incorporation in the embryo. They test various hypotheses, pinpointing the nuclear/cytoplasmic ratio and Chk1, which affects cell cycle state, as influencers. The authors use a variety of clever Drosophila genetic manipulations in this comprehensive study. The data are presented well and conclusions reasonably drawn and not overblown. I have only minor comments to improve readability and clarity. I suggest two OPTIONAL experiments below.

      Major comments:

      We found this manuscript well written and experimentally thorough, and the data are meticulously presented. We have one modification that we feel is essential to reader understanding and one experimental concern: The authors provide the photobleaching details in the methodology, but given how integral this measurement is to the conclusions of the paper, we feel that this should be addressed in clear prose in the body of the text. The authors explain briefly how nuclear export is assayed, but not import (line 99). Would help tremendously to clarify the methods here. This is especially important as import is again measured in Fig 4. This should also be clarified (also in the main body and not solely in the methods).

      If the embryos appeared "reasonably healthy" (line 113) after slbp RNAi, how do the authors know that the RNAi was effective, especially in THESE embryos, given siblings had clear and drastic phenotype? This is especially critical given that the authors find no effect on H3.3 incorporation after slbp RNAi (and presumably H3 reduction), but this result would also be observed if the slbp RNAi was just not effective in these embryos.

      Minor comments:

      Introduction:

      Consider using "replication dependent" (RD) rather than "replication coupled." Both are used in the field, but RD parallels RI ("replication independent"). Would help for clarity if the authors noted that H3 is equivalent to H3.2 in Drosophila. Also it is relevant that there are two H3.3 loci as the authors knock mutations into the H3.3A locus, but leave the H3.3B locus intact. The authors should clarify that there are two H3.3 genes in the Drosophila genome. Please add information and citation (line 58): H3.3 is required to complete development when H3.2 copy number is reduced (PMID: 37279945, McPherson et al. 2023)

      Results:

      Embryo genotype is unclear (line 147): Hira[ssm] haploid embryos inherit the Hira mutation maternally? Are Hira homozygous mothers crossed to homozygous fathers to generate these embryos, or are mothers heterozygous? This detail should be in the main text for clarity. Line 161: Shkl affects nuclear density, but it also appears from Fig 3 to affect nuclear size? The authors do not address this, but it should at least be mentioned. The authors often describe nuclear H3/H3.3 as chromatin incorporated, but these image-based methods do not distinguish between chromatin-incorporated and nuclear protein. I very much appreciate how the authors laid out their model in Fig 3 and then used the same figure to explain which part of the model they are testing in Figs 4 and 5. This is not a critique- we can complement too! OPTIONAL experimental suggestion: The experiments in Figure 4 and 5 are clever. One would expect that H3 levels might exhaust faster in embryos lacking all H3.2 histone genes (Gunesdogan, 2010, PMID: 20814422), allowing a comparison testing the H3 availability > H3.3 incorporation portion of the hypothesis without manipulating the N/C ratio. This might also result in a more consistent system than slbp RNAi (below). O'Haren 2024 (PMID: 39661467) did not find increased Pol II at the HLB after zelda RNAi (line 227). Might also want to mention here that zelda RNAi does not result in changes to H3 at the mRNA level (O'Haren 2024), as that would confound the model.

      Discussion:

      Should discuss results in context of McPherson et al. 2023 (PMID: 37279945), who showed that decreasing H3.2 gene numbers does not increase H3.3 production at the mRNA or protein levels. The Shackleton mutation is a clever way to alter N/C ratio, but the authors should point out that it is difficult (impossible?) to directly and cleanly manipulate the N/C ratio. For example, Shkl mutants seem to also have various nuclear sizes. How is H3.3 expression controlled? Is it possible that H3.3 biosynthesis is affected in Chk1 mutants? Figures:

      While I appreciate the statistical summaries in tables, it is still helpful to display standard significance on the figures themselves.

      Fig 1:

      A: Is it possible to label panels with the nuclear cycle? B: Statistics required - caption suggests statistics are in Table S2, but why not put on graph? C/D: Would be helpful if authors could plot H3/H3.3 on same graph because what we really need to compare is NC13 between H3/H3.3 (and statistics between these curves) E: The comparison in the text is between H3.3 and H3, but only H3.3 data is shown. I realize that it is published prior, but the comparison in figure would be helpful.

      Fig 2:

      A: A very helpful figure. Slightly unclear that the H3 that is not Dendra tagged is at the H3.3 locus. Also unclear that the H3.3A-Dendra2 line exists and used as control, as is not shown in figure. Should show H3 and H3.3 controls (Figure S2) F/H- As the comparison is between H3 and ASVM, it would help to combine these data onto the same graph. As the color is currently used unnecessarily to represent nuclear cycle, the authors could use their purple/pink color coding to represent H3/ASVM. In the legend of Fig 2 the authors write "in the absence of Hira." Technically, there is only a point mutation in Hira. It is not absent.

      Fig 3:

      G: Please show WT for comparison. Can use data in Fig 3A. Model in H is very helpful (complement)!

      Fig 4:

      B/C/F/G: The authors use a point size scale to represent the number of nuclei, but the graphs are so overlaid that it is not particularly useful. Is there a better way to display this dimension? D/E/H/I: What does "min volume" mean on the X axis?

      Fig 5:

      F: OPTIONAL Experimental request: Here I would like to see H3 as a control.

      Significance

      General assessment: Many long-standing mysteries surround zygotic genome activation, and here the authors tackle one: what are the signals to remodel the zygotic chromatin around ZGA? This is a tricky question to answer, as basically all manipulations done to the embryo have widespread effects on gene expression in general, confounding any conclusions. The authors use clever novel techniques to address the question. Using photoconvertible H3 and H3.3, they can compare the nuclear dynamics of these proteins after embryo manipulation. Their model is thorough and they address most aspects of it. The hurdle this study struggles to overcome is the same that all ZGA studies have, which is that manipulation of the embryo causes cascading disasters (for example, one cannot manipulate the nuclear:cytoplasmic ratio without also altering cell cycle timing), so it's challenging to attribute molecular phenotypes to a single cause. This doesn't diminish the utility of the study.

      Advance: The conceptual advance of this study is that it implicates the nuclear:cytoplasmic ratio and Chk1 in H3.3 incorporation. The authors suggest these factors influence cell cycle closing, which then affects H3.3 incorporation, although directly testing the granularity of this model is beyond the scope of the study. The authors also provide technical advancement in their use of measuring histone dynamics and using changes in the dynamics upon treatment as a useful readout. I envision this strategy (and the dendra transgenes) to be broadly useful in the cell cycle and developmental fields.

      Audience: The basic research presented in this study will likely attract colleagues from the cell cycle and embryogenesis fields. It has broader implications beyond Drosophila and even zygotic genome activation.

      This reviewer's expertise: Chromatin, Drosophila, Gene Regulation

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer 1

      Major issue #1. Regarding the conclusions on IRE1 signaling, both yeast species have different IRE1 activities (https://elifesciences.org/articles/00048), the total deletion of IRE1 in S pombe appears to indicate that expansion of perinuclear ER is independent of IRE1, however since IRE1 signaling has exclusively a negative impact on mRNA expression, it might be relevant to identify mRNA whose expression is stabilized under those circumstances and evaluate whether those could confer a mechanism which would also yield perinuclear ER expansion (eg differential deregulation of ER stress controlled lipid biosynthesis required for lipid membrane synthesis). In S. cerevisiae, do the authors observe HAC1 mRNA splicing?

      We have not tested whether HAC1 mRNA is processed in S. cerevisiae.

      In addition, as requested by the reviewers, we reassessed our RNA-seq data and compared it with data from (Kimmig et al., 2012) (UPR activation in S. pombe), which added a new layer of data that reinforces the differences between the transcriptomic responses induced by HU and DIA and the canonical UPR. The following information is now included in the paper (page 26, highlighted in blue):

      “We further compared our transcriptomic data with that obtained by Kimmig et al. from DTT- treated S. pombe cells. When we compared the genes that were downregulated in our conditions with the ones described by Kimmig et al. (FC≤-1), we found no similarities between HU treatment (75 mM HU for 150 minutes) and UPR-induced downregulation, and only three genes ( ist2, efn1 and xpa1) all of them encode for transmembrane proteins, were common with DIA treatment (3 mM DIA for 60 minutes). Additionally, ist2 and xpa1, but not efn1, are considered Ire1-dependent downregulated genes and are located in the ER. These results show that HU- or DIA- induced transcriptomic programs are different from UPR, as they do not heavily rely on mRNA decay and favor gene overexpression. Interestingly, we found similarities between genes showed to be upregulated more that twofold by DTT in Kimmig et al., and HU and DIA conditions. When the two N-Cap-inducing conditions were compared with DTT, we found eight common upregulated genes (frp1, plr1, SPCC663.08c, srx1, gst2, str3, caf5 and hsp16) mostly involved in reduction processes and the chaperone Hsp16 which suggests folding stress”.

      Major issue #2. The authors indicate that HU and DIA lead to thiol stress, it might be relevant to evaluate the thiol-redox status of major secretory proteins in S. pombe (or even cargo reporters if necessary) to fully document the stress impact on global protein redox status.

      We agree with the reviewer that it is important to determine the redox and the functional state of the secretory pathway in our conditions to fully understand the cellular consequences of these treatments, especially in the case of HU, as it is routinely used in clinics. In this context, we have already included new data showing that HU or DIA treatment leads to alterations in the Golgi apparatus and in the distribution of secretory proteins (Figures 3A-B). In addition, we are currently performing mass spectrometry experiment to detect protein glutathionylation in our conditions, as it has been previously shown that DIA treatment leads to glutathionylation of key ER proteins such as Bip1, Pdi or Ero1 (Lind et al., 2002; Wang & Sevier, 2016), which might by reproduced upon HU treatment. Finally, we plan to test the folding and processing of specific secretory cargoes by western blot in our experimental conditions (See below, Reviewer 2, Major issue #1).

      What happens if HU-treated yeast cells are grown in the presence of n-acetyl cysteine?

      We have tested whether the addition of this antioxidant could prevent and/or revert the N-Cap phenotype. We found that NAC in combination with HU increased N-Cap incidence (Figure 5H). As NAC is a GSH precursor and we find that GSH is required to develop the phenotype of N-Cap (Figure 5A-B, D, G), this result further supports that the HU-induced cellular damage might involve ectopic glutathionylation of proteins.

      Unfortunately, we have not tested NAC in combination with DIA, as NAC seems to reduce DIA as soon as they get in contact, as judged by the change in the characteristic orange color of DIA, the same that happens when we combine GSH and DIA (Supplementary Figure 5A-B).

      In this regard, the following information has been added to the manuscript (page 30, highlighted in blue):

      “We also tested GSH addition to the medium in combination with either HU or DIA. When mixed with DIA, we noticed that the color of the culture changed after GSH addition (Figure S5A), which suggests that GSH and DIA can interact extracellularly, thus preventing us from being able to draw conclusions from those experiments. On the other hand, combining GSH with HU increased N-Cap incidence (Figure 5G), as expected based on our previous observations. Additionally, we checked whether the addition of the antioxidant N-acetyl cysteine (NAC), a GSH precursor, impacted upon the N-Cap phenotype. The results were the same as with GSH addition: when combined with HU, NAC increased N-Cap incidence (Figure 5H), whereas in combination, the two compounds interacted extracellularly (Figure S5B). These data align with NAC being a precursor of GSH, as incrementing GSH levels augments the penetrance of the HU-induced phenotype”.

      Major issue #3. The appearance of cytosolic aggregates is intriguing, do the authors have any idea on the nature of the protein aggregates?

      DIA is a strong oxidant, and HU treatment results in the production of reactive oxygen species (ROS). Therefore, one hypothesis would be that cytoplasmic chaperone foci represent oxidized and/or misfolded soluble proteins. Indeed, in this revised version of the manuscript we have included data showing that guk1-9-GFP and Rho1.C17R-GFP soluble reporters of misfolding accumulate in cytoplasmic foci upon HU or DIA treatment that colocalize with Hsp104 (Figure 4I-J, pages 23-24 and 29), which demonstrate that cytoplasmic chaperone foci contain misfolded proteins. We have also tested if they contain Vgl1, which is one of the main components of heat shock induced stress granules in S. pombe (Wen et al., 2010). However, we found that HU or DIA-induced foci lacked this stress granule marker, and indeed Vgl1 did not form any foci in response to these treatments. Therefore, our aggregates differ from the canonical stress-induced granules.

      Are those resulting from proficient retrotranslocation or reflux of misfolded proteins from the ER?

      To test whether these cytosolic aggregates result from retrotranslocation from the ER, we plan to use the vacuolar Carboxipeptidase Y mutant reporter CPY*, which is misfolded. This misfolded protein is imported into the ER lumen but does not reach the vacuole. Instead, it is retrotranslocated to the cytoplasm, where it is ubiquitinated and degraded by the proteasome (Mukaiyama et al., 2012). We will analyze by fluorescence microscopy the localization of CPY*´-GFP and Hsp104-containing aggregates upon HU or DIA treatment and with or without proteasome inhibitors. We can also test the levels, processing and ubiquitination of CPY*-GFP by western blot, as ubiquitination of retrotranslocated proteins occurs once they are in the cytoplasm.

      Are those aggregates membrane bound or do they correspond to aggresomes as initially defined? The Walter lab has demonstrated a tight balance between ER phagy and ER membrane expansion (https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.0040423), which could also impact on the presence of protein aggregates in the cytosol.

      Our results suggest that these aggregates are not bound to ER membranes, as they do not appear in close proximity to the ER area marked by mCherry-AHDL in fluorescence microscopy images.

      To fully rule out this possibility, we have tested whether these Hsp104-aggregates colocalized with ER transmembrane proteins Rtn1 and Yop1, and with Gma12-GFP that marks the Golgi apparatus. In none of the cases the Hsp104-containing aggregates colocalized or were surrounded by membranes. This information will be added to the final version of the manuscript.

      With respect to autophagy, we have tested whether deletion of key genes involved in autophagy affected the N-Cap phenotype. To this end, we used deletions of vac8 and atg8 in strains expressing Cut11-GFP and/or mCherry-AHDL and found that none of them affected N-Cap formation. These data suggest that the core machinery of autophagy is not critical for HU/DIA-induced ER expansion. We plan to include this data in the final version of the manuscript along with the rest of experiments proposed.

      To get deeper insights and to fully rule out a possible contribution of macro-autophagy to the HU- and DIA-induced phenotypes, we plan to analyze by western blot whether GFP-Atg8 is induced and cleaved upon HU or DIA treatments which would be indicative of macroautophagy activation.

      To test whether the cytoplasmic aggregates are the result of an imbalance between ER-expansion and ER-phagy we plan to analyze the localization of GFP-Atg8 and Hsp104-RFP in the atg7Δ mutant, impaired in the core macro-autophagy machinery. In these conditions, the number or size of the cytoplasmic aggregates might be impacted.

      On the other hand, it has been recently shown that an ER-selective microautophagy occurs in yeasts upon ER stress (Schäfer et al., 2020; Schuck et al., 2014). This micro-ER-phagy involves the direct uptake of ER membranes into lysosomes, is independent of the core autophagy machinery and depends on the ESCRT system and is influenced by the Nem1-Spo7 phosphatase. ESCRT directly functions in scission of the lysosomal membrane to complete the uptake of the ER membrane. Interestingly, N-Caps are fragmented in the absence of cmp7 and specially in the absence of vps4 or lem2, the nuclear adaptor of the ESCRT (Figure 3E), We had initially interpreted these results as the need to maintain nuclear membrane identity during the process of ER expansion (Kume et al., 2019); however, the appearance of fragmented ER upon HU treatment in the absence of ESCRT might also be due to an inability to complete microautophagic uptake of ER membranes. To test this hypothesis, we plan to analyze whether the fragmented ER in these conditions co-localize with lysosome/vacuole markers.

      Major issue #4. Nucleotide depletion was previously shown to lead to HSP16 expression through activation of the spc1 MAPK pathway (https://academic.oup.com/nar/article/29/14/3030/2383924), one might think that HU (or diamide) could lead to this through a nucleotide dependent mechanism and not necessary through a thiol-redox protein misfolding stress. This issue has to be sorted out to ensure that the HSP effect is independent of nucleotide depletion.

      As stated in (Taricani et al., 2001), hsp16 expression is strongly induced in a cdc22-M45 mutant background. We performed experiments in this mutant that were included in the original version of the manuscript and remain in the current version (Sup. Fig. 2C) and, under restrictive conditions, we do not see spontaneous N-Cap formation. If Hsp16 overexpression and nucleotide depletion were key to the mechanism triggering N-Cap appearance, we would expect this mutant to eventually form N-Caps when placed at restrictive temperature. Furthermore, Taricani et al. show that Hsp16 expression was abolished in a Δatf1 mutant background in the presence of HU, and we found that this mutant is still able to produce N-Caps in HU; therefore, our results strongly suggest that the phenotype of N-cap is independent on the MAPK pathway and on the expression of hsp16.

      Minor issues

      1. __P1 - UPR = Unfolded Protein Response: __Corrected in the manuscript
      2. 2__. P22 - HSP upregulation "might" be indicative of a folding stress:__ Corrected in the manuscript
      3. __ The abstract does not reflect the findings presented in the manuscript. In addition, I would recommend the authors revise the storytelling in their manuscript to push forward the message on either the specific phenotype associated with perinuclear ER or on the characterization of protein misfolding stress.__ We have modified the abstract to better reflect our findings and will further revise our arguments in the final version of the manuscript once we have the results of the experiments proposed

      Reviewer 2

      Major issue #1. The authors state the cytoplasmic and ER folding are both disrupted. The impact on ER protein biogenesis would be bolstered with some biochemical data focused on the folding of one or more nascent secretory proteins. Is disulfide bond formation and/or protein folding indeed disrupted?

      We have addressed the status of secretion in cells treated with HU or DIA by assessing the morphology of the Golgi apparatus and the localization of several secretory proteins by fluorescence microscopy and found that both HU and DIA treatments impact the secretion system. In addition, we plan on addressing the redox status of ER proteins (Bip1, Pdi or Ero1) by biochemical approaches. Please see the answer to major issue #2 from reviewer 1.

      We will also analyze by western blot the biogenesis and processing of the wildtype vacuolar Carboxypeptidase Y (Cpy1-GFP) and/or alkaline phosphase (Pho8-GFP), two widely used markers to test the functionality of the ER/endomembrane system.

      Major issue #2. Increased signal of Bip1 in the expanded perinuclear ER is shown and is suggested as consistent with immobilization of BiP upon binding of misfolded proteins. The authors suggest that this increased signal must reflect Bip1 redistribution because "Bip1 levels are constant". Yet, the western image (Figure 4B) looks to show increased level of Bip1 protein up HU treatment. Given the abundance of Bip1 in cells, it seems possible that a two-fold increase in newly synthesized proteins in the perinuclear region may account for the increased signal. These original data cited by the authors uses photobleaching (not just fluorescence intensity) to show a change in crowding / mobility, which the authors should consider to support their conclusion. Alternatively, a detected increased engagement of Bip1 with substrates (e.g. pulldown experiment) would be similarly strengthening.

      This same issue arose with reviewer 3, so we decided to change the image of the western blot showing another one with less exposure and added a quantification showing that Bip1-GFP levels remain mostly constant between control conditions and treatments with HU and DIA.

      We have also performed the suggested photobleaching experiment to analyze potential changes in crowding and mobility in Bip1-GFP upon HU treatment. We found that Bip1-GFP signal recovers after photobleaching the perinuclear ER in HU-treated cells that had not yet expanded the ER, showing that Bip1-GFP is dynamic in these conditions. However, Bip1-GFP signal did not recover after photobleaching the whole N-Cap in cells that had fully developed the expanded perinuclear ER phenotype, whereas it did recover when only half of the N-Cap region was bleached. This suggests that Bip1-GFP is mobile within the expanded perinuclear ER but cannot freely diffuse between the cortical and the perinuclear ER once the N-Cap is formed.

      These data have been included in the revised version of the manuscript, in figure 4B, sup. figures 4A-B, and in page 22.

      Major issue #3. It is curious that cycloheximide (CHX) has a distinct impact on HU versus DIA treatment. Blocking protein synthesis with CHX exacerbates the phenotype with DIA, but not HU. The authors use the data with CHX to argue that their drug treatments are interfering with folding during synthesis and translation into the ER. If so, what is the rationale as to why CHX treatment decreases expansion upon HU treatment? Relatedly, is protein synthesis and/or ER import impacted upon treatment with HU and/or DIA?

      As all three reviewers had comments about the CHX and Pm-related data, we revised those experiments and noticed a phenotype occurring upon HU+CHX treatment that had gone unnoticed previously and that changed our understanding about the effect of these drugs on the ER. Briefly, we noticed that, although CHX treatment decreases the HU-induced expansion of the perinuclear ER, it indeed induced expansion but in this case in the cortical area of the ER. This means that the phenotype of ER expansion in HU is not being suppressed by addition of CHX, but rather taking place in another area of the ER (cortical ER). We do not understand why this happens; however, these results show that ER expansion is exacerbated both in DIA and HU when combined with CHX. We have included this data in Figures 3C-D and in page 21.

      We also examined the trafficking of secretory proteins that go from the ER to the cell tips and noticed that this transit was affected under both drugs (Figures 3A-B). This suggests that, although there is still protein synthesis when cells are exposed to the drugs (as can be seen by the higher levels of chaperones induced by both stresses (Figure 4C-E)), their protein synthesis capacity is possibly impinged on to certain degree. All this information is now included in the manuscript (page 18).

      Major issue #4. While the authors suggest that there is disulfide stress in the ER / nucleus, the redox environment in these compartments is not tested directly (only cytoplasmic probes).

      Although we have only included experiments using one redox sensor in the manuscript, we had tested the oxidation of several biosensors during HU and DIA exposure monitoring cytoplasmic, mitochondrial and glutathione-specific probes. We have tried to use ER directed probes however, we have not been successful due to oversaturation of the probe in the highly oxidative environment of the ER lumen.

      Although so far we have not been able to directly test the redox status of the ER with optical probes, we plan to test the folding and redox status of several ER proteins and secretory markers by biochemical approaches, so hopefully these experiments will give us more information on this question (See answer to Reviewer 1, Main Issue #2 and Reviewer 2, Main issue #1).

      Major Issue #5. What do the authors envision is the role of the cytoplasmic chaperone foci? Do CHX / Pm treatment with HU/DIA reverse the chaperone foci?

      Pm causes premature termination of translation, leading to the release of truncated, misfolded, or incomplete polypeptides into the cytosol and the re-engagement of ribosomes in a new cycle of unproductive translation, as puromycin does not block ribosomes (Aviner, 2020; Azzam & Algranati, 1973). This likely decreases the number of peptides entering the ER that can be targeted by either HU or DIA, decreasing in turn ER expansion. Indeed, we have found that Pm treatment alone results in the formation of multiple cytoplasmic protein aggregates marked by Hsp104-GFP (Figure 4K), consistent with a continuous release of incomplete and misfolded nascent peptides to the cytoplasm. This would explain why Pm treatment suppresses N-Cap formation when cells are treated with either HU or DIA.

      To further test this idea, we analyzed the number and size of Hsp104-containing cytoplasmic aggregates in cells treated with HU or DIA and Pm, where N-Caps are suppressed. As expected, we found an increase in the accumulation of proteotoxicity in the cytoplasm in these conditions. This information has now been added to the paper (Figure 4K, pages 23-24 and 29).

      On the other hand, CHX inhibits translation elongation by stalling ribosomes on mRNAs, preventing further peptide elongation but leaving incomplete polypeptides tethered to the blocked ribosomes. This reduces overall protein load entering the ER by blocking new protein synthesis and stabilizes misfolded proteins bound to ribosomes. Accordingly, it has been shown previously that blocking translation with CHX abolishes cytoplasmic protein aggregation (Cabrera et al., 2020; Zhou et al., 2014). Similarly, we have found that Hsp104 foci are not observed when we add CHX alone or in combination with HU or DIA (Figures 4K-L). These results suggest that cytoplasmic foci that we observe upon HU or DIA treatment likely contain misfolded proteins derived from ongoing translation.

      As this question had also been raised by reviewer 1, we further explored the nature of these cytoplasmic foci (please see answer to Reviewer1, Issue 3). Briefly:

      • We tested whether they colocalize with the foci of Guk1-9-GFP and Rho1.C17R-GFP reporters of misfolding that appear upon HU or DIA treatments and, indeed, Hsp104-containing aggregates colocalize with Guk1-9-GFP and Rho1.C17R-GFP. This information has now been added to the paper (Figure 4I-J, pages 23-24 and 29).
      • We tested whether these foci were membrane bound with several ER transmembrane proteins (Tts1, Yop1, Rtn1) and integral membrane protein Ish1, and in none of the cases we detected membranes surrounding the aggregates. This information will be included in the final version of the paper.
      • We plan to test whether the cytoplasmic foci represent proteins retro-translocated from the ER.
      • We will also test whether autophagy or an imbalance between ER expansion and ER-phagy might contribute to the accumulation of cytoplasmic protein foci. The new data regarding the suppression of cytoplasmic foci by CHX treatment has already been included in the current version of the manuscript in Figure 4K and in the text (page 29).

      The authors argue that cytoplasmic foci are "independent" from ER expansion and are "not a direct consequence of thiol stress" based on the observation that DTT does not reverse these foci. This seems like a strong statement based on the limited analysis of these foci.

      We agree with the reviewer. We have toned down our statements about the relationship between thiol stress, the cytoplasmic chaperone foci and their relationship with ER expansion. We have removed from the text the statement that cytoplasmic foci are independent from ER expansion and thiol stress and have further revised our claims about CHX and Pm in the main text and the discussion to address these and the other reviewers’ concerns.

      Major Issue #6. Based on the transcriptional data, the authors speculate a potential role on role on iron-sulfur cluster protein biogenesis. This would seem to be rather straightforward to test.

      To address this issue, we plan to analyze the localization of proteins involved in iron-sulfur cluster assembly and/or containing iron-sulfur clusters by in vivo fluorescence microscopy, such as DNA polymerase Dna2 or Grx5, during HU or DIA treatments.

      Related to this, we have found that a subunit of the ribonucleotide reductase (RNR) aggregated in the cytoplasm upon HU exposure (Figure S2B). It is worth noting that RNR is an iron-containing protein whose maturation needs cytosolic Grxs (Cotruvo & Stubbe, 2011; Mühlenhoff et al., 2020). The catalytic site, the activity site (which governs overall RNR activity through interactions with ATP) and the specificity site (which determines substrate choice) are located in the R1 (Cdc22) subunits, which are the ones that aggregate, while the R2 subunits (Suc22) contain the di-nuclear iron center and a tyrosyl radical that can be transferred to the catalytic site during RNR activity (Aye et al., 2015). The fact that a subunit of RNR aggregates could be related to an impingement on its synthesis and/or maturation due to defects in iron-sulfur cluster formation, as it has been recently published that RNR cofactor biosynthesis shares components with cytosolic iron-sulfur protein biogenesis and that the iron-sulfur cluster assembly machinery is essential for iron loading and cofactor assembly in RNR in yeast (Li et al., 2017). This information has been added to the discussion.

      Major Issue #7. The authors suggest that "pre-treatment" with DTT before HU addition suppresses formation of the N-Caps. However, these samples (Figure 2J) contain DTT coincident with the treatment as well. To say it is the effect of pre-treatment, the DTT should be added and then washed out prior to HU or DIA addition. Alternatively, the language used to describe these experiments and their outcomes could be revised.

      We modified the language used to describe the experiment in the manuscript, as suggested by the reviewer, to clarify that while DTT is kept in the medium, N-Caps never form. In addition, we have also performed a pre-treatment with DTT; adding 1 mM DTT one hour before, washing the reducing agent out and adding HU to the medium then. The result indicates that pre-treating cells with DTT significantly reduces N-Cap formation after a 4-hour incubation with HU, which suggests that triggering reducing stress “protects” cells from the oxidative damage induced by HU and DIA. This information has been also added to the manuscript (Figure 2J).

      Major Issue #8. For a manuscript with 128 references there is rather limited discussion of the data in the context of the wider literature. The discussion primarily focuses on a recap of the results. The authors do cite several prior works focused on redox-dependent nuclear expansion. However, while cited, there is no real discussion of the relationship between this work in the context of that previously published (including several known disulfide bonded proteins that are involved in nuclear/ER architecture).

      We have revised and expanded our discussion. In addition, in the final revision of our work we will increase the discussion in the context of the new results obtained.

      Minor points

      1. __ Figure numbering goes from figure 4 to S6 to 5.__ We have updated the numbering of the figures after merging several supplementary figures, so now this issue is fixed.

      __ It would be helpful to the reader to explain what some of the reporters are in brief. For example, Guk1-9-GFP and Rho1.C17R-GFP reporters__.

      Both the Guk1-9-GFP and Rho1.C17R-GFP are two thermosensitive mutants in guanylate kinase and Rho1 GTPase respectively, that have been previously used in S. pombe as soluble reporters of misfolding in conditions of heat stress. During mild heat shock, both mutants aggregate into reversible protein aggregate centers (Cabrera et al., 2020). This information has now been added to the manuscript.

      __ Supplementary Figure 3. The main text suggests panel 3A is focused on diamide treatment. The figure legend discusses this in terms of HU treatment. Which is correct?__

      We thank the reviewer for pointing out this mistake. The experiment was performed in 75 mM HU, the legend was correct. It has now been corrected in the manuscript.

      __ The authors use ref 110 and 111 to suggest the importance of UPR-independent signaling. However, they do not point out that this UPR-independent signaling referred to in these papers is dependent on the UPR transmembrane kinase IRE1.__

      We have included pertinent clarification in the new discussion.

      Reviewer 3

      Major issue #1. It is hard to see how the claim of ER stress can be supported if BiP levels do not change (Fig. 4B). Also, this figure is overexposed. The RNA-seq data should be able to establish ER stress as well, but no rigorous analysis of ER stress markers is presented.

      Regarding the levels of Bip1, we now show in Figure 4 a less exposed image of the western blot, and a quantification of Bip1-GFP intensity from three independent experiments. We find that, in our experimental conditions, neither HU nor DIA treatments significantly altered Bip1 levels.

      With respect to the RNA-Seq, as we mentioned in the major issue 1 from reviewer 1, we reassessed our data to further clarify and add information about ER stress markers induced or repressed by HU and DIA.

      Major issue #2. The interpretation of the CHX and puromycin experiments of Figure 3A-B is hard to follow. My best guess is that the authors argue that CHX decreases misfolded protein load and that puromycin increases misfolded protein load, and that since DIA is a stronger oxidative stress than HU hence CHX is only protective under HU and not DIA. However, while CHX decreases misfolded protein load, puromycin hasn't been show directly to increase it and I don't see how this explains puromycin being protective at all.

      We have found that puromycin treatment alone results in the formation of cytoplasmic foci containing Hsp104, suggesting that puromycin indeed increases folding stress in the cytoplasm. We have now included this data in Figure 4K (please see Main Issue #5 from Reviewer 2). Pm suppresses the formation of N-caps induced by HU or DIA; however, we have not addressed cell survival or fitness in these conditions and therefore we cannot conclude about being protective.

      In addition, upon the reevaluation of our data, we have realized that CHX treatment suppresses HU-induced perinuclear expansion, although it does not suppress but instead enhances ER expansion in the cortical region. This data has been added to the present version of the manuscript in Figure 3C-D (pages 20-21).

      Furthermore, puromycin causes Ca leakage from the ER (which can be recapitulated with thapsigargin and blocked with anisomycin; easy experiments), which could be responsible for the differences from CHX, and the model does not address the effects on downstream stress signaling. The authors should be much more clear regarding their argument, since this data is used to support the argument of disrupted ER proteostasis.

      Thapsigargin has been described to be ineffective in yeasts as they lack a (SERCA)‐type Ca2+ pump which is the target of this drug (Strayle et al., 1999). However, deletion of the P5A-type ATPase Cta4, which is required for calcium transport into ER membranes (Lustoza et al., 2011), reduced but did not abolish ER expansion. We also tested the effect of anisomycin. We found that anisomycin in combination with HU or DIA mimicked CHX behavior (ER expansion occurrs in both conditions, exacerbating perinuclear ER expansion in combination with DIA and cortical ER expansion when combined with HU). It is difficult to correlate this result with a role of Ca leakage in ER expansion, as there is no recent information regarding CHX and Ca leakage, although it has been indicated that CHX treatment does not increase cytoplasmic Ca levels (Moses & Kline, 1995). As anisomycin, like CHX, blocks protein synthesis and stabilizes polysomes, what we can conclude from this information is that nascent peptides attached to ribosomes during protein synthesis do promote ER expansion when combined with HU or DIA. This information will be added to the final version of the paper.

      Regarding the downstream effects of HU or DIA treatment on ER proteostasis, we plan to further explore the effect of these drugs on the secretory system (please see major issue #2 from Reviewer 1) and to evaluate the redox state and processing of several key ER and secretory proteins. We have also further explored the nature of the aggregates that appear in the cytoplasm in our experimental conditions, which also shed light into the downstream effects of these drugs in cytoplasmic proteostasis (please see answer to issue #5 from Reviewer 2).

      Major issue #3. The claim that a canonical UPR is not induced is weak. First, the transcriptional program of S. cerevisiae from Travers et al is used as the canonical UPR, and compared to HU/DIA induced stress in S. pombe. These organisms may not be similar enough to assume that they have transcriptionally identical UPRs. Second, no consideration is given to the mechanism by which the different transcripts are modulated between "canonical" and HU/DIA induced UPR. Is it solely through RIDD, or does it point to differences in sensing or signaling transduction?

      We readdressed this topic by analyzing the genes that have been described to be differentially expressed during UPR activation in S. pombe and comparing them with our data by reevaluating our transcriptomic data.. The re-analysis of our RNA-Seq data have allowed us to infer the mechanisms that modulate the ER response to HU or DIA treatment and further separate them from UPR. This information has been added to the paper (page 26). As an alternative approach, we will also analyse the levels of UPR targets by western blot upon HU or DIA treatment

      Finally, the p-values used are unadjusted (e.g. by Bonferroni's method or by ANOVA or at least controlled by an FDR approach) and unmodulated (extremely important when n = 3 and variance is poorly sampled), which makes them not dependable. It looks like HSF1 targets are induced, which should be addressed.

      We thank the reviewer for pointing this out. We forgot to include this information which now appears in the M&M section as follows:

      “A gene was considered as differentially expressed when it showed an absolute value of log2FC(LFC)≥1 and an adjusted p-valueIn this regard, we are currently performing proteome-wide mass spectrometry experiments to detect protein glutathionylation in our conditions, as it has been previously shown that DIA treatment leads to glutathionylation of key ER proteins such as Bip1, Pdi or Ero1 (Lind et al., 2002; Wang & Sevier, 2016), which might by reproduced upon HU treatment. We also plan to test the folding and processing of specific secretory cargoes by western blot in our experimental conditions (see below, and Reviewer 2, Major issue #1).

      We have already tested whether mutant strains with deletions of key enzymes in both cytoplasmic and ER redox systems are able to expand the ER upon HU or DIA treatment. We have found that only pgr1Δ (glutathione reductase), gsa1Δ (glutathione synthetase) and gcs1Δ (glutamate-cysteine ligase) mutants fully suppressed N-Cap formation, which suggests that glutathione has an important role in the phenotype of ER expansion. We have now added the pgr1Δ mutant strain to the main text of the manuscript (Figure 5C, page 30).

      Major issue #5. Figure S5 presents weak ER expansion in fibrosarcoma cells in response to HU (at very low concentrations and DIA is not included). The lack of any other phenotypes being presented could suggest that such experiments were done but didn't show any effect. The authors should straightforwardly discuss whether they performed experiments looking for perinuclear ER expansion or NPC clustering, and if not, what challenges precluded such experiments. Given how important this line of experimentation is for establishing generality, much more discussion is needed here.

      We not only investigated the effects of HU on the ER in mammalian cells, but also of DIA. The results from this experiment mimicked the effect of HU (an increase in ER-ID fluorescence intensity in DIA). We merely excluded this information from the manuscript because we were focusing on HU at that point due to its importance as it is used currently in clinics. In this new version of the manuscript, we have included an extra panel in supplementary figure 5 to show the results from DIA in mammalian cells.

      Minor concerns

      1) Figure 1A should show individual data points (i.e. 3 averages of independent experiments) in the bar graph.

      Although we initially changed the graph, we believe the bar plot disposition facilitates its comprehension and went back to the initial one. Also, as the rest of the graphs similar to 1A are all expressed as bar plots. Therefore, we preferred keeping the figure as it was in the original version. However, we include here the graph with each of the averages of the independent experiments.

      2) It is argued that Figure 1B demonstrates that the SPB is clustered with the NPC cluster. However, a single image is not enough to support this claim, as the association could be coincidental.

      We have changed the image to show a whole population of cells, with several of them having NPC clusters, and we have indicated the position of SPB in each of them (all colocalizing with the N-Cap).

      3) Figures 1B through 1D do not indicate the HU concentration.

      We thank the reviewer for pointing out this mistake. Figures 1B and 1C represent cells exposed to 15 mM HU for 4 hours, while the graph in 1D shows the results from cells exposed to 75 mM HU over a 4-hour period. This information has been now added to the corresponding figure legend.

      4) I was confused by the photobleaching experiments of Figure S1. How do the authors know that there is complete photobleaching of the cytoplasm or nucleus in the absence of a positive control? If photobleaching is incomplete, they could be measuring motility without compartments rather than transport between compartments, and hence the conclusion that trafficking is unaffected could be wrong.

      Our control is the background of each microscopy image; we make sure that after the laser bleaches a cell, the bleached area coincides with the background noise. That way, we make sure that fluorescence from any remaining GFP is completely removed from the bleached area.

      5) On page 8, they say "exposure to DIA" when they intend HU.

      This has been corrected in the manuscript.

      6) In Figure S3A, the colocalization of INM proteins with the ER are presented. It is not clearly explained what conclusions are meant to be drawn from this figure, but it seems it would have been more useful to compare INM and Cut11, to see whether the NPCs are localizing at the INM or ONM.

      We have added an explanation in the main text to clarify the main conclusions derived from this figure. We think that NPCs localize in a section of the nucleus where the two membranes (INM and ONM) are still bound together.

      7) I had to read Figure 2C's description and caption several times to understand the experiment. A schematic would be helpful. 20 mM HU is low compared to most conditions used. Does repositioning eventually take place for 75 mM HU or 3 mM DIA treatment, or do the cells just die before they get a chance?

      20 mM HU was used in this experiment to provide a time frame suitable for analysis after HU addition, as a higher HU concentration increases the repositioning time. We found that both HU (75mM 4h) and DIA (3mM 4h)-induced ER expansions are reversible upon drug washout. If HU is kept in the media, ER expansions are eventually resolved. However, DIA is a strong oxidant and if it is kept in the media ER expansions are not resolved and cells do not survive.

      8) Figure 2D shows little oxidative consequence from 75 mM HU treatment until 40 min., the same time that phenotypes are observed (Figure 1D). Is this relationship consistent with the kinetics of other concentrations of HU, or of DIA? Seems like a pretty important mechanistic consideration that can rationalize the effects of the two oxidants.

      Thanks to this comment we realized that the numbering underneath Figure 1D (1E in the new version of the manuscript) was wrongly annotated. The original timings shown in the figure were “random”, meaning that the time stablished as 40 minutes was not measuring the passing of 40 minutes since the beginning of the experiment. We have now corrected this panel: the timings are now normalized to the moment when NPCs cluster. The fact that, before, that moment coincided with “40 minutes” does not mean N-Caps appear at that time point in HU (they indeed appear after a >2 hour incubation).

      9) Figure S4 is missing the asterisk on the lower left cell.

      Fixed in the corresponding figure.

      10) How is roundness determined in Figure S4B?

      Roundness in Figure S4B (now S2E) is determined the same way as in Figure 1D, and as is described in the Method section (copied below). A clarification has been added to the legend to address that.

      The ‘roundness’ parameter in the ‘Shape Descriptors’ plugin of Fiji/ImageJ was used after applying a threshold to the image in order to select only the more intense regions and subtract background noise (Schindelin et al., 2012). Roundness descriptor follows the function:

      where [Area] constitutes the area of an ellipse fitted to the selected region in the image and [Major axis] is the diameter of the round shape that in this case would fit the perimeter of the nucleus.

      11) What threshold is used to determine whether cells analyzed in Figures S4C have "small ER" or "large ER"?

      Large ER are considered when their area along the projection of a 3-Z section is over 4 μm2 (more than twice the mean area of the ER in cells with N-Caps in milder conditions). This has now been clarified in the legend of the corresponding figure.

      __12) The authors interpret Figure 4K as indicating that ER expansion is not involved in the generation of punctal misfolded protein aggregates. However, the washout occurs only after the proteins have already aggregated. The proper interpretation is that the aggregates are not reversible by resolution of the stress, and hence are not physically reliant on disulfide bonds. __

      We agree with the reviewer and have modified the interpretation of the indicated figure accordingly (page 29).


      The speculation that these proteins are iron dependent is a stretch; there is no reason to believe that losses of iron metabolism are the most important stress in these cells. It seems at least as likely that oxidizing cysteine-containing proteins in the cytosol or messing with the GSH/GSSG ratio in the cytosol would make plenty of proteins misfold; oxidative stress in budding yeast does activate hsf1. However, this point could be addresses by centrifugation and mass spectrometry to identify the aggregated proteome. It is also surprising that the authors did not investigate ER protein aggregation, perhaps by looking at puncta formation of chaperones beyond BiP. By contrast, the fact that gcs1 deletion prevents ER expansion but does not prevent Hsp104 puncta does support the idea that cytoplasmic aggregation is not dependent on ER expansion.

      To address this suggestion, we plan to analyze the localization of other chaperones and components of the protein quality control such as the ER Hsp40 Scj1 or the ribosome-associated Hsp70 Sks2.

      13) Figure 4L is cited on page 28 when Figure 4K is intended.

      This has been corrected in the text, although new panels have been added and now it is 4N.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This article makes the following claims, using S. pombe as their model system. Hydroxyurea (HU) and diamide (DIA) induce ER stress, an atypical UPR, and cytoplasmic protein aggregation. HU and DIA induce IRE1-independent and GSH-dependent reversible ER perinuclear expansion which causes nuclear pore clustering with no effect on protein trafficking, and can be reversed by DTT.

      Major concerns:

      1. It is hard to see how the claim of ER stress can be supported if BiP levels do not change (Fig. 4B). Also, this figure is overexposed. The RNA-seq data should be able to establish ER stress as well, but no rigorous analysis of ER stress markers is presented.
      2. The interpretation of the CHX and puromycin experiments of Figure 3A-B is hard to follow. My best guess is that the authors argue that CHX decreases misfolded protein load and that puromycin increases misfolded protein load, and that since DIA is a stronger oxidative stress than HU hence CHX is only protective under HU and not DIA. However, while CHX decreases misfolded protein load, puromycin hasn't been show directly to increase it and I don't see how this explains puromycin being protective at all. Furthermore, puromycin causes Ca leakage from the ER (which can be recapitulated with thapsigargin and blocked with anisomycin; easy experiments), which could be responsible for the differences from CHX, and the model does not address the effects on downstream stress signaling. The authors should be much more clear regarding their argument, since this data is used to support the argument of disrupted ER proteostasis.
      3. The claim that a canonical UPR is not induced is weak. First, the transcriptional program of S. cerevisiae from Travers et al is used as the canonical UPR, and compared to HU/DIA induced stress in S. pombe. These organisms may not be similar enough to assume that they have transcriptionally identical UPRs. Second, no consideration is given to the mechanism by which the different transcripts are modulated between "canonical" and HU/DIA induced UPR. Is it solely through RIDD, or does it point to differences in sensing or signaling transduction? Finally, the p-values used are unadjusted (e.g. by Bonferroni's method or by ANOVA or at least controlled by an FDR approach) and unmodulated (extremely important when n = 3 and variance is poorly sampled), which makes them not dependable. It looks like HSF1 targets are induced, which should be addressed.
      4. Mechanistically, one would expect effects to be mediated by PDIs and oxidoreductases. No effort is made to characterize the redox state of these molecules, nor how that relates to the kinetics of ER expansion and resolution under HU/DIA treatment. No discussion is made of the existing literature on oxidants and ER stress. A few papers: PMID: 29504610, PMID: 31595201.
      5. Figure S5 presents weak ER expansion in fribrosarcoma cells in response to HU (at very low concentrations and DIA is not included). The lack of any other phenotypes being presented could suggest that such experiments were done but didn't show any effect. The authors should straightforwardly discuss whether they performed experiments looking for perinuclear ER expansion or NPC clustering, and if not, what challenges precluded such experiments. Given how important this line of experimentation is for establishing generality, much more discussion is needed here.

      Minor concerns:

      1. Figure 1A should show individual data points (i.e. 3 averages of independent experiments) in the bar graph.
      2. It is argued that Figure 1B demonstrates that the SPB is clustered with the NPC cluster. However, a single image is not enough to support this claim, as the association could be coincidental.
      3. Figures 1B through 1D do not indicate the HU concentration.
      4. I was confused by the photobleaching experiments of Figure S1. How do the authors know that there is complete photobleaching of the cytoplasm or nucleus in the absence of a positive control? If photobleaching is incomplete, they could be measuring motility without compartments rather than transport between compartments, and hence the conclusion that trafficking is unaffected could be wrong.
      5. On page 8, they say "exposure to DIA" when they intend HU.
      6. In Figure S3A, the colocalization of INM proteins with the ER are presented. It is not clearly explained what conclusions are meant to be drawn from this figure, but it seems it would have been more useful to compare INM and Cut11, to see whether the NPCs are localizing at the INM or ONM.
      7. I had to read Figure 2C's description and caption several times to understand the experiment. A schematic would be helpful. 20 mM HU is low compared to most conditions used. Does repositioning eventually take place for 75 mM HU or 3 mM DIA treatment, or do the cells just die before they get a chance?
      8. Figure 2D shows little oxidative consequence from 75 mM HU treatment until 40 min., the same time that phenotypes are observed (Figure 1D). Is this relationship consistent with the kinetics of other concentrations of HU, or of DIA? Seems like a pretty important mechanistic consideration that can rationalize the effects of the two oxidants.
      9. Figure S4 is missing the asterisk on the lower left cell.
      10. How is roundness determine in Figure S4B?
      11. What threshold is used to determine whether cells analyzed in Figures S4C have "small ER" or "large ER"?
      12. The authors interpret Figure 4K as indicating that ER expansion is not involved in the generation of punctal misfolded protein aggregates. However, the washout occurs only after the proteins have already aggregated. The proper interpretation is that the aggregates are not reversible by resolution of the stress, and hence are not physically reliant on disulfide bonds. The speculation that these proteins are iron dependent is a stretch; there is no reason to believe that losses of iron metabolism are the most important stress in these cells. It seems at least as likely that oxidizing cysteine-containing proteins in the cytosol or messing with the GSH/GSSG ratio in the cytosol would make plenty of proteins misfold; oxidative stress in budding yeast does activate hsf1. However, this point could be addresses by centrifugation and mass spectrometry to identify the aggregated proteome. It is also surprising that the authors did not investigate ER protein aggregation, perhaps by looking at puncta formation of chaperones beyond BiP. By contrast, the fact that gcs1 deletion prevents ER expansion but does not prevent Hsp104 puncta does support the idea that cytoplasmic aggregation is not dependent on ER expansion.
      13. Figure 4L is cited on page 28 when Figure 4K is intended.

      Significance

      This paper is for the most part well-written, presenting a logical chain of experiments that fully support the most important claims that have been made. Specifically, they show that HU and DIA induce reversible perinuclear expansion and nuclear pore clustering in an IRE1-independent and GSH-dependent manner, and that DTT can prevent and accelerate recovery of this phenotype. Both oxidants clearly induce protein aggregation in the cytosol. The evidence that perinuclear expansion is responsible for nuclear pore clustering is compelling, with strong support from the kinetics and the nup120 deletion experiments. Some conclusions are not supported, including the claim of an atypical UPR and of ER stress, but the validity of these claims does not substantively affect the overall importance of the paper and could be handled by withdrawal or tempering of the claims. The lack of a molecular mechanism connecting oxidation with ER expansion moderately detracts from the potential impact. Adequate experimental detail is provided unless otherwise noted

      This paper is likely to be important for cell biologists interested in interorganelle communication and how the cell responds to oxidative stress. Modulating ER oxidoreductase activity has been shown to be a powerful way to regulate ER stress and proteostasis, and this paper shows how specific oxidative stresses that have not widely been investigated in this context, as opposed to the more commonly studied reductive and electrophilic stresses, can remodel the ER with cell-wide consequences. More specifically, the nuclear pore and nuclear morphology phenotypes, while not yet functionally significant in yeast, could be significant in other unexplored ways identified in the future. Towards that end, it would be valuable to see if these gross phenotypes reproduce in any metazoan cell or tissue, rather than just looking at ER expansion as in the current manuscript. My expertise is centered around ER proteostasis and chaperones, and as such I consider this paper important to my field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Sánchez-Molina et al describes a striking time and dose-dependent clustering of nuclear pores and perinuclear ER expansion in response to hydroxyurea (HU) or diamide (DIA) treatment in S. pombe. Using microscopy, the authors establish clustering is reversible upon drug washout or extended drug treatment. Pretreatment or post-treatment with the reductant DTT prevents or reverses the clustering and expansion effects, as does the release of translating polypeptides from ribosomes (with puromycin). The phenotypes were established to occur independent of the established impact of HU on RNR activity and the cell cycle. The authors suggest instead that the phenotypes (referred to as nuclear-cap (N-Cap) formation) are associated with disulfide-based folding stress. Overlapping transcriptional responses for HU and DIA treatment suggest that cells are experiencing folding stress (based on chaperone induction) and/or a disruption in iron homeostasis (induction of genes involved in iron homeostasis). The observed clustering, ER expansion, and transcriptional profiles are independent of the well-established ER stress response pathway: the UPR.

      The manuscript outlines several interesting phenotypic observations, and they establish the potential for conserved of this ER expansion and nuclear pore clustering from yeast (S. cerevisiae) and mammals (HT1080 fibrosarcoma cells). Data clearly establish the time and dose-dependent formation of these interesting structures. Additional experiments with combined drug treatments points towards a role for changes in the redox environment in cells, an impact on cytoplasmic protein aggregation, and a potential impact on the ER folding environment / ER redox environment.

      Data obtained with thiol oxidants and reductants, alongside translation inhibitors, suggest a potential connection between the N-Cap phenotype and oxidative folding within the ER. Yet, this latter observation remains a suggestive link with less clear mechanistic connections. Some experiments that would more directly assess the suggested changes within the nuclear/ER region are outlined below.

      1. The authors state the cytoplasmic and ER folding are both disrupted. The impact on ER protein biogenesis would be bolstered with some biochemical data focused on the folding of one or more nascent secretory proteins. Is disulfide bond formation and/or protein folding indeed disrupted?
      2. Increased signal of Bip1 in the expanded perinuclear ER is shown and is suggested as consistent with immobilization of BiP upon binding of misfolded proteins. The authors suggest that this increased signal must reflect Bip1 redistribution because "Bip1 levels are constant". Yet, the western image (Figure 4B) looks to show increased level of Bip1 protein up HU treatment. Given the abundance of Bip1 in cells, it seems possible that a two-fold increase in newly synthesized proteins in the perinuclear region may account for the increased signal. These original data cited by the authors uses photobleaching (not just fluorescence intensity) to show a change in crowding / mobility, which the authors should consider to support their conclusion. Alternatively, a detected increased engagement of Bip1 with substrates (e.g. pulldown experiment) would be similarly strengthening.
      3. It is curious that cycloheximide (CHX) has a distinct impact on HU versus DIA treatment. Blocking protein synthesis with CHX exacerbates the phenotype with DIA, but not HU. The authors use the data with CHX to argue that their drug treatments are interfering with folding during synthesis and translation into the ER. If so, what is the rationale as to why CHX treatment decreases expansion upon HU treatment? Relatedly, is protein synthesis and/or ER import impacted upon treatment with HU and/or DIA?
      4. While the authors suggest that there is disulfide stress in the ER / nucleus, the redox environment in these compartments is not tested directly (only cytoplasmic probes).

      Addition suggestions / comments:

      1. What do the authors envision is the role of the cytoplasmic chaperone foci? Do CHX / Pm treatment with HU/DIA reverse the chaperone foci? The authors argue that cytoplasmic foci are "independent" from ER expansion and are "not a direct consequence of thiol stress" based on the observation that DTT does not reverse these foci. This seems like a strong statement based on the limited analysis of these foci.
      2. Based on the transcriptional data, the authors speculate a potential role on role on iron-sulfur cluster protein biogenesis. This would seem to be rather straightforward to test.
      3. The authors suggest that "pre-treatment" with DTT before HU addition suppresses formation of the N-Caps. However, these samples (Figure 2J) contain DTT coincident with the treatment as well. To say it is the effect of pre-treatment, the DTT should be added and then washed out prior to HU or DIA addition. Alternatively, the language used to describe these experiments and their outcomes could be revised.
      4. For a manuscript with 128 references there is rather limited discussion of the data in the context of the wider literature. The discussion primarily focuses on a recap of the results. The authors do cite several prior works focused on redox-dependent nuclear expansion. However, while cited, there is no real discussion of the relationship between this work in the context of that previously published (including several known disulfide bonded proteins that are involved in nuclear/ER architecture).

      Minor points

      1. Figure numbering goes from figure 4 to S6 to 5.
      2. It would be helpful to the reader to explain what some of the reporters are in brief. For example, Guk1-9-GFP and Rho1.C17R-GFP reporters.
      3. Supplementary Figure 3. The main text suggests panel 3A is focused on diamide treatment. The figure legend discusses this in terms of HU treatment. Which is correct?
      4. The authors use ref 110 and 111 to suggest the importance of UPR-independent signaling. However, they do not point out that this UPR-independent signalling referred to in these papers is dependent on the UPR transmembrane kinase IRE1.

      Significance

      An interesting finding that is well-supported as a phenotype. What would raise the impact would be data that connect these observations more directly with a mechanism. In particular, there are suggestions of a disruption in ER folding and/or the ER redox environment that are logical but not directly tested. How one viewed these additional experiments will depend on what journal is considering the manuscript.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Sanchez-Molina describe the impact of hydroxyurea on the remodeling of the nuclear pore complex (clustering) and the expansion of both cortical and perinuclear ER. The study is carried out in S. pombe, and the observations confirmed in S. cerevisiae. Results are clear and analyzed properly, however considering the differences in UPR signaling in both yeast strains the conclusions raised may remain to be fully documented.

      Major issues

      Regarding the conclusions on IRE1 signaling, both yeast species have different IRE1 activities https://elifesciences.org/articles/00048), the total deletion of IRE1 in S pombe appears to indicate that expansion of perinuclear ER is independent of IRE1, however since IRE1 signaling has exclusively a negative impact on mRNA expression, it might be relevant to identify mRNA whose expression is stabilized under those circumstances and evaluate whether those could confer a mechanism which would also yield perinuclear ER expansion (eg differential deregulation of ER stress controlled lipid biosynthesis required for lipid membrane synthesis). In S cerevisiae, do the authors observe HAC1 mRNA splicing?

      The authors indicate that HU and DIA lead to thiol stress, it might be relevant to evaluate the thiol-redox status of major secretory proteins in S pombe (or even cargo reporters if necessary) to fully document the stress impact on global protein redox status. What happens if HU-treated yeast cells are grown in the presence of n-acetyl cysteine?

      The appearance of cytosolic aggregates is intriguing, do the authors have any idea on the nature of the protein aggregates? Are those resulting from proficient retrotranslocation (or reflux of misfolded proteins from the ER? Are those aggregates membrane bound or do they correspond to aggresomes as initially defined?

      The Walter lab has demonstrated a tight balance between ER phagy and ER membrane expansion (https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.0040423), which could also impact on the presence of protein aggregates in the cytosol. Does HU impact on the regulation of autophagy?

      Nucleotide depletion was previously shown to lead to HSP16 expression through activation of the spc1 MAPK pathway (10.1093/nar/29.14.3030), one might think that HU (or diamide) could lead to this through a nucleotide dependent mechanism and not necessary through a thiol-redox protein misfolding stress. This issue has to be sorted out to ensure that the HSP effect is independent of nucleotide depletion.

      Minor issues

      P1 - UPR = Unfolded Protein Response

      P22 - HSP upregulation "might" be indicative of a folding stress

      The abstract does not reflect the findings presented in the manuscript. In addition, I would recommend the authors to revise the story telling in their manuscript to push forward the message on either the specific phenotype associated with perinuclear ER or on the characterization of protein misfolding stress.

      Significance

      This is a nice manuscript describing the likely effects of HU on protein misfolding and several consequences including the remodeling of the nuclear pore complex, the expansion of both cortical and perinuclear ER. The underlying mechanisms remain however unclear (for each parameter evaluated) and the manuscript would definitely benefit from the elucidation of one of those (if not more).

      The work in yeast is novel and might bring light on mechanisms existing in mammalian systems. Since HU is used as a therapeutic, the characterization of the molecular mechanisms associated with its mode(s) of action will definitely be useful for better (targeted) efficiency.

      The audience for this work is more targeted towards people working on yeast cell biology, however, the authors could expand the discussion section to make it of a broader scope.

      I am expert on ER stress signaling

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02605

      Corresponding author: Woo Jae, Kim

      1. ____Point-by-point description of the revisions

      Reviewer #1

      General Comment: This study investigates the role of the foraging gene in modulating interval timing behaviors in flies, with a particular focus on mating duration. Using single-cell RNA sequencing and gene knockdown experiments, the research demonstrates the crucial role of foraging gene expression in Pdfr-positive cells for achieving longer mating duration (LMD). The study further identifies key neurons in the ellipsoid body (EB) as essential when the foraging gene is overexpressed, highlighting its specific influence on LMD. The findings suggest that a small subset of EB neurons must express the foraging gene to modulate LMD effectively.

      __Answer:____ __We would like to express our gratitude to the reviewer for their insightful comments and positive feedback on our manuscript. During the revision process, we serendipitously discovered that the heart-specific expression of the foraging gene plays a crucial role in regulating LMD behavior. We have elaborated on the significance of this finding in the revised manuscript and have addressed the reviewer's comments accordingly.

      Comment 1. *(optional) Integration of Neuronal Subsets into a Pathway: The knockdown experiments indicate that a small subset of neurons must express the foraging gene to influence LMD. Could these neurons be integrated into a potential signaling pathway, or being treated as separate components within the brain circuit? How might this integration provide a more cohesive understanding of their role in LMD? *

      Answer: We sincerely thank the reviewer for her/his insightful comments regarding the integration of neuronal subsets into a signaling pathway and their potential role in modulating LMD behavior. During the revision process, we conducted further experiments to address this question. While we were unable to identify a specific small subset of EB neurons expressing foraging, we utilized the recently developed EB-split GAL4 driver line (SS00096), which is restricted to the EB region of the brain, to confirm that foraging expression in the EB is indeed crucial for generating LMD behavior (Fig. 4L-M). This finding underscores the importance of foraging in specific neural circuits within the EB for interval timing.

      Additionally, we discovered that foraging expression in Hand-GAL4-labeled pericardial cells (PCs) of the heart is essential for LMD behavior. These PCs are also partially labeled by fru-GAL4 and 30y-GAL4 drivers, indicating that foraging functions in both neuronal and non-neuronal tissues to regulate interval timing. Importantly, we observed that group-reared males exhibit higher calcium activity in PCs compared to socially isolated males, suggesting that social context-dependent calcium dynamics in the heart play a critical role in modulating LMD behavior.

      These findings highlight a novel integration of neuronal and cardiac mechanisms, where foraging expression in both the EB and heart coordinates calcium dynamics to regulate interval timing. This dual-tissue involvement provides a more cohesive understanding of how foraging integrates social cues with internal physiological states to modulate complex behaviors like LMD. We believe this integration of neuronal and cardiac pathways offers a comprehensive framework for understanding the gene’s pleiotropic roles in behavior. We have included these new findings in the revised manuscript to better address the reviewer’s question and to strengthen the discussion of how foraging functions across tissues to regulate interval timing behaviors.

      Comment 2. Genetic Considerations in Gal4 System Usage (Fig. 1D): In the study, the elavc155-Gal4 transgene, located on chromosome I, produces hemizygous males after crossing, while the repo-Gal4 transgene, located on chromosome III, results in heterozygous males. Is there any evidence suggesting that this genetic configuration could impact the experimental outcomes? If so, what steps could be taken to address potential issues?

      Answer: We appreciate the reviewer’s thoughtful consideration of potential genetic confounds related to the chromosomal locations of the elavc155 and repo-GAL4 transgenes. To address this concern, we conducted additional experiments using the nSyb-GAL4 driver, which is located on the third chromosome, and observed that knockdown of foraging with this driver also disrupts LMD behavior (Fig. S1G). This result aligns with our findings using elavc155 (chromosome I) and repo-GAL4 (chromosome III), indicating that the chromosomal location of the GAL4 transgene does not significantly impact the experimental outcomes.

      Furthermore, our extensive tissue-specific GAL4 screening, which included drivers on different chromosomes, consistently demonstrated that foraging knockdown effects on LMD are robust and reproducible across various genetic configurations. These results suggest that the observed behavioral deficits are due to the loss of foraging function rather than positional effects of the GAL4 transgenes. We thank the Reviewer for raising this important point and have taken care to address it thoroughly in our revised manuscript.

      Comment 3. Discrepancies in lacZ Signal Intensity (Fig. 5A): The observed discrepancies in lacZ signal intensity on the surface of the male brain have been attributed to the dissection procedure. Is it feasible to replace the current data with a new, more consistent dataset? How might improved dissection techniques mitigate these discrepancies?

      Answer____: We thank the reviewer for her/his observation regarding the discrepancies in lacZ signal intensity on the surface of the male brain, which we attributed to variations in the dissection procedure. While replacing the current dataset with a new one is feasible, we have instead shifted our focus to address this concern by leveraging more reliable and validated tissue-specific GAL4 drivers combined with foraging-RNAi.

      During the revision process, we extensively examined multiple foraging-GAL4 lines and found that foraging expression in the brain is limited and often inconsistent, despite scRNA-seq data from flySCope indicating broader expression across tissues, including the brain. This discrepancy suggests that many foraging-GAL4 lines may not accurately reflect endogenous foraging expression patterns. To circumvent this issue, we utilized well-characterized tissue-GAL4 drivers to systematically identify tissues where foraging plays a critical role in modulating LMD behavior.

      Our findings revealed that foraging expression in the heart, particularly in fru-positive heart cells, is essential for LMD. This discovery aligns with previous knowledge that foraging is highly enriched in glial cells in the brain, but our new data highlight a previously unrecognized role for cardiac foraging in regulating interval timing behaviors. Furthermore, we demonstrated that calcium activity in these heart cells is dynamically regulated by social context, suggesting that these cells play a crucial role in modulating male mating investment.

      We believe this new analysis addresses the reviewer’s concerns by providing a more robust and consistent approach to studying foraging function, focusing on its role in the heart rather than relying on potentially unreliable brain expression data. We hope these findings meet the reviewer’s expectations and provide a clearer understanding of foraging’s role in mating duration.

      Comment ____4. Rescue Experiment Data (Fig. S2L): Could additional data be provided to demonstrate the rescue effect using the c61-Gal4 driver, similar to what was observed with the 30y-Gal4 driver? How would such data enhance the study's conclusions regarding the specificity and robustness of the foraging gene's role in LMD?

      Answer: We appreciate the reviewer’s suggestion to provide additional rescue experiment data using the c61-GAL4 driver, similar to the results obtained with the 30y-GAL4 driver. While we do not currently have a UAS-for line to perform direct rescue experiments with c61-GAL4, we have conducted extensive follow-up experiments using both 30y-GAL4 driver to further validate the role of foraging in LMD behavior. These experiments consistently demonstrated that foraging knockdown in cells targeted by these drivers disrupts LMD, reinforcing the specificity and robustness of foraging’s role in interval timing.

      Additionally, our revised manuscript includes new findings that highlight the critical role of foraging expression in fru-positive heart neurons for generating male-specific mating investment. These heart neurons exhibit dynamic calcium activity changes in response to social context, further supporting the idea that foraging modulates LMD through both neuronal and non-neuronal mechanisms. While we acknowledge that direct rescue data with c61-GAL4 would strengthen the study, we believe the combination of 30y-GAL4 and c61-GAL4 knockdown results, along with the newly identified role of heart neurons, provides compelling evidence for foraging’s role in LMD.

      In addition, we have confirmed that the 30y-GAL4 driver labels fru-positive heart cells, further supporting the critical role of foraging expression in these cells for generating male-specific mating investment. This finding aligns with our broader results, demonstrating that foraging function in fru-positive heart neurons is essential for modulating interval timing behaviors, particularly LMD. We hope these additional analyses address the reviewer’s concerns and enhance the study’s conclusions regarding the specificity and robustness of foraging function in interval timing behaviors. We have incorporated the following findings into the main text:

      “Therefore, we conclude that the knockdown and genetic rescue effects observed with the Pdfr3A-GAL4 driver (Fig. 3J and 3N) and the 30y-GAL4 driver (Fig. 4A, S2A, and S2L) are attributable to their expression in the heart. In summary, our findings demonstrate that fru-positive heart cells expressing foraging and Pdfr play a critical role in mediating LMD behavior.”


      Reviewer #2

      General Comment: The authors nicely demonstrated that the Drosophila for gene is involved in the plastic LMD behavior that serves as a model for interval timing. For is widely expressed in the body, they have tentatively localized the LMD-relevant for functioning to the ellipsoid body of the central complex.

      Answer: We sincerely thank the reviewer for their positive feedback on our manuscript and their recognition of our findings regarding the role of the foraging gene in modulating plastic LMD behavior as a model for interval timing. In addition to its function in the ellipsoid body (EB) of the central complex, we have identified a novel and critical role for foraging in fru-positive heart neurons. These neurons are essential for regulating male-specific mating investment, as demonstrated by dynamic calcium activity changes in response to social context. This discovery expands our understanding of foraging’s pleiotropic roles, highlighting its function not only in neural circuits but also in non-neuronal tissues, particularly the heart, to modulate interval timing behaviors. We believe these findings provide a more comprehensive view of how *foraging* integrates genetic, neural, and physiological mechanisms to regulate complex behaviors. We hope this additional insight into the role of fru-positive heart neurons further strengthens the manuscript and aligns with the reviewer’s interest in the broader implications of foraging function.


      Major concerns: __ Comment 1.__ Please clarify how a loss-of-function forS allele can be dominant in the presence of overactive forR allele? In the same vein, please clarify how does the forR/forS transgeterozygote supports your hypothesis that high levels of PKG activity disrupt SMD and low levels of it disrupt LMD?

      Answer: We thank the reviewer for her/his insightful questions regarding the dominance of the forS allele in the presence of the overactive forR allele and the implications of the forR/forS transheterozygote phenotype. As the Reviewer noted, the forR allele is associated with higher PKG activity, while the forS allele exhibits lower PKG activity. The disruption of SMD in the presence of a single forR allele can be explained by the excessive PKG activity, which may hyperactivate or desensitize neural circuits required for SMD. Conversely, the forS homozygote disrupts LMD, suggesting that a minimum threshold of PKG activity is necessary for LMD generation.

      The forR/forS transheterozygote, which disrupts both LMD and SMD, presents an intriguing case. Unlike forR/+ or forS/+ heterozygotes, which show intact behaviors due to intermediate PKG activity levels, the forR/forS combination results in conflicting PKG activity levels that likely destabilize shared pathways required for both behaviors. We propose two hypotheses to explain this phenomenon:

      1. Metabolic Disruption: The foraginggene mediates adult plasticity and gene-environment interactions, particularly under conditions of food deprivation (Kent 2009). It influences body fat, carbohydrate metabolism, and gene expression levels, leading to metabolic and behavioral gene-environment interactions (GEI). In forR/forStransheterozygotes, the metabolic changes induced by each allele may accumulate without proper regulatory mechanisms, disrupting the male’s internal metabolic state and impairing the ability to accurately measure interval timing.

      Neuronal Polymorphism: The foraginggene regulates neuronal excitability, synaptic transmission, and nerve connectivity (Renger 1999). The forRand forS alleles may induce distinct neuronal polymorphisms, such as altered synaptic terminal morphology, which could lead to conflicting circuit dynamics in transheterozygotes. This neuronal mismatch may explain why forR/forS flies exhibit disrupted behaviors, unlike heterozygotes with a wild-type allele.

      These findings align with prior studies showing that PKG activity must be tightly regulated within context-dependent ranges for optimal behavior. The foraging gene’s pleiotropic roles, including its influence on metabolic and neural pathways, highlight the importance of allelic balance in maintaining behavioral robustness. The forR/forS transheterozygote phenotype underscores the complexity of foraging’s role in interval timing, where extreme or mismatched PKG activity levels disrupt circuit-specific thresholds critical for distinct behaviors. We hope this explanation clarifies the dominance effects and the role of PKG activity in LMD and SMD, and we have incorporated these insights into the revised manuscript to strengthen our discussion of foraging’s pleiotropic functions.

      We provide a concise explanation of this hypothesis in the Discussion section, as outlined below:

      “The foraging gene plays a critical role in regulating interval timing behaviors, with its allelic variants, rover and sitter, exhibiting distinct effects on LMD and SMD. These differences are primarily driven by their opposing impacts on cGMP-dependent protein kinase (PKG) activity. The forR allele, associated with higher PKG activity, disrupts SMD while maintaining normal LMD (Fig. 1A), suggesting that elevated PKG levels may hyperactivate or desensitize neural circuits specific to SMD processes. Conversely, the forS allele, characterized by lower PKG activity, impairs LMD but not SMD (Fig. 1B), indicating that reduced PKG activity fails to meet the neuromodulatory thresholds required for LMD coordination. The forR/forS transheterozygotes, which disrupt both LMD and SMD (Fig. 1C), reveal a complex interaction between these alleles, likely due to conflicting PKG activity levels or metabolic and neuronal polymorphisms that destabilize shared pathways. This phenomenon underscores the foraging gene’s pleiotropic roles, where allelic balance fine-tunes PKG activity to maintain behavioral robustness, while extreme or mismatched levels disrupt circuit-specific thresholds critical for distinct memory processes [6,10] .

      The foraging gene’s influence on interval timing behaviors extends beyond neural circuits to include metabolic and synaptic regulation. The intact behaviors observed in forR/+ or forS/+ heterozygotes suggest that intermediate PKG activity levels balance circuit dynamics, allowing for normal LMD and SMD. However, the dual deficits in forR/forS transheterozygotes highlight the importance of allelic balance, as conflicting PKG levels may lead to systemic disruptions in both metabolic and neural pathways. This aligns with previous studies showing that foraging mediates adult plasticity and gene-environment interactions, particularly under stress conditions, and regulates synaptic terminal morphology and neuronal excitability [29,77]. The gene’s role in integrating genetic and environmental cues further emphasizes its central role in adaptive behaviors. Collectively, these findings illustrate the complex interplay between PKG activity, neural circuits, and metabolic regulation in shaping interval timing behaviors, highlighting the foraging gene as a key modulator of behavioral plasticity in Drosophila [3,6,77].”

      Comment 2. Please consider removing lines 193-201 & Fig 3G,H, since abruptly and briefly returning to SMD could distract the reader and hinder the flow.

      Answer: We sincerely thank the reviewer for her/his suggestion to improve the flow of the manuscript. In response to reviewer’s feedback, we have removed Figure 3G-H and the related text (lines 193-201) from the main text. While the data on SMD behavior provided additional insights into the role of foraging in gustatory modulation via sNPF-expressing peptidergic neurons, we agree that its inclusion at this point in the manuscript could distract from the primary focus on LMD behavior and interval timing.

      Comment 3. Please use more specific Gal4 drivers to identify the exact subset of the EB-RNs where for function is necessary for LMD. Please note that Taghert lab already identified Pdfr+ EB-RN subset, and in contradiction to your findings, demonstrated that Cry is expressed in these Pdfr+ EB neurons

      Answer: We thank the reviewer for their suggestion to use more specific GAL4 drivers to identify the exact subset of EB ring neurons (EB-RNs) where foraging function is necessary for LMD. In response, we utilized the EB-split-GAL4 driver SS00096, which has been previously employed to map the neuroanatomical ultrastructure of the EB (Turner-Evans 2020). Knockdown of foraging using this refined EB driver disrupted LMD behavior, confirming that foraging function in the EB is indeed crucial for interval timing.

      Regarding the reviewer’s observation about the Taghert lab’s findings on Pdfr+ EB-RNs and the expression of Cry in these neurons, we acknowledge this discrepancy. However, during the revision process, we discovered that foraging and Pdfr are co-expressed not only in EB neurons but also in fru-positive heart neurons, which play a complementary role in modulating LMD behavior. This finding suggests that the apparent contradiction may arise from the dual-tissue involvement of foraging in both EB neurons and heart cells. While foraging function in the EB is critical, its role in heart neurons may provide an additional layer of regulation for interval timing behaviors, potentially compensating for or interacting with EB-related mechanisms.

      We have incorporated these insights into the revised manuscript, emphasizing the importance of both EB and heart neurons in mediating LMD behavior. This dual-tissue perspective offers a more comprehensive understanding of foraging’s role in interval timing and addresses the potential discrepancies highlighted by the reviewer. We hope this clarification resolves the reviewer’s concerns and strengthens the manuscript’s conclusions regarding the neural and non-neural mechanisms underlying foraging function.

      Comment 4. Please clarify how do you think for and Pdfr signaling molecularly interact in these neurons? Since your work doesn't implicate the for+ AL neurons, please remove lines 260-269.Please clarify if the Pdfr+ for+ EB neurons are also fru+.The lacZ staining in Fig5A-B is atypical in having a mosaic-like pattern. Please replace the image.

      Answer: We thank the reviewer for her/his thoughtful questions regarding the molecular interaction between foraging and Pdfr signaling, as well as their observations on the atypical lacZ staining pattern. Below, we address each point in detail:

      1. Molecular Interaction Between foragingand PdfrSignaling: Our tissue-specific driver screening indicates that Pdfr and foraging do not co-express in the same neurons within the brain. Instead, we found that Pdfr and foraging are co-expressed in fru-positive heart cells, suggesting that PDF-Pdfr signaling in these cells modulates calcium activity in pericardial cells (PCs) in a social context-dependent manner. This finding aligns with our previous work showing that PDF signaling is crucial for LMD behavior (Kim 2013). We propose that PDF-Pdfr signaling operates not only through the brain’s sLNv to LNd neuronal circuit but also through a brain-to-heart signaling axis, influencing behaviors and physiological processes across multiple tissues.

      Removal of Lines 260-269: As suggested, we have removed lines 260-269, which discussed for+ AL neurons, as our findings do not implicate these neurons in LMD regulation. This revision helps streamline the manuscript and maintain focus on the relevant neural and cardiac mechanisms.

      Clarification on Pdfr+for+EB Neurons and fru Expression: While our data do not directly address whether Pdfr+ for+ EB neurons are also fru+, we have confirmed that foraging and Pdfr co-express in fru-positive heart cells. This suggests that fru may play a role in integrating foraging and Pdfr signaling in non-neuronal tissues, particularly in the heart, to regulate LMD behavior.

      Replacement of lacZ Staining Images: During the revision process, we extensively examined multiple foraging-GAL4lines and found that foragingexpression in the brain is limited and often inconsistent, despite scRNA-seq data from flySCope indicating broader expression across tissues, including the brain. This discrepancy suggests that many foraging-GAL4 lines may not accurately reflect endogenous foraging expression patterns. To circumvent this issue, we utilized well-characterized tissue-GAL4 drivers to systematically identify tissues where foraging plays a critical role in modulating LMD behavior. Our findings revealed that foraging expression in the heart, particularly in fru-positive heart cells, is essential for LMD. This discovery aligns with previous knowledge that foraging is highly enriched in glial cells in the brain, but our new data highlight a previously unrecognized role for cardiac foraging in regulating interval timing behaviors. Furthermore, we demonstrated that calcium activity in these heart cells is dynamically regulated by social context, suggesting that these cells play a crucial role in modulating male mating investment. We believe this new analysis addresses the reviewer’s concerns by providing a more robust and consistent approach to studying foraging function, focusing on its role in the heart rather than relying on potentially unreliable brain expression data. We hope these findings meet the reviewer’s expectations and provide a clearer understanding of foraging’s role in mating duration.

      We hope these revisions meet the Reviewer’s expectations and provide a clearer understanding of the interplay between foraging and Pdfr signaling in interval timing behaviors.

      Comment 5. Please consider removing lines 303-312, since this negative result may dilute your final conclusions without adding strong factual value.

      Answer: We appreciate the reviewer's suggestion regarding lines 303-312. Upon careful consideration, we believe this paragraph provides important context about the roles of dsx-positive and fru-positive cells in foraging behavior. Specifically, it highlights that the foraging function is associated with fru-positive cells rather than dsx-positive cells, which is a key distinction in our study. This information is relevant to understanding the broader implications of our findings, as it underscores the functional specificity of these genes in regulating behavior. However, to address the reviewer's concern, we have revised the paragraph to ensure it is more concise and directly tied to the study's conclusions. We have also integrated additional data from the new manuscript to further strengthen the factual value of this section. We hope this adjustment strikes the right balance between maintaining necessary context and avoiding any dilution of the final conclusions. Thank you for this thoughtful feedback.

      __Minor concerns: __

      __Comment 6. __Minor points: In the intro please mention other interval timing mechanisms and their underlying molecular mechanisms (e.g., CREB work of Crickmore lab). Please provide a better rationale for why you thought for is a good candidate for LMD? In line 124, when you start to talk about larval neurons - please specify which neurons you are referring to. In Fig 2E,G,H - 'glia' should be replaced with 'neurons'.

      Answer: We appreciate the reviewer’s insightful comments regarding our conclusion linking LMD to interval timing behavior. Current research by Crickmore et al. has shed light on how mating duration in Drosophila serves as a powerful model for exploring changes in motivation over time as behavioral goals are achieved. For instance, at approximately six minutes into mating, sperm transfer occurs, leading to a significant shift in the male's nervous system: he no longer prioritizes sustaining the mating at the expense of his own survival. This change is driven by the output of four male-specific neurons that produce the neuropeptide Corazonin (Crz). When these Crz neurons are inhibited, sperm transfer does not occur, and the male fails to downregulate his motivation, resulting in matings that can last for hours instead of the typical ~23 minutes (Thornquist 2020).

      Recent research by Crickmore et al. has received NIH R01 funding (Mechanisms of Interval Timing, 1R01GM134222-01) to explore mating duration in Drosophila as a genetic model for interval timing. Their work highlights how changes in motivation over time can influence mating behavior, particularly noting that significant behavioral shifts occur during mating, such as the transfer of sperm at approximately six minutes, which correlates with a decrease in the male's motivation to continue mating (Thornquist 2020). These findings suggest that mating duration is not only a behavioral endpoint but may also reflect underlying mechanisms related to interval timing.

      In addition to the efforts of Crickmore's group to connect mating duration with a straightforward genetic model for interval timing, we have previously published several papers demonstrating that LMD and SMD can serve as effective genetic models for interval timing within the fly research community. For instance, we have successfully connected SMD to an interval timing model in a recently published paper (Lee 2023), as detailed below:

      "We hypothesize that SMD can serve as a straightforward genetic model system through which we can investigate "interval timing," the capacity of animals to distinguish between periods ranging from minutes to hours in duration.....

      In summary, we report a novel sensory pathway that controls mating investment related to sexual experiences in Drosophila. Since both LMD and SMD behaviors are involved in controlling male investment by varying the interval of mating, these two behavioral paradigms will provide a new avenue to study how the brain computes the ‘interval timing’ that allows an animal to subjectively experience the passage of physical time (Buhusi & Meck, 2005; Merchant et al, 2012; Allman et al, 2013; Rammsayer & Troche, 2014; Golombek et al, 2014; Jazayeri & Shadlen, 2015)."

      Lee, S. G., Sun, D., Miao, H., Wu, Z., Kang, C., Saad, B., ... & Kim, W. J. (2023). Taste and pheromonal inputs govern the regulation of time investment for mating by sexual experience in male Drosophila melanogaster. PLoS Genetics, 19(5), e1010753.

      We have also successfully linked LMD behavior to an interval timing model and have published several papers on this topic recently (Huang 2024,Zhang 2024,Sun 2024).

      Sun, Y., Zhang, X., Wu, Z., Li, W., & Kim, W. J. (2024). Genetic Screening Reveals Cone Cell-Specific Factors as Common Genetic Targets Modulating Rival-Induced Prolonged Mating in male Drosophila melanogaster. G3: Genes, Genomes, Genetics, jkae255.

      Zhang, T., Zhang, X., Sun, D., & Kim, W. J. (2024). Exploring the Asymmetric Body’s Influence on Interval Timing Behaviors of Drosophila melanogaster. Behavior Genetics, 54(5), 416-425.

      Huang, Y., Kwan, A., & Kim, W. J. (2024). Y chromosome genes interplay with interval timing in regulating mating duration of male Drosophila melanogaster. Gene Reports, 36, 101999.

      Finally, in this context, we have outlined in our INTRODUCTION section below how our LMD and SMD models are related to interval timing, aiming to persuade readers of their relevance. We hope that the reviewer and readers are convinced that mating duration and its associated motivational changes such as LMD and SMD provide a compelling model for studying the genetic basis of interval timing in Drosophila.

      “The mating duration (MD) of male fruit flies, Drosophila melanogaster, serves as an excellent model for studying interval timing behaviors. In Drosophila, two notable interval timing behaviors related to mating duration have been identified: Longer-Mating-Duration (LMD), which is observed when males are in the presence of competitors and extends their mating duration [15–17] and Shorter-Mating-Duration (SMD), which is characterized by a reduction in mating time and is exhibited by sexually experienced males [18,19]. The MD of male fruit flies serves as an excellent model for studying interval timing, a process that can be modulated by internal states and environmental contexts. Previous studies by our group (Kim 2013,Kim 2012,Zhang 2024,Lee 2023,Huang 2024) and others (Thornquist 2020,Crickmore 2013,Zhang 2019,Zhang 2021) have established robust frameworks for investigating MD using advanced genetic tools, enabling the dissection of neural circuits and molecular mechanisms that govern interval timing.

      The foraging gene emerged as a strong candidate for regulating LMD due to its well-documented role in behavioral plasticity and decision-making processes (Kent 2009,Alwash 2021,Anreiter 2019). The foraging gene encodes a cGMP-dependent protein kinase (PKG), which has been implicated in modulating foraging behavior, aggression, and other context-dependent behaviors in Drosophila. Its involvement in these processes suggests a potential role in integrating environmental cues and internal states to regulate interval timing, such as LMD. Furthermore, the molecular mechanisms underlying interval timing have been explored in other contexts, such as the work of the Crickmore et al., which has demonstrated the critical role of CREB (cAMP response element-binding protein) in regulating behavioral timing and plasticity. CREB-dependent signaling pathways, along with other molecular players like PKG, provide a broader framework for understanding how interval timing is orchestrated at the neural and molecular levels (Thornquist 2020,Zhang 2016,Zhang 2021,Zhang 2019,Crickmore 2013,Zhang 2023). By investigating foraging in the context of LMD, we aim to uncover how specific genetic and neural mechanisms fine-tune interval timing in response to social and environmental cues, contributing to a deeper understanding of the principles governing behavioral adaptation.”

      When describing larval neurons, we provide specific references to ensure clarity and accuracy, as outlined below:

      “Moreover, the cultured giant neural characteristics of these phenotypes are distinctly different [29].”

      We thank the reviewer for catching this error. We have corrected the incorrect label "Glia" to "Neuron" in Figures 2E, 2G, and 2H.

      Reviewer #3

      General Comment: This manuscript explores the foraging gene's role in mediating interval timing behaviors, particularly mating duration, in Drosophila melanogaster. The two distinct alleles of the foraging gene-rover and sitter-demonstrate differential impacts on mating behaviors. Rovers show deficiencies in shorter mating duration (SMD), while sitters are impaired in longer mating duration (LMD). The gene's expression in specific neuronal populations, particularly those expressing Pdfr (a critical regulator of circadian rhythms), is crucial for LMD. The study further identifies sexually dimorphic patterns of foraging gene expression, with male-biased expression possibly in the ellipsoid body (EB) being responsible for regulating LMD behavior. The findings suggest that the foraging gene operates through a complex neural circuitry that integrates genetic and environmental factors to influence mating behaviors in a time-dependent manner. Additionally, restoring foraging expression in Pdfr-positive cells rescues LMD behavior, confirming its central role in interval timing related to mating.

      Answer: We sincerely thank the reviewer for her/his thoughtful and comprehensive synthesis of our work, as well as their recognition of its key contributions. We are grateful that the reviewer highlighted the central findings of our study, including the allele-specific roles of forR (rover) and forS (sitter) in regulating distinct interval timing behaviors—specifically, the deficiencies of rovers in SMD and sitters in LMD. We also appreciate the reviewer’s emphasis on the sexually dimorphic expression of the *foraging* gene, particularly its male-biased expression in the ellipsoid body (EB), and its critical role in Pdfr-positive neurons for mediating LMD.

      We agree with the reviewer that the interplay between genetic factors (e.g., allelic variation in foraging) and environmental cues (e.g., circadian rhythms via Pdfr pathways) underscores the complexity of interval timing regulation. The rescue of LMD behavior by restoring foraging expression in Pdfr cells further supports our hypothesis that foraging operates through specialized neural circuits to integrate temporal and environmental inputs. This finding aligns with broader studies on interval timing mechanisms, such as the work of the Crickmore lab on CREB-dependent pathways, which have demonstrated how molecular and neural mechanisms converge to regulate behavioral plasticity and timing.

      In the revised manuscript, we will expand on these points to strengthen the discussion of foraging’s pleiotropic roles in time-dependent mating strategies and its potential links to evolutionary fitness. Specifically, we will incorporate additional insights from the new manuscript, including further evidence of how foraging balances behavioral plasticity with metabolic and neural demands, and how its expression in specific neuronal populations, such as the EB, contributes to adaptive behaviors. These updates will provide a more comprehensive understanding of the gene’s role in interval timing and its broader implications for behavioral adaptation. Once again, we thank the Reviewer for their valuable feedback, which has helped us refine and enhance the presentation of our findings.

      __Major concerns: __

      Comment 1. The sexually dimorphic expression of the foraging gene is not convincing. Specifically, the lacZ signal in the male brain is not representative.

      __Answer:____ __We sincerely thank the reviewer for her/his insightful comment regarding the sexually dimorphic expression of the foraging gene. We agree that the lacZ signal in the male brain, as presented, may not be fully representative, and we appreciate the reviewer’s observation regarding the discrepancies in signal intensity, which we attribute to variations in dissection procedures. While replacing the current dataset with a new one is feasible, we have chosen to address this concern by shifting our focus to a more reliable and validated approach using tissue-specific GAL4 drivers combined with foraging-RNAi.

      During the revision process, we conducted an extensive examination of multiple foraging-GAL4 lines and found that foraging expression in the brain is often limited and inconsistent, despite scRNA-seq data from flySCope indicating broader expression across tissues, including the brain. This discrepancy suggests that many foraging-GAL4 lines may not accurately reflect endogenous foraging expression patterns. To overcome this limitation, we employed well-characterized tissue-specific GAL4 drivers to systematically identify tissues where foraging plays a critical role in modulating LMD behavior.

      Our findings revealed that foraging expression in the heart, particularly in fru-positive heart cells, is essential for LMD. This discovery aligns with previous knowledge that foraging is highly enriched in glial cells in the brain, but our new data highlight a previously unrecognized role for cardiac foraging in regulating interval timing behaviors. Furthermore, we demonstrated that calcium activity in these heart cells is dynamically regulated by social context, suggesting that these cells play a crucial role in modulating male mating investment.

      By focusing on the heart and leveraging more reliable genetic tools, we believe this new analysis addresses the Reviewer’s concerns and provides a more robust and consistent approach to studying foraging function. We hope these findings meet the reviewer’s expectations and offer a clearer understanding of foraging’s role in mating duration. We are grateful for the Reviewer’s constructive feedback, which has significantly strengthened our study.

      Comment 2____. Key control genotypes are missing.

      Answer: We thank the Reviewer for raising this important point regarding control genotypes. We would like to clarify that all necessary control experiments have indeed been conducted, and the results are included in the manuscript. Detailed descriptions of these controls, including the specific genotypes and experimental conditions, are provided in the Methods section. For example, control experiments were performed to account for genetic background effects, GAL4 driver activity, and RNAi efficiency, ensuring the reliability and specificity of our findings. In the revised manuscript, we have further emphasized these control experiments and their outcomes to ensure transparency and reproducibility. We have also included additional details in the Results section to highlight how these controls validate our key findings. For instance, control genotypes lacking the foraging-RNAi or GAL4 drivers were used to confirm that the observed phenotypes are specifically due to the manipulation of foraging expression.

      We appreciate the Reviewer’s attention to this critical aspect of our study and hope that the additional clarification and emphasis on control experiments in the revised manuscript address their concerns. If there are specific control genotypes or experiments the reviewer would like us to include or elaborate on further, we would be happy to do so. Thank you for this valuable feedback.

      Comment 3____.fru is not expressed in the EB, so the authors may need to reconcile their model in figure 5G.

      Answer: We thank the reviewer for her/his insightful comment regarding the expression of fru in the ellipsoid body (EB) and its relevance to our model in Figure 5G. We agree that fru is not expressed in the EB, and we acknowledge the need to reconcile this aspect of our model. While initial evidence suggested a potential role for the EB in regulating foraging-dependent LMD behavior, further investigation has revealed that neurons outside the EB are more likely to be involved in this process.

      During our revision, we identified fru-positive heart neurons that coexpress Pdfr and foraging, which appear to play a critical role in modulating LMD behavior. These findings suggest that the heart, rather than the EB, may be a key site for foraging function in the context of interval timing and mating duration. Specifically, we demonstrated that calcium activity in these fru+ heart cells is dynamically regulated by social context, further supporting their role in modulating male mating investment.

      In light of these new findings, we revised Figure 5G as new Figure 6H and the accompanying model to reflect the updated understanding that fru+ heart neurons, rather than EB neurons, are central to the regulation of LMD behavior. This adjustment aligns with our broader goal of accurately representing the neural and molecular mechanisms underlying foraging’s role in interval timing. We appreciate the Reviewer’s feedback, which has helped us refine our model and strengthen the manuscript. We hope these revisions address their concerns and provide a clearer and more accurate representation of our findings. Thank you for this valuable input.

      Minor concerns: Comment 4____.

      Line 32, what do you mean by "overall success of the collective"

      Line 124-126: I suggest not using "sitter neurons" or "rover neurons". Line 301, typo with "male-specific".

      Answer: We thank the Reviewer for their careful reading and constructive feedback. We have addressed each of their comments as follows:

      1. Line 32: We agree with the reviewer that the phrase "overall success of the collective" was unclear and have completely revised the Abstract to remove this expression. The updated Abstract now provides a clearer and more concise summary of our findings.

      Lines 124-126: We appreciate the reviewer’s suggestion to avoid using the terms "sitter neurons" or "rover neurons," as they could be misleading. We have revised this phrasing to "neurons of sitter/rover allele" to more accurately reflect the genetic context of our study.

      Line 301: We have corrected the typo with "male-specific" to ensure accuracy and clarity in the text.

      We hope these revisions address the Reviewer’s concerns and improve the overall quality of the manuscript. Thank you for your valuable input, which has helped us refine our work.

      __Strengths and limitations of the study:______ This study presents a significant advancement in understanding the foraging gene's role in regulating mating behaviors through interval timing, and identifies the critical role of Pdfr-expressing neurons in the ellipsoid body for LMD. However, it does not fully explain how these neurons specifically modulate timing mechanisms. The lack of in-depth mechanistic exploration of how these neurons interact with other circuits involved in memory and decision-making leaves gaps in the understanding of the exact pathways influencing interval timing. Also, the study focuses more on LMD behaviors and the neural circuits involved, leaving the mechanisms underlying SMD comparatively underexplored.

      __Answer:____ __We thank the reviewer for her/his thoughtful assessment of the strengths and limitations of our study. We agree that our work represents a significant advancement in understanding the role of the foraging gene in regulating mating behaviors through interval timing, particularly in identifying the critical role of Pdfr-expressing neurons in the ellipsoid body (EB) for long mating duration (LMD). However, we acknowledge that the initial manuscript did not fully elucidate how these neurons specifically modulate timing mechanisms or interact with other neural circuits involved in memory and decision-making.

      In response to this feedback, we have conducted additional experiments and analyses, which are now included in the revised manuscript. Specifically, we identified fru-positive heart neurons that coexpress Pdfr and foraging, and we demonstrated their essential role in LMD using calcium imaging (CaLexA). These findings provide a more comprehensive mechanistic understanding of how foraging influences interval timing through cardiac activity, which is dynamically regulated by social context. This new evidence addresses the reviewer’s concern by offering a clearer picture of the neural and molecular pathways underlying LMD.

      Regarding SMD behavior, we agree that it was comparatively underexplored in the initial manuscript. However, we have extensively studied SMD in other contexts, as highlighted in several of our previously published papers. These studies have investigated the sensory mechanisms, memory processes, peptidergic signaling, and clock gene functions associated with SMD (Zhang 2024,Zhang 2024,Sun 2024,Wong 2019,Kim 2024,Lee 2023). While the current manuscript focuses primarily on LMD, we will include a discussion of these findings to provide a more balanced perspective on the mechanisms underlying both LMD and SMD.

      We believe these revisions address the Reviewer’s concerns and significantly strengthen the manuscript by providing a more detailed mechanistic understanding of foraging’s role in interval timing and mating behaviors. We are grateful for the Reviewer’s constructive feedback, which has helped us improve the depth and clarity of our study. Thank you for your valuable input.

      __Advance:______ This study brings a novel perspective to the foraging gene, previously known for its role in regulating food-search behavior. It demonstrates that foraging is also involved in interval timing, a cognitive process integral to mating behaviors in Drosophila. This discovery challenges the assumption that foraging is solely related to foraging strategies, revealing a broader function in time-based decision-making processes.

      Answer: We sincerely thank the reviewer for her/his insightful comments and for recognizing the novel contributions of our study. We are pleased that the reviewer highlighted how our work expands the understanding of the foraging gene, which was previously primarily associated with food-search behavior. By demonstrating its role in interval timing—a cognitive process critical to mating behaviors in Drosophila—we challenge the conventional assumption that foraging is solely related to foraging strategies. Instead, our findings reveal its broader function in time-based decision-making processes, particularly in the context of mating duration.

      This discovery not only advances our understanding of the pleiotropic roles of foraging but also opens new avenues for exploring how genetic and neural mechanisms integrate temporal and environmental cues to regulate complex behaviors. We are grateful for the reviewer’s support and acknowledgment of the significance of our findings. Thank you for this valuable feedback.

      __Audience:______ The study offers significant value to several specialized research communities, including behavioral genetics and evolutionary biology, especially those using the Drosophila model. This could inform future research on other behaviors that depend on precise timing and decision-making.

      Answer: We sincerely thank the reviewer for her/his thoughtful comment and for recognizing the broad relevance of our study. We are pleased that the reviewer highlighted the significant value our work offers to be specialized research communities, particularly in behavioral genetics and evolutionary biology, as well as to researchers using the Drosophila model. By elucidating the role of the foraging gene in interval timing and its impact on mating behaviors, our findings provide a foundation for future research on other behaviors that rely on precise timing and decision-making. This study not only advances our understanding of the genetic and neural mechanisms underlying interval timing but also opens new avenues for exploring how similar processes may operate in other species or contexts. We hope our work will inspire further investigations into the interplay between genetic variation, neural circuits, and environmental cues in shaping adaptive behaviors. Thank you for your valuable feedback and for acknowledging the potential impact of our research.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This manuscript explores the foraging gene's role in mediating interval timing behaviors, particularly mating duration, in Drosophila melanogaster. The two distinct alleles of the foraging gene-rover and sitter-demonstrate differential impacts on mating behaviors. Rovers show deficiencies in shorter mating duration (SMD), while sitters are impaired in longer mating duration (LMD). The gene's expression in specific neuronal populations, particularly those expressing Pdfr (a critical regulator of circadian rhythms), is crucial for LMD. The study further identifies sexually dimorphic patterns of foraging gene expression, with male-biased expression possibly in the ellipsoid body (EB) being responsible for regulating LMD behavior. The findings suggest that the foraging gene operates through a complex neural circuitry that integrates genetic and environmental factors to influence mating behaviors in a time-dependent manner. Additionally, restoring foraging expression in Pdfr-positive cells rescues LMD behavior, confirming its central role in interval timing related to mating.

      Major comments

      1. The sexually dimorphic expression of the foraging gene is not convincing. Specifically, the lacZ signal in the male brain is not representative.
      2. Key control genotypes are missing.
      3. fru is not expressed in the EB, so the authors may need to reconcile their model in figure 5G.

      Minor comments:

      1. Line 32, what do you mean by "overall success of the collective"
      2. line 124-126: I suggest not using "sitter neurons" or "rover neurons".
      3. line 301, typo with "male-specific".

      Significance

      Strengths and limitations of the study:

      This study presents a significant advancement in understanding the foraging gene's role in regulating mating behaviors through interval timing, and identifies the critical role of Pdfr-expressing neurons in the ellipsoid body for LMD. However, it does not fully explain how these neurons specifically modulate timing mechanisms. The lack of in-depth mechanistic exploration of how these neurons interact with other circuits involved in memory and decision-making leaves gaps in the understanding of the exact pathways influencing interval timing. Also, the study focuses more on LMD behaviors and the neural circuits involved, leaving the mechanisms underlying SMD comparatively underexplored.

      Advance:

      This study brings a novel perspective to the foraging gene, previously known for its role in regulating food-search behavior. It demonstrates that foraging is also involved in interval timing, a cognitive process integral to mating behaviors in Drosophila. This discovery challenges the assumption that foraging is solely related to foraging strategies, revealing a broader function in time-based decision-making processes.

      Audience:

      The study offers significant value to several specialized research communities, including behavioral genetics and evolutionary biology, especially those using the Drosophila model. This could inform future research on other behaviors that depend on precise timing and decision-making.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors nicely demonstrated that the Drosophila for gene is involved in the plastic LMD behavior that serves as a model for interval timing. For is widely expressed in the body, they have tentatively localized the LMD-relevant for functioning to the ellipsoid body of the central complex.

      Major points:

      Please clarify how a loss-of-function forS allele can be dominant in the presence of overactive forR allele? In the same vein, please clarify how does the forR/forS transgeterozygote supports your hypothesis that high levels of PKG activity disrupt SMD and low levels of it disrupt LMD?

      Please consider removing lines 193-201 & Fig 3G,H, since abruptly and briefly returning to SMD could distract the reader and hinder the flow.

      Please use more specific Gal4 drivers to identify the exact subset of the EB-RNs where for function is necessary for LMD. Please note that Taghert lab already identified Pdfr+ EB-RN subset, and in contradiction to your findings, demonstrated that Cry is expressed in these Pdfr+ EB neurons.

      Please clarify how do you think for and Pdfr signaling molecularly interact in these neurons? Since your work doesn't implicate the for+ AL neurons, please remove lines 260-269.

      Please clarify if the Pdfr+ for+ EB neurons are also fru+. The lacZ staining in Fig5A-B is atypical in having a mosaic-like pattern. Please replace the image.

      Please consider removing lines 303-312, since this negative result may dilute your final conclusions without adding strong factual value.

      Minor points: In the intro please mention other interval timing mechanisms and their underlying molecular mechanisms (e.g., CREB work of Crickmore lab). Please provide a better rationale for why you thought for is a good candidate for LMD? In line 124, when you start to talk about larval neurons - please specify which neurons you are referring to. In Fig 2E,G,H - 'glia' should be replaced with 'neurons'.

      Referees cross-commenting

      I find the two other reviewers' comments constructive & insightful. All the reviewers are unanimously urging the authors to change the lacZ figure (5) panel, and to provide a more integrated, coherent model indicating how For, Pdfr, and Fru genes are interacting through a specific neural circuit.

      Significance

      The comprehensive work is based on many behavioral assays, combined with genetics. It shows a novel function of the for gene. However, how for contributes to interval timing is not studied. Nevertheless, their work represents an incremental conceptual advance to the genetic underpinnings of interval timing. The work is primarily targeted to Drosophila neurogeneticists.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study investigates the role of the foraging gene in modulating interval timing behaviors in flies, with a particular focus on mating duration. Using single-cell RNA sequencing and gene knockdown experiments, the research demonstrates the crucial role of foraging gene expression in Pdfr-positive cells for achieving longer mating duration (LMD). The study further identifies key neurons in the ellipsoid body (EB) as essential when the foraging gene is overexpressed, highlighting its specific influence on LMD. The findings suggest that a small subset of EB neurons must express the foraging gene to modulate LMD effectively.

      Questions for Further Exploration:

      (optional) Integration of Neuronal Subsets into a Pathway: The knockdown experiments indicate that a small subset of neurons must express the foraging gene to influence LMD. Could these neurons be integrated into a potential signaling pathway, or being treated as separate components within the brain circuit? How might this integration provide a more cohesive understanding of their role in LMD?

      Genetic Considerations in Gal4 System Usage (Fig. 1D): In the study, the elavc155-Gal4 transgene, located on chromosome I, produces hemizygous males after crossing, while the repo-Gal4 transgene, located on chromosome III, results in heterozygous males. Is there any evidence suggesting that this genetic configuration could impact the experimental outcomes? If so, what steps could be taken to address potential issues?

      Discrepancies in lacZ Signal Intensity (Fig. 5A): The observed discrepancies in lacZ signal intensity on the surface of the male brain have been attributed to the dissection procedure. Is it feasible to replace the current data with a new, more consistent dataset? How might improved dissection techniques mitigate these discrepancies?

      Rescue Experiment Data (Fig. S2L): Could additional data be provided to demonstrate the rescue effect using the c61-Gal4 driver, similar to what was observed with the 30y-Gal4 driver? How would such data enhance the study's conclusions regarding the specificity and robustness of the foraging gene's role in LMD?

      Referees cross-commenting

      The two other reviewers provided important and valuable feedback on this topic. The LacZ figure (5) panel should be replaced as a control. The interaction between For, Pdfr, and Fru could form a circuit involved in fly mating behaviors, even as a hypothesis.

      Significance

      The research highlights the pivotal role of the foragine gene in regulating complex interval timing behaviors in Drosophila. This finding offers valuable insights into the interplay between genetics, environment, and behavior across species. Additionally, it suggests potential multifunctional implications for the ellipsoid body.

  3. Feb 2025
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Wasilewska and colleagues generated tmbim5-/- zebrafish line and demonstrated that tmbim5 loss of function leads to decrease in zebrafish size and induces muscle atrophy. Authors used immunohistochemistry to suggest that tmbim5-/- zebrafish shows reduced glycogen levels in muscle and liver. However, most of the immunohistochemistry is not quantitated and only qualitative differences are shown. Next, the authors measured mitochondrial calcium levels in the brain of tmbim5-/- zebrafish but there was no behavioral phenotype in the fish. It would have be better to measure mitochondrial calcium levels in the muscles of tmbim5-/- zebrafish as phenotype is muscle atrophy. Further, it is reported that the mitochondrial membrane potential and glycogen levels were perturbed in tmbim5-/- zebrafish.

      Next, the authors generated a scl8b1-/- (a probable NCLX ortholog in zebrafish) zebrafish, which did not show any drastic phenotype. However, neither slc8b1 function nor the phenotype of scl8b1-/- zebrafish was well characterized. Further, authors created two double knockout zebrafish lines i.e. tmbim5-/-/mcu-/- and tmbim5-/-/slc8b1-/-. Interestingly, both these lines were viable and do not show any drastic phenotypes. The authors concluded that in these transgenic fishes compensatory and/or alternative mitochondrial Ca2+ mobilization pathways counterbalance the effects of silencing of these proteins.

      Although it is an interesting study, the conclusions are not well supported with the data. At several places only qualitative images are shown and quantitative data is missing. Similarly, Ca2+ imaging in muscles of tmbim5-/- zebrafish is not performed. Finally, no molecular mechanism or molecular details are provided. Though Tmbim5's potential role in EMRE degradation is discussed, it is not experimentally investigated. The quality of the manuscript would significantly enhance if authors perform the suggested experiments.

      Major Comments:

      1. As a potential mechanism, Tmbim5's potential role in EMRE degradation is discussed but it is not experimentally investigated. It is very easy to test this hypothesis. If this is the case, it would be a very good contribution to the field.
      2. On Page 16, authors state that slc8b1 does not constitutes the major mitochondrial Ca2+ efflux transport system. Authors should do calcium imaging experiments just like they did with tmbim5 and mcu double knockouts (data presented in Figure 4C) to make any comments on functioning of slc8b1 in mitochondrial Ca2+ transport. This is important because slc8b1 is only a predictive ortholog of human NCLX and it is not experimentally examined yet.
      3. The data presented in Fig. 4C is very important but it is not fully explained and discussed in the results. Please discuss all the data sets presented in Fig4C in detail. As such, it is very difficult to follow and interpret the data.
      4. In tmbim5-/- zebrafish, what happens to mitochondrial Ca2+ signaling in muscle as phenotype is muscle atrophy only?
      5. Please validate the observation of decreased glycogen levels in tmbim5-/- fish by one more way. Only immunohistochemistry that too without quantitation is not convincing (Fig. 2E-H).

      Minor Comments:

      1. Authors state that tmbim5 loss of function leads to metabolic changes but the only data provided is decrease in glycogen levels. It would be helpful for the authors to focus comments specifically on the data presented in the manuscript to avoid potential over-interpretation.
      2. While discussing Fig4., authors mention that Tmbim5 may act as a MCU independent Ca2+ uptake mechanism and therefore they crossed tmbim5 mutants with mcu KO fish. But from the data presented in Fig.3 and as concluded by the authors themselves tmbim5 mutants do not show changes in the mitochondrial Ca2+ levels. Authors may clarify this point.
      3. Does tmbim5 contributes to mitochondrial Ca2+ uptake in presence or along with MCU. Further analysis of Fig4C may shed some light on this. Authors should test significance between tmbim5-/- and WT as well as between tmbim5-/- and tmbim5+/+ in mcu-/- background.
      4. Please check the labeling on traces in Fig3D.
      5. Please include quantitation of data presented in EV2E-F.
      6. Please include quantitation of immunohistochemistry data presented in 2E-H.

      Referee cross-commenting

      Several comments are common between the reviewers highlighting that those experiments are critical. Secondly, I agree with the concerns raised by other two reviewers.

      Significance

      In this study, authors report couple of new transgenic zebrafish lines. However, further characterization of slc8b1-/- is required. This study reinforces the existing idea that there are very robust compensatory mechanisms that maintain mitochondrial Ca2+ homeostasis. While the work provides useful insights, it could benefit from a broader scope to provide substantial advancement to existing knowledge.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: The work of Wasilewska et al. focusses on the MCU independent basal Ca2+ uptake mechanisms and the effects of MCU, NCLX, and TMBIM5 KO on Zebrafish Ca2+ homeostasis, mortality, anatomy and metabolism. The authors found evidence that tmbim5 potentially has a bidirectional mode of operation and is able to extrude Ca2+ from the matrix as well as transfer Ca2+ into mitochondria. Further, a reduced membrane potential in tmbim5-/- fish and altered metabolism was found. While the conclusion drawn are well argumented, a few points have to be addressed.

      Major Points:

      1. While all mitochondrial genes seem collectively reduced compared to control, it would be interesting to assess the mitochondrial mass and/or mitochondrial turnover rate in regard to e.g. mitophagy. The reduced membrane potential could lead to PINK1 accumulation on the outer mitochondrial membrane to mediate mitophagy leading overall to reduced mitochondrial count and mass.
      2. The characterization of slc8b1-KO fish needs some improvement to facilitate a better understanding of the molecular interactions of slc8b1 and tmbim5. This would also greatly improve the understanding of the phenotypical characterization and behavioral response to CGP.
      3. Functional Ca2+ measurements of the activity of slc8b1 gene product have to be done to ensure a KO phenotype. Especially in light of the surprising results presented in Figure 6A showing an effect of CGP on slc8b1-KO fish but not on tmbim5-KO fish I advise mitochondrial isolation to conduct mitochondrial basal and extrusion Ca2+experiments of slc8b1-KO fish, tmbim5-KO fish, and double KO-fish.

      Minor Points:

      The authors claim that mRNA levels of mitochondrial proteins involved in Ca2+ transport in tmbim5-/- are unaffected (Figure EV3). While the T-tests show no significant alteration, what happens if a 2-way ANOVA shows a more general effect revealed between WT and TMBIM5-/-?

      Significance

      This is a well-designed and carefully executed piece of work. The experimental design is thoughtfully elaborated, and the topic is worthy of investigation. The strengths of this study lie in translating our knowledge of TMBIN5 from single cells to organism and organ function. Moreover, the work provides important new information that will help the scientific community working on mitochondrial regulation AND muscle diseases to understand how ions coordinately regulate mitochondrial function.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Although the experimental approach is promising (see below), the results do not significantly expand our current understanding. This is partly due to the challenges of interpreting negative results, which are nonetheless worth reporting. Some of the conclusions and interpretations of the results could benefit from further clarification and contextualization to enhance their impact:

      • Figure 1D: The distribution of fiber size in wt vs. Tmbim5-ko fish shows a notable difference limited to one size range. Can the authors clarify this observation? Could this indicate a switch in fiber type? Is there a correlation between this finding and the differential PAS staining?
      • Figure 3: one of the advantages of the zebrafish model is its transparency, allowing for fluorescence imaging. Unfortunately, this proves to be impossible in the case of cepia2mt. The data provided by the authors show that the fluorescence of this probe does not vary following physiological stimuli. The only change is that induced by CCCP (Fig 3C-D), which according to the authors causes a discharge of mitochondrial calcium. However, the use of CCCP with GFP-based probes should be avoided, as the acidification caused by CCCP treatment leads to quenching of the fluorophore, resulting in a fluorescence decrease which is independent of Ca2+ levels. Although the experimental approach aims to detect dynamic changes in mitochondrial Ca2+ levels, the presented results in Figure 3 do not provide conclusive evidence to support this capability. While significant experimental effort is evident, these findings may require further validation or additional data to strengthen their impact. Alternatively, the authors could remove this Figure 3 and relevant text from the manuscript.
      • Figure 6A: In my opinion, this dataset is impossible to understand. To my knowledge, the precise molecular target of CGP-37157 remains elusive. While CGP is often considered an NCLX inhibitor, this classification lacks definitive experimental support. Although CGP is known to inhibit mitochondrial Na+-dependent Ca2+ extrusion, direct binding of CGP to NCLX has yet to be conclusively demonstrated. With this in mind, the authors show that pharmacological intervention with CGP elicits a distinct phenotype in the fish model. While this effect appears to persist in SLC8B1-KO fish, it is absent in Tmbim5-KO fish, suggesting Tmbim5 as a potential molecular target for CGP. However, this interpretation is inconsistent with the following observations: i) CGP remains effective in Tmbim5/Slc8b1 double-KO fish and ii) Tmbim5-KO fish exhibit no discernible phenotype. A comprehensive explanation that reconciles these findings is sought.
      • Figure 6B: according to the authors, the phenotype induced by CGP treatment is specific because a different substance with a completely different effect, CCCP, causes the same phenotype in both wt and Tmbim5-KO fish. Also in this case, the rationale and reasoning behind this experiment in not very evident. As I see it, CCCP blocks zebrafish motility because it is a metabolic poison, and its effect does not depend on any transporter.

      Significance

      The manuscript submitted by Wasilewska et al investigates the functional relationship between different mitochondrial calcium transporters using zebrafish as a model. The topic is of great interest. In the last 15 years, many mitochondrial calcium transporters have been identified. In some cases, their mechanism is not fully understood, such as in the case of TMBIM5, recently described by some as an H/Ca exchanger, or as a Ca channel by others. Furthermore, the functional relationship between different transporters has so far been studied in a partial and superficial way. I believe that this work is therefore of great interest because it aims to contribute to a fundamental problem that is still poorly studied. The idea of using zebrafish is interesting, as it is an organism that is easy to manipulate and phenotype, and because it is transparent, making it possible to use specific biosensors to characterize mitochondrial calcium dynamics, at least in principle. The paper therefore deserves attention.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 comments

      • *

      Major comments:

      • *

      - Neither data nor code was made available for review. There's only a mention of them being in Figshare with no link. As a consequence and a matter of principle, this study is not publishable without both public data and code. I would recommend using adequate repositories for data and code. Image data can be deposited in a public image data repository such as the BioImage Archive which would ensure that minimal metadata are provided and code could go to a public code repository (e.g. GitLab...) so that it is discoverable and eventual changes can be tracked and visible (for example should any bug be fixed after publication). Also consider depositing the models into the BioImage Model Zoo (https://bioimage.io).

      • *

      We will upload all the code used in the article in GitHub while image data will be deposited in BioImage Archive as suggested by the referee. Method section will be also rewritten.

      • *

      - The use of the term morphology is misleading. Like I expect most readers would, I understand morphology in this context as being related to shape. However, there is no indication that any specific type of information (like shape, texture, size/scale...) is used or learned by the described method. To understand what information the classifiers rely on, it would be interesting to compare with human engineered features extracted from the same ROIs.

      All references to morphology in the text must be removed unless indication can be provided as to what type of information is used by the models.

      • *

      We understand the concern regarding the use of "morphology" and will revise the manuscript to be more precise. Instead of referring broadly to "morphology," we will specify "image-derived features" or "texture and structural features" where applicable.


      Additionally, to address this concern directly, we have performed an analysis comparing our learned features to classical human-engineered features (such as texture and shape descriptors) to better understand what type of information is utilized by the model. These results will be incorporated into the revised manuscript.

      • *

      - The method should be described with more details:

      - How are the window sizes to use determined? Are the two sizes listed in the methods section used simultaneously? What is the effect of this parameter on the performance?

      - How are the ROIs determined? In a grid pattern? Do they overlap? i.e. how does the windowing function work?

      - Predictions seem to be made at the ROI level but it isn't clear if this is always the case. Can inference be made at the level of individual cells?

      • *

      Window Sizes: We will clarify that the two window sizes were chosen based on empirical performance assessments. We will include a specific figure evaluating the impact of window size on classification performance, by expanding the analysis to multiple window sizes and number of training regions.

      ROI Determination: We will describe thoroughly the ROI selection in the Method Section. We will include a comparison between overlapped and non-overlapping grid selection.

      Inference at the Cell Level: While predictions are made at the ROI level (we will clarify the text), we will discuss an additional approach that aggregates ROI-level predictions into a final cell-level classification, which we will add as an optional post-processing step.

      • *

      - What would be the advantages of the proposed subcellular approach compared to learning to classify whole images?

      • *

      We will detail a comparison between subcellular and cellular or whole image classification; the main advantage of this subcellular technique (that will be remarked in the text) is the reduction in the number of images that are required to learn to classify cell types. Nevertheless, other advantages are the robustness to confluency variations (whole-image classification can be biased by confluency differences, while subcellular regions focus on individual cell features) and a fine-grained feature learning.

      • *

      - When fluorescent markers are used, the text isn't clear on what measures have been taken to prevent these markers from bleeding through into the brightfield image. To rule out the possibility that the models learn from bleed-through of the marker into the brightfield image, the staining should be performed after the brightfield image acquisition. Without this, conclusions of the related experiments are fatally flawed.

      • *

      __We appreciate this important point and confirm that all fluorescent staining was performed after brightfield image acquisition, ensuring that no fluorescence contamination influenced model training. We will have explicitly stated this in the Methods section. __

      • *

      - How robust are the models e.g. with respect to culture age and batch effects? Use of a different microscope is mentioned in the methods section. This should be shown, i.e. can a model trained on one microscope accurately predict on data acquired from a different microscope? Does mixing images from different sources for training improve robustness?

      • *

      We have used different cellular batches without any effect on accuracy. We will also include the experiment using another microscope, and we will add new data with/without combination of mixed images from different figures. In summary, we include a new supplementary figure that address the use of distinct and mixed cellular batches and microscopies in terms of accuracy and trained models.

      • *

      - Why not use the Mahalanobis distance in feature space? This would be the natural choice given that PCA has been selected for visualization and would allow to show uncertainty regions in the PCA plots. Could other dimensionality reduction methods show better separation of the groups? Why not train the network for further dimensionality reduction if the goal is to learn a useful feature space?

      We appreciate this suggestion and will include a comparison of Mahalanobis distance-based classification with our existing approach. Regarding dimensionality reduction, we will test additional methods including t-SNE and UMAP as supplementary figures. Finally, while training a network specifically for dimensionality reduction is an interesting alternative, our current pipeline was focused on simplicity and the ample range of techniques that allow to address. However, we include include a discussion on potential future directions where such an approach could be explored.

      Minor comments:

      • *

      - Make sure the language used is clear, e.g. The text describes the method as involving a transformation to black and white followed by thresholding. This doesn't make sense. What is meant by "the set of 300 genes was subjected to Gene Ontology"? Use percent instead of permille in the text for easier reading.

      These minor changes will be addressed in the text, including the percent instead of permille as it was a common point suggested by the referees.

      • *

      - To provide more context, cite previous work that indicates that brightfield images contain exploitable information, e.g.

      - Cross-Zamirski, J.O., Mouchet, E., Williams, G. et al. Label-free prediction of cell painting from brightfield images. Sci Rep 12, 10001 (2022). https://doi.org/10.1038/s41598-022-12914-x

      - Harrison PJ, Gupta A, Rietdijk J, Wieslander H, Carreras-Puigvert J, et al. (2023) Evaluating the utility of brightfield image data for mechanism of action prediction. PLOS Computational Biology 19(7): e1011323. https://doi.org/10.1371/journal.pcbi.1011323

      • *

      We will cite these references in the introduction of the paper.


      Reviewer #2

      • *

      Major comments:

      • *

        • Place this study in context of previous studies that classify cell types. Here are two relevant recent papers, which could provide a good start for properly crediting previous work and placing your contribution in context: PMID: 39819559 (note the "Nucleocentric" approach) and PMID: 38837346. Please seek for papers that use label free for similar applications (which is the main contribution of the current manuscript).*
      • *

      We appreciate this suggestion (shared by reviewer #1) and we will include references to these and other relevant studies on label-free cell classification. We specifically discuss how our approach differs from the "nucleocentric" method in PMID: 39819559 and how our method complements existing work in label-free imaging. We will update both the Introduction and Discussion sections to reflect this improved contextualization.

      • *

      • Many experiments were performed, but we found it hard to follow them and the logic behind each experiment. Please include a Table summarizing the experiments and their full statistics (see below) and also please provide more comprehensive justifications for designing these specific experiments and regarding the experimental details. This will make the reading more fluent.*

      • *

      We will include a summary table in the Methods section that provides an overview of all experiments, detailing:


      -The purpose of each experiment

      -The dataset used

      -The number of images/cells

      -Objective used

      -Cellular confluence

      -Reference to BioImage Archive

      -Model used (reference to Github)

      -Technical / Biological replicates

      -The main conclusions drawn

      -Figure that presents the data


      Additionally, we will revise the Results section to provide clearer justifications for each experiment, improving the logical flow of the manuscript.

      • *

      • The experiments, data acquisition and data reporting details are lacking. 10x objective is reported in the Results and 20x in the Methods. Please explain how the co-culturing (mixed) experiments were performed including co-culturing experiments with varying fractions of each cell type and on what data were the models trained on (Fig. 2F). Differential confluency experiments are not described in the Methods (and not on what confluency levels were the models trained on), this is also true for the detachment experiment. How many cells were acquired in each experiment (it says "20 and 40 images per cell line" but this is a wide range + it is not clear how many cells appear in each image)? How many biological/technical replicates were performed for each experiment? Please report these for each experiment in the corresponding figure legend and show the results on replicates (can be included as Supplementary). "Using a different microscope with the same objective produced similar results (data not shown)" (lines #370-371), please report these results (including what is the "different microscope") in the SI.*

      • *

      We will carefully review and expand the Methods section to provide complete details, as with the Table that we will prepare to address the previous comment and this one. In addition, the co-culturing experiments will explicitly describe how cell fractions were varied and how training data were generated for Fig. 2F. The differential confluency and detachment experiments will be fully described, including confluency levels used during model training. The secondary microscopy data will be added as part of a new figure that was commented for reviewer #1.

      • *

      • The machine learning details are lacking. The train-validation-test strategy is not described, which could be critical in excluding concerns for data leakage (e.g., batch effects) which could be a major concern in this study. It is not always clear what network architecture was used. What were the parameters used for training? Accuracy is reported in % (and sometimes in an awkward representation, 990‰). Proper evaluation will use measurements that are not sensitive to unbalanced data (e.g., ROC-AUC). What are the controls (i.e., could the accuracy reported be by chance?). Reporting accuracy at the pixel/patch level and not at the cell level is a weakness. Estimation of cell numbers (in methods) is helpful but I did not see when it was used in the Results - a better alternative is using fluorescent nuclear markers to move to a cell level (not necessary to implement if it was not imaged).*

      • *

      We will significantly expand the Machine Learning method and result sections, providing:


      -A detailed description of the train-validation-test split strategy, (that explicitly rules out batch effects as a confounding factor). A clarification of the network architecture used for different tasks and their parameters (always the same one).

      -We will expand the evaluation metrics, including ROC-AUC scores to account for class imbalances, and baseline models as controls, ensuring that model performance is not due to chance as a new supplementary figure.

      - Accuracy will be reported to use percentage instead of permille as suggested by other referees.

      - We will clarify the use of cell number estimation in the specific figures in which we use it, including new data in the first figure for the generalization of patch-to-cell estimation.

      • *

      • Downstream analyses lacking sufficient information to enable us to follow and interpret the results, please provide more information.*

      • The PCA ellipses visualizations reference to previous papers. Please explain what was done, how the ellipses were calculated and from how much data? If they are computed from a small number of data points - please show the actual data. It would also be useful to briefly include the information regarding the representation and dimensionality reduction in the Results and not only in the Methods. No biologically-meaningful interpretation is provided - perhaps providing cell images along the PCs projections can help interpret what are the features that distinguish between different experimental conditions.*

      • *

      We will include a clearer explanation in methods as well as results for PCA and dimensionality reduction, as well as the use of Mahalanobis distance as another metric, another visualization for improved interpretation, and a supplementary figure related to tSNE reduction. We will update the figure for inclusion of real subcelullar images that help the biological interpretation of the results.

      • *

        • How were the pairwise accuracies calculated? How did the authors avoid potential batch effects driving classification.*
      • *

      We have used different cellular batches without any effect on accuracy. In the new revised manuscript, we will clarify batch normalization techniques used in training and include additional control analyses ensuring that batch effects are not driving classification results (new figure as suggested by reviewer #1 with mixed and separate cellular batches).

      • *

        • "suggesting that the current workflow can handle four cell lines simultaneously" (lines #126-127) - how were the cell lines determined for each analysis? We assume that the performance will depend on the cell types (e.g., two similar morphology cell types will be hard to distinguish). Fig. 2F is not clear: the legend should report a mixture of four cell types, and this should be translated to clear visualization in the figure panel itself: what do the data points mean? Where are the different cell types?*
      • *

      We will include additional experiments with other cell lines, and we will explicitly describe the rationale for cell line pairings, considering morphological similarities. Fig. 2F will be redesigned for clarity, ensuring data points are clearly labeled by cell type.

      • *

        • Lines 232 and onwards use #pixels as a subcellular size measurement when referring to cell nucleus, cytoplasm and membrane, please report the actual physical size and show specific examples of these patches. This visualization and analysis of patch sizes should appear much earlier in the manuscript because it relates to the method's robustness and interpretability.*
      • *

      We will explicitly report patch sizes in microns and include a supplementary figure illustrating different subcellular regions to enhance interpretability.

      • *

        • Analysis of co-cultured (mixed) experiments is not clear. Was the fluorescent marker used to define ground truth? Was the model trained and evaluated on co-cultures or trained on cultures of a single cell type and evaluated on mixed cultures? We assume that the models were still evaluated on the label-free data? "...obtain subcellular ROIs only from regions positive in the red channel. Using these labeled ROIs,.." (138-139) - shouldn't both positive and negative ROIs be used to have both cell types? What are the two quantifications in the bottom of Fig. 1E? Did the "labeled cells" trained another classifier for the fluorescent labels?*
      • *

      We will clarify both the method and results section regarding the co-culture experiment from the first figure. In that specific case, the model learned from positive ROIs in order to demonstrate that this approach can also be used from a mixed culture. In order to become clearer, we will transfer this experiment to a supplementary figure.

      • *

        • Please interpret the results from Fig. 3C-D - should we expect to see passage-related changes in cells (that lead to deterioration in classification) or is it a limitation of the current study?*
      • *

      We will explicitly discuss whether passage-related changes affect cell morphology. In addition, we will include novel RNA-seq data comparing passage and batch effects, in order to correlate them to the image-based deterioration as part of the figure.

      • *

        • In general, as we mentioned a couple of times. It would be useful to visualize different predictions (or use explainability methods such as GradCam) to try to interpret what the model has learned.*
      • *

      We will perform a GradCAM analysis, highlighting which subcellular regions contribute most to classification, improving interpretability.

      • *

        • The correlation analysis between transcriptional profiles and morphological profiles is not clear. There are not sufficient details to follow the genetic algorithm (and its justification). What was the control for this analysis? Would shuffling the cells' labels (identities) and repeating the analysis will not yield a correlation?*
      • *

      We agree with the concern of the reviewer. We will expand the Methods section to clarify how the correlation was calculated, as well as the genetic algorithm. We will perform a control analysis using shuffled cell identities, trying to demonstrate that correlations do not arise by chance.

      • *

      • Please use proper scientific terms. For example, "white-light microscopy" and "live cell red marker".*

      • *

      We will change the text accordingly, making a global review of the manuscript.

      • *

      • This is a "Methods" manuscript and thus should open the source code and data, along with some examples on how to use it in order to enable others to replicate the results and to enable others to use it.*

      • *

      We acknowledge that our manuscript is more a ‘Methods’ manuscript instead of a general article (that it was conceptualized by us). Probably most of the critical points arose by the referees at the end are explained by this reason. We will deposit image data in the BioImage Archive with proper metadata, and we will published our code in GitHub as well as the models.

      • *

      • Please improve the figures. Fonts are tiny and in some places even clipped (e.g., Fig. 1D,E, Fig.2 E, E', and many more), some labels are missing (e.g., units of the color bar in Fig. 1B).*

      • *

      Figures will be redesigned accordingly.

      • *

      • Discussion. Please place this work in context of other studies that tackled a similar challenge of classifying cell types and discuss cons and pros of the different measurements. For example, there are clear benefits of using label-free data to reduce the number of fluorescent labels and enable long-term live cell imaging following a process without photobleaching and phototoxicity (Fig. 2G) but it is more difficult to interpret these differences in label-free image patches rather than fluorescently labeled single cells. One solution to bridge this gap that could be discussed is using silico labeling (PMID: 38838549).*

      • *

      The Discussion will be significantly expanded to compare our work with other methods, including in silico labeling (PMID: 38838549).

      • *

      • The idea of using the pairwise correlation distance of different cell types to model unseen cell types is interesting and promising. Why did these specific pairwise networks were used? How robust is this representation to inclusion of other/additional models?*

      • *

      As the referees are very interested in pairwise correlation distance, we will include a sensitivity analysis, testing alternative model selections to assess robustness.



      Reviewer #3

      • *

      ## General

      • *

      - It is often unclear if a sample in the particular experiment is a patch or a pixel. Please be more specific on this in the text.

      • *

      Manuscript will be rewritten for clarification of pixel/patch.

      • *

      - It is unclear which patch size was used and if it was consistent throughout the experiments. Please add this information.

      • *

      We will include a new figure with comparison between different patch sizes, as suggested by reviewer #1.

      • *

      - It is often unclear which data was used for training/validation and final readout. Did you do train/val splits? Did you predict on the same data or new samples? This should be stated more specifically.

      • *

      We will clarify in the Methods section the strategy of training/testing (90% - 10%, same data) with new samples used for final readout. All reported classification results come from that set, ensuring that the model was evaluated on unseen data.

      • *

      - Also, it is a little bit unclear what you mean by patch or by ROI or by region, please be more consistent and explain what you mean by adding definitions.

      • *

      We will standardize the use of these terms, leaving only ROI.

      • *

      - Please compare your method to other approaches and to baselines (see also our comment above).

      • *

      We will compare our approach with whole-image classification, showing that our subcellular approach provides better generalization. A new supplementary analysis will explore the feasibility of alternative feature extraction techniques and their relative performance. Several baselines will be incorporated in order to assess random accuracies (following the suggestions of other reviewers).

      • *

      - In general, if possible, please add more concrete examples of how you envision your method to be used in practice. There are general ideas presented in the discussion section, but we feel those could be substantiated by more concrete implementation suggestions.

      • *

      We will provide three specific case studies in the Discussion section, demonstrating how our approach can be applied in real-world scenarios:


      -Drug Screening: Identifying cellular responses to drug treatments in high-throughput screening pipelines.

      -Stem Cell Differentiation Monitoring: Tracking changes in subcellular morphology during differentiation to assess developmental stages.

      -Cancer Cell Classification: Distinguishing between different subtypes of cancer cells in heterogeneous populations.

      • *

      Minor comments (grouped and summarized for clarification):

      • *

      General Clarifications & Wording Improvements

      • *

      Line 18: Clarify if the study is based on morphological features and specify the novelty (e.g., subcellular features).

      Lines 25 & 29: The wording suggests that the workflow was extended before being validated. Improve clarity.

      Line 92: Add a brief explanation of "subcellular region."

      • *

      We will clarify in the Introduction that our study is based on morphological features but specifically focuses on subcellular features, which distinguishes it from whole-cell analysis. We will rephrase the relevant sentences to make it clear that the workflow was first validated and then extended. We will provide a brief definition of "subcellular region" and ensure consistency throughout the manuscript.

      • *

      Experimental Setup & Methodological Details

      • *

      Lines 100-141: Clarify the use of validation and test sets, and discuss potential batch effects.

      Line 113: Missing training details (loss function, data volume, epochs).

      Line 117: Clarify if "pairwise classification" is meant.

      Line 119: Accuracy should be reported in percent instead of permille.

      Lines 136-141: Justify why two cell lines were mixed but only one was analyzed.

      • *

      We will add a clear explanation of the train-validation-test split, ensuring reproducibility and ruling out batch effects. Additional batch effect control experiments will be performed and included in Supplementary Figures as suggested by other reviewers.

      We will include training details (e.g., loss function, number of epochs, data volume) in the Methods section and referenced it in the Results section for clarity. The terminology will be updated to "pairwise classification" where appropriate. We will report accuracy in percent (%) as suggested by other reviewers. The rationale for mixing two cell lines but analyzing only one is now explicitly stated: we used a mixed culture to simulate realistic conditions but focused on one cell type to test classification specificity. Nevertheless, following other reviewer suggestion this experiment will be placed in a supplementary figure in order to become clearer.

      • *

      Technical & Experimental Design Clarifications

      • *

      Line 105: Replace "white light microscopy" with "brightfield microscopy."

      Line 107: Be specific about "transformation to black and white" and "contrast thresholding algorithm."

      Line 125: Explain why performance dropped—did you try a larger network?

      Line 133: Clarify how confluency was estimated.

      • *

      "White light microscopy" will be replaced with "brightfield microscopy." The thresholding method will be explicitly described, with a reference to the Methods section where details are provided. We will discuss the possible reasons for performance drop. Confluency estimation will be described, explaining that it was calculated using automated image segmentation and validated manually.

      • *

      Data Representation & Interpretation

      • *

      Line 143-158: Clarify the ground truth—was it based on dye labeling, thresholding, or human annotation?

      Line 156: What is meant by "magnification"? Higher resolution? Different microscope? Crops?

      Lines 163-166: Sudden switch to pixels instead of ROIs—explain why.

      Line 191 & 192: If a strong correlation is claimed, include a statistical test.

      Lines 211-214: If differences are claimed, add a quantitative analysis.

      Lines 396-404: Clarify how the test set was chosen and what "in situ prediction" means.

      Lines 407-409: What do you mean by "binarizing the image"? What threshold was used?

      • *

      We will clearly explain terms like “ground truth”, "magnification", “in situ prediction” and ‘binarization”. Consistent terminology will be ensured, regarding ROIs throughout the text. Statistical analyses will be added to correlation results and morphological feature comparisons to support claims.

      • *

      Biological Interpretation & Feature Space Analysis

      • *

      Line 226-228: You show classification in feature space but not whether distances in feature space correlate with real-world differences between cell types.

      Line 234-236: What do you mean by "detect potentially more informative subcellular regions"?

      Line 302-303: The claimed application (estimating cell types in an unseen culture) was not shown—please add an experiment.

      • *

      We now include an experiment comparing three cell types, where two are closely related and one is more distinct, to test if feature space distance corresponds to real-world differences. The concept of "informative subcellular regions" will be rephrased. We will add an experiment demonstrating the ability of our model to estimate the number of cell types in an unseen culture, as suggested.

      • *

      Figure & Visualization Improvements

      • *

      • Improve figure readability (tiny fonts, clipped text).*

      Line 653-655: Show actual data points in PCA ellipses, not just ellipses.

      Line 672-677: Add a quantification of performance differences between different categories.

      • *

      All figures will be revised for better readability, ensuring that text is legible, axes are labeled, and color bars are clear. We will overlay data points onto PCA ellipses for better visualization of feature distribution, as suggested by other reviewers. Performance differences between different experimental conditions will be quantified, with statistical comparisons provided.

      • *

      Model Training & Data Reproducibility

      • *

      Lines 386-392: Add exact details on model architecture, loss function, number of images used per experiment.

      • *

      A complete breakdown of model architecture, loss function, training set size, and validation details will be included in the Methods section, ensuring full reproducibility.


      Dimensionality Reduction & Feature Space Interpretation

      • *

      Line 438-439: Consider using UMAP or t-SNE in addition to PCA. Report variance explained by PCA components.

      Line 439-440: Provide more details on how eigenvectors were used to calculate ellipses.

      Line 442-443: Clarify which correlation method was used.

      • *

      We will include t-SNE visualizations in Supplementary Figures and report the variance explained by PCA components, as well as Mahalanobis distance, as suggested by other reviewers. The eigenvector-based ellipse calculation will be described in more detail in the Methods section, and the specific correlation metric used will be explicitly stated.

      • *

      Code & Data Accessibility

      • *

      Line 491: Provide a direct URL to the code and data. Consider using GitHub for code and BioImage Archive for data.

      • *

      We will include the code to GitHub and image data to the BioImage Archive, following the reviewers recommendation, with direct URLs.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The authors present a computational workflow that automatically classifies patches of transmission microscopy images of cultured cells into different cell types.

      Comments to the Manuscript

      General

      • It is often unclear if a sample in the particular experiment is a patch or a pixel. Please be more specific on this in the text.
      • It is unclear which patch size was used and if it was consistent throughout the experiments. Please add this information.
      • It is often unclear which data was used for training/validation and final readout. Did you do train/val splits? Did you predict on the same data or new samples? This should be stated more specifically.
      • Also, it is a little bit unclear what you mean by patch or by ROI or by region, please be more consistent and explain what you mean by adding definitions.
      • Please compare your method to other approaches and to baselines (see also our comment above).
      • In general, if possible, please add more concrete examples of how you envision your method to be used in practice. There are general ideas presented in the discussion section, but we feel those could be substantiated by more concrete implementation suggestions.

      Specific

      Line 18

      • Isn't this study also based on morphological features? Eventually, you could be more specific what the novelty is, it might be the fact that your features are subcellular?

      Lines 25 & 29

      • In general one would expect a workflow to be validated first and extended afterwards. You could improve the wording here to make this clear for the reader.

      Line 92

      • Please add a short explanation of what is meant by "subcellular region".

      Lines 100-141

      • Did you validate the classification results with a validation and test set? Maybe with cross validation? Please add more details on how this was done.
      • It could be that the model exploits batch effects of different imaging runs (e.g. different overall intensity in patches). It would be nice if this could be checked by an additional experiment.

      Line 105

      • "white light microscopy" is an unusual term, can you be more specific, e.g. bright-field?

      Line 107

      • It is unclear what a "transformation to black and white" and a "contrast thresholding algorithm" are, please be more specific (and potentially point the reader to a corresponding Methods section).

      Line 113

      • How does training work? Which loss is used? How much data? How many epochs? ... All of this information is missing which makes the study non-reproducible. Please add this here or point to an appropriate method section.

      Line 117

      • Do you mean pairwise classification?

      Line 119

      • It is unusual to use permille as a unit to report, percent is more common
      • Also, it is unclear if accuracy is the correct read-out here, are all the data sets balanced?
      • more information about the data sets could be added in a methods section and the decision to use accuracy as a measure could be explained

      Line 121

      • this hypothesis was never stated before, please explain this to the reader first and then check your hypothesis by experiments

      Line 125

      • do you have a hypothesis why the performance dropped? Did you for example try a larger network?

      Line 133

      • how is the confluence estimated?

      Lines 136-141

      • It is unclear why two cell lines were mixed when only data of one of them is used for analysis afterwards. Could you explain this in more detail or specify why this approach is used?

      Lines 143-158

      • We think you are trying to establish a ground truth here. Unfortunately, there are two things mixed here, the labeling with an additional dye combined with thresholding and human annotation. It is unclear which is considered the ground truth or if both are considered true. Could you explain this in more detail or be more specific?

      Line 156

      • What do you mean by magnification? Images with a higher resolution (different microscope with higher magnification)? Crops of the same data? Something else? Could you explain this in more detail or be more specific?

      Lines 163-166

      • Suddenly you talk about pixels instead of ROIs, where are they coming from? Maybe point the reader to a method section and explain the switch here.
      • Also, why is the pixel size cell line dependent, didn't you use the same microscope for all of them? Could you define what you mean by pixel size?
      • You say you compared different cell lines, how is this summarized in one plot? Please explain in more detail.

      Lines 177-214

      • Again, it is unclear which data was used for training, validation and analysis in the end. Please add this.

      Lines 191 & 192

      • If you claim a strong correlation please add a statistical analysis that shows this.

      Lines 211-214

      • If you claim these differences you should add a quantitative read-out with a statistical analysis. You could use distances in your representation space as a basis for this.

      Lines 226-228

      • What is shown here is that the morphological features can be used to classify cell types. You show that these classes are distant in feature space. But you don't show any correlation between the distance in feature space and the distance in real space (a.k.a how different the cell types are). It would be nice to have an experiment with at least three classes where 2 are closer to each other than to the third one. This would be a stronger claim that your features actually capture meaningful distances/differences.

      Lines 234-236

      • What do you mean by "detect potentially more informative subcellular regions within the cell"? Please describe in more detail what the training task was for the model and how you interpret the results.

      Lines 296-298

      • It is a little bit confusing what you mean here since you do train a network for each pair of cell lines. What you are describing is a foundation model. Please explain in more detail what you mean.

      Lines 302-303

      • The application you are claiming here was never shown in the experiments. Could you please add this experiment where a model estimates the number of cell types in an unseen culture.

      Line 323

      • Could you please elaborate how you would identify "specific cellular compartments"?

      Lines 323-326

      • Are there other studies that suggest that such malignant cells show features that are recognizable by your approach?

      Lines 342-365

      • Did you use biological replicates? This would be interesting and also a nice way to validate your models.

      Lines 369-373

      • Why do you claim that a similar microscope produces similar images? Can you give more details why this is relevant. And if that is the case it would be nice to show them. Maybe in some supplementary material.
      • How big is one image? How many cells can you see in one image? What is the resolution? What is the pixel size? ... Also, for which experiment did you use how many images? Please add all these details.
      • Also, please show some example images to make it clear for the reader what the data looks like. Could be done in supplementary material.

      Lines 386-392

      • Again, please add details. As it is right now the study is not reproducible. How many images were used for each experiment? How many for training, validation, analysis? Give the exact architecture of the model used. Which loss was used for training?

      Lines 396-404

      • Please add more details and clarify. How was the test set chosen? What do you mean by "in situ prediction"? What do you mean by "running ROIs"? What do you mean by "if the cell type was predicted to be more than 50 % of the times"? Was the human annotation or the life cell marker used for the final accuracy? Humans are never unbiased.

      Lines 404-407

      • This sounds like the ground truth for a segmentation task - is this what you mean? Since you are solving a classification task this is confusing. Please clarify.

      Lines 407-409

      • This sentence is confusing and it is unclear what was done. Please clarify. Do you mean the image was binarized? If yes, which threshold was used? What do you mean by "accuracy was estimated as with the prediction"? The accuracy should be estimated by comparing the prediction to the ground truth.

      Lines 413-422

      • Please give more details. What are these specific numbers? What do you mean by "pixel size of each cell type"? The pixel size is metadata given by the microscope/image and should not be cell type specific. We also did not understand what is meant by "fitting the percentages" and what the aim of this is. Please consider rewriting this to make it more clear.

      Lines 426-430

      • Please provide the oligo sequence.

      Line 435

      • Please consider rephrasing to: "the output of the last max pooling layer"

      Lines 438-439

      • It would be interesting to visualize the data based on a different dimensionality reduction algorithm that is non-linear like UMAP or t-SNE. If you use PCA, could you give a measure on how much of the variance is captured in the first two PCs.

      Lines 439-440

      • Please give some more details on how you use eigenvectors to calculate ellipses.

      Lines 442-443

      • Please give more details on which correlation you calculated.

      Lines 447-457

      • It would be nice if you could rephrase this a little bit to make clear that the preprocessing itself stays the same but you basically establish different data sets by separating ROIs based on their distance to the closest nucleus.

      Lines 455-457

      • Please be more precise here. The networks still learn to classify patches and are not aware of the fact that these ROIs fall in a certain category. You exploit this fact afterwards for your analysis.

      Lines 464-474

      • Please add more details why this experiment is done. Why is a genetic algorithm needed? Could not the same analysis be done on the original transcriptomics data?

      Line 486

      • Do you mean technical or biological replicates? If that is the case, could you please clearly state that you report mean values and also give the standard deviation.
      • "test" should be experiment

      Line 491

      • Could you please provide a URL to the code and the data.
      • Also, it is common practice to upload code to GitHub and image data to the Bioimage Archive. Please consider doing this.

      Lines 627-633

      • Panel A could be improved by making the ROIs larger since it is hard to see them.
      • Also, please make sure that it is clear that one ROI at a time is given to the model.

      Line 638

      • What does "magnification" mean here - see above.
      • Why do you not show the same region?

      Line 640-642

      • This basically shows that your approach is as good as simple thresholding. What do you want to show with this?

      Lines 643-644

      • Please clarify. It is unclear what percentage you present here.

      Lines 652-653 (Fig. 3C)

      • Please clarify. It is unclear what statistical analysis was performed here and to what end.

      Lines 653-655

      • It would be interesting to see not only the ellipses but also the actual data points plotted.

      LInes 658-661

      • Please add a statistical analysis of what you want to show here.
      • It is clear that the correlation is not as clear for higher values on the x-axis, why is this?

      Lines 661-662

      • Please clarify. It is unclear what statistical analysis was performed here.

      Lines 662-664

      • Please add a statistical analysis of what you want to show here.

      Lines 672-677

      • Please also plot the actual data points
      • Also, if possible it would be nice to quantify the differences in performance between the different categories.

      Code and data availability

      We could not see how to access example image data. To our best knowledge it is current best practice to upload image data to the Bioimage Archive: https://www.ebi.ac.uk/bioimage-archive/

      Specifically for this kind of study the reader should have access to the training and test data that was used to train the classifier.

      We also could not see how to reproduce the analysis. To our best knowledge it is current best practice to make all code publicly accessible, e.g. in a GitHub repository.

      Please see https://www.nature.com/articles/s41592-023-01987-9 for general guidelines of publishing bioimage data and analysis.

      Significance

      The ability to use label free microscopy for extracting biologically meaningful information is very valuable and it is very interesting to learn that simple transmission microscopy contains enough information to reveal cell types. In this study the authors trained a neural network for this task and demonstrated that it works with rather high accuracy.

      In its current form, we could not access the data nor the code. We could thus not fully judge the quality of the presented work. For a future revision, access to data and code will be essential.

      We also found it difficult to judge how difficult the classification task is, because the size of the cells in the current figures does not allow one to see texture detail in the images. Since we did not manage to access the image data, we could not assess whether the classification task is very hard (and indeed requires an AI approach) or whether the differences are rather obvious and could be quantified with classical image analysis. To enable the interested reader to better assess this important information we would like to recommend to (a) add figures that allow one to better see the cells and their texture, at least for some of the cell types, and (b) provide easy download access to the raw image data.

      Along those lines, we think it would be very interesting to actually test whether training a neural network is required or whether other methods would yield similar results. For instance, we would recommend to simply compute the mean and variance of the intensities in each patch and check whether this information also can perform some of the classification tasks. Depending on the outcome of this analysis this could be either added to some of the main figures of the article or to the supplemental material.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Automatic classification of single cell types and cell states in heterogeneous mixed cell populations has many applications in cell biology and screening. The authors present a machine learning workflow to distinguish between different cell types or cell states from label-free microscopy image patches of subcellular size. The authors evaluate their ability to identify different cell types and molecular profiles on many applications.

      Major comments:

      The application of classifying cell type and states from label-free data is promising and useful, but this manuscript requires major rewriting to enable us comprehensive assessment. Specifically, provide all technical details necessary for its evaluation, improve clarity and justification for the methodology used and the results obtained, and to better place this study in context of other studies in the field. Two crucial points are excluding the concern of the possibility that batch effects are contributing to the classification results and providing stronger evidence for a link between transcriptional and morphological profiles. Some efforts to interpret the classification decision making could help understand what morphological information was used for classification and reduce the concerns for the model using non-biologically meaningful information for the classification (e.g., illumination changes due to batch effects). Finally, making the source code and data publicly available would be important to enable others to apply the method (code) and to benchmark other methods (data).

      1. Place this study in context of previous studies that classify cell types. Here are two relevant recent papers, which could provide a good start for properly crediting previous work and placing your contribution in context: PMID: 39819559 (note the "Nucleocentric" approach) and PMID: 38837346. Please seek for papers that use label free for similar applications (which is the main contribution of the current manuscript).
      2. Many experiments were performed, but we found it hard to follow them and the logic behind each experiment. Please include a Table summarizing the experiments and their full statistics (see below) and also please provide more comprehensive justifications for designing these specific experiments and regarding the experimental details. This will make the reading more fluent.
      3. The experiments, data acquisition and data reporting details are lacking. 10x objective is reported in the Results and 20x in the Methods. Please explain how the co-culturing (mixed) experiments were performed including co-culturing experiments with varying fractions of each cell type and on what data were the models trained on (Fig. 2F). Differential confluency experiments are not described in the Methods (and not on what confluency levels were the models trained on), this is also true for the detachment experiment. How many cells were acquired in each experiment (it says "20 and 40 images per cell line" but this is a wide range + it is not clear how many cells appear in each image)? How many biological/technical replicates were performed for each experiment? Please report these for each experiment in the corresponding figure legend and show the results on replicates (can be included as Supplementary). "Using a different microscope with the same objective produced similar results (data not shown)" (lines #370-371), please report these results (including what is the "different microscope") in the SI.
      4. The machine learning details are lacking. The train-validation-test strategy is not described, which could be critical in excluding concerns for data leakage (e.g., batch effects) which could be a major concern in this study. It is not always clear what network architecture was used. What were the parameters used for training? Accuracy is reported in % (and sometimes in an awkward representation, 990‰). Proper evaluation will use measurements that are not sensitive to unbalanced data (e.g., ROC-AUC). What are the controls (i.e., could the accuracy reported be by chance?). Reporting accuracy at the pixel/patch level and not at the cell level is a weakness. Estimation of cell numbers (in methods) is helpful but I did not see when it was used in the Results - a better alternative is using fluorescent nuclear markers to move to a cell level (not necessary to implement if it was not imaged).
      5. Downstream analyses lacking sufficient information to enable us to follow and interpret the results, please provide more information.

      a. The PCA ellipses visualizations reference to previous papers. Please explain what was done, how the ellipses were calculated and from how much data? If they are computed from a small number of data points - please show the actual data. It would also be useful to briefly include the information regarding the representation and dimensionality reduction in the Results and not only in the Methods. No biologically-meaningful interpretation is provided - perhaps providing cell images along the PCs projections can help interpret what are the features that distinguish between different experimental conditions.

      b. How were the pairwise accuracies calculated? How did the authors avoid potential batch effects driving classification.

      c. "suggesting that the current workflow can handle four cell lines simultaneously" (lines #126-127) - how were the cell lines determined for each analysis? We assume that the performance will depend on the cell types (e.g., two similar morphology cell types will be hard to distinguish). Fig. 2F is not clear: the legend should report a mixture of four cell types, and this should be translated to clear visualization in the figure panel itself: what do the data points mean? Where are the different cell types?

      d. Lines 232 and onwards use #pixels as a subcellular size measurement when referring to cell nucleus, cytoplasm and membrane, please report the actual physical size and show specific examples of these patches. This visualization and analysis of patch sizes should appear much earlier in the manuscript because it relates to the method's robustness and interpretability.

      e. Analysis of co-cultured (mixed) experiments is not clear. Was the fluorescent marker used to define ground truth? Was the model trained and evaluated on co-cultures or trained on cultures of a single cell type and evaluated on mixed cultures? We assume that the models were still evaluated on the label-free data? "...obtain subcellular ROIs only from regions positive in the red channel. Using these labeled ROIs,.." (138-139) - shouldn't both positive and negative ROIs be used to have both cell types? What are the two quantifications in the bottom of Fig. 1E? Did the "labeled cells" trained another classifier for the fluorescent labels?

      f. Please interpret the results from Fig. 3C-D - should we expect to see passage-related changes in cells (that lead to deterioration in classification) or is it a limitation of the current study?

      g. In general, as we mentioned a couple of times. It would be useful to visualize different predictions (or use explainability methods such as GradCam) to try to interpret what the model has learned.

      h. The correlation analysis between transcriptional profiles and morphological profiles is not clear. There are not sufficient details to follow the genetic algorithm (and its justification). What was the control for this analysis? Would shuffling the cells' labels (identities) and repeating the analysis will not yield a correlation? 6. Please use proper scientific terms. For example, "white-light microscopy" and "live cell red marker". 7. This is a "Methods" manuscript and thus should open the source code and data, along with some examples on how to use it in order to enable others to replicate the results and to enable others to use it. 8. Please improve the figures. Fonts are tiny and in some places even clipped (e.g., Fig. 1D,E, Fig.2 E, E', and many more), some labels are missing (e.g., units of the color bar in Fig. 1B). 9. Discussion. Please place this work in context of other studies that tackled a similar challenge of classifying cell types and discuss cons and pros of the different measurements. For example, there are clear benefits of using label-free data to reduce the number of fluorescent labels and enable long-term live cell imaging following a process without photobleaching and phototoxicity (Fig. 2G) but it is more difficult to interpret these differences in label-free image patches rather than fluorescently labeled single cells. One solution to bridge this gap that could be discussed is using silico labeling (PMID: 38838549).<br /> 10. The idea of using the pairwise correlation distance of different cell types to model unseen cell types is interesting and promising. Why did these specific pairwise networks were used? How robust is this representation to inclusion of other/additional models?

      Significance

      Automated classification of cell types and cell states in mixed cell populations using label-free images has important applications in academic research and in industry (e.g., cell profiling). This paper applies standard machine learning toward this technical goal, and demonstrates it on many different experimental systems, exceeding the common standard in terms of quantity and variability, and with the potential of being a nice contribution to the field. However, we were not able to properly evaluate these results due to lacking experimental and methodological details as detailed above and thus can not make a strong point regarding validity and significance before a major revision. Our expertise is in computational biology, and specifically applications of machine learning in microscopy. We are not familiar with the specific cell types, states and perturbations used in this manuscript.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This paper presents a method to classify cells in brightfield images using information from subcellular regions. The approach consists in first thresholding a brightfield image then splitting the resulting binary image into small ROIs which are then fed to a CNN-based classifier. The authors demonstrate application to the identification of cell types in pure cultures and in cultures with mixed types. They then show that features learned by the classifier correlate with expression of cell type-specific genes and explore what information can be learned from networks trained on subcellular regions selected based on distance from the nucleus. The authors conclude that subcellular ROIs extracted from brightfield images contain useful information about the identity and state of the cells in the image.

      Major comments:

      • Neither data nor code was made available for review. There's only a mention of them being in Figshare with no link. As a consequence and a matter of principle, this study is not publishable without both public data and code.
      • I would recommend using adequate repositories for data and code. Image data can be deposited in a public image data repository such as the BioImage Archive which would ensure that minimal metadata are provided and code could go to a public code repository (e.g. GitLab...) so that it is discoverable and eventual changes can be tracked and visible (for example should any bug be fixed after publication). Also consider depositing the models into the BioImage Model Zoo (https://bioimage.io).
      • The use of the term morphology is misleading. Like I expect most readers would, I understand morphology in this context as being related to shape. However, there is no indication that any specific type of information (like shape, texture, size/scale...) is used or learned by the described method. To understand what information the classifiers rely on, it would be interesting to compare with human engineered features extracted from the same ROIs. All references to morphology in the text must be removed unless indication can be provided as to what type of information is used by the models.
      • The method should be described with more details:
      • How are the window sizes to use determined? Are the two sizes listed in the methods section used simultaneously? What is the effect of this parameter on the performance?
      • How are the ROIs determined? In a grid pattern? Do they overlap? i.e. how does the windowing function work?
      • Predictions seem to be made at the ROI level but it isn't clear if this is always the case. Can inference be made at the level of individual cells?
      • What would be the advantages of the proposed subcellular approach compared to learning to classify whole images?
      • When fluorescent markers are used, the text isn't clear on what measures have been taken to prevent these markers from bleeding through into the brightfield image. To rule out the possibility that the models learn from bleed-through of the marker into the brightfield image, the staining should be performed after the brightfield image acquisition. Without this, conclusions of the related experiments are fatally flawed.
      • How robust are the models e.g. with respect to culture age and batch effects? Use of a different microscope is mentioned in the methods section. This should be shown, i.e. can a model trained on one microscope accurately predict on data acquired from a different microscope? Does mixing images from different sources for training improve robustness?
      • Why not use the Mahalanobis distance in feature space? This would be the natural choice given that PCA has been selected for visualization and would allow to show uncertainty regions in the PCA plots. Could other dimensionality reduction methods show better separation of the groups? Why not train the network for further dimensionality reduction if the goal is to learn a useful feature space?

      Minor comments:

      • Make sure the language used is clear, e.g.
      • The text describes the method as involving a transformation to black and white followed by thresholding. This doesn't make sense.
      • What is meant by "the set of 300 genes was subjected to Gene Ontology"?
      • Use percent instead of permille in the text for easier reading.
      • To provide more context, cite previous work that indicates that brightfield images contain exploitable information, e.g.
      • Cross-Zamirski, J.O., Mouchet, E., Williams, G. et al. Label-free prediction of cell painting from brightfield images. Sci Rep 12, 10001 (2022). https://doi.org/10.1038/s41598-022-12914-x
      • Harrison PJ, Gupta A, Rietdijk J, Wieslander H, Carreras-Puigvert J, et al. (2023) Evaluating the utility of brightfield image data for mechanism of action prediction. PLOS Computational Biology 19(7): e1011323. https://doi.org/10.1371/journal.pcbi.1011323

      Referees cross-commenting

      I support comments from reviewers 2 and 3 around the lack of sufficient details fro interpretability and reproducibility. Some of the necessary information could be communicated through well documented re-usable code and computational workflows as well as properly documented data sets.

      Jean-Karim Hériché (heriche@embl.de)

      Significance

      This is an interesting study that adds to a growing body of evidence showing that information contained in brightfield images can be usefully exploited, potentially replacing the expensive and time-consuming use of fluorescent markers and is therefore of interest to a broad audience of cell biologists.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Critique

      In this manuscript, the authors examine the biochemistry of two protein domains that are, on the basis of sequence similarity, predicted to function autonomously as binders of histone H3 tails or methylated DNA. They present solid data to suggest that neither domain in fact has this function, but that they act as protein interaction domains that form a heterodimer mediated by the presence of a zinc ion (two ligands from each protein).

      In the first part of the Results, the authors note that ASXL PHD doesn't contain aromatics that are characteristic of methylated lysine binding. I would just note that they don't mention at this point that some PHDs bind unmethylated H3 - and that aromatics are not required for that binding activity. The lack of H3K4me3-binding aromatics doesn't at all make a case the domain doesn't bind histones. The lack of the Ala1 binding residues does make this case, but that's separate...

      Anyway, they then go on to show convincingly by ITC that ASXL doesn't bind the N-terminal H3 tail - unmodified or methylated. They also show modified-H3 ELISA data that make the same point (though it would be nice to know what the points were on the single ELISA that exceeded 2 SDs, even if they weren't reproduced - especially given there is a lot of scatter in the ELISA). I note in passing that I don't think I could find a Supp table 1).

      The authors then use AF3 to show that what would typically be the N-terminal zinc-binding site is not well predicted by the software (and the site ends up being square planar), suggesting that something might be amiss. (They were also unable to obtain an experimental structure.) It would have been helpful to gain more insight into what led them to the conclusion that the protein forms a weak homodimer based on the NMR data. Typically, it can be challenging to determine by NMR whether a dimer is forming or if non-specific soluble aggregates or other factors are contributing to line broadening.

      Next, the authors show nicely that MBD5/6 - two proteins shown in a previous paper to form a complex with ASXL - are predicted by AF3 to dimerize with ASXL - and form an intermolecular zinc-binding module in doing so. This is a nice result and there are very few examples of this in the literature (eg the zinc hook formed by Rad50 proteins). They confirm the zinc-binding prediction biochemically. They also show an HSQC of the complex (both subunits 15N labelled) and they count what they say is roughly the right number of peaks. To me, the lineshapes in the HSQC look good and, as the authors say, there are no clearly disordered resigies. I do make some additional comments below about the NMR data - suggesting what I think would be some valuable follow-up experiments. Overall, this study is a nice piece of biochemistry that recognizes an anomaly in the classification of examples of not one, but two, domain types well-known in the field of epigenetics. Going further than that, they not only show that the domains are mis-annotated but also demonstrate what their real function is and put forward a very likely model for their structure.

      The work is a good combination of AF based computational prediction with corroborating biochemistry and the experiments look technically well done to me. It is definitely of publishable quality and represents an advance in our understanding both of the particular proteins that they have studied and of the quirkiness of protein structure in general - there is always a new wrinkle to be discovered. I would make a couple of comments and suggestions that I think could improve the manuscript. I also have a number of minor comments below.

      Regarding the NMR data, the HSQC of the heterodimer that they show has nice lineshapes, as I mentioned above. However, the spectrum looks a little curious and closer inspection makes me wonder whether we are actually looking at two or more species with related structures. Many of the peaks appear to have a second peak nearby and it looks to me as if there is a consistent intensity ratio between the two forms (maybe 3:1 or 4:1?). It would be beneficial to explore this further, as understanding this aspect more clearly could have important implications for their analysis. I think the overall conclusions would probably still hold, but there would be far fewer signals than expected, suggesting likely some sort of slow-intermediate conformational exchange process that is giving two signals for a chunk of the residues and giving no signals for some of the others. Some comparison with the HSQC of the PHD domain alone might be helpful here.

      Some simple backbone triple resonance experiments would also be very helpful. Not only would they allow assignments to be made - and therefore a comparison of predicted secondary structure with the AF3-predicted fold - but also would help confirm whether there are two conformers. Often in these cases, the Ca and Cb chemical shifts for an exchanging system are much more similar than the HN and 15N signals, and it is therefore often clear that two peaks are actually the same residue in two different conformations. ZZ exchange experiments could help too, though these can sometimes be challenging.

      Finally, it would be reassuring to see SEC-MALS data for the heterodimer. Given that the interaction is mediated by covalent bonds, I'd expect to see a dimer molecular weight. It would also be reassuring to see a nice-looking SEC peak - and it would be useful data to have as part of the interrogation of possible chemical exchange mentioned above.

      Specific points

      • Intro: A nucleosome wraps less than two turns of DNA
      • I'm not a fan of this sentence: "The quaternary structure of the nucleosome forces the N- and C-terminal tails from histone proteins to protrude for covalent chemical modification". Not clear to me that the nucleosome 'forces' the tails to protrude...
      • The authors state that "Attachment of ubiquitin to histone H2A at K119 limits gene expression" - but they don't give any context. Which genes are limited in their expression? Nearby ones? Ones on the same chromosome? Just the gene that has an H2A-Ub in a specific position?
      • No need for capital Z in zinc.
      • "After purification, the protein solution was concentrated to 42.5 uM". The authors would not know the protein concentration to three significant figures. They would be unlikely to know it to 2 figures, given the inherent uncertainty in protein concentration measurement.
      • I like that they show purification gels for their proteins - almost no one does...
      • The authors state that "The domain, however, proved too small and flexible to produce crystals". However, the authors don't (as far as I can see) have any data to support the notion that either of these was the reason that no suitable crystals were obtained. I bet there are plenty of large, well-ordered proteins that haven't been able to have their crystal structures determined...
      • Supp fig 3 - the authors could label N and C termini.
      • "The 1H15N HSQC spectra revealed the presence of about 95 backbone amide peaks, which is in agreement with the overall protein complex." The authors could tell us how many peaks are expected, to make the comparison more useful! (and it should be spectrum).
      • "and form a tight, stable protein complex". Too many adjectives... The data don't show that the complex is tight, nor really say anything about its stability (is the Tm 35 degrees or 95 degrees - can't really say). The data do show that the two proteins form a complex.
      • I'd say that 633 A2 buried surface area isn't 'large'. It's small by protein complex standards, I think. But still perfectly reasonable.
      • Figure S6 - would be good to label N and C termini.

      Significance

      In this manuscript, the authors examine the biochemistry of two protein domains that are, on the basis of sequence similarity, predicted to function autonomously as binders of histone H3 tails or methylated DNA. They present solid data to suggest that neither domain in fact has this function, but that they act as protein interaction domains that form a heterodimer mediated by the presence of a zinc ion (two ligands from each protein).

      I am a structural biologist and biochemist who has worked on zinc-binding domains - including PHD domains - on and off over 30 years.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: The Polycomb Repressive-Deubiquitinase (PR-DUB) complex catalyzes histone H2AK119Ub deubiquitinylation, regulating gene expression and chromatin dynamics. It comprises the BAP1 deubiquitinylase and one of three ASXL proteins (ASXL1-3). ASXL proteins contain a highly conserved C-terminal Plant Homeodomain (PHD), which was proposed to recognize epigenetic marks on histone H3 tail and other proteins. Mutations and truncations in the PHD domain are frequently observed in cancer. The authors propose that the ASXL PHD domain does not target histone H3 PTM marks. They model the PHD domain using AlphaFold3 and identified a non-canonical fold that can apparently chelate one Zinc ion only in vitro, instead of the two ions typically bound by PHD domains. They also investigated the methyl CpG-binding domain proteins MBD5 and MBD6, known to interact with the ASXL PHDs and found that the complexes contain a composite Zinc-binding site at the interface between the two proteins. While the overall concept is interesting, the data do not justify conclusions. The authors should also reefer properly to the citations

      Major comments:

      • Are the key conclusions convincing?

      No. The final model is not substantiated by a robust experimental system The conclusions about Zinc binding and that PHD of ASXLs does not bind histone tails are based on a rather weak experimental system. There is a need for structural evidence and validation with mutagenesis. Also, comparing the sequence of the ASXL PHD to ING2 is insufficient and the PhD might bind other known or unknown peptide sequences on histones. The authors can not state or imply, based on their data, that the ASXL PHD does not recognize histone H3 epigenetic modifications. The methods are not sensitive enough and other peptides with an apparent fold enrichment have not been considered. It is not adequate to compare the Zinc-binding assay ASXL2 (residues 1375-1435) and Asx (residues 1610-1668) PHD domains with the RING domain of cIAP1 (residues 551-618) and a GST-only control. Why not other PHD domains? - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Yes - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Need solid evidence through experimental structure validation The ASXL PHD forms a composite Zinc-binding site with MBD5 and MBD6 is not well developed. There is a need structural validation - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes - Are the data and the methods presented in such a way that they can be reproduced?

      Need to provide more technical details - Are the experiments adequately replicated and statistical analysis adequate?

      It is unclear whether the data are mostly technical replicates in the same experiment as opposed to independent experiments

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Need to validate Zinc chelation and composite interface for Zinc binding with other methods - Are prior studies referenced appropriately?

      Largely No Examples: - Attachment of ubiquitin to histone H2A at K119 limits gene expression (Cao & Yan, 2012) - Ubiquitin is attached to H2AK119 by the Really Interesting New Gene (RING) E3 ubiquitin ligase Polycomb Repressive Complex 1 (PRC1, Cohen et al., 2020) and is removed by the PR-DUB (Reddington et al., 2020; Scheuermann et al., 2010) - The PR-DUB has regulatory functions in the cell cycle, cellular development and DNA damageresponse, and determines short-term changes to gene expression (reviewed in Di Croce &Helin, 2013; Mozgova & Hennig, 2015; Parreno & Martinez, 2022; Schuettengruber et al., 2017). - Are the text and figures clear and accurate?

      Yes - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Validate the conclusions with robust methods.

      Other issues

      • In the Abstract: Need to include MBD5, MBD6 in the initial statement A Plant Homeodomain (PHD) at the C-terminus of ASXL proteins is recurrently truncated in cancer, and was previously proposed to recognise epigenetic modifications on the N-terminal tail of histone H3.

      Referees cross-commenting

      This session contains comments from both Reviewers

      Rev 1

      I believe that the manuscript needs substantial improvement. This involves experiments and this would require at least 6 months.

      Rev 2

      I don't agree that the authors need to determine an experimental structure for this work to be publishable. I think that the methods used, as is, are sufficient to draw a conclusion about the likely zinc ligation geometry. A structure would of course be great, but is a 'next level' experiment.

      Rev 1

      Ok, for not doing the structure, but this is just part of several comments. They have to address very carefully the comments and better control the study overall.

      Significance

      Could be significant and new if adequately demonstrated. The study is preliminary Could be significant in the filed of biochemistry and epigenetics.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Uttley et al., describes the identification of a candidate sequence for enhancing craniofacial sox9 expression in Neanderthals and offers functional genomics evidence towards identification of candidate sequence variants in a cis regulatory element (CRE) responsible for jaw morphology variation in hominin evolution. They generated a transgenic zebrafish model for testing the activity of a previously characterised regulatory element in human, which when mutated causes Pierre Robin developmental disorder and its neanderthal counterpart which has been identified as a candidate enhancer by sequence similarity and by being a DMR in the Neanderthal genome. They show that the Neanderthal CRE is active similarly in distribution to its human counterpart but with elevated activity in anatomically loosely or unspecified cell types in zebrafish cartilaginous neural crest candidates, which they argue are matching the cells where the same enhancer is active in mammalian development. They then show by single cell transcriptomics the cell distribution for the enhancer activity in relation to neural crest subpopulations and trasncription factors involved in craniofacial development. Finally they carry out overexpression of SOX9 with the human enhancer variant in zebrafish and demonstrate morphology changes which they interpret as evidence towards the capacity of the enhancer to broaden mesenchymal condensations leading to change in jaw morphology.<br /> Taken together, the paper provides evidence for a predicted neanderthal regulatory element candidate to function as enhancer in a zebrafish model and evidence for this enhancer to carry sequence variation which can lead to overactivation in craniofacial cell types relevant to jaw morphology, which the authors interpret as the source of the cis regulatory mechanism for jaw morphology evolution in hominin evolution.

      Main comments:

      I found the conclusion on the functional divergence of sequence variants of Neanderthal v human enhancer convincing as they were provided by an elegant double reporter approach which offers internal control for variant comparison. However, i found the argument about the role of the sequence variant in craniofacial development less convincing

      1. Setting the aims I found the introduction to the topic and the setting of aims somewhat sketchy. It is not clear from the introduction, why the Neanderthal element was chosen for further study and why the SNVs in this one element were worth pursuing in the lack of broader understanding of the potentially complex regulatory element complexity at the Neanderthal Sox9 locus. While it is a very reasonable assumption, that a key CRE found and well characterised in human (by the authors in their seminal paper) is a worthy candidate for functional assessment, without better understanding of the overall locus conservation between human and Neanderthal this element may be one of many functionally redundant elements.
      2. Justification of the fish model in hominin gene regulation

      2.1. For the neanderthal element function to be compared to human in a valuable and informative fashion, one would expect that the host system i.e. the zebrafish is sufficiently conserved by offering a similar developmental context both in terms of gene regulation and in terms of anatomy. From the gene regulation perspective, i would expect that the analysis of the EC1.45 is based on expectation of similar regulatory information content to that in the fish homolog thus one can expect similar TF network activities on them and as a result one an test sequence variation effects relevant to endogenous regulatory interactions both in fish and hominins. However, there is no data shown for the relevance of fish regulatory background as a test system. No information is provided on the fish sox9 locus and its activity, or whether the fish homolog enhancer (or any sox9 enhancer that is expressed in the expected domains of craniofacial lineages and structures) has been identified and how it compares to the hominins. One expects that the hominin enhancers are active in domains of the zebrafish sox9 for the anatomical structures to give relevant readout. I would expect a comparison and match of the EC1.45 activity to ether endogenous sox9 by WISH or (although less accurate) a cross to one of the several sox9 reporter transgenic lines available on ZFIN.

      2.2. There is an argument about the regulatory networks being conserved (without references), this would need more arguments particularly in the context of Sox9/SOX9 regulation. 3. Further to the justification of the fish model, from the anatomical perspective, the assessment of the parallels of zebrafish and mammalian craniofacial development need strengthening.

      3.1. While indeed transparency and external development helps the reporter transgenesis and argues for the fish model, but the generation time is actually comparable to mouse (in contrast to the statement in the introduction), however the understanding of zebrafish craniofacial development and its similarity to human is not well argued, and indeed very superficially compared in the manuscript. I found the anatomical analyses to be rather imprecise and difficult to compare. In the lack of direct comparisons and diagrams comparing mammalian and fish developmental structures and their origins, the statement of 'EC1.45 activity matches expression domains from mammalian development' or 'broadly recapitulate' to be an oversimplification and overstatement. The lineage tracing is an important evidence but again the anatomical homologies need to be more clearly visualized and the lineage history better explained.

      3.2. In a similar vein, direct comparison of human and Neanderthal adult morphologies (Figure 1B) would be very helpful.

      3.3. I was also confused why the sox10 reporter is used as reference (with no direct overlap of activity to the SOX9 associated EC1.45 reporter) rather than or alongside a sox9a reporter line or even comparison to endogenous sox9a activity by WISH (Figure 2). The anatomical details in Figure 2 would need to be extended with more precisely describing the cell types, where the transgene is active and how the homology to mammalian anatomies are established.

      3.4. Overall, the use of the fluorescence reporter is helpful for initial assessments but accurate enhancer activity profiling and comparison should be done by WISH, as mRNA is far more likely to follow the temporal activation dynamics and may explain fluorescence signal intensity differences, the latter important for correct interpretation of sequence variant effects (e.g. is the perceived higher expression by the Ne element is perhaps due to longer expression or earlier activation). 4. Single cell transcriptomics This experiment was not only used to characterise transgenic reporter active cell types, but to establish transcription factor candidates relevant to neural crest differentiation regulated by EC1.45. What is somewhat confusing, is that the EC1.45 element activity domain is only partially and not predominantly overlapping with the twist1a expressing cells. The authors previously established Twist1 as key regulator of EC1.45 in craniofacial development. How do the authors explain the apparent little relevance of twist1a in regulating the enhancer in fish? Overall the lack of any attempt to link the SNVs to TFBS (including, if available that of the fish homolog sequences) is making the interpretation of the sequence variation harder. BTW, even of the fish elements are not directly identifiable by direct sequence alignment it may be possible to identify the fish homolog through phylogenetic footprinting with stepping stone species such as the non-duplicated paddlefish. 5. Sox9 overexpression This experiment seems not to add too much to the main claim of the paper. While not essential, for this data to add more value, a comparison to that using the Neanderthal element would be more interesting and not a difficult experiment to carry out. 6. Throughout the paper there is a lack of data on reproducibility of reporter activities. As random integration often leads to position effects, it is expected that more than one lines showing the same patterns is used to identify cell type and tissue specificities. This is lacking in the paper and is a concern, as for example, the human element activity in Fig. 1 appears to be different from that by in the dual reporter shown in Fig. 3.

      Minor points

      A request to the editor as much as the authors: please make sure that legends are on the same page with figures, it is very hard to follow manuscripts when one needs to scroll between 3 pages at the same time (text, figure, legend). This archaic separation inherited from decades ago when physical prints used to be submitted has no justification in the digital era but continues to make reviewer's life difficult. Similarly, there should be no limit, and it should be encouraged to label anatomical structures directly on panels to point out expression domains, highlight expression variation, or to make a panel more self-explanatory, while making sure that clarity is not lost.

      Figure 1A does not support the statement it is referenced to

      Figure 1B should include human anatomy in comparison and perhaps a schematic diagram of the hypothesized developmental morphogenesis divergence modelled in this paper

      Figure 1D should show why the authors argue the neanderthal is not the ancestral state (BTW, what does the fish homolog look like?)

      Figure 4A,B are better suited in Supplemental

      Significance

      Conceptual: identifying sequence variants in Neanderthal cis-regulatory element as potential source of evolutionary change in morphology.

      Technologically mostly following prior art, use of single cell in reporter analysis is technologically improvement on current standards, albeit somewhat rudimentary.

      The use of a tractable embryo model to explore a regulatory sequence change leading to morphology change has often been applied for carious aspects of evolutionary changes during development pioneering examples include the shh ZRA enhancer in fin/limb morphogenesis, or balean fin evolution (PMID: 9860988) or human versus ape hand evolution (PMID: 18772437), but this is the first for applying it to hominin evolution. This will be of interest to human geneticists, evolutionary geneticists and developmental geneticists.

      My expertise is in developmental gene regulation with the zebrafish model.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors provide evidence that nucleotide sequence variants in a remote enhancer, E1.45, which is located 1.45 Mb upstream of the Sox9 promoter, probably contributed to subtle morphological differences in the lower jaws of Neanderthals and modern humans. The study employs the use of a cleverly-designed dual reporter gene for directly comparing the activities of the Neanderthal and modern human enhancers in transgenic zebrafish. The results are clear and convincing: the Neanderthal enhancer is significantly more active than the modern human enhancer.

      Here are a few minor recommendations that might help clarify aspects of the study:

      1. Is it possible to quantify the different enhancer activities in the zebrafish assays? Is it strictly a question of levels or are there also subtle differences in the timing and/or sites of expression during development?
      2. Is the Neanderthal form of the E1.45 enhancer ancestral for the hominids? If so, then reduced expression in modern humans is a derived trait. This could be stated more clearly.
      3. Are there potential transcription factor binding motifs associated with the SNVs?

      Significance

      The authors address one of the most compelling problems in biology: the evolutionary origins of modern humans. This study addresses the role of regulatory DNAs in the divergence of Neanderthals and modern humans. Sox9 is a good focus of study since it has been implicated in the development of craniofacial features in humans. The authors identified three SNVs (single nucleotide variants) in Neanderthal vs. modern human E1.45 enhancer sequences. Direct comparison of these enhancers provide compelling evidence that these SNVs cause upregulation of the Sox9 in Neanderthals. I think this is a very interesting finding and strongly endorse publication.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an interesting paper that is logical continuation of authors previous work characterizing a human enhancer mutation implicated in Pierre Robin malformations that alters Sox9 expression. Here using zebrafish as a convenient model organism, the authors test the activity of the human enhancer compared to its Neanderthal ortholog. The results show that both enhancers drive reporter expression in the vicinity of forming cartilage condensations of the jaw. While both enhancers mediate reporter expression in neural crest derived cells, the Neanderthal sequence drives quantitatively higher expression than the orthologous human enhancer. Consistent with this, overexpression of Sox9 using the human enhancer caused an increase in cartilage volume. Altogether, this is a nicely done study that would be appropriate for publication after some revisions as detailed below.

      Major Revisions:

      1. The introduction seems overly long and a bit rambling so diminishes from the excitement of the work. It should be half the length and focus on the novelty of this question and findings.
      2. The authors should demonstrate that that human EC1.45 activity overlaps with Sox9 expression. This should be included in Figure 2.
      3. There are differences in level of enhancer activity signal between figures (e.g. seems lower in Fig. 3 than Fig. 2). Does enhancer activity vary between embryos or was the imaging protocol different?
      4. Some co-staining should be performed to show whether or not the enhancers are active in the same cells but at different levels or if they are actually in different cells.
      5. There is an important issue with the single cell RNA seq. Given that the cells were FACS sorted for +GFP and +Cherry, there seem to be many negative cells in their scRNAseq data. Perhaps the FACS gates (figure 4B) were not conservative enough? Did negative cells get included? Authors should verify that their clusters express both GFP and Cherry transcripts.
      6. From their scRNAseq data, they talk about enhancer activity in PA1, but this isn't discussed/shown in the enhancer reporter embryos. It would be appropriate to annotate PA1 in figures 2 and 3.
      7. Authors should quantify how many Sox9+ cells also have enhancer activity. Looking at the UMAPs in figure 4E and 4F, it actually looks like there is less enhancer activity in the Sox9 dense regions of the clusters.
      8. For the over-expression of Sox9 driven by EC1.45, it is important to first establish that EC1.45 activity does indeed overlap with Sox9 gene expression. Does Sox9 itself drive EC1.45?
      9. Importantly the authors do not discuss if the Neanderthal SNVs lie in TF binding sites? Which TF motifs? Are they conserved? Are those TF's expressed in the same cells as both enhancers?
      10. If you introduce the Neanderthal SNVs into the human sequence, do you gain enhancer activity?
      11. The over-expression experiments are tricky as they cause major developmental defects. Would it be possible to drive Sox9 expression at levels that better reflect those driven endogenously by the human versus Neanderthal enhancer?

      Minor Revisions:

      1. Figure 1 - authors should highlight that panel C is a zoom in of panel A.
      2. Figure 3 - Why does Human EC1.45 activity looks weaker here than it does in Figure 2.
      3. The first sentence of the last paragraph in the Introduction is unclear: "spatiotemporal developmental expression patterns for the human EC1.45 cluster during zebrafish development". Instead should read "reporter expression driven by the human EC1.45 enhancer over developmental time"

      Significance

      This is a nice paper that advances understanding of jaw development and has disease relevance as well as some evolutionary implications. Thus it is novel and would appeal to developmental biologist, the craniofacial community, and to some extent to evolutionary biologists.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02588

      Corresponding author(s): Frederic SALTEL

      __1. __Point-by-point description of the revisions

      Reviewer #1:

      Invadosomes are dynamic, actin-based structures that enable cells to interact with and remodel the extracellular matrix (ECM), playing a crucial role in tumor cell invasion and metastasis. Prior studies by the authors and other groups have established the formation, activation, and appearance of invadosomes. This study demonstrates the following:

      1. Key elements of the translation machinery and endoplasmic reticulum (ER) proteins are constituents of the invadosome structure.
      2. Specific proteins are associated with distinct invadosome structures.

      The researchers utilized two cellular models (NIH3T3-Src and A431 melanoma cell line) and Tks5, a specific invadosome marker, for immunoprecipitation and mass spectrometry, validating the results through fluorescent images, electron microscopy, and time-lapse live imaging.

      Major Comments

      The manuscript is well-written, with a clear and detailed experimental workflow. Compared to their previous seminal work that first demonstrated invadosomes concentrate mRNA and exhibit translational activity using NIH3T3-Src cells, this study adds details about the specific enrichment of translation proteins for each type of invadosome and the presence of ribosomal and ER proteins. However, the experiments do not further enhance our understanding of the intricate mechanisms linking invadosome structures, function, and translation factors.

      Further experiments are needed to better demonstrate the hypothesis of active translation within these structures, including the use of additional cellular models.

      To demonstrate the hypothesis of active translation within these structures, we performed the same translation inhibition experiments, using CHX in additional cellular models. Indeed, these experiments were performed on MDA-MB-231 breast cancer cell lines, as well as on Huh6 liver cancer cell lines. Degradation experiments showed the same results as for NIH-3T3-Tks5-GFP and A431-Tks5-GFP, since we were able to observe a significant decrease in the degradation capacities of cells in the absence of translation (see graphs below).

      Left: Quantification and representative images of ECM degradation properties of Huh6 cells on gelatin treated (CHX) or not (DMSO) with cycloheximide. Gelatin is stained in green and nuclei in blue. Values represent the mean +/- SEM of n=4 independent experiments (15 images per condition and per replicate) and were analyzed using student t-test.

      Right: Left: Quantification and representative images of ECM degradation properties of MDA-MB-231 cells on gelatin treated (CHX) or not (DMSO) with cycloheximide. Gelatin is stained in green and nuclei in blue. Values represent the mean +/- SEM of n=4 independent experiments (15 images per condition and per replicate) and were analyzed using student t-test.

      The authors should also investigate the effects of Tks5 silencing on ER-associated translational machinery.

      The effects of Tks5 silencing on the ER-associated translation machinery were investigated using a SunSET experiment. We were able to demonstrate that Tks5 silencing had no significant impact on translation in both cellular models since no translation modification was observed between control and siTks5 conditions.

      Quantification and relative western blot analysis of the effect of Tks5-targeting siRNA treatment on A431 and NIH-3T3-Src cells by using puromycin quantification. Values represent the mean +/- SEM of n=4 independent experiments and were analyzed using Anova.


      How do the authors propose Tks5 is linked to these proteins? Directly or indirectly? Focusing on specific proteins night provide an opportunity to study the molecular mechanisms in greater depth.

      Tks5 is a scaffold protein, a multi-domain “bridging molecule” that serve as regulators by simultnneously binding multipe molecular partners. TKs5 contain a PX domain and 5 SAH Domains. Consequently, Tks5 can bind different partners. Moreover, as focal adhesion, invadosome are large macromolecular assemblies. Here, in this study, Tks5 serve as a specific molecular hook, to precipitate partners. At this step, there is no evidence of a direct or indirect link of the translational machineray with Tks5. Even if we can hypothetize un indirect connection. In this version we focused more precisely on a specific and common Tks5 partners, such as EIF4B.

      They used chemical inhibitors and siRNA approaches to assess the role of specific players, such as EIF4B, in the proteolytic activity of invadosomes, which can be considered proof of concept. Additional experiments aligning the results with the involved pathways would add molecular details and enhance the manuscript's significance. Resolving these issues is crucial for the manuscript to meet the publication standards for contributing novel and impactful insights to the field.

      To better understand the variation of the pathways involved, we first wanted to observe the impact of Eif4b silencing on active translation in both cellular models. To do this, we performed SunSET experiments in both cell models. An experiment was performed for the A431 cell line and the results seem to show little difference between control conditions and conditions in the presence of siEIF4B. Conversely, SunSET experiments in the NIH 3T3 Src cell line show an increase in translation in the presence of siEIF4B.

      __ __

      Quantification of the effect of cycloheximide (CHX) and EIF4B-targeting siRNA (siEIF4B #1 and #2) treatment on A431 and NIH-3T3-Src cells by using puromycin quantification. Values represent the mean +/- SEM of n=1 independent experiment for A431 or n=2 independent experiments for NIH-3T3-Src.

      In order to better understand the variation of the signaling pathways involved, spectrometry experiments were performed to compare the variation of the pathways in control conditions and in the presence of siRNA against EIF4B. These results allowed us to provide a better understanding of the variability of the pathways and therefore of the mechanism of action.

      Volcano plot of overexpressed and underexpressed proteins after silencing of the EIF4B protein identified by mass spectrometry analysis.

      These mass spectrometry experiments allowed us to highlight that the pathway mainly impacted during Eif4b depletion was the Hras pathway. However, this information is given for information purposes only. It would be necessary to look more closely at the Hras pathway to understand what the link with EIF4B and therefore the link with the formation of invadosomes could be.

      Table of translation-related proteins or proteins involved in the formation or function of invadosomes that are overexpressed or underexpressed in at least one siRNA of EIF4B.

      These experiments also allowed us to highlight that the depletion of EIF4B directly impacts the translation pathway by modulating translation initiation factors as well as ribosomal proteins but also proteins involved in the formation and function of invadosomes such as ADAM17, ACTR5, IGFBP6 RPL22 and RPS6KA5 proteins (see table below). It will be necessary to validate these data and determine their specificity due to the fact that some other proteins appear under-expressed like IGFBP3 and ADAM19. To conclude, to fully understand the exact impact of EIF4B into this process, additional investigations are necessary.

      __ __Minor Comments :

      A more detailed discussion of the implications of their findings within the broader context of cancer cell signaling and the potential impact on related cancer research areas would further advance our understanding in this area.

      This part was added in the new version of the discussion. Indeed, deregulation of the translation is now a hallmark of cancer. This notion is now present in the manuscript and concluded the discussion (see page 12).

      Reviewer #1 (Significance (Required)):

      General Assessment:

      This study offers novel insights into a new function of the invadosome-specific player Tks5 as a molecular crossroad between ER-related translation proteins and invadosomes. The authors suggest that Tks5 could act as a scaffold, supporting the rapid clustering of translation-related proteins during invadosome formation or proteolytic activity. However, a major limitation is the lack of mechanistic exploration. The results do not elucidate how Tks5 mediates the recruitment of these proteins or the specific molecular mechanisms involved.

      Advances: The study extends knowledge in the field by confirming the presence of specific markers linked to different invadosome structures and demonstrating the Tks5 interactome's association with translation machinery.

      Audience: This study will primarily interest specialists working on invadosomes and, secondarily, those interested in cancer cell signaling, invasion, and metastasis.

      Field of Expertise: Invadosome and related signaling pathways in cancer.

      __ __


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary In this work, Normand and her colleagues analyze and compare the interactome of the key invadopodia component, TKS5 (overexpressed as a GFP-tagged protein), in two transformed cell models cultured on different substrates. Potential TKS5 interacting partners are identified including previously known and validated TKS5 interactors, some known to contribute to the mechanism of invadopodia formation and function. Bioinformatic (GSEA) analysis reveals a specific enrichment for proteins related to protein translation and interaction with ER-associated ribosome machinery. Evidence is presented that some of these proteins (RPS6, a component of the 40S ribosomal subunit, and translation factor, EIF4B) localize to TKS5-positive invadopodia in Src-transformed cells. Experiments based on translation inhibitor, cycloheximide, and silencing of EIF4B factor could demonstrate a link between overall protein translation and invadosome formation. Live cell imaging and microscopy analysis of fixed samples could document some proximity between the endoplasmic reticulum network and invadosome rosettes.

      Major comments

      __ __1- In the Results Section, the IP/proteomics-based pipeline used by Normand and colleagues to identify TKS5 partners is not clearly described and is confusing. Cut-off used to select the proteins in the different classes summarized in Table S1 should be better described. In addition, the nomenclature of the different protein subgroups used in Table S1 is confusing (see minor point#5).

      Details have been added in the results section regarding the IP/proteomics section to complete the materials and methods section. As described in the materials and methods section, control versus IP data were quantified by an enrichment ratio ≥ 2. These criteria are the most classically used in the practices analyzed.

      For clarity, additional tables have been added for each category (A431/NIH plastic or collagen) and gene names, protein descriptions and abundance ratios have been indicated (Supp table 2, 3, 4 and 5).

      2- The effects of cycloheximide treatment or EIF4B silencing on gelatin degradation are clear and convincing. However, these are correlative evidence, and they may reflect a general implication of protein translation in the control of invadopodia function. A direct link between the observed interactions of TKS5 with the protein translation machinery and the formation and/or function of invadopodia is missing.

      To demonstrate the direct links between Tks5 and the translation machinery, a fluorophore was used to visualize active translation within invadopodia. We were able to highlight an active translation localized in the rosettes (see figure below). Indeed, we can observe a localized translation within the rosettes. However, these same results were not observed in linear invadosomes where we could not observe any localized translation. We can however hypothesize that it is more difficult to observe a localized translation in linear invadosomes which are much smaller structures than rosettes.

      Confocal microscopy images of NIH-3T3-Src cells. The cells were stained for B-actin RNA in green, B-actin in red, nuclei in blue and actin in grey. Scale bar: 20µm, zoom: 5µm.

      In order to provide additional elements to show the link between Tks5 and the translation machinery, we performed immunofluorescence experiments by labeling the Sec61 protein. Sec61 is a well-described ER marker that allows the insertion of proteins into the ER but is also a key player in the docking of ribosomes to the ER. We were able to highlight the colocalization between Tks5 and Sec61 in all types of invadosomes, allowing to show the link between the Tks5 protein and the translation machinery. These images were inserted in the manuscript (see Figure 6b).

      Confocal microscopy images of NIH-3T3-Src and A431 cells. The cells were seeded on gelatin or type I collagen and stained for Sec61 in red, nuclei in blue and Actin in grey. Scale bar: 20µm, zoom: 5µm.

      __ __3- Images showing the interrelations between the ER and the adhesive podosome rosettes are striking (Figure 5). Src-transformed cells forming invadosome rosettes when in contact with the collagen substratum change shape and produce adhesive protrusions towards the substratum. As the ER is a huge compartment that fills the entire cytoplasm, it is maybe not so surprising to observe the ER filling the protrusions and getting close to the rosettes at the tip of these membrane extensions. Again, these observations are essentially correlative and there is no prove of some direct contact between some ER regions and the invadosomes.

      For clarity, the contrast of the images has been improved. Thus, time-lapse imaging clearly demonstrate that the ER is not present in all the cytoplasm but is enriched in the destination of the rosettes as well as in the rosettes. Moreover, this is not systematic with all invadosome rosettes (see video 1)

      4- Overall, this report is lacking a clear hypothesis or model of what could be the consequence of the interaction of TKS5 and the translation machinery on the formation and/or the activity of the invadosomes in transformed cells.

      We performed a sunset experiment to analyze the impact of Tks5 depletion into translation. No variation of global translation was observable in the absence of Tks5 (see results below). Tks5 depletion block invadosome formation. So, the impact on total translation activity cannot be measurable at the cell level, suggesting that invadosome recruit a specific translation machinery. Indeed, even if we obtained a good percentage of Tks5 depletion, around 90%, the impact in total translation activity is not quantifiable. However, we noticed that some specific translation actors are modulated and specifically localized into invadosome structures suggesting that it is more a question of localization and local translation of specific mRNAs, and not a global modification. This is consistent with the fact that Tks5 expression is not altered during tumor cell invasion, and it is just recruited and activated at specific sites to form these invasive structures.

      Thus, in this paper, Tks5 only served as an anchor point in order to be able to extract the specific molecular machinery and specific translational actors.

      Quantification and relative western blot analysis of the effect of Tks5-targeting siRNA treatment on A431 and NIH-3T3-Src cells by using puromycin quantification. Values represent the mean +/- SEM of n=4 independent experiments and were analyzed using Anova.

      Minor comments

      1- Discussion Section (page 2). The statement that TKS4 is involved in ECM degradation in podosomes only and not in invadopodia is not correct. TKS4 knock down has been shown to interfere with ECM degradation in Human DLD1 colon cancer cells (Gianni et al. SCIENCESIGNALING Vol 2 Issue 88, 2009) and in in mouse and human melanoma cell lines (Iizuka et al. Oncotarget, Vol. 7, 2016). In addition, an unphosphorylable mutant form of Tks4 blocked invadopodia formation and ECM degradation in Src-transformed DLD1 cells (Gianni et al. Molecular Biology of the Cell Vol. 21, 4287- 4298, 2010). We (this reviewer's team) reported that TKS4 was associated with cortactin-positive invadopodia in MDA-MB-231 and Hs578T triple-negative breast cancer cell lines (Zagryazhskaya-Masson et al. J. Cell Biol. 219, 2020).

      The involvement of TKS4 protein in extracellular matrix degradation has been changed in the text (page 2).

      2- Discussion Section (page 3). A431 is wrongly referred to as a melanoma cell line; it is a human epidermoid carcinoma cell line.

      The text has been modified according to the recommendations, the A431 cell line has been designated as a human epidermoid carcinoma cell line.

      3- Results Section (page 4 & 5). The authors compare the proteins they identified as potential TKS5 partners to previously published data by Stilly et al. (based on TKS5 IP like in the present study) and Thuault et al. (TKS5 bioIB). Additionally, authors should mention and discuss previously published data based on TKS5 coIP experiment and Mass Spec analysis similar to the present study, identifying potential TKS5 partners; some of which were similarly found in the present study including proteins involved in translation and ribosome function although these were not the focus of this work (several 40S and 60S ribosomal proteins, see Zagryazhskaya-Masson et al. J. Cell Biol. 219, 2020).

      This comparison is now present int the text of the manuscript (page 10).

      4- Figure 1b. Matrix degradation is not visible in association with the invadopodia in selected high magnification images in Figure 1a and 1b.

      Matrix degradation is indeed not visible in association with invadopodia in the selected high magnification images. Indeed, the imaging techniques used, Interference Refection Microscopy (IRM) do not allow us to observe matrix degradation at the invadosomes, since the reflection also highlights the cells. The aim here was to show only the presence collagen fibers that correspond to inducer of linear invadosome reorganization. It is widely accepted that all these structures are capable of degrading the extracellular matrix.

      5- Supplemental table 1. The names of the different lists of proteins in the summary table is not clear and is rather confusing.

      For clarity, additional tables have been added for each category (A431/NIH plastic or collagen) and gene names, protein descriptions and abundance ratios have been indicated (Supp table 2, 3, 4 and 5).

      6- Supp Figure 1. Please define what is the sample named 'D' (Delta).

      The Delta sample corresponds to the material that was not attached to the bead.

      7- Results Section (page 5). 'These experiments confirm the correct co-localization between Tks5 and the proteins identified in Tks5 interactome by mass spectrometry analysis.' This statement is too general; in fact, data validate only colocalization between TKS5 and some identified partners, namely CD44 and MAP4.

      To be less general, this statement has been modified in the text to show that the data only validate colocalization between TKS5 and certain identified partners, namely CD44 and MAP4.

      8- Figure 2e and Figure 3. It would have been nice to show the colocalization of selected proteins and TKS5 in association with collagen fibers to validate that enrichment occurs at matrix/cell contact sites and corresponds to bona fide invadopodia.

      As commented above, the reflection highlights the collagen fibers but also the cells. Thus, it is complex in this case to show the colocalization of the selected proteins in association with the collagen fibers with this approach. The other possibility is to stain collagen fibrils, however this kind of approach reduce the quality of interaction between fibers and associated receptors inducing a decrease of linear invadosome formation.

      9- Figure 3c (high mag insets). TKS5 and EIF4b do not seem particularly enriched in invadopodia rosettes as compared to the rest of the cytoplasm.

      Indeed, we can observe on this image a colocalization of Tks5 and EIF4B in the rosettes without showing an enrichment.

      However, the enrichment of EIF4B remains clearly visible in the linear invadosomes and the dots.

      10- Figure 4c-f. Treatments (i.e. CHX, siEIF4b) affect gelatin degradation. It would be interesting to assess the capacity of cells to form invadopodia under these conditions.

      As demonstrated in this study, the CHX treatment and EIF4B depletion affect the degradation of gelatin. In addition, we were able to show that CHX only impacts the formation of rosettes on gelatin (Figure 4a, 4b and Supp 3).

      Moreover, we added in the manuscript the impact of siEIF4B on invadosome formation (Supp Figure 3g). We show that it affects the formation of rosettes as CHX, but also affects the formation of linear invadosomes on collagen by A431 cells.

      Quantification of the numbers of invadosomes per cell on gelatin and collagen silencing (siEIF4B) or not (DMSO) for EIF4B in A431-Tks5-GFP and NIH3T3-Src-Tks5-GFP cells. Values represent the mean +/- SEM of n=4 independent experiments (10 images per condition and per replicate) and were analyzed using student t-test.

      Reviewer #2 (Significance (Required)):

      This study confirms and adds to a previously published report by this research group based on invadosome laser capture microdissection and proteomics revealing that invadosomes contain specific components of the translational machinery, and that protein translation activity is required to maintain invadosome structure and activity (Ezzoukhry et al. Nat Commun 2018). It also adds to a recent study that established a crucial role for ribosome biogenesis in promoting cell invasion in the C. elegans anchor cell invasion model (Development. 2023).

      The experimentation presented in this paper is of good quality and convincingly support the authors conclusions of a link between the ER-associated translation machinery and invadosome function in transformed cells. Overall, although this study adds to the emerging idea of an evolutionary-conserved translational control of cell invasion through the extracellular matrix it is mostly correlative and lacking a direct prove that the interaction of TKS5 with components of the translation machinery has a direct contribution to invadopodia function.

      __ __


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: To invade the surrounding extracellular matrix (ECM), cells organize actin-rich cellular membrane structures capable of ECM degradation, called invadosomes. Depending on the composition and organization of the ECM, cells organize their invadosomes differently. The authors aimed to identify specific and common components of different types of invadosomes: rosettes formed by NIH3T3-Src cells seeded on gelatin, dots formed by A431 cells seeded on gelatin, and linear invadosomes formed by NIH3T3-Src and A431 cells when seeded on fibrillar collagen I. For this, they generated cells stably expressing GFP-Tks5, a ubiquitous constituent of invadosomes, and determined its interactome. They identified 88 common proteins, among which the protein translation machinery was enriched. Whereas general protein inhibition impaired only rosette formation and impaired every type of invadosome-associated degradation, EIF4B inhibition inhibited the formation of every type of invadosomes. They then analyzed the impact of the ER on invadosome formation and degradation activity. First, they documented the presence of the ER in the center of the NIH3T3-Src rosettes and correlated ER presence with rosette initiation and persistence. They then demonstrated that chemical inhibition of Sec61 translocon decreased formation of invadosomes in general.

      Major comments:

      1- The authors use cells overexpressing GFP-Tks5 for their analysis of Tks5 interactome in the different invadosomes (Fig. 2). The impact of GFP-Tks5 overexpression on invadosome formation and degradation activity should be mentioned.

      Depending the cell type the TKS5-GFP overexpression do not increase the number of invadosomes but increase the matrix degradation activity (Di Martino et al 2014); or could impact the number of invadosomes as in B16 cell line (Shinji Iizuka et al, 2016). This point was added in the introduction.

      However, the Tks5 overexpression was used fo immunoprecipitation and mass spectrometry analysis. The rest of the study and targets validation are done on wild type cells.

      2- Concerning the analysis of the mass spectrometry (MS) data, clarifications would be appreciated:

      a. The authors first "determined the specific molecular signature associated with each invadosome organization" (p.4). As I understand it, the proteins in each of these signatures correspond to proteins identified only in a particular type of invadosomes, not in the others. Could the authors indicate the percentage of the total proteins identified for each type of invadosomes that corresponds to the specific molecular signature?

      The meaning of the sentence has been changed in the paper to provide more understanding. The term "molecular signature" has been replaced by "specific proteins". Percentages have been added to the tables in Figure 1 Supp.

      1. __ __ The GSEA pathways related to each of the specific molecular signature were then analyzed and the authors "commonly identified an enrichment in mitochondrial, ER and Golgi proteins" (page 4) (Supp Fig 1c,e,g). Could the authors provide numbers/percentage/statistics? It is not clear to me whether the biological processes (Supp Fig 1b,d,f) are derived from the analysis of the specific molecular signature or of the total proteins identified for each type of invadosomes. Could the authors clarify this point? The percentages of each specific protein category have been added in Figure 1 Supp.

      The biological processes (Supp Fig 1b, d, f) arise from the analysis of the molecular signature common to the 4 invadosomes conditions, namely the dots, rosettes and linear invadosomes of A431 and NIH-3T3-Src. Thus, the biological processes arise here from the 88 proteins commonly identified for all types of invadosomes.

      1. The authors also identified "translation proteins" enriched in the specific molecular signature of each type of invadosomes (p.4). They commented on this category, indicating that each type of invadosome contains a specific set of translation-related proteins. This is true, but according to my analysis of the provided tables, the same applies to the other categories as well. Could the authors comment this point? Indeed, some proteins involved in translation can appear specific or common depending the type of invadosome. Our comment is at this step, only suggest that some of this protein should be specific for invadosome and some could be associated to only one organization. Of course, the role of each protein needs to be investigated.

      2. Would similar categories of proteins (translation, ER, Golgi, mitochondrial) appear as enriched if the Tks5 interactome was analyzed as a whole for each type of invadosomes? (the authors may disregard this comment if comment a. is inaccurate). Protein pathways enriched in the different type of invadosome differ, for example, Protein activity GTPase activity, vs cell adhesion molecule binding or hydrolase activity acting on Acid Anyhdrides. This analysis demonstrates and highlights differences between the different invadosome organization. However, we focus on translational proteins, ER proteins for example and calculated the percentage of protein identified and associated with this different structure. We can notice important difference as 3% of translation proteins for rosette vs 9 % for dots in A431 cells. This point suggests that the part of each element can differ.

      3. __ __ The authors identified that "cell adhesion proteins" are specifically enriched in linear invadosomes (page 4) (Supp Fig 1f). This conclusion appears to be based on the analysis of NIH3T3-Src and A431 cells. Could the authors provide more details on how this analysis was performed? Specifically, was the analysis conducted on a mixture of the specific signatures of each of the 2 cell models, or on their shared proteins? Additionally, is this category still enriched if each linear invadosome model is analyzed separately? The analysis was performed on common proteins of linear invadosomes, grouping the two cellular models. The category "cell adhesion protein" is not specifically enriched in linear invadosomes because adhesion proteins are also found in the other groups. However, this category represents a larger percentage in linear invadosomes, thus justifying our choice to highlight it for this category.

      4. __ __ The authors identified 88 proteins common to all types of invadosomes (Fig. 2b) and classified them as validated or not in invadosomes. Could the authors give details on the criteria used for this classification? References for the already validated proteins should also be provided. RTN4 has been described as partially localized at invadopodia formed by MDA-MB-231 cells in Thuault et al., yet the authors classified it as not validated in invadosomes. The RTN4 protein has been moved to the category of proteins identified as localized in at least one invadosomes organization, thank you for this precision.

      Please find below the list of papers having among the proteins classification as identified in at least one invadosomes organization, based on literature searches.

      ADAM15 : Aspartate β-hydroxylase promotes pancreatic ductal adenocarcinoma metastasis through activation of SRC signaling pathway - Ogawa et al 2019

      ADAM19 : The Adaptor Protein Fish Associates with Members of the ADAMs Family and Localizes to Podosomes of Src-transformed Cells - Abram et al 2003

      ASPH : Aspartate β-hydroxylase promotes pancreatic ductal adenocarcinoma metastasis through activation of SRC signaling pathway - Ogawa et al, 2019

      BAG3 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      CALD1 :

      • Caldesmon is an integral component of podosomes in smooth muscle cells - Eves et al, 2006
      • Caldesmon is an integral component of podosomes in smooth muscle cells, Gu et al 2007
      • Changes in the balance between caldesmon regulated by p21‐activated kinases and the Arp2/3 complex govern podosome formation, Morita et al 2007 CD44 :

      • The CD44s splice isoform is a central mediator for invadopodia activity, Zhao et al

      • CD147, CD44, and the Epidermal Growth Factor Receptor (EGFR) Signaling Pathway Cooperate to Regulate Breast Epithelial Cell Invasiveness, Grass et al, 2013
      • CD44 and beta3 integrin organize two functionally distinct actin-based domains in osteoclasts, Chabadel et al, 2007
      • Macrophages podosomes go 3, Goethem et al 2011 CTTN : ERβ promoted invadopodia formation-mediated non-small cell lung cancer metastasis via the ICAM1/p-Src/p-Cortactin signaling pathway - Wang et al, 2023

      EIF4B : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      FNBP1L : Transducer of Cdc42-dependent actin assembly promotes breast cancer invasion and metastasis - Chander et al, 2013

      FXR1 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      G3BP1 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      HNRNPA1 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      IGF2BP2 : IMP2 and IMP3 cooperate to promote the metastasis of triple-negative breast cancer through destabilization of progesterone receptor - Kim et al, 2018

      ITGA5 : Membrane Proteome Analysis of Glioblastoma Cell Invasion, Mallawaaratchy et al, 2015

      LAMP1 : Lysosomal cathepsin B participates in the podosome-mediated extracellular matrix degradation and invasion via secreted lysosomes in v-Src fibroblasts - Chun Tu et al, 2008

      MAP4 : A proximity-labeling proteomic approach to investigate invadopodia molecular landscape in breast cancer cells, Thuault et al, 2020

      MMP14 :

      • Receptor-type protein tyrosine phosphatase alpha (PTPα) mediates MMP14 localization and facilitates triple-negative breast cancer cell invasion - Decotret 2021
      • Deciphering the involvement of the Hippo pathway co-regulators, YAP/TAZ in invadopodia formation and matrix degradation - Venghateri 2023 MYH9 :

      • TRPM7, a novel regulator of actomyosin contractility and cell adhesion 6 Clarck et al, 2006

      • Bradykinin promotes migration and invasion of hepatocellular carcinoma cells through TRPM7 and MMP2, Chen et al, 2016 NONO : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      NPM1 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      PABPC1 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      PPP1CA : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      PRKAA1 : A proximity-labeling proteomic approach to investigate invadopodia molecular landscape in breast cancer cells, Thuault et al, 2020

      PTBP1 : The lncRNA MIR99AHG directs alternative splicing of SMARCA1 by PTBP1 to enable invadopodia formation in colorectal cancer cells - Li et al, 2023

      RPL10A : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      RPL34 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      RPS4X : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      RRBP1 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      RTN4 : A proximity-labeling proteomic approach to investigate invadopodia molecular landscape in breast cancer cells, Thuault et al, 2020

      SSB : The PDGFRα-laminin B1-keratin 19 cascade drives tumor progression at the invasive front of human hepatocellular carcinoma - Govaere 2017

      STX7 : Syntaxin 7 contributes to breast cancer cell invasion by promoting invadopodia formation, Parveen et al, 2022

      SYNCRIP : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      THBD : VEGF-Induced Endothelial Podosomes via ROCK2-Dependent Thrombomodulin Expression Initiate Sprouting Angiogenesis - Cheng-Hsiang Kuo - 2021

      YBX3 : Combining laser capture microdissection and proteomics reveals an active translation machinery controlling invadosome formation, Ezzoukhry et al, 2018

      1. __ __ Page 7, "In addition to translation proteins, the MS analysis highlighted the presence of ER-related proteins such as RTN4, LRRC59 or RRBP1 in all invadosomes linked with Tks5 (Figure 2c)". Is the "ER proteins" category enriched among the 88 common proteins? GSEA analysis on the 88 proteins showed an enrichment in proteins related to ribosomes and mRNA binding.

      2. __ __ The comparative analysis of the TKS5 interactome from NIH3T3-Src-GFP-TKS5 on gelatin (this study) with the proteome of NIH3T3-Src rosettes from Ezzoukhry et al. (Fig 5a and Supp Table 2) should be included in the analysis of the MS data obtained in this study (Fig 2), rather than in the paragraph "Recruitment of ER into invadosome rosettes". Are "ER proteins" enriched? Comparative analysis of the TKS5 interactome of NIH3T3-Src-GFP-TKS5 on gelatin (this study) with the proteome of NIH3T3-Src rosettes from Ezzoukhry et al. was included in Supp Figure 2.

      The proteins related to translation are enriched, but not those of the ER.__ __3- Was the localization of the newly identified Tks5 partners, such as RPS6 and EIF4B, but also MAP4 and CD44, to invadosomes analyzed in cells expressing endogenous levels of Tks5? If not, this should be addressed to rule out the possibility that their localization in invadosomes is linked to Tks5 overexpression. Through the figures, it is important to indicate whether cells overexpressing or not Tks5 were used.

      The precision on the overexpression of Tks5 has been added in the figures.

      The experiments were also carried out on cells not overexpressing Tks5 (see results below). Clarifications have been added in the article to specify that these experiments were carried out on cell lines overexpressing Tks5 but also on WT cell lines not overexpressing Tks5 (data not shown in the paper).

      Confocal microscopy images of A431 and NIH-3T36Src cells. The cells were seeded on gelatin or type I collagen and stained for Tks5 in green, actin in red, nuclei in blue and Eif4b in grey. Scale bar: 40µm, zoom: 10µm.

      Confocal microscopy images of A431 and NIH-3T3-Src cells. The cells were seeded on gelatin or type I collagen and stained for Tks5 in green, actin in red, nuclei in blue and RPS6 in grey. Scale bar: 40µm, zoom: 10µm.

      Confocal microscopy images of A431 and NIH-3T3-Src cells. The cells were seeded on gelatin or type I collagen and stained for Tks5 in green, actin in red, nuclei in blue and MAP4 in grey. Scale bar: 40µm, zoom: 10µm.

      4- EIF4B depletion inhibits ECM degradation (Fig 4e-f). The authors should address the impact of EIF4B depletion on invadosome formation. In other words, does EIF4B depletion corroborate the results obtained with CHX treatment, where only rosette formation is inhibited (Fig. 4a and Supp Fig. 3d).

      The impact of EIF4B depletion on invadosome formation was studied. We were able to show that EIF4B depletion partly corroborates with the results obtained with CHX treatment, since rosette formation is also inhibited by EIF4B depletion but linear invadosomes formed on collagen by A431 are also inhibited by EIF4B depletion.

      These results have been added to the paper (see Figure 3g).

      Quantification of the numbers of invadosomes per cell on gelatin and collagen silencing (siEIF4B) or not (DMSO) for EIF4B in A431-Tks5-GFP and NIH3T3-Src-Tks5-GFP cells. Values represent the mean +/- SEM of n=4 independent experiments (10 images per condition and per replicate) and were analyzed using student t-test.

      __ __5- The authors treated NIH3T3-Src-KDEL-GFP and LifeAct-Ruby cells with CHX and conclude that "translation inhibition led to the collapse of the rosette structure (Fig 6a, Video 4)" (page 8): could extra time points be added before T300 to appreciate the collapse of actin before the retraction of ER from the center of the rosette. No video 4 is provided. A video 5 is provided but does not correspond to a rosette collapse. The lifetime/dissociation rate of rosettes with and without CHX treatment should be determined.

      Live cell imaging has been performed by recording one image every 2 minutes as described in methods. Graphs represent all recorded points along the experiment however we modified scale of original graph included into the manuscript to better appreciate the dissociation of fluorescence intensity curves revealing the collapse of actin before the retractation of ER. We also added a second graph which confirmed our first interpretation.

      For video 4, we submitted the videos to make sure there were no errors. So, we can now clearly see the collapse of the rosette in video 4.

      Lifeact-mRuby and KDEL-GFP signals were recorded in NIH-3T3-Src cells treated with cycloheximide (CHX; 35µM)

      __ __6- Sec61 translocon inhibition by the chemical inhibitor ES1 decreases formation of dots by A431 and rosettes and linear invadosomes by NIH3T3-Src (Fig. 6b). Sec61 siRNA should be analyzed. Does Sec61 localize at invadosomes?

      Immunofluorescence on NIH-3T3-Src and A431 WT cell lines were performed and added in the paper showing the localization of Sec61 in invadosomes (Figure 6b). Currently, we did not test siRNA targeting Sec61.

      Confocal microscopy images of NIH-3T3-Src and A431 cells. The cells were seeded on gelatin or type I collagen and stained for Sec61 in red, nuclei in blue and Actin in grey. Scale bar: 20µm, zoom: 5µm.

      __ __Minor comments:

      1- The data of Figure 1 is not totally new, at least plasticity of NIH3T3-Src invadosomes has already been described in Juin A., MBoC, 2012. References to original work should be mentioned.

      Indeed, the reference has been added to the text at Figure 1.

      2- Page 4 "We realized immunoprecipitation against GFP in both cell lines on plastic and type I collagen conditions": the authors should show/mention that on plastic, cells behave has on gelatin coating.

      A sentence has been added to the text to mention this: "Indeed, on plastic, the cells behave as on a gelatin coating and thus form the same types of invadosomes, i.e. dots for A431 cells and rosettes for NIH-3T3-Src cells." (see page 4).

      3- The authors compared their MS data to previously published Tks5 interactomes (page 4) (Supp Fig 2a). A study from Zagryakhskaya-Masson et al (PMID: 32673397) identified Tks5 interactome of MDA-MB-231 cells generating linear invadosomes. Could the authors comment this study?

      This study shows that FGD1, a guanine nucleotide exchange factor for the Rho-GTPase CDC42 interacts with Tks5 and plays a role in the formation of linear invadosomes. We have added this reference in the manuscript, but we have not found FGD1 in our data. It is possible that the GEF of Cdc42 varies from one cell type to another. This study has been added to the discussion.

      4- The comparison of translation proteins found in this study with the ones found in other studies (Supp. Fig. 3 a) should be combined with the paragraph commenting the 88 common proteins (Fig. 2c-d).

      For clarity, we decided to separate these two parts. There is indeed a lot of information, so it seemed clearer to us to keep the structure of the figures in this sense.

      5- The table Supp Fig 2c listing the proteins present in each of the functional categories enriched among the 88 common Tks5 partners should be included as main figure or a color code representing the different biological processes should be included in Fig 2c.

      A color code has been added between the two tables. A sentence has been added in the legends for clarity: "Color codes are according to Table Supp Figure 2c: orange: translation, green: actin cytoskeleton, and blue: adhesion."

      __ __6- The SUnSET assay is not correctly untitled and described in the Material and Methods. Indeed, the paragraph refering to it is entitled "Inhibition of translation machinery present in invadosomes" and is a mixture of immunofluorescence and SUnSET protocols.

      The SunSET assay materials and methods were modified in the paper in the "Sunset Assay" section as described below:

      Sunset assay

      Cells were treated with puromycin (10mg/ml) during 10min at 37°C then washed twice in ice-cold PBS for protein extraction as described above in Western Blot section. For negative control we pre-treated cells with the translation inhibitor cycloheximide (35mM) during 10min at 37°C.


      7- Figure 4, the decrease in ECM degradation of A431 (GFP-Tks5) cells seeded on gelatin by CHX is not statistically different. The affirmation that "CHX treatment limited degradation activity by A431 and NIH3T3-Src cells on gelatin and collagen matrices" (page 6) should be modulated.

      Indeed, thank you for your observation. We realized that incorrect values had been reported. Statistical tests (t-tests) were redone for each CHX condition, and significant results were found for each condition.

      8- Page 8, "These results therefore confirm the presence but also the involvement of the ER in the rosette formation and maintenance over time". At this point in the study, there is a correlation between the presence of the ER and rosette persistence but no direct evidence of ER involvement is provided. The authors should moderate their conclusion.

      That's absolutely right, the sentence has been modified accordingly (page 8).

      9- Fig 5d: the authors should specify in the figure legend what are the red head arrows.

      The red arrows show membranes of the endoplasmic reticulum, present at the level of the invadosome rosette. This point was added in the figure legend.

      10- Some references are not correct. For example p.10, "MAP4 and LAMP1 were described in podosomes": ref 23 and 26 are studies on invadopodia, not on podosomes.

      Corrections have been made to the text, the term podosomes has been replaced by invadopodia (see section references).

      11- The authors indicate p.10, "Thanks to mass spectrometry experiments, we were able to show for the first time the presence of translation proteins in linear invadosomes". In their previous study Ezzoukry et al, they showed the localization of overexpressed Caprin1, eEF2 and eEF1A1 translation machinery components in linear invadosomes formed by NIH3T3-Src seeded on fibrillar collagen I. The authors should modulate their affirmations.

      Indeed, this sentence has been modulated in the text (see page 10).

      12- Could the authors refer to figures in the Discussion.

      References to figures were added in the discussion.

      Reviewer #3 (Significance (Required)):

      This work extends their previous work, Ezzoukhry et al, in which the proteome of rosettes of NIH3T3-Src was identified after laser microdissection. In this work, they had identified protein translation machinery as components of rosettes and its implication in the degradation activity and/or the formation of rosettes and linear invadosomes.

      The present study extends the presence of protein translation machinery to other types of invadosomes and the implication of protein translation in invadosome activity and/or formation. It also confirms the presence of ER in the center of rosettes. It suggests that ER-associated translation is required for invadosomes formation and activity. This knowledge will be of interest for the invadosome researcher community.

      My expertise is in: cellular biology, invadopodia, ECM degradation, cancer. I do not have sufficient expertise to evaluate the accuracy of the analysis of mass spectrometry data and the quantification of videomicroscopy experiments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      To invade the surrounding extracellular matrix (ECM), cells organize actin-rich cellular membrane structures capable of ECM degradation, called invadosomes. Depending on the composition and organization of the ECM, cells organize their invadosomes differently. The authors aimed to identify specific and common components of different types of invadosomes: rosettes formed by NIH3T3-Src cells seeded on gelatin, dots formed by A431 cells seeded on gelatin, and linear invadosomes formed by NIH3T3-Src and A431 cells when seeded on fibrillar collagen I. For this, they generated cells stably expressing GFP-Tks5, a ubiquitous constituent of invadosomes, and determined its interactome. They identified 88 common proteins, among which the protein translation machinery was enriched. Whereas general protein inhibition impaired only rosette formation and impaired every type of invadosome-associated degradation, EIF4B inhibition inhibited the formation of every type of invadosomes. They then analyzed the impact of the ER on invadosome formation and degradation activity. First, they documented the presence of the ER in the center of the NIH3T3-Src rosettes and correlated ER presence with rosette initiation and persistence. They then demonstrated that chemical inhibition of Sec61 translocon decreased formation of invadosomes in general.

      Major comments:

      1- The authors use cells overexpressing GFP-Tks5 for their analysis of Tks5 interactome in the different invadosomes (Fig. 2). The impact of GFP-Tks5 overexpression on invadosome formation and degradation activity should be mentioned.

      2- Concerning the analysis of the mass spectrometry (MS) data, clarifications would be appreciated:

      a. The authors first "determined the specific molecular signature associated with each invadosome organization" (p.4). As I understand it, the proteins in each of these signatures correspond to proteins identified only in a particular type of invadosomes, not in the others. Could the authors indicate the percentage of the total proteins identified for each type of invadosomes that corresponds to the specific molecular signature?

      b. The GSEA pathways related to each of the specific molecular signature were then analyzed and the authors "commonly identified an enrichment in mitochondrial, ER and Golgi proteins" (page 4) (Supp Fig 1c,e,g). Could the authors provide numbers/percentage/statistics? It is not clear to me whether the biological processes (Supp Fig 1b,d,f) are derived from the analysis of the specific molecular signature or of the total proteins identified for each type of invadosomes. Could the authors clarify this point?

      c. The authors also identified "translation proteins" enriched in the specific molecular signature of each type of invadosomes (p.4). They commented on this category, indicating that each type of invadosome contains a specific set of translation-related proteins. This is true, but according to my analysis of the provided tables, the same applies to the other categories as well. Could the authors comment this point?

      d. Would similar categories of proteins (translation, ER, Golgi, mitochondrial) appear as enriched if the Tks5 interactome was analyzed as a whole for each type of invadosomes? (the authors may disregard this comment if comment a. is inaccurate)

      e. The authors identified that "cell adhesion proteins" are specifically enriched in linear invadosomes (page 4) (Supp Fig 1f). This conclusion appears to be based on the analysis of NIH3T3-Src and A431 cells. Could the authors provide more details on how this analysis was performed? Specifically, was the analysis conducted on a mixture of the specific signatures of each of the 2 cell models, or on their shared proteins? Additionally, is this category still enriched if each linear invadosome model is analyzed separately?

      f. The authors identified 88 proteins common to all types of invadosomes (Fig. 2b) and classified them as validated or not in invadosomes. Could the authors give details on the criteria used for this classification? References for the already validated proteins should also be provided. RTN4 has been described as partially localized at invadopodia formed by MDA-MB-231 cells in Thuault et al., yet the authors classified it as not validated in invadosomes.

      g. Page 7, "In addition to translation proteins, the MS analysis highlighted the presence of ER-related proteins such as RTN4, LRRC59 or RRBP1 in all invadosomes linked with Tks5 (Figure 2c)". Is the "ER proteins" category enriched among the 88 common proteins?

      h. The comparative analysis of the TKS5 interactome from NIH3T3-Src-GFP-TKS5 on gelatin (this study) with the proteome of NIH3T3-Src rosettes from Ezzoukhry et al. (Fig 5a and Supp Table 2) should be included in the analysis of the MS data obtained in this study (Fig 2), rather than in the paragraph "Recruitment of ER into invadosome rosettes". Are "ER proteins" enriched?

      3- Was the localization of the newly identified Tks5 partners, such as RPS6 and EIF4B, but also MAP4 and CD44, to invadosomes analyzed in cells expressing endogenous levels of Tks5? If not, this should be addressed to rule out the possibility that their localization in invadosomes is linked to Tks5 overexpression. Through the figures, it is important to indicate whether cells overexpressing or not Tks5 were used.

      4- EIF4B depletion inhibits ECM degradation (Fig 4e-f). The authors should address the impact of EIF4B depletion on invadosome formation. In other words, does EIF4B depletion corroborate the results obtained with CHX treatment, where only rosette formation is inhibited (Fig. 4a and Supp Fig. 3d).

      5- The authors treated NIH3T3-Src-KDEL-GFP and LifeAct-Ruby cells with CHX and conclude that "translation inhibition led to the collapse of the rosette structure (Fig 6a, Video 4)" (page 8): could extra time points be added before T300 to appreciate the collapse of actin before the retraction of ER from the center of the rosette. No video 4 is provided. A video 5 is provided but does not correspond to a rosette collapse. The lifetime/dissociation rate of rosettes with and without CHX treatment should be determined.

      6- Sec61 translocon inhibition by the chemical inhibitor ES1 decreases formation of dots by A431 and rosettes and linear invadosomes by NIH3T3-Src (Fig. 6b). Sec61 siRNA should be analyzed. Does Sec61 localize at invadosomes?

      Minor comments:

      1- The data of Figure 1 is not totally new, at least plasticity of NIH3T3-Src invadosomes has already been described in Juin A., MBoC, 2012. References to original work should be mentioned.

      2- Page 4 "We realized immunoprecipitation against GFP in both cell lines on plastic and type I collagen conditions": the authors should show/mention that on plastic, cells behave has on gelatin coating.

      3- The authors compared their MS data to previously published Tks5 interactomes (page 4) (Supp Fig 2a). A study from Zagryakhskaya-Masson et al (PMID: 32673397) identified Tks5 interactome of MDA-MB-231 cells generating linear invadosomes. Could the authors comment this study?

      4- The comparison of translation proteins found in this study with the ones found in other studies (Supp. Fig. 3 a) should be combined with the paragraph commenting the 88 common proteins (Fig. 2c-d).

      5- The table Supp Fig 2c listing the proteins present in each of the functional categories enriched among the 88 common Tks5 partners should be included as main figure or a color code representing the different biological processes should be included in Fig 2c.

      6- The SUnSET assay is not correctly untitled and described in the Material and Methods. Indeed, the paragraph refering to it is entitled "Inhibition of translation machinery present in invadosomes" and is a mixture of immunofluorescence and SUnSET protocols.

      7- Figure 4, the decrease in ECM degradation of A431 (GFP-Tks5) cells seeded on gelatin by CHX is not statistically different. The affirmation that "CHX treatment limited degradation activity by A431 and NIH3T3-Src cells on gelatin and collagen matrices" (page 6) should be modulated.

      8- Page 8, "These results therefore confirm the presence but also the involvement of the ER in the rosette formation and maintenance over time". At this point in the study, there is a correlation between the presence of the ER and rosette persistence but no direct evidence of ER involvement is provided. The authors should moderate their conclusion.

      9- Fig 5d: the authors should specify in the figure legend what are the red head arrows.

      10- Some references are not correct. For example p.10, "MAP4 and LAMP1 were described in podosomes": ref 23 and 26 are studies on invadopodia, not on podosomes.

      11- The authors indicate p.10, "Thanks to mass spectrometry experiments, we were able to show for the first time the presence of translation proteins in linear invadosomes". In their previous study Ezzoukry et al, they showed the localization of overexpressed Caprin1, eEF2 and eEF1A1 translation machinery components in linear invadosomes formed by NIH3T3-Src seeded on fibrillar collagen I. The authors should modulate their affirmations.

      12- Could the authors refer to figures in the Discussion.

      Significance

      This work extends their previous work, Ezzoukhry et al, in which the proteome of rosettes of NIH3T3-Src was identified after laser microdissection. In this work, they had identified protein translation machinery as components of rosettes and its implication in the degradation activity and/or the formation of rosettes and linear invadosomes.

      The present study extends the presence of protein translation machinery to other types of invadosomes and the implication of protein translation in invadosome activity and/or formation. It also confirms the presence of ER in the center of rosettes. It suggests that ER-associated translation is required for invadosomes formation and activity. This knowledge will be of interest for the invadosome researcher community.

      My expertise is in: cellular biology, invadopodia, ECM degradation, cancer. I do not have sufficient expertise to evaluate the accuracy of the analysis of mass spectrometry data and the quantification of videomicroscopy experiments.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this work, Normand and her colleagues analyze and compare the interactome of the key invadopodia component, TKS5 (overexpressed as a GFP-tagged protein), in two transformed cell models cultured on different substrates. Potential TKS5 interacting partners are identified including previously known and validated TKS5 interactors, some known to contribute to the mechanism of invadopodia formation and function. Bioinformatic (GSEA) analysis reveals a specific enrichment for proteins related to protein translation and interaction with ER-associated ribosome machinery. Evidence is presented that some of these proteins (RPS6, a component of the 40S ribosomal subunit, and translation factor, EIF4B) localize to TKS5-positive invadopodia in Src-transformed cells. Experiments based on translation inhibitor, cycloheximide, and silencing of EIF4B factor could demonstrate a link between overall protein translation and invadosome formation. Live cell imaging and microscopy analysis of fixed samples could document some proximity between the endoplasmic reticulum network and invadosome rosettes.

      Major comments:

      1- In the Results Section, the IP/proteomics-based pipeline used by Normand and colleagues to identify TKS5 partners is not clearly described and is confusing. Cut-off used to select te proteins in the different classes summarized in Table S1 should be better described. In addition, the nomenclature of the different protein subgroups used in Table S1 is confusing (see minor point#5).

      2- The effects of cycloheximide treatment or EIF4B silencing on gelatin degradation are clear and convincing. However, these are correlative evidence, and they may reflect a general implication of protein translation in the control of invadopodia function. A direct link between the observed interactions of TKS5 with the protein translation machinery and the formation and/or function of invadopodia is missing.

      3- Images showing the interrelations between the ER and the adhesive podosome rosettes are striking (Figure 5). Src-transformed cells forming invadosome rosettes when in contact with the collagen substratum change shape and produce adhesive protrusions towards the substratum. As the ER is a huge compartment that fills the entire cytoplasm, it is maybe not so surprising to observe the ER filling the protrusions and getting close to the rosettes at the tip of these membrane extensions. Again, these observations are essentially correlative and there is no prove of some direct contact between some ER regions and the invadosomes.

      4- Overall, this report is lacking a clear hypothesis or model of what could be the consequence of the interaction of TKS5 and the translation machinery on the formation and/or the activity of the invadosomes in transformed cells.

      Minor comments:

      1- Discussion Section (page 2). The statement that TKS4 is involved in ECM degradation in podosomes only and not in invadopodia is not correct. TKS4 knock down has been shown to interfere with ECM degradation in Human DLD1 colon cancer cells (Gianni et al. SCIENCESIGNALING Vol 2 Issue 88, 2009) and in in mouse and human melanoma cell lines (Iizuka et al. Oncotarget, Vol. 7, 2016). In addition, an unphosphorylable mutant form of Tks4 blocked invadopodia formation and ECM degradation in Src-transformed DLD1 cells (Gianni et al. Molecular Biology of the Cell Vol. 21, 4287- 4298, 2010). We (this reviewer's team) reported that TKS4 was associated with cortactin-positive invadopodia in MDA-MB-231 and Hs578T triple-negative breast cancer cell lines (Zagryazhskaya-Masson et al. J. Cell Biol. 219, 2020).

      2- Discussion Section (page 3). A431 is wrongly referred as to a melanoma cell line; it is a human epidermoid carcinoma cell line.

      3- Results Section (page 4 & 5). The authors compare the proteins they identified as potential TKS5 partners to previously published data by Stilly et al. (based on TKS5 IP like in the present study) and Thuault et al. (TKS5 bioIB). Additionally, authors should mention and discuss previously published data based on TKS5 coIP experiment and Mass Spec analysis similar to the present study, identifying potential TKS5 partners; some of which were similarly found in the present study including proteins involved in translation and ribosome function although these were not the focus of this work (several 40S and 60S ribosomal proteins, see Zagryazhskaya-Masson et al. J. Cell Biol. 219, 2020).

      4- Figure 1b. Matrix degradation is not visible in association with the invadopodia in selected high magnification images in Figure 1a and 1b.

      5- Supplemental table 1. The names of the different lists of proteins in the summary table is not clear and is rather confusing.

      6- Supp Figure 1. Please define what is the sample named '' (Delta).

      7- Results Section (page 5). 'These experiments confirm the correct co-localization between Tks5 and the proteins identified in Tks5 interactome by mass spectrometry analysis.' This statement is too general; in fact, data validate only colocalization between TKS5 and some identified partners, namely CD44 and MAP4.

      8- Figure 2e and Figure 3. It would have been nice to show the colocalization of selected proteins and TKS5 in association with collagen fibers to validate that enrichment occurs at matrix/cell contact sites and corresponds to bona fide invadopodia.

      9- Figure 3c (high mag insets). TKS5 and EIF4b do not seem particularly enriched in invadopodia rosettes as compared to the rest of the cytoplasm.

      10- Figure 4c-f. Treatments (i.e. CHX, siEIF4b) affect gelatin degradation. It would be interesting to assess the capacity of cells to form invadopodia under these conditions.

      Significance

      This study confirms and adds to a previously published report by this research group based on invadosome laser capture microdissection and proteomics revealing that invadosomes contain specific components of the translational machinery, and that protein translation activity is required to maintain invadosome structure and activity (Ezzoukhry et al. Nat Commun 2018). It also adds to a recent study that established a crucial role for ribosome biogenesis in promoting cell invasion in the C. elegans anchor cell invasion model (Development. 2023).

      The experimentation presented in this paper is of good quality and convincingly support the authors conclusions of a link between the ER-associated translation machinery and invadosome function in transformed cells. Overall, although this study adds to the emerging idea of an evolutionary-conserved translational control of cell invasion through the extracellular matrix it is mostly correlative and lacking a direct prove that the interaction of TKS5 with components of the translation machinery has a direct contribution to invadopodia function.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Invadosomes are dynamic, actin-based structures that enable cells to interact with and remodel the extracellular matrix, playing a crucial role in tumor cell invasion and metastasis. Prior studies by the authors and other groups have established the formation, activation, and appearance of invadosomes. This study demonstrates the following:

      1. Key elements of the translation machinery and endoplasmic reticulum (ER) proteins are constituents of the invadosome structure.

      2. Specific proteins are associated with distinct invadosome structures. The researchers utilized two cellular models (NIH3T3-Src and A431 melanoma cell line) and Tks5, a specific invadosome marker, for immunoprecipitation and mass spectrometry, validating the results through fluorescent images, electron microscopy, and time-lapse live imaging.

      Major Comments:

      • The manuscript is well-written, with a clear and detailed experimental workflow. Compared to their previous seminal work that first demonstrated invadosomes concentrate mRNA and exhibit translational activity using NIH3T3-Src cells, this study adds details about the specific enrichment of translation proteins for each type of invadosome and the presence of ribosomal and ER proteins. However, the experiments do not further enhance our understanding of the intricate mechanisms linking invadosome structures, function, and translation factors.

      • Further experiments are needed to better demonstrate the hypothesis of active translation within these structures, including the use of additional cellular models. The authors should also investigate the effects of Tks5 silencing on ER-associated translational machinery.

      • How do the authors propose Tks5 is linked to these proteins? Directly or indirectly? Focusing on specific proteins might provide an opportunity to study the molecular mechanisms in greater depth.

      • They used chemical inhibitors and siRNA approaches to assess the role of specific players, such as EIF4B, in the proteolytic activity of invadosomes, which can be considered proof of concept. Additional experiments aligning the results with the involved pathways would add molecular details and enhance the manuscript's significance. Resolving these issues is crucial for the manuscript to meet the publication standards for contributing novel and impactful insights to the field.

      Minor Comments:

      • A more detailed discussion of the implications of their findings within the broader context of cancer cell signaling and the potential impact on related cancer research areas would further advance our understanding in this area.

      Significance

      General Assessment:

      This study offers novel insights into a new function of the invadosome-specific player Tks5 as a molecular crossroad between ER-related translation proteins and invadosomes. The authors suggest that Tks5 could act as a scaffold, supporting the rapid clustering of translation-related proteins during invadosome formation or proteolytic activity. However, a major limitation is the lack of mechanistic exploration. The results do not elucidate how Tks5 mediates the recruitment of these proteins or the specific molecular mechanisms involved.

      Advances:

      The study extends knowledge in the field by confirming the presence of specific markers linked to different invadosome structures and demonstrating the Tks5 interactome's association with translation machinery.

      Audience:

      This study will primarily interest specialists working on invadosomes and, secondarily, those interested in cancer cell signaling, invasion, and metastasis.

      Field of Expertise:

      Invadosome and related signaling pathways in cancer.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Below is a point-by-point response to reviewers comments. We appreciate the reviewers' thoughtful consideration of the manuscript and __suggestions

      Reviewer #1

      Evidence, reproducibility and clarity

      In this study, Parkinson et al. investigated lung extracellular matrix using imaging mass cytometry (IMC) in mouse models. Overall, the paper is well-written, and the data are clear, although major points outlined below need to be addressed.

      In its current form, the paper appears more like a methods-focused study since, to my understanding, no new biological responses are described. The methods employed are very interesting, particularly the extracellular matrix analysis.

      As the reviewer points out a large part of this manuscript is the development of a novel methodology for analyzing the spatial ECM changes in a model of allergic airway inflammation. However, there are several novel responses described in the manuscript. Firstly, differing spatial organisation of immune cells across different mouse strains has not been shown before, particularly in a model of chronic allergic pathology that shares features of severe steroid-resistant asthma in people. Secondly, we show that specific macrophage-fibroblast interactions are occurring in the subepithelial region during DRA-induced allergic airway inflammation. Finally, we integrate all these established and novel findings with detailed spatial analysis of the cellular ECM environment, something which is sorely needed in the field.

      However, the scope of the study is quite limited, as all the experiments were performed with mouse samples, which are relatively easy to work with, and the cell organisation is simple compared to humans.

      Whilst we appreciate that the dataset in this study is limited, imaging mass cytometry studies, especially when optimizing reagents, are costly, time consuming, and have limited throughput, not to mention the time required to develop new computational tools for data analysis. Investigating cell-matrix changes in mouse data is vitally important for understanding the mechanistic role of pathways and interactions during disease processes. Whilst we have not provided human datasets in this study, staining, data acquisition and analysis has been performed on FFPE samples, making our pipelines applicable to archival tissue banks. Regardless, we are currently preparing a publication showing the applicability of this technique to human samples. Many ECM components are well conserved between humans and mice and the cellular structure and architecture of the lung shares a lot of similarities. Many papers (PMID: 39437149, 38758780, 38581685, and 38142637) have used this imaging technology in the analysis of human cancer, which shows an even more complicated and dense cellular organisation.

      The authors do not discuss how this analysis pipeline could be applied to human samples. Furthermore, the entire paper relies on imaging mass cytometry, and additional techniques could have been used to confirm some of the observations, especially given the availability of mouse samples.

      As mentioned above, we have taken steps to show that this technology is applicable to humans, though this is outside the scope of this already lengthy manuscript. Additionally, Steinbock, the main analysis pipeline, is well published in human datasets (PMID: 38758780, 39905080, 39759522, and 39761010) and the homology between ECM components is strong between mouse and human. The technology itself is completely species agnostic, so there is no reason to think that there would be issues when applying to humans, other than some differences in the marker expression of certain populations, which is well characterised in many cases.

      The reviewer’s comment regarding the use of additional techniques is valid. Firstly, these murine lung pathology samples are derived from the same mouse experiments used in our previous publication (PMID: 33587776), where we have analysed histology, immune mediators and cells using a variety of techniques including flow cytometry and ELISA. We will ensure this point is made clearer in the manuscript. In addition, for revision we plan to compliment IMC data presented with fluorescent immuno-staining to characterize cell populations in greater resolution and also using 3D precision cut lung slices to better characterize and visualize cell populations of interest in greater depth, directly addressing the reviewer’s concerns.

      The introduction mentions the DRA model without providing an explanation of what it involves. Non-specialist readers may not be familiar with this abbreviation, and further clarification should be provided.

      As the DRA model has been characterized previously, we provided references in the text in order to save space. However, we agree with the reviewer and will provide this information up front in the introduction to make the manuscript more approachable for a non-specialist.

      In the methods section, it is not mentioned whether the lungs were inflated before tissue collection, which is crucial for preserving normal cellular organization. The authors should clarify whether this was performed.

      Lungs were inflated prior to tissue collection. We agree that this is important information to include in the methods and we will update the manuscript to reflect this.

      Figure 1 provides a brief summary of the methods employed in the study but could be enriched with additional information. In its current state, it does not provide meaningful insights beyond what is described in the methods section. It would be helpful if the authors clarified whether the mice used were adults and whether both male and female animals were included.

      We agree with the reviewer. The idea behind this figure was to have an approachable introduction to the manuscript. However, in line with the reviewer’s previous comments about focusing more on the biology we will move this to supplementary to keep the importance focused on the biological results. Mouse age and gender were included in the methods of the paper, aligning to the ARRIVE guidelines for reporting animal research. We will additionally clarify that these are adult mice

      Additionally, they could present examples of the cell segmentation approach with zoomed-in images at the cellular level to illustrate the analysis.

      This is a great idea and appreciate the reviewer’s suggestion. We will provide maps (with zoomed inserts) of the cell segmentation and cell classification across representative ROIs to show not only the segmentation but to provide an overview of how the cell types localise across the lung. This addition will also highlight the caveat of IMC around image resolution of 1μm2 which limits the sensitivity of cell segmentation. We will discuss such limitations of the technique in general in the manuscript in response to this and later reviewer comments.

      The first set of data in Figure 2 suggests that C57Bl/6 mice did not respond to allergen treatment, as shown by the non-significant increase in cell numbers. The authors should provide evidence that their model induced inflammation through alternative methods, such as assessing eosinophil counts or pathology.

      We know that these exact animals are allergic as their immunological responses were characterized in a previous publication (PMID: 33587776) demonstrating eosinophil counts and cytokine responses measured by flow cytometry. However, in light of the reviewer’s comment, we will add histological images of the lung to this current manuscript. Such data, together with enhanced expression of RELMα and Ym2 from airway epithelial cells (Sup Fig 6) and the shift from ATI to ATII cells in both C57BL/6 and BALB/c mice after DRA treatment (Fig 5 g) will provide thorough evidence that the DRA model induces allergic airway inflammation and pathology in both mouse strains.

      The UMAP representation indicates significant overlap between cell clusters, which raises concerns about the accuracy of cell segmentation. For example, the heatmap in Supplementary Figure 1 shows endothelial cells expressing markers such as VWF, aSMA, Vimentin, and PDGFRα, suggesting that the cell cluster may contain a mixture of endothelial cells, vascular smooth muscle cells, and fibroblasts.

      UMAP reductions of IMC do not separate as clearly as those from single cell RNAseq or flow cytometry. This is because the staining intensity from IMC is much lower. Rather than being on a log scale, as for single cell or flow cytometry, the values are much closer to linear. Additionally, due to the limitations in IMC resolution and the fact that we did not have distinct membrane markers in our panel, cell mask generation is often non-optimal. This is particularly evident in regions where cells are in close proximity and where the limitations of, an effectively, two-dimensional 5-micron thick tissue section mean that there can be overlap between one cell and another. Whilst we acknowledge that some populations will be a mix of cell types we are limited by the number of markers we can use in IMC, as well as the limitations mentioned above. We have accounted for this by using methodologies to identify and focus on tissue regions (lisaClust) and correlate changes to differences in these regions rather than single cells per se.

      Examples of segmented cells should be shown to validate this approach.

      As per the reviewers comment above, we will provide maps (with zoomed inserts) of the cell segmentation and cell classification across representative ROIs to show not only the segmentation but to provide an overview of how the cell types localised across the lung.

      It is unclear what Figure 2e represents. If it is simply to show that certain clusters can be grouped together, such as AEC, AT1, and AT2 as epithelial cells, this could be conveyed in a simpler way.

      We apologise that the reviewer found Figure 2e confusing. The aim of this figure was to provide a simple diagram to highlight how different classifications of cell types aligned. This was required because there were variations in the specificity of some clusters and to address specific questions it made more sense to analyse cells at a broader level. i.e. merging resting and activated ATI/II cells or grouping specific immune cell clusters into larger groups. We did consider a table, but we did not feel this was a “simpler” way to do it. As it is simply for reference, we will move Figure 2e to supplemental.

      The analysis of extracellular matrix components presented in Figure 3 provides a novel method for studying these acellular structures, which is a challenge in the field. The authors should be commended for their efforts in this area.

      We thank the reviewer for their comment here. We agree that this is a vital area that needs to be addressed as the immunomatrix becomes ever more important in understanding disease pathogenesis. We developed this novel method to begin to understand key spatial interactions between cells and ECM molecules, something missing from the majority of high-dimensional imaging datasets.

      However, the parameters investigated in Figures 4-6 do not report any novel findings. While IMC appears to work effectively to analyse these parameters simultaneously, the induction of immune foci and changes in tissue organisation following allergen challenges are already well-documented in both mouse models and human samples.

      We disagree with the reviewer on this point. Figure 4 shows that immune cell infiltration in the adventitial cuff is different between BALB/c and C57BL/6 mice. This is a new discovery and provides nuance to our previously published data (PMID: 33587776), which showed that in the bronchoalveolar lavage from these same mice there were no differences in immune cell populations at these chronic time points. Therefore, analysis of lavage cells or lung histology in isolation does not provide a full picture of allergic immune responses.

      Figure 5 shows neutrophils localised with alveolar macrophages in the alveolar parenchyma in this chronic DRA model completely distinct from the spatial advential cuff region occupied by other CD11b+ cells. In addition, we show that we can identify perturbations in the alveolar parenchyma by IMC and these correlate with known differences in allergy and asthma such as alterations in ATI/ATII balance, which has also not been shown in this model.

      Figure 6 demonstrates that we can identify a tissue region termed “subepithelial cells” which is the site of where remodelling events are known to occur in asthma and allergic pathology. This ECM-rich region is strongly associated with fibroblasts and immune cells which leads in to figure 7 showing that these cells are interacting.

      In addition to all of this the main focus of this manuscript is to link these analysis parameters to changes in the ECM environment and we have included this in each of these figures showing how these correlates with allergic changes and how they may be important in understanding these processes. In response to this reviewer’s point, we will highlight and make these novel findings clearer within the text of the manuscript.

      In Figure 5, the authors show a decrease in neutrophil numbers in challenged mice. This is unexpected, as this model is widely known to induce strong neutrophil recruitment. The authors should clarify this finding and investigate whether neutrophil chemoattractants are increased in these samples.

      This is a keen observation by the reviewer. We were interested in this finding however as it was not the focus of the paper we did not investigate further. In our previous publication we show that there are increased neutrophil numbers in the BAL of these animals (PMID: 33587776) and as mentioned above, we show in figure 5 that neutrophils are found mainly in the alveolar parenchyma. This perhaps means that they are more sensitive to being washed out in the BAL and perhaps there are differences in their “stickiness” in BALB/c and C57BL/6 animals or during DRA-induced allergy. This is in contrast to eosinophils (likely within our CD11b+ cells) which are found in the adventitial cuff, a region is not likely to be captured by BAL wash, though we know that these cells are actively present in the BAL. Overall, though this is an interesting result it was not the focus of this already lengthy paper and is best investigated in another project.

      When analysing epithelial cells, the authors separate AT1 and AT2 cells based on podoplanin expression. However, data in Supplementary Figure 4b suggest that both cell types express similar levels of podoplanin. The authors do not provide statistical validation for the claim that AT1 cells express higher levels. Additionally, as podoplanin is expressed by various cell types, such as lymphatic endothelial cells, additional markers are required to confirm the identity of AT1 cells.

      Again, the reviewer is entirely correct here. The cells we have identified are labelled as ATI as a best guess and correlate with ATII cells based on anatomical location – though this is likely shared by some of the populations mentioned by the reviewer. The majority of cells in this population are likely ATIs, as they are localized in the alveolar parenchyma and are cells that are not SPC+, though we cannot say for sure without more markers and we were already at the limit of the number of markers that we can run with one IMC panel. It is likely that there are contaminating lymphatic endothelial cells in this cluster. However, these will be a relatively minor population and do not change the main findings presented in the paper. To address this and other comments by the reviewer we plan to include a limitation section to the discussion that highlights exactly these points for future studies.

      The last set of data in Figure 7 is interesting and shows that immune cells interact with a population of S100a4 fibroblasts. This finding could be expanded further, as CD11b and Ly6C are expressed by a variety of immune cells. The authors should include additional staining to identify the specific cell types involved, such as monocytes, eosinophils, or airway macrophages. Furthermore, the authors should speculate on why these fibroblast regions attract immune cells. Are these regions enriched in chemokines or other factors?

      We thank them for this suggestion. To answer this point, we will conduct immunofluorescent imaging to provide further characterization of these cells in greater depth, as we agree, this will be important to consider. To best visualize cells and their interactions in this adventitial region, we plan to use 3D precision cut lung slices from PBS versus DRA mice in combination with confocal imaging. This method will allow us to utilize antibodies and markers that do not work in the FFPE sections such as SiglecF (eosinophils), CD11c (DCs, macrophages), CD64 and CD169 (macrophages).

      The discussion is engaging but focuses more on methodological aspects than new biological insights. Without mechanistic links, it is challenging to draw meaningful biological conclusions.

      We agree that the discussion could be used to reinforce the importance of the biological discoveries we have made (listed previously) in the discussion. However, we also believe that it is important to discuss the methodology as this is a novel way to explore ECM-cell interactions in the tissue as highlighted by the reviewer. There are many limitations to using IMC and similar techniques that should be highlighted for future studies so that we can develop better ways of quantifying the ECM environment during disease.

      Significance

      The study of Parkinson et al. provides interesting methodological insights into the use of imaging mass cytometry (IMC) to analyse lung architecture following inflammation. The application of multiplex antibody staining will leverage important information related to how tissues are adapting to chronic immune response. Here the authors rely entirely on mouse models for their studies and compared two lines of WT animals and the same allergen model. This limits the scope of the study, additional timepoints, sex or age would have improved the manuscript.

      Whilst we appreciate the reviewers points here, we would like to highlight the time involved in generating such datasets, with a lot of careful optimization and experimental design aspects going into each study. Whilst we have also performed staining and analysis using our described method in human FFPE tissue, we are currently looking to further develop analysis tools to assess ECM-cell interactions. Additionally, data acquisition using IMC takes considerable time, and hence it is not feasible run and analysis the number of samples required to address some of the questions proposed by the reviewer.

      We believe our manuscript provides novel methodology to analyse ECM environments within spatial datasets, something that no other spatial datasets have explored to date. Furthermore, we provide numerous new biological findings in relation to how cells are organized within the tissue during allergic pathology and propose immune-fibroblast interactions that may be key for driving ECM remodelling in the lung. Integrating these analyses will be key for further understanding the role of the ECM in disease pathogenesis.

      The applicability of this analysis pipeline to human tissue samples is not discussed, which would significantly enhance the impact of the study. Additionally, complementary techniques, such as flow cytometry or immunohistochemistry, could be used to validate the findings and improve reproducibility. A specialised audience of immunology researchers would be interested by the image analysis approach.

      As mentioned above, this analysis pipeline is easily applied to human samples or any other species as ECM molecule organization is largely conserved across species. Moreover, we have already explored this using human samples. However, adding human data to this manuscript is beyond the scope of this manuscript which was aiming to build one of the first methodologies for incorporating the ECM into this kind spatial analysis from the start in order to make biological discoveries. Regardless, we will add a discussion point on utilizing these pipelines to other species within the discussion of the manuscript.

      Flow cytometry has been published on this model and the exact samples used within this study as mentioned previously (PMID: 33587776), validating some of these findings – we will make this point more clearly in the manuscript. We do appreciate that it would be good to further expand on some findings presented in the manuscript. As such we will expand our immunostaining (as mentioned above) to give more detail on the infiltrating immune cell populations and their interactions with fibroblasts.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required): __

      Summary Parkinson and colleagues provided a highly intriguing manuscript on spatial resolution of cell-ECM interaction in mouse models of allergic airway inflammation. They used IMH to analyse two common mouse strains for allergic airway inflammation with a human relevant allergen mix. The study implements a novel technique to better segment tissue stainings (DeepThresh) and modifies existing tools to assess non-cellular seqmentation, ECM or fibrous structures that is. The study identifies region-specific ECM and confirms cellular proximity with canonical cell markers. Furthermore, clear ECM and cellular differences between the two mouse strains are found. The study concludes that this IMH approach is superior to existing methods as it provides a high spatial resolution of ECM protein - cell interaction.

      Major: ECM Isoform Annotation - The manuscript lacks precise annotation of ECM isoforms, particularly for Collagen I, IV, and VI. This impacts the accuracy of reported associations between ECM environments and cellular interactions.

      We thank the reviewer for this excellent comment and pointing this out. We agree that this is very important and will add this data to the manuscript. All information was included by reference of the antibody clones. However, it is an important point to make and we will account for this during interpretation of the results.

      Spatial Annotation Consistency- The manuscript inconsistently defines and annotates ECM environments (e.g., adventitial collagen, subepithelial & vascular ECM), leading to confusion in spatial correlation analyses.

      We are unsure what the reviewer is exactly referring to here. We have maintained a consistent nomenclature for these annotations throughout the manuscript. If the reviewer has an issue with the names we have provided for the regions; names were chosen these to be more informative than just naming them “region 1, 2, 3…”. Names in the manuscript were based on taking the lung tissue region and the prominent ECM molecules present. Whilst some level of detail will naturally be lost, we considered this the best way to keep data clear and consistent throughout the manuscript. For example, adventitial collagen describes the region predominantly around the adventitial cuff (fig 3c and d; shown in dark blue) that has high levels of Collagen I, III and VI. Yes, HA, laminin and fibronectin are also expressed, but at much lower levels. Regardless, all the information is present within the figures with readers to observe and make their own interpretations. We are happy to consider alternative names if the reviewer were to provide some guidance on what they thought was more appropriate.

      Lack of Supplemental Data- Activated cell types and regions are not clearly defined, and no supplemental data is provided to verify classifications. DeepThresh Validation - The method for removing staining artifacts via DeepThresh lacks clear validation. Complexity - Overlapping marker definitions (e.g., CD11b+ cells and infiltrating cells) need clarification for accurate immune cell characterization.

      We provide heatmaps in the supplementary data which shows the exact marker expression pattern for all of the clusters we define (Sup Fig 1a). Additionally, we provide graphs showing the cellular contribution and spatial distribution of all the regions we defined with lisaClust (Fig 2h & I; Sup Fig 1d). Most activated cells are a feature of a specific clustered cell type only being present in either PBS or DRA treated animals. However, the features which have led to separation these cell types are available in the heatmaps as mentioned (Sup Fig 1a).

      We believe the reviewer may be confused about the purpose of DeepThresh. This algorithm is not for removing staining artifacts. Instead it uses expert annotation of a small training set to generate a method of accurately thresholding images for positive staining in relatively small ROIs which may have diverse structural features with different staining properties. We did not have space in the manuscript to go into this in more detail. However, we appreciate this may not be as clear as needed for readers, and hence, will provide supplementary data showing some example thresholding alongside the original staining in a new edit of the manuscript.

      CD11b+ and infiltrating cells are not an overlapping population, they were separately clustered by the algorithm, but we take the reviewers point that further characterisation could be done. As mentioned in comments from reviewer 1, there is a limitation in the number of markers we can use in IMC, especially with the number of ECM markers we included. Additionally, there are limitations in the appropriate antibodies (carrier-free) that work in FFPE mouse tissue with the antigen retrieval that we use for good, reliable staining of ECM components. As such, we will perform additional immunofluorescence staining in 3D precision cut lung slices to better characterize the CD11b+ population to address comments by both reviewers.

      Minor: Terminology Inconsistency- The manuscript uses inconsistent terminology for ECM components and anatomical regions (e.g., adventitial collagen, immune foci, inflammatory zone).

      This point was directly addressed above in “Major” points and appears to be a duplicate comment.

      ROI Mask Coverage - Statistical insignificance in C57BL/6 ROI mask coverage is not addressed.

      The increase in C57BL/6 mice upon DRA treatment in panel A is not “significant” in the modern sense of the word. However, we would argue that stating it is “not significant” would also be a mistake. We prefer to use p values as an inferential measure of significance in combination with measures such as effect size and variance (PMID: 8465801). We find this more useful considering the vast number of mistakes made when interpreting p values (PMID: 18582619). The importance of not purely relying on p values for clinical research has been reviewed recently here (PMID: 39909800).

      Whilst we appreciate the reviewer’s requirement for significance, we do not want to make sweeping statements based off of a p value of 0.07, especially in only one experiment. Many papers have been published on the pitfalls of stringently adhering to p

      Spelling Error - "Immunte foci" in Figure 4h.

      We thank the reviewer for pointing this out and will correct this.

      Figure 6g Correlation Issue- The matrix environment correlation plot does not align with expected cell-ECM interactions.

      We find it hard to comment on this without more detail of the cell-ECM interactions that the reviewer believes should be occurring. We analysed this in an unbiased way, so we have not forced interactions to appear based on our preconceptions. The regions being analysed in Fig 6g are the resting (PBS) and activated (DRA) airways that contain expected cell populations of airway epithelial cells and a low level of fibroblasts, likely from just under the airway epithelial cells. These cell populations align with AEC-associated matrix, laminin and hyaluronan, and adventitial collagen regions. Perhaps the reviewer is questioning why the airways are associated with adventitial collagens? The reason behind this, is due to adventitial cuff residing adjacent to a proportion of all airways, and hence any ECM associated with the adventitial cuff will likely be included in an airway region. However, as mentioned previously there are limitations to this analysis and we are very likely missing finer details due to issues such as resolution which we have discussed within the point-by-point on numerous occasions, and something we will directly address by adding a limitations section to the discussion of the revised manuscript.

      Color Issues in Figures - ColI and ColIII have the same color in Fig. 3a, making interpretation difficult.

      We agree with the reviewer on this point. The issue we had here was that Col-I and Col-III strongly overlap in these images, whilst one was green and one yellow the effect was to make them look the same in the final images. We will remake these images with clearer colours that better illustrate differences in Col-I and Col-III expression.

      Patch Annotation (Fig. 4i) - The method for defining immune cell patches is unclear.

      Patches refers to an approach that is used to identify interconnected groups of similar cell types and is a method that is based off published data (PMID: 35363540). We will add further method details that explains this process to the revised manuscript.

      Detailed review: Methods: Animal model is suitable for differential analysis of various mouse strain responses to allergic airway inflammation.

      We thank the reviewer for this comment and also agree that the mouse models presented in the manuscript can provide insightful and mechanistic data for investigating human disease.

      Deepthresh matrix thresholding: IMCDenoise is sensitive to clusters of staining artefacts (specks). Please explain how DeepThresh via manual thresholding enables staining artefacts removal/detection. Manual ground truth mapping is common however it is not clear how your approach is performing against another tool. How was manual thresholding controlled (several analysts thresholded same image)?

      As described in a previous comment this is not the function of DeepThresh. Manual annotation for training data was performed by consensus agreement of four independent researchers. In terms of performance against another tool, we are not aware of another tool which performs this function and hence cannot compare. However, we will add additional data showing the validation metrics for the pipeline to make future comparisons easier.

      Antibodies Collagen IV, stains col4a1 - please correct, as isoforms vary throughout tissue. Collagen VI, stains col6a1 - isoforms vary in lung tissue, please state correct isoform throughout the document. Heparan sulfate: Molecular weight? Collagen I - isoform not defined, please state in methods.

      Figure 3 d As a resultant of the choice of antibodies against some particular isoforms of ECM molecules associations of cells, compartments are correct yet do not comply with all isoforms. Col4a1 is a basal membrane collagen from blood vessels; the adventitial area and vascular area are high in Col4a1. Other Col4 isoforms are found more frequently in the alveolar regions (col4a5,a6) and the subepithelial membrane. It is of utmost importance to clearly label the correct isoforms throughout the document.

      This relates to the comment above made by reviewer 2. As mentioned, we agree with this key point and will provide this information from the respective antibody clones.

      However, we are unable to provide details on the molecular weight of heparan sulfate as this will vary depending on location/tissue/condition etc. The antibody recognises 10E4 epitope on HS which is found across a wide variety of tissues and species and will recognise many different sizes of HS and even porcine Heparan. Importantly it is relatively specific, not cross reacting with hyaluronan, keratan sulphate, chondroitin sulphate, or dermatan sulphate which is an issue for certain clones. Whilst the size of the HS is an interesting facet, consideration of changes in sulphation patterns would also be of interest, though these currently cannot be accurately assessed via purely immunostaining-based methodologies and would require more targeted biochemical techniques. In addition to this there are multiple nuances in 10E4 antibody binding (PMID: 15044385 and 11278655) which are interesting, but far beyond the scope of this study. Although captured in the antibody clone information, we will also ensure this is clear in the methods.

      In relation to Col4 isoforms specifically, often antibodies for the ECM are limited because of their repeating structures it is hard to generate specific antibodies. For collagen IV there many clones for Col4a1, but no specific clones for Col4a3/col4a5 etc, suitable for use in FFPE tissues and metal conjugation required for IMC. Therefore, we were very limited in what was available to detect them at all. We will bring this up in the discussion as this is an important point, not just for our data, but also for people attempting to replicate this kind of analysis.

      Figure 2i: The cell-specific marker expression is in part already confounded by region. So vasculature or resting airways show no "resting" fibroblasts as their annotation is linked to activation (indicated by S100A4 expression). Anatomic locations such as airways with remodelling are termed "activated" to explain morphological differences which is acceptable given the model chosen. However, some cell type are not given an anatomical or morpholocial "resting" nomenclature. Only during activation and through location a cell type may aquire e.g. a nomenclature such as "alveolar fibroblast". The correlation blot 2i should provide this basic information. Please explain.

      Our staining approach and analysis have only identified certain activated populations as pointed out by the reviewer. Most of the populations that we have identified as “activated” have been identified primarily only in mice administered DRA. The reason that we have not included “resting” and “activated” populations for all cell types is that these clusters were generated using a clustering algorithm based on the cellular markers used within the study. Each cluster was then simply labelled as best we could, using information from marker expression, published biological data, anatomical location, and sample identity (e.g. PBS or DRA).

      A caveat to using IMC and other similar imaging techniques is that we will miss certain “flavours” of cell populations because we simply do not have the markers, or scope to include markers, with which to identify these cells. This is partly a problem of appropriate antibody availability, but also for many populations there are no specific markers identified in the literature/databases. Single cell RNAseq has provided deep segmentation of some of these populations, but we (and others) have found that often these make poor antibody choices at the protein immunostaining level.

      We are unsure what the reviewer wants adding to plot 2i. This plot shows the cell cluster contribution to different lisaClust defined tissue regions. Hence the presence of alveolar fibroblasts in the resting and activate alveoli region. However, we will include more discussion on the limitations of markers and identification of specific cell populations in the discussion.

      Figure 2h: How do you explain subepithelia to "leak" luminally in C57BL/6 DRA animal?

      We assume the reviewer is referring to the overlap of some grey circles though/over the red airway epithelial cells in the C57BL/6 DRA panel of figure 2h. This figure represents individual cells as circles with the centroid of the circle at the centroid of the cell. Cells are rarely perfect circles and, in this case, it has made it seem like the cell is coming through the airway epithelium, when likely it is a longer cell that sits directly under it. In addition to this, these are effectively 2-dimensional section (5um thick) that capture as small portion of the lung anatomy, hence occasionally this can result in unusual tissue structures that make no sense in the confines of a 2D section, but instead correlate with the larger 3D structure.

      How is an activated airway possible in a Balb/c PBS animal (same for inflammatory adventitia, alveoli)?

      Activated airway simply describes a region that is showing some evidence of activation markers such as RELMα and/or Ym2 etc. PBS itself, as with any other liquid administered into the lungs, will drive a very low level of inflammation, which is why it is used as a control in the animal model. Therefore, it is not surprising that we see a low number of these “activated” cells in PBS animals vice versa for their activated counterparts in DRA treated animals. This is similar for the other regions mentioned.

      How is subepithelia adjacent to immune foci and inflammatory adventitia (Balb/c DRA).

      We are somewhat confused by this question. We have termed the region “subepithelia” because it is mostly found under the airway epithelial cells. We found that this region expands during DRA treatment and covers areas close to the immune foci and inflammatory adventitia, hence they are next to each other.

      As described above, the names of these regions were chosen for simplicity and to communicate its general features. These, regions were identified by detection of nearby regions of cells with similar cellular compositions and the names we a “best fit”.

      Text for fig 3c: Here it should be mentioned that a cell is used as a proxy locator to the ECM region.

      We apologise that this was unclear for the reviewer. Rather than describing it as using the cell as a proxy locator to the ECM region we find it more accurate to think of it as we are characterizing the matrix environment of the cell i.e. what is the cell close to and what is it far away from. We will make this clearer in the results by changing the name to cellular matrix environment, rather than matrix environment.

      Again, in UMAP3b location and ECM molecule a mixed a priori which only can be achieved through proxy loction as in fig 3c or correlation matrix analysis. The UMAP shows ECM molecules in various combinations. Fig3c analysis of anatomic location from images with cell proxies would validate morpho-spatial UMAP annotation. Please make this clear in the manuscript or specify why your approach is superior in its presented format.

      We struggled to ascertain what the reviewer was referring to here and what edits they were suggesting to the revised manuscript. However, this comment seems to assume that we have used cellular location as an input to the UMAP in figure 3b, which is untrue. This UMAP (and associated clustering) shows each cell as a dot which is organised based on its distance to the different matrix components. Effectively showing us how different cells cluster based on their cellular matrix environment, with no input of cellular based markers. We are unsure what the reviewer is referring to on line 486 – as they seem to be describing exactly what figure 3c already is (a spatial map of the UMAP clusters on representative images, which shows that a cells matrix environment does seem to show patterns that align with the general lung anatomy).

      Finally, the reviewer asks us to specify why our approach is superior, but we are unclear what the alternative approach is.

      This methodology is effectively a repurposing of the traditional UMAP and clustering methodology used in many single cell techniques, but instead of applying this to cellular markers we are applying it to a cells matrix environment as quantified by the matrix distances. If the reviewer could clarify this comment we would be happy to revisit it. As mentioned in the previous comment, we will more clearly describe cellular matrix environments in the revised manuscript and this may also help with the confusion.

      Fig 3d: The Matrix Cluster names are in part not correct. Subepithelial & Vascular ECM does not correlate with Vasculature in LisaClust Regions. Also ColIV is not AEC associated, yet subepithelial.

      Respectfully, we completely disagree with the reviewer on this point. In the heatmap (Fig 3d) the Subepithelial & Vascular matrix environment correlates most strongly with the Vasculature and Subepithelial cells as shown by the stronger green-yellow colour in the corresponding cell of the heatmap.

      As mentioned previously in response to another comment by reviewer 2, there could be many reasons that we are not detecting collagen-IV in the AEC associate cell matrix environment. One likely explanation is that this is too fine for the resolution of IMC (1-micron2) or it could be that certain subchains are utilised here that are not recognized by the antibody we managed to optimize. Additionally, AEC-associated matrix environment is comprised of both mouse strains and includes higher representation from DRA treated animals. From our previous work (PMID: 33587776), we have shown that Col-IV expression around the AEC is reduced in DRA versus PBS -treated animals.

      No ECM molecule is inflammatory zone associated. Does this indicate cellular density does not allow to distinguish ECM?

      This is a great point from the reviewer and their explanation is entirely possibly. As mentioned there are huge limitations in the resolution of IMC and so we are likely missing finer matrix structures. There is a huge recruitment of cells within this environment so it could be that we cannot clearly visualise fine ECM structure through this considering we are also looking at a 5-micron thick 2D tissue section. Additionally, cells maybe degrading the ECM in order to infiltrate into the tissue. This is definitely an interesting point to examine in further detail, but would need to be done with a different methodology. We will aim to look at an ECM molecules and its distribution within the inflammatory zone using 3D precision cut lung slices and also immune-staining of tissue sections to see whether we can better resolve this in a revised manuscript.

      Also the term "adventitial collagen" is locating to LisaClust Regions Vasculature, Subepithelial Cells, Resting Airways, Infiltrating Cells, Activated Airways. Adventitial per definition of fig. 2g is around blood vessels extending to airways and around it. The adventitial regions are the ECM rich areas after the fibroblasts (as for blood vessels, PMID: 31522963). The definition used in this study therefore generates morphological overlaps between airways and their basolateral regions and blood vessels. Whilst both morphological regions have an adventitia the Matrix cluster assumes from areas to close by this terminology. As a sensitivity analysis I would suggest to reduce the perimeter around blood vessels to the same borderline as seen in airways. If composition remains similar "adventitial collagen" could be a broader term. Alternatively, if adventitia from airway and blood vessel differ these should be separate terms.

      Whilst the adventitial cuff does refer to the region immediately around a blood vessel in the lung, these structures are slightly more nuanced as blood vessels normally travel through the lung in close association with an airway. This is true all the way down to the close association with the capillaries and the alveolar spaces where gas exchange occurs. Indeed, previous publications (PMID: 30824323) have shown that these adventitial cuffs extend out from around the contiguous area around the blood vessel and associated airway and these can expand during inflammation (PMID: 24631179). This region is rich in Collagen-I and Collagen-III, as we have shown in this manuscript and previously (PMID: 33587776).

      Whilst we agree that there are likely microanatomical niches within this larger structure, our dataset lacks the resolution to study this in more detail. However, as mentioned above we can include matrix markers in our future IF staining to examine this region in more detail. The adventitial collagen environment described in this manuscript and beyond, are vital “meet and greet” spots for immune cell infiltrating into the lungs (PMID: 30824323) as well as being sites of iBALT formation (PMID: 24631179)

      We are unsure what the reviewer means by “…reduce the perimeter around blood vessels to the same borderline as seen in airways.” We have not defined a manual threshold for the border of the airways. These regions were all defined by SNN clustering and not manual segmentation. Whilst this methodology could be developed we do not believe that this dataset has the resolution to answer this question, as mentioned previously.

      Fig 4c: Balb/c and C57bl/6 labels are incorrect (see a,b)

      We thank the reviewer for noticing this incorrect labelling and will update this.

      Fig 4h: Cell type "other is highly present in immune foci and inflammatory adventitia but not further classified and not myeloid. This seems either a difficult definition for myeloid or a significant immune population wasn't stained. How is myeloid defined?

      We define myeloid broadly as CD11b+ or alveolar macs. There were certain populations that were not stained, notably T cells. We were unable to have suitable or reliable staining in FFPE tissue with CD90, TCRa/b, CD3e antibodies via IMC. The same was true for Eosinophil markers (SiglecF, Ccr3, EPO, MBP). The additional experiments we will perform for a revised manuscript (using 3D precision cut lung slices and/or IF staining) should shed further light on these cells. Additionally, as we are not limited by the processing requirements of IMC, we can use a wider range of markers.

      Fig 4l has a vast variety of marker combinations some being very specific within the staining panel, others subsummarise entire groups of cells. It would be very helpful to know if the lables are specific and exclusive or if larger clusters exist, that they then subdivide into specific groups (e.g. Infiltrating cells: any of CD11b, CD44, Ly6C vs. B-cells or CD11b+Lys6C). This graph would profit also from either using markers or cell types only. Your marker set is very distinct and limited so per definition it is either a neutrophil or a Lys6C+. Please decide, explain and provide the other graph as supplement.

      We apologise that this was not clear to the reviewer. Labels are exclusive and represent the clusters that were identified in figure 2 and are at the finest level of detail that we felt we were able to biologically infer from the data. In terms of the reviewer’s first point about infiltrating cells, these are completely separate from the other cell types mentioned. As mentioned in the previous comment line 570, we were simply unable to find working antibodies for some of the common lung populations (a common problem for FFPE sections where antigens are often masked or lost due to fixation and processing) and so are limited to general annotations for these. For the reviewer’s second example of Neutrophils vs Ly6C+ cells, neutrophils were classified by expression of Ly6G, CD11b+, and Ym1 whereas there are many other cell types that express Ly6C, including, but not limited to, dendritic cells, monocytes, eosinophils, and even some T cells.

      We believe that the graph in combination with data in Fig 1c and supplementary Fig 1a, already shows what the reviewer is asking for.

      Fig 5l and sup Fig4i: There is no graph confirming the statement that Ym1 is produced by macrophages. From the graphs in either of the two panels, The AEC are highly associated with Ym1/2 expression or the activated alveoli. Please explain ad amend.

      We assume the reviewer means Fig 5l and sup Fig 5i (as there is no figure sup Fig 4i). Whilst we did not include a graph to show that alveolar macrophages produce Ym1, we did include two references in the text and this has been widely shown in the literature for many years (PMID: 11141507 and 15148607). We are somewhat unclear on the reviewers second point. AEC (airway epithelial cells) can definitely also produce Ym1, though this can often be contentious because of issues with cross-reactivity with its highly homologous sister protein Ym2, which is also produced from airway epithelial cells under Type-2 settings. If the reviewer is referring to AEC (alveolar epithelial cells) then this is true. Activated alveoli are lisaClust regions with lots of alveolar macrophages which was the original statement we made and aligns with sup Fig 5i. Activated alveoli II have less alveolar macrophages and also have less Ym1, which would correlate though there are other cell types which can make Ym1 as well.

      Fig 6g: The correlation plots again show that the matrix environment labels are somewhat confounded. Whilst AEC associated makes perfect sense, adventitial collagen only weakly correlates, yet was part of the adventitial mapping. Cell types like AEC are expected however fibroblasts, especially in resting airways as large constituent cell populations. There are not other, myeloid or lymphoid cells associated with these airways, which under activated conditions seems rather odd. From fig6a it is appearant that the lisaClust has ascribed subepithelial regions to distal parts of the airway separated by blood vessel or parenchyma (C57BL/6 and Balb/c DRA). Also blood vessels are in part other cell types or epithelium (B6 PBS). Is the annotation here the reason for this rather confusing result? Please explain and/or amend.

      We are again somewhat confused by this comment. Adventitial collagen only weakly correlates because it is not within the airway epithelial cells, instead it is adjacent in the subepithelial region which is shown in Fig 6j. We are unsure exactly what the reviewer is referring to in terms of “adventitial mapping” but are happy to comment on this if the reviewer can clarify what they mean.

      We agree with the reviewer that it is somewhat surprising to see so many fibroblasts in the resting and activated airway regions. There is a level of ambiguity here in what lisaClust decides to include in one region vs another. However, what it does show is that there are a large population of fibroblasts around the airway, possibly correlating with peribronchial fibroblasts. We did not observe immune cells in between the airway cells or immediately underneath it. We do not believe this is odd, as from our data it appears that these cells are more likely to be found in the adventitial (including peribronchial as mentioned previously) cuff. Cell are most certainly moving into the airways as shown from the BAL in our previous publication (PMID: 33587776). However, we are unlikely to capture this process in the snapshot of our histology across a relatively small section of the airways covered in our 2D sections.

      In regards to the reviewers comment about figure 6a we agree that some of the regions between the airways and blood vessels have been characterised as subepithelia. As mentioned previously we are happy to consider alternative names but have been unable to come up with an alternative that encompasses the cells and spatial region more accurately and clearly., Regardless, the main purpose of these names is to provide simple nomenclature to follow throughout the manuscript and make these types of analyses accessible to all readers. We believe that these are accurately labelled and have provided information about the constituent cell populations that are present within them, making the data and subsequent analysis transparent for others to view and explore. Our data suggests that the adventitial cuff may fulfil multiple roles during DRA-induced inflammation, some of which are more focused on immune cell recruitment and others which may correlate more with the fibroblast rich subepithelial region.

      The reviewer is entirely correct to point out that some blood vessels were not entirely annotated. We used vWF to manually separate blood vessels from the adjacent smooth muscle layers, which were not separated by the clustering originally. Notably it appears that veins seem to not separate as well as arteries suggesting another marker (e.g. CD31) may help with this, though we were limited in what we could include as mentioned previously. As this is only a small effect, which we do not have a way to correct, and blood vessels were not the focus of this manuscript, we have left the annotation as it is with raw data included.

      __Significance __

      Strength Innovative ECM-Immune Interaction Approach- The study integrates extracellular matrix (ECM) phenotyping with immune cell spatial mapping, providing novel insights into allergic airway inflammation Multiplex Imaging Technology - The use of Imaging Mass Cytometry (IMC) allows high-resolution spatial characterization of both cellular and ECM components. Strain Analysis - The inclusion of BALB/c and C57BL/6 mice enables differentiation of strain-specific ECM and immune responses. Deep-Learning-Based ECM Quantification - DeepThresh offers an advanced computational approach for ECM analysis, enhancing accuracy in defining ECM-cell associations. Comprehensive Tissue Classification- LisaClust clustering facilitates detailed segmentation of lung microenvironments, improving understanding of localized tissue remodeling.

      Limitations ECM Isoform Inconsistencies - The study lacks precise annotation of ECM isoforms, which affects the accuracy of reported ECM-cell interactions. Ambiguous Spatial Correlations- Some ECM clusters, such as "adventitial collagen," overlap inconsistently with anatomical regions, making interpretation challenging. Unvalidated DeepThresh Method - The manuscript does not provide sufficient validation of DeepThresh's ability to remove staining artifacts. Lack of Supplemental Data- Key activated cell types and regions lack supporting data for classification.

      __Advance, gap filled __ Clearly the next step to improve organ compendia such as the lung cell atlas, spatial protein analysis is warranted. scRNA-Seq in particular for ECM molecules is challenging as these molecules are produced in small quantities or have a very slow turn-over. This study has the potential to provide novel deep learning algorithms to include not only cellular markers but consider larger panels of ECM molecules and their spatial orientation in the lung.

      __Audience __ The manuscript is interdisciplinary located between advanced image analysis with deep learning methods, fundamental lung biology and single cell analysis. The readership would entice molecular biologists, bioinformaticians and basic disease model scientists. The manuscript would appeal to clinician scientists and a broader audience if human tissue pendants could be provided validating the methods and outcomes.

      __Own Expertise __ Translational scientist in the field of chronic lung disease, highly familiar with epithelial cells, mouse models, human cohorts and next generation sequencing and imaging of live single cells.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Parkinson and colleagues provided a highly intriguing manuscript on spatial resolution of cell-ECM interaction in mouse models of allergic airway inflammation. They used IMH to analyse two common mouse strains for allergic airway inflammation with a human relevant allergen mix. The study implements a novel technique to better segment tissue stainings (DeepThresh) and modifies existing tools to assess non-cellular seqmentation, ECM or fibrous structures that is. The study identifies region-specific ECM and confirms cellular proximity with canonical cell markers. Furthermore, clear ECM and cellular differences between the two mouse strains are found. The study concludes that this IMH approach is superior to existing methods as it provides a high spatial resolution of ECM protein - cell interaction.

      Major:

      ECM Isoform Annotation - The manuscript lacks precise annotation of ECM isoforms, particularly for Collagen I, IV, and VI. This impacts the accuracy of reported associations between ECM environments and cellular interactions. Spatial Annotation Consistency- The manuscript inconsistently defines and annotates ECM environments (e.g., adventitial collagen, subepithelial & vascular ECM), leading to confusion in spatial correlation analyses. Lack of Supplemental Data- Activated cell types and regions are not clearly defined, and no supplemental data is provided to verify classifications. DeepThresh Validation - The method for removing staining artifacts via DeepThresh lacks clear validation. Complexity - Overlapping marker definitions (e.g., CD11b+ cells and infiltrating cells) need clarification for accurate immune cell characterization.

      Minor:

      Terminology Inconsistency- The manuscript uses inconsistent terminology for ECM components and anatomical regions (e.g., adventitial collagen, immune foci, inflammatory zone). ROI Mask Coverage - Statistical insignificance in C57BL/6 ROI mask coverage is not addressed. Figure 3d Labeling- Matrix cluster names do not always match tissue localization. Spelling Error - "Immunte foci" in Figure 4h. Figure 6g Correlation Issue- The matrix environment correlation plot does not align with expected cell-ECM interactions. Color Issues in Figures - ColI and ColIII have the same color in Fig. 3a, making interpretation difficult. Patch Annotation (Fig. 4i) - The method for defining immune cell patches is unclear.

      Detailed review:

      Methods: Animal model is suitable for differential analysis of various mouse strain responses to allergic airway inflammation.

      Deepthresh matrix thresholding: IMCDenoise is sensitive to clusters of staining artefacts (specks). Please explain how DeepThresh via manual thresholding enables staining artefacts removal/detection. Manual ground truth mapping is common however it is not clear how your approach is performing against another tool. How was manual thresholding controlled (several analysts thresholded same image)?

      Antibodies Collagen IV, stains col4a1 - please correct, as isoforms vary throughout tissue. Collagen VI, stains col6a1 - isoforms vary in lung tissue, please state correct isoform throughout the document. Heparan sulfate: Molecular weight? Collagen I - isoform not defined, please state in methods.

      Figure 3 d As a resultant of the choice of antibodies against some particular isoforms of ECM molecules associations of cells, compartments are correct yet do not comply with all isoforms. Col4a1 is a basal membrane collagen from blood vessels; the adventitial area and vascular area are high in Col4a1. Other Col4 isoforms are found more frequently in the alveolar regions (col4a5,a6) and the subepithelial membrane. It is of utmost importance to clearly label the correct isoforms throughout the document.

      Spelling error in figure 4 h (immunte foci)

      ROI mask coverage in C57/6 not significant

      Activated cell types/region: This definition is not specified and no supplemental data is provided to see which markers classify such areas/cells. Please provide.

      Figure 2i: The cell-specific marker expression is in part already confounded by region. So vasculature or resting airways show no "resting" fibroblasts as their annotation is linked to activation (indicated by S100A4 expression). Anatomic locations such as airways with remodelling are termed "activated" to explain morphological differences which is acceptable given the model chosen. However, some cell type are not given an anatomical or morpholocial "resting" nomenclature. Only during activation and through location a cell type may aquire e.g. a nomenclature such as "alveolar fibroblast". The correlation blot 2i should provide this basic information. Please explain.

      Figure 2h: How do you explain subepithelia to "leak" luminally in C57BL/6 DRA animal? How is an activated airway possible in a Balb/c PBS animal (same for inflammatory adventitia, alveoli)? How is subepithelia adjacent to immune foci and inflammatory adventitia (Balb/c DRA).

      Fig 3a: ColI and ColIII have same colour, this makes images not easy to understand please change. Text for fig 3c: Here it should be mentioned that a cell is used as a proxy locator to the ECM region. Again, in UMAP3b location and ECM molecule a mixed a priori which only can be achieved through proxy loction as in fig 3c or correlation matrix analysis. The UMAP shows ECM molecules in various combinations. Fig3c analysis of anatomic location from images with cell proxies would validate morpho-spatial UMAP annotation. Please make this clear in the manuscript or specify why your approach is superior in its presented format.

      Fig 3d: The Matrix Cluster names are in part not correct. Subepithelial & Vascular ECM does not correlate with Vasculature in LisaClust Regions. Also ColIV is not AEC associated, yet subepithelial. No ECM molecule is inflammatory zone associated. Does this indicate cellular density does not allow to distinguish ECM? Also the term "adventitial collagen" is locating to LisaClust Regions Vasculature, Subepithelial Cells, Resting Airways, Infiltrating Cells, Activated Airways. Adventitial per definition of fig. 2g is around blood vessels extending to airways and around it. The adventitial regions are the ECM rich areas after the fibroblasts (as for blood vessels, PMID: 31522963). The definition used in this study therefore generates morphological overlaps between airways and their basolateral regions and blood vessels. Whilst both morphological regions have an adventitia the Matrix cluster assumes from areas to close by this terminology. As a sensitivity analysis I would suggest to reduce the perimeter around blood vessels to the same borderline as seen in airways. If composition remains similar "adventitial collagen" could be a broader term. Alternatively, if adventitia from airway and blood vessel differ these should be separate terms.

      Fig 4c: Balb/c and C57bl/6 labels are incorrect (see a,b) Fig 4h: Cell type "other is highly present in immune foci and inflammatory adventitia but not further classified and not myeloid. This seems either a difficult definition for myeloid or a significant immune population wasn't stained. How is myeloid defined?

      Fig 4l has a vast variety of marker combinations some being very specific within the staining panel, others subsummarise entire groups of cells. It would be very helpful to know if the lables are specific and exclusive or if larger clusters exist, that they then subdivide into specific groups (e.g. Infiltrating cells: any of CD11b, CD44, Ly6C vs. B-cells or CD11b+Lys6C). This graph would profit also from either using markers or cell types only. Your marker set is very distinct and limited so per definition it is either a neutrophil or a Lys6C+. Please decide, explain and provide the other graph as supplement.

      Fig 5l and sup Fig4i: There is no graph confirming the statement that Ym1 is produced by macrophages. From the graphs in either of the two panels, The AEC are highly associated with Ym1/2 expression or the activated alveoli. Please explain ad amend.

      Fig 6g: The correlation plots again show that the matrix environment labels are somewhat confounded. Whilst AEC associated makes perfect sense, adventitial collagen only weakly correlates, yet was part of the adventitial mapping. Cell types like AEC are expected however fibroblasts, especially in resting airways as large constituent cell populations. There are not other, myeloid or lymphoid cells associated with these airways, which under activated conditions seems rather odd. From fig6a it is appearant that the lisaClust has ascribed subepithelial regions to distal parts of the airway separated by blood vessel or parenchyma (C57BL/6 and Balb/c DRA). Also blood vessels are in part other cell types or epithelium (B6 PBS). Is the annotation here the reason for this rather confusing result? Please explain and/or amend.

      Significance

      Strength

      Innovative ECM-Immune Interaction Approach- The study integrates extracellular matrix (ECM) phenotyping with immune cell spatial mapping, providing novel insights into allergic airway inflammation Multiplex Imaging Technology - The use of Imaging Mass Cytometry (IMC) allows high-resolution spatial characterization of both cellular and ECM components.<br /> Strain Analysis - The inclusion of BALB/c and C57BL/6 mice enables differentiation of strain-specific ECM and immune responses.<br /> Deep-Learning-Based ECM Quantification - DeepThresh offers an advanced computational approach for ECM analysis, enhancing accuracy in defining ECM-cell associations.<br /> Comprehensive Tissue Classification- LisaClust clustering facilitates detailed segmentation of lung microenvironments, improving understanding of localized tissue remodeling.

      Limitations

      ECM Isoform Inconsistencies - The study lacks precise annotation of ECM isoforms, which affects the accuracy of reported ECM-cell interactions.<br /> Ambiguous Spatial Correlations- Some ECM clusters, such as "adventitial collagen," overlap inconsistently with anatomical regions, making interpretation challenging.<br /> Unvalidated DeepThresh Method - The manuscript does not provide sufficient validation of DeepThresh's ability to remove staining artifacts.<br /> Lack of Supplemental Data- Key activated cell types and regions lack supporting data for classification.

      Advance, gap filled

      Clearly the next step to improve organ compendia such as the lung cell atlas, spatial protein analysis is warranted. scRNA-Seq in particular for ECM molecules is challenging as these molecules are produced in small quantities or have a very slow turn-over. This study has the potential to provide novel deep learning algorithms to include not only cellular markers but consider larger panels of ECM molecules and their spatial orientation in the lung.

      Audience

      The manuscript is interdisciplinary located between advanced image analysis with deep learning methods, fundamental lung biology and single cell analysis. The readership would entice molecular biologists, bioinformaticians and basic disease model scientists. The manuscript would appeal to clinician scientists and a broader audience if human tissue pendants could be provided validating the methods and outcomes.

      Own Expertise

      Translational scientist in the field of chronic lung disease, highly familiar with epithelial cells, mouse models, human cohorts and next generation sequencing and imaging of live single cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, Parkinson et al. investigated lung extracellular matrix using imaging mass cytometry (IMC) in mouse models. Overall, the paper is well-written, and the data are clear, although major points outlined below need to be addressed. In its current form, the paper appears more like a methods-focused study since, to my understanding, no new biological responses are described. The methods employed are very interesting, particularly the extracellular matrix analysis. However, the scope of the study is quite limited, as all the experiments were performed with mouse samples, which are relatively easy to work with, and the cell organisation is simple compared to humans. The authors do not discuss how this analysis pipeline could be applied to human samples. Furthermore, the entire paper relies on imaging mass cytometry, and additional techniques could have been used to confirm some of the observations, especially given the availability of mouse samples. The introduction mentions the DRA model without providing an explanation of what it involves. Non-specialist readers may not be familiar with this abbreviation, and further clarification should be provided. In the methods section, it is not mentioned whether the lungs were inflated before tissue collection, which is crucial for preserving normal cellular organization. The authors should clarify whether this was performed. Figure 1 provides a brief summary of the methods employed in the study but could be enriched with additional information. In its current state, it does not provide meaningful insights beyond what is described in the methods section. It would be helpful if the authors clarified whether the mice used were adults and whether both male and female animals were included. Additionally, they could present examples of the cell segmentation approach with zoomed-in images at the cellular level to illustrate the analysis. The first set of data in Figure 2 suggests that C57Bl/6 mice did not respond to allergen treatment, as shown by the non-significant increase in cell numbers. The authors should provide evidence that their model induced inflammation through alternative methods, such as assessing eosinophil counts or pathology. The UMAP representation indicates significant overlap between cell clusters, which raises concerns about the accuracy of cell segmentation. For example, the heatmap in Supplementary Figure 1 shows endothelial cells expressing markers such as VWF, aSMA, Vimentin, and PDGFRα, suggesting that the cell cluster may contain a mixture of endothelial cells, vascular smooth muscle cells, and fibroblasts. Examples of segmented cells should be shown to validate this approach. It is unclear what Figure 2e represents. If it is simply to show that certain clusters can be grouped together, such as AEC, AT1, and AT2 as epithelial cells, this could be conveyed in a simpler way. The analysis of extracellular matrix components presented in Figure 3 provides a novel method for studying these acellular structures, which is a challenge in the field. The authors should be commended for their efforts in this area. However, the parameters investigated in Figures 4-6 do not report any novel findings. While IMC appears to work effectively to analyse these parameters simultaneously, the induction of immune foci and changes in tissue organisation following allergen challenges are already well-documented in both mouse models and human samples.

      In Figure 5, the authors show a decrease in neutrophil numbers in challenged mice. This is unexpected, as this model is widely known to induce strong neutrophil recruitment. The authors should clarify this finding and investigate whether neutrophil chemoattractants are increased in these samples. When analysing epithelial cells, the authors separate AT1 and AT2 cells based on podoplanin expression. However, data in Supplementary Figure 4b suggest that both cell types express similar levels of podoplanin. The authors do not provide statistical validation for the claim that AT1 cells express higher levels. Additionally, as podoplanin is expressed by various cell types, such as lymphatic endothelial cells, additional markers are required to confirm the identity of AT1 cells. The last set of data in Figure 7 is interesting and shows that immune cells interact with a population of S100a4 fibroblasts. This finding could be expanded further, as CD11b and Ly6C are expressed by a variety of immune cells. The authors should include additional staining to identify the specific cell types involved, such as monocytes, eosinophils, or airway macrophages. Furthermore, the authors should speculate on why these fibroblast regions attract immune cells. Are these regions enriched in chemokines or other factors? The discussion is engaging but focuses more on methodological aspects than new biological insights. Without mechanistic links, it is challenging to draw meaningful biological conclusions.

      Significance

      The study of Parkinson et al. provides interesting methodological insights into the use of imaging mass cytometry (IMC) to analyse lung architecture following inflammation. The application of multiplex antibody staining will leverage important information related to how tissues are adapting to chronic immune response. Here the authors rely entirely on mouse models for their studies and compared two lines of WT animals and the same allergen model. This limits the scope of the study, additional timepoints, sex or age would have improved the manuscript.

      The applicability of this analysis pipeline to human tissue samples is not discussed, which would significantly enhance the impact of the study. Additionally, complementary techniques, such as flow cytometry or immunohistochemistry, could be used to validate the findings and improve reproducibility. A specialised audience of immunology researchers would be interested by the image analysis approach.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We are very grateful to the reviewers for their time and care in reviewing our manuscript. We have tried to incorporate all of their feedback to the best of our ability, and we feel that this has greatly improved the manuscript.

      Reviewer #1

      This study provides a strong support for the relationship between replication starting point competition and initial factor concentration. However, some predictive conclusions, such as "the origin of high efficiency may not be activated earlier", are still preliminary. Can the author further clarify the scope of these predictions and any potential mechanism in the discussion part to improve the rigor of this study?

      __Response: __In the discussion, we now emphasize the complexity of predicting origin firing time distributions, which are influenced by multiple interrelated factors beyond efficiency alone.

      The resolution and accuracy of the model prediction are obvious to all, but the specific generalization ability is still unknown, which makes the further promotion slightly insufficient. Does the author consider conducting additional experiments? To detect the replication time and efficiency in yeast cells with changed levels of key initiation factors (such as Cdc45 or Dpb11). The empirical data can be compared with the model prediction by editing CRISPR gene or manipulating the initial factor abundance through overexpression vector.

      __Response: __We fully agree that this would be a very interesting direction, but as this is a theoretical study focused on mathematical modelling, conducting further wet lab experiments would be beyond the scope of this work.

      The model currently uses single values for the initiation factor number and recycling rate, though these parameters may vary across cell cycles or under different growth conditions. It is suggested that sensitivity analysis should be added to supplementary materials to explore how the changes of these parameters affect the model output, such as replication time distribution and origin efficiency.

      __Response: __Sensitivity analysis of how the model fit and validation is affected by using different recycling rates and initial firing factor counts will be conducted.

      While the authors use mean absolute error (MAE) to assess model fit, it is suggested to add other statistical methods, such as root mean square error or correlation analysis, to further evaluate the model's accuracy and robustness. In addition, this model lacks comparison with other studies on fitting yeast replication time, and it is difficult to evaluate the effect of this model compared with other models from the specific performance.

      __Response: __We have now included the root mean squared error (RMSE) alongside the mean absolute error (MAE) and R-squared value to compare the simulated replication timing profiles with the experimental data. We agree that we could have been more detailed in comparing our model to other approaches. We have now added a lengthened discussion of this. In some cases, a direct comparison of performance is difficult due to fundamental differences between the approaches, but we have highlighted why this is the case.

      Although the code is open, it is suggested to provide specific instructions or examples of the running code in supplementary materials, so as to facilitate reproduction and application by other researchers.

      __Response: __The GitHub repository will be updated to enable the running of the entire pipeline. This update will include code for processing replication timing data from Müller et al. (2014) and extracting origin positions from the OriDB. Code will also be provided for writing Beacon Calculus scripts with different parameters and origin firing rates. Instructions on the recommended sequence in which scripts should be executed will also be provided. To enable users to run the model locally on their own computers, a smaller version focused on chromosome 2 will be included in the supplementary information and GitHub repository, along with example input data and expected outputs.

      In Figure 2(a), compared with other chromosomes, the fitting effect of chromosome 1 seems to be not good. Has the author ever thought about the reason? In addition, what is the guiding significance of this model in practical applications, such as online services, forecasting tools, or experiments? Can the author give relevant application examples in this regard?

      __Response: __Potential explanations for the poorer fit of the replication timing profile for chromosome 1 are now discussed. The y-axis range has also now been set as the same for all subplots in Figure 2a to make the replication timing profiles for each chromosome more easily comparable. In the discussion, we highlight how the intuitive and flexible nature of the model places it as a valuable tool which could be adapted to predict the effect of different perturbations on DNA replication dynamics.

      Reviewer #2

      In figure 5, the authors demonstrate that replication dynamics are robust to an increase in the number of available firing factors. However, experimental data from strains in which these limiting factors are overexpressed indicate that replication dynamics are substantially altered (e.g. PMID 22081107 and 23562327) since dNTPs become limiting. So the conclusions of the analysis in figure 5 are at best an oversimplification and at worst rather misleading. If adding dNTPs as a factor that becomes limiting only at higher firing factor concentrations is not technically feasible, the authors should be more circumspect in their description and discussion of the results in figure 5.

      __Response: __We now discuss the interpretation of the effect of increasing the number of firing factors, given that factors such as dNTP availability are not included in the model.

      The analysis of replication dynamics appears to exclude origins within the rDNA, which in the average strain account for ~20-25% of all replication origins in S. cerevisiae depending on the origin list chosen. Ignoring this large number of origins likely has a substantial impact on the model: if rDNA origins are intentionally ignored due to the difficulty of modeling repetitive regions or of having multiple identical origins in the competition model, this should be explicitly addressed in the text.

      __Response: __We now emphasize that our model restricts initiation to specific sites and note that some low-efficiency origins, such as those in rDNA, have not been included.

      Reviewer #3

      Can the authors provide some insight into the model's dependency on the Müller, 2014 replication data set? They initialize and converge to this dataset so this paper's findings are highly contingent on treating this data set as ground truth.

      __Response: __In the discussion, we now highlight that, despite the model's reliance on the Müller, 2014 replication data set for fitting, its ability to reproduce other features of DNA replication demonstrates its ability to reflect DNA replication dynamics more broadly.

      The authors describe their model as one that simplifies the origin firing mechanisms compared to more complex models. Is there a direct comparison available that can quantify this advantage? Likewise, how does their model compare to a naive discriminative model, such as one that performs peak finding on the replication timing data. For example, the replication fork directionality can be estimated, naively, using a peak finding algorithm. This type of analysis will provide a stronger argument for the usage of their model.

      __Response: __Quantitative comparisons between our model and other published models are challenging due to differences in underlying assumptions and metrics used to assess goodness of fit. However, we have now added a discussion addressing these challenges and highlighting how our model's design contrasts with that of other models.

      Currently the code is available as supplemental data. Ideally, the code should be available and provided to run the entire pipeline beginning with the initialization of the origin firing program from the Müller, 2014 data set.

      __Response: __The GitHub repository will be updated to enable the running of the entire pipeline. This update will include code for processing replication timing data from Müller et al. (2014) and extracting origin positions from the OriDB.

      The authors mention that origin firing factors and their recycling time to be the basis of how this model is constructed. While also describing the recycle time as a general timing delay that is dependent on a number of reasons such as diffusion and replisome complex formation. Can the authors discuss the limitations of their model towards this simplification?

      __Response: __Limitations of our model's assumptions of constant recycling rates of firing factors are now discussed, as well as our assumption that the firing rates of origins and the maximum number of available firing factors remain constant between simulations.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this paper, the authors create a model of origin replication in yeast using Beacon calculus and a small set of parameters. The model is described as the relationship between origin firing rate and the abundance and recycling of origin firing factors. Using the (Müller, 2014) replication timing data to initialize and fit their model, the authors show that their model recapitulates known replication-related work such as inter-origin distances, replication fork directionality, and origin efficiency. Next, they utilize their model to make predictions that characterize the broader replication program, such as in the quantification of active replication forks, replicons, and replication timing.

      Major comments:

      Can the authors provide some insight into the model's dependency on the Müller, 2014 replication data set? They initialize and converge to this dataset so this paper's findings are highly contingent on treating this data set as ground truth.

      The authors describe their model as one that simplifies the origin firing mechanisms compared to more complex models. Is there a direct comparison available that can quantify this advantage?

      Likewise, how does their model compare to a naive discriminative model, such as one that performs peak finding on the replication timing data. For example, the replication fork directionality can be estimated, naively, using a peak finding algorithm. This type of analysis will provide a stronger argument for the usage of their model.

      Currently the code is available as supplemental data. Ideally, the code should be available and provided to run the entire pipeline beginning with the initialization of the origin firing program from the Müller, 2014 data set.

      The authors mention that origin firing factors and their recycling time to be the basis of how this model is constructed. While also describing the recycle time as a general timing delay that is dependent on a number of reasons such as diffusion and replisome complex formation. Can the authors discuss the limitations of their model towards this simplification?

      Minor comments:

      The author describes the prediction of 200 active replication forks 22 minutes into S phase. Please discuss why this peak number of active replication forks may have been reached. Is this related to the model configured for the number of firing factors F = 200?

      The recycling parameter appears to be very important for this model. A sensitivity analysis of the value of 0.05 would be helpful to understand why this value was chosen.

      It would be helpful to understand the convergence of the model better. Can the authors provide insight or a plot to better understand why the convergence parameter alpha was chosen as 1.2?

      The authors comment that simulated origin efficiencies were estimated close to zero (6.2%{plus minus}22%). Can the authors comment on the large variability in this estimation (the {plus minus}22%)?

      Significance

      General Assessment

      The strength of the model is in summarizing the origin efficiency firing mechanism into a small set of parameters. This also relates to its limitations. The model asserts that the origin firing depends solely on the abundance and recycling of origin firing factors. This limits the scope of the interpretation of the mechanisms of origin firing compared to more complex models.

      Additionally, the model is fit to, and thus, highly dependent on the quality of the Müller, 2014 dataset.

      Improvements:

      This work can be improved by comparing and contrasting their results to existing models where they argue the advantages of employing a simpler model for origin firing compared to more complex ones they cite (Arbona, 2018; de Moura, 2010; Retkute, 2014; Brümmer, 2010).

      While their modeling and dependency on the Müller, 2024 replication timing data may be sufficient, some of the findings can be naively characterized from this data set, such as in replication fork direction and origin firing times. Thus, the authors can argue the strengths of their model by contrasting theirs to more simpler and naive quantifications.

      Currently the paper is very descriptive. A nice addition would be to model the effects of Rpd3 deletion which is thought to either have a direct effect on late origins (advancing their time of replication) or an indirect effect via the rDNA locus which may, in the absence of rpd3) act as sink for limiting replication factors. (Vogelauer et al., Mol Cell, 2002; Yoshida et al.,Mol Cell 2014, He et al., PNAS 2022). Specifically, how does titrating the number of active rDNA origins out of the ~150 available rDNA origins impact global origin usage under this model?

      Scope:

      Audience: Specialized towards groups modeling and studying replication.

      Reviewer's field of expertise: Computer science, computational biology, bioinformatics, and general computational modeling

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Berners-Lee et al extend the beacon calculus approach previously developed by the Boemo lab to model the dynamics of Saccharomyces cerevisiae genome duplication at high resolution, based on competition for limiting origin firing factors. The simulations converge to produce a timing profile that closely matches experimentally determined replication dynamics through the genome. In an extension, the authors model how an increase in firing factor availability (assuming abundant dNTPs) would affect replication dynamics and conclude that overall timing would be robust.

      Major comments

      In figure 5, the authors demonstrate that replication dynamics are robust to an increase in the number of available firing factors. However, experimental data from strains in which these limiting factors are overexpressed indicate that replication dynamics are substantially altered (e.g. PMID 22081107 and 23562327) since dNTPs become limiting. So the conclusions of the analysis in figure 5 are at best an oversimplification and at worst rather misleading. If adding dNTPs as a factor that becomes limiting only at higher firing factor concentrations is not technically feasible, the authors should be more circumspect in their description and discussion of the results in figure 5.

      The analysis of replication dynamics appears to exclude origins within the rDNA, which in the average strain account for ~20-25% of all replication origins in S. cerevisiae depending on the origin list chosen. Ignoring this large number of origins likely has a substantial impact on the model: if rDNA origins are intentionally ignored due to the difficulty of modeling repetitive regions or of having multiple identical origins in the competition model, this should be explicitly addressed in the text

      Minor comment

      Sekedat et al (2010, PMID PMID: 20212525) demonstrated convincingly that replication-fork movement is uniform throughout the genome but are not cited in favor of more recent work.

      Significance

      This manuscript will be of interest to researchers working on DNA replication dynamics, since the methodology and conclusions could be extended to other genomes for which high-quality replication timing data are available. The technical advance of including limiting firing factor availability is interesting, although the overall utility of these models is perhaps somewhat limited by the need for experimental data on which the model can converge. Extending the model to include known additional factors affecting replication-fork movement and replication timing as outlined above would extend the significance, especially since variations in replication-fork speed are associated with genome instability (e.g. PMID 29950726), differentiation (e.g PMID 35256805) and other biologically important phenomena.

      Expertise: molecular biology, high-throughput analysis of DNA replication. I do not have sufficient expertise to evaluate the mathematical model itself.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This study develops a high-resolution stochastic model to explore DNA replication timing regulation in Saccharomyces cerevisiae, specifically focusing on competition between replication origins for limited initiation factors. The model, based on "Beacon Calculus," utilizes an iterative optimization process to fit experimental data, successfully reproducing timing, efficiency, and directionality features of genome replication origins. Additionally, the authors use the model to make predictions on replication dynamics under varying initiation factor concentrations, providing new insights into DNA replication processes that have not yet been observed empirically or experimentally.

      Major Comments:

      1. This study provides a strong support for the relationship between replication starting point competition and initial factor concentration. However, some predictive conclusions, such as "the origin of high efficiency may not be activated earlier", are still preliminary. Can the author further clarify the scope of these predictions and any potential mechanism in the discussion part to improve the rigor of this study?
      2. The resolution and accuracy of the model prediction are obvious to all, but the specific generalization ability is still unknown, which makes the further promotion slightly insufficient. Does the author consider conducting additional experiments? To detect the replication time and efficiency in yeast cells with changed levels of key initiation factors (such as Cdc45 or Dpb11). The empirical data can be compared with the model prediction by editing CRISPR gene or manipulating the initial factor abundance through overexpression vector.
      3. The model currently uses single values for the initiation factor number and recycling rate, though these parameters may vary across cell cycles or under different growth conditions. It is suggested that sensitivity analysis should be added to supplementary materials to explore how the changes of these parameters affect the model output, such as replication time distribution and origin efficiency.
      4. While the authors use mean absolute error (MAE) to assess model fit, it is suggested to add other statistical methods, such as root mean square error or correlation analysis, to further evaluate the model's accuracy and robustness. In addition, this model lacks comparison with other studies on fitting yeast replication time, and it is difficult to evaluate the effect of this model compared with other models from the specific performance.
      5. Although the code is open, it is suggested to provide specific instructions or examples of the running code in supplementary materials, so as to facilitate reproduction and application by other researchers.
      6. In Figure 2(a), compared with other chromosomes, the fitting effect of chromosome 1 seems to be not good. Has the author ever thought about the reason? In addition, what is the guiding significance of this model in practical applications, such as online services, forecasting tools, or experiments? Can the author give relevant application examples in this regard?

      Minor Comments:

      1. Suggestions for Improving Figures: Figures 2 and 3: It is suggested that the differences between experimental data and model fitting data should be clearly marked by using more distinctive colors or symbols with different shapes in these figures, so as to help readers quickly distinguish between simulation results and experimental observation results. Density Plot in Figure 4: The current color gradient is dense, making it difficult to differentiate activation distributions for different origins. Consider using a broader color gradient or adding a slight separation between each origin's curve to improve readability.
      2. Model Parameter Table: Adding a table in the Methods section or supplementary materials that summarizes the main model parameters (e.g., number of initiation factors, recycling rate, replication speed) and the basis for each parameter's setting would be helpful. This will allow readers to quickly understand the model setup and provide a reference for future researchers who may wish to use or adjust this model.
      3. Citation and Description of Experimental Data: Clarify the origin and characteristics of the experimental data used, such as the specific details of the replication timing dataset applied for model fitting, and indicate whether the data represents single-cell or population-averaged measurements. This information will help readers better understand the comparison between the model and actual data.
      4. Background and References: In the Introduction, consider adding a brief explanation of "Beacon Calculus" to aid non-specialist readers in understanding the novelty and applicability of this method. Adding foundational references for Beacon Calculus would further help readers appreciate the advantages of this approach. Additionally, in the discussion of the model's suitability for other biological systems, citing some reviews on high-efficiency replication origin analyses would help demonstrate the model's broader applicability.

      Significance

      1. Significance of the Research:

      This study advances our understanding of DNA replication timing regulation in S. cerevisiae and presents a mathematical modeling approach with theoretical importance. By reconstructing a DNA replication timing framework for yeast, the model also provides a foundation that could be adapted for other systems, potentially advancing modeling techniques in genome replication research. 2. Relation to Existing Literature:

      This study builds upon prior research on S. cerevisiae DNA replication initiation and proposes a simplified, reproducible model. Compared to more complex mathematical models or large-scale data analyses, this approach is more interpretable and easier to reproduce. The study's predictions on initiation factor concentration effects provide another perspective for future experimental work. 3. Target Audience:

      This work will influence researchers studying DNA replication regulation, yeast genomics, and bioinformatics modeling. Additionally, scholars in microbiology and genetics may also benefit from the innovative modeling methods introduced. 4. Reviewer Expertise:

      My expertise includes computational biology and bioinformatics, with a professional knowledge in DNA replication origins and bioinformatics modeling.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02824

      Corresponding author(s): Rita tewari

      1. General Statements [optional]

      We wish to thank the reviewers and the Editor for their constructive comments and valuable suggestions to improve our manuscript. We have addressed as far as possible all comments and concerns and we hope that this revised manuscript, with additional new data, will be acceptable for publication. Please find below detailed responses (red text) to all specific points raised by the reviewers

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      We would like to thank all the reviewers for using their valuable time to review our manuscript and to provide constructive comments and suggestions. We have now revised the manuscript taking their comments into consideration; our responses to these comments are detailed below (in red).

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Minor comments: In the results section (lines 498-499), the authors describe free kinetochores in many cells without associated spindle microtubules. However, some nuclei appear to have kinetochores, as presented in Figure 6. Could the authors clarify how this conclusion was derived using transmission electron microscopy (TEM) without serial sectioning, as this is not explicitly mentioned in the materials and methods?

      We observed free kinetochores in the ALLAN-KO parasites with no associated spindle microtubules (see Fig. 6Gh), while kinetochores are attached to spindle microtubules in WT-GFP cells (see Fig. 6Gc). To provide further evidence we analysed additional images and found that ALLAN-KO cells have free kinetochores in the centre of nucleus, unattached to spindle microtubules. We provide some more images clearly showing free kinetochores in these cells (new supplementary Fig. S11).

      However, in the ALLAN mutant, this difference is not absolute: in a search of over 50 cells, one example of a cell with a "normal" nuclear spindle and attached kinetochores was observed.

      The use of serial sectioning has limitations for examining small structures like kinetochores in whole cells. The limitations of the various techniques (for example, SBF-SEM vs tomography) are highlighted in our previous study (Hair et al 2022; PMID: 38092766), and we consider that examining a population of randomly sectioned cells provides a better understanding of the overall incidence of specific features.

      Discussion Section:

      Could the authors expand on why SUN1 and ALLAN are not required during asexual replication, even though they play essential roles during male gametogenesis?

      We observed no phenotype in asexual blood stage parasites associated with the sun1 and allan gene deletions. Several other Plasmodium berghei gene knockout parasites with a phenotype in sexual stages, for example CDPK4 (PMID: 15137943), SRPK (PMID: 20951971), PPKL (PMID: 23028336) and kinesin-5 (PMID: 33154955) have no phenotype in blood stages, so perhaps this is not surprising. One explanation may be the substantial differences in the mode of cell division between these two stages. Asexual blood stages produce new progeny (merozoites) over 24 hours with closed mitosis and asynchronous karyokinesis during schizogony, while male gametogenesis is a rapid process, completed within 15 min to produce eight flagellated gametes. During male gametogenesis the nuclear envelope must expand to accommodate the increased DNA content (from 1N to 8N) before cytokinesis. Furthermore, male gametogenesis is the only stage of the life cycle to make flagella, and axonemes must be assembled in the cytoplasm to produce the flagellated motile male gametes at the end of the process. Thus, these two stages of parasite development have some very different and specific features.

      Lines 611-613 states: "These loops serve as structural hubs for spindle assembly and kinetochore attachment at the nuclear MTOC, separating nuclear and cytoplasmic compartments." Could the authors elaborate on the evidence supporting this statement?

      We observed the loops/folds in the nuclear envelope (NE) as revealed by SUN1-GFP and 3D TEM images during male gametogenesis. These folds/loops occur mainly in the vicinity of the nuclear MTOC where the spindles are assembled (as visualised by EB1 fluorescence) and attached to kinetochores (as visualised by NDC80 fluorescence). These loops/folds may form due to the contraction of the spindle pole back to the nuclear periphery, inducing distortion of the NE. Since there is no physical segregation of chromosomes during the three rounds of mitosis (DNA increasing from 1N to 8N), we suggest that these folds provide additional space for spindle and kinetochore dynamics within an intact NE to maintain separation from the cytoplasm (as shown by location of kinesin-8B).

      In lines 621-622, the authors suggest that ALLAN may have a broader role in NE remodelling across the parasite's lifecycle. Could they reflect on or remind readers of the finding that ALLAN is not essential during the asexual stage?

      ALLAN-GFP is expressed throughout the parasite life cycle but as the reviewer points out, a functional role is more pronounced during male gametogenesis. This does not mean that it has no role at other stages of the life cycle even if there is no obvious phenotype following deletion of the gene during the asexual blood stage. The fact that ALLAN is not essential during the asexual blood stage is noted in lines 628-29.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Introduction Line 63: The authors stat: "NE is integral to mitosis, supporting spindle formation, kinetochore attachment, and chromosome segregation..". Seemingly at odds, they also say (Line 69) that 'open' "mitosis is "characterized by complete NE disassembly". The authors could explain better the ideas presented in their quoted review from Dey and Baum, which points out that truly 'open' and 'closed' topologies may not exist and that even in 'open' mitosis, remnants of the NE may help support the mitotic spindle.

      We have modified the sentence in which we discuss current opinions about 'open' and 'closed' mitosis. It is believed that there is no complete disassembly of the NE during open mitosis and no completely intact NE during closed mitosis, respectively. In fact, the NE plays a critical role in the different modes of mitosis during MTOC organisation and spindle dynamics. Please see the modified lines 64-71.

      Results

      Fig 7 is the final figure; but would be more useful upfront.

      We have provided a new introductory figure (Fig 1) showing a schematic of conventional /canonical LINC complexes and evidence of SUN protein functions in model eukaryotes and compare them to what is known in apicomplexans.

      Fig 1D. The authors generated a C-terminal GFP-tagged SUN1 transfectants and used ultrastructure expansion microscopy (U-ExM) and structured illumination microscopy (SIM) to examine SUN1-GFP in male gametocytes post-activation. The immuno-labelling of SUN1-GFP in these fixed cells appears very different to the live cell images of SUN1-GFP. The labelling profile comprises distinct punctate structures (particularly in the U-ExM images), suggesting that paraformaldehyde fixation process, followed by the addition of the primary and secondary antibodies has caused coalescing of the SUN1-GFP signal into particular regions within the NE.

      We agree with the reviewer. Fixation with paraformaldehyde (PFA) results in a coalescence of the SUN1-GFP signal. We have also tried methanol fixation (see below, new Fig. S2), but a similar problem was encountered.

      Given these fixation issues, the suggestion that the SUN1-GFP signal is concentrated at the BB/ nuclear MTOC and "enriched near spindle poles" needs further support.

      These statements seem at odd with the data for live cell imaging where the SUN1-GFP seems evenly distributed around the nuclear periphery. Can the observation be quantitated by calculating the percentage of BB/ nuclear MTOC structures with associated SUN1-GFP puncta? If not, I am not convinced these data help understand the molecular events.

      We agree with the reviewer that whilst the live cell imaging showed an even distribution of SUN1-GFP signal, after fixation with either PFA or methanol, then SUN1-GFP puncta are observed in addition to the peripheral location around the stained DNA (Hoechst) (See the above figure; puncta are indicated by arrows). These SUN1-GFP labelled puncta were observed at the junction of the nuclear MTOC and the basal body (Fig. 2F). Quantification of the distribution showed that these SUN1-GFP puncta are associated with nuclear MTOC in more than 90 % of cells (18 cells examined). Live cell imaging of the dual labelled parasites; SUN1xkinesin-8B (Fig. 2H) and SUN1x EB1 (Fig. 2I) provides further support for the association of SUN1-GFP puncta with BB (kinesin-8B) /nuclear MTOC (EB1).

      The authors then generated dual transfectants and examined the relative locations of different markers in live cells. These data are more informative.

      The authors state; " ..SUN1-GFP marked the NE with strong signals located near the nuclear MTOCs situated between the BB tetrads". The nuclear MTOCs are not labelled in this experiment. The SUN1-GFP signal between the kinesin-8B puncta is evident as small puncta on regions of NE distortion. I would prefer to not describe this signal as "strong". The signal is stronger in other regions of the NE.

      We have modified the sentence on line 213 to accommodate this suggestion.

      Line 219. The authors state; "..SUN1-GFP is partially colocalized with spindle poles as indicated by EB1,.. it shows no overlap with kinetochores (NDC80)." The authors should provide an analysis of the level of overlap at a pixel by pixel level to support this statement.

      We now provide the overlap at a pixel-by-pixel level for representative images, and we have quantified more cells (n>30), as documented in the new Fig. S4A, which is displayed below. We have also modified the sentence on line 219 to reflect these additions.

      The SUN1 construct is C-terminally GFP-tagged. By analogy with human SUN1, the C-terminal SUN domain is expected to be in the NE lumen. That is in a different compartment to EB1, which is located in the nuclear lumen (on the spindle). Thus, the overlap of signal is expected to be minimal.

      We agree with the reviewer that the overlap between EB1 and Sun1 signals is expected to be minimal. We have quantified the data and included it in Supplementary Fig. S4A.

      Similarly, given that EB1 and NDC80 are known to occupy overlapping locations on the spindle, it seems unlikely that SUN1 can overlap with one and not the other.

      We agree with the reviewer's analysis that EB1 and NDC80 occupy overlapping locations on the spindle, although the length of NDC80 is less at the ends of spindles (see below Fig A) as shown in our previous study where we compared the locations of two spindle proteins, ARK2 and EB1, with that of NDC80 (Zeeshan et al, 2022; PMID: 37704606). In the present study we observed that Sun1-GFP partially overlaps with EB1 at the ends of the spindle, but not with NDC80. Please see Fig. B, below.

      I note on Line 609, the authors state "Our study demonstrates that SUN1 is primarily localized to the nuclear side of the NE.." As per Fig 7D, and as discussed above, the bulk of the protein, including the SUN1 domain, is located in the space between the INM and the ONM.

      We appreciate the reviewer's correction; we have now modified the sentence to indicate that the protein is largely localized in the space between the INM and the ONM on line 617.

      Interestingly, as the authors point out, nuclear membrane loops are evident around EB1 and NDC80 focal regions. The data suggests that the contraction of the spindle pole back to the nuclear periphery induces distortion of the NE.

      We agree with the reviewer's suggestion that the data indicate that contraction of spindle poles back to the nuclear periphery may induce distortion of the NE.

      The author should discuss further the overlap of findings of this study with that from a recent manuscript (https://doi.org/10.1016/j.cels.2024.10.008). That Sayers et al. study identified a complex of SUN1 and ALLC1 as essential for male fertility in P. berghei. Sayers et al. also provide evidence that this complex particulate in the linkage of the MTOC to the NE and is needed for correct mitotic spindle formation during male gametogenesis.

      We thank the reviewer for this suggestion. The study by Sayers et al, (2024) was published while our manuscript was under preparation. It was interesting to see that these complementary studies have similar findings about the role of SUN1 and the novel complex of SUN1-ALLAN. Our study contains a more detailed, in-depth analysis both by Expansion and TEM of SUN1. We include additional studies on the role of ALLAN. We discuss the overlap in the findings of the two studies in lines 590-605.

      While the work is interesting, the conclusions may need to be tempered. The authors suggestion that in the absence of KASH-domain proteins, the SUN1-ALLAN complex forms a non-canonical LINC complex (that is, a connection across the NE), that "achieves precise nuclear and cytoskeletal coordination".

      We have toned down the wording of this conclusion in lines 665-677.

      In other organisms, KASH interacts with the C-terminal domain on SUN1, which as mentioned above is located between the INM and ONM. By contrast, ALLAN interacts with the N-terminal domain of SUN1, which is located in the nuclear lumen. The SUN1-ALLAN interaction is clearly of interest, and ALLAN might replace some of the roles of lamins. However, the protein that functionally replaces KASH (i.e. links SUN1 to the ONM) remains unidentified.

      We agree with reviewer, and future studies will need to focus on identifying the KASH replacement that links SUN1 to the ONM.

      It may also be premature to suggest that the SUN1-ALLAN complex is promising target for blocking malaria transmission. How would it be targeted?

      We have deleted the sentence that raised this suggestion.

      While the above datasets are interesting and internally consistent, there are two other aspects of the manuscript that need further development before they can usefully contribute to the molecular story.

      The authors undertook a transcriptomic analysis of Δsun1 and WT gametocytes, at 8 and 30 min post-activation, revealing moderate changes (~2-fold change) in different genes. GO-based analysis suggested up-regulation of genes involved in lipid metabolism. Given the modest changes, it may not be correct to conclude that "lipid metabolism and microtubule function may be critical functions for gametogenesis that can be perturbed by sun1 deletion." These changes may simply be a consequence of the stalled male gametocyte development.

      Following the reviewer's suggestion we have moved these data to the supplementary information (Fig. S5D-I) and toned down their discussion in the results and discussion sections.

      The authors have then undertaken a detailed lipid analysis of the Δsun1 and WT gametocytes, before and after activation. Substantial changes in lipid metabolites might not be expected in such a short period of time. And indeed, the changes appear minimal. Similarly, there are only minor changes in a few lipid sub-classes between Δsun1 and WT gametocytes. In my opinion, the data are not sufficient to support the authors conclusion that "SUN1 plays a crucial role, linking lipid metabolism to NE remodelling and gamete formation."

      In agreement with the reviewer's comments we have moved these data to supplementary information (Fig. S6) and substantially toned down the conclusions based on these findings.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Major comments: My main concern with this manuscript is that the authors do conclude not only that SUN1 is important for spindle formation and basal body segregation, but also that it influences for lipid metabolism and NE dynamics. I don't think the data supports this conclusion, for several reasons listed below. I would suggest to remove this claim from the manuscript or at least tone it down unless more supporting data are provided, in particular showing any change in NE dynamics in the SUN1-KO. Instead I would recommend to focus on the more interesting role of SUN1-ALLAN in bipartite MTOC organisation, which likely explains all observed phenotypes (including those in later stages of the parasite life cycle). In addition, some aspects of the knockout phenotype should be quantified to a bit deeper level.

      In more detail:

      • The lipidomics analysis is clearly the weakest point of the manuscript: The authors state that there are significant changes in some lipid populations between WT and sun1-KO, and between activated and non-activated cells, yet no statistical analysis is shown and the error bars are quite high compared to only minor changes in the means. For some discussed lipids, the result text does not match the graphs, e.g. PA, where the increase upon activation is more pronounced in the SUN1-KO vs WT (contrary to the text), or MAG, which is reduced in the SUN1-KO vs WT (contrary to the text). I don't see the discussed changes in arachidonic acid levels and myristic acid levels in the data either. Even if the authors find after analysis some statistically significant differences between some groups, they should carefully discuss the biological significance of these differences. As it is, I do not think the presented data warrants the conclusion that deletion of SUN1 changes lipid homeostasis, but rather shows that overall lipid homeostasis is not majorly affected by gametogenesis or SUN1 deletion. As a minor comment, if you decide to keep the lipidomics analysis in the manuscript, please state how many replicates were done.

      As detailed above we have moved the lipidomics data to supplementary information (Fig. S6) and substantially toned down the discussion of these data in the results and discussion sections.

      • I can't quite follow the logic why the authors performed transcriptomic analysis of the SUN1 and how they chose their time points. Their data up to this point indicate that SUN1 has a structural or coordinating role in the bipartite MTOC during male gametogenesis. Based on that it is rather unlikely that SUN1 KO directly leads to transcriptional changes within the 8 min of exflagellation. Isn't it more likely that transcriptional differences are purely a downstream effect of incomplete/failed gametogenesis? This is particularly true for the comparison at 30 min, which compares a mixture of exflagellated/emerged gametes and zygotes in WT to a mixture of aberrant, arrested gametes in the knockout, which will likely not give any meaningful insight. The by far most significant GO-term is then also nuclear-transcribed mRNA catabolic process, which is likely not related at all to SUN1 function (and the authors do not even comment on this in the main text). I would therefore suggest removing the 30 min data set from this manuscript. As a minor point, I would suggest highlighting some of the top de-regulated gene IDs in the volcano plots and stating their function. Also, please state how you prepared the cells for the transcriptomes and in how many replicates this was done.

      As suggested by the reviewer we have removed the 30 min post activation data from the manuscript. We have also moved the rest of the transcriptomics data to supplementary information (Fig. S5) and toned down the presentation of this aspect of the work in the results and discussion sections.

      • Live-cell imaging of SUN1-GFP does nicely visualise the NE during gametogenesis, showing a highly dynamic NE forming loops and folds, which is very exciting to see. It would be beneficial to also show a video from the life-cell imaging.

      We have now added videos to the manuscript as suggested by the reviewer. Please see the supplementary Videos S1 and S2.

      In their discussion, the authors state multiple times that NE dynamics are changed upon SUN1 KO. Yet, they do not provide data supporting this claim, i.e. that the extended loops and folds found in the nuclear envelope during gametogenesis are affected in any way by the knockout of SUN1 or ALLAN. What happens to the NE in absence of SUN1? Are there less loops and folds? In absence of a reliable NE marker this may not be entirely easy to address, but at least some SBF-SEM images of the sun1-KO gametocytes could provide insight.

      It was difficult to provide SBF-SEM images as that work is beyond the scope of this manuscript. We will consider this approach in our future work. We re-examined many of our TEM images of SUN1-KO and ALLAN-KO parasites and did find some micrographs showing aberrant nuclear membrane folding ( - I think the exciting part of the manuscript is the cell biological role of SUN1 on male gametogenesis, which could be carved out a bit more by a more detailed phenotyping. Specifically it would be good to quantify

      1) if DNA replication to an octoploid state still occurs in SUN1-KO and ALLAN-KO,

      DNA replication is not affected in the SUN1-KO and ALLAN-KO mutants: DNA content increases to 8N (data added in Fig. 3J and Fig. S10F).

      2) the proportion of anucleated gametes in WT and the KO lines

      We have added these data in Fig. 3K and Fig. S10G

      3) a quantification of the BB clustering phenotype (in which proportion of cells do the authors see this phenotype). This could be addressed by simple fixed immunofluorescence images of the respective WT/KO lines at various time points after activation (or possibly by reanalysis of the already obtained images) and would really improve the manuscript.

      We have reanalysed the BB clustering phenotype and added the quantitative data in Fig. 4E and Fig. S7.

      Especially the claim that emerged SUN1-KO gametes lack a nucleus is currently only based on single slices of few TEM cells and would benefit from a more thorough quantification in both SUN1- and ALLAN-Kos

      We have examined many microgametes (100+ sections). In WT parasites a small proportion of gametes can appear to lack a nucleus if it does not extend all the way to the apical and basal ends (Hair et al. 2022). However, the proportion of microgametes that appear to lack a nucleus (no nucleus seen in any section) was much higher in the SUN1 mutant. In contrast, this difference was not as clear cut in the ALLAN mutant with a small proportion of intact (with axoneme and nucleus) microgametes being observed.

      We have done additional analysis of male gametes, looking for the presence of the nucleus by live cell imaging after DNA staining with Hoechst. Please see the figure below. These data are added in Fig. 3K (for Sun1-KO) and S10G (for Allan-KO).

      • The TEM suggests that in the SUN1-KO, kinetochores are free in the nucleus. Are all kinetochores free or do some still associate to a (minor/incorrectly formed) spindle? The authors could address this by tagging NDC80 in the KO lines.

      Our observation and quantification of the data indicated that 100% of kinetochores were attached to spindle microtubules and that 0% were unattached kinetochores in the WT parasites. However, the exact opposite was found for the SUN1 mutant with 100% unattached kinetochores and 0% attached. The result was not quite as clear cut in the ALLAN mutant, with 98% unattached and 2% attached. An important observation was the lack of separation of the nuclear poles and any spindle formation. Spindle formation was never or very rarely observed in the mutants.

      • Finally, I think it is curious that in contrast to SUN1, ALLAN seems to be less important, with some KO parasite completing the life cycle. Maybe a more detailed phenotyping as above gives some more hints to where the phenotypic difference between the two proteins lies. I would assume some ALLAN-KO cells can still segregate the basal body. Can the authors speculate/discuss in more detail why these two proteins seems to have slightly different phenotypes?

      We agree with the reviewer. Overall, the ALLAN-KO has a less prominent phenotype than that of the Sun1-KO. The main difference is that in the ALLAN-KO mutant some basal body segregation can occur, leading to the production of some fertile microgametocytes, and ookinetes, and oocyst formation (Fig. 8). Approximately 5% of oocysts sporulated to release infective sporozoites that could infect mice in bite back experiments and complete the life cycle. In contrast the Sun1-KO mutant made no healthy oocysts, or infective sporozoites, and could not complete the life cycle in bite back experiments. We have analysed the phenotype in detail and provide quantitative data for gametocyte stages by EM and ExM in Figs. 4 and S8 (SUN1) and Figs. 7 and S11 (ALLAN). We have also performed detailed analysis of oocyst and sporozoite stages and included the data in Fig. 3 (SUN1) and S10 (ALLAN).

      Based on the location, and functional and interactome data, we think that SUN1 plays a central role in coordinating nucleoplasm and cytoplasmic events as a key component of the nuclear membrane lumen, whereas ALLAN is located in the nucleoplasm. Deleting the SUN1 gene may disrupt the connection between INM and ONM whereas the deletion of ALLAN may affect only the INM.

      . Some additional points where the data is not entirely sound yet or could be improved:

      • Localisation of SUN1: There seems to be a discrepancy between SUN1-GFP location as observed by live cell microscopy, and by Expansion Microscopy (ExM), similar for ALLAN-GFP. By live-cell microscopy, the SUN1 localisation is much more evenly distributed around the NE, while the localisation in ExM is much more punctuated, and e.g. in Figure 1E seems to be within the nucleus. Do the authors have an explanation for this? Also, in Fig. 1D there are two GFP foci at the cell periphery (bottom left of the image), which I would think are not SUN1-Foci, as they seem to be outside of the cell. Is the antibody specific? Was there a negative control done for the antibody (WT cells stained with GFP antibodies after ExM)?

      High resolution SIM and expansion microscopy showed that the SUN1-GFP molecules coalesce to form puncta, in contrast to the more uniform distribution observed by live cell imaging. This apparent difference may be due to a better resolution that could not be achieved by live cell imaging. We agree with the reviewer that the two green foci are outside of the cell. As a negative control we have used WT-ANKA cells (which contain no GFP) and the anti-GFP antibody, which gave no signal. This confirms the specificity of the antibody (please see the new Fig. S3).

      • The authors argue that SIM gave unexpected results due to PFA fixation leading to collapse of the NE loops. However, they also fix their ExM cells and their EM cells with PFA and do not observe a collapse, at least from what I see in the two presented images and in the 3D reconstruction. Is there something else different in the sample preparation?

      There was no difference in the fixation process for samples examined by SIM and ExM, but we used an anti-GFP antibody in ExM to visualise the SUN1-GFP, while in SIM the images of GFP signal were collected directly after fixation. We used both PFA and methanol as fixative, and both methods showed a coalescing of the SUN1-GFP signal (please see the new Fig. S2 and S3).

      Can the authors trace their NE in ExM according to the NHS-Ester signal?

      We could trace the NE in the ExM by the NHS-ester signal and observed that the SUN1-GFP signal was largely coincident with the NE (Please see the new Fig. S3B below).

      • Fig 2D: It would be good to not just show images of oocysts but actually quantify their size from images. Also, have the authors determined the sporozoite numbers in SUN1-KO?

      We have measured oocyst size (data added in new Fig. 3) and added the sporozoite quantification data in Fig. 3D.

      • Line 481-483: the authors state that oocyst size is reduced in ALLAN-KO but do not show the data. Please quantify oocyst size or at least show representative images. Also the drastic decrease in sporozoite numbers (Fig. 6D, E) is not mentioned in the text. Please add reference to Fig S7D when talking about the bite back data.

      We have added the oocyst size data in Fig. S10. We mention the changes in sporozoite numbers (now shown in Fig. 7D, E), and refer to the bite back data shown in current Fig. 7E.

      • Fig S1C, 6C: Both WB images are stitched, but this is not clearly indicated e.g. by leaving a small gap between the lanes. Also please show a loading control along with the western blots. Also there seems to be a (unspecific?) band in the control, running at the same height as Allan-GFP WB. What exactly is the control?

      We have provided the original blot showing the bands of ALLAN-GFP and SUN1-GFP. As a positive control, we used an RNA associated protein (RAP-GFP) that is highly expressed in Plasmodium and regularly used in our lab for this purpose.

      • Regarding the crossing experiment: The authors conclude from this cross that SUN1 is only needed in males, yet for this conclusion they would need to also show that a cross with a female line does not rescue the phenotype. The authors should repeat the cross with a male-deficient line to really test if the phenotype is an exclusively male phenotype. In addition, line 270-272 states that no oocysts/sporozoites were detected in sun1-ko and nek4-ko parasites. However, the figure 2E shows only oocysts, not sporozoites, and shows also that sun1-ko does form oocysts, albeit dead ones.

      We have now performed the experiment of crossing the Sun1-KO parasite line with a male deficient line (Hap2-KO) and added the data in Fig. 3I. We have added images showing sporozoites in oocysts.

      • In Fig S1 the authors show that they also generated a SUN1-mCherry line, yet they do not use it in any of the presented experiments (unless I missed it). Would it be beneficial to cross the SUN1-mCherry line with the Allan1-GFP line to test colocalisation (possibly also by expansion microscopy)?

      We did generate a SUN1-mCherry line, with the intent to cross ALLAN-GFP and SUN1-mCherry lines and observe the co-location of the proteins. Despite multiple attempts this cross was unsuccessful. This may have been due to their close proximity such that the addition of both GFP and mCherry was difficult to facilitate a proper protein-protein interaction between either of the proteins.

      • Line 498: "In a significant proportion of cells" - What was the proportion of cells, and what does significant mean in this context?

      Approximately 67% of cells showed the clumping of BBs. We have now added the numbers in Figs. 6H and S11I.

      • The authors should discuss a bit more how their work relates to the work of Sayers et al. 2024, which also identified the SUN1-ALLAN complex. The paper is cited, but only very briefly commented on.

      We have extended this discussion now in lines 590-605.

      Suggestions how to improve the writing and data presentation.

      • General presentation of microscopy images: Considering that large parts of the manuscript are based on microscopy data, their presentation could be improved. Single-channel microscopy images would benefit from being depicted in gray scale instead of color, which would make it easier to see the structures and intensities (especially for blue channels).

      Whilst we agree with the reviewer, sometimes it is difficult to see the features in the merged images. Therefore, we would like to request to be allowed to retain the colours, which can be easily followed in both individual and merged images.

      Also, it would be good to harmonize in which panels arrows are shown (e.g. Fig 1G, where some white arrows are in the SUN1-GFP panel, while others are in the merge panel, but they presumably indicate the same thing.). At the same time, Fig 1H doesn't have any with arrows, even though the figure legend states so.

      We apologise for this lack of consistency, and we have now added arrows wherever they are missing to harmonise in the presentations.

      Fig 3A and S4 show the same experiment but are coloured in different colours (NHS-Eester in green vs grey scale).

      • Are the scale bars of all expansion microscopy images adjusted for the expansion factor?

      Yes, the scale bars are adjusted accordingly.

      • The figure legends would benefit from streamlining, as they have very different style between figures (eg Fig. 6 which has a concise figure legend vs microscopy figures where figure legends are very long and describe not only the figure but the results)

      The figure legends have been streamlined, with removal of the description of results.

      • Line 155-156: The text makes it sound like the expression only happens after activation. is that the case? Are these images activated or non-activated gametocytes?

      They are expressed before activation, but the signal intensifies after activation. Images from before and after activation of gametocytes have been added in Fig. S1F.

      • Line 267: Reference to the original nek4-KO paper missing

      This reference is now included.

      • Line 301: The reference to Figure 2J seems to be a bit arbitrarily placed. Also, this schematic of lipid metabolism is never discussed in relation to the transcriptomic or lipidomic data.

      We have moved these data to supplementary information and modified the text.

      • Line 347-349 states that gametes emerged, but the referenced figure shows activated gametocytes before exflagellation.

      We have corrected the text to the start of exflagellation.

      • Line 588: Spelling mistake in SUN1-domain

      Corrected.

      • Line 726/731: i missing in anti-GFP

      Corrected.

      • Line 787-789: statement of scale bar and number of cells imaged is not at the right position in the figure legend.

      Moved to right place

      • Line 779, 783: "shades of green" should be just "green". Same goes for line 986, 989 with "shades of grey"

      Changed.

      • Line 974, 976: please correct to WT-GFP and dsun1

      Corrected.

      • Line 1041, 1044: WT-GFP instead of WTGFP.

      Corrected to WT-GFP.

      • Fig 1B, D, E, Fig S1G, H: What are the time points of imaging?

      We have added the time points to the images in these figures.

      • Fig 1D/Line 727: the scale of the scale bar on the inset is missing.

      We have added the scale bar.

      • Fig 3 E-G and 6H-J: Please indicate total number of cells/images analysed per quantification, either in the graphs themselves or in the figure legend.

      We indicate now the number of cells analysed in individual figures and also in Fig. S5C and S8C, respectively.

      • Fig 5B: What is NP

      Nuclear Pole (NP), also known as the nuclear/acentriolar MTOC (Zeeshan et al 2022; PMID: 35550346).

      • Fig S1B/D: The legend states that there is an arrow indicating the band, but there is none.

      We have added the arrow.

      • Fig S2C: Is the scale bar really the same for the zygote and the ookinete?

      We have checked this and used the same for both zygote and ookinete.

      • Fig S3C, S7C: which stages was qRT-PCR done on?

      Gametocytes activated for 8 min.

      • Fig. S3D, S7D: According to the figure legend, three independent experiments were performed. How many mice were used per experiment? It would be good to depict the individual data points instead of the bar graph. For S7D, 3 data points are depicted (one in WT, two in allan-KO), what do they mean?

      The bite back experiment was performed using 15-20 mosquitoes infected with WT-GFP and gene knockout lines to feed on one naïve mouse each, in three different experiments. We have now included the data points in the bar diagrams.

      • Fig S3: Panel letters E and G are missing

      We have updated the lettering in current Fig. S5

      • Fig 3D: Please indicate what those boxes are. I presume that these are the insets show in b, e and j, but it is never mentioned. J is not even larger than i. Also, f is quite cropped, it would be good to see the large-scale image it comes from to see where in the nucleus these kinetochores are placed. Were there unbound kinetochores found in WT?

      We mention the boxes in the figure legends. It is rare to find unbound kinetochores in WT parasite. We provide large scale and zoomed-in images of free kinetochores in Fig. S8.

      • Fig S4: Insets are not mentioned in the figure legend. Please add scale bar to zoom-ins

      We now describe the insets in the figure legends and have added scale bars to the zoomed-in images.

      • Fig S5A, B: Please indicate which inset belongs to which sub-panel. Where does Ac stem from?

      We have now included the full image showing the inset (new Fig. S8).

      • Fig S5C and S8C: Change "DNA" to "Nucleus".

      We have changed "DNA" to "Nucleus". Now they are Fig. S8K and S11I.

      Reviewer #3 (Significance (Required)):

      Yet, the statement that SUN1 is also important for lipid homoeostasis and NE dynamics is currently not backed up by sufficient data. I believe that the manuscript would benefit from removing the less convincing transcriptomic and lipidomic datasets and rather focus on more deeply characterising the cell biology of the knockouts. This way, the results would be interesting not only for parasitologists, but also for more general cell biologists.

      We have moved the lipidomics and transcriptomics data to supplementary information and toned down the emphasis on these data to make the manuscript more focused on the cell biology and analysis of the genetic KO data.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors investigate the function of the protein SUN1, a proposed nuclear envelope protein linking nuclear and cytoplasmic cytoskeleton, during the rapid male gametogenesis of the rodent malaria parasite Plasmodium berghei. They reveal that SUN1 localises to the nuclear envelope (NE) in male and female gametes, and show that the male NE has unexpectedly high dynamics during the rapid process of gametogenesis. Using expansion microscopy, the authors find that SUN1 is enriched at the neck of the bipartite MTOC that links the intranuclear spindle to the basal bodies of the cytoplasmic axonemes. They further show that upon deletion of SUN1, the basal bodies of the eight axonemes fail to segregate, no spindle is formed, and emerging gametes are anucleated, leading to a complete block in transmission. By interactomics they identify a divergent allantoicase-like protein, ALLAN, as a main interaction partner of SUN1 and further show that ALLAN deletion largely phenocopies the effect of SUN1. Overall, the work here reveals a new protein complex important for maintaining the structural integrity of the bipartite MTOC during the rapid rounds of endomitosis in male gametogenesis. In addition, the authors use transcriptomics and lipidomics to further characterise the effects of SUN1 deletion on gametogenesis and conclude that SUN1 is also required for lipid homeostasis and NE dynamics.

      Major comments:

      My main concern with this manuscript is that the authors do conclude not only that SUN1 is important for spindle formation and basal body segregation, but also that it influences for lipid metabolism and NE dynamics. I don't think the data supports this conclusion, for several reasons listed below. I would suggest to remove this claim from the manuscript or at least tone it down unless more supporting data are provided, in particular showing any change in NE dynamics in the SUN1-KO. Instead I would recommend to focus on the more interesting role of SUN1-ALLAN in bipartite MTOC organisation, which likely explains all observed phenotypes (including those in later stages of the parasite life cycle). In addition, some aspects of the knockout phenotype should be quantified to a bit deeper level.

      In more detail:

      • The lipidomics analysis is clearly the weakest point of the manuscript: The authors state that there are significant changes in some lipid populations between WT and sun1-KO, and between activated and non-activated cells, yet no statistical analysis is shown and the error bars are quite high compared to only minor changes in the means. For some discussed lipids, the result text does not match the graphs, e.g. PA, where the increase upon activation is more pronounced in the SUN1-KO vs WT (contrary to the text), or MAG, which is reduced in the SUN1-KO vs WT (contrary to the text). I don't see the discussed changes in arachidonic acid levels and myristic acid levels in the data either. Even if the authors find after analysis some statistically significant differences between some groups, they should carefully discuss the biological significance of these differences. As it is, I do not think the presented data warrants the conclusion that deletion of SUN1 changes lipid homeostasis, but rather shows that overall lipid homeostasis is not majorly affected by gametogenesis or SUN1 deletion. As a minor comment, if you decide to keep the lipidomics analysis in the manuscript, please state how many replicates were done.
      • I can't quite follow the logic why the authors performed transcriptomic analysis of the SUN1 and how they chose their time points. Their data up to this point indicate that SUN1 has a structural or coordinating role in the bipartite MTOC during male gametogenesis. Based on that it is rather unlikely that SUN1 KO directly leads to transcriptional changes within the 8 min of exflagellation. Isn't it more likely that transcriptional differences are purely a downstream effect of incomplete/failed gametogenesis? This is particularly true for the comparison at 30 min, which compares a mixture of exflagellated/emerged gametes and zygotes in WT to a mixture of aberrant, arrested gametes in the knockout, which will likely not give any meaningful insight. The by far most significant GO-term is then also nuclear-transcribed mRNA catabolic process, which is likely not related at all to SUN1 function (and the authors do not even comment on this in the main text). I would therefore suggest removing the 30 min data set from this manuscript. As a minor point, I would suggest highlighting some of the top de-regulated gene IDs in the volcano plots and stating their function. Also, please state how you prepared the cells for the transcriptomes and in how many replicates this was done.
      • Live-cell imaging of SUN1-GFP does nicely visualise the NE during gametogenesis, showing a highly dynamic NE forming loops and folds, which is very exciting to see. It would be beneficial to also show a video from the life-cell imaging. In their discussion, the authors state multiple times that NE dynamics are changed upon SUN1 KO. Yet, they do not provide data supporting this claim, i.e. that the extended loops and folds found in the nuclear envelope during gametogenesis are affected in any way by the knockout of SUN1 or ALLAN. What happens to the NE in absence of SUN1? Are there less loops and folds? In absence of a reliable NE marker this may not be entirely easy to address, but at least some SBF-SEM images of the sun1-KO gametocytes could provide insight.
      • I think the exciting part of the manuscript is the cell biological role of SUN1 on male gametogenesis, which could be carved out a bit more by a more detailed phenotyping. Specifically it would be good to quantify 1) if DNA replication to an octoploid state still occurs in SUN1-KO and ALLAN-KO, 2) the proportion of anucleated gametes in WT and the KO lines and 3) a quantification of the BB clustering phenotype (in which proportion of cells do the authors see this phenotype). This could be addressed by simple fixed immunofluorescence images of the respective WT/KO lines at various time points after activation (or possibly by reanalysis of the already obtained images) and would really improve the manuscript. Especially the claim that emerged SUN1-KO gametes lack a nucleus is currently only based on single slices of few TEM cells and would benefit from a more thorough quantification in both SUN1- and ALLAN-KOs
      • The TEM suggests that in the SUN1-KO, kinetochores are free in the nucleus. Are all kinetochores free or do some still associate to a (minor/incorrectly formed) spindle? The authors could address this by tagging NDC80 in the KO lines.
      • Finally, I think it is curious that in contrast to SUN1, ALLAN seems to be less important, with some KO parasite completing the life cycle. Maybe a more detailed phenotyping as above gives some more hints to where the phenotypic difference between the two proteins lies. I would assume some ALLAN-KO cells can still segregate the basal body. Can the authors speculate/discuss in more detail why these two proteins seems to have slightly different phenotypes?

      Minor comments:

      Some additional points where the data is not entirely sound yet or could be improved:

      • Localisation of SUN1: There seems to be a discrepancy between SUN1-GFP location as observed by live cell microscopy, and by Expansion Microscopy (ExM), similar for ALLAN-GFP. By live-cell microscopy, the SUN1 localisation is much more evenly distributed around the NE, while the localisation in ExM is much more punctuated, and e.g. in Figure 1E seems to be within the nucleus. Do the authors have an explanation for this? Also, in Fig. 1D there are two GFP foci at the cell periphery (bottom left of the image), which I would think are not SUN1-Foci, as they seem to be outside of the cell. Is the antibody specific? Was there a negative control done for the antibody (WT cells stained with GFP antibodies after ExM)? - The authors argue that SIM gave unexpected results due to PFA fixation leading to collapse of the NE loops. However, they also fix their ExM cells and their EM cells with PFA and do not observe a collapse, at least from what I see in the two presented images and in the 3D reconstruction. Is there something else different in the sample preparation? Can the authors trace their NE in ExM according to the NHS-Ester signal?
      • Fig 2D: It would be good to not just show images of oocysts but actually quantify their size from images. Also, have the authors determined the sporozoite numbers in SUN1-KO?
      • Line 481-483: the authors state that oocyst size is reduced in ALLAN-KO but do not show the data. Please quantify oocyst size or at least show representative images. Also the drastic decrease in sporozoite numbers (Fig. 6D, E) is not mentioned in the text. Please add reference to Fig S7D when talking about the bite back data.
      • Fig S1C, 6C: Both WB images are stitched, but this is not clearly indicated e.g. by leaving a small gap between the lanes. Also please show a loading control along with the western blots. Also there seems to be a (unspecific?) band in the control, running at the same height as Allan-GFP WB. What exactly is the control?
      • Regarding the crossing experiment: The authors conclude from this cross that SUN1 is only needed in males, yet for this conclusion they would need to also show that a cross with a female line does not rescue the phenotype. The authors should repeat the cross with a male-deficient line to really test if the phenotype is an exclusively male phenotype. In addition, line 270-272 states that no oocysts/sporozoites were detected in sun1-ko and nek4-ko parasites. However, the figure 2E shows only oocysts, not sporozoites, and shows also that sun1-ko does form oocysts, albeit dead ones.
      • In Fig S1 the authors show that they also generated a SUN1-mCherry line, yet they do not use it in any of the presented experiments (unless I missed it). Would it be beneficial to cross the SUN1-mCherry line with the Allan1-GFP line to test colocalisation (possibly also by expansion microscopy)?
      • Line 498: "In a significant proportion of cells" - What was the proportion of cells, and what does significant mean in this context?
      • The authors should discuss a bit more how their work relates to the work of Sayers et al. 2024, which also identified the SUN1-ALLAN complex. The paper is cited, but only very briefly commented on.

      Suggestions how to improve the writing and data presentation.

      • General presentation of microscopy images: Considering that large parts of the manuscript are based on microscopy data, their presentation could be improved. Single-channel microscopy images would benefit from being depicted in gray scale instead of color, which would make it easier to see the structures and intensities (especially for blue channels). Also, it would be good to harmonize in which panels arrows are shown (e.g. Fig 1G, where some white arrows are in the SUN1-GFP panel, while others are in the merge panel, but they presumably indicate the same thing.). At the same time, Fig 1H doesn't have any with arrows, even though the figure legend states so. Fig 3A and S4 show the same experiment but are coloured in different colours (NHS-Eester in green vs grey scale).
      • Are the scale bars of all expansion microscopy images adjusted for the expansion factor?
      • The figure legends would benefit from streamlining, as they have very different style between figures (eg Fig. 6 which has a concise figure legend vs microscopy figures where figure legends are very long and describe not only the figure but the results)
      • Line 155-156: The text makes it sound like the expression only happens after activation. is that the case? Are these images activated or non-activated gametocytes?
      • Line 267: Reference to the original nek4-KO paper missing
      • Line 301: The reference to Figure 2J seems to be a bit arbitrarily placed. Also, this schematic of lipid metabolism is never discussed in relation to the transcriptomic or lipidomic data.
      • Line 347-349 states that gametes emerged, but the referenced figure shows activated gametocytes before exflagellation.
      • Line 588: Spelling mistake in SUN1-domain
      • Line 726/731: i missing in anti-GFP
      • Line 787-789: statement of scale bar and number of cells imaged is not at the right position in the figure legend.
      • Line 779, 783: "shades of green" should be just "green". Same goes for line 986, 989 with "shades of grey"
      • Line 974, 976: please correct to WT-GFP and dsun1
      • Line 1041, 1044: WT-GFP instead of WTGFP.
      • Fig 1B, D, E, Fig S1G, H: What are the time points of imaging?
      • Fig 1D/Line 727: the scale of the scale bar on the inset is missing.
      • Fig 3 E-G and 6H-J: Please indicate total number of cells/images analysed per quantification, either in the graphs themselves or in the figure legend.
      • Fig 5B: What is NP?
      • Fig S1B/D: The legend states that there is an arrow indicating the band, but there is none.
      • Fig S2C: Is the scale bar really the same for the zygote and the ookinete?
      • Fig S3C, S7C: which stages was qRT-PCR done on?
      • Fig. S3D, S7D: According to the figure legend, three independent experiments were performed. How many mice were used per experiment? It would be good to depict the individual data points instead of the bar graph. For S7D, 3 data points are depicted (one in WT, two in allan-KO), what do they mean?
      • Fig S3: Panel letters E and G are missing
      • Fig 3D: Please indicate what those boxes are. I presume that these are the insets show in b, e and j, but it is never mentioned. J is not even larger than i. Also, f is quite cropped, it would be good to see the large-scale image it comes from to see where in the nucleus these kinetochores are placed. Were there unbound kinetochores found in WT?
      • Fig S4: Insets are not mentioned in the figure legend. Please add scale bar to zoom-ins
      • Fig S5A, B: Please indicate which inset belongs to which sub-panel. Where does Ac stem from?
      • Fig S5C and S8C: Change "DNA" to "Nucleus".

      Significance

      This study uses extensive microscopy and genetics to characterise an unusual SUN1-ALLAN complex and provides new insights into the molecular events during Plasmodium male gametogenesis, especially how the intranuclear events (spindle formation and mitosis) are linked to the extranuclear, cytoplasmic formation of the axonemes. While it could be more extensive, the phenotypic characterisation of the mutants reveals an interesting phenotype, showing that SUN1 and ALLAN are localised to and maintain the neck region of the bipartite MTOC. The authors here confirm and expand the previous knowledge about SUN1 in P. berghei (as published by Sayers et al., 2024), adding more detail to its localisation and dynamics, and further characterise the interaction partner ALLAN. Yet, the statement that SUN1 is also important for lipid homoeostasis and NE dynamics is currently not backed up by sufficient data. I believe that the manuscript would benefit from removing the less convincing transcriptomic and lipidomic datasets and rather focus on more deeply characterising the cell biology of the knockouts. This way, the results would be interesting not only for parasitologists, but also for more general cell biologists.

      My expertise lies within the cell biology of malaria parasites, especially during early transmission stages.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript by Zeeshan et al describes the organisation of SUN1 during the rapid closed mitosis of male Plasmodium gametocytes and the consequences of knockout of the SUN1 gene for male gamete formation and oocyst development.

      SUN (Sad1, UNC84-domain) proteins have been shown, in other studies, in other organisms, to be part of a bridging complex (LINC) that links cytoplasm-located structural elements with nuclear structures. They are anchored in the inner nuclear envelope and present a C-terminal SUN domain into the space between nuclear envelope (NE) inner and outer membranes. In humans, the SUN domain interacts with the outer NE-embedded KASH (Klarsicht, ANC-1, Syne Homology)-protein, which in turn binds to the cytoskeletal components, including the centrosome.

      Introduction

      Line 63: The authors stat: "NE is integral to mitosis, supporting spindle formation, kinetochore attachment, and chromosome segregation..". Seemingly at odds, they also say (Line 69) that 'open' "mitosis is "characterized by complete NE disassembly". The authors could explain better the ideas presented in their quoted review from Dey and Baum, which points out that truly 'open' and 'closed' topologies may not exist and that even in 'open' mitosis, remnants of the NE may help support the mitotic spindle.

      Results

      Fig 7 is the final figure; but would be more useful upfront. The authors compared the sequence of SUN1, ALLAN, KASH proteins and lamins across apicomplexans, and Arabidopsis and humans. They note that plasmodium has two SUN domain proteins. Plasmodium SUN1 has the same orientation as in human SUN1 with the C-terminal SUN domain into the space between nuclear envelope (NE) inner and outer membranes. In agreement with previous reports, no KASH-like or lamin proteins were identified.

      Fig 1D. The authors generated a C-terminal GFP-tagged SUN1 transfectants and used ultrastructure expansion microscopy (U-ExM) and structured illumination microscopy (SIM) to examine SUN1-GFP in male gametocytes post-activation. The immuno-labelling of SUN1-GFP in these fixed cells appears very different to the live cell images of SUN1-GFP. The labelling profile comprises distinct punctate structures (particularly in the U-ExM images), suggesting that paraformaldehyde fixation process, followed by the addition of the primary and secondary antibodies has caused coalescing of the SUN1-GFP signal into particular regions within the NE.

      Given these fixation issues, the suggestion that the SUN1-GFP signal is concentrated at the BB/ nuclear MTOC and "enriched near spindle poles" needs further support. These statements seem at odd with the data for live cell imaging where the SUN1-GFP seems evenly distributed around the nuclear periphery. Can the observation be quantitated by calculating the percentage of BB/ nuclear MTOC structures with associated SUN1-GFP puncta? If not, I am not convinced these data help understand the molecular events.

      The authors then generated dual transfectants and examined the relative locations of different markers in live cells. These data are more informative.

      The authors state; " ..SUN1-GFP marked the NE with strong signals located near the nuclear MTOCs situated between the BB tetrads". The nuclear MTOCs are not labelled in this experiment. The SUN1-GFP signal between the kinesin-8B puncta is evident as small puncta on regions of NE distortion. I would prefer to not describe this signal as "strong". The signal is stronger in other regions of the NE.

      Line 219. The authors state; "..SUN1-GFP is partially colocalized with spindle poles as indicated by EB1,.. it shows no overlap with kinetochores (NDC80)." The authors should provide an analysis of the level of overlap at a pixel by pixel level to support this statement.

      The SUN1 construct is C-terminally GFP-tagged. By analogy with human SUN1, the C-terminal SUN domain is expected to be in the NE lumen. That is in a different compartment to EB1, which is located in the nuclear lumen (on the spindle). Thus, the overlap of signal is expected to be minimal. Similarly, given that EB1 and NDC80 are known to occupy overlapping locations on the spindle, it seems unlikely that SUN1 can overlap with one and not the other.

      I note on Line 609, the authors state "Our study demonstrates that SUN1 is primarily localized to the nuclear side of the NE.." As per Fig 7D, and as discussed above, the bulk of the protein, including the SUN1 domain, is located in the space between the INM and the ONM.

      Interestingly, as the authors point out, nuclear membrane loops are evident around EB1 and NDC80 focal regions. The data suggests that the contraction of the spindle pole back to the nuclear periphery induces distortion of the NE.

      The authors generate Δsun1 parasites and showed that a functional sun1 gene is required for male gamete formation and subsequent oocyst development.

      In a very impressive set of micrographs (Fig 3), the authors used U-ExM and TEM to show that spindle formation is severely disrupted, and BB fail to segregate in Δsun1 gametocytes. Axoneme elongation occurs but the axenomes are inconnected to BBs and nuclear spindles.

      The authors undertook immunoprecipitation (IP) experiment using a nanobody that recognises SUN1-GFP in lysates of purified activated gametocytes.

      They identified several nuclear pore proteins, as well as the allantoicase-like protein (ALCC1/ ALLAN). They reverse-immunoprecipitated ALLAN-GFP from lysates of activated gametocytes and identified SUN1 and its interactors, DDRGK-domain containing protein and kinesin-15. This is an important finding.

      The authors used AlphaFold to predict potential complexes of SUN1 and ALLAN. A complex is predicted between the plasmodium-specific N-terminal domain of SUN1. The authors conclude that ALLAN is located in the nuclear lumen and is involved in linking SUN1 to nuclear components.

      The authors generated a line expressing ALLAN-GFP. In activated male gametocytes, ALLAN-GFP rapidly relocates to focal points at the nuclear periphery that correlated with the nuclear MTOCs (spindle poles). This is another important finding.

      Δallan mutants exhibit a very similar phenotype to the Δsun1 parasites. Activated male gametocyte exhibited clustered BB, with incomplete segregation and misalignment relative to the nuclear MTOCs. TEM data is consistent with the author's conclusion that "ALLAN is critical for the alignment of spindle microtubules with kinetochores and BB segregation."

      Taken together these results are consistent with the suggestion that SUN1 and ALLN proteins play an important structural role in linking the nuclear spindle of P. berghei male gametocytes to the BB and axonemes.

      These are important findings. The author should discuss further the overlap of findings of this study with that from a recent manuscript (https://doi.org/10.1016/j.cels.2024.10.008). That Sayers et al. study identified a complex of SUN1 and ALLC1 as essential for male fertility in P. berghei. Sayers et al. also provide evidence that this complex particulate in the linkage of the MTOC to the NE and is needed for correct mitotic spindle formation during male gametogenesis.

      While the work is interesting, the conclusions may need to be tempered. The authors suggestion that in the absence of KASH-domain proteins, the SUN1-ALLAN complex forms a non-canonical LINC complex (that is, a connection across the NE), that "achieves precise nuclear and cytoskeletal coordination".

      In other organisms, KASH interacts with the C-terminal domain on SUN1, which as mentioned above is located between the INM and ONM. By contrast, ALLAN interacts with the N-terminal domain of SUN1, which is located in the nuclear lumen. The SUN1-ALLAN interaction is clearly of interest, and ALLAN might replace some of the roles of lamins. However, the protein that functionally replaces KASH (i.e. links SUN1 to the ONM) remains unidentified.

      It may also be premature to suggest that the SUN1-ALLAN complex is promising target for blocking malaria transmission. How would it be targeted?

      While the above datasets are interesting and internally consistent, there are two other aspects of the manuscript that need further development before they can usefully contribute to the molecular story.

      The authors undertook a transcriptomic analysis of Δsun1 and WT gametocytes, at 8 and 30 min post-activation, revealing moderate changes (~2-fold change) in different genes. GO-based analysis suggested up-regulation of genes involved in lipid metabolism. Given the modest changes, it may not be correct to conclude that "lipid metabolism and microtubule function may be critical functions for gametogenesis that can be perturbed by sun1 deletion." These changes may simply be a consequence of the stalled male gametocyte development.

      The authors have then undertaken a detailed lipid analysis of the Δsun1 and WT gametocytes, before and after activation. Substantial changes in lipid metabolites might not be expected in such a short period of time. And indeed, the changes appear minimal. Similarly, there are only minor changes in a few lipid sub-classes between Δsun1 and WT gametocytes. In my opinion, the data are not sufficient to support the authors conclusion that "SUN1 plays a crucial role, linking lipid metabolism to NE remodelling and gamete formation."

      Significance

      While the work is interesting, the conclusions may need to be tempered. Datasets are interesting and internally consistent. The aspects of manuscript and conclusion derived from transcriptomic and the lipidomic analysis, however, need further development before they can usefully contribute to the molecular story.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: The study explores the role of the SUN1-ALLAN complex in Plasmodium berghei, identifying it as a unique mediator of nuclear envelope (NE) remodeling and microtubule-organizing center (MTOC) coordination during the rapid closed mitosis of male gametogenesis. The authors demonstrate that SUN1, a nuclear envelope protein, and ALLAN, a novel allantoicase-like protein, form a non-canonical complex. This complex bridges chromatin and cytoskeletal interactions, compensating for the lack of canonical LINC components like KASH-domain proteins and lamins in Plasmodium. Using lipidomics, mass spectrometry, RNA-seq, and advanced imaging methods like ultrastructure expansion microscopy (U-ExM), they reveal that disruption of this complex results in impaired spindle assembly, basal body segregation, and kinetochore attachment. This leads to defective, anuclear flagellated gametes incapable of fertilization. Furthermore, SUN1 deletion affects lipid metabolism, emphasizing its role in maintaining NE homeostasis. The study sheds light on a highly specialized adaptation for rapid mitotic division in Plasmodium, providing insights into NE and MTOC evolution and identifying potential targets for malaria transmission-blocking strategies.

      The authors have utilized an impressive array of techniques, including lipidomics, mass spectrometry, RNA sequencing, and diverse microscopy approaches, to characterize the role of SUN1 deletion during male gametogenesis in Plasmodium.

      Minor comments:

      In the results section (lines 498-499), the authors describe free kinetochores in many cells without associated spindle microtubules. However, some nuclei appear to have kinetochores, as presented in Figure 6. Could the authors clarify how this conclusion was derived using transmission electron microscopy (TEM) without serial sectioning, as this is not explicitly mentioned in the materials and methods?

      Discussion Section:

      Could the authors expand on why SUN1 and ALLAN are not required during asexual replication, even though they play essential roles during male gametogenesis? Lines 611-613 states: "These loops serve as structural hubs for spindle assembly and kinetochore attachment at the nuclear MTOC, separating nuclear and cytoplasmic compartments." Could the authors elaborate on the evidence supporting this statement? In lines 621-622, the authors suggest that ALLAN may have a broader role in NE remodeling across the parasite's lifecycle. Could they reflect on or remind readers of the finding that ALLAN is not essential during the asexual stage?

      Significance

      General assessment:

      The introduction is well-constructed, providing a clear and comprehensive overview of the current understanding of closed mitosis in protozoa and how it differs in Plasmodium parasites. The results are presented clearly and without overstatement, allowing readers to follow the logical progression of the study.

      The knockout (KO) and rescue experiment for Neck4 was particularly innovative, effectively demonstrating the absence of male gametocytes in the SUN1 KO line.

      Impact: This study uncovers how malaria parasites orchestrate one of the fastest cell division processes in biology during male gametogenesis, a critical step for disease transmission. By identifying a novel protein complex, the SUN1-ALLAN axis, that links the nuclear envelope to the machinery organizing cell division, we reveal a unique solution evolved by the parasite to achieve rapid and precise chromosome segregation. This discovery sheds light on how these parasites overcome the lack of proteins commonly found in other organisms, using an entirely distinct strategy to sustain their lifecycle. The findings not only deepen our understanding of the cellular innovations in malaria parasites but also open new avenues for interventions targeting the processes essential for parasite survival and transmission. These insights could contribute to the development of next-generation strategies to combat a disease that continues to impact millions worldwide.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      Summary:

      • In this study, authors investigate the impact of pre-membane (prM) and envelope (E) proteins of tick-borne encephalitis virus (TBEV) on viral distribution and tropism, mostly in the brain.*
      • To do so, authors use high resolution imaging of whole mouse brain after infection by either LGTV, a low pathogenic orthoflavivirus also transmitted by ticks, TBEV, or TBEV/LGTV chimeric virus where prM and E of TBEV are inserted in a LGTV background.*
      • Structural and antigenic characterization of the chimeric virus reveal that it remains a low pathogenic virus exhibiting TBEV structural and antigenic features.*
      • Those viruses are then used to infect wt or mavs -/- mice and viral propagation / tropism is explored, revealing that LGTV and LGTVT:prM predominantly infect cerebral cortex while TBEV infects cerebellum.*
      • Authors work at characterizing their viruses is nicely done and convincing, showing that LGTVT:prM replicated just like LGTV, and exhibited increased viral spread in cellulo.*
      • However LGTVT:prM appear to be less pathogenic in vivo and its brain tropism in mavs -/- mice seems to be similar to wt LGTV virus, stressing the fact that the role of structural proteins prM/E is only modest in TBEV specific tropism to cerebellum.*

      Major comments:

      • It is stated in the introduction that prior work on LGTV/TBEV chimera have already been done, and that both LGTV and LGTV/TBEV are neuroinvasive and neurovirulent in animal models. In this study, both LGTV and LGTVT:prM fails to establish infection in wt mouse model. Were previous published data on LGTV and derivatives also only performed in mavs, or ifnar deficient mice? The previous studies referred to in the manuscript (ref 21 and 23) are both using wt mice of younger age, 3.5 and 3 weeks respectively. It is known that age influences immune status, and some of the experiments in these previous studies are performed in even younger animals (3 to 8 days suckling mice) likely for this specific reason. The different mice strains in these studies may also influence their susceptibility to infection.

      • *While LGTV and LGTVT:prME fails to result in symptomatic infection in wt mice in our study, a certain level of localized infection is likely taking place and the outcome will depend on the immune status of the animals (age/immune deficiencies). What we tried to highlight in the manuscript was that the relative pathogenicity (TBEV/LGTV The fact that the whole "tropism" part of the study is performed in mavs -/- mice limits the impact of the study as escape from innate immune response is central in shaping viral tropism. Authors should advertise more this fact (absent from the abstract) and discuss more the links between LGTV / TBEV and innate immune response (escape mechanisms and NS proteins, implication of prM in controlling MDA5, MAVS)

      Thank you for pointing out the lack of clarity. All the tropism studies, figure 4 and 5, were done in adult WT mice infected i.c. to allow the virus to surpass the initial barrier of peripheral immune response and establish infection in the brain. We have now stressed this in the result section and in the relevant figure legends.

      Minor comments:

      • Figures need some re-working:*

      • Figure 1 :

      • 1D : only the difference between TBEV and LGTVT:prME is shown. Plotting the difference LGTV / LGTVT:prM would be a nice upgrade.* Thank you for this suggestion. However, as there is no statistical difference between LGTV and ChLGTV in Fig 1D we have maintained the figure as originally made.

      • Figure 2 : Numbering in the panels is wrong (2j in the text is 2K, 2H is 2I, ...) and should be corrected. Thank you, this has been corrected in the figure.

      • Figure 3 : Route of infection could be added to figure labels for more clarity. Thank you, we have added this to the figure.

      • Figure 4A : Labelling the Mock panel with areas of concern in the brain(Cerebrum, Cerebellum, ...) would help a lot readers not familiar with brain anatomy. We agree that adding these labels improves the clarity and accessibility of the figure and have added this to 4A.

      • Figure 4 E : images are too small to be convincing. What is staining Iba-1 is not mentioned in the figure legend. Thank you, we have added the explanation that microglia were stained by Iba1 and increased the size of the images in Figure 4. Additionally, co-staining of viral antigens and the neuronal marker UCHL-1 has been added as the new Figure 4E and Iba-1 staining moved to 4F.


      Significance

      Prior studies already described the generation and characterization of TBEV/LGTV chimeric viruses. * The main addition of this paper to the field is the use of impressive high-resolution imaging of whole mouse brains, to explore viral infection and tropism in the brain. * However, presented data remain mostly descriptive, and experiments are performed in a model that may not be optimal to study tropism. As the ability of the virus to escape type I interferon participates to tropism, the fact that infections are only performed in mavs -/- mice limits the relevance of those findings.

      We agree that studying tropism in MAVS-/- mice might be misleading and that is why the whole tropism study was performed in adult WT mice, we have clarified in the text that these data are from WT mice. In addition to the significance of this study in highlighting the respective contribution of structural proteins and the immune response in shaping tropism, this study also provides a __well-characterized chimeric virus __with a safety profile comparable to LGTV while retaining key structural and antigenic features of TBEV, model that has already helped advance studies on flavivirus receptor interactions and structural dynamics.

      Reviewer #2

      Evidence, reproducibility and clarity

      In the manuscript entitled "The influence of the pre-membrane and envelope proteins on structure, * pathogenicity and tropism of tick-borne encephalitis virus" Ebba Rosendal and colleagues present a wealth of data regarding generation and characterisation of a chimeric LGTV virus with TBEV structural proteins, comparing this virus to both LGTV and TBEV across a number of different basic and advanced readouts. They present interesting data regarding the ability of the LGTV-TBEV chimera to spread cell-cell, and the prolonged survival of immunocompromised mice compared with LGTV, which the authors associate with reduced replication in the periphery. As well as an overall increased ability of TBEV to replicate in vitro, and lead to mortality in WT mice in vivo, TBEV was found to be able to infect the cerebellum, whilst this region was rarely infected by LGTV and the chimera. The authors also demonstrate the cross-reactivity of these three viruses via neutralization using serum of TBEV vaccinated individuals.*

      General comment: * In general, I am impressed by the amount of work and breadth of techniques included in this manuscript, which I think speaks to the benefit of multidisciplinary collaboration. However, in my opinion, some points are lacking. My primary concerns lie with the in vivo experiments. The comparison of LGTV and the chimera at the same timepoints isn't ideal as the shift in mortality means these animals are at a different stage of disease at different time points. Whilst this is interesting in itself, it leaves questions about viral titres and tropism of i.p. inoculated animals at end points, in addition to the exclusion of serum titre analysis, the strength of discussion regarding peripheral replication and its potential impact on neuroinvasion/virulence is weakened. Further, claims of neuronal infection are made in figure 4 in total absence of a neuron marker. If the authors wish to claim cell-specific tropism, the cell-specific markers must be included. For figures dependent upon fluorescent imaging, further clarification as to what the AU axes indicate would aid in better interpretation of the data, especially regarding comparison of cerebellar layers for TBEV infection (described in more detail in my specific comments). Finally, In general, I think some opportunities are missed to describe the big picture of potential applicability/impact/translatability of the results obtained, especially the conclusions can be expanded to better highlight this.*

      Thank you for these very relevant comments and suggestions. In line with these, we have now added a later timepoint (8 days) for LGTV:prME in IPS1-/- mice to better understand the kinetics of the chimeric virus at later time points (Figure 3). Additionally, we have added a neuronal marker in figure 4. The explanation of quantification of the fluorescence data is described in detail in the material and method, where the concept of this arbitrary unit (AU) used for quantification is described.

      Specific points: * • Line 67: "It" is a bit of a shaky antecedent - assumedly the authors are referring to tropism, but would be good to state this, as they could also be referring to the underlying mechanisms of pathology. i.e. Tropism is determined by....*

      We agree here and have specified this accordingly.

      • Line 70 - Low pathogenicity in which species? All? Humans? The sentence refers to mice as there has not been any human clinical case with LGTV. We have added that to the text.

      • Line 79 - Strange wording - "and which viral factors influence tropism" is sufficient Corrected accordingly.

      • Line 82 - What does "low pathogenic" mean in this context? Good survivability? No clinical signs? We have clarified in the text that this is referring to similarity to the pathogenicity of LGTV.

      • Line 95: Good to mention in the text the cell type in which the foci are seen We agree, this information has been added to the main text in addition to the figure legend.

      • Line 133 - What is the rationale for the different TBEV strains used? (Kuutsalo-14 here but 93/783 before) We compare the structure of our chimeric virus with the previously published Kuutsalo-14 strain (ref 25). The use of 93/783 in this study is to ensure the same strain of TBEV is used as was used to generate LGTV:prME and to compare the chimeric virus to infectious clones of the parental viruses rescued and passaged in the same way as the chimeric virus itself to ensure differences observed is indeed due to the genetic factors.

      • Line 175/Figure 3 - Why these time points and not later ones for the LGTV chimera? I understand the early time points for replication in the periphery, but would also be good to see brain titres around day 14 when the survival of the chimera inoculated mice decreases quite rapidly. Further, imaging at timepoints at which mortality is somewhat comparable (meaning that virus is likely in the brain) would enable additional readouts to characterise neurovirulence such as cell death markers etc. and allow for a more solid comparative characterisation. Thank you for bringing this to our attention. The figure 3E is displaying data for MAVS-/- mice infected with 10^5 FFU, where the some animals meet end-point criteria already around day 7-9. To address this comment, we have added an additional timepoint at day 8 (seven animals) to explore the trend in viral loads in the brain. However, we refrain from analyzing later time points as this would require a high number of starting animals to ensure adequate numbers surviving to e.g. the suggested day 14, which is not in line with RRR.

      • Interestingly, there is not significant increase in viral loads of LGTV:prME infected animals between day 6 and 8. In line with this, IF imaging analysis of brains from later end-point animals (day 10-14) has shown limited staining of viral antigen in the brain (data not included in manuscript but could be provided to reviewers if requested). This suggests that inflammation is driving the pathology in these animals rather than uncontrolled viral replication. This has also been noted in the text. The tropism and imaging is done in WT mice infected i.c.. and the time/infectious dose has been adjusted to ensure similar clinical manifestation as presented in supplemental Figure 2A. These mice are then euthanized around day 5-6 and processed for brain imaging, line 189.

      • Line 174-182/Figure 3 - Why were serum titres not included in these experiments? These would help to strengthen your argument. (also nice to look at neutralisation in this context, though maybe not essential thanks to your data in figure 2). Viral serum titers have been analyzed previously in MAVS-/- mice in Kurhade et al 2016, and they are high at day 2 and go down to almost detection limit day 4, meaning earlier drop than in peripheral organ and was not included in these experiments. For neutralization, the included time points for the experiments in Figure 3E-H the time points are too short for robust detection of IgG antibody responses.

      • Line 183 - Good to overtly state that this is via i.c. inoculation and the justification for use of this route, and that the mice are assumedly WT. I understand LGTV struggles to get to the brain in mice, but is this representative of how neurotropism looks in animals inoculated via a more "natural" route for TBEV? We appreciate the comment and we have clarified that WT mice are i.c. inoculated. Since we wanted to compare the three viruses, we needed to use an inoculation route that is working for all three viruses. While the tropism after peripheral infection of TBEV is a very interesting question, it remains outside the scope of this study as this cannot be compared with LGTV in WT mice.

      • Figure 4B - What could account for the large variation seen in the TBEV group? This is a very good question that is difficult to answer. Although these are inbred mice, we have previously seen that there are differences in infection rate between different mice using whole brain imaging (Chotiwan et al 2023).

      • Line 200-201 - This image doesn't answer the question of tropism, but contributes to that of microglial activation. A neuronal marker should be included to surmise the cell type infected, rather than using staining for a viral protein to indicate cell morphology/type. Also, the justification for use of the microglial marker over neuronal is lacking, especially as microglia are not mentioned anywhere in the discussion. Also, see suggestion regarding cell death markers above. Thank you for this suggestion we have added a neuronal marker. We have also clarified in the text that we confirm the infection pattern in rhinal cortex with confocal microscopy. Microglia activation has been added to the discussion.

      • Line 203/Figure 4E - Are these images quantifiable? Are any differences observed between the viruses? Quantification of microglial activation is sensitive to imaging quality and area of imaging and requires large sample sets to ensure validity in the conclusions. Here we do not observe any clear differences nor claim that the microglia activation is different between the different viral strains.

      • Line 210 - Bit strange to mention figure 4D again after figure 4E, and I also couldn't spot reference to figure 4F? Thank you for pointing this out the Figure 4D should be Figure 4E, this has been corrected.

      • Are both figures 5A and 5C required for the message you wish to get across? I would suggest either only use 5C or only include the white matter/grey matter comparison for TBEV, in combination with 5A. Thank you we have now removed the mock, LGTV and LGTVT:prME from fig 5C to more clearly communicate the message of difference in infection between GM and WM for TBEV specifically.

      • Figure 5D: does the method of quantification you use/the conclusions you arrive at account for cell size/number? The Purkinje cell bodies are very large and the virus signal in these cells looks saturated - however within the granular layer the nuclei are much smaller but have what seem like large foci of NS5 positivity. Though the overall signal is likely much lower, how does relative distribution look when you account for cell size/number or a binary positive/negative quantification? Relatedly, does the primary anti-NS5 antibody have the same affinity for both LGTV and TBEV NS5? The quantification of OPT in figure 5C is not at the level of single cell resolution but rather virus signal over mock. We agree the cells in the cerebellum has different sizes but we do not claim that the Purkinje layer is more infected compared to the granular cell layer, only that Purkinje cells are infected which is relevant in human TBE.

      NS5 antibody is raised against a peptide in the TBEV NS5 protein which is highly conserved. The aa identity between TBEV and LGTV is 93%, we have not seen a difference in the staining between the different viruses using this antibody.

      • Line 242: Please clarify what you mean by "higher infection" - higher titres? Higher fluorescent signal? We have added "as measured by stronger fluorescent signal" to better explain what we mean with higher infection.

      • Line 242: Can you really say anything about replication here? Infection, yes, but the AU readout and lack of multiple time points doesn't allow for much of an insight into replication, especially when TBEV was left out of the comparison in figure 3F, though even this did not look at live virus. We have changed the wording to infected cells.

      • Line 269-271: Exactly what I was wondering and maybe worth discussing a bit more - is there appropriate literature that you could cite here? We were unsure about the specific concern raised by the reviewer in this comment and, therefore, have not made any changes. If the reviewer could clarify their request, we would be happy to address it accordingly.

      • Line 274-275: Also mosquito borne viruses. See nice paper related to impact of TBEV vaccination on ADE for mosquito borne flaviviruses. Very interesting and would increase the impact of this point. https://doi.org/10.1038/s41467-024-45806-x Thank you for this suggestion we have added this point into the discussion.

      • Line 290-291: Are clinical signs associated with cerebellar injury common for TBEV patients? i.e. does this have translatability to human disease and diagnosis? We have now added some information about cerebellum symptoms in human TBE infection to the discussion.

      • Line 308 conclusions; Your point about the potential use of the chimera for vaccine research/to understand cross-reactivity is worth reiterating here, and potentially something about "highlighting the role of non-structural proteins on tropism determination" Thank you for these suggestions we have now added these aspects in the conclusions.

      • Methods: whilst I realise the statistics are described in the figure legends, it is usually customary to include a short statistics section in the methods to indicate which program was used and why certain statistical tests were chosen, e.g. in figure 1 you use both parametric and non-parametric testing. Thank you for this suggestion. We have added a section describing the statistics in the methods.

      Significance

      Broad ranging characterisation of a novel chimera which has potential applications for vaccine/cross-reactivity research and highlights a key role of non-structural proteins in the determination of viral fitness and tropism. Some limitations regarding cell-specific tropism and kinetics of neuroinvasion and neurovirulence. Likely of interest for basic researchers from range of disciplines within arbovirology.

      • Expertise: arboviruses, imaging, neurovirulence, animal models*
      • Limited expertise: in-depth structural biology, therefore my comments on figure 2 are limited.*

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): * SUMMARY: The authors generated an LGTV chimeric virus harboring the prM and ectodomain of E from TBEV. Aim of the study is to understand how the virals structural proteins influence the distribution and tropism of the virus in the brain. They solved the atomic structures of LGTV and the chimeric virus demonstrating that the chimeric virus is structurally and antigenically similar to TBEV. In vivo experiments demonstrate that the chimeric virus is less pathogenic than LGTV. Finally using 3D whole brain OPT imaging techniques the authors demonstrate that the three viruses show a similar viral distribution in cerebral cortex with the rhnial cortex being the primary site of cortical infection for all viruses. In general TBEV exhibit higher infection rates and is more widespread in the brain, particularly in cerebellum, compared to LGTV and the chimeric virus. The authors concluded that the distribution and tropism of LGTV and TBEV are not solely dependent on receptor tropism. *

      MAJOR COMMENTS: * The conclusions are supported by the data.*

      • However, I think the work can be improved if the authors investigate the differences in the antiviral response induced by the chimeric virus compared to LGTV. The authors speculate that the non-structural proteins may play a role in shaping tropism, likely through their immunomodulating role. These data become especially important if you consider that in the experiments of fig 1 the chimeric virus behave similar to the LGTV wt with even an advantage in cell-to-cell spread but in the in vivo experiments with MAVS-/- mice the chimeric virus behave differently, being less pathogenic than LGTV suggesting that the chimeric virus could not escape the antiviral response even in MAVS-/- condition. We thank the reviewer for this suggestion. In line with this we have now added Ifnb1 and Rsad2 RNA levels in different peripheral organs and we see that early on in infection most mice infected with LGTVT:prME show higher upregulation of these genes. These data have been added as a new panel F and G in figure 3.

      • Moreover, in the discussion, line 270 the authors speculate that the observed attenuation could also be due to sub-optial interactions between TBEV prM and C and transmembrane domain of LGTV E. I think it is important to explain and justify why they decided to do not include C protein of TBEV in the chimeric virus, as well as the transmembrane domain of E. The rational for not using the C protein of TBEV is that we did not want to reduce the RNA to C interaction which, could affect the packaging or encapsidation. In line with this, previous research on chimeric flaviviruses has shown that exchanging the prM-E proteins are usually well tolerated while exchanging the C-protein may lead to attenuation or even failure to rescue the virus.

      • Finally, the authors first used A549 cells for studying the kinetics and viral spread of the chimeric virus in vitro. Than they switch to A549-/- cells for studying structure and antigenicity. The different pathogenicity was assessed in Mavs-/- mice but lastly they used mice WT for the 3D whole brain OPT imaging. I found this discrepancy confusing. The authors should justify, including the explanation in the text, why they switch from WT to A549-/- from experiment to experiment. A549 cells were used in the spread and kinetic study because it is an IFN competent cell type which TBEV and LGTV grows well in. The structural studies were performed in A549 MAVS cells because the lack of MAVS results in higher virus titers. The ability of these cells to produce large amount of virus while grown without serum greatly facilitated the purification protocols for cry-EM and mass spectrometry analysis. This has been highlighted in the text of both the material and method and very briefly in the result.

      The pathogenicity with peripheral infection can only be done with MAVS-/- mice as they are more sensitive to LGTV and it is a lethal model. Adult WT mice are resistant to LGTV infection i.p.. As the immune response is important in shaping the tropism, a direct comparison of the viruses is best analyzed in a WT mouse model.

      MINOR COMMENTS:

        • Line 96 - "recombinant parental LGTV" and "recombinant TBEV", the word recombinant is misused in the sentence.* We have removed recombinant.
      • Line 143-144-145 - I believe the authors are referring to Fig 2I and not 2H as written. Moreover, the authors should clarify if all the experiemtns of fig 2 have been performed in A549-/- cells or only the one of fig 2I All experiments in figure 2 are performed in A549 MAVS-/- as highlighted in the material and methods.

      • Line 158 - to be change "Fig 2I" with "fig 2J" Corrected

      • Line 159 - as above: fig 2J to be change with figure 2k Corrected

      *Significance: *

      • The authors designed a chimeric low pathogenic model virus to study the importance of the structural proteins in determing viral tropism and pathogenicity. The strengths of this work is that they combined the use of the chimeric virus with in vivo experiments and 3D whole brain OPT imaging. Integrating together these tools and assays the authors provided an example of complete investigation method for studying neuroinvasive viruses. *

      • My field of expertise: virus-host interaction, at molecular level.*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: The authors generated an LGTV chimeric virus harboring the prM and ectodomain of E from TBEV. Aim of the study is to understand how the virals structural proteins influence the distribution and tropism of the virus in the brain. They solved the atomic structures of LGTV and the chimeric virus demonstrating that the chimeric virus is structurally and antigenically similar to TBEV. In vivo experiments demonstrate that the chimeric virus is less pathogenic than LGTV. Finally using 3D whole brain OPT imaging techniques the authors demonstrate that the three viruses show a similar viral distribution in cerebral cortex with the rhnial cortex being the primary site of cortical infection for all viruses. In general TBEV exhibit higher infection rates and is more widespread in the brain, particularly in cerebellum, compared to LGTV and the chimeric virus. The authors concluded that the distribution and tropism of LGTV and TBEV are not solely dependent on receptor tropism.

      Major Comments: The conclusions are supported by the data.

      However, I think the work can be improved if the authors investigate the differences in the antiviral response induced by the chimeric virus compared to LGTV. The authors speculate that the non-structural proteins may play a role in shaping tropism, likely through their immunomodulating role. These data become especially important if you consider that in the experiments of fig 1 the chimeric virus behave similar to the LGTV wt with even an advantage in cell-to-cell spread but in the in vivo experiments with MAVS-/- mice the chimeric virus behave differently, being less pathogenic than LGTV suggesting that the chimeric virus could not escape the antiviral response even in MAVS-/- condition.

      Moreover, in the discussion, line 270 the authors speculate that the observed attenuation could also be due to sub-optial interactions between TBEV prM and C and transmembrane domain of LGTV E. I think it is important to explain and justify why they decided to do not include C protein of TBEV in the chimeric virus, as well as the transmembrane domain of E.

      Finally, the authors first used A549 cells for studying the kinetics and viral spread of the chimeric virus in vitro. Than they switch to A549-/- cells for studying structure and antigenicity. The different pathogenicity was assessed in Mavs-/- mice but lastly they used mice WT for the 3D whole brain OPT imaging. I found this discrepancy confusing. The authors should justify, including the explanation in the text, why they switch from WT to A549-/- from experiment to experiment.

      Minor comments:

      Line 96 - "recombinant parental LGTV" and "recombinant TBEV", the word recombinant is misused in the sentence.

      Line 143-144-145 - I believe the authors are referring to Fig 2I and not 2H as written. Moreover, the authors should clarify if all the experiemtns of fig 2 have been performed in A549-/- cells or only the one of fig 2I

      Line 158 - to be change "Fig 2I" with "fig 2J"

      Line 159 - as above: fig 2J to be change with figure 2k

      Significance

      The authors designed a chimeric low pathogenic model virus to study the importance of the structural proteins in determing viral tropism and pathogenicity. The strengths of this work is that they combined the use of the chimeric virus with in vivo experiments and 3D whole brain OPT imaging. Integrating together these tools and assays the authors provided an example of complete investigation method for studying neuroinvasive viruses.

      My field of expertise: virus-host interaction, at molecular level.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript entitled "The influence of the pre-membrane and envelope proteins on structure, pathogenicity and tropism of tick-borne encephalitis virus" Ebba Rosendal and colleagues present a wealth of data regarding generation and characterisation of a chimeric LGTV virus with TBEV structural proteins, comparing this virus to both LGTV and TBEV across a number of different basic and advanced readouts. They present interesting data regarding the ability of the LGTV-TBEV chimera to spread cell-cell, and the prolonged survival of immunocompromised mice compared with LGTV, which the authors associate with reduced replication in the periphery. As well as an overall increased ability of TBEV to replicate in vitro, and lead to mortality in WT mice in vivo, TBEV was found to be able to infect the cerebellum, whilst this region was rarely infected by LGTV and the chimera. The authors also demonstrate the cross-reactivity of these three viruses via neutralisation using serum of TBEV vaccinated individuals.

      General comment:

      In general, I am impressed by the amount of work and breadth of techniques included in this manuscript, which I think speaks to the benefit of multidisciplinary collaboration. However, in my opinion, some points are lacking. My primary concerns lie with the in vivo experiments. The comparison of LGTV and the chimera at the same timepoints isn't ideal as the shift in mortality means these animals are at a different stage of disease at different time points. Whilst this is interesting in itself, it leaves questions about viral titres and tropism of i.p. inoculated animals at end points, in addition to the exclusion of serum titre analysis, the strength of discussion regarding peripheral replication and its potential impact on neuroinvasion/virulence is weakened. Further, claims of neuronal infection are made in figure 4 in total absence of a neuron marker. If the authors wish to claim cell-specific tropism, the cell-specific markers must be included. For figures dependent upon fluorescent imaging, further clarification as to what the AU axes indicate would aid in better interpretation of the data, especially regarding comparison of cerebellar layers for TBEV infection (described in more detail in my specific comments). Finally, In general, I think some opportunities are missed to describe the big picture of potential applicability/impact/translatability of the results obtained, especially the conclusions can be expanded to better highlight this.

      Specific points:

      • Line 67: "It" is a bit of a shaky antecedent - assumedly the authors are referring to tropism, but would be good to state this, as they could also be referring to the underlying mechanisms of pathology. i.e. Tropism is determined by....
      • Line 70 - Low pathogenicity in which species? All? Humans?
      • Line 79 - Strange wording - "and which viral factors influence tropism" is sufficient
      • Line 82 - What does "low pathogenic" mean in this context? Good survivability? No clinical signs?
      • Line 95: Good to mention in the text the cell type in which the foci are seen
      • Line 133 - What is the rationale for the different TBEV strains used? (Kuutsalo-14 here but 93/783 before)
      • Line 175/Figure 3 - Why these time points and not later ones for the LGTV chimera? I understand the early time points for replication in the periphery, but would also be good to see brain titres around day 14 when the survival of the chimera inoculated mice decreases quite rapidly. Further, imaging at timepoints at which mortality is somewhat comparable (meaning that virus is likely in the brain) would enable additional readouts to characterise neurovirulence such as cell death markers etc. and allow for a more solid comparative characterisation.
      • Line 174-182/Figure 3 - Why were serum titres not included in these experiments? These would help to strengthen your argument. (also nice to look at neutralisation in this context, though maybe not essential thanks to your data in figure 2)
      • Line 183 - Good to overtly state that this is via i.c. inoculation and the justification for use of this route, and that the mice are assumedly WT. I understand LGTV struggles to get to the brain in mice, but is this representative of how neurotropism looks in animals inoculated via a more "natural" route for TBEV?
      • Figure 4B - What could account for the large variation seen in the TBEV group?
      • Line 200-201 - This image doesn't answer the question of tropism, but contributes to that of microglial activation. A neuronal marker should be included to surmise the cell type infected, rather than using staining for a viral protein to indicate cell morphology/type. Also, the justification for use of the microglial marker over neuronal is lacking, especially as microglia are not mentioned anywhere in the discussion. Also, see suggestion regarding cell death markers above.
      • Line 203/Figure 4E - Are these images quantifiable? Are any differences observed between the viruses?
      • Line 210 - Bit strange to mention figure 4D again after figure 4E, and I also couldn't spot reference to figure 4F?
      • Are both figures 5A and 5C required for the message you wish to get across? I would suggest either only use 5C or only include the white matter/grey matter comparison for TBEV, in combination with 5A.
      • Figure 5D: does the method of quantification you use/the conclusions you arrive at account for cell size/number? The Purkinje cell bodies are very large and the virus signal in these cells looks saturated - however within the granular layer the nuclei are much smaller but have what seem like large foci of NS5 positivity. Though the overall signal is likely much lower, how does relative distribution look when you account for cell size/number or a binary positive/negative quantification? Relatedly, does the primary anti-NS5 antibody have the same affinity for both LGTV and TBEV NS5?
      • Line 242: Please clarify what you mean by "higher infection" - higher titres? Higher fluorescent signal?
      • Line 242: Can you really say anything about replication here? Infection, yes, but the AU readout and lack of multiple time points doesn't allow for much of an insight into replication, especially when TBEV was left out of the comparison in figure 3F, though even this did not look at live virus.
      • Line 269-271: Exactly what I was wondering and maybe worth discussing a bit more - is there appropriate literature that you could cite here?
      • Line 274-275: Also mosquito borne viruses. See nice paper related to impact of TBEV vaccination on ADE for mosquito borne flaviviruses. Very interesting and would increase the impact of this point. https://doi.org/10.1038/s41467-024-45806-x
      • Line 290-291: Are clinical signs associated with cerebellar injury common for TBEV patients? i.e. does this have translatability to human disease and diagnosis?
      • Line 308 conclusions; Your point about the potential use of the chimera for vaccine research/to understand cross-reactivity is worth reiterating here, and potentially something about "highlighting the role of non-structural proteins on tropism determination"
      • Methods: whilst I realise the statistics are described in the figure legends, it is usually customary to include a short statistics section in the methods to indicate which program was used and why certain statistical tests were chosen, e.g. in figure 1 you use both parametric and non-parametric testing.

      Significance

      Broad ranging characterisation of a novel chimera which has potential applications for vaccine/cross-reactivity research and highlights a key role of non-structural proteins in the determination of viral fitness and tropism. Some limitations regarding cell-specific tropism and kinetics of neuroinvasion and neurovirulence. Likely of interest for basic researchers from range of disciplines within arbovirology.

      Expertise: arboviruses, imaging, neurovirulence, animal models

      Limited expertise: in-depth structural biology, therefore my comments on figure 2 are limited.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this study, authors investigate the impact of pre-membane (prM) and envelope (E) proteins of tick-borne encephalitis virus (TBEV) on viral distribution and tropism, mostly in the brain.

      To do so, authors use high resolution imaging of whole mouse brain after infection by either LGTV, a low pathogenic orthoflavivirus also transmitted by ticks, TBEV, or TBEV/LGTV chimeric virus where prM and E of TBEV are inserted in a LGTV background. Structural and antigenic characterization of the chimeric virus reveal that it remains a low pathogenic virus exhibiting TBEV structural and antigenic features. Those viruses are then used to infect wt or mavs -/- mice and viral propagation / tropism is explored, revealing that LGTV and LGTVT:prM predominantly infect cerebral cortex while TBEV infects cerebellum.<br /> Authors work at characterizing their viruses is nicely done and convincing, showing that LGTVT:prM replicated just like LGTV, and exhibited increased viral spread in cellulo. However LGTVT:prM appear to be less pathogenic in vivo and its brain tropism in mavs -/- mice seems to be similar to wt LGTV virus, stressing the fact that the role of structural proteins prM/E is only modest in TBEV specific tropism to cerebellum.

      Major comments:

      • It is stated in the introduction that prior work on LGTV/TBEV chimera have already been done, and that both LGTV and LGTV/TBEV are neuroinvasive and neurovirulent in animal models. In this study, both LGTV and LGTVT:prM fails to establish infection in wt mouse model. Were previous published data on LGTV and derivatives also only performed in mavs, or ifnar deficient mice?

      The fact that the whole "tropism" part of the study is performed in mavs -/- mice limits the impact of the study as escape from innate immune response is central in shaping viral tropism. Authors should advertise more this fact (absent from the abstract) and discuss more the links between LGTV / TBEV and innate immune response (escape mechanisms and NS proteins, implication of prM in controlling MDA5, MAVS)

      Minor comments:

      Figures need some re-working :

      Figure 1 :

      1D : only the difference between TBEV and LGTVT:prM is shown. Plotting the difference LGTV / LGTVT:prM would be a nice upgrade.

      Figure 2 : Numbering in the panels is wrong (2j in the text is 2K, 2H is 2I, ...) and should be corrected.

      Figure 3 : Route of infection could be added to figure labels for more clarity.

      Figure 4A : Labelling the Mock pannel with areas of concern in the brain(Cerebrum, Cerebellum, ...) would help a lot readers not familiar with brain anatomy.

      Figure 4 E : images are too small to be convincing. What is staining Iba-1 is not mentioned in the figure legend.

      Significance

      Prior studies already described the generation and characterization of TBEV/LGTV chimeric viruses. The main addition of this paper to the field is the use of impressive high-resolution imaging of whole mouse brains, to explore viral infection and tropism in the brain.

      However, presented data remain mostly descriptive, and experiments are performed in a model that may not be optimal to study tropism. As the ability of the virus to escape type I interferon participates to tropism, the fact that infections are only performed in mavs -/- mice limits the relevance of those findings.

    1. Note: This response was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The authors describe a genome-wide CRISPR screen in mouse ES cells to identify factors and genes that regulate positively and negatively FGF/ERK signaling during differentiation. Out of known and potentially novel regulating signals, Mediator subunit Med12 was a strong hit in the screen and it was clearly and extensively shown by that the loss of Med12 results in impaired FGF/ERK signal responsiveness, modulation of mRNA levels and disturbed cell differentiation leading to reduced stem cell plasticity.<br /> This is a very concise and well written manuscript that demonstrates for the first time the important role of Med12 in ES cells and during early cell differentiation. The results support data that had been previously observed in Med12 mouse models and in addition show that Med12 cooperates with various signaling systems to control gene expression during early lineage decision.

      We thank the reviewer for their positive evaluation of our work.

      Fig. 3 Supp1A-B:<br /> The loci of all three independent Med12 mutant clones and the absence of Med12 should be included. Are all three Med12 loss-of-function mutants?

      In the revised version of the manuscript, we have updated the scheme in Fig. 3 Supp 1A to represent both deletions that were obtained with the CRISPR guides used. Both the more common 97 bp deletion as well as the 105 bp deletion that occurred in one clonal line result in a complete loss of the protein on the western blot (Fig. 3 Supp. 1B), suggesting that all mutant clones used for further experiments are loss-of-function mutants.

      Minor:<br /> Line 466: Should be Fig. 6F, not 6E.

      We have removed this figure panel and the corresponding text in response to the other reviewers' comments.

      Reviewer #1 (Significance):

      The CRISPR screen identified list of some novel interesting factors that regulate FGF/ERK signaling in ES cells. Med12 was then analyzed in very detail on various levels and under various differentiation conditions, resulting in a complex picture how Med12 controls stem cell plasticity. These data support results observed in mouse models and identified novel regulating mechanisms of Med12.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In the manuscript "Med12 cooperates with multiple differentiation signals to enhance embryonic stem cell plasticity" Ferkorn and Schröter report on the role of Med12 in mouse embryonic stem cells. The perform an elegant genetic screen to identify regulators of Spry4 in mouse ESCs, screening for mutations that increase and decrease Spr4-reporter expression in serum/LIF conditions. They find that Med12 deletion results in defects in the exit from naïve pluripotency and in PrE-formation upon Gata-TF overexpression. Using scRNAseq experiments they report a reduction in biological noise in Med12 KO cells differentiating towards PrE upon Gata6 OE.

      Major points:<br /> 1) The title might not exactly reflect the scientific findings of the manuscript. There is little direct evidence for a decrease in plasticity upon Med12 depletion.

      We have changed the title to "Med12 cooperates with multiple differentiation signals to facilitate efficient lineage transitions in embryonic stem cells". In addition, we have toned down claims that Med12 regulates plasticity throughout the manuscript.

      2) Fig 1G: From the data provided it is not entirely clear how well screen results can be validated. Did some of the mutants identified in the screen also produce no detectable phenotypes? What would be the phenotype of knocking out an unrelated gene? In other words, are some of the weak phenotypes really showing Spry4 downregulation or are they withing the range of biological variance?

      Fluorescence levels in Fig. 1G have been normalized to control wild-type cells (dashed red line). Absence of a detectable phenotype would have resulted in normalized fluorescence values around 1. Fluorescence values of all tested mutants were significantly different from 1, as indicated in the statistical analysis given in the figure legend. Furthermore, H2B-Venus fluorescence of cells transfected with a non-targeting control vector are shown in Fig. 1F, and are not different from that of untransfected control wild-type cells. We have now added an explicit explanation how we normalized the data to the figure legend of Fig. 1G, and hope that this addresses the reviewer's concern.

      3) Rescue experiments by re-expressing Med12 in Med12 KO ESCs are missing. Can the differentiation and transcriptional phenotypes be rescued?

      We agree with the reviewer that a rescue experiment re-expressing Med12 would be ideal to ensure that the observed phenotypes are specifically due to loss of Med12. However, we could not identify commercially available full-length Med12 cDNA clones. Even though we managed to amplify full-length Med12 cDNA after reverse transcription, we were unable to clone it into expression vectors. These observations suggest that specific properties of the Med12 cds make the construction of expression vectors by conventional means difficult, and solving these issues is beyond the scope of this study.

      Throughout the study we used multiple independent clonal lines in multiple experimental readouts and obtained congruent results. The reduced expression of pluripotency genes for example was observed in bulk sequencing of the lines introduced in Fig. 3, and by single-cell sequencing of independently generated _Med12-_mutant GATA6-mCherry inducible lines (Fig. 5 Supp. 1B). We argue that this congruence makes it unlikely that the results are dominated by off-target effects.

      4) L365: The subheading "Transitions between embryonic... buffered against loss of Med12" is confusing. The data simply shows that Med12 KOs can still, albeit less efficiently generate PrE upon Gata TF OE. Is there evidence for some active buffering? I think the authors could simply report the data as is, stating that the phenotypes are not a complete block but an impairment of differentiation.

      Prompted by the reviewer's comment as well as remarks along similar lines by reviewer #4, we have completely reorganized this section and now present all the analysis pertaining to PrE differentiation in a new figure 4. In the revised text (lines 316 - 378), we refrain from any speculations about possible buffering and simply report the data as is, as suggested by the reviewer.

      5) L386: Would it not make more sense to reduce dox concentrations in control cells to equalize Gata6 OE to equalize levels between Med12 KO and controls? A shorter pulse of Gata6 does not really directly address unequal expression levels due to loss of Med12. Different pulse length of OE might have consequences that the authors do not control for. This also impacts scRNAseq experiments which suffer from the same, in my opinion, suboptimal experimental setup. This is a point that needs to be addressed.

      We agree with the reviewer that it would have been desirable to equalize GATA6 overexpression levels between wild-type and Med12-mutant cells while keeping induction time the same. In our experience however, reducing the dox concentration is not suitable to achieve this: Rather than reducing transgene expression levels across the board, lower dox concentrations tend to increase the variability within the population - see Fig. 2 in PMID: 16400644 for an example. Since we agree with the reviewer that the setup of the scRNAseq experiment limits our ability to draw conclusion regarding the separation of cell states, we have decided remove these analyses in the revised manuscript. In doing so, we have reorganized the previous figures 5 and 6 into a new single figure 4. This has made the manuscript more concise and allowed us to focus on the main phenotype of the Med12 mutant cells, namely their delayed exit from pluripotency.

      6) The reduced transcript number in Med12 KOs is interesting, but how does it come about. Is there indeed less transcriptional activity or is reduced transcript numbers a side effect of slower growth or the different cell states between WT and Med12 mutants. Appropriate experiments to address this should be performed.

      To address this point, we have performed EU labeling experiments, to compare RNA synthesis rates between wild-type and Med12-mutant during the exit from pluripotency. These experiments confirmed an increase in the mRNA production upon differentiation for both wild-type and Med12 mutant cells, but the method was not sensitive enough to detect any differences between wild-type and Med12 mutant cells within the same condition. The EU labeling thus supports the notion that overall transcriptional rate increases during differentiation, but leaves open the possibility that reduced mRNA levels in Med12 mutant cells arise from effects other than reduced transcriptional output. These new analyses areshown in Fig. 4 Supp. 3 and described in the main text in lines 373 - 378.

      7) I the proposed reduction of biological noise a feature of the PrE differentiation experiments or can it also be observed in epiblast differentiation.

      To address this question, we have carried out single-cell measurements of Spry4 and Nanog mRNA numbers to compare transcriptional variability between wild-type and Med12-_mutant cells during epiblast differentiation (new Fig. 3 Supp. 1G, H). These measurements confirmed the differences between genotypes in mean expression levels detected by RNA sequencing. However, this analysis did not reveal strong differences in mRNA number distributions. Furthermore, as discussed in point 6 above, our interpretations of noise levels in the PrE differentiation paradigm could have been influenced by the unequal GATA6 induction times. Finally, reviewer #4 pointed out that 10x genomics scRNAseq is not ideal to compare noise levels when total mRNA content differ between samples, as is the case in our dataset. We therefore decided to tone down our conclusions regarding altered noise levels in _Med12-mutant cells.

      8) I cannot follow the authors logic that Med12 loss results in enhanced separation between lineages. How is this experimentally supported.

      As discussed in point 6 above, this result could have been influenced by the unequal induction times between wild type and Med12-mutant cells. We have therefore decided to remove this analysis in the revised version of the manuscript.

      Minor points:<br /> Fig 3, Supp1 A: What exactly are the black and blue highlighted letters?

      The black and blue highlighted letters indicate whether bases are part of an intron or an exon. Exon 7 is now explicitly labelled in the figure, and the meaning of the highlighting is explained in the figure legend.

      Reviewer #2 (Significance):

      Overall, this is an interesting study. The screen has been performed to a high technical standard and differentiation defects were appropriately analyzed. The manuscript has some weaknesses in investigating the molecular mode of action of Med12 which could be improved to provide more significant insights.

      Reviewer #3 (Evidence, reproducibility and clarity):

      The authors sought to identify genes important for the transcriptional changes needed during mouse ES cell differentiation. They identified a number of genes and focussed on Med12, as it was the strongest hit from a cluster of Mediator components.

      Using knockout ES cells, differentiation assays, bulk and scRNAseq, they clearly show that Med12 is important for transgene activation and for gene activation generally during exit from self-renewal, but it is not specifically influencing differentiation efficacy per se. Rather, cells lacking Med12 display "a reduced ability to react to changing culture conditions" and, by inference, to environmental changes. They conclude that Med12 "contributes to the maintenance of cellular plasticity during differentiation and lineage transitions."

      Med12 is a structural component of the kinase module of Mediator, but it is not clear what this study tells us about Mediator function. The authors state that their results contrast with those obtained using a Cdk8 inhibitor, which resulted in increased self-renewal (lines 577-580). I'm not sure where their results show "...that loss of Med12 leads to reduced pluripotency." (lines 579-580). They do not test potency of these cells. There is reduced expression of some pluripotency-associated markers and fewer colonies formed in a plating assay, but these assays to not test cellular potency.

      We agree with the reviewer that our RNA sequencing and colony formation assays do not exhaustively test cellular potency. We have therefore changed the wording in the paragraphs that describe these assays and now talk about "reduced pluripotency gene expression" (e.g. lines 20, 228, 461, 512).

      While their phenotype certainly appears different from that reported in cells treated with Cdk8 inhibitor, it's not clear to me what to make of it, or what it might tell us about the function of the Mediator Kinase module or of Mediator. That a co-activator is important for gene expression in general, or even for gene activation upon receipt of some signal, is not really surprising.

      We believe that reporting differences in the phenotypes obtained with Cdk8 inhibition versus knock-out of Med12 is relevant, because it yields new insight into the different functions that the components of the Mediator kinase module have in pluripotent cells. We have previously discussed possible reasons for these functional differences (discussion line 519 - 528), and further expand on them in the revised manuscript.

      Minor points:

      It is surprising they don't relate their work to that of Hamilton et al (https://doi.org/10.1038/s41586-019-1732-z) who conclude that differentiation from the ES cell state towards primitive endoderm is compromised without Med24.

      Thank you for pointing out this omission. We now cite the work of Hamilton et al., in line 317 (related to new Fig. 4) and 537 - 538 in the discussion.

      Stylistic point: please make the separation between paragraphs more obvious. With no indentation or extra spacing between paragraphs it looks like one solid mass of words.

      Reviewer #3 (Significance):

      There is a lot of careful work here, but I'm not getting a big conclusion here. Perhaps the authors could argue their main points somewhat more stridently and what we've learned beyond this current system.

      Prompted by the reviewer's comment, we have re-organized the functional analyses of Med12 function in the manuscript by condensing the previous figures 5 and 6 into a new single figure 4. We have removed all discussions of transcriptional noise and plasticity, and now focus more strongly on the slowed pluripotency transitions as the main phenotype of the Med12 mutant cells. These changes make the manuscript more concise, and we hope that they help to deliver a single, clear message to the reader.

      Reviewer #4 (Evidence, reproducibility and clarity):

      Fernkorn and Schröter report the results of a screen in mESCs based on modulation of the fluorescent intensity of the Spry4:H2B-Venus reporter. They identify candidate genes that both positively and negatively modulate the expression of the reporter. Amongst those, are several known regulators of the FGF pathway (transcriptional activator of Spry4) that serve as a positive control for the screen. The manuscript focuses on characterisation of Med12, and the authors conclude that Med12 does not specifically affect FGF-targets. Paradoxically, the authors show that based on the expression of key naïve markers Med12 cells show delayed differentiation. Functionally, however, Med12 mutant cells at 48hrs can form less colonies when plated back in naïve conditions (that would normally indicate accelerated differentiation ). The authors conclude that Med12 mutants have "a reduced ability to react to changing culture conditions". Next, they examine the Med12 mutation affects embryonic/extraembryonic differentiation using an inducible Gata6 expression system. They show that transgene induction is slower and dampened in mutant cells and that overall the balance of fates is skewed towards embryonic cells. Finally, they use single cell RNA sequencing and observe differences in the number of mRNAs detected, as well as the separation between clusters in the mutant cells. They conclude that the mutants have reduce transcriptional noise levels.

      Overall, it was an interesting article exploring the molecular consequences of knocking out a subunit of the mediator complex. The characterisation focuses primarily on the description of the screen and the more functional consequences of the KO, rather than delving onto the molecular aspects (e.g. whether mediator complex assembly is affected, or it's binding etc). The analysis of the transcriptional noise will be of particular interest to the community, although I have some suggestions to exclude the possibility that the analysis simply reflects changes in global transcription levels. I have a small number of concerns and requests for clarification on the data but all of them should be relatively easy to address.

      Mayor points:

      • Med12, transcription levels and noise (Figure 6G, J-L). This is an intriguing observation. The labelling and multiplexing helped resolve many of the issue typically associated with comparing 10x dataset. I have two observations about this analysis:<br /> 1) Clarify how number of mRNA counts per cell is calculated (figure 6F) - the methods only described a value normalised by the total number of counts per cell.

      The mRNA counts shown in the figure correspond to the raw number of UMIs detected per cell. We now explicitly state this in the figure legend. Please note that after re-organizing the manuscript, former Fig. 6F has become Fig. 4 Supp. 3A.

      I feel this observation is key and has repercussions for the interpretation of the data (see point below) and should be independently validated (although I recognise it's difficult!). Since the authors observed differences in a randomly integrated transgene (iGata experiments), it's possible/likely that the dysregulation of transcription output is more generic. A possible suggestion is measuring global mRNA synthesis and degradation rates, either using inhibitors or by adding modified nucleotides and measuring incorporation rate and loss through pulse/chase labelling.

      We have performed an EU labeling experiment to address this point, which is shown in Fig. 4 Supp. 3 and described in the main text in lines 373 - 378 of the revised manuscript. Please refer to our response to reviewer #2, point 6 for a short description of the results.

      2) 10x is not the ideal for looking at heterogeneity/noise since it has a low capture efficiency and there are a lot of gaps/zeros in the lower expression range. Therefore, it's simply possible that mutant cells have dampened transcriptional output, meaning lowly expressed genes which in the WT contribute to the apparent heterogeneity (because there is a higher chance of not being captured), are below the 10x detection range in the mutant. This can be seen by plotting the cumulative sum of the mean gene count across each sample - the 50% mark (=mean gene count at 50% detection) reflects a measure of the "capture efficiency" (either because of technical reasons or lower mRNA input). Generally (e.g. also seen across technical repeats), the mean coefficient of variation, entropy and other measures of population heterogeneity directly scale with this "mean gene count at 50% detection", while the cell-cell correlation inversely scales with the "mean gene count at 50% detection". If this scaling relationships are observed for the WT and mutant, then it is impossible to say from the single cell RNA-seq whether the differences in heterogeneity are due to biological or technical reasons. Unfortunately, down-sampling the reads does not generally correct or normalise for this type of technical noise since the technical errors accumulate at every step of sample prep. Of course, it's possible that the technical noise in the RNAseq obfuscates real differences in the level of noise. The failure of mutant cells to re-establish the naïve network certainly suggest there is something going on. Therefore, I suggest performing the analysis of capture efficiency vs CV2 mentioned above and adjusting the discussion accordingly, and potentially perform single molecule FISH of key variable genes at the interface of the two clusters to validate the difference in heterogeneity.

      As suggested by the reviewer, we have performed single molecule FISH measurements of variable genes (Fig. 3 Supp. 1 G, H), but these did not provide independent evidence for increased noise levels in Med12 mutant cells. In light of the caveats raised by reviewer #4 when estimating noise levels from 10x scRNAseq data, and the suggestion of reviewer #3 to sharpen the focus of the manuscript, we have decided to remove any strong conclusions about different noise levels between the genotypes. Instead, we focus on the slowed pluripotency transitions as the main phenotype of the Med12 mutant cells to make the manuscript more concise, to deliver a single, clear message.

      • Are Oct4 levels affected? Reduction of Oct4 is sufficient to block differentiation (Radzisheuskaya et al. 2013 - PMID: 23629142).

      We thank the reviewer for this idea. We measured OCT4 expression levels in single cells via quantitative immunostaining and found that that there is no difference between wild-type and Med12-mutant cells. It is therefore unlikely that lowered OCT4 levels block differentiation in the mutant. These new results are shown in Fig. 5, Supp. 1 D, E.

      • Med12 mutants showing transcriptionally delayed differentiation (related to figure 4C). Is this delay also reflected in the expression of formative genes? If I understand correctly, Figure 4C is made from a panel of naïve markers. It would be good to determine if the formative network is equally affected (and in the same direction - suggesting a delay), or if the transcriptional changes speak to a global dysregulation/dampened expression.

      Prompted by the reviewer's suggestion, we have extended our analysis of the differentiation delays to genes that are upregulated during differentiation, such as formative genes. Rather than trying to come up with an new set of formative markers to produce a variation of the original Fig. 4C (Fig. 5C in the revised manuscript), we have taken an unbiased approach and extended Fig. 5E with a panel showing the distribution of expression slopes of the 100 most upregulated genes determined as in Fig. 5D. This analysis demonstrates a lower upregulation slope in Med12-mutant cells. This result confirms that both the upregulation and downregulation of genes is less efficient upon the loss of MED12, in line with our conclusion of delayed differentiation.

      • Control for the re-plating experiments in 2i/LIF (Figure 4B). Replating in 2iLIF + FBS can have a large selective effect in certain mutant backgrounds (e.g. Nodal mutants) which don't accurately reflect the differentiation status. To exclude such effects, it would be good to repeat the replating assays in serum-free conditions (laminin coating can help with attachment) and include undifferentiated controls to ensure that the mutant doesn't have a clonal disadvantage.

      The reason we have included FBS in the re-plating assays is that in our experience, Fgf4-_mutant cells show strongly impaired growth standard in 2i+LIF medium. We anticipate that using laminin coating to help with attachment would not overcome this requirement. We have therefore decided against repeating the re-plating assays. Instead, we state the reason why we used FBS in the main text, and also explicitly acknowledge the reviewers' concern of the risk of selective effects of the FBS and the possible clonal disadvantages of the _Med12 mutant line.

      Minor points:<br /> - I found figure 3D and the corresponding text and caption difficult to understand. It is unclear what a "footprint", "relative pathway activity" or "spearman correlation of footprint" mean. Were all the genes listed below Med12 knocked out and sequenced in this study? I suggest re-working and maybe simplifying the text and figure.

      We re-worked the description about the pathway analysis and stated more clearly that:

      • The footprint is a quantitative measure of the differences in gene expression change of a defined list of target genes between wild-type and perturbation.
      • Only the Med12 mutant data is new data produced in this manuscript and all examples below are from Lackner et al., 2021.

      We think that a more extensive explanation of the terms "relative pathway activity" and "spearman correlation of footprint" would disturb the flow of the manuscript too much. Therefore, we now cite the original paper just next to the sentence these terms are mentioned.

      In figure S1 Sup1 the authors report the dose response of targets to FGF - are those affected in the mutant?

      In this manuscript we have not tested if the dose response of FGF target genes changes upon perturbation of Med12. We argue that such an experiment would be beyond the scope of the current manuscript, since - as acknowledged by the reviewer - "Med12 does not specifically affect FGF-targets".

      • Similarly, it would be helpful to guide the reader through figure 5H-I and the corresponding text and caption since it's not immediately obvious how the analysis/graphs lead to the conclusion stated.

      As a consequence of our reorganization of the manuscript, the original figure 5H-I has been moved to Fig. 4, Supp. 1 in the revised version. The analysis strategy has been described in more detail in one of our previous publications (PMID: 26511924). In keeping with our general decision to make the manuscript more focused and concise, we have decided against further expanding on these data, but instead refer the reader to the original publication.

      • Role of Med12 in regulating FGF signalling. There are two observations that seems a bit at odds with the text description and it would be helpful to clarify: "ppERK levels were indistinguishable between wild-type and Med12-mutant lines" (line 222) - 5/6 datapoints show an increase. "[...] overall these results argue against a strong and specific role of Med12 in regulation of FGF target genes." (line 274). If I understood correctly, ~50% of genes are differentially transcribed because of Med12 KO.

      To address the reviewers' first question, we have performed a statistical test on the quantifications of the western blots. This test indicates that there is no significant change of ppERK levels upon loss-of MED12, which now stated clearly in the text (line 217).

      Second, to clarify why our data argues against a strong and specific role of Med12 in regulation of FGF target genes, we now formulate an expectation (lines 276 - 277): If MED12 specifically regulated FGF target genes, the number of differentially expressed genes would be higher in the wild-type than in the Med12-mutant upon stimulation with FGF. This however is not the case.

      • "[...] as well as transitions between different pluripotent states" (line 41) - references missing.

      We have added a reference to PMID: 28174249 (line 39).

      • Line 447: "differentiation conditions" - it's unclear what it's mean by differentiation and how it relates to the diagram in figure 6A. Are those the 20hr cells? Do the -8h, -4hr and 0hr cells (if I understand the meaning of the diagram) cluster all together?

      We now specify in the text that pluripotency conditions refer to cells maintained in 2i + LIF medium, whereas differentiation refers to cells switched to N2B27 after the doxycycline pulse (lines 341 - 342).

      • The difference in dynamics of mCherry activation as a consequence of Med12 KO are not apparent from figure 5E. It might be easier to visualise this observation if x-axis was normalised to the starting point plotting "time from start of induction".

      We agree with the reviewer that the current alignment has not been optimized to compare GATA6 induction dynamics between wild-type and Med12-mutant cells. If we changed the alignment however, it would not be clear any longer that both genotypes were in N2B27 for the same amount of time before analyzing Epi and PrE differentiation. Since our focus is on the differentiation of the two lineages rather than GATA6-mCherry induction dynamics, we decided to keep the original alignment.

      • Figure 3H/I - what does "gene expression changes" and "fold change ratio" mean?

      In Fig. 3H, we plot the the fold change of gene expression upon FGF4 stimulation in _Med12-_mutant versus that in wild-type cells; in Fig. 3I we plot the distribution of the ratio of these two fold changes across all genes. To make this strategy clearer, we have changed the axis label in Fig. 3H to "expression fold change upon FGF", to make it consistent with the axis label "fold-change ratio" in Fig. 3I.

      • Line 579-580 - please clarify what is meant by "reduced pluripotency".

      Prompted by a similar concern raised by reviewer #3, we have changed the wording throughout this paragraph and now talk of "reduced pluripotency gene expression". See also our response to reviewer #3 above.

      • Title: "enhance ESC plasticity". not sure enhance is the right word? There is no evidence that the plasticity of cells is affected.

      We have changed the title; see also our response to reviewer #2, point 1.

      Reviewer #4 (Significance):

      Overall, it was an interesting article exploring the molecular consequences of knocking out a subunit of the mediator complex. The characterisation focuses primarily on the description of the screen and the more functional consequences of the KO, rather than delving onto the molecular aspects (e.g. whether mediator complex assembly is affected, or it's binding etc). The analysis of the transcriptional noise will be of particular interest to the community, although I have some suggestions to exclude the possibility that the analysis simply reflects changes in global transcription levels. I have a small number of concerns and requests for clarification on the data but all of them should be relatively easy to address.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Fernkorn and Schröter report the results of a screen in mESCs based on modulation of the fluorescent intensity of the Spry4:H2B-Venus reporter. They identify candidate genes that both positively and negatively modulate the expression of the reporter. Amongst those, are several known regulators of the FGF pathway (transcriptional activator of Spry4) that serve as a positive control for the screen. The manuscript focuses on characterisation of Med12, and the authors conclude that Med12 does not specifically affect FGF-targets. Paradoxically, the authors show that based on the expression of key naïve markers Med12 cells show delayed differentiation. Functionally, however, Med12 mutant cells at 48hrs can form less colonies when plated back in naïve conditions (that would normally indicate accelerated differentiation ). The authors conclude that Med12 mutants have "a reduced ability to react to changing culture conditions". Next, they examine the Med12 mutation affects embryonic/extraembryonic differentiation using an inducible Gata6 expression system. They show that transgene induction is slower and dampened in mutant cells and that overall the balance of fates is skewed towards embryonic cells. Finally, they use single cell RNA sequencing and observe differences in the number of mRNAs detected, as well as the separation between clusters in the mutant cells. They conclude that the mutants have reduce transcriptional noise levels.

      Overall, it was an interesting article exploring the molecular consequences of knocking out a subunit of the mediator complex. The characterisation focuses primarily on the description of the screen and the more functional consequences of the KO, rather than delving onto the molecular aspects (e.g. whether mediator complex assembly is affected, or it's binding etc). The analysis of the transcriptional noise will be of particular interest to the community, although I have some suggestions to exclude the possibility that the analysis simply reflects changes in global transcription levels. I have a small number of concerns and requests for clarification on the data but all of them should be relatively easy to address.

      Major points:

      • Med12, transcription levels and noise (Figure 6G, J-L). This is an intriguing observation. The labelling and multiplexing helped resolve many of the issue typically associated with comparing 10x dataset. I have two observations about this analysis:
      • Clarify how number of mRNA counts per cell is calculated (figure 6F) - the methods only described a value normalised by the total number of counts per cell. I feel this observation is key and has repercussions for the interpretation of the data (see point below) and should be independently validated (although I recognise it's difficult!). Since the authors observed differences in a randomly integrated transgene (iGata experiments), it's possible/likely that the dysregulation of transcription output is more generic. A possible suggestion is measuring global mRNA synthesis and degradation rates, either using inhibitors or by adding modified nucleotides and measuring incorporation rate and loss through pulse/chase labelling.
      • 10x is not the ideal for looking at heterogeneity/noise since it has a low capture efficiency and there are a lot of gaps/zeros in the lower expression range. Therefore, it's simply possible that mutant cells have dampened transcriptional output, meaning lowly expressed genes which in the WT contribute to the apparent heterogeneity (because there is a higher chance of not being captured), are below the 10x detection range in the mutant. This can be seen by plotting the cumulative sum of the mean gene count across each sample - the 50% mark (=mean gene count at 50% detection) reflects a measure of the "capture efficiency" (either because of technical reasons or lower mRNA input). Generally (e.g. also seen across technical repeats), the mean coefficient of variation, entropy and other measures of population heterogeneity directly scale with this "mean gene count at 50% detection", while the cell-cell correlation inversely scales with the "mean gene count at 50% detection". If this scaling relationships are observed for the WT and mutant, then it is impossible to say from the single cell RNA-seq whether the differences in heterogeneity are due to biological or technical reasons. Unfortunately, down-sampling the reads does not generally correct or normalise for this type of technical noise since the technical errors accumulate at every step of sample prep. Of course, it's possible that the technical noise in the RNAseq obfuscates real differences in the level of noise. The failure of mutant cells to re-establish the naïve network certainly suggest there is something going on. Therefore, I suggest performing the analysis of capture efficiency vs CV2 mentioned above and adjusting the discussion accordingly, and potentially perform single molecule FISH of key variable genes at the interface of the two clusters to validate the difference in heterogeneity.
      • Are Oct4 levels affected? Reduction of Oct4 is sufficient to block differentiation (Radzisheuskaya et al. 2013 - PMID: 23629142).
      • Med12 mutants showing transcriptionally delayed differentiation (related to figure 4C). Is this delay also reflected in the expression of formative genes? If I understand correctly, Figure 4C is made from a panel of naïve markers. It would be good to determine if the formative network is equally affected (and in the same direction - suggesting a delay), or if the transcriptional changes speak to a global dysregulation/dampened expression.
      • Control for the re-plating experiments in 2i/LIF (Figure 4B). Replating in 2iLIF + FBS can have a large selective effect in certain mutant backgrounds (e.g. Nodal mutants) which don't accurately reflect the differentiation status. To exclude such effects, it would be good to repeat the replating assays in serum-free conditions (laminin coating can help with attachment) and include undifferentiated controls to ensure that the mutant doesn't have a clonal disadvantage.

      Minor points:

      • I found figure 3D and the corresponding text and caption difficult to understand. It is unclear what a "footprint", "relative pathway activity" or "spearman correlation of footprint" mean. Were all the genes listed below Med12 knocked out and sequenced in this study? I suggest re-working and maybe simplifying the text and figure. In figure S1 Sup1 the authors report the dose response of targets to FGF - are those affected in the mutant?
      • Similarly, it would be helpful to guide the reader through figure 5H-I and the corresponding text and caption since it's not immediately obvious how the analysis/graphs lead to the conclusion stated.
      • Role of Med12 in regulating FGF signalling. There are two observations that seems a bit at odds with the text description and it would be helpful to clarify: "ppERK levels were indistinguishable between wild-type and Med12-mutant lines" (line 222) - 5/6 datapoints show an increase. "[...] overall these results argue against a strong and specific role of Med12 in regulation of FGF target genes." (line 274). If I understood correctly, ~50% of genes are differentially transcribed because of Med12 KO.
      • "[...] as well as transitions between different pluripotent states" (line 41) - references missing .
      • Line 447: "differentiation conditions" - it's unclear what it's mean by differentiation and how it relates to the diagram in figure 6A. Are those the 20hr cells? Do the -8h, -4hr and 0hr cells (if I understand the meaning of the diagram) cluster all together?
      • The difference in dynamics of mCherry activation as a consequence of Med12 KO are not apparent from figure 5E. It might be easier to visualise this observation if x-axis was normalised to the starting point plotting "time from start of induction".
      • Figure 3H/I - what does "gene expression changes" and "fold change ratio" mean?
      • Line 579-580 - please clarify what is meant by "reduced pluripotency".
      • Title: "enhance ESC plasticity". not sure enhance is the right word? There is no evidence that the plasticity of cells is affected.

      Significance

      Overall, it was an interesting article exploring the molecular consequences of knocking out a subunit of the mediator complex. The characterisation focuses primarily on the description of the screen and the more functional consequences of the KO, rather than delving onto the molecular aspects (e.g. whether mediator complex assembly is affected, or it's binding etc). The analysis of the transcriptional noise will be of particular interest to the community, although I have some suggestions to exclude the possibility that the analysis simply reflects changes in global transcription levels. I have a small number of concerns and requests for clarification on the data but all of them should be relatively easy to address.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors sought to identify genes important for the transcriptional changes needed during mouse ES cell differentiation. They identified a number of genes and focussed on Med12, as it was the strongest hit from a cluster of Mediator components.

      Using knockout ES cells, differentiation assays, bulk and scRNAseq, they clearly show that Med12 is important for transgene activation and for gene activation generally during exit from self-renewal, but it is not specifically influencing differentiation efficacy per se. Rather, cells lacking Med12 display "a reduced ability to react to changing culture conditions" and, by inference, to environmental changes. They conclude that Med12 "contributes to the maintenance of cellular plasticity during differentiation and lineage transitions."

      Med12 is a structural component of the kinase module of Mediator, but it is not clear what this study tells us about Mediator function. The authors state that their results contrast with those obtained using a Cdk8 inhibitor, which resulted in increased self-renewal (lines 577-580). I'm not sure where their results show "...that loss of Med12 leads to reduced pluripotency." (lines 579-580). They do not test potency of these cells. There is reduced expression of some pluripotency-associated markers and fewer colonies formed in a plating assay, but these assays to not test cellular potency. While their phenotype certainly appears different from that reported in cells treated with Cdk8 inhibitor, it's not clear to me what to make of it, or what it might tell us about the function of the Mediator Kinase module or of Mediator. That a co-activator is important for gene expression in general, or even for gene activation upon receipt of some signal, is not really surprising.

      Minor points:

      It is surprising they don't relate their work to that of Hamilton et al (https://doi.org/10.1038/s41586-019-1732-z) who conclude that differentiation from the ES cell state towards primitive endoderm is compromised without Med24.

      Stylistic point: please make the separation between paragraphs more obvious. With no indentation or extra spacing between paragraphs it looks like one solid mass of words.

      Significance

      There is a lot of careful work here, but I'm not getting a big conclusion here. Perhaps the authors could argue their main points somewhat more stridently and what we've learned beyond this current system.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript "Med12 cooperates with multiple differentiation signals to enhance embryonic stem cell plasticity" Ferkorn and Schröter report on the role of Med12 in mouse embryonic stem cells. The perform an elegant genetic screen to identify regulators of Spry4 in mouse ESCs, screening for mutations that increase and decrease Spr4-reporter expression in serum/LIF conditions. They find that Med12 deletion results in defects in the exit from naïve pluripotency and in PrE-formation upon Gata-TF overexpression. Using scRNAseq experiments they report a reduction in biological noise in Med12 KO cells differentiating towards PrE upon Gata6 OE.

      Major points:

      1. The title might not exactly reflect the scientific findings of the manuscript. There is little direct evidence for a decrease in plasticity upon Med12 depletion.
      2. Fig 1G: From the data provided it is not entirely clear how well screen results can be validated. Did some of the mutants identified in the screen also produce no detectable phenotypes? What would be the phenotype of knocking out an unrelated gene? In other words, are some of the weak phenotypes really showing Spry4 downregulation or are they withing the range of biological variance?
      3. Rescue experiments by re-expressing Med12 in Med12 KO ESCs are missing. Can the differentiation and transcriptional phenotypes be rescued?
      4. L365: The subheading "Transitions between embryonic... buffered against loss of Med12" is confusing. The data simply shows that Med12 KOs can still, albeit less efficiently generate PrE upon Gata TF OE. Is there evidence for some active buffering? I think the authors could simply report the data as is, stating that the phenotypes are not a complete block but an impairment of differentiation.
      5. L386: Would it not make more sense to reduce dox concentrations in control cells to equalize Gata6 OE to equalize levels between Med12 KO and controls? A shorter pulse of Gata6 does not really directly address unequal expression levels due to loss of Med12. Different pulse length of OE might have consequences that the authors do not control for. This also impacts scRNAseq experiments which suffer from the same, in my opinion, suboptimal experimental setup. This is a point that needs to be addressed.
      6. The reduced transcript number in Med12 KOs is interesting, but how does it come about. Is there indeed less transcriptional activity or is reduced transcript numbers a side effect of slower growth or the different cell states between WT and Med12 mutants. Appropriate experiments to address this should be performed.
      7. I the proposed reduction of biological noise a feature of the PrE differentiation experiments or can it also be observed in epiblast differentiation.
      8. I cannot follow the authors logic that Med12 loss results in enhanced separation between lineages. How is this experimentally supported.

      Minor points:

      Fig 3, Supp1 A: What exactly are the black and blue highlighted letters?

      Significance

      Overall, this is an interesting study. The screen has been performed to a high technical standard and differentiation defects were appropriately analyzed. The manuscript has some weaknesses in investigating the molecular mode of action of Med12 which could be improved to provide more significant insights.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors describe a genome-wide CRISPR screen in mouse ES cells to identify factors and genes that regulate positively and negatively FGF/ERK signaling during differentiation. Out of known and potentially novel regulating signals, Mediator subunit Med12 was a strong hit in the screen and it was clearly and extensively shown by that the loss of Med12 results in impaired FGF/ERK signal responsiveness, modulation of mRNA levels and disturbed cell differentiation leading to reduced stem cell plasticity.<br /> This is a very concise and well written manuscript that demonstrates for the first time the important role of Med12 in ES cells and during early cell differentiation. The results support data that had been previously observed in Med12 mouse models and in addition show that Med12 cooperates with various signaling systems to control gene expression during early lineage decision.

      Fig. 3 Supp1A-B:<br /> The loci of all three independent Med12 mutant clones and the absence of Med12 should be included. Are all three Med12 loss-of-function mutants?

      Minor:

      Line 466: Should be Fig. 6F, not 6E.

      Significance

      The CRISPR screen identified list of some novel interesting factors that regulate FGF/ERK signaling in ES cells. Med12 was then analyzed in very detail on various levels and under various differentiation conditions, resulting in a complex picture how Med12 controls stem cell plasticity. These data support results observed in mouse models and identified novel regulating mechanisms of Med12.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript addresses the important topic of cell-cell junction maturation and mechanical stability, with a specific focus on how mechanotransduction through the Piezo1 channel regulates these processes. The authors present compelling in vivo evidence demonstrating that Piezo1 plays a role in junction stability and barrier function, particularly in aged tissue. The work makes a valuable contribution to our understanding of mechanotransduction in epithelial biology. However, several aspects of the mechanistic model and in vitro experiments require additional development to fully support the authors' conclusions.

      Major Strengths:

      • The in vivo experiments are well-designed and provide convincing evidence for Piezo1's role in barrier function
      • The study identifies an important connection between mechanical sensing and junction maturation
      • The age-dependent phenotype provides interesting insights into tissue mechanics

      • Areas Requiring Additional Development:

      a. Mechanistic Model Definition A major issue is that the central concept of Piezo1 "balancing membrane and cortical tension" requires more precise definition and experimental support. The authors need to clearly explain what this balance means mechanistically and how it is achieved.

      b. Localization-Function Discrepancy There is an important inconsistency between the authors' claims about Piezo1's role and its localization: while they conclude that Piezo1 is crucial for mechanical stability, they also show that Piezo1 is not localized at mature junctions. This apparent contradiction needs to be addressed with a clear mechanistic explanation.

      c. Quantification and Statistical Analysis Several key conclusions would benefit from more rigorous quantification: - The quantitation of junction maturation in Fig. 1a and 2a should include independent analysis of each experiment rather than pooling cells from multiple experiments - Actin morphology and pMLC2 levels at junctions in Fig. 1 need systematic quantification - Cytoskeletal dynamics and morphological changes in Piezo1-eKO cells (Fig. 2a) require quantification

      d. Methodological and Timeline Clarity The analysis methods and temporal aspects of several experiments need better documentation: Analysis Methods:

      The quantification method for mature adhesions (used in Figs. 1a, 1e, 1f, 2a) needs clarification. The Methods section states that "The transition from zipper-like adhesions to mature continuous intercellular junctions were quantified manually," but crucial details are missing: - What specific criteria defined a "continuous junction"? - Was this based on complete visibility of the cell perimeter as one junction? - How were cells classified as having continuous versus zipper-like adhesions?

      e. The protein intensity quantification at junctions requires methodological clarification. The Methods state "For quantifying intensities at junctions, max projection images were generated, and region of interests (ROIs) were restricted to ZO1-positive junctions." However: - Were ROIs drawn empirically by the user? Or was the ZO-1 signal used to make a mask? - Was there an automated step to determine junctional areas (e.g., intensity threshold)? - Was the analysis blinded? If subjective methods were used, this should be clearly stated and potential variability addressed. 2. Timeline Documentation:

      For blebbistatin experiments (Fig. 1e), specify observation timeframes and quantify the extent of accelerated maturation

      The hypotonic shock experiment (Fig. 3e) timeline needs clarification: - When were measurements taken relative to Ca2+ switch? - Duration of hypotonic media exposure? - Were there time-dependent effects in cell response? 3. Data Support and Interpretation

      a. Several conclusions require additional support or clarification: - The claim about "more dynamic cytoskeletal motion and irregularly shaped" cells (Fig. 2a) is not supported by the provided data. Quantification of dynamics and cell shape are needed to support this conclusion. Cytoskeletal imaging data would also be useful.

      b. The interpretation of junctional tension requires revision: - Current conclusions about increased junctional tension are inferred indirectly from vinculin (Fig. 1c) and a18-catenin (Fig. S1a) immunostaining images. - Consider either:

      a) Adding direct junctional tension measurements (e.g., optical measurements, PMID 31964776)
      
      b) Limiting claims to well-supported morphological differences and moving tension-related interpretations to the Discussion as speculative elements
      

      c. The description "Analysis of vinculin translocation to intercellular junctions showed reduced levels of vinculin at cell-cell contacts, but abundant vinculin at cell-matrix adhesions (Supplementary Fig. S2a), indicating abnormal build-up of stresses at intercellular junctions of Piezo1-eKO cells" needs revision: - "Build-up" suggests higher tensions in Piezo1-eKO cells, which contradicts impaired adhesion maturation findings. Suggest replacing with "distribution" or "organization" "Intercellular" is used ambiguously to include both cell-cell and cell-matrix adhesions 4. Literature Context:

      The discussion should incorporate recent relevant literature on Piezo1's role in tight junction regulation (e.g., PMID 37005489, PMID 33636174, PMID 31409093) 5. Technical Considerations - For localization studies (Fig. 2), using keratinocytes from Piezo1-tdTomato mouse (JAX #029214) would be preferable to heterologously-expressed Piezo1-FLAG, as it would avoid potential artifacts from non-physiological expression levels - Supp Fig. 1b requires additional replicates - The Fig. 3A legend states "Note increase in FLIPPER-TR lifetime indicative of elevated membrane tension in Piezo1-eKO" when the data actually shows the opposite - a decrease in Flipper-TR lifetime indicating lower membrane tension 6. Conceptual and Experimental Clarity Needed Several statements require clearer explanation or additional supporting evidence:

      a. Regarding junction maturation mechanisms:

      The authors state: "This indicated that formation of belt-like adhesions was associated with initial contractility build-up by actomyosin stress fibers linked to junctions, followed by a switch to parallel actomyosin bundles and reduced contractility at adhesions, while the junctions themselves were stabilized in a stressed state indicated by a strengthened actin-junction link." Each part of this claim needs experimental support: - The "initial contractility build-up by actomyosin stress fibers linked to junctions" needs to be demonstrated - The "switch to parallel actomyosin bundles and reduced contractility at adhesions" requires quantification - The claim about "junctions themselves were stabilized in a stressed state" needs stronger evidence

      b. The statement "contact expansion from zippers to a belt requires collaborative regulation of adhesion tension and actomyosin cytoskeleton to lower interfacial tension at the contact" is unclear and needs clarification

      c. The claim "Concomitant with emergence of continuous junctions (8h), the stress fibers were replaced by thick actin bundles positioned perpendicularly to junctions (Fig. 1b)" is not clearly supported by the data 7. Regarding experimental interpretation: - In Fig. 1e, the authors claim that 5µM blebbistatin accelerates junction maturation, but this conclusion is not supported by the statistics (p = 0.0784). Additionally, the timeframe of observation and the quantification of maturation speed should be specified - The results section describing Fig. 3 presents seemingly disconnected observations without clear mechanistic links between them, making it difficult to follow the authors' logic and support their conclusions - The mechanism by which both reduced contractility (blebbistatin) and increased membrane tension can accelerate maturation (Fig. 1e, f; and also in Piezo1-eKO Fig. 3d, e) needs explanation. The fact that these interventions also accelerate maturation also in Piezo1-eKO suggests a mechanism independent of Piezo1 which is at odds with their broad conclusion that Piezo1 balances membrane tension and cortical contractility in the maturation process. The precise mechanism of Piezo1's role in sensing membrane and cortex tension requires clarification. - How Piezo1 maintains mechanical stability of mature junctions despite not being localized there needs to be explained 8. Suggested Additional Experiments:

      a. Optional: Given the age-dependent tissue stiffness effects proposed by the authors, examining keratinocyte behavior in vitro on substrates of varying stiffness would provide valuable insights

      b. Optional: Direct measurements of tension at cell-cell junctions where Piezo1 localizes would help validate the proposed mechanical model 9. Minor Points: - The cell biology sections, particularly descriptions of in vitro experiments, would benefit from a thorough revision to improve precision and clarity. For instance, the Results section describes "Analysis of vinculin translocation to intercellular junctions" when no translocation is actually being studied - Figure legends should clearly indicate what individual data points represent - Several conclusions are overstated. For example, the authors conclude that "Piezo1 controls the maturation process" and that "Piezo1 is required for cell junction maturation into junctional belts" based on Fig. 2. These are exaggerated claims since maturation still progresses in Piezo1's absence, just more slowly. "Regulates" or "modulates" would be more appropriate terminology

      In conclusion, while this manuscript presents important findings regarding Piezo1's role in junction maturation and stability, addressing the mechanistic and quantification issues outlined above is essential for supporting the authors' conclusions. The authors have laid groundwork for understanding an important biological process, and addressing these points would help readers better appreciate the significance of their findings.

      Significance

      General Assessment: This study investigates the critical role of mechanosensing in epithelial barrier formation and maintenance, with a particular focus on Piezo1's contribution to junction maturation and stability. The work's primary strengths lie in its compelling in vivo demonstrations of Piezo1's importance for barrier function, particularly in aged tissue, and its identification of a novel connection between mechanical sensing and junction maturation. The age-dependent phenotype provides valuable insights into tissue mechanics and barrier maintenance. However, the mechanistic understanding of how Piezo1 coordinates these processes requires further development, particularly regarding the proposed balance between membrane and cortical tension.

      Advance: This work provides several important advances:

      1. First demonstration of Piezo1's role in regulating the maturation of cell-cell junctions from zipper-like to belt-like structures
      2. Novel insights into how mechanical forces influence junction maturation through mechanosensitive ion channels
      3. Important connection between aging, tissue mechanics, and barrier function
      4. Integration of mechanical sensing with junction assembly and stability

      The findings extend our understanding of epithelial barrier formation beyond traditional molecular pathways to include mechanotransduction, suggesting new therapeutic possibilities for barrier dysfunction. The age-dependent phenotype is particularly significant as it reveals how mechanical properties of tissue influence barrier maintenance over time.

      Audience: This research will be of broad interest to multiple communities:

      • Cell biologists studying junction assembly and epithelial organization
      • Mechanobiologists interested in force transmission and sensing
      • Ion channel researchers interested in the physiological roles of channels
      • Aging researchers investigating tissue barrier function
      • Bioengineers developing therapeutic strategies for epithelial barriers

      The findings have both basic research and translational implications, particularly for understanding and treating age-related barrier dysfunction in epithelia.

      Reviewer Expertise: Cell biology, mechanobiology, live cell imaging, quantitative image analysis, ion channels I have sufficient expertise to evaluate all aspects of the manuscript except for the specific age-related physiological changes in mouse skin, which falls outside my area of expertise.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript describes the role of the mechanosensitive ion channel Piezo1 in epithelial junction assembly, using Piezo-1-KO primary epidermal keratinocytes in vitro and mouse skin in vivo. The authors conclude that Piezo1 allows balancing of membrane versus cortical tension to stabilize junctions and promote tight juntion (TJ) barrier integrity assembly. The conclusion that Piezo1 has an important function in the formation and maintenance of apical junctions of keratinocytes both in vitro and in vivo is well documented by experiments in WT, KO and rescue cells/tissues where different parameters are carefully measured: protein localization, quantification of mature junctions, membrane tension using the flipper probe, use of the myosin inhibitor blebbistatin, analysis of cortical stiffness by AFM, etc. Although, the physiological relevance and the mechanism through which Piezo operates in young skin are not clear, the authors make reasonable claims, that are not too speculative.

      Major comments:

      1. The Supplementary Figure 4d (panel d) that is described in the Results section is missing. It supposedly shows that 1 year-old Piezo1-eKO mice diplay an increase in transepidermal water loss, inducating that TJ barrier function is compromised. The Figure legend for the panel is also missing. Please provide the Figure panel and the legend.
      2. TJ barrier function depends on claudins, and the loss of claudin-1 leads to transepidermal water loss (please cite the relevant paper from the Tsukita lab). Considering that altered TJ barrier function is observed only in 1-yr old mice (Supplementary Figure to be shown, see point n.1) and not in young mice (Suppl. Fig. 3f-h), the expression pattern of the main claudin isoforms, and especially claudin-1, in the different cell populations (see Suppl. Fig. 3b, or by IF analysis) in young vs old and WT vs KO mice must provided, to provide a mechanistic basis for the observed TJ barrier phenotype. This would help to determine if the phenotype is linked to altered claudin expression or to altered (increased) perijunctional tension.
      3. Mechanistically, the authors mention in the Discussion that Piezo1 might act through RhoA signaling. In Rübsam et al 2017 the authors showed that the uppermost viable layer of the skin has increased apical junctional tension, due to anisotropy of AJ distribution which correlates with EGFR activation and localization. In this context, it is important to know if KO of Piezo-1 affects EGFR localization and signaling, and to probe the RhoA pathway using for example the ROCK inhibitor, instead of blebbistatin.

      Minor comments:

      1. The Methods sections should be improved with additional details. For example, the description of quantification of junctional labeling is vague, and there is often no or little indication in the Legends that specifies number of experiments and junctional segments. In addition, quantification of junctional stainings for specific proteins should be done using a junctional reference marker and not as "absolute" values, because there can be variability of staining between samples and experiments. This is especially important when measuring ZO-1, which is a dual AJ-TJ protein (for example at zipper-like junctions ZO-1 colocalizes with AJ markers). Double labelling with a true TJ marker (occludin or cingulin) and/or a true AJ marker (PLEKHA7, afadin, Ecadherin or a catenin) and quantifying junctional labeling by ratio is highly recommended. This is particularly important when evaluating tension-sensitive epitopes/antigens (alpha-catenin, vinculin, etc)
      2. Please use ZO-1 (and ZO-2) consistently, instead of ZO1 (or ZO2), which is completely inaccurate.
      3. Plase cite Furuse et al 2002 JCB (see above).
      4. Please include statistical data in Figure Legends, specifying the number of separate experiments and number of samples. At least three experiments is recommended.
      5. At the end of the introduction the authors mention "putative" occludin-containing TJs. I would delete putative. Epithelial junctions that contain a continuous circumferential linear distribution of occludin/ZO-1/cingulin and form a barrier comply with the definition of a TJs (Citi et al JCS 2024) .
      6. Please insert page numbers in the manuscript.

      Significance

      The notion that mechanosensitive calcium channels contribute to the formation of continuous apical junctions (repair and assembly) was introduced by the Miller lab, using Xenopus oocytes. This manuscript provides a significant conceptual advance, not only by using in vitro and in vivo mouse (mammalian) epidermal keratinocytes as model system, but especially by using Piezo1-KO and rescue experiments, which was not done in the Xenopus model.

      This research would be of great interest to cell biologists interested in epithelial differentiation, polarization and junction assembly, and to clinicians that are interested in the molecular basis of skin pathophysiology.

      My expertise is in the biochemistry, cell biology and mechanobiology of epithelial junctions. I have used Xenopus embryos, cultured epithelial cells, primary keratinocytes and keratinocyte cell lines and KO mice as model systems. The research of my group focuses on how specific cytoskeletal proteins are organized to transmit forces and are recruited to junctions, and how junctional proteins respond to mechanical force. I have experience in all of the methods described in this paper, except for transepidermal water loss measurement, in situ RNA hybridization and mechanical stretching experiments.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The studies described in this manuscript investigated the mechanical regulation of tight junction (TJ) maturation in the epidermis using a combination of in vitro and in vivo analysis. The findings indicate that during calcium-induced cell-cell adhesion in keratinocytes, there is an initial build up cortical tension in the actin cytoskeleton, followed by an increase in membrane tension, which is required for formation of mature TJs. The studies also demonstrate that loss of Piezo1 delays TJ maturation via defects in membrane tension. Loss of Piezo1 also impaired epidermal homeostasis and barrier function in aged mice. The authors propose that the balance in forces between the cortex and membrane is essential for TJ assembly and is mediated by Piezo1.

      Overall, the studies are carefully designed and executed and provide a clear role for membrane tension and Piezo1 in TJ development, making use of molecular forces sensors, imaging, and chemical and genetic perturbations. However, not all of the conclusions are fully supported by the data, and some key findings require additional quantitative and statistical analysis.

      1. The statement at the end of page 5 ("This indicated that formation of belt-like...) is somewhat overinterpreted from the data shown. To draw conclusions about a switch to reduced contractility at adhesions requires more careful spatio-temporal quantification of F-actin and pmyosin beyond the example single cells shown in 1b. It would also help to see the localization of Ecadherin during this process.
      2. To avoid confusion, the authors should pay careful attention to terminology and be specific when referring to adherens junctions or TJs, rather than just junctions generally.
      3. The labelling of Figure 2b could be clearer. Were the CNL cells also transfected with Piezo1 or mock transfected to control for general effects of transfection? This was not clear from the figure captions.
      4. In Figure 2c-g it is not specified which timepoints the images represent, and the qualitative description of changes in localisation require quantification.
      5. The importance of Piezo1 in junction maturation is somewhat overstated throughout. While Piezo KO clearly delays TJ maturation, the process can still be completed. In the absence of Piezo1 what triggers the rise in membrane tension? Could there be any compensation from Piezo2?
      6. Some of the differences noted are subtle and not strongly significant, such as K6expression, Ca++ induced Piezo1 expression, and F4/80 staining. The conclusions related to these responses should be tempered or qualified.
      7. Analysis of the immune infiltration and the suggested inflammatory response in aged mice is fairly preliminary and not well supported by the data. A second marker of macrophages and addition of T cell markers would help clarify the type of immune response. It would also help to describe the localisation of specific immune cells in more detail and include a direct marker of inflammation (e.g. inflammatory cytokines).
      8. OPTIONAL: Although not essential for the conclusions of the study, the impact and insight could be improved by providing more analysis of the mechanism for the role of Piezo1. For example, does the build up of cortical tension trigger changes in ion channel signalling, and how does this then regulate membrane tension? Is RhoA or aPKC involved?

      Significance

      The process by which epithelia assemble and maintain effective barriers is complex and requires precise spatio-temporal regulation. This study provides some new insight into the mechanical regulation of TJ assembly within the epidermis. It builds upon previous work that identified essential biomechanical cross-talk between adherens junctions and TJs and adds some new information on the timings and specific roles of membrane tension and Piezo1. The interplay between cortical and membrane tension is noteworthy, and this mechanism may have important implications in other barrier tissues. A limitation of the study is a lack of mechanistic detail in how the mechanical switch occurs during TJ maturation, including the specific molecules, structures, and interactions with Piezo1.

      The study also describes the functional implications, whereby loss of Piezo1 in the mouse disrupts barrier integrity. However, these effects were quite subtle. Barrier homeostasis was only disrupted in aged mice, and in vitro, loss of Piezo1 delayed but did not prevent junction maturation. It is therefore interesting to speculate what other mechanisms may be involved in TJ maturation. A potential limitation here is also a lack of detail in the analysis of the inflammatory and immune response in Piezo KO skin.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewers

      We thank the reviewers for their comments and suggestions, which we think are helpful and will improve the manuscript, and intend to address with the changes and planned revisions below.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Bello et al look at the SNP rs28834970 associated with Alzheimer's disease (AD), with C being the risk allele, on chromatin accessibility and expression of a nearby gene, PTK2B, in microglia. Their contention is that the single SNP affects chromatin accessibility and binding of the transcription factor CEBP[beta] in an intronic region of PTK2B and thereby affects PTKB expression. I had a few questions that I think are critical to be addressed. Please note that my numbering of panels is based on the figures, not the legends, which do not seem to quite agree with each other. There are also some figure legends that say "IFNg" while the figures say "LPS", which should be fixed.

      We apologise for the mistake in the figure legend that made this confusing, which we have now revised.

      The abstract says that editing a line that is homozygous for protective alleles to homozygous for risk results in "subtle downregulation of PTK2B expression". It isn't clear to me that the presented data fully supports this contention, which is central to the argument of the paper. In figure 2e, the authors show in both RNAseq and ddPCR that there is numerically lower PTK2B expression but this is not indicated to be statistically significant by one-way paired ANOVA. If there is no nominally significant difference in the edited lines, compared to the proposed significant differences in lines carrying the full risk haplotype (figure 1), then it would not seem sensible to ascribe the effects to the single edited base pair.

      We agree with the reviewer that given the effect of the SNP on PTK2B expression in the edited lines is small and only significant in macrophages, we should not interpret the effects to be mediated solely through PTK2B expression, and have substantially reworded the manuscript accordingly.

      Whilst the effects in the eQTL analysis are significant, it is worth noting that this is likely due to the much larger number of donors (133-217) giving greater power to detect the subtle changes in expression (~1.1 to 2 fold in eQTL). This change is of a similar magnitude in our SNP edited lines (~1.2 fold in SNP edited lines) as would be expected of most common regulatory variants so we believe that it could be the primary causal variant. However, we cannot exclude that other variants in the haplotype could contribute to the effect, so have also reworded accordingly to make this clear.

      Given this uncertainty about the overall strength of effect of the single base pair change it would seem important to evaluate the proposed mechanism of CEBPb binding. It wasn't clear whether the ATAC-seq data summarized in the volcano plot in 2C is proposed to be a cause or a consequence of the CEBPb binding change. I am assuming that the 'fold change' estimate here is CC compared to TT, which would be consistent with direction of effect in figure 1, but please clarify.

      We apologise for the mistake in the figure legend that made this confusing, which we have now revised along with clarification in the revised text. It is difficult to be sure whether changes in chromatin accessibility are a cause or consequence of CEBPb binding, but the fact that the binding of CEBPb is increased in the CC allele (Fig 2a, Fig 2c), that the C allele better matches the consensus sequence (Fig 2b) and there is increased chromatin accessibility (Fig 2a, Supp Fig 3b) suggests that CEBPb binding is causing the formation of the region of chromatin accessibility.

      In contrast to the subtle effects at PTK2B, the global transcriptional effects in figure 3 look quite strong. Are any of these changes dependent on PTK2B, that is to say, are they mimicked by partial suppression of PTK2B expression or activity?

      We agree that the downstream effects of the SNP are much stronger than the effects on PTK2B expression, and we have substantially reworded the manuscript to make it clear that we are unsure that the effects of the SNP are all mediated via PTK2B.

      However, we note that there is evidence in the literature of a loss in CCL4 and CCL5 expression upon PTK2B knockout in macrophages (https://www.nature.com/articles/s41467-021-27038-5) and inhibition of PTK2B in monocytes results in a reduction in CCL5 and CXCL1 (https://www.nature.com/articles/s41598-019-44098-2) consistent with our observations.

      Experiments to manipulate PTK2B expression in microglia and readout changes at the RNA level would take a few months to complete, but we would be willing to do this if the reviewer felt this was necessary.

      Finally, in figure 4, it should be clarified as to why lower expression of PTK2B would be expected to have a detrimental effect on Alzheimer's risk. If understood correctly, and again fixing the figure legends would be helpful, the CC edited lines (risk) have lower chemokine induction than the unedited TT lines.

      We apologise for the error in this figure which we have corrected in the revised version. You are correct that the CC lines have a lower chemokine level in both unstimulated and stimulated cells, and we have now discussed further how this may be linked to increased disease risk.

      "Even though overexpression of these chemokines is characteristic of neuroinflammation, correlated with disease progression and found in late stages of AD, knockout of chemokines, such as CCL2, and chemokine receptors, such as CCR2 and CCR5, in mice is associated with increased Aβ deposition and accumulation [47,50-52,107]. It has also been found that patients carrying CCR5Δ32 mutation, which prevents CCR5 surface expression, develop AD at a younger age[108]. Therefore, we hypothesize that in individuals carrying the C/C allele of rs28834970 downregulation of these chemokines in macrophages and microglia harbouring the C/C allele of rs28834970 affects Aβ-induced microglia chemotaxis, leukocytes recruitment and clearance of Aβ, and may increase the risk of developing symptomatic AD"

      Reviewer #1 (Significance (Required)):

      Going from GWAS hits, which represent blocks of high LD inherited variants, to single functional variants is a difficult problem in human genetics. The current paper attempts to isolate the effect of a single variant within an LD block on IPSC derived macrophages and microglia. This idea might be useful in nominating PTK2B as a therapeutic target for AD, although there is some question in my mind as to direction of effect.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      SUMMARY: In this manuscript the authors explore the biological effects of an intronic SNP in the PTK2B gene, previously shown to be associated with late onset Alzheimer's disease (AD) risk. Based on the likely effect of the SNP locus on PTK2B expression in the macrophage lineage, the authors explore the consequences of introducing with the Crispr/Cas9 technique the biallelic SNP base change (C/C vs T/T) in a human IPSC line that is then differentiated into macrophages or microglia. They observe that C/C increases chromatin accessibility and CEBPb binding in comparison to T/T, with a slight decrease in PTK2B expression, significant in macrophages but not in microglia. The authors then investigate the transcriptome changes induced by the C/C mutation and find alteration in many genes, including a decreased expression of a number of cytokine or receptor proteins involved in inflammatory responses. The authors also mention a decreased effect on IFNg-induced reduced mobility but the data are missing (see Figure errors below). Overall the authors propose that the risk SNP is associated with a decreased PTK2B expression and hypothesize a link between this change and a decreased function of macrophages/microglia that may contribute to AD pathology.

      MAJOR COMMENTS

      1- The authors claim that their results show that the investigated SNP has a causal effects in "microglial function" (Title) and in Alzheimer's disease (AD) (Abstract 2nd sentence "Here we validate a causal single nucleotide polymorphism (SNP) associated with an increased risk of Alzheimer's disease". The word "causal" is repeated many times. However the authors should qualify their claim with respect to AD. Their results do show that the SNP has an effect on chromatin accessibility, CEBP binding, PTK2B expression and transcriptome, but the link between these changes is not formally demonstrated and their potential role in AD-like phenotype is not explored. The "causal" role is not formally and logically demonstrated. It remains an interesting, plausible hypothesis and the results provide strong arguments in support of that hypothesis but do not prove it, yet.

      Concerning the title, "causal effects on microglial function" is awkward, anything that has effects is logically "causal" in these effects. The title should be "... has effects on microglial functions" or "... alters microglial function".

      We agree with the reviewer that given the effect of the SNP on PTK2B expression in the edited lines is small and only significant in macrophages, we should not interpret the effects to be mediated solely through PTK2B expression, or that they cause AD. We have substantially reworded the manuscript throughout to account for this.

      2- One major difficulty in the results is to link the slight decrease in PTK2B transcript, which is only significant in macrophages, with the rest of the phenotype. Because what matters to make this link is not the mRNA but the protein, and because mRNA levels are often not strictly correlated with the protein levels, the authors should measure the PTK2B/PYK2 protein levels in their differentiated cell lines in basal conditions and following activation (as they do for other readouts) using immunoblotting. A robust and significant diminution in PYK2 protein would strongly support its role in linking PTK2B expression and transcriptome change.

      We have performed preliminary analyses of PTK2B expression by Western blot in these cell lines after differentiation, but were unable to observe a significant change in abundance in the edited cell lines. This is not unexpected given the results at the RNA level, since the effect size of this common regulatory variant is likely very small (estimated to be ~1.2 fold from the eQTL analysis), and likely within the variability of this assay.

      As mentioned above, we have reworded the manuscript to avoid interpreting that the effects of rs28834970 are mediated solely through effects on PTK2B expression. We think that an experiment to manipulate PTK2B levels (see next point) may be a better way to demonstrate whether these effects are mediated through PTK2B expression.

      An optional additional key experiment would be to reverse the transcriptome phenotype by increasing the expression of PTK2B (e.g. by cDNA transfection). Note that these points are important because an alternative hypothesis to explain the effects of C/C mutation on macrophage function would be that the C/C mutation has a long distance effect on other chromatin regions with key role in regulating these cells.

      We agree that this would be a valuable experiment, and are planning additional experiments to investigate the effect of manipulating PTK2B levels (through knockout) on microglia.

      3- The manuscript contains several errors in the figures and figure legends. In Fig. 2 the legends for the figure items are shuffled. Figure 4 and Supplementary Figure 5 are duplicates of the same one. Consequently important data are not presented.

      We apologise for the errors in these figures that were due to a mistake during uploading where the incorrect versions were used. The legends for figure 2 and panels in figure 4 have now been corrected, and show the effects of rs28834970 on microglial migration and chemokine release in the presence or absence of IFNg.

      4- When the number of replicates is small (e.g. n = 3) it is preferable to use non parametric tests (rank analysis, e.g. Mann Whitney's test) rather than t test. This applies to Figures 2D (current legend 2A), 2E (current legend 2B), Figure 4A-C, Supplementary Figures 2A, 2B. In Supplementary Fig 4E (MARCO) the number of replicates (presumably 3 because based on RNAseq) and the used test are not indicated. Is it the RNAseq statistical analysis?

      We thank the reviewer for this comment. We acknowledge that the t-test may lead to inflated false discovery rates. However, it has been shown that for small sample sizes parametric tests have a power advantage compared to non-parametric ones that may outweigh the possibly exaggerated false positives. See https://genomebiology.biomedcentral.com/articles/10.1186/s13059-022-02648-4#Sec3 which states:

      "In conclusion, when the per-condition sample size is less than 8, parametric methods may be used because their power advantage may outweigh their possibly exaggerated false positives."

      We have also modified the legend of supplementary figure 4E to clarify the number of replicates used.

      5- In addition to the above comment on tests, when the number of replicates is small it is not appropriate (and misleading) to show box plots or bars with SEM. In the indicated figures the individual data points should be shown.

      We now show individual replicates on box plots (Figure 2D, 2E and supp figure 4E).

      MINOR COMMENTS:

      a- Macrophages and microglia are very similar cell types. Could the authors comment more on the differences they observe and how they are related to those previously described?

      We have now referenced the original papers and commented on the markers that we see differentially expressed, notably P2RY12 which is a key homeostatic microglia marker that distinguishes these cells from macrophages.

      b- In Fig. 2A CEBPb cut and run plot, the differences are not limited to the SNP immediate vicinity, there are also visible differences between T/T and C/C plots in at least a 40-kb range. Is it due to multiple interactions of CEBPb? How can the point difference have broad consequences? Please explain this potentially interesting and relevant finding.

      Whilst there may be small changes in CEBPb binding at the second intronic PTK2B chromatin peak, this is not statistically significant given the variability between repeats. In fact, the only significant change we see in CEBPb binding genome-wide is at the locus overlapping the SNP (Fig 2c).

      c- Potentially cis-altered genes near the SNP include CHRNA2 and EPHX2 (see Sup. Fig. 3a). Their expression may not be detected in macrophage lineage. If this is the case please indicate in the text, otherwise please include the corresponding data in Sup. Fig. 3b to show the presence or absence of SNP-induced change.

      You are correct that CHRNA2 and EPHX2 are not expressed in our macrophages or microglia, and we have now explicitly stated this in the revised text.

      d- In general the Figures are not of very high quality and are difficult to read or understand without constantly going back and forth to the legends (which are mislabeled in some instances). To improve:

      . Please increase font size whenever possible.

      . Please improve Fig. 1d by indicating the position of the SNP, numbering the exons (an intermediate scale plot may be necessary and lines on bottom trace are hardly visible).

      . Please indicate the correct color code for T/T and C/C in Fig 3a and b, left panels, which currently doesn't match.

      . Please label the Venn's diagrams comparisons in Sup. Fig. 4b.

      . In the text and legends the Figure items are identified with letters in upper case, in the figures they are in lower case. Please be consistent.

      We have improved the resolution of the images in the pdf and Fig 1d has been revised to include the position of the SNP. The colour code for T/T and C/C is correct in fig 3a and 3b, but since the PCA plots are independently created, we would not always expect the position of the T/T and C/C alleles to be the same. The Venn diagrams in Sup Fig 4b have been updated, and the letters for the figure panels made consistently upper case throughout.

      e- In Fig. 2D and 2E, the Y axes should start at zero to avoid artificially increasing the visual differences. If there is a strong reason not to do so (I don't see any here), the Y axis should be clearly interrupted to avoid confusion.

      We have altered this accordingly.

      f- In the introduction the authors provide some background about previous work about the potential role of PTK2B/PYK2 in AD pathophysiology. The cited preclinical results suggest that PTK2B activity could have a deleterious effect (references in the manuscript). In contrast, some other reports (PMID: 29803828, 33718872) suggest a protective effect of PTK2B/PYK2. Because the evidence in the current manuscript suggests that the risk-associated SNP results in a decreased function of PTK2B/PYK2 (through decreased levels), at least in cells of the macrophage lineage, the authors could broaden their discussion to include these results.

      We have now discussed the conflicting evidence in the revised manuscript.

      Reviewer #2 (Significance (Required)):

      ADVANCE: Late onset Alzheimer's disease is a major medical issue. It has a complex genetic risk component with many associated loci identified in GWAS. Most of these have only a small individual impact on the risk. One of the SNPs associated with increased risk (rs28834970) is located in an intron of the PTK2B gene. Although various reports have investigated the role of the PTK2B gene product, the tyrosine kinase PYK2, in several AD models, the possible link with rs28834970, is unclear.

      An important point is to determine whether TàC SNP corresponding to rs28834970 alters PTK2B expression and how it does so. An alternative hypothesis could be that the SNP has a strong linkage disequilibrium with an unidentified allele in human populations that could be responsible for AD risk. The current manuscript is a significant step forward in addressing that question. By generating a biallelic C/C SNP mutation in a human IPSC line the current study allows to eliminate such linked contribution.

      The strength of the manuscript is to show an effect on chromatin accessibility, CEBP binding and possibly PTK2B transcripts. It also provides interesting evidence of a broad effect of the C/C mutation on the transcriptome of macrophage lineage cells. In its current form the manuscript presents weaknesses that could be improved. These flaws include issues with the presentation discussed above and the uncomplete demonstration that it is the decrease in PTK2B expression that causes the macrophage/microglia phenotype. If these flaws were overcome the paper would represent a significant advance.

      AUDIENCE: The expected audience is specialized in AD with a possible broader range if all weaknesses are addressed.

      REVIEWER EXPERTISE: Basic science close to the field.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this manuscript the authors explore the biological effects of an intronic SNP in the PTK2B gene, previously shown to be associated with late onset Alzheimer's disease (AD) risk. Based on the likely effect of the SNP locus on PTK2B expression in the macrophage lineage, the authors explore the consequences of introducing with the Crispr/CAS9 technique the biallelic SNP base change (C/C vs T/T) in a human IPSC line that is then differentiated into macrophages or microglia. They observe that C/C increases chromatin accessibility and CEBPb binding in comparison to T/T, with a slight decrease in PTK2B expression, significant in macrophages but not in microglia. The authors then investigate the transcriptome changes induced by the C/C mutation and find alteration in many genes, including a decreased expression of a number of cytokine or receptor proteins involved in inflammatory responses. The authors also mention a decreased effect on IFNg-induced reduced mobility but the data are missing (see Figure errors below). Overall the authors propose that the risk SNP is associated with a decreased PTK2B expression and hypothesize a link between this change and a decreased function of macrophages/microglia that may contribute to AD pathology.

      Major comments:

      1. The authors claim that their results show that the investigated SNP has a causal effects in "microglial function" (Title) and in Alzheimer's disease (AD) (Abstract 2nd sentence "Here we validate a causal single nucleotide polymorphism (SNP) associated with an increased risk of Alzheimer's disease". The word "causal" is repeated many times. However the authors should qualify their claim with respect to AD. Their results do show that the SNP has an effect on chromatin accessibility, CEBP binding, PTK2B expression and transcriptome, but the link between these changes is not formally demonstrated and their potential role in AD-like phenotype is not explored. The "causal" role is not formally and logically demonstrated. It remains an interesting, plausible hypothesis and the results provide strong arguments in support of that hypothesis but do not prove it, yet. Concerning the title, "causal effects on microglial function" is awkward, anything that has effects is logically "causal" in these effects. The title should be "... has effects on microglial functions" or "... alters microglial function".
      2. One major difficulty in the results is to link the slight decrease in PTK2B transcript, which is only significant in macrophages, with the rest of the phenotype. Because what matters to make this link is not the mRNA but the protein, and because mRNA levels are often not strictly correlated with the protein levels, the authors should measure the PTK2B/PYK2 protein levels in their differentiated cell lines in basal conditions and following activation (as they do for other readouts) using immunoblotting. A robust and significant diminution in PYK2 protein would strongly support its role in linking PTK2B expression and transcriptome change. An optional additional key experiment would be to reverse the transcriptome phenotype by increasing the expression of PTK2B (e.g. by cDNA transfection). Note that these points are important because an alternative hypothesis to explain the effects of C/C mutation on macrophage function would be that the C/C mutation has a long distance effect on other chromatin regions with key role in regulating these cells.
      3. The manuscript contains several errors in the figures and figure legends. In Fig. 2 the legends for the figure items are shuffled. Figure 4 and Supplementary Figure 5 are duplicates of the same one. Consequently important data are not presented.
      4. When the number of replicates is small (e.g. n = 3) it is preferable to use non parametric tests (rank analysis, e.g. Mann Whitney's test) rather than t test. This applies to Figures 2D (current legend 2A), 2E (current legend 2B), Figure 4A-C, Supplementary Figures 2A, 2B. In Supplementary Fig 4E (MARCO) the number of replicates (presumably 3 because based on RNAseq) and the used test are not indicated. Is it the RNAseq statistical analysis?
      5. In addition to the above comment on tests, when the number of replicates is small it is not appropriate (and misleading) to show box plots or bars with SEM. In the indicated figures the individual data points should be shown.

      Minor comments:

      • a. Macrophages and microglia are very similar cell types. Could the authors comment more on the differences they observe and how they are related to those previously described?
      • b. In Fig. 2A CEBPb cut and run plot, the differences are not limited to the SNP immediate vicinity, there are also visible differences between T/T and C/C plots in at least a 40-kb range. Is it due to multiple interactions of CEBPb? How can the point difference have broad consequences? Please explain this potentially interesting and relevant finding.
      • c. Potentially cis-altered genes near the SNP include CHRNA2 and EPHX2 (see Sup. Fig. 3a). Their expression may not be detected in macrophage lineage. If this is the case please indicate in the text, otherwise please include the corresponding data in Sup. Fig. 3b to show the presence or absence of SNP-induced change.
      • d. In general the Figures are not of very high quality and are difficult to read or understand without constantly going back and forth to the legends (which are mislabeled in some instances). To improve:
        • Please increase font size whenever possible.
        • Please improve Fig. 1d by indicating the position of the SNP, numbering the exons (an intermediate scale plot may be necessary and lines on bottom trace are hardly visible).
        • Please indicate the correct color code for T/T and C/C in Fig 3a and b, left panels, which currently doesn't match.
        • Please label the Venn's diagrams comparisons in Sup. Fig. 4b.
        • In the text and legends the Figure items are identified with letters in upper case, in the figures they are in lower case. Please be consistent.
      • e. In Fig. 2D and 2E, the Y axes should start at zero to avoid artificially increasing the visual differences. If there is a strong reason not to do so (I don't see any here), the Y axis should be clearly interrupted to avoid confusion.
      • f. In the introduction the authors provide some background about previous work about the potential role of PTK2B/PYK2 in AD pathophysiology. The cited preclinical results suggest that PTK2B activity could have a deleterious effect (references in the manuscript). In contrast, some other reports (PMID: 29803828, 33718872) suggest a protective effect of PTK2B/PYK2. Because the evidence in the current manuscript suggests that the risk-associated SNP results in a decreased function of PTK2B/PYK2 (through decreased levels), at least in cells of the macrophage lineage, the authors could broaden their discussion to include these results.

      Significance

      Advance: Late onset Alzheimer's disease is a major medical issue. It has a complex genetic risk component with many associated loci identified in GWAS. Most of these have only a small individual impact on the risk. One of the SNPs associated with increased risk (rs28834970) is located in an intron of the PTK2B gene. Although various reports have investigated the role of the PTK2B gene product, the tyrosine kinase PYK2, in several AD models, the possible link with rs28834970, is unclear.

      An important point is to determine whether TC SNP corresponding to rs28834970 alters PTK2B expression and how it does so. An alternative hypothesis could be that the SNP has a strong linkage disequilibrium with an unidentified allele in human populations that could be responsible for AD risk. The current manuscript is a significant step forward in addressing that question. By generating a biallelic C/C SNP mutation in a human IPSC line the current study allows to eliminate such linked contribution.

      The strength of the manuscript is to show an effect on chromatin accessibility, CEBP binding and possibly PTK2B transcripts. It also provides interesting evidence of a broad effect of the C/C mutation on the transcriptome of macrophage lineage cells. In its current form the manuscript presents weaknesses that could be improved. These flaws include issues with the presentation discussed above and the uncomplete demonstration that it is the decrease in PTK2B expression that causes the macrophage/microglia phenotype. If these flaws were overcome the paper would represent a significant advance.

      Audience: The expected audience is specialized in AD with a possible broader range if all weaknesses are addressed.

      Reviewer Expertise: Basic science close to the field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Bello et al look at the SNP rs28834970 associated with Alzheimer's disease (AD), with C being the risk allele, on chromatin accessibility and expression of a nearby gene, PTK2B, in microglia. Their contention is that the single SNP affects chromatin accessibility and binding of the transcription factor CEBP[beta] in an intronic region of PTK2B and thereby affects PTKB expression. I had a few questions that I think are critical to be addressed. Please note that my numbering of panels is based on the figures, not the legends, which do not seem to quite agree with each other. There are also some figure legends that say "IFNg" while the figures say "LPS", which should be fixed.

      The abstract says that editing a line that is homozygous for protective alleles to homozygous for risk results in "subtle downregulation of PTK2B expression". It isn't clear to me that the presented data fully supports this contention, which is central to the argument of the paper. In figure 2e, the authors show in both RNAseq and ddPCR that there is numerically lower PTK2B expression but this is not indicated to be statistically significant by one-way paired ANOVA. If there is no nominally significant difference in the edited lines, compared to the proposed significant differences in lines carrying the full risk haplotype (figure 1), then it would not seem sensible to ascribe the effects to the single edited base pair.

      Given this uncertainty about the overall strength of effect of the single base pair change it would seem important to evaluate the proposed mechanism of CEBPb binding. It wasn't clear whether the ATAC-seq data summarized in the volcano plot in 2C is proposed to be a cause or a consequence of the CEBPb binding change. I am assuming that the 'fold change' estimate here is CC compared to TT, which would be consistent with direction of effect in figure 1, but please clarify.

      In contrast to the subtle effects at PTK2B, the global transcriptional effects in figure 3 look quite strong. Are any of these changes dependent on PTK2B, that is to say, are they mimicked by partial suppression of PTK2B expression or activity?

      Finally, in figure 4, it should be clarified as to why lower expression of PTK2B would be expected to have a detrimental effect on Alzheimer's risk. If understood correctly, and again fixing the figure legends would be helpful, the CC edited lines (risk) have lower chemokine induction than the unedited TT lines.

      Significance

      Going from GWAS hits, which represent blocks of high LD inherited variants, to single functional variants is a difficult problem in human genetics. The current paper attempts to isolate the effect of a single variant within an LD block on IPSC derived macrophages and microglia. This idea might be useful in nominating PTK2B as a therapeutic target for AD, although there is some question in my mind as to direction of effect.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      I would like to thank the reviewers for their comments and interest in the manuscript and the study.

      Referee #1

      1. I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning.

      Response: The directional positioning of CTCF-binding sites at chromatin interaction sites was analyzed by CRISPR experiment (Guo Y et al. Cell 2015). We found that the machine learning and statistical analysis showed the same directional bias of the CTCF-binding motif sequence at chromatin interaction sites as the experimental analysis of Guo Y et al. (lines 229-245, Figure 3b, c, d and Table 1). Since CTCF is involved in different biological functions (Braccioli L et al. Essays Biochem. 2019 ResearchGate webpage), the directional bias of binding sites may be reduced in all binding sites including those at chromatin interaction sites (lines 68-73). In our study, we investigated the DNA-binding sites of proteins using the ChIP-seq data of DNA-binding proteins and DNase-seq data. We also confirmed that the DNA-binding sites of SMC3 and RAD21, which tend to be found in chromatin loops with CTCF, also showed the same directional bias as CTCF by the computational analysis.

      1. Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure.

      Response: Following the reviewer's advice, I performed the same analysis with the DNA-binding sites that do no overlap with the DNA-binding sites of CTCF and cohesin (RAD21 and SMC3) (Fig. 6 and Supplementary Fig. 4). The result showed the same tendency in the distribution of DNA-binding sites. The height of a peak on the graph became lower for some DNA-binding proteins after removing the DNA-binding sites that overlapped with those of CTCF and cohesin. I have added the following sentence on lines 427 and 817: For the insulator-associated DBPs other than CTCF, RAD21, and SMC3, the DNA-binding sites that do not overlap with those of CTCF, RND21, and SMC3 were used to examine their distribution around interaction sites.

      1. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      Response: As suggested by the reviewer, I have added the insulator scores and boundary sites from the 4D nucleome data portal as tracks in the UCSC genome browser. The insulator scores seem to correspond to some extent to the H3K27me3 histone marks from ChIP-seq (Fig. 4a and Supplementary Fig. 3). The direction of DNA-binding sites on the genome can be shown with different colors (e.g. red and green), but the directionality of insulator-associated DNA-binding sites is their overall tendency, and it may be difficult to notice the directionality from each binding site because the directionality may be weaker than that of CTCF, RAD21, and SMC3 as shown in Table 1 and Supplementary Table 2.

      I found that the CTCF binding sites examined by a wet experiment in the previous study may not always overlap with the boundary sites of chromatin interactions from Micro-C assay (Guo Y et al. Cell 2015). The chromatin interaction data do not include all interactions due to the high sequencing cost of the assay. The number of the boundary sites may be smaller than that of CTCF binding sites acting as insulators and/or some of the CTCF binding sites may not be locate in the boundary sites. It may be difficult for the boundary location algorithm to identify a short boundary location. Due to the limitations of the chromatin interaction data, I planned to search for insulator-associated DNA-binding proteins without using chromatin interaction data in this study. I have added the statistical summary of the analysis in lines 364-387 as follows: Overall, among 20,837 DNA-binding sites of the 97 insulator-associated proteins found at insulator sites identified by H3K27me3 histone modification marks (type 1 insulator sites), 1,315 (6%) overlapped with 264 of 17,126 5kb long boundary sites, and 6,137 (29%) overlapped with 784 of 17,126 25kb long boundary sites in HFF cells. Among 5,205 DNA-binding sites of the 97 insulator-associated DNA-binding proteins found at insulator sites identified by H3K27me3 histone modification marks and transcribed regions (type 2 insulator sites), 383 (7%) overlapped with 74 of 17,126 5-kb long boundary sites, 1,901 (37%) overlapped with 306 of 17,126 25-kb long boundary sites. Although CTCF-binding sites separate active and repressive domains, the limited number of DNA-binding sites of insulator-associated proteins found at type 1 and 2 insulator sites overlapped boundary sites identified by chromatin interaction data. Furthermore, by analyzing the regulatory regions of genes, the DNA-binding sites of the 97 insulator-associated DNA-binding proteins were found (1) at the type 1 insulator sites (based on H3K27me3 marks) in the regulatory regions of 3,170 genes, (2) at the type 2 insulator sites (based on H3K27me3 marks and gene expression levels) in the regulatory regions of 1,044 genes, and (3) at insulator sites as boundary sites identified by chromatin interaction data in the regulatory regions of 6,275 genes. The boundary sites showed the highest number of overlaps with the DNA-binding sites. Comparing the insulator sites identified by (1) and (3), 1,212 (38%) genes have both types of insulator sites. Comparing the insulator sites between (2) and (3), 389 (37%) genes have both types of insulator sites. From the comparison of insulator and boundary sites, we found that (1) or (2) types of insulator sites overlapped or were close to boundary sites identified by chromatin interaction data.

      1. The suggested alternative transcripts function, also highlighted in the manuscripts abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.

      Response: According to the reviewer's comment, I performed the genome-wide analysis of alternative transcripts where the DNA-binding sites of insulator-associated proteins are located near splicing sites. The DNA-binding sites of insulator-associated DNA-binding proteins were found within 200 bp centered on splice sites more significantly than the other DNA-binding proteins (Fig. 4e and Table 2). I have added the following sentences on lines 397 - 404: We performed the statistical test to estimate the enrichment of insulator-associated DNA-binding sites compared to the other DNA-binding proteins, and found that the insulator-associated DNA-binding sites were significantly more abundant at splice sites than the DNA-binding sites of the other proteins (Fig 4e and Table 2; Mann‒Whitney U test, p value 5. Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.

      Response: I believe that the Figure 1 would help researchers in other fields who are not familiar with biological phenomena and functions to understand the study. More explanation has been included in the Figures and legends of Figs. 4 and 5 to help readers outside the immediate research field understand the figures.

      1. Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Response: Reviewer #2's comments would be related to this comment. I have introduced a more detailed explanation of the method in the Results section, as shown in the responses to Reviewer #2's comments.

      Referee #2

      1. Introduction, line 95: CTCF appears two times, it seems redundant.

      Response: On lines 91-93, I deleted the latter CTCF from the sentence "and examined the directional bias of DNA-binding sites of CTCF and insulator-associated DBPs, including those of known DBPs such as RAD21 and SMC3".

      1. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?

      Response: Although CTCF is known to be the main insulator protein in vertebrates, we found that 97 DNA-binding proteins including CTCF and cohesin are associated with insulator sites by modifying and developing a machine learning method to search for insulator-associated DNA-binding proteins. Most of the insulator-associated DNA-binding proteins showed the directional bias of DNA-binding motifs, suggesting that the directional bias is associated with the insulator.

      I have added the sentence in lines 96-99 as follows: Furthermore, statistical testing the contribution scores between the directional and non-directional DNA-binding sites of insulator-associated DBPs revealed that the directional sites contributed more significantly to the prediction of gene expression levels than the non-directional sites. I have revised the statement in lines 101-110 as follows: To validate these findings, we demonstrate that the DNA-binding sites of the identified insulator-associated DBPs are located within potential insulator sites, and some of the DNA-binding sites in the insulator site are found without the nearby DNA-binding sites of CTCF and cohesin. Homologous and heterologous insulator-insulator pairing interactions are orientation-dependent, as suggested by the insulator-pairing model based on experimental analysis in flies. Our method and analyses contribute to the identification of insulator- and chromatin-associated DNA-binding sites that influence EPIs and reveal novel functional roles and molecular mechanisms of DBPs associated with transcriptional condensation, phase separation and transcriptional regulation.

      1. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS.

      Response: On lines 121-124, to explain the procedure for the SNP of an eQTL, I have added the sentence in the Methods: "If a DNA-binding site was located within a 100-bp region around a single-nucleotide polymorphism (SNP) of an eQTL, we assumed that the DNA-binding proteins regulated the expression of the transcript corresponding to the eQTL".

      1. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details.

      Response: On line 119, I have included the explanation of the eQTL dataset of GTEx v8 as follows: " The eQTL data were derived from the GTEx v8 dataset, after quality control, consisting of 838 donors and 17,382 samples from 52 tissues and two cell lines". On lines 681 and 865, I have added the filename of the eQTL data "(GTEx_Analysis_v8_eQTL.tar)".

      1. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.

      Response: The reviewer would mention Figure 2, not Figure 1. If so, the matrices in panels a and b in Figure 2 are equivalent. I have shown it in the figure: The same figure in panel a is rotated 90 degrees to the right. The green boxes in the matrix show the regions with the ChIP-seq peak of a DNA-binding protein overlapping with a SNP of an eQTL. I used eQTL data to associate a gene with a ChIP-seq peak that was more than 2 kb upstream and 1 kb downstream of a transcriptional start site of a gene. For each gene, the matrix was produced and the gene expression levels in cells were learned and predicted using the deep learning method. I have added the following sentences to explain the method in lines 133 - 139: Through the training, the tool learned to select the binding sites of DNA-binding proteins from ChIP-seq assays that were suitable for predicting gene expression levels in the cell types. The binding sites of a DNA-binding protein tend to be observed in common across multiple cell and tissue types. Therefore, ChIP-seq data and eQTL data in different cell and tissue types were used as input data for learning, and then the tool selected the data suitable for predicting gene expression levels in the cell types, even if the data were not obtained from the same cell types.

      1. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?

      Response: As suggested by the reviewer, to help readers understand the observation, I have added Supplementary Fig. S4c to show the distribution of DNA-binding sites of "CTCF, RAD21, and SMC3" and "BACH2, FOS, ATF3, NFE2, and MAFK" around chromatin interaction sites. I have modified the following sentence to indicate the figure on line 493: Although a DNA-binding-site distribution pattern around chromatin interaction sites similar to those of CTCF, RAD21, and SMC3 was observed for DBPs such as BACH2, FOS, ATF3, NFE2, and MAFK, less than 1% of the DNA-binding sites of the latter set of DBPs colocalized with CTCF, RAD21, or SMC3 in a single bin (Fig. S4c).

      In Aljahani A et al. Nature Communications 2022, we find that depletion of cohesin causes a subtle reduction in longer-range enhancer-promoter interactions and that CTCF depletion can cause rewiring of regulatory contacts. Together, our data show that loop extrusion is not essential for enhancer-promoter interactions, but contributes to their robustness and specificity and to precise regulation of gene expression. Goel VY et al. Nature Genetics 2023 mentioned in the abstract: Microcompartments frequently connect enhancers and promoters and though loss of loop extrusion and inhibition of transcription disrupts some microcompartments, most are largely unaffected. These results suggested that chromatin loops can be driven by other DBPs independent of the known CTCF/Cohesin.

      FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates (Ji D et al. Molecular Cell 2024). CTCF have also found to form transcriptional condensate and phase separation (Lee R et al. Nucleic acids research 2022). FOS was found to be an insulator-associated DNA-binding protein in this study and is potentially involved in chromatin remodeling, transcription condensation, and phase separation with the other factors such as BACH2, ATF3, NFE2 and MAFK. I have added the following sentence on line 548: FOXA1 pioneer factor functions as an initial chromatin-binding and chromatin-remodeling factor and has been reported to form biomolecular condensates.

      1. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?

      Response: Goel VY et al. Nature Genetics 2023 identified highly nested and focal interactions through region capture Micro-C, which resemble fine-scale compartmental interactions and are termed microcompartments. In the section titled "Most microcompartments are robust to loss of loop extrusion," the researchers noted that a small proportion of interactions between CTCF and cohesin-bound sites exhibited significant reductions in strength when cohesin was depleted. In contrast, the majority of microcompartmental interactions remained largely unchanged under cohesin depletion. Our findings indicate that most P-P and E-P interactions, aside from a few CTCF and cohesin-bound enhancers and promoters, are likely facilitated by a compartmentalization mechanism that differs from loop extrusion. We suggest that nested, multiway, and focal microcompartments correspond to small, discrete A-compartments that arise through a compartmentalization process, potentially influenced by factors upstream of RNA Pol II initiation, such as transcription factors, co-factors, or active chromatin states. It follows that if active chromatin regions at microcompartment anchors exhibit selective "stickiness" with one another, they will tend to co-segregate, leading to the development of nested, focal interactions. This microphase separation, driven by preferential interactions among active loci within a block copolymer, may account for the striking interaction patterns we observe.

      The authors of the paper proposed several mechanisms potentially involved in microcompartments. These mechanisms may be involved in looping with insulator function. Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently (Hsieh TS et al. Nature Genetics 2022). Among the identified insulator-associated DNA-binding proteins, Maz and MyoD1 form loops without CTCF (Xiao T et al. Proc Natl Acad Sci USA 2021 ; Ortabozkoyun H et al. Nature genetics 2022 ; Wang R et al. Nature communications 2022). I have added the following sentences on lines 563-567: Another group reported that enhancer-promoter interactions and transcription are largely maintained upon depletion of CTCF, cohesin, WAPL or YY1. Instead, cohesin depletion decreased transcription factor binding to chromatin. Thus, cohesin may allow transcription factors to find and bind their targets more efficiently. I have included the following explanation on lines 574-576: Maz and MyoD1 among the identified insulator-associated DNA-binding proteins form loops without CTCF.

      As for the directionality of CTCF, if chromatin loop anchors have some structural conformation, as shown in the paper entitled "The structural basis for cohesin-CTCF-anchored loops" (Li Y et al. Nature 2020), directional DNA binding would occur similarly to CTCF binding sites. Moreover, cohesin complexes that interact with convergent CTCF sites, that is, the N-terminus of CTCF, might be protected from WAPL, but those that interact with divergent CTCF sites, that is, the C-terminus of CTCF, might not be protected from WAPL, which could release cohesin from chromatin and thus disrupt cohesin-mediated chromatin loops (Davidson IF et al. Nature Reviews Molecular Cell Biology 2021). Regarding loop extrusion, the 'loop extrusion' hypothesis is motivated by in vitro observations. The experiment in yeast, in which cohesin variants that are unable to extrude DNA loops but retain the ability to topologically entrap DNA, suggested that in vivo chromatin loops are formed independently of loop extrusion. Instead, transcription promotes loop formation and acts as an extrinsic motor that extends these loops and defines their final positions (Guerin TM et al. EMBO Journal 2024). I have added the following sentences on lines 535-539: Cohesin complexes that interact with convergent CTCF sites, that is, the N-terminus of CTCF, might be protected from WAPL, but those that interact with divergent CTCF sites, that is, the C-terminus of CTCF, might not be protected from WAPL, which could release cohesin from chromatin and thus disrupt cohesin-mediated chromatin loops. I have included the following sentences on lines 569-574: The 'loop extrusion' hypothesis is motivated by in vitro observations. The experiment in yeast, in which cohesin variants that are unable to extrude DNA loops but retain the ability to topologically entrap DNA, suggested that in vivo chromatin loops are formed independently of loop extrusion. Instead, transcription promotes loop formation and acts as an extrinsic motor that extends these loops and defines their final positions.

      Another model for the regulation of gene expression by insulators is the boundary-pairing (insulator-pairing) model (Bing X et al. Elife 2024) (Ke W et al. Elife 2024) (Fujioka M et al. PLoS Genetics 2016). Molecules bound to insulators physically pair with their partners, either head-to-head or head-to-tail, with different degrees of specificity at the termini of TADs in flies. Although the experiments do not reveal how partners find each other, the mechanism unlikely requires loop extrusion. Homologous and heterologous insulator-insulator pairing interactions are central to the architectural functions of insulators. The manner of insulator-insulator interactions is orientation-dependent. I have summarized the model on lines 551-559: Other types of chromatin regulation are also expected to be related to the structural interactions of molecules. As the boundary-pairing (insulator-pairing) model, molecules bound to insulators physically pair with their partners, either head-to-head or head-to-tail, with different degrees of specificity at the termini of TADs in flies (Fig. 7). Although the experiments do not reveal how partners find each other, the mechanism unlikely requires loop extrusion. Homologous and heterologous insulator-insulator pairing interactions are central to the architectural functions of insulators. The manner of insulator-insulator interactions is orientation-dependent.

      1. Do the authors think that the identified DBPs could work in that way as well?

      Response: The boundary-pairing (insulator-pairing) model would be applied to the insulator-associated DNA-binding proteins other than CTCF and cohesin that are involved in the loop extrusion mechanism (Bing X et al. Elife 2024) (Ke W et al. Elife 2024) (Fujioka M et al. PLoS Genetics 2016).

      Liquid-liquid phase separation was shown to occur through CTCF-mediated chromatin loops and to act as an insulator (Lee, R et al. Nucleic Acids Research 2022). Among the identified insulator-associated DNA-binding proteins, CEBPA has been found to form hubs that colocalize with transcriptional co-activators in a native cell context, which is associated with transcriptional condensate and phase separation (Christou-Kent M et al. Cell Reports 2023). The proposed microcompartment mechanisms are also associated with phase separation. Thus, the same or similar mechanisms are potentially associated with the insulator function of the identified DNA-binding proteins. I have included the following information on line 546: CEBPA in the identified insulator-associated DNA-binding proteins was also reported to be involved in transcriptional condensates and phase separation.

      1. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?

      Response: Snead WT et al. Molecular Cell 2019 mentioned that protein post-transcriptional modifications (PTMs) facilitate the control of molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin (Tang X et al. Nature Communications 2024). I found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Supplementary Fig. 2d). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation by PTMs. I have added the following explanation on lines 576-582: Furthermore, protein post-transcriptional modifications (PTMs) facilitate control over the molecular valency and strength of protein-protein interactions. O-GlcNAcylation as a PTM inhibits CTCF binding to chromatin. We found that the identified insulator-associated DNA-binding proteins tend to form a cluster at potential insulator sites (Fig. 4f and Supplementary Fig. 3c). These proteins may interact and actively regulate chromatin interactions, transcriptional condensation, and phase separation through PTMs.

      1. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Response: Structural molecular model of cohesin-CTCF-anchored loops has been published by Li Y et al. Nature 2020. The structural conformation of CTCF and cohesin in the loops would be the cause of the directional bias of CTCF binding sites, which I mentioned in lines 531 - 535 as follows: These results suggest that the directional bias of DNA-binding sites of insulator-associated DBPs may be involved in insulator function and chromatin regulation through structural interactions among DBPs, other proteins, DNAs, and RNAs. For example, the N-terminal amino acids of CTCF have been shown to interact with RAD21 in chromatin loops. To investigate the principles underlying the architectural functions of insulator-insulator pairing interactions, two insulators, Homie and Nhomie, flanking the Drosophila even skipped locus were analyzed. Pairing interactions between the transgene Homie and the eve locus are directional. The head-to-head pairing between the transgene and endogenous Homie matches the pattern of activation (Fujioka M et al. PLoS Genetics 2016).

      Referee #3

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.

      Response: When a protein complex binds to DNA, one protein of the complex binds to the DNA directory, and the other proteins may not bind to DNA. However, the DNA motif sequence bound by the protein may be registered as the DNA-binding motif of all the proteins in the complex. The molecular structure of the complex of CTCF and Cohesin showed that both CTCF and Cohesin bind to DNA (Li Y et al. Nature 2020). I think there is a possibility that if the molecular structure of a protein complex becomes available, the previous recognition of the DNA-binding ability of a protein may be changed. Therefore, I searched the Pfam database for 99 insulator-associated DNA-binding proteins identified in this study. I found that 97 are registered as DNA-binding proteins and/or have a known DNA-binding domain, and EP300 and SIN3A do not directory bind to DNA, which was also checked by Google search. I have added the following explanation in line 249 to indicate direct and indirect DNA-binding proteins: Among 99 insulator-associated DBPs, EP300 and SIN3A do not directory interact with DNA, and thus 97 insulator-associated DBPs directory bind to DNA. I have updated the sentence in line 20 of the Abstract as follows: We discovered 97 directional and minor nondirectional motifs in human fibroblast cells that corresponded to 23 DBPs related to insulator function, CTCF, and/or other types of chromosomal transcriptional regulation reported in previous studies.

      1. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.

      Response: As the reviewer mentioned, I recognize enhancers are relatively small regions. In the paper, I intended to examine further upstream and downstream of promoter regions where enhancers are found. Therefore, I have modified the sentence in lines 917 - 919 of the Fig. 2 legend as follows: Enhancer-gene regulatory interaction regions consist of 200 bins of 10 kbp between -1 Mbp and 1 Mbp region from TSS, not including promoter.

      1. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.

      Response: Following the reviewer's advice, I have added the ChIP-seq data of H3K9me3 as a truck of the UCSC Genome Browser. The distribution of H3K9me3 signal was different from that of H3K27me3 in some regions. I also found the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions and took some screenshots of the UCSC Genome Browser of the regions around the sites in Supplementary Fig. 3b. I have modified the following sentence on lines 962 - 964 in the legend of Fig. 4: a Distribution of histone modification marks H3K27me3 (green color) and H3K9me3 (turquoise color) and transcript levels (pink color) in upstream and downstream regions of a potential insulator site (light orange color). I have also added the following result on lines 348 - 352: The same analysis was performed using H3K9me3 marks, instead of H3K27me3 (Fig. S3b). We found that the distribution of H3K9me3 signal was different from that of H3K27me3 in some regions, and discovered the insulator-associated DNA-binding sites close to the edges of H3K9me3 regions (Fig. S3b).

      1. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      Response: The resolution of the Micro-C assay is considered to be 100 bp and above, as the human nucleome core particle contains 145 bp (and 193 bp with linker) of DNA. However, internucleosomal DNA is cleaved by endonuclease into fragments of multiples of 10 nucleotides (Pospelov VA et al. Nucleic Acids Research 1979). Highly nested focal interactions were observed (Goel VY et al. Nature Genetics 2023). Base pair resolution was reported using Micro Capture-C (Hua P et al. Nature 2021). Sub-kilobase (20 bp resolution) chromatin topology was reported using an MNase-based chromosome conformation capture (3C) approach (Aljahani A et al. Nature Communications 2022). On the other hand, Hi-C data was analyzed at 1 kb resolution. (Gu H et al. bioRxiv 2021). If the resolution of Micro-C interactions is at best at 1 kb, the binding sites of a DNA-binding protein will not show a peak around the center of the genomic locations of interaction edges. Each panel shows the number of binding sites of a specific DNA-binding protein at a specific distance from the midpoint of all chromatin interaction edges. I have modified and added the following sentences in lines 585-589: High-resolution chromatin interaction data from a Micro-C assay indicated that most of the predicted insulator-associated DBPs showed DNA-binding-site distribution peaks around chromatin interaction sites, suggesting that these DBPs are involved in chromatin interactions and that the chromatin interaction data has a high degree of resolution. Base pair resolution was reported using Micro Capture-C.

      Minor comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2 or https://pubmed.ncbi.nlm.nih.gov/37486787/). The authors should discuss how that would impact their results.

      Response: The directional bias of CTCF binding sites was identified by ChIA-pet interactions of CTCF binding sites. The analysis of the contribution scores of DNA-binding sites of proteins considering the binding sites of CTCF as an insulator showed the same tendency of directional bias of CTCF binding sites. In the analysis, to remove the false-positive prediction of DNA-binding sites, I used the binding sites that overlapped with a ChIP-seq peak of the DNA-binding protein. This result suggests that the DNA-binding sites of CTCF obtained by the current analysis have sufficient quality. Therefore, if the accuracy of prediction of DNA-binding sites is improved, althought the number of DNA-binding sites may be different, the overall tendency of the directionality of DNA-binding sites will not change and the results of this study will not change significantly.

      As for the first reference in the reviewer's comment, chromatin interaction data from Micro-C assay does not include all chromatin interactions in a cell or tissue, because it is expensive to cover all interactions. Therefore, it would be difficult to predict all chromatin interactions based on machine learning. As for the second reference in the reviewer's comment, pioneer factors such as FOXA are known to bind to closed chromatin regions, but transcription factors and DNA-binding proteins involved in chromatin interactions and insulators generally bind to open chromatin regions. The search for the DNA-binding motifs is not required in closed chromatin regions.

      1. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      Response: In the DeepLIFT paper, the authors explain that DeepLIFT is a method for decomposing the output prediction of a neural network on a specific input by backpropagating the contributions of all neurons in the network to every feature of the input (Shrikumar A et al. ICML 2017). DeepLIFT compares the activation of each neuron to its 'reference activation' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.

      Truly explainable AI would be able to find cause and reason, and to make choices and decisions like humans. DeepLIFT does not perform causal inferences. I did not use the term "Explainable AI" in our manuscript, but I briefly explained it in Discussion. I have added the following explanation in lines 615-620: AI (Artificial Intelligence) is considered as a black box, since the reason and cause of prediction are difficult to know. To solve this issue, tools and methods have been developed to know the reason and cause. These technologies are called Explainable AI. DeepLIFT is considered to be a tool for Explainable AI. However, DeepLIFT does not answer the reason and cause for a prediction. It calculates scores representing the contribution of the input data to the prediction.

      Furthermore, to improve the readability of the manuscript, I have included the following explanation in lines 159-165: we computed DeepLIFT scores of the input data (i.e., each binding site of the ChIP-seq data of DNA-binding proteins) in the deep leaning analysis on gene expression levels. DeepLIFT compares the importance of each input for predicting gene expression levels to its 'reference or background level' and assigns contribution scores according to the difference. DeepLIFT calculates a metric to measure the difference between an input and the reference of the input.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Osato and Hamada propose a systematic approach to identify DNA binding proteins that display directional binding. They used a modified Deep Learning method (DEcode) to investigate binding profiles of 1356 DBP from GTRD database at promoters (30 of 100bp bins around TSS) and enhancers (200 bins of 10Kb around eSNPs) and use this to predict expression of 25,071 genes in Fibroblasts, Monocytes, HMEC and NPC. This method achieves a good prediction power (Spearman correlation between predicted and actual expression of 0.74). They then use PIQ, and overlap predicted binding sites with actual ChIP-seq data to investigate the motifs of TFs that are controlling gene expression. They find 99 insulator proteins showing either a specific directional bias or minor non-directional bias, corresponding to 23 DBP previously reported to have insulator function. Of the 23 proteins they identify as regulating enhancer promoter interactions, 13 are associated with CTCF. They also show that there are significantly more insulator proteins binding sites at borders of polycomb domains, transcriptionally active or boundary regions based on chromatin interactions than other proteins.

      Major Comments:

      1. Some of these TFs do not have specific direct binding to DNA (P300, Cohesin). Since the authors are using binding motifs in their analysis workflow, I would remove those from the analysis.
      2. I am not sure if I understood correctly, by why do the authors consider enhancers spanning 2Mb (200 bins of 10Kb around eSNPs)? This seems wrong. Enhancers are relatively small regions (100bp to 1Kb) and only a very small subset form super enhancers.
      3. I think the H3K27me3 analysis was very good, but I would have liked to see also constitutive heterochromatin as well, so maybe repeat the analysis for H3K9me3.
      4. I was not sure I understood the analysis in Figure 6. The binding site is with 500bp of the interaction site, but micro-C interactions are at best at 1Kb resolution. They say they chose the centre of the interaction site, but we don't know exactly where there is the actual interaction. Also, it is not clear what they measure. Is it the number of binding sites of a specific or multiple DBP insulator proteins at a specific distance from this midpoint that they recover in all chromatin loops? Maybe I am missing something. This analysis was not very clear.

      Minor comments:

      1. PIQ does not consider TF concentration. Other methods do that and show that TF concentration improves predictions (e.g., https://www.biorxiv.org/content/10.1101/2023.07.15.549134v2 or https://pubmed.ncbi.nlm.nih.gov/37486787/). The authors should discuss how that would impact their results.
      2. DeepLIFT is a good approach to interpret complex structures of CNN, but is not truly explainable AI. I think the authors should acknowledge this.

      Referee Cross-Commenting

      I would like to mention that I agree with the comments of reviewers 1 and 2.

      Significance

      General assessment:

      This is the first study to my knowledge that attempts to use Deep Learning to identify insulators and directional biases in binding. One of the limitations is that no additional methods were used to show that these DBP have directional binding bias. It is not necessarily to employ additional methods, but it would definitely strengthen the paper.

      Advancements:

      This is a useful catalogue of potential DNA binding proteins of interest, beyond just CTCF. Some known TFs are there, but also new ones are found.

      Audience:

      Basic research mainly, with particular focus on chromatin conformation and TF binding fields.

      My expertise:

      ML/AI methods in genomics, TF binding models, epigenetics and 3D chromatin interactions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see my points below). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the following list.

      Also, I encourage the authors to integrate the current presentation of the data with other (published) data about chromatin architecture, to make more robust the claims and go deeper into the biological implications of the current work. Se my list below.

      It follows a specific list of relevant points to be addressed:

      Specific points:

      1. Introduction, line 95: CTCF appears two times, it seems redundant;
      2. Introduction, lines 99-103: Please stress better the novelty of the work. What is the main focus? The new identified DPBs or their binding sites? What are the "novel structural and functional roles of DBPs" mentioned?
      3. Results, line 111: How do the SNPs come into the procedure? From the figures it seems the input is ChIP-seq peaks of DNBPs around the TSS;
      4. Again, are those SNPs coming from the different cell lines? Or are they from individuals w.r.t some reference genome? I suggest a general restructuring of this part to let the reader understand more easily. One option could be simplifying the details here or alternatively including all the necessary details;
      5. Figure 1: panel a and b are misleading. Is the matrix in panel a equivalent to the matrix in panel b? If not please clarify why. Maybe in b it is included the info about the SNPs? And if yes, again, what is then difference with a.
      6. Line 386-388: could the author investigate in more detail this observation? Does it mean that loops driven by other DBPs independent of the known CTCF/Cohesin? Could the author provide examples of chromatin structural data e.g. MicroC?
      7. In general, how the presented results are related to some models of chromatin architecture, e.g. loop extrusion, in which it is integrated convergent CTCF binding sites?
      8. Do the authors think that the identified DBPs could work in that way as well?
      9. Also, can the authors comment about the mechanisms those newly identified DBPs mediate contacts by active processes or equilibrium processes?
      10. Can the author provide some real examples along with published structural data (e.g. the mentioned micro-C data) to show the link between protein co-presence, directional bias and contact formation?

      Significance

      In this work, the authors describe a deep learning computational tool to identity binding motifs of DNA binding proteins associated to insulators that led to the discovery of 99 motifs related to insulation. This is in turn related to chromatin architecture and highlight the importance of directional bias in order to form chromatin loops.

      In general, chromatin organization is an important topic in the context of a constantly expanding research field. Therefore, the work is timely and could be useful for the community. The paper appears overall well written and the figures look clear and of good quality. Nevertheless, there are some aspects to be clarified and better explored to make stronger conclusions. In particular, there are some aspects to clarify in the text about the Machine Learning procedure (see list of specific points). In addition, I have some general questions about the biological implications of the discussed findings, listed in detail in the above reported points.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study by Osato and Hamada aims at computationally identifying a set of novel putative insulator-associated DNA binding proteins (DBPs) via estimation of their contribution to the expression of genes in the same chromosome region of their binding sites (+- 1Mbp from TSS). To achieve this, the authors leverage a deep learning architecture already published via which ChIP-seq peaks of DBPs in the TSS of a given gene are used to predict its expression level in four human cell lines.

      Building on this, the authors used another tool called DeepLIFT to evaluate the weight of each DBP binding site on the final gene expression value. Hence they made the assumption that if a given DBP had an insulator function they could restrict the prediction of the gene's expression to the region included between pairs of that DBP binding sites, and evaluate the pair's motif directionality bias in the distribution of weights. They exemplify their approach's validity by the fact that they can predict the known directionality bias of CTCF/cohesin-bound sites as the highest of the lot, with the F-R orientation of the pairs the most enriched, recapitulating what already known in literature: i.e., that F-R chromatin interaction peaks are the most enriched. In addition, they find several new DBPs showing significant directionality bias; hence they could be candidates for insulation activity. They then provide correlation between these putative insulator binding sites and sites of transition between euchromatin and heterochromatin by independently using histone mark and gene expression datasets. This, of course, is not surprising because (a) there is insulation between regions with heterotypic chromatin identities, and (b) it was already known from the first papers describing insulated chromatin domains that their boundaries were well-enriched for active transcription and transcriptional regulators (e.g., Dixon et al, Nature 2012).

      Finally, they use chromatin interaction (looping) sites to check the overlap between CTCF and all other DBPs and define a subset of putative insulator DBPs not overlapping CTCF peaks, suggesting potentially new insulatory mechanisms. These factors were all known transcriptional activators, but this part of the findings carry most of the novelty in the work and have the potential of opening up new directions for research in chromatin organization.

      Overall, the methodology applied here is adequate, clear, and reproducible. The major issue, in our view, is that the entire manuscript's findings relies on the usage of deepLIFT, a tool which was not benchmarked previously or by the current study. In fact, deepLIFT is public as regards its code, and also appears as a preprint from 2017 on biorXiv and published in the Proceedings of Machine Learning Research conference. Also, this key tool was developed by the Kundaje lab (who produce high quality alogrithms), and not by the authors. Therefore, the manuscript is predominantly based on the execution of existing workflows to publicly-available data. This does not take anything away from the interesting question posed here, but at the same time does not provide the community with any new algorithm/workflow.

      Finally, although I appreciate that the authors are purely computational and have likely no capacity for experimental validation of their claims of new DBPs having insulator roles, I would assume that there are RNA-seq and/or ChIP-seq data out there produced after knockdown of one or more of these DBPs that show directional positioning. Using this kind of data, effects on gene expression can at least be tested in regard to the authors' predictions. Moreover, in terms of validation, Figure 6 should be expanded to incorporate analysis of DBPs not overlapping CTCF/cohesin in chromatin interaction data that is important and potentially more interesting than the simple DBPs enrichment reported in the present form of the figure. Critically, I would like to see use of Micro-C/Hi-C data and ChIP-seq from these factors, where insulation scores around their directionally-bound sites show some sort of an effect like that presumed by the authors - and many such datasets are publicly-available and can be put to good use here.

      As secondary issues, we would point out that:

      • The suggested alternative transcripts function, also highlighted in the manuscript;s abstract, is only supported by visual inspection of a few cases for several putative DBPs. I believe this is insufficient to support what looks like one of the major claims of the paper when reading the abstract, and a more quantitative and genome-wide analysis must be adopted, although the authors mention it as just an 'observation'.
      • Figure 1 serves no purpose in my opinion and can be removed, while figures can generally be improved (e.g., the browser screenshots in Figs 4 and 5) for interpretability from readers outside the immediate research field.
      • Similarly, the text is rather convoluted at places and should be re-approached with more clarity for less specialized readers in mind.

      Significance

      The scientific novelty of the work lies primarily in the identification of a set of DBPs that are proposed to confer insulator activity genome-wide. This has been long sought after in human data (whilst it is well understood and defined in Drosophila). The authors produce a quantitative ranking of the putative insulation effect of these DBPs and, most importantly, go on to identify a smaller subset that are apparently non-overlapping with anchors of CTCF-cohesin loop anchors; the presence of strong motif orientation biases in many DBPs can also be of broad interest, especially those that cannot be trivially ascribable to the loop extrusion process.

      However, although these findings open the way for speculation on multiple insulation mechanisms via proteins with multiple regulatory functions, the manuscript provide no experimental or computational means to test the proposed roles of these DBPs - and, as such, this limits the potential impact of the work and mostly targets researchers in the field of genome organization that can test these findings. Having said this, if validated, this work can significantly broaden our understanding of how chromatin is organized in 3D nuclear space.

      I typically identify myself to the authors: A. Papantonis, expertise in 3D genome architecture, chromatin biology, and genomics/bioinformatics.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Authors has provided a mechanism by which how presence of truncated P53 can inactivate function of full length P53 protein. Authors proposed this happens by sequestration of full length P53 by truncated P53.

      In the study, performed experiments are well described.

      My area of expertise is molecular biology/gene expression, and I have tried to provide suggestions on my area of expertise. The study has been done mainly with overexpression system and I have included few comments which I can think can be helpful to understand effect of truncated P53 on endogenous wild type full length protein. Performing experiments on these lines will add value to the observation according to this reviewer.

      Major comments:

      1. What happens to endogenous wild type full length P53 in the context of mutant/truncated isoforms, that is not clear. Using a P53 antibody which can detect endogenous wild type P53, can authors check if endogenous full length P53 protein is also aggregated as well? It is hard to differentiate if aggregation of full length P53 happens only in overexpression scenario, where lot more both of such proteins are expressed. In normal physiological condition P53 expression is usually low, tightly controlled and its expression get induced in altered cellular condition such as during DNA damage. So, it is important to understand the physiological relevance of such aggregation, which could be possible if authors could investigate effect on endogenous full length P53 following overexpression of mutant isoforms. Response: Thank you very much for your insightful comments. 1) To address "what happens to endogenous wild-type full-length P53 in the context of mutant/truncated isoforms," we employed a human A549 cell line expressing endogenous wild-type p53 under DNA damage conditions such as an etoposide treatment1. We choose the A549 cell line since similar to H1299, it is a lung cancer cell line (www.atcc.org). For comparison, we also transfected the cells with 2 μg of V5-tagged plasmids encoding FLp53 and its isoforms Δ133p53 and Δ160p53. As shown in Figure R1A, lanes 1 and 2, endogenous p53 expression, remained undetectable in A549 cells despite etoposide treatment, which limits our ability to assess the effects of the isoforms on the endogenous wild-type FLp53. We could, however, detect the V5-tagged FLp53 expressed from the plasmid using anti-V5 (rabbit) as well as with anti-DO-1 (mouse) antibody (Figure R1). The latter detects both endogenous wild-type p53 and the V5-tagged FLp53 since the antibody epitope is within the N-terminus (aa 20-25). This result supports the reviewer's comment regarding the low level of expression of endogenous p53 that is insufficient for detection in our experiments. (Figure R1 is included in the file "RC-2024-02608 Figures of Response to Reviewer.)__

      In summary, in line with the reviewer's comment that 'under normal physiological conditions p53 expression is usually low,' we could not detect p53 with an anti-DO-1 antibody. Thus, we proceeded with V5/FLAG-tagged p53 for detection of the effects of the isoforms on p53 stability and function. We also found that protein expression in H1299 cells was more easily detectable than in A549 cells (Compare Figures R1A and B). Thus, we decided to continue with the H1299 cells (p53-null), which would serve as a more suitable model system for this study.

      2) We agree with the reviewer that 'It is hard to differentiate if aggregation of full-length p53 happens only in overexpression scenario'. However, it is not impossible to imagine that such aggregation of FLp53 happens under conditions when p53 and its isoforms are over-expressed in the cell. Although the exact physiological context is not known and beyond the scope of the current work, our results indicate that at higher expression, p53 isoforms drive aggregation of FLp53. Given the challenges of detecting endogenous FLp53, we had to rely on the results obtained with plasmid mediated expression of p53 and its isoforms in p53-null cells.

      Can presence of mutant P53 isoforms can cause functional impairment of wild type full length endogenous P53? That could be tested as well using similar ChIP assay authors has performed, but instead of antibody against the Tagged protein if the authors could check endogenous P53 enrichment in the gene promoter such as P21 following overexpression of mutant isoforms. May be introducing a condition such as DNA damage in such experiment might help where endogenous P53 is induced and more prone to bind to P53 target such as P21.

      Response: Thank you very much for your valuable comments and suggestions. To investigate the potential functional impairment of endogenous wild-type p53 by p53 isoforms, we initially utilized A549 cells (p53 wild-type), aiming to monitor endogenous wild-type p53 expression following DNA damage. However, as mentioned and demonstrated in Figure R1, endogenous p53 expression was too low to be detected under these conditions, making the ChIP assay for analyzing endogenous p53 activity unfeasible. Thus, we decided to utilize plasmid-based expression of FLp53 and focus on the potential functional impairment induced by the isoforms.

      3. On similar lines, authors described:

      "To test this hypothesis, we escalated the ratio of FLp53 to isoforms to 1:10. As expected, the activity of all four promoters decreased significantly at this ratio (Figure 4A-D). Notably, Δ160p53 showed a more potent inhibitory effect than Δ133p53 at the 1:5 ratio on all promoters except for the p21 promoter, where their impacts were similar (Figure 4E-H). However, at the 1:10 ratio, Δ133p53 and Δ160p53 had similar effects on all transactivation except for the MDM2 promoter (Figure 4E-H)."

      Again, in such assay authors used ratio 1:5 to 1:10 full length vs mutant. How authors justify this result in context (which is more relevant context) where one allele is Wild type (functional P53) and another allele is mutated (truncated, can induce aggregation). In this case one would except 1:1 ratio of full-length vs mutant protein, unless other regulation is going which induces expression of mutant isoforms more than wild type full length protein. Probably discussing on these lines might provide more physiological relevance to the observed data.

      Response: Thank you for raising this point regarding the physiological relevance of the ratios used in our study. 1) In the revised manuscript (lines 193-195), we added in this direction that "The elevated Δ133p53 protein modulates p53 target genes such as miR34a and p21, facilitating cancer development2, 3. To mimic conditions where isoforms are upregulated relative to FLp53, we increased the ratios to 1:5 and 1:10." This approach aims to simulate scenarios where isoforms accumulate at higher levels than FLp53, which may be relevant in specific contexts, as also elaborated above.

      2) Regarding the issue of protein expression, where one allele is wild-type and the other is isoform, this assumption is not valid in most contexts. First, human cells have two copies of TPp53 gene (one from each parent). Second, the TP53 gene has two distinct promoters: the proximal promoter (P1) primarily regulates FLp53 and ∆40p53, whereas the second promoter (P2) regulates ∆133p53 and ∆160p534, 5. Additionally, ∆133TP53 is a p53 target gene6, 7 and the expression of Δ133p53 and FLp53 is dynamic in response to various stimuli. Third, the expression of p53 isoforms is regulated at multiple levels, including transcriptional, post-transcriptional, translational, and post-translational processing8. Moreover, different degradation mechanisms modify the protein level of p53 isoforms and FLp538. These differential regulation mechanisms are regulated by various stimuli, and therefore, the 1:1 ratio of FLp53 to ∆133p53 or ∆160p53 may be valid only under certain physiological conditions. In line with this, varied expression levels of FLp53 and its isoforms, including ∆133p53 and ∆160p53, have been reported in several studies3, 4, 9, 10.

      3) In our study, using the pcDNA 3.1 vector under the human cytomegalovirus (CMV) promoter, we observed moderately higher expression levels of ∆133p53 and ∆160p53 relative to FLp53 (Figure R1B). This overexpression scenario provides a model for studying conditions where isoform accumulation might surpass physiological levels, impacting FLp53 function. By employing elevated ratios of these isoforms to FLp53, we aim to investigate the potential effects of isoform accumulation on FLp53.

      4. Finally does this altered function of full length P53 (preferably endogenous one) in presence of truncated P53 has any phenotypic consequence on the cells (if authors choose a cell type which is having wild type functional P53). Doing assay such as apoptosis/cell cycle could help us to get this visualization.

      Response: Thank you for your insightful comments. In the experiment with A549 cells (p53 wild-type), endogenous p53 levels were too low to be detected, even after DNA damage induction. The evaluation of the function of endogenous p53 in the presence of isoforms is hindered, as mentioned above. In the revised manuscript, we utilized H1299 cells with overexpressed proteins for apoptosis studies using the Caspase-Glo® 3/7 assay (Figure 7). This has been shown in the Results section (lines 254-269). "The Δ133p53 and Δ160p53 proteins block pro-apoptotic function of FLp53.

      One of the physiological read-outs of FLp53 is its ability to induce apoptotic cell death11. To investigate the effects of p53 isoforms Δ133p53 and Δ160p53 on FLp53-induced apoptosis, we measured caspase-3 and -7 activities in H1299 cells expressing different p53 isoforms (Figure 7). Caspase activation is a key biochemical event in apoptosis, with the activation of effector caspases (caspase-3 and -7) ultimately leading to apoptosis12. The caspase-3 and -7 activities induced by FLp53 expression was approximately 2.5 times higher than that of the control vector (Figure 7). Co-expression of FLp53 and the isoforms Δ133p53 or Δ160p53 at a ratio of 1: 5 significantly diminished the apoptotic activity of FLp53 (Figure 7). This result aligns well with our reporter gene assay, which demonstrated that elevated expression of Δ133p53 and Δ160p53 impaired the expression of apoptosis-inducing genes BAX and PUMA (Figure 4G and H). Moreover, a reduction in the apoptotic activity of FLp53 was observed irrespective of whether Δ133p53 or Δ160p53 protein was expressed with or without a FLAG tag (Figure 7). This result, therefore, also suggests that the FLAG tag does not affect the apoptotic activity or other physiological functions of FLp53 and its isoforms. Overall, the overexpression of p53 isoforms Δ133p53 and Δ160p53 significantly attenuates FLp53-induced apoptosis, independent of the protein tagging with the FLAG antibody epitope."

      **Referees cross-commenting**

      I think the comments from the other reviewers are very much reasonable and logical.

      Especially all 3 reviewers have indicated, a better way to visualize the aggregation of full-length wild type P53 by truncated P53 (such as looking at endogenous P53# by reviewer 1, having fluorescent tag #by reviewer 2 and reviewer 3 raised concern on the FLAG tag) would add more value to the observation.

      Response: Thank you for these comments. The endogenous p53 protein was undetectable in A549 cells induced by etoposide (Figure R1A). Therefore, we conducted experiments using FLAG/V5-tagged FLp53. To avoid any potential side effects of the FLAG tag on p53 aggregation, we introduced untagged p53 isoforms in the H1299 cells and performed subcellular fractionation. Our revised results, consistent with previous FLAG-tagged p53 isoforms findings, demonstrate that co-expression of untagged isoforms with FLAG-tagged FLp53 significantly induced the aggregation of FLAG-FLp53, while no aggregation was observed when FLAG-tagged FLp53 was expressed alone (Supplementary Figure 6). These results clearly indicate that the FLAG tag itself does not contribute to protein aggregation.

      Additionally, we utilized the A11 antibody to detect protein aggregation, providing additional validation (Figure R3). Given that the fluorescent proteins (~30 kDa) are substantially bigger than the tags used here (~1 kDa) and may influence oligomerization (especially GFP), stability, localization, and function of p53 and its isoforms, we avoided conducting these vital experiments with such artificial large fusions.

      Reviewer #1 (Significance (Required)):

      The work in significant, since it points out more mechanistic insight how wild type full length P53 could be inactivated in the presence of truncated isoforms, this might offer new opportunity to recover P53 function as treatment strategies against cancer.

      Response: Thank you for your insightful comments. We appreciate your recognition of the significance of our work in providing mechanistic insights into how wild-type FLp53 can be inactivated by truncated isoforms. We agree that these findings have potential for exploring new strategies to restore p53 function as a therapeutic approach against cancer.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Zhao and colleagues presents a novel and compelling study on the p53 isoforms, Δ133p53 and Δ160p53, which are associated with aggressive cancer types. The main objective of the study was to understand how these isoforms exert a dominant negative effect on full-length p53 (FLp53). The authors discovered that the Δ133p53 and Δ160p53 proteins exhibit impaired binding to p53-regulated promoters. The data suggest that the predominant mechanism driving the dominant-negative effect is the co-aggregation of FLp53 with Δ133p53 and Δ160p53.

      This study is innovative, well-executed, and supported by thorough data analysis. However, the authors should address the following points:

        • Introduction on Aggregation and Co-aggregation: Given that the focus of the study is on the aggregation and co-aggregation of the isoforms, the introduction should include a dedicated paragraph discussing this issue. There are several original research articles and reviews that could be cited to provide context.* Response: Thank you very much for the valuable comments. We have added the following paragraph in the revised manuscript (lines 74-82): "Protein aggregation has become a central focus of modern biology research and has documented implications in various diseases, including cancer13, 14, 15. Protein aggregates can be of different types ranging from amorphous aggregates to highly structured amyloid or fibrillar aggregates, each with different physiological implications. In the case of p53, whether protein aggregation, and in particular, co-aggregation with large N-terminal deletion isoforms, plays a mechanistic role in its inactivation is yet underexplored. Interestingly, the Δ133p53β isoform has been shown to aggregate in several human cancer cell lines16. Additionally, the Δ40p53α isoform exhibits a high aggregation tendency in endometrial cancer cells17. Although no direct evidence exists for Δ160p53 yet, these findings imply that p53 isoform aggregation may play a major role in their mechanisms of actions."

      2. Antibody Use for Aggregation: To strengthen the evidence for aggregation, the authors should consider using antibodies that specifically bind to aggregates.

      Response: Thank you for your insightful suggestion. We addressed protein aggregation using the A11 antibody which specifically recognizes amyloid-like protein aggregates. We analyzed insoluble nuclear pellet samples prepared under identical conditions as described in Figure 6B. To confirm the presence of p53 proteins, we employed the anti-p53 M19 antibody (Santa Cruz, Cat No. sc-1312) to detect bands corresponding to FLp53 and its isoforms Δ133p53 and Δ160p53. The monomer FLp53 was not detected (Figure R3, lower panel), which may be attributed to the lower binding affinity of the anti-p53 M19 antibody to it. These samples were also immunoprecipitated using the A11 antibody (Thermo Fischer Scientific, Cat No. AHB0052) to detect aggregated proteins. Interestingly, FLp53 and its isoforms, Δ133p53 and Δ160p53, were clearly visible with Anti-A11 antibody when co-expressed at a 1:5 ratio suggesting that they underwent co-aggregation__.__ However, no FLp53 aggregates were observed when it was expressed alone (Figure R2). These results support the conclusion in our manuscript that Δ133p53 and Δ160p53 drive FLp53 aggregation.

      (Figure R2 is included in the file "RC-2024-02608 Figures of Response to Reviewer.)__

      3. Fluorescence Microscopy: Live-cell fluorescence microscopy could be employed to enhance visualization by labeling FLp53 and the isoforms with different fluorescent markers (e.g., EGFP and mCherry tags).

      Response: We appreciate the suggestion to use live-cell fluorescence microscopy with EGFP and mCherry tags for the visualization FLp53 and its isoforms. While we understand the advantages of live-cell imaging with EGFP / mCherry tags, we restrained us from doing such fusions as the GFP or corresponding protein tags are very big (~30 kDa) with respect to the p53 isoform variants (~30 kDa). Other studies have shown that EGFP and mCherry fusions can alter protein oligomerization, solubility and aggregation18, 19. Moreover, most fluorescence proteins are prone to dimerization (i.e. EGFP) or form obligate tetramers (DsRed)20, 21, 22, potentially interfering with the oligomerization and aggregation properties of p53 isoforms, particularly Δ133p53 and Δ160p53.

      Instead, we utilized FLAG- or V5-tag-based immunofluorescence microscopy, a well-established and widely accepted method for visualizing p53 proteins. This method provided precise localization and reliable quantitative data, which we believe meet the needs of the current study. We believe our chosen method is both appropriate and sufficient for addressing the research question.

      Reviewer #2 (Significance (Required)):

      The manuscript by Zhao and colleagues presents a novel and compelling study on the p53 isoforms, Δ133p53 and Δ160p53, which are associated with aggressive cancer types. The main objective of the study was to understand how these isoforms exert a dominant negative effect on full-length p53 (FLp53). The authors discovered that the Δ133p53 and Δ160p53 proteins exhibit impaired binding to p53-regulated promoters. The data suggest that the predominant mechanism driving the dominant-negative effect is the co-aggregation of FLp53 with Δ133p53 and Δ160p53.

      Response: We sincerely thank the reviewer for the thoughtful and positive comments on our manuscript and for highlighting the significance of our findings on the p53 isoforms, Δ133p53 and Δ160p53.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript entitled "Δ133p53 and Δ160p53 isoforms of the tumor suppressor protein p53 exert dominant-negative effect primarily by co-aggregation", the authors suggest that the Δ133p53 and Δ160p53 isoforms have high aggregation propensity and that by co-aggregating with canonical p53 (FLp53), they sequestrate it away from DNA thus exerting a dominant-negative effect over it.

      First, the authors should make it clear throughout the manuscript, including the title, that they are investigating Δ133p53α and Δ160p53α since there are 3 Δ133p53 isoforms (α, β, γ), and 3 Δ160p53 isoforms (α, β, γ).

      Response: Thank you for your suggestion. We understand the importance of clearly specifying the isoforms under study. Following your suggestion, we have added α in the title, abstract, and introduction and added the following statement in the Introduction (lines 57-59): "For convenience and simplicity, we have written Δ133p53 and Δ160p53 to represent the α isoforms (Δ133p53α and Δ160p53α) throughout this manuscript."

      One concern is that the authors only consider and explore Δ133p53α and Δ160p53α isoforms as exclusively oncogenic and FLp53 dominant-negative while not discussing evidences of different activities. Indeed, other manuscripts have also shown that Δ133p53α is non-oncogenic and non-mutagenic, do not antagonize every single FLp53 functions and are sometimes associated with good prognosis. To cite a few examples:

      • Hofstetter G. et al. D133p53 is an independent prognostic marker in p53 mutant advanced serous ovarian cancer. Br. J. Cancer 2011, 105, 1593-1599.
      • Bischof, K. et al. Influence of p53 Isoform Expression on Survival in High-Grade Serous Ovarian Cancers. Sci. Rep. 2019, 9,5244.
      • Knezovi´c F. et al. The role of p53 isoforms' expression and p53 mutation status in renal cell cancer prognosis. Urol. Oncol. 2019, 37, 578.e1-578.e10.
      • Gong, L. et al. p53 isoform D113p53/D133p53 promotes DNA double-strand break repair to protect cell from death and senescence in response to DNA damage. Cell Res. 2015, 25, 351-369.
      • Gong, L. et al. p53 isoform D133p53 promotes efficiency of induced pluripotent stem cells and ensures genomic integrity during reprogramming. Sci. Rep. 2016, 6, 37281.
      • Horikawa, I. et al. D133p53 represses p53-inducible senescence genes and enhances the generation of human induced pluripotent stem cells. Cell Death Differ. 2017, 24, 1017-1028.
      • Gong, L. p53 coordinates with D133p53 isoform to promote cell survival under low-level oxidative stress. J. Mol. Cell Biol. 2016, 8, 88-90. Response: Thank you very much for your comment and for highlighting these important studies.

      We agree that Δ133p53 isoforms exhibit complex biological functions, with both oncogenic and non-oncogenic potentials. However, our mission here was primarily to reveal the molecular mechanism for the dominant-negative effects exerted by the Δ133p53α and Δ160p53α isoforms on FLp53 for which the Δ133p53α and Δ160p53α isoforms are suitable model systems. Exploring the oncogenic potential of the isoforms is beyond the scope of the current study and we have not claimed anywhere that we are reporting that. We have carefully revised the manuscript and replaced the respective terms e.g. 'pro-oncogenic activity' with 'dominant-negative effect' in relevant places (e.g. line 90). We have now also added a paragraph with suitable references that introduces the oncogenic and non-oncogenic roles of the p53 isoforms.

      After reviewing the papers you cited, we are not sure that they reflect on oncogenic /non-oncogenic role of the Δ133p53α isoform in different cancer cases. Although our study is not about the oncogenic potential of the isoforms, we have summarized the key findings below:

      • Hofstetter et al., 2011: Demonstrated that Δ133p53α expression improved recurrence-free and overall survival (in a p53 mutant induced advanced serous ovarian cancer, suggesting a potential protective role in this context.
      • Bischof et al., 2019: Found that Δ133p53 mRNA can improve overall survival in high-grade serous ovarian cancers. However, out of 31 patients, only 5 belong to the TP53 wild-type group, while the others carry TP53 mutations.
      • Knezović et al., 2019: Reported downregulation of Δ133p53 in renal cell carcinoma tissues with wild-type p53 compared to normal adjacent tissue, indicating a potential non-oncogenic role, but not conclusively demonstrating it.
      • Gong et al., 2015: Showed that Δ133p53 antagonizes p53-mediated apoptosis and promotes DNA double-strand break repair by upregulating RAD51, LIG4, and RAD52 independently of FLp53.
      • Gong et al., 2016: Demonstrated that overexpression of Δ133p53 promotes efficiency of cell reprogramming by its anti-apoptotic function and promoting DNA DSB repair. The authors hypotheses that this mechanism is involved in increasing RAD51 foci formation and decrease γH2AX foci formation and chromosome aberrations in induced pluripotent stem (iPS) cells, independent of FL p53.
      • Horikawa et al., 2017: Indicated that induced pluripotent stem cells derived from fibroblasts that overexpress Δ133p53 formed non-cancerous tumors in mice compared to induced pluripotent stem cells derived from fibroblasts with complete p53 inhibition. Thus, Δ133p53 overexpression is "non- or less oncogenic and mutagenic" compared to complete p53 inhibition, but it still compromises certain p53-mediated tumor-suppressing pathways. "Overexpressed Δ133p53 prevented FL-p53 from binding to the regulatory regions of p21WAF1 and miR-34a promoters, providing a mechanistic basis for its dominant-negative inhibition of a subset of p53 target genes."
      • Gong, 2016: Suggested that Δ133p53 promotes cell survival under low-level oxidative stress, but its role under different stress conditions remains uncertain. We have revised the Introduction to provide a more balanced discussion of Δ133p53's dule role (lines 62-73):

      "The Δ133p53 isoform exhibit complex biological functions, with both oncogenic and non-oncogenic potentials. Recent studies demonstrate the non-oncogenic yet context-dependent role of the Δ133p53 isoform in cancer development. Δ133p53 expression has been reported to correlate with improved survival in patients with TP53 mutations23, 24, where it promotes cell survival in a non-oncogenic manner25, 26, especially under low oxidative stress27. Alternatively, other recent evidences emphasize the notable oncogenic functions of Δ133p53 as it can inhibit p53-dependent apoptosis by directly interacting with the FLp53 4, 6. The oncogenic function of the newly identified Δ160p53 isoform is less known, although it is associated with p53 mutation-driven tumorigenesis28 and in melanoma cells' aggressiveness10. Whether or not the Δ160p53 isoform also impedes FLp53 function in a similar way as Δ133p53 is an open question. However, these p53 isoforms can certainly compromise p53-mediated tumor suppression by interfering with FLp53 binding to target genes such as p21 and miR-34a2, 29 by dominant-negative effect, the exact mechanism is not known."

      On the figures presented in this manuscript, I have three major concerns:

      *1- Most results in the manuscript rely on the overexpression of the FLAG-tagged or V5-tagged isoforms. The validation of these construct entirely depends on Supplementary figure 3 which the authors claim "rules out the possibility that the FLAG epitope might contribute to this aggregation. However, I am not entirely convinced by that conclusion. Indeed, the ratio between the "regular" isoform and the aggregates is much higher in the FLAG-tagged constructs than in the V5-tagged constructs. We can visualize the aggregates easily in the FLAG-tagged experiment, but the imaging clearly had to be overexposed (given the white coloring demonstrating saturation of the main bands) to visualize them in the V5-tagged experiments. Therefore, I am not convinced that an effect of the FLAG-tag can be ruled out and more convincing data should be added. *

      Response: Thank you for raising this important concern. We have carefully considered your comments and have made several revisions to clarify and strengthen our conclusions.

      First, to address the potential influence of the FLAG and V5 tags on p53 isoform aggregation, we have revised Figure 2 and removed the previous Supplementary Figure 3, where non-specific antibody bindings and higher molecular weight aggregates were not clearly interpretable. In the revised Figure 2, we have removed these potential aggregates, improving the clarity and accuracy of the data.

      To further rule out any tag-related artifacts, we conducted a co-immunoprecipitation assay with FLAG-tagged FLp53 and untagged Δ133p53 and Δ160p53 isoforms. The results (now shown in the new Supplementary Figure 3) completely agree with our previous result with FLAG-tagged and V5-tagged Δ133p53 and Δ160p53 isoforms and show interaction between the partners. This indicates that the FLAG / V5-tags do not influence / interfere with the interaction between FLp53 and the isoforms. We have still used FLAG-tagged FLp53 as the endogenous p53 was undetectable and the FLAG-tagged FLp53 did not aggregate alone.

      In the revised paper, we added the following sentences (Lines 146-152): "To rule out the possibility that the observed interactions between FLp53 and its isoforms Δ133p53 and Δ160p53 were artifacts caused by the FLAG and V5 antibody epitope tags, we co-expressed FLAG-tagged FLp53 with untagged Δ133p53 and Δ160p53. Immunoprecipitation assays demonstrated that FLAG-tagged FLp53 could indeed interact with the untagged Δ133p53 and Δ160p53 isoforms (Supplementary Figure 3, lanes 3 and 4), confirming formation of hetero-oligomers between FLp53 and its isoforms. These findings demonstrate that Δ133p53 and Δ160p53 can oligomerize with FLp53 and with each other."

      Additionally, we performed subcellular fractionation experiments to compare the aggregation and localization of FLAG-tagged FLp53 when co-expressed either with V5-tagged or untagged Δ133p53/Δ160p53. In these experiments, the untagged isoforms also induced FLp53 aggregation, mirroring our previous results with the tagged isoforms (Supplementary Figure 5). We've added this result in the revised manuscript (lines 236-245): "To exclude the possibility that FLAG or V5 tags contribute to protein aggregation, we also conducted subcellular fractionation of H1299 cells expressing FLAG-tagged FLp53 along with untagged Δ133p53 or Δ160p53 at a 1:5 ratio. The results showed (Supplementary Figure 6) a similar distribution of FLp53 across cytoplasmic, nuclear, and insoluble nuclear fractions as in the case of tagged Δ133p53 or Δ160p53 (Figure 6A to D). Notably, the aggregation of untagged Δ133p53 or Δ160p53 markedly promoted the aggregation of FLAG-tagged FLp53 (Supplementary Figure 6B and D), demonstrating that the antibody epitope tags themselves do not contribute to protein aggregation."

      We've also discussed this in the Discussion section (lines 349-356): "In our study, we primarily utilized an overexpression strategy involving FLAG/V5-tagged proteins to investigate the effects of p53 isoforms Δ133p53 and Δ160p53 on the function of FLp53. To address concerns regarding potential overexpression artifacts, we performed the co-immunoprecipitation (Supplementary Figure 6) and caspase-3 and -7 activity (Figure 7) experiments with untagged Δ133p53 and Δ160p53. In both experimental systems, the untagged proteins behaved very similarly to the FLAG/V5 antibody epitope-containing proteins (Figures 6 and 7 and Supplementary Figure 6). Hence, the C-terminal tagging of FLp53 or its isoforms does not alter the biochemical and physiological functions of these proteins."

      In summary, the revised data set and newly added experiments provide strong evidence that neither the FLAG nor the V5 tag contributes to the observed p53 isoform aggregation.

      2- The authors demonstrate that to visualize the dominant-negative effect, Δ133p53α and Δ160p53α must be "present in a higher proportion than FLp53 in the tetramer" and the need at least a transfection ratio 1:5 since the 1:1 ration shows no effect. However, in almost every single cell type, FLp53 is far more expressed than the isoforms which make it very unlikely to reach such stoichiometry in physiological conditions and make me wonder if this mechanism naturally occurs at endogenous level. This limitation should be at least discussed.

      Response: Thank you for your insightful comment. However, evidence suggests that the expression levels of these isoforms such as Δ133p53, can be significantly elevated relative to FLp53 in certain physiological conditions3, 4, 9. For example, in some breast tumors, with Δ133p53 mRNA is expressed at a much levels than FLp53, suggesting a distinct expression profile of p53 isoforms compared to normal breast tissue4. Similarly, in non-small cell lung cancer and the A549 lung cancer cell line, the expression level of Δ133p53 transcript is significantly elevated compared to non-cancerous cells3. Moreover, in specific cholangiocarcinoma cell lines, the Δ133p53 /TAp53 expression ratio has been reported to increase to as high as 3:19. These observations indicate that the dominant-negative effect of isoform Δ133p53 on FLp53 can occur under certain pathological conditions where the relative amounts of the FLp53 and the isoforms would largely vary. Since data on the Δ160p53 isoform are scarce, we infer that the long N-terminal truncated isoforms may share a similar mechanism.

      Figure 5C: I am concerned by the subcellular location of the Δ133p53α and Δ160p53α as they are commonly considered nuclear and not cytoplasmic as shown here, particularly since they retain the 3 nuclear localization sequences like the FLp53 (Bourdon JC et al. 2005; Mondal A et al. 2018; Horikawa I et al, 2017; Joruiz S. et al, 2024). However, Δ133p53α can form cytoplasmic speckles (Horikawa I et al, 2017) when it colocalizes with autophagy markers for its degradation.

      3-The authors should discuss this issue. Could this discrepancy be due to the high overexpression level of these isoforms? A co-staining with autophagy markers (p62, LC3B) would rule out (or confirm) activation of autophagy due to the overwhelming expression of the isoform.

      Response: Thank you for your thoughtful comments. We have thoroughly reviewed all the papers you recommended (Bourdon JC et al., 2005; Mondal A et al., 2018; Horikawa I et al., 2017; Joruiz S. et al., 2024)4, 29, 30, 31. Among these, only the study by Bourdon JC et al. (2005) provided data regarding the localization of Δ133p534. Interestingly, their findings align with our observations, indicating that the protein does not exhibit predominantly nuclear localization in the Figure below. The discrepancy may be caused by a potentially confusing statement in that paper4

      (The Figure from Bourdon JC et al. (2005) is included in the file "RC-2024-02608 Figures of Response to Reviewer.)__

      The localization of p53 is governed by multiple factors, including its nuclear import and export32. The isoforms Δ133p53 and Δ160p53 contain three nuclear localization sequences (NLS)4 . However, the isoforms Δ133p53 and Δ160p53 were potentially trapped in the cytoplasm by aggregation and masking the NLS. This mechanism would prevent nuclear import.

      Further, we acknowledge that Δ133p53 co-aggregates with autophagy substrate p62/SQSTM1 and autophagosome component LC3B in cytoplasm by autophagic degradation during replicative senescence33. We agree that high overexpression of these aggregation-prone proteins may induce endoplasmic reticulum (ER) stress and activates autophagy34. This could explain the cytoplasmic localization in our experiments. However, it is also critical to consider that we observed aggregates in both the cytoplasm and the nucleus (Figures 6B and E and Supplementary Figure 6B). While cytoplasmic localization may involve autophagy-related mechanisms, the nuclear aggregates likely arise from intrinsic isoform properties, such as altered protein folding, independent of autophagy. These dual localizations reflect the complex behavior of Δ133p53 and Δ160p53 isoforms under our experimental conditions.

      In the revised manuscript, we discussed this in Discussion (lines 328-335): "Moreover, the observed cytoplasmic isoform aggregates may reflect autophagy-related degradation, as suggested by the co-localization of Δ133p53 with autophagy substrate p62/SQSTM1 and autophagosome component LC3B33. High overexpression of these aggregation-prone proteins could induce endoplasmic reticulum stress and activate autophagy34. Interestingly, we also observed nuclear aggregation of these isoforms (Figure 6B and E and Supplementary Figure 6B), suggesting that distinct mechanisms, such as intrinsic properties of the isoforms, may govern their localization and behavior within the nucleus. This dual localization underscores the complexity of Δ133p53 and Δ160p53 behavior in cellular systems."

      Minor concerns:

      - Figure 1A: the initiation of the "Δ140p53" is shown instead of "Δ40p53"

      Response: Thank you! The revised Figure 1A has been created in the revised paper.

      • Figure 2A: I would like to see the images cropped a bit higher, so the cut does not happen just above the aggregate bands

      Response: Thank you for this suggestion. We've changed the image and the new Figure 2 has been shown in the revised paper.

      • Figure 3C: what ratio of FLp53/Delta isoform was used?

      Response: We have added the ratio in the figure legend of Figure 3C (lines 845-846) "Relative DNA-binding of the FLp53-FLAG protein to the p53-target gene promoters in the presence of the V5-tagged protein Δ133p53 or Δ160p53 at a 1: 1 ratio."

      • Figure 3C suggests that the "dominant-negative" effect is mostly senescence-specific as it does not affect apoptosis target genes, which is consistent with Horikawa et al, 2017 and Gong et al, 2016 cited above. Furthermore, since these two references and the others from Gong et al. show that Δ133p53α increases DNA repair genes, it would be interesting to look at RAD51, RAD52 or Lig4, and maybe also induce stress.

      Response: Thank you for your thoughtful comments and suggestions. In Figure 3C, the presence of Δ133p53 or Δ160p53 only significantly reduced the binding of FLp53 to the p21 promoter. However, isoforms Δ133p53 and Δ160p53 demonstrated a significant loss of DNA-binding activity at all four promoters: p21, MDM2, and apoptosis target genes BAX and PUMA (Figure 3B). This result suggests that Δ133p53 and Δ160p53 have the potential to influence FLp53 function due to their ability to form hetero-oligomers with FLp53 or their intrinsic tendency to aggregate. To further investigate this, we increased the isoform to FLp53 ratio in Figure 4, which demonstrate that the isoforms Δ133p53 and Δ160p53 exert dominant-negative effects on the function of FLp53.

      These results demonstrate that the isoforms can compromise p53-mediated pathways, consistent with Horikawa et al. (2017), which showed that Δ133p53α overexpression is "non- or less oncogenic and mutagenic" compared to complete p53 inhibition, but still affects specific tumor-suppressing pathways. Furthermore, as noted by Gong et al. (2016), Δ133p53's anti-apoptotic function under certain conditions is independent of FLp53 and unrelated to its dominant-negative effects.

      We appreciate your suggestion to investigate DNA repair genes such as RAD51, RAD52, or Lig4, especially under stress conditions. While these targets are intriguing and relevant, we believe that our current investigation of p53 targets in this manuscript sufficiently supports our conclusions regarding the dominant-negative effect. Further exploration of additional p53 target genes, including those involved in DNA repair, will be an important focus of our future studies.

      • Figure 5A and B: directly comparing the level of FLp53 expressed in cytoplasm or nucleus to the level of Δ133p53α and Δ160p53α expressed in cytoplasm or nucleus does not mean much since these are overexpressed proteins and therefore depend on the level of expression. The authors should rather compare the ratio of cytoplasmic/nuclear FLp53 to the ratio of cytoplasmic/nuclear Δ133p53α and Δ160p53α.

      Response: Thank you very much for this valuable suggestion. In the revised paper, Figure 5B has been recreated. Changes have been made in lines 214-215: "The cytoplasm-to-nucleus ratio of Δ133p53 and Δ160p53 was approximately 1.5-fold higher than that of FLp53 (Figure 5B)."

      **Referees cross-commenting**

      I agree that the system needs to be improved to be more physiological.

      Just to precise, the D133 and D160 isoforms are not truncated mutants, they are naturally occurring isoforms expressed in almost every normal human cell type from an internal promoter within the TP53 gene.

      Using overexpression always raises concerns, but in this case, I am even more careful because the isoforms are almost always less expressed than the FLp53, and here they have to push it 5 to 10 times more expressed than the FLp53 to see the effect which make me fear an artifact effect due to the overwhelming overexpression (which even seems to change the normal localization of the protein).

      To visualize the endogenous proteins, they will have to change cell line as the H1299 they used are p53 null.

      Response: Thank you for these comments. We've addressed the motivation of overexpression in the above responses. We needed to use the plasmid constructs in the p53-null cells to detect the proteins but the expression level was certainly not 'overwhelmingly high'.

      First, we tried the A549 cells (p53 wild-type) under DNA damage conditions, but the endogenous p53 protein was undetectable. Second, several studies reported increased Δ133p53 level compared to wild-type p53 and that it has implications in tumor development2, 3, 4, 9. Third, the apoptosis activity of H1299 cells overexpressing p53 proteins was analyzed in the revised manuscript (Figure 7). The apoptotic activity induced by FLp53 expression was approximately 2.5 times higher than that of the control vector under identical plasmid DNA transfection conditions (Figure 7). These results rule out the possibility that the plasmid-based expression of p53 and its isoforms introduced artifacts in the results. We've discussed this in the Results section (lines 254-269).

      Reviewer #3 (Significance (Required)):

      Overall, the paper is interesting particularly considering the range of techniques used which is the main strength.

      The main limitation to me is the lack of contradictory discussion as all argumentation presents Δ133p53α and Δ160p53α exclusively as oncogenic and strictly FLp53 dominant-negative when, particularly for Δ133p53α, a quite extensive literature suggests a not so clear-cut activity.

      The aggregation mechanism is reported for the first time for Δ133p53α and Δ160p53α, although it was already published for Δ40p53α, Δ133p53β or in mutant p53.

      This manuscript would be a good basic research addition to the p53 field to provide insight in the mechanism for some activities of some p53 isoforms.

      My field of expertise is the p53 isoforms which I have been working on for 11 years in cancer and neuro-degenerative diseases

      Response: Thank you very much for your positive and critical comments. We've included a fair discussion on the oncogenic and non-oncogenic function of Δ133p53 in the Introduction following your suggestion (lines 62-73).

      References

      1. Pitolli C, Wang Y, Candi E, Shi Y, Melino G, Amelio I. p53-Mediated Tumor Suppression: DNA-Damage Response and Alternative Mechanisms. Cancers 11, (2019).

      Fujita K, et al. p53 isoforms Delta133p53 and p53beta are endogenous regulators of replicative cellular senescence. Nature cell biology 11, 1135-1142 (2009).

      Fragou A, et al. Increased Δ133p53 mRNA in lung carcinoma corresponds with reduction of p21 expression. Molecular medicine reports 15, 1455-1460 (2017).

      Bourdon JC, et al. p53 isoforms can regulate p53 transcriptional activity. Genes & development 19, 2122-2137 (2005).

      Ghosh A, Stewart D, Matlashewski G. Regulation of human p53 activity and cell localization by alternative splicing. Molecular and cellular biology 24, 7987-7997 (2004).

      Aoubala M, et al. p53 directly transactivates Δ133p53α, regulating cell fate outcome in response to DNA damage. Cell death and differentiation 18, 248-258 (2011).

      Marcel V, et al. p53 regulates the transcription of its Delta133p53 isoform through specific response elements contained within the TP53 P2 internal promoter. Oncogene 29, 2691-2700 (2010).

      Zhao L, Sanyal S. p53 Isoforms as Cancer Biomarkers and Therapeutic Targets. Cancers 14, (2022).

      Nutthasirikul N, Limpaiboon T, Leelayuwat C, Patrakitkomjorn S, Jearanaikoon P. Ratio disruption of the ∆133p53 and TAp53 isoform equilibrium correlates with poor clinical outcome in intrahepatic cholangiocarcinoma. International journal of oncology 42, 1181-1188 (2013).

      Tadijan A, et al. Altered Expression of Shorter p53 Family Isoforms Can Impact Melanoma Aggressiveness. Cancers 13, (2021).

      Aubrey BJ, Kelly GL, Janic A, Herold MJ, Strasser A. How does p53 induce apoptosis and how does this relate to p53-mediated tumour suppression? Cell death and differentiation 25, 104-113 (2018).

      Ghorbani N, Yaghubi R, Davoodi J, Pahlavan S. How does caspases regulation play role in cell decisions? apoptosis and beyond. Molecular and cellular biochemistry 479, 1599-1613 (2024).

      Petronilho EC, et al. Oncogenic p53 triggers amyloid aggregation of p63 and p73 liquid droplets. Communications chemistry 7, 207 (2024).

      Forget KJ, Tremblay G, Roucou X. p53 Aggregates penetrate cells and induce the co-aggregation of intracellular p53. PloS one 8, e69242 (2013).

      Farmer KM, Ghag G, Puangmalai N, Montalbano M, Bhatt N, Kayed R. P53 aggregation, interactions with tau, and impaired DNA damage response in Alzheimer's disease. Acta neuropathologica communications 8, 132 (2020).

      Arsic N, et al. Δ133p53β isoform pro-invasive activity is regulated through an aggregation-dependent mechanism in cancer cells. Nature communications 12, 5463 (2021).

      Melo Dos Santos N, et al. Loss of the p53 transactivation domain results in high amyloid aggregation of the Δ40p53 isoform in endometrial carcinoma cells. The Journal of biological chemistry 294, 9430-9439 (2019).

      Mestrom L, et al. Artificial Fusion of mCherry Enhances Trehalose Transferase Solubility and Stability. Applied and environmental microbiology 85, (2019).

      Kaba SA, Nene V, Musoke AJ, Vlak JM, van Oers MM. Fusion to green fluorescent protein improves expression levels of Theileria parva sporozoite surface antigen p67 in insect cells. Parasitology 125, 497-505 (2002).

      Snapp EL, et al. Formation of stacked ER cisternae by low affinity protein interactions. The Journal of cell biology 163, 257-269 (2003).

      Jain RK, Joyce PB, Molinete M, Halban PA, Gorr SU. Oligomerization of green fluorescent protein in the secretory pathway of endocrine cells. The Biochemical journal 360, 645-649 (2001).

      Campbell RE, et al. A monomeric red fluorescent protein. Proceedings of the National Academy of Sciences of the United States of America 99, 7877-7882 (2002).

      Hofstetter G, et al. Δ133p53 is an independent prognostic marker in p53 mutant advanced serous ovarian cancer. British journal of cancer 105, 1593-1599 (2011).

      Bischof K, et al. Influence of p53 Isoform Expression on Survival in High-Grade Serous Ovarian Cancers. Scientific reports 9, 5244 (2019).

      Gong L, et al. p53 isoform Δ113p53/Δ133p53 promotes DNA double-strand break repair to protect cell from death and senescence in response to DNA damage. Cell research 25, 351-369 (2015).

      Gong L, et al. p53 isoform Δ133p53 promotes efficiency of induced pluripotent stem cells and ensures genomic integrity during reprogramming. Scientific reports 6, 37281 (2016).

      Gong L, Pan X, Yuan ZM, Peng J, Chen J. p53 coordinates with Δ133p53 isoform to promote cell survival under low-level oxidative stress. Journal of molecular cell biology 8, 88-90 (2016).

      Candeias MM, Hagiwara M, Matsuda M. Cancer-specific mutations in p53 induce the translation of Δ160p53 promoting tumorigenesis. EMBO reports 17, 1542-1551 (2016).

      Horikawa I, et al. Δ133p53 represses p53-inducible senescence genes and enhances the generation of human induced pluripotent stem cells. Cell death and differentiation 24, 1017-1028 (2017).

      Mondal AM, et al. Δ133p53α, a natural p53 isoform, contributes to conditional reprogramming and long-term proliferation of primary epithelial cells. Cell death & disease 9, 750 (2018).

      Joruiz SM, Von Muhlinen N, Horikawa I, Gilbert MR, Harris CC. Distinct functions of wild-type and R273H mutant Δ133p53α differentially regulate glioblastoma aggressiveness and therapy-induced senescence. Cell death & disease 15, 454 (2024).

      O'Brate A, Giannakakou P. The importance of p53 location: nuclear or cytoplasmic zip code? Drug resistance updates : reviews and commentaries in antimicrobial and anticancer chemotherapy 6, 313-322 (2003).

      Horikawa I, et al. Autophagic degradation of the inhibitory p53 isoform Δ133p53α as a regulatory mechanism for p53-mediated senescence. Nature communications 5, 4706 (2014).

      Lee H, et al. IRE1 plays an essential role in ER stress-mediated aggregation of mutant huntingtin via the inhibition of autophagy flux. Human molecular genetics 21, 101-114 (2012).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript entitled "Δ133p53 and Δ160p53 isoforms of the tumor suppressor protein p53 exert dominant-negative effect primarily by co-aggregation", the authors suggest that the Δ133p53 and Δ160p53 isoforms have high aggregation propensity and that by co-aggregating with canonical p53 (FLp53), they sequestrate it away from DNA thus exerting a dominant-negative effect over it.

      First, the authors should make it clear throughout the manuscript, including the title, that they are investigating Δ133p53α and Δ160p53α since there are 3 Δ133p53 isoforms (α, β, γ), and 3 Δ160p53 isoforms (α, β, γ).

      One concern is that the authors only consider and explore Δ133p53α and Δ160p53α isoforms as exclusively oncogenic and FLp53 dominant-negative while not discussing evidences of different activities. Indeed, other manuscripts have also shown that Δ133p53α is non-oncogenic and non-mutagenic, do not antagonize every single FLp53 functions and are sometimes associated with good prognosis. To cite a few examples: Hofstetter G. et al. D133p53 is an independent prognostic marker in p53 mutant advanced serous ovarian cancer. Br. J. Cancer 2011, 105, 1593-1599. Bischof, K. et al. Influence of p53 Isoform Expression on Survival in High-Grade Serous Ovarian Cancers. Sci. Rep. 2019, 9,5244. Knezovi´c F. et al. The role of p53 isoforms' expression and p53 mutation status in renal cell cancer prognosis. Urol. Oncol. 2019, 37, 578.e1-578.e10. Gong, L. et al. p53 isoform D113p53/D133p53 promotes DNA double-strand break repair to protect cell from death and senescence in response to DNA damage. Cell Res. 2015, 25, 351-369. Gong, L. et al. p53 isoform D133p53 promotes efficiency of induced pluripotent stem cells and ensures genomic integrity during reprogramming. Sci. Rep. 2016, 6, 37281. Horikawa, I. et al. D133p53 represses p53-inducible senescence genes and enhances the generation of human induced pluripotent stem cells. Cell Death Differ. 2017, 24, 1017-1028. Gong, L. p53 coordinates with D133p53 isoform to promote cell survival under low-level oxidative stress. J. Mol. Cell Biol. 2016, 8, 88-90.

      On the figures presented in this manuscript, I have three major concerns:

      1. Most results in the manuscript rely on the overexpression of the FLAG-tagged or V5-tagged isoforms. The validation of these construct entirely depends on Supplementary figure 3 which the authors claim "rule[s] out the possibility that the FLAG epitope might contribute to this aggregation. However, I am not entirely convinced by that conclusion. Indeed, the ratio between the "regular" isoform and the aggregates is much higher in the FLAG-tagged constructs than in the V5-tagged constructs. We can visualize the aggregates easily in the FLAG-tagged experiment, but the imaging clearly had to be overexposed (given the white coloring demonstrating saturation of the main bands) to visualize them in the V5-tagged experiments. Therefore, I am not convinced that an effect of the FLAG-tag can be ruled out and more convincing data should be added.
      2. The authors demonstrate that to visualize the dominant-negative effect, Δ133p53α and Δ160p53α must be "present in a higher proportion than FLp53 in the tetramer" and the need at least a transfection ratio 1:5 since the 1:1 ration shows no effect. However, in almost every single cell type, FLp53 is far more expressed than the isoforms which make it very unlikely to reach such stoichiometry in physiological conditions and make me wonder if this mechanism naturally occurs at endogenous level. This limitation should be at least discussed.
      3. Figure 5C: I am concerned by the subcellular location of the Δ133p53α and Δ160p53α as they are commonly considered nuclear and not cytoplasmic as shown here, particularly since they retain the 3 nuclear localization sequences like the FLp53 (Bourdon JC et al. 2005; Mondal A et al. 2018; Horikawa I et al, 2017; Joruiz S. et al, 2024). However, Δ133p53α can form cytoplasmic speckles (Horikawa I et al, 2017) when it colocalizes with autophagy markers for its degradation. The authors should discuss this issue. Could this discrepancy be due to the high overexpression level of these isoforms? A co-staining with autophagy markers (p62, LC3B) would rule out (or confirm) activation of autophagy due to the overwhelming expression of the isoform.

      Minor concerns:

      • Figure 1A: the initiation of the "Δ140p53" is shown instead of "Δ40p53"
      • Figure 2A: I would like to see the images cropped a bit higher, so the cut does not happen just above the aggregate bands
      • Figure 3C: what ratio of FLp53/Delta isoform was used?
      • Figure 3C suggests that the "dominant-negative" effect is mostly senescence-specific as it does not affect apoptosis target genes, which is consistent with Horikawa et al, 2017 and Gong et al, 2016 cited above. Furthermore, since these two references and the others from Gong et al. show that Δ133p53α increases DNA repair genes, it would be interesting to look at RAD51, RAD52 or Lig4, and maybe also induce stress.
      • Figure 5A and B: directly comparing the level of FLp53 expressed in cytoplasm or nucleus to the level of Δ133p53α and Δ160p53α expressed in cytoplasm or nucleus does not mean much since these are overexpressed proteins and therefore depend on the level of expression. The authors should rather compare the ratio of cytoplasmic/nuclear FLp53 to the ratio of cytoplasmic/nuclear Δ133p53α and Δ160p53α.

      Referees cross-commenting

      I agree that the system needs to be improved to be more physiological.

      Just to precise, the D133 and D160 isoforms are not truncated mutants, they are naturally occurring isoforms expressed in almost every normal human cell type from an internal promoter within the TP53 gene.

      Using overexpression always raises concerns, but in this case I am even more careful because the isoforms are almost always less expressed than the FLp53, and here they have to push it 5 to 10 times more expressed than the FLp53 to see the effect which make me fear an artifact effect due to the overwhelming overexpression (which even seems to change the normal localization of the protein).

      To visualize the endogenous proteins, they will have to change cell line as the H1299 they used are p53 null.

      Significance

      Overall, the paper is interesting particularly considering the range of techniques used which is the main strength. The main limitation to me is the lack of contradictory discussion as all argumentation presents Δ133p53α and Δ160p53α exclusively as oncogenic and strictly FLp53 dominant-negative when, particularly for Δ133p53α, a quite extensive literature suggests a not so clear-cut activity.

      The aggregation mechanism is reported for the first time for Δ133p53α and Δ160p53α, although it was already published for Δ40p53α, Δ133p53β or in mutant p53.

      This manuscript would be a good basic research addition to the p53 field to provide insight in the mechanism for some activities of some p53 isoforms.

      My field of expertise is the p53 isoforms which I have been working on for 11 years in cancer and neuro-degenerative diseases