6,066 Matching Annotations
  1. Mar 2024
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Through Review Commons, we received some highly favorable and constructive feedback from reviewers who are clearly knowledgeable about phylogenomics and/or the field of bacterial anti-phage immunity. We have responded to all suggestions made by the reviewers, which we feel have substantially improved and clarified the manuscript. We thank all three reviewers for their thoughtfulness and time.

      Reviewer #1

      Evidence, reproducibility and clarity

      Culbertson and Levin present an elegant computational analysis of the evolutionary history of several families of immune proteins conserved in bacteria and metazoan cells. The authors' work is impressive, revealing interesting insight into previously known connections and identifying exciting new connections that further link bacterial anti-phage defense and animal innate immunity. The results are overall well-presented and will have an important impact on multiple related fields. I have a few comments for the authors to help explain some of the new connections observed in their findings and clarify the results for a general audience.

      We thank the reviewer for their kind appraisal of our manuscript as well as their helpful comments. We found their comments to be very useful in strengthening our work and increasing the clarity of the writing.

      Comments: 1) The authors adeptly navigate difficult and changing nomenclature around cGAS-STING signaling but there may be room for clarifying terminology. Although historically the term "CD-NTase" has been used to describe both bacterial and animal enzymes (including by this reviewer's older work as well), the field has now settled on consistent use of the name "CD-NTase" to describe bacterial cGAS/DncV-like enzymes and the use of the names "cGAS" and "cGLR" to describe animal cGAS-like receptor proteins. Nearly all papers describing bacterial signaling use the term CD-NTase, and since 2021 most papers describing divergent cGAS-like enzymes in animal signaling now use the term "cGLR" (for recent examples see primary papers Holleufer et al 2021 PMID 34261128; Slavik et al 2021 PMID 34261127; Li et al 2023 PMID 37379839; Cai et al 2023 PMID 37659413 and review articles Cai et al 2022 PMID 35149240; Slavik et al 2023 PMID 37380187; Fan et al 2021 PMID 34697297; West et al 2021 PMID 34373639 Unterholzner Cell 2023 PMID 37478819). Kingdom-specific uses of CD-NTase and cGLR may help add clarity to the manuscript especially as each group of enzyme is quite divergent and many protein members synthesize signaling molecules that are distinct from cyclic GMP-AMP (i.e. not cGAS).

      Related to this point, the term "SMODS" is useful for describing the protein family domain originally identified in the elegant work of Burroughs and Aaravind (Burroughs et al 2015 PMID 26590262), but this term is rarely used in papers focused on the biology of these systems. "eSMODS" is a good name, but the authors may want to consider a different description to better fit with current terminology.

      We appreciate the reviewer’s suggestion and have updated the text to try to be more clear (ex: using cGLR as a more specific term whenever possible). However, as OAS is distinctly not a cGLR, strict kingdom-specific use of the terms CD-NTase and cGLR is not possible. We have updated the Mab21 superfamily to be re-named as the cGLR superfamily, as those seem to be synonymous based on recent literature. At this time we are choosing to stick with the eSMODS terminology as it remains to be shown that these eukaryotic proteins have a CD-NTase-like biochemical function.

      An example of how we have tried to navigate this naming issues is:

      “The cGLR superfamily passed all four of these HGT thresholds, as did another eukaryotic clade of CD-NTases that were all previously undescribed. We name this clade the eukaryotic SMODS (eSMODS) superfamily, because the top scoring domain from hmmscan for each sequence in this superfamily was the SMODS domain (PF18144), which is typically found only in bacterial CD-NTases (Supplementary Data).”

      2) The authors state that proteins were identified using an iterative HMM-based search until they "began finding proteins outside of the family of interest" (Line 86). Is it possible to please explain in more detail what this means? A key part of the analysis pipeline is knowing when to stop, especially as some proteins like CD-NTases and cGLRs share related-homology to other major enzyme groups like pol-beta NTases while other proteins like STING and viperin are more unique.

      We have updated the text to better explain how we determined that a given protein sequence was excluded:

      “After using this approach to create pan-eukaryotic HMMs for each protein family, we then added in bacterial homologs to generate universal HMMs (Fig. 1A and Supp. Fig. 1), continuing our iterative searches until we either failed to find any new protein sequences or began finding proteins outside of the family of interest (Supp. Fig. 1). To define the boundaries that separated our proteins of interest from neighboring gene families, we focused on including homologs that shared protein domains that defined that family (see Materials and Methods for domain designations) and were closer to in-group sequences than the outgroup sequences on a phylogenetic tree (outgroup sequences are noted in the Materials and Methods). “

      We also added a section to the Methods specifically defining our outgroups:

      “As outgroup sequences, we used Poly(A) RNA polymerase (PAP) sequences for the CD-NTases, and molybdenum cofactor biosynthetic enzyme (MoaA) for viperin. We did not have a suitable outgroup for STING domains, nor did any diverged outgroups come up in our searches.”

      3) The authors comment on several controls to guard against potential contaminating bacterial sequences present in metazoan genome sequencing datasets (Lines 174-182). It may be helpful to include this very important part of the analysis as part of the stepwise schematic in Figure 1a. Additionally, have the authors used other eukaryotic features like the presence of introns or kingdom specific translation elements (e.g. Shine-Dalgarno- vs. Kozak-like sequences) as part of the analysis?

      We agree that it will be very interesting to look for these eukaryotic gene features, both to rule out contamination and to discern how eukaryotes have acquired and domesticated bacteria-like immune proteins. However, one limitation when working with the data in EukProt is that many species are represented by de novo transcriptome datasets and therefore information about the local gene environment, introns, or promoters are unavailable.

      4) A particularly surprising result of the analysis is a proposed connection between oligoadenylate synthase-like (OAS-like) enzymes and bacterial Clade C CD-NTases. A concern with these results is that previous structural analysis has demonstrated that bacterial CD-NTase enzymes and animal cGLRs are more closely related to each other than they are to OAS (Slavik et al 2021 PMID 34261127). Can the authors provide further support for a connection between OAS and Clade C CD-NTases? The C-terminal alpha-helix bundle of OAS is known to be distinct (Lohöfener et al 2015 PMID 25892109) and perhaps AlphaFold2 modeling of bacterial Clade C CD-NTases and additional OAS sequences may provide further bioinformatic evidence to support the authors' conclusions.

      We were also surprised by this finding as it seems to be in opposition to structural comparisons in studies such as Whiteley et. al 2019 (PMID 30787435). As the reviewer suggests,e used AlphaFold to predict the structures of two CD-NTases, that of Bacterioides uniformis (Clade C016) and Escherichia coli (Clade C018) as well as a previously uncharacterized OAS-like protein (Tripos fusus P058904) and compared those structural predictions to those of cGAS (PDB: 6CTA), OAS1 (PDB: 4RWO), and DncV (PDB: 4TY0). We used the DALI server to make these all vs all comparisons.

           We have not included these analyses in the manuscript as the results were largely inconclusive. The average pairwise z-score between any of these structures was around 20, with a narrow range of scores between 16 (e.g. OAS vs. DncV) and 22 (e.g. DncV vs. the Clade C CD-NTases). For reference, the z-score of a given protein compared to itself was ~50 and a z-score of 20 is a general DALI benchmark used to determine if structures are homologous ( z-scores between 8-20 are in a gray area, and 20+ are generally considered homologous).
      

      In our view, these pairwise structural comparisons suffer from essentially the same problem that is evident in phylogenetic trees containing only animal and bacterial homologs. Namely, all structures/sequences under consideration are extremely different from each other, on very long branches that are difficult to place with confidence when few homologs are being considered. The benefit of our approach is that we have the ideal species diversity to break up the long branches (particularly with respect to the OAS superfamily), allowing us to place those sequences confidently on the phylogeny.

      That said, while we have strong support for the topology of OAS within the CD-NTase tree, the interpretation of the relationships relies partly on the inferred root of the tree. In our analyses, we opted not to include a distant outgroup such as pol-beta for rooting purposes, as these sequences aligned poorly with the CD-NTases, resulting in a substantial decrease in alignment and tree quality. Instead, in Fig. 2 we present a tree that is arbitrarily rooted within the bacterial CD-NTases, as this root allows for clade C to be phylogenetically coherent. Our data are also consistent with an alternative rooting, placing OAS as an outgroup. If so, this would yield a tree that implies that OAS-like sequences could have given rise to all other CD-NTases and that, within the non-OAS sequences, all bacterial CD-NTases emerged from within Clade C. We thought it slightly more likely that the root of CD-NTases was solidly within bacteria, hence the display we chose. However, we were not intending to rule out an OAS-outgroup model here. As this response to reviewers will be publically available alongside the final manuscript, we hope this clarifies our claims about the placement of OAS.

      5) One of the most exciting results in the paper is identification of a family of putative CD-NTase enzymes conserved in metazoans. Although full description may be beyond the scope of this paper, if possible, some more analysis would be interesting here: a. Are these CD-NTase enzymes in a conserved gene neighborhood within the metazoan genomes (i.e. located next to a potential cyclic nucleotide receptor?) b. Do these metazoan genomes encode other known receptors for cyclic nucleotide signaling (PFAM searches for CARF or SAVED domains for instance). c. Similar to points 3 and 4, is it possible to add further evidence for support of these proteins as true metazoan sequences that have predicted structural homology to bacterial CD-NTase enzymes?

      Yes agreed, we think point a is an exciting avenue of questioning to pursue. However, as mentioned above, the Eukprot dataset often does not provide the relevant information for the analyses proposed. Therefore, we feel that answering questions about the genomic region of these proteins is beyond the scope of the current manuscript. In particular, all 6 of the eSMODS species are represented only by transcriptomes, making these analyses impossible.

      For point b, we searched EukProt with HMMs for SAVED domains (PF18145), finding 24 total SAVED-containing proteins in EukProt. (We did not find a CARF HMM in Pfam, Tigrfam or other databases, and so could not easily carry out these searches.) Five of the 24 SAVED-containing sequences came from species encoding an eSMODS gene. This represented 3 species out of the total 20 species where we detected a SAVED domain. While this is a potentially intriguing overlap, we cannot make a strong claim about whether these SAVED sequences derive from eukaryotes vs. bacterial contamination without undergoing the extensive searching and phylogenetic tree construction methods for SAVED domains that we have performed for our three families of interest. We expect this will be an interesting line of inquiry for a future study.

      For point c, we agree that additional evidence to support the finding that the eSMODS are eukaryotic rather than bacterial sequences would be helpful. To us, the strongest pieces of evidence would be: 1) presence of eukaryotic gene architecture, 2) adjacency to clearly eukaryotic genes in the contig, and/or 3) fluorescence in situ hybridization experiments in these species to localize where the genes are encoded. Unfortunately, the transcriptome data available does not provide this level of information. We hope that other groups will follow up on these genes and species to decide the matter more definitively. In the meantime, we feel that our filters for HGT vs. contamination have done as much as possible with the existing dataset. We have modified the text in this region to leave open potential scenarios that could be fooling us, such as the presence of unusual, long-term, eukaryote-associated symbionts in the taxa where we detect eSMODS:

      “For species represented only by transcriptomes, these criteria may still have difficulty distinguishing eukaryote-bacteria HGT from certain specific scenarios such as the long-term presence of dedicated, eukaryote-associated, bacterial symbionts. However, because these criteria allow us to focus on relatively old HGT events, they give us higher confidence these events are likely to be real. ”

      6) The authors state that obvious CD-NTase/cGLR enzymes are not present in organisms that encode the group of divergent eukaryotic "blSTINGs". Have the authors analyzed the protein-coding genes encoded immediately upstream and downstream of the blSTING proteins with AlphaFold2 and FoldSeek? It would be very exciting if putative cyclic nucleotide generating enzymes are predicted to be encoded within the nearby gene neighborhood.

      Similar to the eSMODS, the majority of the species with blSTINGs were represented by transcriptomes (22/26). We do agree that this type of analysis would be very interesting. However, we feel that this is beyond the scope of this manuscript.

      7) Line 144 appears to reference the incorrect supplementary figure. SI Figure 4 may be the correct reference?

      We agree and have made this change. We thank the reviewer for catching this error.

      I hope the authors will find my comments useful, thank you for the opportunity to read this exciting manuscript.

      Significance

      Culbertson and Levin present an elegant computational analysis of the evolutionary history of several families of immune proteins conserved in bacteria and metazoan cells. The authors' work is impressive, revealing interesting insight into previously known connections and identifying exciting new connections that further link bacterial anti-phage defense and animal innate immunity. The results are overall well-presented and will have an important impact on multiple related fields. I have a few comments for the authors to help explain some of the new connections observed in their findings and clarify the results for a general audience.

      Reviewer #2

      Evidence, reproducibility and clarity

      Describe your expertise? Molecular Evolution, Mechanisms of Protein evolution, Phylogenomics, Adaptation.

      Summary: This manuscript broadly aims to improve our understanding the evolutionary relationships between eukaryote and bacterial protein families where members of those families have immune roles. The study focuses on three such families and samples deeply across the eukaryotic tree. The approaches taken include a nice application of the EukProt database and the use of homology detection approaches that are sensitive to the issues of assigning homology through deep time. The main findings show the heterogeneity in means by which these families have arisen, with some of the families originating at least as far back as the LCA of eukaryotes, in contrast the wide spread yet patchy distribution of other families is the result of repeated independent HGT events and/or convergent domain shuffling.

      We thank the reviewer for this excellent review and their helpful comments and suggestions. We firmly believe that these comments will strengthen and clarify our work.

      Major Comments: 1. Overall the level of detail provided throughout the manuscript is lacking, perhaps the authors were constrained by a word limit for initial submission, if so then this limit needs to be extended to include the detail necessary. In addition, there are some structural issues throughout, e.g. some of the very brief intro (see later comment) reads a little more like methods (paragraph 2) and abstract (paragraph 3). The results section is lacking detail of the supporting evidence from the clever analyses that were clearly performed and the statistics underpinning conclusions are not included.

      Good suggestion, we have updated the paper to include more details and statistics on the analyses that were performed. We have also expanded on some of the most interesting findings about these bacterial innate immune proteins in the introduction (see Comment 2 below for our changes), as well as shifting the methods-like paragraph mentioned (paragraph 2) to later on in the paper. For paragraph 3, we have slimmed this down to include fewer details, but leave the final paragraph of the Introduction as a brief synopsis to prime the reader for the rest of the paper.

      1. The intro and discussion both include statements about some recent discoveries that bacteria and mammals share mechanisms of innate immunity - but there is no further detail into what would appear to be important work leading to this study. This context needs to be provided in more detail therefore I would encourage the authors to expand on the intro to include specific detail on these significant prior studies. In addition, more background information on the gene families investigated in detail here would be useful e.g. how the proteins produced influence immunity etc should be a feature of the intro. A clear and concise rationale for why these 3 particular gene families (out of all the possible innate immune genes known) were selected for analysis.

      We have added in additional background about some of the most exciting discoveries made in the past few years. We also included specific rationale as to why we chose to look at cGAS, STING, and Viperin.

      Specifically, we have added the following to the introduction:

      “ For example, bacterial cGAS-DncV-like nucleotidyltransferases (CD-NTases), which generate cyclic nucleotide messengers (similar to cGAS), are massively diverse with over 6,000 CD-NTase proteins discovered to date. Beyond the cyclic GMP-AMP signals produced by animal cGAS proteins, bacterial CD-NTases are capable of producing a wide array of nucleotide signals including cyclic dinucleotides, cyclic trinucleotides, and linear oligonucleotides [11,14]. Many of these bacterial CD-NTase products are critical for bacterial defense against viral infection[8]. Interestingly, these discoveries with the CD-NTases mirror what has been discovered with bacterial viperins. In mammals, viperin proteins restrict viral replication by generating 3’-deoxy-3’,4’didehdro- (ddh) nucleotides[4,15–17] block RNA synthesis and thereby inhibit viral replication[15,18]. Mammalian viperin generates ddhCTP molecules while bacterial viperins can generate ddhCTP, ddhUTP, and ddhGTP. In some cases, a single bacterial protein is capable of synthesizing two or three of these ddh derivatives[4]. These discoveries have been surprising and exciting, as they imply that some cellular defenses have deep commonalities spanning across the entire Tree of Life, with additional new mechanisms of immunity waiting to be discovered within diverse microbial lineages. But despite significant homology, these bacterial and animal immune proteins are often distinct in their molecular functions and operate within dramatically different signaling pathways (reviewed here[5]). How, then, have animals and other eukaryotes acquired these immune proteins?”

      In regards to why we choose to investigate CD-NTases, STING, and Viperin specifically, we have added the following to the third paragraph of the introduction:

      “We choose to focus on the cGAS, STING, and Viperin for a number of reasons. First, in metazoans cGAS and STING are part of the same signaling pathway whereas bacterial CD-NTases often act independently of bacterial STINGs[21], raising interesting questions about how eukaryotic immune proteins have gained their signaling partners. Also, given the vast breadth of bacterial CD-NTase diversity, we were curious as to if any eukaryotes had acquired CD-NTases distinct from cGAS. For similar reasons, we investigated Viperin, which also has a wide diversity in bacteria but a much more narrow described function in eukaryotes.”

      1. Context: Genome quality is always a concern, and confirming the absence of an element/protein in a genome is challenging given the variation in quality of available genomes. Low BUSCO scores mean that the assessment of gene loss is difficult to evaluate (but we are not provided with said scores). Query: in the results section it states that the BUSCO completeness scores (which need to be provided) etc were insufficient to explain the pattern of gene loss. I would like to know how they reached this conclusion - what statistical analyses (ANOVA?? OTHER??) have been performed to support this statement and please include the associated P values etc. Similarly, throughout the paper, including in the discussion section, the point is brushed over. If, given a statistical test, you find that some of the disparity in gene presence is explained by BUSCO score, most of your findings are still valid. It would just be difficult to make conclusions about gene loss.

      We have rewritten this section to be more clear about what we feel we can and cannot say about gene loss and BUSCO scores. This section now reads:

      “However, outside of Metazoa, these homologs were sparsely distributed, such that for most species in our dataset (711/993), we did not recover proteins from any of the three immune families examined (white space, lack of colored bars, Fig. 1B). While some of these absences may be due to technical errors or dataset incompleteness (Supp. Fig. 2), we interpret this pattern as a reflection of ongoing, repeated gene losses across eukaryotes, as has been found for other innate immune proteins[27–29] and other types of gene families surveyed across eukaryotes[28,30–32]. Indeed, many of the species that lacked any of the immune homologs were represented by high-quality datasets (Ex: Metazoa, Chlorplastida, and Fungi). Thus, although it is always possible that our approach has missed some homologs, we believe the resulting data represents a fair assessment of the diversity across eukaryotes, at least for those species currently included within EukProt.”

      In addition, we direct readers to EukProt v3, where the BUSCO scores are publicly available.

      “BUSCO scores can also be viewed on EukProt v3 (https://evocellbio.com/SAGdb/images/EukProtv3.busco.output.txt).”

      1. In terms of the homolog search strategy - line 394 - can you please state what an "outgroup gene family" means in this context. It is unclear but very important to the downstream interpretation of results.

      We have updated the materials and methods to specifically name our outgroups:

      “As outgroup sequences, we used Poly(A) RNA polymerase (PAP) sequences for the CD-NTases, and molybdenum cofactor biosynthetic enzyme (MoaA) for viperin. We did not have a suitable outgroup for STING domains, nor did any diverged outgroups come up in our searches.”

      1. For reproducibility, the materials and methods section needs to provide more detail/sufficient detail to reproduce these results. E.g the section describing phase 1 of the euk searches the text here repeats what is in the results section for the crystal structure work but doesn't give me any information on how, what method was used to "align the crystal structures", what scoring scheme is used and how the scoring scheme identifies "the core"? What specific parameters are used throughout. Why is MAFFT the method of choice for some of the analyses? Whereas, in other cases both MAFFT and MUSCLE are employed. What are the specific settings used for the MAFFT alignments throughout - is it default (must state if that is the case) or is it MAFFT L-INS-I with default settings etc.

      We have updated the text to include the specific settings used each time a particular software package was deployed. We also have included information for STING as to how we aligned 3 published crystal structures to determine the boundaries of homology.

      Here is how we now discuss identifying the “core” STING domain:

      “ For STING, where the Pfam profile includes regions of the protein outside of the STING domain, we generated a new HMM for the initial search. First, we aligned crystal structures of HsSTING (6NT5), Flavobacteriaceae sp. STING (6WT4) and Crassostrea gigas STING (6WT7) with the RCSB PDB “Pairwise Structure Alignment” tool with a jFATCAT (rigid) option[73,74]. We defined a core “STING” domain, as the ungapped region of 6NT5 that aligned with 6WT7 and 6WT4 (residues G152-V329 of 6NT5).Then we aligned 15 eukaryotic sequences from PF15009 (all 15 of the “Reviewed” sequences on InterPro) with MAFFT(v7.4.71)[75] with default parameters and manually trimmed the sequences down to the boundaries defined by our crystal alignment (residues 145-353 of 6NT5). We then trimmed the alignment with TrimAI (v1.2)[76] with options -gt 0.2. The trimmed MSA was then used to generate an HMM profile with hmmbuild from the hmmer (v3.2.1) package (hmmer.org) using default settings. “

      We employed three alignment softwares at specific times throughout our analyses. MAFFT was used as our default aligner for most of the analysis. Hmmalign (part of the hmmer package) was used to make the alignments prior to hmmbuild. The overall goal of this work was to reconstruct the evolutionary history of these proteins via a phylogenetic tree. To ensure that this tree topology was as robust as possible we employed the more computationally intensive, but more accurate, tree builder MUSCLE. We have updated the text in the methods section to be more clear as to why we used each software.

      We have updated the methods section to read:

      “MUSCLE was deployed in parallel with MAFFT to generate these final alignments to ensure that the final tree topology would be as robust as possible. MUSCLE is a slightly more accurate but more computationally intensive alignment software[79].”

      1. The justification for the number of HMM searches needs to be included. The choice of starting points for the HMMs was cryptic - please provide details. It is likely that you ran the search until no more sequences were found or until sequences were added from a different gene family, and that these happened to be between 3 and 5 searches, but it reads like you wanted to run it 3 or 5 times and that corresponds to the above condition. Something like this would be clearer: "The profile was [...] until no more sequences were found or until sequences from other gene families were found which was between 3 and 5 times in all cases" - the same is true of figure 1.

      We agree that this could have been worded better. We have updated the text to make it more clear that we searched until saturation which happened to occur between 3-5 searches and not that we arbitrarily wanted to do 3-5 searches.

      We have updated the text, which now reads:

      “After using this approach to create pan-eukaryotic HMMs for each protein family, we then added in bacterial homologs to generate universal HMMs (Fig. 1A and Supp. Fig. 1), continuing our iterative searches until we either failed to find any new protein sequences or began finding proteins outside of the family of interest (Supp. Fig. 1). To define the boundaries that separated our proteins of interest from neighboring gene families, we focused on including homologs that shared protein domains that defined that family (see Materials and Methods for domain designations) and were closer to in-group sequences than the outgroup sequences on a phylogenetic tree (outgroup sequences are noted in the Materials and Methods). “

      We also updated the figure legend to Fig. 1. It now reads:

      “Each set of searches was repeated until few or no additional eukaryotic sequences were recovered which was between 3-5 times in all cases.”

      1. Why do you limit hits to 10 per species - might this lead to misleading findings about gene family diversity? Info and justification for approach is required (411-412).

      We limited the hits to 10 per species to limit the influence of any one species on our alignments and subsequent phylogenetic trees. This 10-per-species cap was never reached with any search for STING or Viperin, but was used to throttle the number of Metazoan hits when searching for CD-NTases. Because of this, we probably have missed some amount of the diversity of Metazoan Mab21-like/OAS-like sequences, although this was not a focus of our manuscript. We have updated the text to be more clear about why we have included this limit and when the limit was invoked.

      We have update the text, which now reads:

      “HMM profiles were used to search EukProt via hmmsearch (also from hmmer v3.2.1) with a statistical cutoff value of 1e-3 and -hit parameter set to 10 (i.e. the contribution of a single species to the output list is capped at 10 sequences). It was necessary to cap the output list, as EukProt v3 includes de novo transcriptome assemblies with multiple splice isoforms of the same gene and we wanted to limit the overall influence a single species had on the overall tree. We never reached the 10 species cap for any search for STING or viperin homologs; only for the CD-NTases within Metazoa did this search cap limit hits.”

      1. The information in Supplementary Figure 3 is quite difficult to assess visually, but I think that is what is expected from that figure. However, this is an important underpinning element of the work and should really be quantitatively assessed. A metric of comparison of trees, with defined thresholds etc there are many out there, even a simple Robinson-Foulds test perhaps? Essentially - comparing the panels in Supplementary Figure 3 by eye is unreliable and in this case not possible given there are no labels. It would also be important to provide these full set of phylogenies generated and associated RF/other scores as supplementary file.

      We agree that this Supplementary Figure is difficult to assess by eye, however we feel that it is vital to show this data. Visually, we do feel like this figure conveys the idea that while individual branches may move around, the major clades/areas of interest are stable across the different alignments and tree builders. To increase robustness, we have included the weighted Robinson-Foulds test results into a new panel of this figure (Supplementary Fig. 3B).

      We have added a section to the methods on how this weighted Robinson-Foulds test was conducted:

      “Weighted Robinson-Foulds distances for Supp. Fig. 3B were calculated with Visual TreeCmp (settings: -RFWeighted -Prune trees -include summary -zero weights allowed)[83].”

      We added the weighted Robinson-Foulds data to Supplemental Fig. 3 and have updated the figure legend to reflect this new data. The new legend for Supp. Fig. 3B reads:

      “(B) The average weighted Robinson-Foulds distances all pairwise comparisons between the four tree types (MAFFT/MUSCLE alignment built with IQTREE/RAXML-ng). Although the distances were higher for the CD-NTase tree (as expected for this highly diverse gene family), all of the key nodes defining the cGLR, OAS, and eSMODS superfamilies, as well as their nearest bacterial relatives, were well supported (>70 ultrafast bootstrap value).”

      1. Does domain shuffling mean that phylogenetic reconstruction is less valid? How was the alignment performed in these cases to account for this.

      Thank you for bringing this up, this is a point we have now clarified in the text. Our searches, alignments, and trees are all of single protein domains, as typically only conservation within domains is retained across the vast distances between bacteria and eukaryotes. As such, domain shuffling should have no impact on the validity of that phylogenetic reconstruction. We have updated the text to be more clear about the scope of the alignments and searches. We made changes to our wording throughout the manuscript. One specific example of this is:

      “Using maximum likelihood phylogenetic reconstruction on the STING domain alone, we identified STING-like sequences from 26 diverse microeukaryotes whose STING domains clustered in between bacterial and metazoan sequences, breaking up the long branch.”

      Minor Comments: 10. I am not sure about the use of the term "truly ancestral" or variants thereof, same issues with "significant homology" and "inherited since LECA and possibly longer" .. these are awkwardly phrased. E.g. I think perhaps "homologous across the whole length" might be clearer, and elsewhere "present in LECA and possibly earlier" may be more fitting.

             We have updated the text for these phrases throughout the manuscript and have replaced them with more specific language.
      
      1. Line 75 - "Detecting" rather than discovering?

      We appreciate the suggestion. However, because many of these gene families have never been described in the eukaryotic lineages considered here, we think ‘discovering’ is more appropriate. Indeed, the eSMODS lineage demonstrates that our search approach has the power to find not just new homologs but to discover totally new subfamilies of these eukaryotic proteins.

      1. 132-133 - more justification is needed for the choice of bacterial genes.

      We have clarified that our selection of bacterial CD-NTases included every known CD-NTase at the time of our analysis. The text now reads:

      “As representative bacterial CD-NTases, we used 6,132 bacterial sequences, representing a wide swath of CD-NTase diversity[43]. To our knowledge, this dataset included every known bacterial CD-NTase at the time of our analysis.”

      1. For the downsizing from 6000 to 500 what were the criteria and thresholds.

      We have updated the text to include the PDA software options for downsampling.The text now reads:

      “We downsampled the CD-NTase bacterial sequences from ~6000 down to 500 using PDA software (options -k 500) on a FastTree (default settings) tree built upon a MAFFT (default parameters) tree, to facilitate more manageable computation times on alignments and tree construction.“

      1. How are you rooting your trees e.g. figure 2? Information is provided for Viperin but not others.

      We have updated the text to ensure that the root of every tree is specifically stated.

      1. In the results section on CD-NTases I think it would be best to place the second paragraph detailing the role of cGAS earlier in this section, perhaps after the first sentence.

      We have moved the second paragraph, which introduces cGAS, OAS, and the other CD-NTases to the beginning of the CD-NTase section.The first paragraph of the CD-NTase section of the results now reads:

      “We next studied the evolution of the innate immune proteins, beginning with cGAS and its broader family of CD-NTase enzymes. Following infections or cellular damage, cGAS binds cytosolic DNA and generates cyclic GMP-AMP (cGAMP)[32–35], which then activates downstream immune responses via STING [34,36–38]. Another eukaryotic CD-NTase, 2’5’-Oligoadenylate Synthetase 1 (OAS1), synthesizes 2',5'-oligoadenylates which bind and activate Ribonuclease L (RNase L)[39]. Activated RNase L is a potent endoribonuclease that degrades both host and viral RNA species, reducing viral replication (reviewed here[40,41]). Some bacterial CD-NTases such as DncV behave similar to animal cGAS; they are activated by phage infection and produce cGAMP[8,42,43]. These CD-NTases are commonly found within cyclic oligonucleotide-based anti-phage signaling systems (CBASS) across many bacterial phyla and archaea[8,27,43].”

      1. Is FASTtree really necessary to include as it will underperform in all instances? Removing that method and comparing the remaining two (i.e. IQTREE and RAXML) - what level of disagreement do you find between the 2 alignment and 2 tree building methods? The cases that disagree should also be detailed.

      We agree that FASTtree underperforms against IQTREE and RAXML and have eliminated those trees from the supplement. We initially had included FASTtree, as it still seems to be widely used in phylogenetic analyses within the recent papers on bacterial immune homologs, but we completely agree with the reviewer and have removed it. In addition, we have calculated and added in the average weighted Robinson-Foulds Distance to Supplemental Figure 3. Our manuscript focuses on features of the phylogenetic trees that were consistent across all the replicate methods. However, given the numerous sequences and high degree of divergence involved, there were many cases where individual branches shifted between the methods, e.g. if individual CD-NTases within bacterial clade G swapped positions with one another. The differences we observed between the trees were inconsequential to our overall conclusions.

      1. Again a structural point - the start to paragraph "To understand the evolutionary history of CD-NTases we used the Pfam domain PF03281 as a starting point", I don't know at this point why or how you have done this. The sentence seems a little premature. I would therefore suggest that you start that paragraph with your motivation, "In order to..." and then finish that paragraph with your sentence in quotes above which actually summarizes the paragraph.

      We have updated the text to clear up this paragraph (in addition to other structural changes in the CD-NTase section. The paragraph containing information about how we started the HMM searches for the CD-NTases now reads:

      “ To begin our sequence searches for eukaryotic CD-NTases, we used the Pfam domain PF03281, representing the main catalytic domain of cGAS, as a starting point. As representative bacterial CD-NTases, we used 6,132 bacterial sequences, representing a wide swath of CD-NTase diversity[21]. Following our iterative HMM searches, we recovered 313 sequences from 109 eukaryotes, of which 34 were metazoans (Supplemental Data and Fig. 1B). Within the phylogenetic trees, most eukaryotic sequences clustered into one of two distinct superfamilies: the cGLR superfamily (defined by clade and containing a Mab21 PFAM domain: PF03281) or the OAS superfamily (OAS1-C: PF10421) (Fig. 2A). Bacterial CD-NTases typically had sequences matching the HMM for the Second Messenger Oligonucleotide or Dinucleotide Synthetase domain (SMODS: PF18144).”

      1. Line 148 - "within" change to "before"?

      We have updated the text with this suggestion.

      1. Unclear from text as is whether you found any STING homologs in arthropods (~line 157). Please update the text for clarity. Would also suggest that "agreeing" should be replaced with "aligning".

      We found several STING homologs in arthropods and have updated the text to specifically note this. We also have updated the text as per the suggestion of using the term “aligning” instead of “agreeing”.The text now reads:

      “Almost half of these species (10/19) were arthropods, aligning with prior findings of STING sparseness among arthropods(Wu et al. 2014). We did find STING homologs in 8/19 arthropod species in EukProt v3, including the previously identified STINGs of Drosophila melanogaster, Apis mellifera and Tribolium castaneum(Wu et al. 2014; Margolis, Wilson, and Vance 2017).”

      1. Line 169 - If clade D is not a clade, maybe it should be called something different.

      Yes, unfortunate naming, isn’t it? Clade D is not a coherent clade in our results nor when it was first described, but we feel that for consistency with the rest of the field, it is best if we adhere to previously published nomenclature.

      1. Line 188-190 - In principle, max likelihood should be able to infer the right tree even with high divergence.

      Yes, we agree that maximum likelihood methods should be able to infer the correct tree. However, we are not sure what change the reviewer is suggesting here.

      1. Paragraph starting at 199 - eSMODS - always unknown function or mostly - could be important.

      To our knowledge the function of the two closest bacterial CD-NTases to the eSMODS group have an unknown function.

      1. For calling HGT you state that one of the criteria is that the euk and bac sequences branched near one another, what is "near" in this scenario?

      “Near” in this case refers to being adjacent on the phylogenetic tree. We have updated the text for clarity. The text now reads:

      “To minimize such false positive HGT calls, we took a conservative approach in our analyses, considering potential bacteria-eukaryote HGT events to be trustworthy only if: 1) eukaryotic and bacterial sequences branched adjacent to one another with strong support (bootstrap values >70); 2) the eukaryotic sequences formed a distinct subclade, represented by at least 2 species from the same eukaryotic supergroup; 3) the eukaryotic sequences were produced by at least 2 different studies; and 4) the position of the horizontally transferred sequences was robust across all alignment and phylogenetic reconstruction methods used (Supp. Fig. 3A).”

      1. In legends be specific about what type of support value, e.g. bootstrap or jack-knife.. I think it is always bootstrap but would be good to have that precision.

      Our phylogenetic trees only use bootstrap values for support and so have updated the figure legends and methods to provide this information. Apologies for this lack of clarity.

      1. Throughout the text if stating e.g. "clustered robustly and with high support" please provide the appropriate values.

      We have updated the text to provide bootstrap values when invoking statements about support. An example of this is:

      “There are two clades of Chloroplastida (a group within Archaeplastida) sequences that branch robustly (>80 ultrafast bootstrap value) within the bacteria clade.”

      1. It is unclear from the text how the animal origin of the TIR domain is supported (~line 274). Please provide necessary details to support your statements in the results section.

      Our phylogenetic tree of TIR domains (Supp. Fig. 7), places C. gigas’ TIR domain (of its STING protein) clusters with high support next to other metazoan TIR domains.

      We have updated the STING section to include these lines:

      “We also investigated the possibility that C. gigas acquired the TIR-domain of its TIR-STING protein via HGT from bacteria, however this analysis also suggested an animal origin for the TIR domain (Supp. Fig. 7), as the C. gigas TIR domain clustered with other metazoan TIR domains such as Homo sapiens TICAM1 and 2 (ultrafast bootstrap value of 75). Eukaryotic TIR-STINGs are also rare, further supporting the hypothesis that this protein resulted from recent convergence, where animals independently fused STING and TIR domains to make a protein resembling bacterial TIR-STINGs, consistent with previous reports[19].”

      1. Replace similar with -> similar "to"

      We have accepted the suggestion and replaced “with” with “to”.

      1. Line 266: It was previously shown .. or it is known but not "it was previously known"

      We have rephrased the sentence to be clearer: “Some eukaryotes like C. gigas…”.

      1. The last sentence in paragraph ~line 277: "Our work also identified a number of non-metazoan STINGS...." Please expand on this and provide some of the details on this finding in the text or point to the figure that supports the statement and provide a little more detail here.

      The intent of the words on line 277 was a summary of what we had previously discussed in the STING section. For clarity we updated the text, which now reads:

      “Interestingly the non-metazoan, blSTINGs (Fig. 3C) that are found in the Stramenopiles, Haptista, Rhizaria, Choanoflagellates and Amoebozoa have a TM-STING domain architecture similar to animal STINGs but a STING domain more similar to bacterial STINGs..”

      blSTINGs are discussed in more detail earlier in the STING section (specifically paragraph 3) where we say:

      “Using maximum likelihood phylogenetic reconstruction on the STING domain alone, we identified STING-like sequences from 26 diverse microeukaryotes whose STING domains clustered in between bacterial and metazoan sequences, breaking up the long branch. We name these sequences the bacteria-like STINGs (blSTINGs) because they were the only eukaryotic group of STINGs with a bacteria-like Prok_STING domain (PF20300) and because of the short branch length (0.86 vs. 1.8) separating them from bacterial STINGs on the tree (Fig. 3C). While a previous study reported STING domains in two eukaryotic species (one in Stramenopiles and one in Haptista) [19], we were able to expand this set to additional species and also recover blSTINGs from Amoebozoa, Rhizaria and choanoflagellates. This diversity allowed us to place the sequences on the tree with high confidence (bootstrap value >70), recovering a substantially different tree than previous work[19]. As for CD-NTases, the tree topology we recovered was robust across multiple different alignment and phylogenetic tree construction algorithms (Supp. Fig. 3A).”

      1. Line 294: it is unclear which are the orphan taxa -we are directed to figure 1 but there is no notation for orphan taxa here perhaps add something to the figure to make obvious which these are.

      We have updated the text to mention these orphan taxa specifically by name.

      The text now reads:

      “The 194 viperin-like proteins we recovered came from 158 species spanning the full range of eukaryotic diversity, including organisms from all of the major eukaryotic supergroups, as well as some orphan taxa whose taxonomy remains open to debate (Fig. 1, Ancyromonadida, Hemimastigophora, Malawimonadida).”

      1. Lines 340-341 - some redundant use of eukaryotic/eukaryotes

      We have updated the text to reduce redundancy.

      1. Lines 475-480 - some further detail needed - how were sequences trimmed to the TIR domain? - what were your starting sequences? etc.

      We have updated the text detailing how we acquired a set of proteins from Interpro and how we used hmmscan to determine the coordinates for the TIR domains in those proteins. We then isolated the TIR domains (using the coordinates defined by hmmscan) and proceeded to align those sequences

      The text now reads:

      “We used hmmscan to identify the coordinates of TIR domains in a list of 203 TIR domain containing-sequences from InterPro (all 203 proteins from curated “Reviewed” selection of IPR000157 (Toll/interleukin-1 receptor homology (TIR) domain as of 2023-04-04)) and 104 bacterial TIR-STING proteins (the same TIR-STING proteins used in Fig. 3)[3]. Next, we trimmed the sequences down to the hmmscan identified TIR coordinates and aligned the TIR domains with MUSCLE (-super5). We trimmed the alignments with TrimAL and built a phylogenetic tree with IQtree (-s, -bb 1000, -m TEST, -nt AUTO).”

      1. Check that the colour schemes for branches etc are detailed in the legends of supplementary as well as main.

      We have updated the text of figure legends to be more clear about our maintenance of the same color scheme throughout the manuscript. This involved ensuring that the following statement (or an equivalent statement) was present in the figure legends of Figures 2, 3, 4, S2, S3,S4,S5,S6, and S7:

      “Eukaryotic sequences are colored according to eukaryotic group as in Fig. 1B.”

      1. The threshold set for gaps is very strict at 0.2. This seems quite strict given the sequences are potentially quite highly divergent. What length are the alignments that you are using after trimming - these details need to be included and considered.

      We have updated the text to specifically detail how long our alignments were after trimming and how that post-trimming length compares to the length of the alignment for each PFAM group.

      Specifically, the text now reads:

      “The length of these final alignments were 232, 175, and 346 amino acids long for CD-NTases, STING, and viperin respectively. These alignments represent ≥75% of the length of alignment their respective PFAM domain (PF3281 (Mab-21 protein nucleotidyltransferase domain) for CD-NTases, PF20300 (Prokaryotic STING domain) for STING, and PF404055 (Radical SAM family) for viperin.”

      1. How were sequences downsampled with PDA? Line 424.

      We have updated the text to include the PDA settings that were used to downsample sequences. The text now reads:

      “To ensure the combined HMM did not have an overrepresentation of either bacterial or eukaryotic sequences, we downsampled the bacterial sequences and eukaryotic sequences to obtain 50 phylogenetically diverse sequences of each, and then combined the two downsampled lists. To do this, eukaryotic and bacterial sequences were each separately aligned with MAFFT (default parameters), phylogenetic trees were built with FastTree (v2.1.10)[77], and the Phylogenetic Diversity Analyzer (pda/1.0.3)[78] software with options -k 50 or -k 500 with otherwise default parameters was run the the FastTree files to downsample the sequences while maximizing remaining sequence diversity.”

      1. Please provide adequate descriptions for the materials in the supplementary files for the manuscript, they currently lack description. They are useful and we fully support their inclusion with sufficient information.

      We have expanded the descriptions of the provided supplementary files.

      1. The starting sequences, hmm pipeline and scripts would be great to include, apologies if we have missed them.

      We have added the starting bacterial sequences to the supplementary data, as well as the final HMMs, and the one script that we used in our analysis. All other software (including the included script) is freely and publicly available.

      Significance

      This study provides us with examples of instances where a medley of different mechanisms have resulted in the emergence of innate immune proteins across eukaryotes. The study is entirely bioinformatic in nature and provides some nice cases for future study. The thorough search strategies are to be commended. The limitations of the work are that we don't know whether the functions have also been conserved across deep time and/or in the independent events described. Nevertheless, this work contributes to a growing body of evidence on the complex, and sometimes shared, nature of the evolution of animal and bacterial immunity. I would classify this nice study as a conceptual advance of our understanding of the evolution of protein families through deep time and would imagine it is of interest to a broad audience of biologists from immunologists to evolutionary biologists and structural biologists.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Culbertson and Levin takes a bioinformatic approach to investigate the evolutionary origins/trajectories of three different proteins domains involved in innate immunity in both bacteria and eukaryotes: cGAS/CD-NTases, STING, and Viperins. To perform this analysis, the authors apply an iterative homology search model to the EukProt database of eukaryotic genomes. Their analysis finds that that eukaryotic CD-NTases arose from multiple horizontal gene transfer events between bacteria and eukaryotes. They also fill in an important gap in understanding how STING from bacteria evolved into modern human STING by identifying blasting in diverse eukaryotes. Finally, they determine that Viperins are an ancient protein family that likely existed in LECA, but found two more recent HGT events for proteins related in Vipirin.

      Major comments

      1. The hypothesis for the origin of STING via convergent domain shuffling could be handled with a little more care in the text. The authors show that homologs of STING from animals can also be found in the genomes of diverse eukaryotes outside the metazoa, demonstrating (1) STING and cGAS have had different histories, and (2) that these sequences are more bacteria-like than metazoan STING. However, in multiple places (the title, line 275, elsewhere) the term "convergence" could be misleading. "Convergence" leaves the reader with the impression that there is no common ancestor between the STING domain from bacteria and eukaryotes. I understand that the authors are using "convergent domain shuffling" to draw this distinction, but I'm unsure if a naïve reader will glean the distinction between domain shuffling and STING itself converging. I would argue that we simply cannot place eukaryotic STING and blSTING proteins on the tree of bSTING sequences. i.e. blSTING are no more related to bacterial TM-STING than bacterial TIR-STING (likely the missing bSTING sequences are simply extinct?). Can the authors curate their language to state more simply that STING likely arose through horizontal gene transfer, but it is unlikely that bacterial TM-STING is the unequivocal progenitor?

      We thank the reviewer for this comment, and we absolutely agree that we should be clearer about the distinction between convergence and convergent domain shuffling. We have changed the title and edited the text to increase clarity. In addition, we have clarified what our data does and does say about the evolutionary history of STING. We feel that our STING tree (Fig.3 C), due to a general sparseness of eukaryotic and bacterial sequences, is insufficient to confidently call if eukaryotes acquired STING by HGT or if STING was present in the LECA.

      We have added the following to clear up this issue:

      “Overall, the phylogenetic tree we constructed (Fig. 3C) suggests that there is domain-level homology between bacterial and eukaryotic STINGs, but due to sparseness and lack of a suitable outgroup, this tree does not definitively explain the eukaryotic origin of the STING domain. However, the data does clearly support a model in which convergent domain shuffling in eukaryotes and bacteria generated similar TM-STING and TIR-STING proteins independently.”

      Minor Comments

      1. Spelling error in Figure 3B and 3C: "cannoical"

      Thanks, we have corrected this error.

      1. Figure 5 could be improved to more clearly articulate the findings of the manuscript. In A, it's unclear how OAS relates to Mab21 and a reader not paying close attention might think that OAS was part of the gene duplications after Mab21 was acquired. The LECA origins of OAS are also not presented (albeit, these are still defined in the legend). In B, this panel would suggest that there was not horizontal transfer of STING from bacteria to eukaryotes but rather both domains of life received STING from a separate source. My understanding is STING did likely arise in bacteria, however, the assumption that extant TM-STING in bacteria is the predecessor of TM-STING in eukaryotes is not well supported. Similarly for the TIR domain.

      We have updated Fig. 5 to more clearly show that OAS was likely in the LECA and that eSMODS and cGLRs were HGT’d from bacteria to other eukaryotic lineages. For STING, it was not our intent to imply that the extant TM-STING in bacteria is the predecessor of TM-STING in eukaryotes, and we agree with the reviewer that this is unlikely. Although we do not have sufficient data to speak to the origin of the STING domain itself, we do feel confident in our evidence of domain shuffling. Our illustration in Fig 5B was meant to correspond to the following statement: “Drawing on a shared ancient repertoire of protein domains that includes STING, TIR, and transmembrane (TM) domains, bacteria and eukaryotes have convergently evolved similar STING proteins through domain shuffling.” We believe this inference valid and best describes our results for STING.

      1. Line 119: While the role of Mab21L1-2 are established for development, I'm unaware of a role for MB21D2 in development (or any other phenotype).

      We agree with the reviewer that MB21D2 has not been shown to have any phenotype and have corrected the wording to clarify this point.

      The line now reads “However, the immune functions of Mab21L1 and MB21D2 remain unclear, although Mab21L1they has been shown to be important for development[29–31].”

      1. Line 210: "Gamma" should be "genes"

      We have corrected this error and replaced the word.

      Reviewer #3 (Significance (Required)):

      This work is of high quality, is timely, and will have a large impact on shaping the field. The origins and evolution of antiviral immunity from bacteria to eukaryotes have been investigated from multiple angles. While the phylogeny and evolutionary trajectory of these genes have been traced in bacteria, there have been relatively fewer analyses across diverse (non-metazoan) eukaryotes. For this reason, I am confident that this manuscript will help future researchers select homologs for investigation and guide similar analyses of other bacterial defense systems.

      A particular challenge of this work is accounting for gene loss across taxa and weighing that possibility against horizontal gene transfer. The authors are conservative in their conclusions and well-reasoned. The comments I have can be addressed with changes to the writing and emphasis of certain points.

      I expect these findings to be of interest to a broad audience of evolutionary biologists, microbiologists, and immunologists.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Culbertson and Levin takes a bioinformatic approach to investigate the evolutionary origins/trajectories of three different proteins domains involved in innate immunity in both bacteria and eukaryotes: cGAS/CD-NTases, STING, and Viperins. To perform this analysis, the authors apply an iterative homology search model to the EukProt database of eukaryotic genomes. Their analysis finds that that eukaryotic CD-NTases arose from multiple horizontal gene transfer events between bacteria and eukaryotes. They also fill in an important gap in understanding how STING from bacteria evolved into modern human STING by identifying blasting in diverse eukaryotes. Finally, they determine that Viperins are an ancient protein family that likely existed in LECA, but found two more recent HGT events for proteins related in Vipirin.

      Major comments

      1. The hypothesis for the origin of STING via convergent domain shuffling could be handled with a little more care in the text. The authors show that homologs of STING from animals can also be found in the genomes of diverse eukaryotes outside the metazoa, demonstrating (1) STING and cGAS have had different histories, and (2) that these sequences are more bacteria-like than metazoan STING. However, in multiple places (the title, line 275, elsewhere) the term "convergence" could be misleading. "Convergence" leaves the reader with the impression that there is no common ancestor between the STING domain from bacteria and eukaryotes. I understand that the authors are using "convergent domain shuffling" to draw this distinction, but I'm unsure if a naïve reader will glean the distinction between domain shuffling and STING itself converging. I would argue that we simply cannot place eukaryotic STING and blSTING proteins on the tree of bSTING sequences. i.e. blSTING are no more related to bacterial TM-STING than bacterial TIR-STING (likely the missing bSTING sequences are simply extinct?). Can the authors curate their language to state more simply that STING likely arose through horizontal gene transfer, but it is unlikely that bacterial TM-STING is the unequivocal progenitor?

      Minor Comments

      1. Spelling error in Figure 3B and 3C: "cannoical"
      2. Figure 5 could be improved to more clearly articulate the findings of the manuscript. In A, it's unclear how OAS relates to Mab21 and a reader not paying close attention might think that OAS was part of the gene duplications after Mab21 was acquired. The LECA origins of OAS are also not presented (albeit, these are still defined in the legend). In B, this panel would suggest that there was not horizontal transfer of STING from bacteria to eukaryotes but rather both domains of life received STING from a separate source. My understanding is STING did likely arise in bacteria, however, the assumption that extant TM-STING in bacteria is the predecessor of TM-STING in eukaryotes is not well supported. Similarly for the TIR domain.
      3. Line 119: While the role of Mab21L1-2 are established for development, I'm unaware of a role for MB21D2 in development (or any other phenotype).
      4. Line 210: "Gamma" should be "genes"

      Significance

      This work is of high quality, is timely, and will have a large impact on shaping the field. The origins and evolution of antiviral immunity from bacteria to eukaryotes have been investigated from multiple angles. While the phylogeny and evolutionary trajectory of these genes have been traced in bacteria, there have been relatively fewer analyses across diverse (non-metazoan) eukaryotes. For this reason, I am confident that this manuscript will help future researchers select homologs for investigation and guide similar analyses of other bacterial defense systems.

      A particular challenge of this work is accounting for gene loss across taxa and weighing that possibility against horizontal gene transfer. The authors are conservative in their conclusions and well-reasoned. The comments I have can be addressed with changes to the writing and emphasis of certain points.

      I expect these findings to be a interest to a broad audience of evolutionary biologists, microbiologists, and immunologists.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Describe your expertise? Molecular Evolution, Mechanisms of Protein evolution, Phylogenomics, Adaptation.

      Summary: This manuscript broadly aims to improve our understanding the evolutionary relationships between eukaryote and bacterial protein families where members of those families have immune roles. The study focusses on three such families and samples deeply across the eukaryotic tree. The approaches taken include a nice application of the EukProt database and the use of homology detection approaches that are sensitive to the issues of assigning homology through deep time. The main findings show the heterogeneity in means by which these families have arisen, with some of the families originating at least as far back as the LCA of eukaryotes, in contrast the wide spread yet patchy distribution of other families is the result of repeated independent HGT events and/or convergent domain shuffling.

      Major Comments:

      1. Overall the level of detail provided throughout the manuscript is lacking, perhaps the authors were constrained by a word limit for initial submission, if so then this limit needs to be extended to include the detail necessary. In addition, there are some structural issues thorughout, e.g. some of the very brief intro (see later comment) reads a little more like methods (paragraph 2) and abstract (paragraph 3). The results section is lacking detail of the supporting evidence from the clever analyses that were clearly performed and the statistics underpinning conclusions are not included.
      2. The intro and discussion both include statements about some recent discoveries that bacteria and mammals share mechanisms of innate immunity - but there is no further detail into what would appear to be important work leading to this study. This context needs to be provided in more detail therefore I would encourage the authors to expand on the intro to include specific detail on these significant prior studies. In addition, more background information on the gene families investigated in detail here would be useful e.g. how the proteins produced influence immunity etc should be a feature of the intro. A clear and concise rationale for why these 3 particular gene families (out of all the possible innate immune genes known) were selected for analysis.
      3. Context: Genome quality is always a concern, and confirming the absence of an element/protein in a genome is challenging given the variation in quality of available genomes. Low BUSCO scores mean that the assessment of gene loss is difficult to evaluate (but we are not provided with said scores). Query: in the results section it states that the BUSCO completeness scores (which need to be provided) etc were insufficient to explain the pattern of gene loss. I would like to know how they reached this conclusion - what statistical analyses (ANOVA?? OTHER??) have been performed to support this statement and please include the associated P values etc. Similarly, throughout the paper, including in the discussion section, the point is brushed over. If, given a statistical test, you find that some of the disparity in gene presence is explained by BUSCO score, most of your findings are still valid. It would just be difficult to make conclusions about gene loss.
      4. In terms of the homolog search strategy - line 394 - can you please state what an "outgroup gene family" means in this context. It is unclear but very important to the downstream interpretation of results.
      5. For reproducibility, the materials and methods section needs to provide more detail/sufficient detail to reproduce these results. E.g the section describing phase 1 of the euk searches the text here repeats what is in the results section for the crystal structure work but doesn't give me any information on how, what method was used to "align the crystal structures", what scoring scheme is used and how the scoring scheme identifies "the core"? What specific parameters are used throughout. Why is MAFFT the method of choice for some of the analyses? Whereas, in other cases both MAFFT and MUSCLE are employed. What are the specific settings used for the MAFFT alignments throughout - is it default (must state if that is the case) or is it MAFFT L-INS-I with default settings etc.
      6. The justification for the number of HMM searches needs to be included. The choice of starting points for the HMMs was cryptic - please provide details. It is likely that you ran the search until no more sequences were found or until sequences were added from a different gene family, and that these happened to be between 3 and 5 searches, but it reads like you wanted to run it 3 or 5 times and that corresponds to the above condition. Something like this would be clearer: "The profile was [...] until no more sequences were found or until sequences from other gene families were found which was between 3 and 5 times in all cases" - the same is true of figure 1.
      7. Why do you limit hits to 10 per species - might this lead to misleading findings about gene family diversity? Info and justification for approach is required (411-412)
      8. The information in Supplementary Figure 3 is quite difficult to assess visually, but I think that is what is expected from that figure. However, this is an important underpinning element of the work and should really be quantitatively assessed. A metric of comparison of trees, with defined thresholds etc there are many out there, even a simple Robinson-Foulds test perhaps? Essentially - comparing the panels in Supplementary Figure 3 by eye is unreliable and in this case not possible given there are no labels. It would also be important to provide these full set of phylogenies generated and associated RF/other scores as supplementary file.
      9. Does domain shuffling mean that phylogenetic reconstruction is less valid? How was the alignment performed in these cases to account for this.

      Minor Comments:

      1. I am not sure about the use of the term "truly ancestral" or variants thereof, same issues with "significant homology" and "inherited since LECA and possibly longer" .. these are awkwardly phrased. E.g. I think perhaps "homologous across the whole length" might be clearer, and elsewhere "present in LECA and possibly earlier" may be more fitting.
      2. Line 75 - "Detecting" rather than discovering?
      3. 132-133 - more justification is needed for the choice of bacterial genes.
      4. For the downsizing from 6000 to 500 what were the criteria and thresholds.
      5. How are you rooting your trees e.g. figure 2? Information is provided for Viperin but not others.
      6. In the results section on CD-NTases I think it would be best to place the second paragraph detailing the role of cGAS earlier in this section, perhaps after the first sentence.
      7. Is FASTtree really necessary to include as it will underperform in all instances? Removing that method and comparing the remaining two (i.e. IQTREE and RAXML) - what level of disagreement do you find between the 2 alignment and 2 tree building methods? The cases that disagree should also be detailed.
      8. Again a structural point - the start to paragraph "To understand the evolutionary history of CD-NTases we used the Pfam domain PF03281 as a starting point", I don't know at this point why or how you have done this. The sentence seems a little premature. I would therefore suggest that you start that paragraph with your motivation, "In order to..." and then finish that paragraph with your sentence in quotes above which actually summarises the paragraph.
      9. Line 148 - "within" change to "before"?
      10. Unclear from text as is whether you found any STING homologs in arthropods (~line 157). Please update the text for clarity. Would also suggest that "agreeing" should be replaced with "aligning".
      11. Line 169 - If clade D is not a clade, maybe it should be called something different.
      12. Line 188-190 - In principle, max likelihood should be able to infer the right tree even with high divergence.
      13. Paragraph starting at 199 - eSMODS - always unknown function or mostly - could be important.
      14. For calling HGT you state that one of the criteria is that the euk and bac sequences branched near one another, what is "near" in this scenario?
      15. In legends be specific about what type of support value, e.g. bootstrap or jack-knife.. I think it is always bootstrap but would be good to have that precision.
      16. Throughout the text if stating e.g. "clustered robustly and with high support" please provide the appropriate values.
      17. It is unclear from the text how the animal origin of the TIR domain is supported (~line 274). Please provide necessary details to support your statements in the results section.
      18. Replace similar with -> similar "to"
      19. Line 266: It was previously shown .. or it is known but not "it was previously known"
      20. The last sentence in paragraph ~line 277: "Our work also identified a number of non-metazoan STINGS...." Please expand on this and provide some of the details on this finding in the text or point to the figure that supports the statement and provide a little more detail here.
      21. Line 294: it is unclear which are the orphan taxa -we are directed to figure 1 but there is no notation for orphan taxa here perhaps add something to the figure to make obvious which these are.
      22. Lines 340-341 - some redundant use of eukaryotic/eukaryotes
      23. Lines 475-480 - some further detail needed - how were sequences trimmed to the TIR domain? - what were your starting sequences? etc.
      24. Check that the colour schemes for branches etc are detailed in the legends of supplementary as well as main.
      25. The threshold set for gaps is very strict at 0.2. This seems quite strict given the sequences are potentially quite highly divergent. What length are the alignments that you are using after trimming - these details need to be included and considered.
      26. How were sequences downsampled with PDA? Line 424.
      27. Please provide adequate descriptions for the materials in the supplementary files for the manuscript, they currently lack description. They are useful and we fully support their inclusion with sufficient information.
      28. The starting sequences, hmm pipeline and scripts would be great to include, apologies if we have missed them.

      Significance

      This study provides us with examples of instances where a medley of different mechanisms have resulted in the emergence of innate immune proteins across eukaryotes. The study is entirely bioinformatic in nature and provides some nice cases for future study. The thorough search strategies are to be commended. The limitations of the work are that we don't know whether the functions have also been conserved across deep time and/or in the independent events described. Nevertheless, this work contributes to a growing body of evidence on the complex, and sometimes shared, nature of the evolution of animal and bacterial immunity. I would classify this nice study as a conceptual advance of our understanding of the evolution of protein families through deep time and would imagine it is of interest to a broad audience of biologists from immunologists to evolutionary biologists and structural biologists.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Culbertson and Levin present an elegant computational analysis of the evolutionary history of several families of immune proteins conserved in bacteria and metazoan cells. The authors' work is impressive, revealing interesting insight into previously known connections and identifying exciting new connections that further link bacterial anti-phage defense and animal innate immunity. The results are overall well-presented and will have an important impact on multiple related fields. I have a few comments for the authors to help explain some of the new connections observed in their findings and clarify the results for a general audience.

      Comments:

      1. The authors adeptly navigate difficult and changing nomenclature around cGAS-STING signaling but there may be room for clarifying terminology. Although historically the term "CD-NTase" has been used to describe both bacterial and animal enzymes (including by this reviewer's older work as well), the field has now settled on consistent use of the name "CD-NTase" to describe bacterial cGAS/DncV-like enzymes and the use of the names "cGAS" and "cGLR" to describe animal cGAS-like receptor proteins. Nearly all papers describing bacterial signaling use the term CD-NTase, and since 2021 most papers describing divergent cGAS-like enzymes in animal signaling now use the term "cGLR" (for recent examples see primary papers Holleufer et al 2021 PMID 34261128; Slavik et al 2021 PMID 34261127; Li et al 2023 PMID 37379839; Cai et al 2023 PMID 37659413 and review articles Cai et al 2022 PMID 35149240; Slavik et al 2023 PMID 37380187; Fan et al 2021 PMID 34697297; West et al 2021 PMID 34373639 Unterholzner Cell 2023 PMID 37478819). Kingdom-specific uses of CD-NTase and cGLR may help add clarity to the manuscript especially as each group of enzyme is quite divergent and many protein members synthesize signaling molecules that are distinct from cyclic GMP-AMP (i.e. not cGAS).

      Related to this point, the term "SMODS" is useful for describing the protein family domain originally identified in the elegant work of Burroughs and Aaravind (Burroughs et al 2015 PMID 26590262), but this term is rarely used in papers focused on the biology of these systems. "eSMODS" is a good name, but the authors may want to consider a different description to better fit with current terminology. 2. The authors state that proteins were identified using an iterative HMM-based search until they "began finding proteins outside of the family of interest" (Line 86). Is it possible to please explain in more detail what this means? A key part of the analysis pipeline is knowing when to stop, especially as some proteins like CD-NTases and cGLRs share related-homology to other major enzyme groups like pol-beta NTases while other proteins like STING and viperin are more unique. 3. The authors comment on several controls to guard against potential contaminating bacterial sequences present in metazoan genome sequencing datasets (Lines 174-182). It may be helpful to include this very important part of the analysis as part of the stepwise schematic in Figure 1a. Additionally, have the authors used other eukaryotic features like the presence of introns or kingdom specific translation elements (e.g. Shine-Dalgarno- vs. Kozak-like sequences) as part of the analysis? 4. A particularly surprising result of the analysis is a proposed connection between oligoadenylate synthase-like (OAS-like) enzymes and bacterial Clade C CD-NTases. A concern with these results is that previous structural analysis has demonstrated that bacterial CD-NTase enzymes and animal cGLRs are more closely related to each other than they are to OAS (Slavik et al 2021 PMID 34261127). Can the authors provide further support for a connection between OAS and Clade C CD-NTases? The C-terminal alpha-helix bundle of OAS is known to be distinct (Lohöfener et al 2015 PMID 25892109) and perhaps AlphaFold2 modeling of bacterial Clade C CD-NTases and additional OAS sequences may provide further bioinformatic evidence to support the authors' conclusions. 5. One of the most exciting results in the paper is identification of a family of putative CD-NTase enzymes conserved in metazoans. Although full description may be beyond the scope of this paper, if possible, some more analysis would be interesting here:

      a. Are these CD-NTase enzymes in a conserved gene neighborhood within the metazoan genomes (i.e. located next to a potential cyclic nucleotide receptor?)

      b. Do these metazoan genomes encode other known receptors for cyclic nucleotide signaling (PFAM searches for CARF or SAVED domains for instance).

      c. Similar to points 3 and 4, is it possible to add further evidence for support of these proteins as true metazoan sequences that have predicted structural homology to bacterial CD-NTase enzymes? 6. The authors state that obvious CD-NTase/cGLR enzymes are not present in organisms that encode the group of divergent eukaryotic "blSTINGs". Have the authors analyzed the protein-coding genes encoded immediately upstream and downstream of the blSTING proteins with AlphaFold2 and FoldSeek? It would be very exciting if putative cyclic nucleotide generating enzymes are predicted to be encoded within the nearby gene neighborhood. 7. Line 144 appears to reference the incorrect supplementary figure. SI Figure 4 may be the correct reference?

      I hope the authors will find my comments useful, thank you for the opportunity to read this exciting manuscript.

      Philip Kranzusch

      Significance

      Culbertson and Levin present an elegant computational analysis of the evolutionary history of several families of immune proteins conserved in bacteria and metazoan cells. The authors' work is impressive, revealing interesting insight into previously known connections and identifying exciting new connections that further link bacterial anti-phage defense and animal innate immunity. The results are overall well-presented and will have an important impact on multiple related fields. I have a few comments for the authors to help explain some of the new connections observed in their findings and clarify the results for a general audience.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thanks the reviewers for their critique of our report and our responses to all of their comments are given below.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary Toxoplasma gondii is an obligate intracellular parasite. Intracellular survival critical depends on secretory vesicles named dense granules. These vesicles are predicted to contain >100 different proteins that are released into PV, PV membrane and the host cell to control the parasites intracellular environment and host cell gene expression and immune response. How and where these vesicles are released from the parasite is a long-standing question in the field because T. gondii, and other apicomplexan parasites contained a complex pellicular cytoskeletal structure called the IMC which limits dense granule access to the plasma membrane. In this manuscript by Chelaghma, Ke and colleagues demonstrates for the first time that dense granules are secreted from the parasite at pore structures called the apical annuli. The authors used their previously generated HyperLOPIT data set and identified a plasma membrane protein that is specifically enriched at the apical annuli. Using BioID the authors then identify three SNARE proteins that also localize at the apical annuli. The localization of these proteins is determined using excellent super-resolution structured illumination microscopy. Conditional protein knockdowns for all four proteins were created and both proteomics and microscopy used to demonstrate a reduction in dense granule secretion in the absence of these proteins. Collectively, these data make new and substantial contributions to our understanding of mechanisms of dense granule secretion. Major comments: Overall, these data is convincing and well-described. The text is clear and well written. There are a few instances (see below) where the authors doesn't adequately describe the data or over state the strength of the results. These issues could all be addressed editorially or by process existing data.

      Comment 1.1

      The authors use proteomics and IFA to show that there is a reduction (rather than an inhibition of) in dense granule secretion. However, from the phase images in figure 5, the vacuoles of KD parasites look normal and so not have the phenotypes that one would expect after a significant reduction in dense granule secretion, such as the "bubble" phenotype described for GRA17 and GRA23 knockouts (Gold et al 2015; PMID: 25974303). Authors should describe their findings in the context of the expected phenotypes based on the published literature. The statement on line 369-371 is too strong and should imply a reduction rather than an inhibition of dense granule secretion.

      Authors’ response: It is difficult to compare our results to individual dense granule protein mutants described in the literature because such phenotypes are the result of the loss of only a single protein being exported to the host, whereas we are observing the effects of the reduction of secretion of up to 120+ different proteins. Furthermore, we agree with this reviewer that none of the protein knockdowns appear to completely prevent dense granule secretion, which we implied by ‘inhibition’, and this could be either due to incomplete knockdown of each of these proteins with some residue function, or some redundancy where other proteins can contribute to secretion. We have changed the statement flagged by this reviewer to: ‘Depletion of all four of these proteins affects dense granule secretion*’ to avoid the interpretation of complete loss of function. We now further state that residual secretion may still occur and consider this in the light of possible reasons for this (Discussion, paragraph 4). In any case, none of these considerations change our conclusion that these proteins, at the site of the apical annuli, are implicated in dense granule secretion. *

      __Comment 1.2 __

      The more severe phenotype observed in the AAQa iKD and the additional localizations of AAQa and AAQc suggests an additional role for these protein in protein trafficking that is supported by the authors data. In both AAQa and AAQc there appears to be an accumulation of GRA1 in a post-Golgi compartment and is less vesicular in appearance than the phenotype observed in the AAQb iKD parasites. Additionally, I disagree with the authors assessment that KD of these proteins does not effect microneme localization. In both AAQa and AAQc there appears to be increased number of micronemes at the basal end of the parasites compared with controls. Although this is not a direct focus of the authors papers, a description of these findings should be included in the results and discussion sections.

      Authors’ response: We have included a more complete discussion that considers the differences in phenotypes of the four mutants, including additional locations of two SNAREs, all of which is consistent with known SNARE biology (Discussion, fourth paragraph). These considerations, however, have no impact on our conclusions where all four proteins, including two that are exclusive to the apical annuli, have equivalent effects on dense granule exocytosis.

      Concerning the effects on microneme and rhoptries of the different knockdowns, we have modified and limited our interpretation to overall IFA staining strength and protein organelle protein abundance by proteomics, where we see no differences. This addresses if there is a major post-Golgi trafficking defect that could affect biogenesis of all of micronemes, rhoptries and dense granules, for which we see no evidence. Whether there are subtle differences in the location of these organelles, which are known to show some variability, is beyond the scope or relevance to our central questions. Given that growth phenotypes are seen for all mutants, it is quite possible that secondary effects of retarded cells might present as some disorder within the cell, although we saw nothing conspicuous of this nature in many hundreds of examples observed.

      __ Comment 1.3__

      Presentation of the data in Figure 5. This figure contains images where the fluorescent dense granule signal is overlaid on phase images. However, in some cases (AAQb, AAQc, AAQa, GRA1 KD) the merged imaged looks like a straight merges of the two images, whereas in the rest of the images it looks like a thresholded fluorescent image is merge with phase. Authors need to process the images in consistent manner and provide a description of the image processing in the figure legend and materials and methods.

      Authors’ response: Thank you for this suggestion, we have now processed all of these merges the same way (ImageJ -> merge channels -> Composite Sum). While the merges are only intended to aid in aligning the fluorescence signal with the phase image, we agree that it is better to present them the same way.

      Minor comments:

      Comment 1.4

      The discussion is overly long and could be shorted in some places. Lines 373 and 388 in particularly don't seems directly relevant to the manuscript.

      Authors’ response: The paragraph identified by this reviewer considers the LMBD protein that is the first, and currently only, trans plasma membrane protein specific to the apical annuli that implies that this structure is exposed to the exterior of the cell. It is, therefore, of considerable significance to how we interpret the function and behaviour of these annular structures. We believe that it is very relevant to our study to consider what else is known about these relatively mysterious, but widely conserved, eukaryotic proteins, which is the subject of this paragraph. The other reviewers highlight the relevance of LMBD3 to the interpretation of this structure. This reviewer hasn’t identified any further superfluous discussion elements, and we believe that the current length is not excessive and is justified.

      Comment 1.5

      Line 184 - Remove question mark from this sentence

      Authors’ response: The question mark has been removed.

      Comment 1.6

      Line 321. Should read Figure 7A, not figure 6A.

      Authors’ response: Thank you, corrected.

      Comment 1.7

      Line 139 - should read Figure 1B instead of 2C

      Authors’ response: Thank you, corrected (although to 1C, which is in fact correct).

      Comment 1.8

      Figure 3- Column labels for early, mid, or late endodyogeny would help with the clarity of this figure, especially for readings who are unfamiliar with the field.

      Authors’ response: We have labelled the figure as suggested.

      Comment 1.9

      Figure S2 - the letter n is missing from knockdown labels. And the number 3 from LMBD 3 is covering the word knockdown in the last panel.

      Authors’ response: Thank you, corrected.

      Reviewer #1 (Significance (Required)):

      The manuscript provides, for the first time, insight into the mechanism of dense granule secretion in Toxoplasma and identifies the sites on parasite pellicle where these vesicles can traverse the IMC to reach the plasma membrane. This is a significant conceptual advance in our understanding of this cellular vital process, one that is required for T. gondii intracellular survival. This paper would have broad interest from other research groups studying parasitology, secretion and protein trafficking.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: This manuscript reports on characterizing the function of the long-known apical annuli, which are pores embedded in the membrane skeleton of Toxoplasma gondii. Since their function has remained long elusive, this manuscript is a major breakthrough.

      Comment 2.1

      It is of note, however, that this breakthrough, using the same three SNAREs, was recently, in parallel, also reported by Fu et al in PLoS Pathogens (PMID 36972314), which work is cited here. The additional novelty here is the finding of LMDB3 in the plasma membrane at the site of the annuli. This is a widely conserved protein for which little function is known except roles in signaling, The connection between LMDB3 and the SNAREs is through BioID, but they are preys quite far down the list. Furthermore, the function of LMDB3 is not explored here. As such, the additional advance compared to the Fu et al report is limited. The function of the SNAREs in dense granule exocytosis is much more robustly done here through the proteomics data displaying an accumulation of DG proteins.

      Authors’ response: While it is true that the discovery of the three SNAREs at the apical annuli was made and reported in parallel by Fu et al (2023), a major difference in their conclusions is that they suggest that dense granules are not secreted at this site (this reviewer has mistakenly thought that this was their conclusion — “In our experiments, none of the SNAREs were shown to be related to the exocytosis of GRAs. Therefore, the mechanism that mediates exocytosis of GRAs at the plasma membrane remains to be elucidated.” Fu et al (2023)*). The failure of Fu et al to detect this was almost certainly because they only tested for dense granule secretion defects by inducing depletion of the apical annuli SNAREs after the parasites had invaded the host cells. It is known that dense granule protein secretion happens rapidly in the initial moments after invasion, so apical annuli perturbation in their assay would have only occurred after these secretion events. We directly discuss this experimental difference in our revised discussion and how it accounts for their different conclusions (Discussion, fourth paragraph). We independently tested for this effect by quantitative proteomics which further supported our conclusions. *

      As this reviewer indicates, we additionally discovered that a protein (LMBD3) also spans the plasma membrane at these structures, and this implicates signalling or events at the cell surface. We show that this protein is also required for normal dense granule secretion. While we have not identified an explicit mechanistic role for LMBD3 in this process, such insight is also lacking for all LMBD proteins, including those in humans where they are implicated in disease. While we continue to pursue this interesting question of LMBD3 function, we are by no means alone in cell biology for these answers to be outstanding still.

      Comment 2.2

      The presentation of the data is very clean and convincing, and the broader evolutionary context is well-presented as well. The discussion on whether maintaining the IMC during cell division is an innovation or ancestral is an open debate where the authors seem to come down on the side of innovation, but the evidence could go either way, so I would caution a bit more.

      Authors’ response: We are puzzled by this reviewer’s comment because we do not make reference to the maintenance of the IMC during cell division in this evolutionary context — ancestral or a recent innovation. We describe the case of Toxoplasma and its close relatives maintaining the maternal IMC during division as ‘unusual’, not ancestral (second sentence of the last paragraph of the Discussion), and this is the only statement that we think might have elicited this query from the reviewer. But this does not imply what the ancestral state might have been which is not a subject of any of our considerations here.

      Major comments: - Are the key conclusions convincing?

      Comment 2.3

      The identification of the three SNARE proteins through BioID is not very convincingly represented in Table S1. These SNAREs were not showing significant changes and were not detected universally across the three bio-reps, and thyn were also present in the controls. Although this does not diminish the message of the work, this appears to be quite Cherry-picked, while other top hits in the BioID were overlooked, e.g. Nd6 and Nd2 are right in the top ten, which have a demonstrated role in rhoptry exocytosis. This certainly piqued my interest, but is not even discussed.

      *Authors’ response: We have used BioID as a protein discovery strategy, not to directly measure protein proximity for which it is an imperfect measure for many technical reasons. Accordingly, discovered ‘candidates’ for proteins that might occur at the annuli were all independently verified by protein reporter tagging. We focused our efforts on discovering apical annuli plasma membrane-tethered proteins and, therefore, parsed our BioID data for those shown previously to be in the plasma membrane by LOPIT spatial proteomics (Barylyuk et al, 2020). It is true that the SNARE proteins were not favoured over many other proteins in the BioID signal, but their verified location at these sites justified our pursuit of them as new apical annuli proteins. *

      Other proteins, including the previously identified apical proteins Nd6 and Nd2 that are implicated in rhoptry secretion, similarly piqued our interest! But when we reporter-tagged them they were revealed as BioID false positives, consistent with published work on these proteins, and other ‘top hits’ included some other false positives. Table S1 is included as a further recourse for the field, but it only served as a first step in functional protein discovery in our study.

      Comment 2.4

      TgAAQa, TgAAQb and TgAAQc were recently also reported to localize to the annuli by Fu et al 2023 (PMID: 36972314; this report is even cited in this manuscript for Rab11a accumulation), who gave them different names: TgStx1, TgStx20, and TgStx21 (not in this order). I see no reason to adopt a new nomenclature here, which will be very confusing in the future literature. Please adopt the Stx names in this manuscript.

      *Authors’ response: We agree that where there is precedent in naming it is better to use the earliest used names. Naming of proteins is also best done to reflect orthologues found between species so that consistent names indicate common functions. The naming system proposed by Fu et al for the Qa, Qb and Qc SNAREs unfortunately does not fulfil this second important criterion. They based their names on ‘Syntaxin’ which was first used for an animal SNARE of the nervous system that is almost exclusively used for Qa paralogues. Furthermore, in animals Stx1-4 are all vertebrate-specific Qa paralogues that have arisen only in this group. So, to name the Qa SNARE of Toxoplasma according to one of these animal-specific nerve proteins (Stx1) implies an evolutionary inheritance that is very unlikely (i.e., lateral gene transfer from an animal) and is unsupported by published phylogenies. Furthermore, Fu et al also give the Qb and Qc SNAREs the animal Qa name ‘syntaxin’, and arbitrarily number them Stx21 and Stx20. So, while they have named these proteins first, we think that the names given provide confusing and misleading labels for these proteins. *

      We initially proposed a simpler system according to the location of the SNARE in Toxoplasma (AA = Apical Annuli) and the Q domain type (Qa, Qb, Qc), e.g., AAQa. But on reflection we propose using precedent and orthology and adopt the existing orthologue names as the most useful solution. Klinger et al (2022) have resolved the phylogeny of the three Toxoplasma SNAREs, and they group with strong phylogenetic support with known eukaryote-wide orthogroups with previous names: Qa=StxPM (Syntaxin Plasma Membrane); Qb=NPSN (Novel Plant ‘Syntaxin’); and Qc=Syp7 (a Qc SNARE family originally thought to be specific to plants). These SNARE types are all known to operate at the plasma membrane, and accordingly the names TgStxPM, TgNPSN, and TgSyp7 would indicate their orthology and similar functional location known in other eukaryotes. We have justified this preferred naming system in the text of our report (Discussion, third paragraph), but making it clear which Fu et al names correspond to these more universally consistent names so that these can be easily cross-referenced.

      Comment 2.5

      No knock-down of LMBD3 is pursued: how would this impact SNARE distribution and/or other annuli proteins? The fitness score is very severe, -4.07, so this is somewhat puzzling. Lower comment is related. This could provide tantalizing insights in the architecture of the annuli, and/or their function as a secretory conduit.

      LMBD3 relative to the SNAREs is not explored: co-IPs or detergent extraction to see if they are all in a physically interacting complex. What keeps them together. Is LBCDR3 interfacing with any annuli proteins Cen2 is suggested through the image in Fig 2A, though there appears to be some separation in some images: AAP2, 3 and 5 were previously shown to have smaller diameters than Cen2 and therefore appear better positioned.

      Authors’ response: LMBD3 knockdowns were pursued in so far as identifying that they also have a phenotype of reduced dense granule secretion as for the SNAREs, but it will indeed require further studies of this intriguing molecule to define its specific function. Our central questions of this study were what is the association of the apical annuli with respect to the IMC and plasma membrane, and what is the overall significance and function of these structures. These core questions have been answered in our study. The questions that this review raises here are further and logical questions specifically related to LMBD3 that we are now pursuing as an independent follow-on study.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Comment 2.6

      The discussion on whether maintaining the IMC during cell division is an innovation or ancestral is an open debate where the authors seem to come down on the side of innovation, but the evidence could go either way, so I would caution a bit more.

      Authors’ response: This comment (2.2) is already made and addressed above.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Comment 2.7

      The heavy focus on the LMBD3 in Fig 1 and the evolutionary discussion would warrant a more direct functional dissection. Either through an LMDB3 known-down, or its interface with the SNAREs or annuli more directly.

      Authors’ response: This reviewer has not made it clear that further work on LMBD3 is necessary to support the conclusions of the paper or address the questions that we have asked, only that they would like to see more insight into LMBD3. We would also! But we do present knock-down studies and show that there are functional consequences for dense granule secretion. The question of if LMBD3 is involved in the maintenance of apical annuli structure and/or integrity is an interesting one, but a further question to those that we have presented in this first study. LMBD proteins have poorly characterised molecular functions throughout eukaryotes, and while we are also motivated to understand their role more, this has not proven a straightforward task in other systems also.

      Comment 2.8

      The claim that the annuli are the conduits though which the dense granules travel to get exocytosis is not directly supported by any of the experiments as it is solely based on co-localization studies, not even direct interactions.

      Authors’ response: We agree that we have not directly observed dense granules in the act of secretion at the apical annuli. Dense granules are known to be very mobile in the cell and traffic dynamically on actin networks. So, they do not accumulate at any one site, and their fusion and exocytosis is likely a rapid, transient event. Multiple lines of evidence for them pausing and fusing with the plasma membrane, while indirect, independently support this conclusion:

      • SNARE proteins restricted to the apical annuli in the plasma membrane are required for normal dense granule secretion
      • When these SNAREs are depleted dense granule proteins accumulate in the parasite
      • Rab11A is a further vesicle-tethering molecule that has been shown to be attached to dense granules and its mutation also leads to inhibition of dense granule proteins (Venugopal et al, 2020)
      • When the apical annuli SNAREs are depleted Rab11A accumulates at the annuli (Fu et al, 2023) Collectively, we believe that the claim that the apical annuli are the sites of dense granule secretion is very strongly supported, particularly by the very molecules that would be required for vesicle docking and fusing at these sites, and is justified to be noted in the title. We have, however, made it clear in our report now that these data are indirect and that dense granules are yet to be captured in the act of secreting their contents at these sites (Discussion, paragraph five).

      **Referees cross-commenting**

      The consolidating themes I see (and value) in the reviews:

      Comment 2.9

      1. functional follow up of role of LMDB3 Authors’ response: This work is already part of a follow-up project.

      Comment 2.10

      adopt nomenclature of Fu et al, to avoid confusion in literature

      Authors’ response: Please see our response to Comment 2.4

      Comment 2.11

      better integrate the findings in light of the Fu et al publication throughout this manuscript

      *Authors’ response: We have further acknowledged and compared our findings to those of the parallel study of Fu et al with additional text in the discussion. *

      Comment 2.12

      no direct evidence of dense granules at annuli; attenuate the claims (in title etc), or include supportive data

      Authors’ response: Please see our response to the equivalent Comment 2.8 above.

      Reviewer #2 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Comment 2.13

      The presented manuscript reports on a novel protein, LMBD3, embedded in the plasma membrane of Toxoplasma gondii at the site of the apical annuli, which are pores across the inner membrane complex (IMC) skeleton. This provides a novel, putative connection between the cytoplasm and plasma membrane, although this is not directly explored here. Through LMDB3 proximity biotinylation, three SNAREs are identified that were recently reported to be involved in dense granule exocytosis, which is is confirmed here through robust proteomic experiments.

      Authors’ response: This reviewer has made an error here in stating that the parallel study of Fu et al implicated the apical annuli SNAREs with dense granule exocytosis. See our response to Comment 2.1 where we describe why the experimental design used for Fu et al was unlikely to test this question effectively.

      • Place the work in the context of the existing literature (provide references, where appropriate). The annuli were first reported in 2006, and understanding of their proteomic composition has expanded over the years, however, a function has remained long elusive. This report, together with another parallel performed work, now uses three SNAREs, named TgAAQa, TgAAQb and TgAAQc in this report but previously named TgStx1, TgStx20, and TgStx21 (not in this orthologous order), localizing to the annuli as tool to assign the function of the annuli to exocytosis of the dense granules during intracellular parasite multiplication. The evolutionary context and concepts of the new findings are very well-embedded in the existing literature and insights.

      • State what audience might be interested in and influenced by the reported findings. The audience comprises people with a specific interest beyond apicomplexan biology, basically all Alveolates as they all share a similar membrane skeleton. Assigning a putative function to widely conserved LMBD3 will be of high interest to this completely different audience as well.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In the submitted work "Apical annuli are specialised sites of post-invasion secretion of dense granules in Toxoplasma", the authors explore the role of the apical annuli in T. gondii. They identify a number of proteins that localize to the membranes at the annuli, including SNARE proteins that are known players in vesicle fusion. They also shown that knockdown of several annuli localized proteins blocks replication and secretion of dense granule cargo into the parasitophorous vacuole. Overall, the work is well done and an important contribution to the field.

      Major comments

      Comment 3.1

      1. In the title and throughout the manuscript the authors claim that the apical annuli are sites of dense granule secretion (e.g. "firmly implicating the apical annuli as the site of dense granule docking and membrane fusion." or "that the apical annuli are sites of vesicle fusion and exocytosis"). However, there does not appear to be direct evidence of the dense granules docking and fusing at these sites.

      It would be ideal to see vesicles docked via EM at the annuli, either in wildtype or knockdown parasites. This may not be possible - if not, I recommend toning down the conclusions on docking (or "specialized sites of secretion" as this has not been shown) and instead stating that these structures play a critical role in dense granule secretion. Authors’ response: Please see our response to Comments 2.8 & 2.12, and we have toned down this conclusion as requested to make it clear that direct observations of dense granule fusion are yet to be made. Capturing the transient event of dense granule docking by EM would indeed be a very challenging ambition.

      Comment 3.2

      The authors should discuss earlier (in the results) the findings of Fu et al. which:

      Authors’ response: The parallel study of Fu et al (2023) has indeed generated some similar data, but there are also multiple points of difference including their conclusions. We discuss all of these relevant points in the Discussion, and believe that it would make the Results narrative confusing to introduce this element of discussion there. Our study has not been performed in response to theirs, but rather was conducted in parallel.

      • show the localization of some of the same SNAREs at the apical annuli. Fu et al also see localization to the plasma membrane separate from the annuli for some of these proteins. Do you see plasma membrane spots as well upon longer exposures? Can differences be explained by the position or type of tag used?

      Authors’ response: Fu et al have indeed used different reporters and expressed the SNARE fusion proteins with different non-native promoters. They used a very bulky reporter which combined 12 HA tags as well as the large Auxin-Inducible Degron (AID), and together it is possible that they observe some mistargeting artefacts. For our location studies we used the small epitope 3xV5 only. We did not see the additional locations that they report, and this may be due to the larger modification that they made to these proteins.

      • Fu et al also shows similar plaque defects in the knockdowns and loss of trafficking of plasma membrane proteins to the periphery. In general, the studies from this group are very complementary - they should be better acknowledged.

      Authors’ response: We have included more frequent reference and comparison to the Fu et al study now in our Discussion.

      • Fu et al see an invasion defect but no defect in GRA secretion - Do you see an invasion defect? These differences should be discussed

      Authors’ response: See our response to Comments 2.1 & 2.13 regarding why the Fu et al could not detect the GRA secretion defect. We discuss this in our Discussion now (Discussion paragraph four). We also consider the Fu et al study of an invasion defect as flawed. Both our and their study show that depletion of apical annuli SNAREs has a strong replication phenotype of parasites within the host vacuole. Given induced SNARE depletion must occur during this growing stage of the parasites, to ask if apical annuli could be involved directly in invasion processes requires testing for invasion competence of already very sick cells. It is, therefore, not possible to control for secondary effects on invasion incompetence due to general cell malaise. Furthermore, Fu et al report on invasion efficiency using an assay that relies on SAG1 presentation on the cell surface. However, they conclude independently in their study that SAG1 delivery to the surface is inhibited in their SNARE knockdowns. This further confounds any attempt to reliable measure invasion and any role for these SNAREs in this process. Therefore, for biological as well as technical reasons, we have not tested for a possible role of annuli in invasion.

      • It would be helpful for the field to use the same nomenclature whenever possible. Is it possible to use the naming described earlier?

      Authors’ response: Please see our response to Comment 2.4.

      Comment 3.3

      Fig 1C - The authors use trypsin shaving to demonstrate plasma membrane localization of LMBD3. They are probably correct - but it is important to definitively distinguish between plasma membrane and IMC membrane localization. a. The western blot bands for GAP40 should be quantified. It appears that GAP40 is also reduced and it could be reduced to a similar extent as SAG1 without quantification. In addition, this protection from digestion could be confirmed with a second marker in the space between the PM and IMC membranes like GAP45 (whereas cytoplasmic/mito markers like profilin and Tom40 are likely further protected by the IMC membranes and are thus less relevant here).

      Authors’ response: Quantitation of Western blots is notoriously inaccurate and, rather, we use it here as a qualitative indication of trypsin sensitivity of proteins in intact cells. The LMBD3 protein is completely transformed within the first time point (1 hour) to stable products of proteolysis of this polytopic membrane protein — presumably to those now protected within the cell. Known GPI-anchored surface protein SAG1 shows similar immediate sensitivity, although it is known that internalised SAG1 pools are constantly recycled to the surface and hence gradual elimination of the residual SAG1 band over 4 hours. The internal protein markers (GAP40, PRF, TOM40) show no discernible change in the first hour and little if any beyond that (within the variation common to Western blotting). GAP40 shares an equivalent polytopic membrane topology to LMBD3 except it occurs in the IMC membrane directly below the plasma membrane, so we think this is the more suitable control. Thus, this trypsin shaving experiment gives a binary output: sensitive or insensitive. This conclusion is further supported by the published spatial proteomics study (Barylyuk et al. 2020) which shows that LMBD3 segregates with other integral membrane proteins specific to the plasma membrane and not with the IMC proteins. Our super resolution imaging of LMBD3 relative to inner membrane complex markers (Centrin2, GAP45, IMC1) also show it as peripheral to them, further corroborating the plasma membrane location.

      1. Is it possible to N-terminally tag LMBD3 and then examine plasma membrane localization by detection of the tag without permeabilization? (this would also confirm the proposed topology) Authors’ response: We have tried to N-terminally tag LMBD3 with an epitope reporter but this integration was not tolerated by the cell, presumable because it interferes with membrane insertion of this protein that is essential for cell viability. So, this experimental option is not available.

      Comment 3.4

      I think it is important to make clear for the reader what is happening here. The paper sounds as though the dense granules directly dock at the annuli for release. It also seems possible from this work and Fu et al that secretion at the annuli occurs via small vesicles that originate from the dense granules. Perhaps a diagram or model would help the reader here (and discuss why DGs or other vesicles are not routinely seen at the annuli if this is the critical portal - and perhaps why the organelles are not clustered in the apical end of the cell if this is where they are needed)

      Authors’ response: This comment is related to that of review 2 (Comments 2.8/12), although we note again that Fu et al did not conclude that dense granules are exocytosed at this site. It is also unclear why this reviewer envisages that small vesicles arise from the dense granules, rather than the dense granule itself fusing at the annuli to the plasma membrane. Indeed, the occurrence of Rab11A on the dense granules, and the accumulation of this protein at the annuli with SNARE knockdown, supports that it is the dense granules that dock at this site. Why dense granules don’t otherwise cluster at their sites of secretion but are instead motile in the cell, their movement driven by Myosin F on actin filaments, is not known. Perhaps these otherwise bulky organelles would create too much cellular crowding that could interfere with other processes. We have addressed all of these points in additions to the discussion so that these interesting unknowns are transparent to the reader (Discussion paragraph 5).

      Comment 3.5

      Figure 5. The authors state the knockdown results in "strong phenotypes of reduced plaque development" - The plaque assays should be quantified.

      • Are there no plaques or just very small ones here?

      Authors’ response: The reviewer provides no rationale for this request or states what questions could be addressed by doing so. Indeed, none of our conclusions would be affected. We use the plaque assays to test whether each of the proteins tested are independently necessary for some facet of normal parasite growth where the result is binary — no difference in plaque size versus near or complete absence of plaque development. The interpretation of differing plaque sizes between different knockdown mutations is a very inexact science with assumptions of equal rates of protein depletion, sensitivity of relative protein abundance, modes of action of mutation, and kinetics of plaque growth very difficult to validate for meaningful comparisons to be made. Therefore, we don’t see any useful role for plaque quantification in the research questions that we’ve addressed or the conclusions that we present.

      Comment 3.6

      Figure 6 a. Fig 6A - The use of digitonin for semipermeabilization requires controls as there is typically a lot of variability across the monolayer. This is ideally done with something to show that the host plasma membrane has been permeabilized (e.g. host tubulin) and the PVM has not been permeabilized (e.g. SAG1). Otherwise, perhaps the authors could state what percent of cells showed the data like the representative images shown or describe further how selective permeabilization was assessed? (or wider fields with many cells and vacuoles?)

      *Authors’ response: As requested, we have included a supplemental figure showing wider fields of view where multiple vacuoles are seen. These data show that the vacuoles are similarly stained with no evidence of variability of digitonin permeabilization. The reduction in GRA5 secretion shown by microscopy is further supported by this protein being quantified using proteomics as enriched in the parasites when the apical annuli proteins are depleted (Fig 7). *

      Comment 3.7

      1. Fig 6B - "the GRA signal seen within the parasite was increased compared to the control" This is not clear from the AAQb image shown as it appears more is also present in the vacuole (or perhaps residual body?) Can this be clarified? Authors’ response: Yes, in this image it appears that the ‘residual body’, which is also an integral internal compartment of the growing parasite rosette, is a site of dense granule accumulation. We have modified the text to make it clear that the observations of IFA images showing ‘apparent’ increase in dense granule staining were then directly tested by quantitative proteomics. These subsequent data (Fig 7) provided a clear measure of the increase in dense granule proteins in the parasites when apical annuli function was perturbed.

      Minor comments

      Comment 3.8

      1. Line 215-217 The authors state that "Collectively these data imply that the apical annuli provide coordinated gaps in the IMC barrier that forms at the earliest point of IMC development and that they maintain access of the cytosol to these specialised locations in the plasma membrane."
      2. However, their data shows that LMBD3 only recruits once daughters are emerging (not earliest point of IMC development). Please clarify? Is this just referring to Centrin2 or LMBD3 as well? Authors’ response: Yes, the other AAPs indicate that these structures form early, and they were mentioned as such in the sentences preceding this statement — hence ‘collectively’.

      Comment 3.9

      Fig 5. Regarding growth arrest. AAQa appears to show an arrest but is it possible the others just grow slower? Do they arrest later and hence fail to form a plaque? Is there incomplete knockdown which enables a few parasites to persist?

      *Authors’ response: It is true that it is difficult to discern complete growth arrest from *

      *very retarded growth. However, neither alternative would affect our conclusions where we use these phenotypes as an indication of apical annuli participating in process required for normal growth. All plaque assays show strong growth phenotypes. Nevertheless, we have removed the use of the term ‘growth arrest’ with respect to these phenotypes (including in the Abstract) and replaced it with growth impairment. *

      Comment 3.10

      Line 132, Fig 1 A-C. For clarity it may be better for the reader if LMBD3 is named earlier, or if Fig 1 refers to the gene ID for panels A-C before its named.

      Authors’ response: This is a good idea and we have made this change, making note of the rationale for this name when we present the phylogeny.

      Comment 3.11

      Line 30 - "represent a second structure in the IMC specialised for protein secretion" this is confusing - do the authors mean in addition to the micronemes/rhoptries at the apical complex? Maybe "a second structure in the parasite" would be clearer

      Authors’ response: To clarify we have reworded as follows: ‘The apical annuli, therefore, represent a second type of IMC-embedded structure to the apical complex that is specialised for protein secretion

      Comment 3.12

      Line 440 - the author states that "these pre- and post-invasion secretion processes are also biochemically separated because both microneme and rhoptry secretion are SNARE-independent" Is this from the Cova and Dubios papers cited a line later? I took a quick scan of these papers and neither appear to show this? Cova claims still this is still unclear and Dubios says SNAREs are likely involved?

      Authors’ response: While both microneme and rhoptry secretion use distinctive molecular machineries for controlling membrane fusion for exocytosis, it is true that it is not formally known that these processes completely lack SNARE involvement, and neither paper cited here can eliminate this possibility. We have therefore, removed this short part of the discussion where we consider that dense granules might be unique amongst these three compartments in relying on SNAREs.

      Text editing

      Comment 3.13

      1. Line 94 - plasma membrane or cell surface. Clarify here - do you mean plasma membrane or under the membrane at the periphery? Authors’ response: We have modified as: ‘plasma membrane including the cell surface’.

      Comment 3.14

      Line 321 refers to Fig 6A but should say 7A. Panel 7B is never referenced in the text.

      Authors’ response: Thank you, we have corrected this and only sited Fig7 because A and B are both relevant to the statement made in the text.

      Comment 3.15

      Line 347-242 and fig 4A - the discussion of Q-SNARES and diagram could use some references for the reader

      Authors’ response: Thank you for this suggestion, we have acted on this request.

      Comment 3.16

      The methods says plaque assays were 7 days, fig 5 legend says 8 days

      Authors’ response: Thank you, this is corrected as 8 days.

      **Referees cross-commenting**

      • I completely agree with Rev 2
      • I also think examining invasion given Rev1 comment on the micronemes and the data from Fu et al would be worthwhile and straightforward to do

      Authors’ response: Please see our response to Comment 3.2 where the validity of measuring invasion competence of poorly growing, and/or arrested, parasites is scientifically questionable. It would require controls of similarly unhealthy parasites where the apical annuli are unaffected, but it is difficult to imagine how one would deliver such a control.

      Reviewer #3 (Significance (Required)):

      This is an excellent study that assesses the role of apical annuli in parasite secretion. It is an important addition to the field (and outstanding imaging that provides a high level of detail to the study). The study could be improved by better integrating a recent similar study noted by the authors and in the review

      Authors’ response: We have provided more direct discussion of the Fu et al paper in our Discussion section.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the submitted work "Apical annuli are specialised sites of post-invasion secretion of dense granules in Toxoplasma", the authors explore the role of the apical annuli in T. gondii. They identify a number of proteins that localize to the membranes at the annuli, including SNARE proteins that are known players in vesicle fusion. They also shown that knockdown of several annuli localized proteins blocks replication and secretion of dense granule cargo into the parasitophorous vacuole. Overall, the work is well done and an important contribution to the field.

      Major comments

      1. In the title and throughout the manuscript the authors claim that the apical annuli are sites of dense granule secretion (e.g. "firmly implicating the apical annuli as the site of dense granule docking and membrane fusion." or "that the apical annuli are sites of vesicle fusion and exocytosis"). However, there does not appear to be direct evidence of the dense granules docking and fusing at these sites.

      It would be ideal to see vesicles docked via EM at the annuli, either in wildtype or knockdown parasites. This may not be possible - if not, I recommend toning down the conclusions on docking (or "specialized sites of secretion" as this has not been shown) and instead stating that these structures play a critical role in dense granule secretion.<br /> 2. The authors should discuss earlier (in the results) the findings of Fu et al. which: - show the localization of some of the same SNAREs at the apical annuli. Fu et al also see localization to the plasma membrane separate from the annuli for some of these proteins. Do you see plasma membrane spots as well upon longer exposures? Can differences be explained by the position or type of tag used? - Fu et al also shows similar plaque defects in the knockdowns and loss of trafficking of plasma membrane proteins to the periphery. In general, the studies from this group are very complementary - they should be better acknowledged. - Fu et al see an invasion defect but no defect in GRA secretion - Do you see an invasion defect? These differences should be discussed - It would be helpful for the field to use the same nomenclature whenever possible. Is it possible to use the naming described earlier? 3. Fig 1C - The authors use trypsin shaving to demonstrate plasma membrane localization of LMBD3. They are probably correct - but it is important to definitively distinguish between plasma membrane and IMC membrane localization. - a. The western blot bands for GAP40 should be quantified. It appears that GAP40 is also reduced and it could be reduced to a similar extent as SAG1 without quantification. In addition, this protection from digestion could be confirmed with a second marker in the space between the PM and IMC membranes like GAP45 (whereas cytoplasmic/mito markers like profilin and Tom40 are likely further protected by the IMC membranes and are thus less relevant here). - b. Is it possible to N-terminally tag LMBD3 and then examine plasma membrane localization by detection of the tag without permeabilization? (this would also confirm the proposed topology) 4. I think it is important to make clear for the reader what is happening here. The paper sounds as though the dense granules directly dock at the annuli for release. It also seems possible from this work and Fu et al that secretion at the annuli occurs via small vesicles that originate from the dense granules. Perhaps a diagram or model would help the reader here (and discuss why DGs or other vesicles are not routinely seen at the annuli if this is the critical portal - and perhaps why the organelles are not clustered in the apical end of the cell if this is where they are needed) 5. Figure 5. The authors state the knockdown results in "strong phenotypes of reduced plaque development" - The plaque assays should be quantified. - Are there no plaques or just very small ones here? 6. Figure 6

      a. Fig 6A - The use of digitonin for semipermeabilization requires controls as there is typically a lot of variability across the monolayer. This is ideally done with something to show that the host plasma membrane has been permeabilized (e.g. host tubulin) and the PVM has not been permeabilized (e.g. SAG1). Otherwise, perhaps the authors could state what percent of cells showed the data like the representative images shown or describe further how selective permeabilization was assessed? (or wider fields with many cells and vacuoles?)

      b. Fig 6B - "the GRA signal seen within the parasite was increased compared to the control" This is not clear from the AAQb image shown as it appears more is also present in the vacuole (or perhaps residual body?) Can this be clarified?

      Minor comments

      1. Line 215-217 The authors state that "Collectively these data imply that the apical annuli provide coordinated gaps in the IMC barrier that forms at the earliest point of IMC development and that they maintain access of the cytosol to these specialised locations in the plasma membrane."
      2. However, their data shows that LMBD3 only recruits once daughters are emerging (not earliest point of IMC development). Please clarify? Is this just referring to Centrin2 or LMBD3 as well?
      3. Fig 5. Regarding growth arrest. AAQa appears to show an arrest but is it possible the others just grow slower? Do they arrest later and hence fail to form a plaque? Is there incomplete knockdown which enables a few parasites to persist?
      4. Line 132, Fig 1 A-C. For clarity it may be better for the reader if LMBD3 is named earlier, or if Fig 1 refers to the gene ID for panels A-C before its named.
      5. Line 30 - "represent a second structure in the IMC specialised for protein secretion" this is confusing - do the authors mean in addition to the micronemes/rhoptries at the apical complex? Maybe "a second structure in the parasite" would be clearer
      6. Line 440 - the author states that "these pre- and post-invasion secretion processes are also biochemically separated because both microneme and rhoptry secretion are SNARE-independent" Is this from the Cova and Dubios papers cited a line later? I took a quick scan of these papers and neither appear to show this? Cova claims still this is still unclear and Dubios says SNAREs are likely involved?

      Text editing

      1. Line 94 - plasma membrane or cell surface. Clarify here - do you mean plasma membrane or under the membrane at the periphery?
      2. Line 321 refers to Fig 6A but should say 7A. Panel 7B is never referenced in the text.
      3. Line 347-242 and fig 4A - the discussion of Q-SNARES and diagram could use some references for the reader
      4. The methods says plaque assays were 7 days, fig 5 legend says 8 days

      Referees cross-commenting

      • I completely agree with Rev 2
      • I also think examining invasion given Rev1 comment on the micronemes and the data from Fu et al would be worthwhile and straightforward to do

      Significance

      This is an excellent study that assesses the role of apical annuli in parasite secretion. It is an important addition to the field (and outstanding imaging that provides a high level of detail to the study). The study could be improved by better integrating a recent similar study noted by the authors and in the review

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript reports on characterizing the function of the long-known apical annuli, which are pores embedded in the membrane skeleton of Toxoplasma gondii. Since their function has remained long elusive, this manuscript is a major breakthrough. It is of note, however, that this breakthrough, using the same three SNAREs, was recently, in parallel, also reported by Fu et al in PLoS Pathogens (PMID 36972314), which work is cited here. The additional novelty here is the finding of LMDB3 in the plasma membrane at the site of the annuli. This is a widely conserved protein for which little function is known except roles in signaling, The connection between LMDB3 and the SNAREs is through BioID, but they are preys quite far down the list. Furthermore, the function of LMDB3 is not explored here. As such, the additional advance compared to the Fu et al report is limited. The function of the SNAREs in dense granule exocytosis is much more robustly done here through the proteomics data displaying an accumulation of DG proteins. The presentation of the data is very clean and convincing, and the broader evolutionary context is well-presented as well. The discussion on whether maintaining the IMC during cell division is an innovation or ancestral is an open debate where the authors seem to come down on the side of innovation, but the evidence could go either way, so I would caution a bit more.

      Major comments:

      • Are the key conclusions convincing?

      The identification of the three SNARE proteins through BioID is not very convincingly represented in Table S1. These SNAREs were not showing significant changes and were not detected universally across the three bio-reps, and thyn were also present in the controls. Although this does not diminish the message of the work, this appears to be quite Cherry-picked, while other top hits in the BioID were overlooked, e.g. Nd6 and Nd2 are right in the top ten, which have a demonstrated role in rhoptry exocytosis. This certainly piqued my interest, but is not even discussed.

      TgAAQa, TgAAQb and TgAAQc were recently also reported to localize to the annuli by Fu et al 2023 (PMID: 36972314; this report is even cited in this manuscript for Rab11a accumulation), who gave them different names: TgStx1, TgStx20, and TgStx21 (not in this order). I see no reason to adopt a new nomenclature here, which will be very confusing in the future literature. Please adopt the Stx names in this manuscript.

      No knock-down of LMBD3 is pursued: how would this impact SNARE distribution and/or other annuli proteins? The fitness score is very severe, -4.07, so this is somewhat puzzling. Lower comment is related. This could provide tantalizing insights in the architecture of the annuli, and/or their function as a secretory conduit.

      LMBD3 relative to the SNAREs is not explored: co-IPs or detergent extraction to see if they are all in a physically interacting complex. What keeps them together. Is LBCDR3 interfacing with any annuli proteins Cen2 is suggested through the image in Fig 2A, though there appears to be some separation in some images: AAP2, 3 and 5 were previously shown to have smaller diameters than Cen2 and therefore appear better positioned. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The discussion on whether maintaining the IMC during cell division is an innovation or ancestral is an open debate where the authors seem to come down on the side of innovation, but the evidence could go either way, so I would caution a bit more. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      The heavy focus on the LMBD3 in Fig 1 and the evolutionary discussion would warrant a more direct functional dissection. Either through an LMDB3 known-down, or its interface with the SNAREs or annuli more directly. The claim that the annuli are the conduits though which the dense granules travel to get exocytosis is not directly supported by any of the experiments as it is solely based on co-localization studies, not even direct interactions.

      Referees cross-commenting

      The consolidating themes I see (and value) in the reviews: 1. functional follow up of role of LMDB3 2. adopt nomenclature of Fu et al, to avoid confusion in literature 3. better integrate the findings in light of the Fu et al publication throughout this manuscript 4. no direct evidence of dense granules at annuli; attenuate the claims (in title etc), or include supportive data

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The presented manuscript reports on a novel protein, LMBD3, embedded in the plasma membrane of Toxoplasma gondii at the site of the apical annuli, which are pores across the inner membrane complex (IMC) skeleton. This provides a novel, putative connection between the cytoplasm and plasma membrane, although this is not directly explored here. Through LMDB3 proximity biotinylation, three SNAREs are identified that were recently reported to be involved in dense granule exocytosis, which is is confirmed here through robust proteomic experiments. - Place the work in the context of the existing literature (provide references, where appropriate).

      The annuli were first reported in 2006, and understanding of their proteomic composition has expanded over the years, however, a function has remained long elusive. This report, together with another parallel performed work, now uses three SNAREs, named TgAAQa, TgAAQb and TgAAQc in this report but previously named TgStx1, TgStx20, and TgStx21 (not in this orthologous order), localizing to the annuli as tool to assign the function of the annuli to exocytosis of the dense granules during intracellular parasite multiplication. The evolutionary context and concepts of the new findings are very well-embedded in the existing literature and insights. - State what audience might be interested in and influenced by the reported findings.

      The audience comprises people with a specific interest beyond apicomplexan biology, basically all Alveolates as they all share a similar membrane skeleton. Assigning a putative function to widely conserved LMBD3 will be of high interest to this completely different audience as well.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Toxoplasma gondii is an obligate intracellular parasite. Intracellular survival critical depends on secretory vesicles named dense granules. These vesicles are predicted to contain >100 different proteins that are released into PV, PV membrane and the host cell to control the parasites intracellular environment and host cell gene expression and immune response. How and where these vesicles are released from the parasite is a long-standing question in the field because T. gondii, and other apicomplexan parasites contained a complex pellicular cytoskeletal structure called the IMC which limits dense granule access to the plasma membrane. In this manuscript by Chelaghma, Ke and colleagues demonstrates for the first time that dense granules are secreted from the parasite at pore structures called the apical annuli. The authors used their previously generated HyperLOPIT data set and identified a plasma membrane protein that is specifically enriched at the apical annuli. Using BioID the authors then identify three SNARE proteins that also localize at the apical annuli. The localization of these proteins is determined using excellent super-resolution structured illumination microscopy. Conditional protein knockdowns for all four proteins were created and both proteomics and microscopy used to demonstrate a reduction in dense granule secretion in the absence of these proteins. Collectively, these data make new and substantial contributions to our understanding of mechanisms of dense granule secretion.

      Major comments:

      Overall, these data is convincing and well-described. The text is clear and well written. There are a few instances (see below) where the authors doesn't adequately describe the data or over state the strength of the results. These issues could all be addressed editorially or by process existing data.

      The authors use proteomics and IFA to show that there is a reduction (rather than an inhibition of) in dense granule secretion. However, from the phase images in figure 5, the vacuoles of KD parasites look normal and so not have the phenotypes that one would expect after a significant reduction in dense granule secretion, such as the "bubble" phenotype described for GRA17 and GRA23 knockouts (Gold et al 2015; PMID: 25974303). Authors should describe their findings in the context of the expected phenotypes based on the published literature. The statement on line 369-371 is too strong and should imply a reduction rather than an inhibition of dense granule secretion.

      The more severe phenotype observed in the AAQa iKD and the additional localizations of AAQa and AAQc suggests an additional role for these protein in protein trafficking that is supported by the authors data. In both AAQa and AAQc there appears to be an accumulation of GRA1 in a post-Golgi compartment and is less vesicular in appearance than the phenotype observed in the AAQb iKD parasites. Additionally, I disagree with the authors assessment that KD of these proteins does not effect microneme localization. In both AAQa and AAQc there appears to be increased number of micronemes at the basal end of the parasites compared with controls. Although this is not a direct focus of the authors papers, a description of these findings should be included in the results and discussion sections.

      Presentation of the data in Figure 5. This figure contains images where the fluorescent dense granule signal is overlaid on phase images. However, in some cases (AAQb, AAQc, AAQa, GRA1 KD) the merged imaged looks like a straight merges of the two images, whereas in the rest of the images it looks like a thresholded fluorescent image is merge with phase. Authors need to process the images in consistent manner and provide a description of the image processing in the figure legend and materials and methods.

      Minor comments:

      The discussion is overly long and could be shorted in some places. Lines 373 and 388 in particularly don't seems directly relevant to the manuscript.

      Line 184 - Remove question mark from this sentence

      Line 321. Should read Figure 7A, not figure 6A.

      Line 139 - should read Figure 1B instead of 2C

      Figure 3- Column labels for early, mid, or late endodyogeny would help with the clarity of this figure, especially for readings who are unfamiliar with the field.

      Figure S2 - the letter n is missing from knockdown labels. And the number 3 from LMBD 3 is covering the word knockdown in the last panel.

      Significance

      The manuscript provides, for the first time, insight into the mechanism of dense granule secretion in Toxoplasma and identifies the sites on parasite pellicle where these vesicles can traverse the IMC to reach the plasma membrane. This is a significant conceptual advance in our understanding of this cellular vital process, one that is required for T. gondii intracellular survival. This paper would have broad interest from other research groups studying parasitology, secretion and protein trafficking.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their critiques of our manuscript and for recognizing the importance of the questions about 3D genome organisation that it addresses. We plan to address most of their comments in our revised manuscript.

      Reviewer #1

      1. The aneuploid karyotype of the MCF-7 cells used is a concern. GREB1 is present in four copies, with two on abnormal chromosomes which may not be regulated in the same way as primary cells. The authors should include caveats to this effect in the text to account for this.

      We indicated (pg 5) that there are 4 copies of GREB1, 2 of which are on re-arranged chromosomes. RNA FISH (Figure 1C) suggests all 4 of these alleles are induced by estrogen. On each allele, the GREB1 enhancer and promoter remain closely apposed by imaging (Figure 2, DNA FISH) indicating no gross chromosomal rearrangements around the GREB1 locus. This is confirmed by our Hi-C data (Figure 2A), where any genomic rearrangements at the GREB1 locus would be detectable when the sequencing data were aligned to the reference genome. In the revised manuscript we highlight these points in the respective results sections (pgs 5 and 6). Our data suggest that all 4 alleles of GREB1 in MCF7 cells are regulated in the same way.

      2. The authors should also include more information on the generation and verification of the enhancer deletion cell lines. An illustration of the PCR primers used for screening, as well as an illustration of the sequenced product traces aligned with the reference genome (as opposed to just showing the deleted regions) should be included in Fig. S1D. This would give the reader more confidence that the designed knockout has occurred in the same way on all alleles. Furthermore, long-range PCRs and sequencing should be considered to confirm that no larger deletions have occurred (e.g. Owens et al., 2019 PMID: 31127293).

      We have replaced FigS1D with a new Figure Supplement (Figure S1.2A) that incorporates a more comprehensive diagram of the strategy used for the generation and screening of the enhancer deletion cell lines. This also includes the sequencing traces aligned to the reference genome for each of the clones used in this work. Additionally, in the revised manuscript, we will check the deletions using the C-TALE sequencing data obtained from the enhancer-deleted clones.

      1. The changes in the measured E-P interaction frequency following gene activation are __weak __at best and make visual interpretation of the results difficult. Showing the reciprocal virtual 4C plots from the promoter would help to reassure the reader that the observed effect is real.

      We thank the reviewer for this suggestion, and we will now include virtual 4C plots from the GREB1 and NRIP1promoters in our revised manuscript. These will be in figures 2B, 2E, 3C, 4C and in the supplementary figures 2B, 4B, 5C and 6C.

      4. Furthermore, the precise 3C method used is not clear. The authors repeatedly refer to "Capture-C" (a commonly used 3C-based approach using biotinylated oligos to pull down targets of interest) but the citation used (Golov et al. 2019) refers to a conceptually similar method called "C-TALE". This should be clarified in the text.

      We thank the reviewer for pointing out this potential confusion. We replace the term Capture-C with C-TALE throughout the revised manuscript.

      5. As for the changes in contact frequency, the observed changes in distance measurements between conditions are very small (although statistically significant). We acknowledge that this is likely due to the relatively small linear distances between enhancers and promoters in this study. However, it would be helpful to see the effects of the induction/treatments on a one or more control loci which is not affected by oestrogen signalling given that global changes in nuclear shape/volume and/or cell cycle effects could occur within this time (e.g. effects of tamoxifen treatment on MCF-7 cell cycle distribution, (Osborne et al. 1983 PMID: 6861130), which could impact nuclear volume.

      Data from DNA FISH control probes are already included in Supplementary Figure S3 showing no change in intra-nuclear distances and thus no general effects on chromatin compaction due to nuclear volume or cell cycle. Virtual 4C data for the entire captured regions around GREB1and NRIP1 are show in Fig S2C, also showing no general effect on the wider capture windows. We will include similar data from the viewpoint of the gene promoters in the revised manuscript. Hi-C and imaging data from the enhancer deletion cell lines (Fig S4) also supports that we are looking at an ER-specific effect, not a global one. With the regard to the comment on the effects of tamoxifen treatment on MCF-7 cell cycle distribution, we see no effects of tamoxifen on 3D genome organisation at GREB1 and NRIP1 by Hi-C or by imaging.

      6. The authors discuss previous studies demonstrating that E2 and 4OH recruit different sets of proteins to their target genes. Given that this is central to the conclusion that the ER ligand (and its recruited co-factors) determines the E-P interaction frequency and 3D distances observed, it would be important to demonstrate this at the GREB1/NRIP1 loci specifically. ChIP data of the co-activators/repressors recruited by E2 and 4OH, respectively, would greatly strengthen this claim.

      We acknowledge that investigating co-activator and co-repressor recruitment to the studied loci will strengthen our interpretation our conclusions. In the revision we will perform and include ChIP-qPCR at NRIP1, GREB1 and control loci assaying for PolII, co-activators such as p300, mediator and SRC-3 and the co-repressors N-CoR in control, estradiol and tamoxifen treated cells. We will also perform ChIP-qPCR of PolII and co-activators in cell treated with flavopiridol and triptolide.

      1. The observed uncoupling of E-P contact frequency and 3D distance upon transcriptional inhibition is interesting and offers clues to the molecular details underlying E-P interactions. However, the use of flavopiridol and triptolide, while common in the field, should be carefully qualified given the potential for their indirect effects on transcription. This is particularly important for flavopiridol given its ability to target multiple cyclin-dependent kinases beyond CDK9 and its role in transcription initiation.

      In the revised manuscript we indicate that “Flavopiridol inhibits several CDKs, including CDK9/PTEF-b”

      Minor comments:

      i. In the introduction and beginning of discussion, it would be helpful to detail previous studies where FISH-based analyses have shown more proximal E-P positioning upon activation, to make it clear that differences in E-P proximity appear to be gene-specific. Some examples include Williamson et al. (2016; PMID 27402708) and Chen et al. (2018; PMID 30038397). Speculation as to why some genes behave in this way while others do not, would also be worthwhile.

      We have followed the reviewer’s suggestion and noted these two studies in the Introduction of a revised manuscript. Given that the focus of this current manuscript is to explore discrepancies between Hi-C and DNA FISH, we do not think that this is the right forum for a wider discussion of why there might be differences in E-P proximity between different biological systems.

      ii. On page 6, the authors state that after deletion of the NRIP1 enhancer there is "almost total loss of NRIP1 induction in response to E2". This does not seem to match the data where in 3 out of 4 replicates (2 for each clone) there is a statistically significant increase in number of RNA FISH foci upon E2 stimulation in the NRIP1 enhancer KOs. This suggests that, as for GREB1, the regulation of these genes is not solely controlled by the deleted enhancers. This should be clarified in the text.

      The reviewer is referring to the data on NRIP1 expression in two NRIP1 enhancer deletion clones in Fig 1D and the replicate data in Supplementary Fig S1 (upper-right panel). These data show almost no induction of NRIP12 by E2 compared to wild-type cells. We stand by our statement.

      iii. The labelling of the FISH probes in Supp. Fig. S2 could be improved as it is currently very difficult to read these.

      We will try to improve this in a revised Figure S2.

      iv Given that the authors have referenced a distance of 200 nm as potentially being an important threshold for gene activation, it would be useful to include the fraction of alleles which are below this distance alongside the cumulative frequency plots in Figure 2D and elsewhere in the paper as the cumulative frequency plots can be hard to read in some cases (e.g. Supp. Fig. S3B e-p). This would also allow the authors to show consistency across replicates.

      We thank the reviewer for this suggestion to make the data easier to interpret. In a revised manuscript, we will incorporate the fraction of alleles below and above 200 nm for the DNA-FISH experiments in Figure 2D and Figure S4A-B.

      v. For clarity, it would be helpful to include the difference map between the vehicle-treated unstimulated/stimulated conditions for the 3C plots in Fig. 4. This would help contextualise the resulting differences observed with the drug treatments. Same for Supp. Fig. S6.

      We will include the difference heatmap between the vehicle- and estradiol treated samples for vehicle, flavopiridol and triptolide treated samples.

      vi. Statistical comparisons are not shown for all 3D FISH-based distance measurements (e.g. Supp. Figs. S3A, S4C, D, S6E). If this is because the tests were done and the results were non-significant this should be indicated.

      We had omitted all non-significant p values (>0.05) from the graphs to stop them getting too cluttered. All p values are documented in the supplementary tables. However, following the reviewer’s comment, we will indicate all non-significant statistical comparisons on the graphs.

      vii. On page 13, the authors state that increased E-P separation occurs "before nascent transcription of the gene is detected by either TT-seq or RNA FISH". This does not appear to be correct given that baseline levels of transcription are observed in the absence of ER stimulation by both methods (Fig. 1). This should be clarified in the text.

      We have amended this statement to now indicate that “This is before an induction of nascent transcription of the gene….”

      Reviewer #2

      1. The authors make strong claims and although these are generally reasonably well supported by the data, it is important to acknowledge that they are based on two loci. This manuscript would be stronger if the authors could include additional loci in their study design. If this is not possible, it would be good to acknowledge that the conclusions are preliminary/speculative at this stage.

      The reviewer makes a fair point, and we emphasized throughout the text – including at the end of the Discussion - that we are examining just two gene loci. In a revised manuscript we will include DNA-FISH data for a third locus comprising the CCND1 gene, for which we have preliminary data.

      *2. It would be helpful if the authors could clarify the strategy they used for their FISH probe design. The enhancer and promoter fosmid probes (which are used for the majority of the experiments) are not centered on the active elements and do not even seem to overlap in the case of the GREB1 enhancer fosmid probe. The 10 kb enhancer probe seems better placed for the GREB1 locus, but the 10 kb enhancer probe does not seem to overlap with the enhancer in the NRIP1 locus. It is conceivable that the exact location of the probes has a big impact on the measurements and it would therefore be helpful if the authors could comment on the location of the probes and add additional probes if required to strengthen their conclusions. In addition, the fosmid probes are very large (40 kb). Although the authors acknowledge this, it would be helpful if they could comment on how overlap between 40 kb probes should be interpreted in relation to a potential rather focal contact between (proteins bound to) regions of In the case of GREB1, the fosmid probes were chosen to maximize the distance between them as the promoter and the enhancer of the gene are genomically relatively close to each other. This was not an issue in the case of the NRIP1 locus where fosmid probes could be placed centered on the TSS and the enhancer region. In the case of the 10 kb probes, these were designed to be centered on the regions where higher E2-induced C-TALE contact frequencies were detected. Virtual 4C plots using the TSS regions as viewpoints (incorporated into the revised manuscript) clearly show that, in the case of NRIP1, the contact frequency peak does not fall on the main ER peak.

      1. It is not clear to me why the authors would choose to work with a locus that is present in 4 copies in their cell line. Is the entire regulatory region (incl. enhancers) preserved for the two additional copies of the gene? Can the authors comment on how this may impact on their measurements?

      See response to Reviewer 1, point 1. Our Hi-C data would have revealed if there were genomic rearrangements in the 600kb window surrounding GREB1.

      4. Figure 2D shows an increase in E-P separation for the NRIP1 locus across all timepoints, with cumulative frequency plots shown for the 10 min timepoint. However, the data for the second replicate shown in Figure S2D are a lot less robust and not significant for the 10 min timepoint. It is important that the authors either provide additional data to support the robustness of this experiment or acknowledge that the results are not fully reproducible.

      We acknowledge this, but we would like to note that there is an increase in the median distance for all time points, although this difference is not significant in some of the timepoints. Additionally, DNA-FISH data obtained using the 10 kb probes confirm these observations.

      5. The data presented in Figure 2F for clone 2 of the GREB1 enhancer deletion still show increased E-P distance upon activation. How do the authors explain this?

      This increase in distance is not statistically significant (p-0.33 – see Table S2) and is not seen for the replicate data in Fig. S4.

      Minor comments:

      i. Could the authors comment on the observation that the NRIP1 promoter is not bound by ERa or p300 upon estrogen activation? Are there ATAC-seq or H3K27ac ChIP-seq data available for these conditions?

      We included ATAC-seq tracks in Figure 1A where a peak on the NRIP1 promoter is clearly seen.

      ii. It is not obvious which timepoint is shown in Figure 1D.

      Pre-mRNA FISH in enhancer deleted clones was done in cells treated with vehicle or E2 for 60 minutes. This will be made clearer in the figure legend.

      iii. Why did the authors choose e-i and p-i instead of e-c and p-c in Supplementary Figure 3B?

      We apologize as it was an oversight not to include the e-c data for this experiment. This is now included in Supplementary figure S4B.

      iv. "We treated hormone starved MCF-7 cells with flavopiridol or triptolide for 5 min before adding E2 for 30 min (Fig. 4A)." Does this mean that the FLV/TRP treatment lasted for 35 min or did the authors wash it out before adding E2? Please clarify.

      This observation is correct, and it was made clear in Figure 4A and in the figure legend.

      v. The authors refer to their Capture-C data as "high-resolution". However, the methods section mentions that the data for the GREB1 and NRIP1 locus are 5 kb and 10 kb resolution, respectively. This is not particularly high for a targeted approach, certainly not in light of the MNase-based approaches that have recently been developed. I therefore think that the "high-resolution" claims should be removed from the paper.

      In line with the reviewer’s suggestion, we have removed the term high-resolution when referring from our own data.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Gómez Acuña and colleagues have investigated changes in enhancer-promoter (E-P) interactions with both 3C and DNA FISH. As a model system, they have used the activation of estrogen receptor-dependent enhancers, which allows for examination of changes in E-P interactions at relatively high temporal resolution. Surprisingly, they find that gene activation is associated with increased E-P interactions as measured by 3C but reduced spatial proximity as measured by DNA FISH. The authors show that both these measurements are dependent on the presence of the enhancer. In contrast, blocking transcription with inhibitors does not have a strong effect on the 3C measurements, but abolishes the increased spatial E-P separation as measured by DNA FISH following estrogen induction.

      Overall, this is an interesting and thought-provoking study. However, the strong conclusions are not fully supported by the data, as explained in further detail below.

      Major comments:

      • The authors make strong claims and although these are generally reasonably well supported by the data, it is important to acknowledge that they are based on two loci. This manuscript would be stronger if the authors could include additional loci in their study design. If this is not possible, it would be good to acknowledge that the conclusions are preliminary/speculative at this stage.
      • It would be helpful if the authors could clarify the strategy they used for their FISH probe design. The enhancer and promoter fosmid probes (which are used for the majority of the experiments) are not centered on the active elements and do not even seem to overlap in the case of the GREB1 enhancer fosmid probe. The 10 kb enhancer probe seems better placed for the GREB1 locus, but the 10 kb enhancer probe does not seem to overlap with the enhancer in the NRIP1 locus. It is conceivable that the exact location of the probes has a big impact on the measurements and it would therefore be helpful if the authors could comment on the location of the probes and add additional probes if required to strengthen their conclusions. In addition, the fosmid probes are very large (40 kb). Although the authors acknowledge this, it would be helpful if they could comment on how overlap between 40 kb probes should be interpreted in relation to a potential rather focal contact between (proteins bound to) regions of <1 kb.
      • It is not clear to me why the authors would choose to work with a locus that is present in 4 copies in their cell line. Is the entire regulatory region (incl. enhancers) preserved for the two additional copies of the gene? Can the authors comment on how this may impact on their measurements?
      • Figure 2D shows an increase in E-P separation for the NRIP1 locus across all timepoints, with cumulative frequency plots shown for the 10 min timepoint. However, the data for the second replicate shown in Figure S2D are a lot less robust and not significant for the 10 min timepoint. It is important that the authors either provide additional data to support the robustness of this experiment or acknowledge that the results are not fully reproducible.
      • The data presented in Figure 2F for clone 2 of the GREB1 enhancer deletion still show increased E-P distance upon activation. How do the authors explain this?

      Minor comments:

      • Could the authors comment on the observation that the NRIP1 promoter is not bound by ERa or p300 upon estrogen activation? Are there ATAC-seq or H3K27ac ChIP-seq data available for these conditions?
      • It is not obvious which timepoint is shown in Figure 1D.
      • Why did the authors choose e-i and p-i instead of e-c and p-c in Supplementary Figure 3B?
      • "We treated hormone starved MCF-7 cells with flavopiridol or triptolide for 5 min before adding E2 for 30 min (Fig. 4A)." Does this mean that the FLV/TRP treatment lasted for 35 min or did the authors wash it out before adding E2? Please clarify.
      • The authors refer to their Capture-C data as "high-resolution". However, the methods section mentions that the data for the GREB1 and NRIP1 locus are 5 kb and 10 kb resolution, respectively. This is not particularly high for a targeted approach, certainly not in light of the MNase-based approaches that have recently been developed. I therefore think that the "high-resolution" claims should be removed from the paper.

      Referees cross-commenting I agree with the comments raised by Reviewer 1

      Significance

      Since 3C and DNA FISH are widely used, the discrepancy between these measurements that is described here is of potential broad interest to the field. Since these claims are rather strong and have potential far-reaching implications, it would be helpful if the authors could strengthen their conclusions further, by improving the robustness of the data and including additional loci and additional probes to show that the measurements are not specific for these two loci or dependent on the location of the probes. I think that the paper is in principle also publishable without these additional experiments, but in that case, it would be very important to explicitly acknowledge the limitations of the data throughout the manuscript and clarify that the conclusions are preliminary/speculative at this stage.

      Expertise: 3D genome organization.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Gómez Acuña et al.

      Transcription decouples estrogen-dependent changes in enhancer-promoter contact frequencies and spatial proximity

      Summary

      This study by Gómez Acuña et al. addresses a highly relevant question in the field of gene regulation: what are the mechanisms by which enhancers and promoters communicate to control expression of a target gene? This work tackles an apparent dichotomy as to whether enhancers (E) and promoters (P) need to come into close physical proximity in order to activate gene expression. While 3C-based methods have consistently suggested that the frequency of 'contacts' between enhancers and promoters increases during gene activation, imaging approaches have, in some instances, shown that these elements move further away from each other during this process. Gómez Acuña et al. have used 3C- and imaging-based methods in parallel at two inducible genes in the MCF-7 breast cancer cell line to confront this issue.

      The study focusses on two oestrogen-responsive genes, GREB1 and NRIP1, which are fully activated within 1 h of oestradiol (E2) treatment of MCF-7 cells in in vitro culture. Using a capture-based 3C method, the authors show small increases in contacts between putative enhancers and promoters during gene induction. In contrast, for both genes they also observe increased distances between these elements by DNA FISH. The authors used CRISPR-Cas9 to delete the putative nearby enhancers and showed that these phenomena are likely dependent on these elements.

      Next, the authors questioned whether recruitment of the oestrogen receptor (ER) itself or co-activators recruited by ER upon stimulation are responsible for the observed effects. By comparing E2 with tamoxifen (4OH), which recruit co-activators and -repressors, respectively, they only observed the effects on E-P contact and proximity in the presence of the activating stimulus (E2), suggesting recruitment of activating protein machinery is required.

      Finally, they tested the effects of small molecules known to inhibit different stages of the transcription cycle. Both flavopiridol and triptolide abrogated the effect on E-P proximity observed by DNA FISH but did not affect the increase in E-P contacts seen by C-TALE. This suggested that these phenomena can be uncoupled, and that the act of transcription itself (and perhaps the associated production of RNA) is required for the physical separation of enhancers and promoters but not the increase in E-P contact frequency.

      Major comments

      Given the significance of the question posed in this study (see Significance section), it is important that the claims highlighted in this paper are appropriately supported by the experimental evidence. There are a number of issues with the design of the experiments in the current manuscript which are as follows:

      1. The aneuploid karyotype of the MCF-7 cells used is a concern. GREB1 is present in four copies, with two on abnormal chromosomes which may not be regulated in the same way as primary cells. The authors should include caveats to this effect in the text to account for this.
      2. The authors should also include more information on the generation and verification of the enhancer deletion cell lines. An illustration of the PCR primers used for screening, as well as an illustration of the sequenced product traces aligned with the reference genome (as opposed to just showing the deleted regions) should be included in Fig. S1D. This would give the reader more confidence that the designed knockout has occurred in the same way on all alleles. Furthermore, long-range PCRs and sequencing should be considered to confirm that no larger deletions have occurred (e.g. Owens et al., 2019 PMID: 31127293).
      3. The changes in the measured E-P interaction frequency following gene activation are weak at best and make visual interpretation of the results difficult. Showing the reciprocal virtual 4C plots from the promoter would help to reassure the reader that the observed effect is real. Furthermore, the precise 3C method used is not clear. The authors repeatedly refer to "Capture-C" (a commonly used 3C-based approach using biotinylated oligos to pull down targets of interest) but the citation used (Golov et al. 2019) refers to a conceptually similar method called "C-TALE". This should be clarified in the text.
      4. As for the changes in contact frequency, the observed changes in distance measurements between conditions are very small (although statistically significant). We acknowledge that this is likely due to the relatively small linear distances between enhancers and promoters in this study. However, it would be helpful to see the effects of the induction/treatments on a one or more control loci which is not affected by oestrogen signalling given that global changes in nuclear shape/volume and/or cell cycle effects could occur within this time (e.g. effects of tamoxifen treatment on MCF-7 cell cycle distribution, (Osborne et al. 1983 PMID: 6861130), which could impact nuclear volume.
      5. The authors discuss previous studies demonstrating that E2 and 4OH recruit different sets of proteins to their target genes. Given that this is central to the conclusion that the ER ligand (and its recruited co-factors) determines the E-P interaction frequency and 3D distances observed, it would be important to demonstrate this at the GREB1/NRIP1 loci specifically. ChIP data of the co-activators/repressors recruited by E2 and 4OH, respectively, would greatly strengthen this claim.
      6. The observed uncoupling of E-P contact frequency and 3D distance upon transcriptional inhibition is interesting and offers clues to the molecular details underlying E-P interactions. However, the use of flavopiridol and triptolide, while common in the field, should be carefully qualified given the potential for their indirect effects on transcription. This is particularly important for flavopiridol given its ability to target multiple cyclin-dependent kinases beyond CDK9 and its role in transcription initiation.

      Minor comments

      1. In the introduction and beginning of discussion, it would be helpful to detail previous studies where FISH-based analyses have shown more proximal E-P positioning upon activation, to make it clear that differences in E-P proximity appear to be gene-specific. Some examples include Williamson et al. (2016; PMID 27402708) and Chen et al. (2018; PMID 30038397). Speculation as to why some genes behave in this way while others do not, would also be worthwhile.
      2. On page 6, the authors state that after deletion of the NRIP1 enhancer there is "almost total loss of NRIP1 induction in response to E2". This does not seem to match the data where in 3 out of 4 replicates (2 for each clone) there is a statistically significant increase in number of RNA FISH foci upon E2 stimulation in the NRIP1 enhancer KOs. This suggests that, as for GREB1, the regulation of these genes is not solely controlled by the deleted enhancers. This should be clarified in the text.
      3. The labelling of the FISH probes in Supp. Fig. S2 could be improved as it is currently very difficult to read these.
      4. Given that the authors have referenced a distance of 200 nm as potentially being an important threshold for gene activation, it would be useful to include the fraction of alleles which are below this distance alongside the cumulative frequency plots in Figure 2D and elsewhere in the paper as the cumulative frequency plots can be hard to read in some cases (e.g. Supp. Fig. S3B e-p). This would also allow the authors to show consistency across replicates.
      5. For clarity, it would be helpful to include the difference map between the vehicle-treated unstimulated/stimulated conditions for the 3C plots in Fig. 4. This would help contextualise the resulting differences observed with the drug treatments. Same for Supp. Fig. S6.
      6. Statistical comparisons are not shown for all 3D FISH-based distance measurements (e.g. Supp. Figs. S3A, S4C, D, S6E). If this is because the tests were done and the results were non-significant this should be indicated.
      7. On page 13, the authors state that increased E-P separation occurs "before nascent transcription of the gene is detected by either TT-seq or RNA FISH". This does not appear to be correct given that baseline levels of transcription are observed in the absence of ER stimulation by both methods (Fig. 1). This should be clarified in the text.

      Referees cross-commenting

      Reviewer #2 is in good agreement with our review. The study is interesting but there are several caveats that need addressing as pointed out by both reviewers. We agree that it could take a lot of work to sort these points out in full experimentally but without this the authors should be careful not to overinterpret their data and comment on the shortcomings as it stands.

      Significance

      The question the authors are addressing here is of importance to the field. For several years, researchers have been attempting to uncover the mechanisms by which enhancers deliver information to their target promoters and this study highlights some potentially fundamental issues with the way in which these problems are typically addressed. It is generally accepted that in most cases, the frequency of 'contacts' between enhancers and promoters as measured by 3C-based methods increases as a gene becomes active. It is still unclear what exactly these 3C methods are measuring given that the radius of crosslinking and the extent to which protein-protein linkages between two DNA helices contribute to the observed contact frequency are unknown. In short, despite its widespread use, it is not clear whether contact frequency as measured by 3C is proportional to three-dimensional distance or something more nebulous. The issues raised in this paper are therefore critical for the accurate conception of mechanistic models of gene regulation.

      The use of a well-studied inducible system to enable sampling of gene activation events at high temporal resolution means that the authors can ensure a level of control over the experimental conditions. The experiments appear to have been carried out to a high standard, with necessary controls generally included where appropriate. The analyses are well-documented and the results are consistent over the various reported experiments within the manuscript. The conclusions are thought-provoking and will challenge the field to be clearer about what 3C-based methods can and cannot show. This study will also surely lead to follow-up work to more specifically address the possible molecular mechanisms explaining the above observations. The main limitations of this study are the cell line and genes studied, with GREB1 being present in four copies in the aneuploid genome of MCF-7 cells. As stated above, there is some concern that using cells with such a chaotic genome as a starting point for investigating E-P interactions might prevent the authors from drawing clear and unambiguous conclusions. Perhaps related to this, the effect size of the E-P interactions upon stimulation as measured by C-TALE/Capture-C are underwhelming to say the least. Particularly for NRIP1, it was sometimes hard to tell where to look to see the supposed increase in E-P interactions, even with arrows as guidance.

      Despite these limitations, this work raises an important issue that needs further clarification. It will require the field to reconsider what 3C-based analyses actually tell us with respect to E-P contact/proximity, and as a result models of E-P communication may need revising. This work is of broad interest to those work in the fields of nuclear biology and more specifically gene regulation. This will include both basic research scientists as well as those attempting to exploit enhancer-promoter dependencies for therapeutic purposes.

      Expertise

      Gene regulation, genome organisation and 3D structure.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We would like to thank the 3 reviewers for their comments and suggestions for our manuscript. We believe that the revisions we plan to make, based on the comments by the reviewers, will greatly enhance the quality of our manuscript.

      We would like to respond to some reviewer comments here, since they do not fit into any of the subsequent sections.

      Reviewer #3

      In the Results section that describes the delay in gata2b expression (page 4 and Supp. Fig. 4), the authors show that the mutant embryos start expressing more gata2b at 30 - 36hpf after the decreased expression at earlier time points, with no difference at 48hpf. What could explain that recovery?

      We thank reviewer 3 for this question. The partial functionality of the Cx41.8 channel in cx41.8tq/tq mutants may explain why the HSPC program is eventually induced (leading to sufficient mitochondrial ROS production for Hif1/2α stabilisation). However, this could also result from functional redundancy between Cx41.8 and other connexins such as Cx43 or Cx45.6 in the mitochondria, since they are also expressed in zebrafish arterial ECs at 24hpf (Gurung et al, Sci Rep, 2022) and cx43 knockdown has previously been shown to result in an HSPC specification defect in zebrafish (Jiang et al, Fish Physiol Biochem, 2010). Together, these aspects may explain the recovery, although delayed, of gata2b expression in the cx41.8tq/tq mutant, as discussed in detail in our manuscript.

      The authors showed that gata2b expression can be rescued by ROS induction in the dose-dependent manner (page 6 and Fig.3 and Supp. Fig. 6). Is this what rescues gata2b expression at 30hpf in the cx41.8 mutants?

      This is exactly right, we hypothesize that in cx41.8tq/tq mutants, it takes longer for mitochondrial ROS production to reach above the threshold required to stabilise Hif1/2α and hence induce gata2b expression, which is supported by the data referred to by this reviewer.

      Are any vascular defects in the mutant embryos?

      Our lab previously reported that cx41.8tq/tq embryos have faster ISV growth rate (Denis et al, Front Physiol, 2019). However, we found no evidence of a link between the ISV growth rate increase and the HSPC specification defect in these embryos. Importantly, we show that aorta specification is normal in cx41.8tq/tq mutants, as determined by dll4 expression at 24 (Supp. Fig. 1C) and 28 hpf (Supp. Fig. 1D).

      2. Description of the planned revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary

      The manuscript by Petzold et al. explores the functions of connexin 41.8 (cx41.8) (mammalian homologue Connexin 40) in hematopoietic stem cell (HSC) formation in the zebrafish dorsal aorta. The authors use a cx41.8 allele that appears to be hypomorphic, as the phenotype is milder than a previous cx41.8 allele that the same group published (Cacialli et al., 2021). cx41.8tq/tq mutants exhibit delayed onset of hemogenic endothelial specification, as marked by gata2b at 24 hpf, but HSPC development proceeds normally from 48 hpf onwards. A new reporter line for cx41.8, Tg(cx41.8:GFP), was generated and is expressed in the floor of the dorsal aorta, consistent with the location of hemogenic endothelial cells. Lower ROS production in the whole cell and in the mitochondria was reported in the cx41.8tq/tq mutants, and treatment with ROS enhancers, H2O2 and menadione, appeared to rescue the mutant phenotype of reduced HSPCs at 28 hpf. Finally, the authors tested a link between cx41.8 and Hif1α by pharmaceutically (DMOG/CoCl2) or genetically (vhl morpholino) inhibiting Hif inhibitors, and observed a rescue of HSPC formation in cx41.8 mutants.

      I think it would be important for the authors to address the mechanisms of why cx41.8tq/tq and the other cx41.8-/- (leot1/t1) mutant phenotypes are different, with the latter allele showing more severe phenotypes of increased HSPC apoptosis and reduced HSPCs during later development. The authors speculate the cx41.8tq/tq allele encodes a missense mutation in one of the channel domains, and as such, might be a hypomorph. The authors cited the original paper by Watanabe et al. (2006); however, this paper actually noted that the cx41.8tq/tq allele is likely to be a dominant negative - and as such, should have exhibited a stronger phenotype than the leot1/t1 mutant allele. From the paper: "leotw28 and leotq270 heterozygotes have phenotypes different from that of WT; thus, they represent dominant-negative alleles." Importantly, no data are shown to provide evidence that the allele is a hypomorph - at minimum, qPCR data should be provided to show whether there is NMD of the mRNA in cx41.8tq/tq mutants.

      We would like to thank the reviewer for this comment and suggestion. As the reviewer has rightly pointed out, the cx41.8tq/tq mutation is thought to result in a protein with dominant-negative function (Watanabe et al, EMBO Rep, 2006; Watanabe et al, J Biol Chem, 2016).

      In fact, we agree that the mutant cx41.8tq/tq protein acts as a dominant-negative and although the reviewer is right to point out that the cx41.8t1/t1 mutant may thus exhibit a stronger phenotype which we found not to be the case (runx1 expression was found to be normal in the cx41.8t1/t1 mutant, Cacialli et al, Nature Commun, 2021), we provided our explanation for this in the discussion of the manuscript:

      “The partial functionality of the Cx41.8 channel in cx41.8tq/tq mutants [14] may explain why the HSPC program is eventually induced. However, this could also result from functional redundancy between Cx41.8 and other connexins such as Cx43 or Cx45.6 in the mitochondria, since they are also expressed in zebrafish arterial ECs at 24hpf [18] and cx43 knockdown has previously been shown to result in an HSPC specification defect in zebrafish [36]. This potential functional redundancy may also provide an explanation as to why HSPCs are specified normally, without any delay, in cx41.8t1/t1 embryos [12]. In these null mutants, cx41.8 expression is completely absent but may be functionally compensated by other connexins, whereas in cx41.8tq/tq mutants, although cx41.8 is expressed, its channel function is reduced [14]. Moreover, as Cx41.8 may form heterotypic channels with Cx43 and/or Cx45.6 (and potentially also with others), the function of these chimeric channels would also be altered”

      We believe this addresses the reviewers concern regarding this, especially given the fact that Cx43 and Cx45.6 have been found to be expressed in arterial ECs at 24 hpf, as cited in the manuscript. With regards to the reviewer’s question about whether there is NMD of the cx41.8 transcript, given that the cx41.8tq/tq mutation is missense and does not result in a premature stop codon (usually required for NMD to be induced, Kurosaki et al, J Cell Sci, 2016), we do not believe that there is NMD of the cx41.8 transcript in cx41.8tq/tq mutants. We will however verify this by carrying out the experiment suggested by this reviewer, qPCR analysis of cx41.8 expression in cx41.8tq/tq embryos and wild-type controls.

      The quantification data in this manuscript are not satisfactory. The authors only provide graphs that show embryos with "low", "medium" and "high" numbers of HSPCs, which is incredibly subjective. Considering that the authors already have the cx41.8tq/tq in the Tg(myb:GFP) background (Figure 1E), they could have quantified the precise numbers of Tg(myb:GFP)-positive cells at different timepoints and with the different pharmaceutical rescue experiments. Ideally, this should be combined with other HSPC markers such as Tg(cd41:GFP) or Tg(runx1:GFP) - although this could be limited by the authors' access to the lines or time it takes to cross the mutants to the transgenes.

      We thank reviewer 1 for their concern regarding this. Indeed the reviewer is correct, it would take us too long (at least 6 months) to generate the cx41.8tq/tq cd41:GFP or cx41.8tq/tq runx1:GFP lines, however, as stated, we do already have the cx41.8tq/tq cmyb:GFP zebrafish line. That said, repeating the pharmacological experiments using the cx41.8tq/tq cmyb:GFP zebrafish line would demand months of work and we do not currently have the personnel to perform all of this. However, we will perform the same experiment as performed previously to generate figure 1E but also at earlier timepoints. The cmyb:EGFP transgene marks nascent HSPCs from 28 hpf, and so we will aim to image, and quantify differences in budding HSPCs in cx41.8tq/tq cmyb:EGFP and cmyb:EGFP controls between 28 hpf and 36 hpf. We agree with the reviewer that this will add depth to our study and will provide evidence to back up our conclusions.

      The link between cx41.8 and Hif1α is tenuous. The authors should perform in situ hybridization for the hif1 genes and their downstream effector notch1 which is known to be important for the HSPC specification (Gerri et al., 2018).

      We thank the reviewer for this point. we do not expect hif1/2α expression to be affected in this mutant. Mitochondrial ROS has been shown to stabilise Hif1/2α at the protein level, not the mRNA level. Our data, and that of others (Harris et al, Blood, 2013), suggest that in the absence of mitochondrial ROS, prolyl hydroxylases are not inhibited by mitochondrial ROS, and they target Hif1/2α for ubiquitination and subsequent destruction in a Vhl-dependent manner (as shown in Fig. 4D). We have changed the text in the manuscript to clarify that Hif is stabilised on the protein level (please see the section below).

      Since we do however expect notch1a and notch1b expression to be altered in our mutant embryos, as they are transcriptionally regulated by Hif1/2α (Gerri, Blood, 2018), we will perform in situ hybridisation and qPCR analysis of these 2 genes at 18-24 hpf in cx41.8tq/tq mutants and controls to clarify this point and solidify our model.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary

      Petzold et al are here addressing the potential function of the connexin Cx48.1, a protein involved in the structure of gap junctions, in the specification of future hematopoietic stem cells and progenitors (HSPCs). This piece of work is complementing their previous results showing the function of this connexin isoform in HSPC expansion in the transient hematopoietic niche in the caudal tissue of the zebrafish embryo. They explore phenotypes triggered by the expression of a mutant form bearing a single amino-acid substitution in the fourth transmembrane domain of the protein. Using whole mount in situ hybridization (WISH) of the two transcription factors Gata2b and Runx1, a novel transgenic fish line that expresses eGFP under the control of the Cx48.1 promoter region, and a series of drug treatments interfering with, or promoting, the formation of reactive oxygen species (ROS) production and oxidative stress, they propose that Cx48.1 is also involved upstream of HSPC amplification, rather in their specification at the level of the hemogenic endothelium constituting the ventral floor of the dorsal aorta. Mechanistically, they hypothesize that this function relies on mitochondria-derived ROS that would destabilize the VHL protein involved in mediating the degradation of Hif1/2a transcription factors, thereby stabilizing the Hif1/2a-Notch1a/b signaling axis involved in specification of the hemogenic endothelium.

      The WISH and quantitative analyses.

      Most of the quantitative analyses in the work are based on chromogenic WISH, which is not sufficiently accurate because leading to highly variable results, in addition to its lack of linearity. WISH is also subjected to important variations, particularly for transcription factors that are expressed at low levels such as Runx1, and to some extent Gata2b also. One obvious example in the paper is the inconsistency of signals that are observed Fig1C (Gata2b, left, wt, 24hpf) and FigS3B (Gata2b, left, wt, 24hpf) in which the signal is barely visible and is comparable to the signal for the cx41.8tq/tq mutant Fig1C, right.

      In addition, in the timings that are analyzed in FigS3 (Gata2b, 26 and 28hpf) to argue on temporal delay of expression in the cx41.8tq/tq mutant, the Gata2b signal is masked by the strong increase in tissues other than the hemogenic endothelium in the dorsal aorta (including signal in the somites as well as, possibly, increase in background). In this very example, it is legitimate to question the accuracy of the quantification methodology when the signal in the tissue of interest is drowned in the overall signal from surrounding tissues; how can the authors explain the 100% of embryos that have a 'Low' signal in the region of interest (FigS3C, cx41.8tq/tq mutant in comparison to WT)? This point is also valid for the data quantified FigS4 in which the fitting between WISH data and the quantifications appears to be questionable (for all timing points: 30, 32, 36, 48hpf and comparing mutant with the WT.

      My suggestion would be to complement the WISH data and improve the quantitative analyses using another, more accurate approach such as qRT-PCR for example (on dissected trunk regions and, if necessary because of expression in other surrounding tissues (in the case of Gata2b at later time points), after FACS-sorting using a fish line expressing a fluorescent reporter driven by a vascular promoter, ex: the kdrl:mCherry line used in the work). This is particularly important for the expression of the two transcription factors Runx1 and the more upstream Gata2b, the latter being involved in HSPC specification which is taken as a reference. qRT-PCR experiments should be feasible relatively easily and in a reasonable time frame as the technics is not very time consuming and easily accessible.

      We thank reviewer 2 for their concerns regarding the in situ quantifications used during this study. Although the approach we have used is widely used in the field to quantify gene expression differences, we appreciate that our data could be strengthened by complementing it with another approach. As such we will do the following:

      • We will complement our in situ hybridisation characterisation of delayed hemogenic endothelium formation and HSPC specification with qPCR experiments. For this, we will dissect the trunks of 8tq/tq embryos and controls and perform qPCR analysis of gata2b expression at the timepoints analysed during development (Supp. Fig. 3 A-D and Supp. Fig. 4 A-D), whilst also using the same approach to compliment the data for gata2b and runx1 expression at 24 hpf (Figure 1C and D). We agree with the reviewer that this is a feasible approach and would add robustness to the data we already show.

        2- Fluorescence imaging and associated interpretation/conclusions.

      The fluorescence images (Fig1E; Fig2B,D; Fig3A) are very difficult to analyze; they lack resolution because they appear to be epifluorescence images and not confocal images. When the signal is low, which is in particular the case for the novel Cx41.8:EGFP fish line, Fig2B (which is confirmed with the FACS GFP signal in comparison to the mCherry of the kdrl:mCherry fish line), it is not possible to provide convincing images on the vascular/aortic expression because of the high background of diffusion (the authors state 'likely to be the aortic floor', indeed it is not possible to validate the fact that the expression is truly in potential hemogenic cells). The double positive population in the FACS (Fig2C, right) does not resolve the issue because if indeed cx41.8 is expressed in endothelial cells (as expected from previous studies), the double positive population could equally be endothelial cells from inter-somitic vessels, for example (not to mention the underlying vein which is very close to the aorta in the trunk)). Fig2D, images are too small and, again, the resolution is not good enough to say that double positive cells are on the aortic floor. It is recommended to convince the reader that the authors try to confirm their statements by using confocal microscopy and increase the magnification of the relevant regions of interest.

      We thank this reviewer for this point. We will address this concern by using, as they suggest, confocal microscopy to try to get higher resolution images. In particular, we will do the following:

      • We will use confocal microscopy to image the 8:EGFP line as was done previously (Fig 2B), in order to obtain higher resolution images of expression of cx41.8 in the floor of the aorta.
      • We will also use confocal microscopy to image the 8:EGFP;kdrl:mCherry line as was done in Fig 2D, in order to gain higher resolution images.
      • We will also increase the magnification of the relevant regions of our confocal microscopy images as suggested by this reviewer.

        There is an inconsistency in the data between Fig1E (40hpf, in vivo imaging using the cmyb:GFP fish line) and FigS2 (48hpf, WISH cmyb); how can we observe 'HSPCs budding from the dorsal aorta' (see legend Fig1, arrowheads) which seems very much decreased in the imaging experiment for the cx41.8tq/tq mutant in comparison to WT, and have no effect on the cmyb signals FigS2B? What are the GFP+ cells that are aligned along the elongated yolk Fig1E and that appeared to be decreased in number in the mutant?

      We agree that this disparity is confusing for the reader. We believe the disparity between these results is due firstly to the fact that the experiment in Supp. Fig 2C was performed 8 hours after that in Fig 1E and secondly due to the time it takes for GFP to fold (in the case of Fig 1E). It is also important to keep in mind that the phenotype is not a complete absence of HSPC budding, but only a delay in the onset of EHT.

      • We will however address this concern by carrying out the experiment described above - we will perform the same experiment as performed previously to generate figure 1E but also at earlier timepoints. The cmyb:EGFP transgene marks nascent HSPCs from 28 hpf, and so we will aim to image, and quantify differences in budding HSPCs in 8tq/tq cmyb:EGFP and cmyb:EGFP controls at numerous timepoints from 28 hpf to 36 hpf. This will add depth to our study by providing evidence to back up our conclusions.
      • We will remove the 40-hpf timepoint (Fig 1E) to avoid confusion regarding the disparity with cmyb expression by WISH in Supp. Fig 2C.
      • Regarding the GFP+ cells aligned along the yolk in 1E, we thank the reviewer for pointing this out. These cells are multiciliated cells, from the kidney tubules (Wang et al, Development 2013). We will determine whether their numbers do indeed differ between 8tq/tq;cmyb:EGFP and cmyb:EGFP controls in our new confocal experiments and will mention this in the manuscript if they do.

        It would be important to investigate/show, at least with qualitative WISH experiments all along the time-window of HSPC specification as stated by the authors (26-54hpf, see main text third paragraph of Results), that Cx41.8 is detected in arterial endothelial cells (and perhaps enriched in the hemogenic endothelium?), in complement to the work they are referring to on transcriptomic data at 24hpf (Ref18 Gurung et al Sci Rep 2022). Ideally, these WISH data should be resolutive enough to provide clear localization in aortic cells versus cells in the aortic floor to bring significant added value to the work that lacks spatial resolution (ex: fluorescent WISH using confocal microscopy, allowing to superpose signal with cell types (either by double fluorescent WISH (vascular marker + Cx41.8) or superposing fluorescence signals with transmitted light)).

      We agree with this reviewer regarding this point. The way we will address this is to use confocal microscopy at different timepoints from 24-40 hpf using the cx41.8:EGFP; kdrl:mCherry line to show that expression of cx41.8 is indeed present and enriched in the floor of the dorsal aorta during the timeframe of HSPC specification. We believe that imaging this line using confocal microscopy will be sufficient to clearly show this.

      It would be more informative and secure, Fig2D, to show images of the double transgenics (Cx48.1:eGFP;kdrl:mCherry) at 28-30 hpf (rather than 48 hpf) which is more narrowed down to the specification of the hemogenic endothelium thus preventing any risk to visualize the fluorescence signals coming from recently born HSPCs rather than signals from cells embedded in the aortic floor.

      We thank the reviewer for this suggestion, which we believe would indeed improve the manuscript. As discussed above, we will indeed use confocal microscopy at different timepoints, including 28-30 hpf using the cx41.8:EGFP;kdrl:mCherry line to show that expression of cx41.8 is indeed present and enriched in the floor of the dorsal aorta during the timeframe of HSPC specification. We believe that imaging this line using confocal microscopy will be sufficient to clearly show this and so thank the reviewer for this excellent suggestion.

      To make the data more convincing on the ROS production in the ventral side of the cord in wild type embryos (which suggests that future hemogenic cells are already ventralized at that stage), it would be important to obtain confocal images of the region of interest and perform reconstitution of Z-stacks with a sagittal view (rather than longitudinal). It would be nice also to obtain comparable images later on, after lumenization and before initiation of HSPC emergence (before 28hpf).

      We thank the reviewer for this suggestion and agree that the suggested approach will solidify our data. As such, we will carry out the proposed experiment, using confocal imaging to gain longitudinal and sagittal images of mitoSOX staining in WT embryos and cx41.8tq/tq mutants at both 16 hpf and 26 hpf.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1

      Related to the above point, the authors should test whether the gap junction function of Cx41.8 is intact in the cx41.8tq/tq mutants by assessing calcium waves in the GCamp transgenic line.

      …we have also now found additional published in vivo evidence that Cx41.8 channel function is reduced in the cx41.8tq/tq mutant, which is now also cited in the new version of the manuscript (please see our full response to this point below).

      Please see the section “Description of analyses that authors prefer not to carry out” for additional information regarding the GCamp experiment suggestion.

      The link between cx41.8 and Hif1α is tenuous. The authors should perform in situ hybridization for the hif1 genes…

      We thank reviewer 1 for making this point. To clarify this, we do not expect hif1/2α expression to be affected in this mutant. Mitochondrial ROS has been shown to stabilise Hif1/2α at the protein level, not the mRNA level. Our data, and that of others (Harris et al, Blood, 2013), suggest that in the absence of mitochondrial ROS, prolyl hydroxylases are not inhibited by mitochondrial ROS, and they target Hif1/2α for ubiquitination and subsequent destruction in a Vhl dependent manner (as shown in Fig. 4D).

      To clarify this in the manuscript, we have adjusted the text in three places (including in the abstract) to clarify that Hif1/2α is stabilised at the protein level, as shown below. We believe these changes have made this important point more understandable for the reader:

      1. “… Mitochondrial-derived reactive oxygen species (ROS) have been shown to stabilise the hypoxia-inducible factor 1/2a (Hif1/2a) proteins, allowing them..”
      2. “Recent research has demonstrated that hypoxia and mitochondrial ROS are required for the stabilisation of the transcription factors Hif1/2a at the protein level”
      3. “… as mitochondrial ROS generation may eventually reach the threshold required to sufficiently stabilise the Hif1/2a proteins for downstream”

        Reviewer #2

      Importantly, it appears also that all over the WISH quantifications, the reader cannot appreciate the accuracy of the categories High/Medium/Low, which is not at all developed in the Methods section (paragraph Image processing and WISH phenotypic analyses).

      We have developed the Methods section (paragraph Image processing and WISH phenotypic analyses), which was highlighted as a concern by this reviewer, in order to detail exactly how we performed our image analysis and statistical analyses using this approach. We believe this will satisfy the concerns reviewer 2 has regarding this and appreciate that they have a point that this was indeed underdeveloped in the original submission.

      Finally, there is a confusion in the quantification regarding the number of HSPCs (see the beginning of the second paragraph of Results 'The HSPC specification defect in cx41.8tq/tq mutants is due to a delay in Gata2b expression') and the % of embryos falling into the 3 categories High/Medium/Low FigS2, cmyb 48hpf. The authors use this argument (based on the WISH cmyb signals) to infer that the deficit in the cx41.8tq/tq mutant is not due to controlling HSPC number (no difference in cmyb between WT and mutant) but rather upstream, at the level of the hemogenic endothelium, which is not a thorough argument at that point.

      We thank reviewer 2 for pointing this out to us and agree that the wording we used is a little confusing. We have therefore added to the first sentence of the second paragraph in the results section “'The HSPC specification defect in cx41.8tq/tq mutants is due to a delay in gata2b expression” which now reads:

      “Hence, since HSPC specification is initially reduced, but then recovers in cx41.8tq/tq embryos, we suspected a delay in the formation of the haemogenic endothelium in these mutants. To test this hypothesis…”

      We believe this change to the manuscript will satisfy the reviewers concern by making this section more logical for the reader.

      The authors should take care of the fact that at 16hpf, it is an overstatement to speak of an aorta when the cord is starting to lumenize at around 18hpf, Jin et al Development 2005 (see Main text referring to Fig3).

      We thank the reviewer for this clarification. We have changed the relevant text to state “vascular cord” instead of “aorta” and have mentioned that it begins to lumenize around 18hpf for clarification. We have also added the suggested reference.

      Reviewer #3

      As Gata2 has been shown to be a positive autoregulator of itself in mice (Nozawa 2009, Katsumura 2016) and might do so in zebrafish (Dobrzycki 2020), so could gata2b recover itself, in a dose-dependent manner, without the Hif-Nocth1 axis once enough of it is expressed?

      We thank reviewer 3 for this suggestion. We believe that our data show that Cx41.8 is required for mitochondrial ROS production, which stabilises Hif1/2α and switches on downstream gata2b via Notch1a/b (which will be added, see previous section). As such, we believe that the Hif1/2α/Notch1a/b axis is required, at least for the initial induction of gata2b expression. However, reviewer 3 makes a very interesting point regarding the potential for gata2b to positively autoregulate itself, which may of course occur once gata2b expression has been induced by the Cx41.8-mitoROS-Hif1/2α-Notch1a/b-gata2b pathway. We thank the reviewer again for this interesting proposition and have added this suggestion into our discussion in the following paragraph:

      “GATA2 has been shown to positively autoregulate its own expression in mice (Nozawa et al, Genes to Cells, 2009; Katsumura et al, Cell Reports 2016), and Gata2b may also act in this way in zebrafish (Dobrzycki et al, Commun Biol, 2020). Therefore, it is interesting to speculate that once gata2b expression has been induced by the Cx-mitoROS-Hif1/2α-Notch1a/b-gata2b pathway, it may also further induce its own expression, which would make the induction of the haematopoietic transcriptional program more robust”

      Is Hif1/2a expression affected in the mutant? Is it expressed normally but then degraded faster due to the absence of mitochondrial ROS or is it less Hif1/2a expressed overall?

      We thank reviewer 3 for this question, which is similar to a point made by reviewer 1. To clarify, we do not expect hif1/2α expression to be affected in this mutant. Mitochondrial ROS has been shown to stabilise Hif1/2α at the protein level, not the mRNA level. Our data, and that of others (Harris et al, Blood, 2013), suggest that in the absence of mitochondrial ROS, prolyl hydroxylases are not inhibited by mitochondrial ROS, and they target Hif1/2α for ubiquitination and subsequent destruction in a Vhl dependent manner (as shown in Fig. 4D).

      To clarify this in the manuscript, we have adjusted the text in three places (including in the abstract) to clarify that Hif1/2α is stabilised at the protein level, as shown below. We believe these changes have made this important point more understandable for the reader:

      1. “… Mitochondrial-derived reactive oxygen species (ROS) have been shown to stabilise the hypoxia-inducible factor 1/2a (Hif1/2a) proteins, allowing them..”
      2. “Recent research has demonstrated that hypoxia and mitochondrial ROS are required for the stabilisation of the transcription factors Hif1/2a at the protein level”
      3. “… as mitochondrial ROS generation may eventually reach the threshold required to sufficiently stabilise the Hif1/2a proteins for downstream”

        Does MO-mediated knockdown of vhl in the wildtype and mutant (page 7and Fig. ) result in more HSPCs, following the increase in gata2b expression from WT baseline? Does that high expression persist, or does it drop?

      This is an important question. We had already clarified this in the case of cx41.8tq/tq, since we showed that the vhl MO results in more HSPCs (as determined by runx1 expression) at 28 hpf (Supp. Fig. 8A) but we have now added data for the same marker at the same timepoint for WT embryos (Supp. Fig. 8B).

      Although the vhl MO results in an increase in runx1 signal in WT embryos, since the majority of WT embryos injected with the control MO already have “high” runx1 WISH signal at 28 hpf, the difference between injected and control MO injected WT embryos is not significant (Supp. Fig. 8B), as can be expected. This is now explained in the manuscript following the relevant data addition.

      4. Description of analyses that authors prefer not to carry out

      Reviewer #1

      One major missing component is experimental data that distinguish the gap junction/plasma membrane- related and the mitochondrial membrane-related functions of Cx41.8. This is critical, as the role of Connexins in the mitochondria remains poorly understood (and Connexin 43 is the best understood one). Thus, it is a big claim by the authors that Cx41.8 primarily acts through the mitochondria and not the gap junctions. Suggested experiment: The authors should generate a fluorophore-tagged Cx41.8 - under a ubiquitous (ubb or actin) or HSPC-/hemogenic endothelium-specific (gata2b) promoter to monitor the protein localization of Cx41.8. Providing data on whether Cx41.8 protein indeed localizes to the mitochondria is important to support their claim.

      We thank the reviewer for this suggestion, which we agree would be a nice experimental approach to try to investigate whether Cx41.8 does indeed localise to the mitochondria in zebrafish endothelial cells.

      However, EGFP fused full-length cx41.8 has previously been generated and was reported to be nonfunctional, and it was suggested that the amount of localised Cx41.8 is also too small to detect using this approach (Watanabe et al, Pigment Cell Melanoma Res, 2012; Usui et al, BBA Advances, 2021). An EGFP tagged CT-truncated Cx41.8 construct has also been generated and shown to rescue the cx41.8t1/t1 mutant (Usui et al, BBA Advances, 2021), but EGFP expression again could not be detected using this construct in zebrafish.

      As such, since efforts to carry out such an approach have failed in previous attempts and since it has already been demonstrated that CX40 (orthologous to cx41.8) localises to the mitochondria of endothelial cells (Guo et al, Am J Physiol Cell Physiol, 2017), we believe that confirmation of Cx41.8 localisation to the mitochondria in vivo in zebrafish endothelial cells will be very difficult and too time-consuming in the context of this manuscript.

      Related to the above point, the authors should test whether the gap junction function of Cx41.8 is intact in the cx41.8tq/tq mutants by assessing calcium waves in the GCamp transgenic line.

      We agree with the reviewer that this would be a very elegant approach in order to analyse whether Cx41.8 channel function is affected in cx41.8tq/tq mutants. However, we feel that this experiment is definitely beyond the scope of this manuscript. Furthermore, carrying out this experiment would require the acquisition of the GCamp line as well as multiple crosses with the cx41.8tq/tq line which, together, we envisage would take at least 9 months before the experiments can be performed, as so this experiment would also be too time consuming for this manuscript. Finally, we believe there is already strong published evidence that the cx41.8tq/tq mutant results in disrupted channel function (Watanabe et al, EMBO Rep, 2006), as already cited in our manuscript. However, since then, we have also now found additional published in vivo evidence that cx41.8tq/tq channel function is reduced, which is now also cited in the new version of the manuscript.

      The authors might also want to consider performing transcriptomic analysis (bulk RNA sequencing) from purified HSCs in wild types and cx41.8 mutants and assess the downstream pathways affected by the loss of this gene.

      Although this is an interesting proposition, we consider this suggestion to be out of the scope of this manuscript, especially since our model involves changes in gene expression upstream of HSPC induction, and, expression of the key genes thought to be affected (notch1a/b and gata2b) can be checked using a much more cost and time efficient approach, by qPCR, which we will do, as discussed above.

      Are the authors sure of their statement on budding HSPCs when the GFP signal pointed by arrows could in majority be hemogenic cells? (which would be in favor of their hypothesis on Cx41.8 being involved rather in hemogenic endothelium/HSPC specification).

      Since cmyb is a marker of HSPCs and not of the haemogenic endothelium as demonstrated in numerous publications (North et al, Nature, 2007; Bertrand et al, Development, 2008; Bertrand et al, Nature, 2010 and others). Hence, we are confident that this transgene is marking nascent HSPCs and not the haemogenic endothelium.

      As mentioned by the authors in the Discussion, the other connexin Cx43 (Ref 36, Jiang et al 2010) is playing a significant role in HSPC specification in the zebrafish and is expressed in zebrafish arterial cells at 24 hpf. Hence there may be some functional redundancy between Cx43 and Cx48.1, as supported by previous work from the authors showing that a null mutant of Cx48.1 does not exhibit any phenotype in HSPC specification (Ref12, Cacialli et al 2021). This may be problematic for the experiments using drug treatments in the present work, because they are not selective for the different connexins (ex: anti-oxydants (NAC), connexin blockers (heptanol, CBX)), thus blurring interpretations on the specific function of Cx48.1 versus the ones exerted by Cx43 (this should be also valid for the vhl MO treatments).

      This comment is strengthened by the fact that the authors do not systematically address, for both WT and mutant embryos (Fig3 E, F; FigS6; FigS8), if expression levels with drugs/H2O2/MO are different for the 2 conditions (if relatively equal, it would indeed indicate that these drugs/conditions possibly act on another connexin, which would help the authors in their analyses and interpretations).

      We thank the reviewer for these comments and we agree with their concerns regarding the possibility of other Connexins being affected by our experiments using drug treatments. However, we do not rule this out in our manuscript and actually discuss it as being a very realistic prospect, as written about in the discussion section.

      Sadly, to the best of our knowledge, no selective Cx41.8 inhibitors have been described for use in zebrafish, otherwise we would of course have used this. Hence, this was the reason for our choice of compounds, many of which we also used in our previous publication (Cacialli et al, Nature Commun, 2021).

      The haemogenic endothelium/HSPC phenotype in cx41.8tq/tq embryos confirms that this connexin plays a role in HSPC specification, whilst we believe disentangling which other connexins are also involved in this process will be interesting to look into in other future studies but is beyond the scope of this one – we believe that together, the data presented in our manuscript, along with the revisions we plan to carry out, will be convincing to demonstrate a role for Cx41.8 in the mechanism we describe.

      The authors may try to rescue the wt phenotype by expressing, in the Cx48.1tq/tq mutants, the mRNA encoding for the wt protein.

      Although we appreciate this suggestion, we do not believe this experiment will add much in terms of value to the conclusions of our manuscript and, as such, we believe this suggestion is surplus to requirements for this manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors have successfully shown how disruption in connexin (cx)41.8 results in delayed gata2b expression due to Hif1/2a instability in the absence of mitochondrial ROS. The data is presented well, and the paper is written clearly. The paper is well structured, and the data supports the authors' argument. This study provides a valuable contribution to the field.

      Could the authors clarify the following questions:

      1. In the Results section that describes the delay in gata2b expression (page 4 and Supp. Fig. 4), the authors show that the mutant embryos start expressing more gata2b at 30 - 36hpf after the decreased expression at earlier time points, with no difference at 48hpf. What could explain that recovery? The authors showed that gata2b expression can be rescued by ROS induction in the dose-dependent manner (page 6 and Fig.3 and Supp. Fig. 6). Is this what rescues gata2b expression at 30hpf in the cx41.8 mutants? As Gata2 has been shown to be a positive autoregulator of itself in mice (Nozawa 2009, Katsumura 2016) and might do so in zebrafish (Dobrzycki 2020), so could gata2b recover itself, in a dose-dependent manner, without the Hif-Nothc1 axis once enough of it is expressed?
      2. Does MO-mediated knockdown of vhl in the wildtype and mutant (page 7and Fig. 4) result in more HSPCs, following the increase in gata2b expression from WT baseline? Does that high expression persist, or does it drop?
      3. Is Hif1/2a expression affected in the mutant? Is it expressed normally but then degraded faster due to the absence of mitochondrial ROS or is it less Hif1/2a expressed overall? Are any vascular defects in the mutant embryos?

      Significance

      his study provides a valuable contribution to the field of developmental hematopoiesis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Petzold et al are here addressing the potential function of the connexin Cx48.1, a protein involved in the structure of gap junctions, in the specification of future hematopoietic stem cells and progenitors (HSPCs). This piece of work is complementing their previous results showing the function of this connexin isoform in HSPC expansion in the transient hematopoietic niche in the caudal tissue of the zebrafish embryo. They explore phenotypes triggered by the expression of a mutant form bearing a single amino-acid substitution in the fourth transmembrane domain of the protein. Using whole mount in situ hybridization (WISH) of the two transcription factors Gata2b and Runx1, a novel transgenic fish line that expresses eGFP under the control of the Cx48.1 promoter region, and a series of drug treatments interfering with, or promoting, the formation of reactive oxygen species (ROS) production and oxidative stress, they propose that Cx48.1 is also involved upstream of HSPC amplification, rather in their specification at the level of the hemogenic endothelium constituting the ventral floor of the dorsal aorta. Mechanistically, they hypothesize that this function relies on mitochondria-derived ROS that would destabilize the VHL protein involved in mediating the degradation of Hif1/2a transcription factors, thereby stabilizing the Hif1/2a-Notch1a/b signaling axis involved in specification of the hemogenic endothelium.

      Major comments

      My major comments on the work are on the accuracy of the data in regard to the two main experimental approaches used by the authors and their subsequent analysis/quantification.

      1. the WISH and quantitative analyses. Most of the quantitative analyses in the work are based on chromogenic WISH, which is not sufficiently accurate because leading to highly variable results, in addition to its lack of linearity. WISH is also subjected to important variations, particularly for transcription factors that are expressed at low levels such as Runx1, and to some extent Gata2b also. One obvious example in the paper is the inconsistency of signals that are observed Fig1C (Gata2b, left, wt, 24hpf) and FigS3B (Gata2b, left, wt, 24hpf) in which the signal is barely visible and is comparable to the signal for the cx41.8tq/tq mutant Fig1C, right. In addition, in the timings that are analyzed in FigS3 (Gata2b, 26 and 28hpf) to argue on temporal delay of expression in the cx41.8tq/tq mutant, the Gata2b signal is masked by the strong increase in tissues other than the hemogenic endothelium in the dorsal aorta (including signal in the somites as well as, possibly, increase in background). In this very example, it is legitimate to question the accuracy of the quantification methodology when the signal in the tissue of interest is drowned in the overall signal from surrounding tissues; how can the authors explain the 100% of embryos that have a 'Low' signal in the region of interest (FigS3C, cx41.8tq/tq mutant in comparison to WT)? This point is also valid for the data quantified FigS4 in which the fitting between WISH data and the quantifications appears to be questionable (for all timing points: 30, 32, 36, 48hpf and comparing mutant with WT). Importantly, it appears also that all over the WISH quantifications, the reader cannot appreciate the accuracy of the categories High/Medium/Low, which is not at all developed in the Methods section (paragraph Image processing and WISH phenotypic analyses); hence, it is not possible to evaluate the accuracy/validity of statistics in particular in the experiments in which the quantification into these categories is used for CoCl2 and morpholino analyses to address the contribution of the Hif1/2a-Notch1a/b pathway Fig4 (these experiments generating results that are not as 'black and white' than the other ones in the paper, hence requiring more accuracy; for example, are the differences in the quantification (% of embryos) significant between the WT+vhl MO and Cx41.8tq/tq mutant + vhl MO? Comparing the 2 WISH results for those conditions does not appear to be very convincing).

      My suggestion would be to complement the WISH data and improve the quantitative analyses using another, more accurate approach such as qRT-PCR for example (on dissected trunk regions and, if necessary because of expression in other surrounding tissues (in the case of Gata2b at later time points), after FACS-sorting using a fish line expressing a fluorescent reporter driven by a vascular promoter, ex: the kdrl:mCherry line used in the work). This is particularly important for the expression of the two transcription factors Runx1 and the more upstream Gata2b, the latter being involved in HSPC specification which is taken as a reference. qRT-PCR experiments should be feasible relatively easily and in a reasonable time frame as the technics is not very time consuming and easily accessible.

      Finally, there is a confusion in the quantification regarding the number of HSPCs (see the beginning of the second paragraph of Results 'The HSPC specification defect in cx41.8tq/tq mutants is due to a delay in Gata2b expression') and the % of embryos falling into the 3 categories High/Medium/Low FigS2, cmyb 48hpf. The authors use this argument (based on the WISH cmyb signals) to infer that the deficit in the cx41.8tq/tq mutant is not due to controlling HSPC number (no difference in cmyb between WT and mutant) but rather upstream, at the level of the hemogenic endothelium, which is not a thorough argument at that point.<br /> 2. Fluorescence imaging and associated interpretation/conclusions.

      The fluorescence images (Fig1E; Fig2B,D; Fig3A) are very difficult to analyze; they lack resolution because they appear to be epifluorescence images and not confocal images. When the signal is low, which is in particular the case for the novel Cx41.8:EGFP fish line, Fig2B (which is confirmed with the FACS GFP signal in comparison to the mCherry of the kdrl:mCherry fish line), it is not possible to provide convincing images on the vascular/aortic expression because of the high background of diffusion (the authors state 'likely to be the aortic floor', indeed it is not possible to validate the fact that the expression is truly in potential hemogenic cells). The double positive population in the FACS (Fig2C, right) does not resolve the issue because if indeed cx41.8 is expressed in endothelial cells (as expected from previous studies), the double positive population could equally be endothelial cells from inter-somitic vessels, for example (not to mention the underlying vein which is very close to the aorta in the trunk)). Fig2D, images are too small and, again, the resolution is not good enough to say that double positive cells are on the aortic floor. It is recommended to convince the reader that the authors try to confirm their statements by using confocal microscopy and increase the magnification of the relevant regions of interest.

      There is an inconsistency in the data between Fig1E (40hpf, in vivo imaging using the cmyb:GFP fish line) and FigS2 (48hpf, WISH cmyb); how can we observe 'HSPCs budding from the dorsal aorta' (see legend Fig1, arrowheads) which seems very much decreased in the imaging experiment for the cx41.8tq/tq mutant in comparison to WT, and have no effect on the cmyb signals FigS2B? What are the GFP+ cells that are aligned along the elongated yolk Fig1E and that appeared to be decreased in number in the mutant? Are the authors sure of their statement on budding HSPCs when the GFP signal pointed by arrows could in majority be hemogenic cells? (which would be in favor of their hypothesis on Cx41.8 being involved rather in hemogenic endothelium/HSPC specification).

      Other Major Comments:

      • It would be important to investigate/show, at least with qualitative WISH experiments all along the time-window of HSPC specification as stated by the authors (26-54hpf, see main text third paragraph of Results), that Cx41.8 is detected in arterial endothelial cells (and perhaps enriched in the hemogenic endothelium?), in complement to the work they are referring to on transcriptomic data at 24hpf (Ref18 Gurung et al Sci Rep 2022). Ideally, these WISH data should be resolutive enough to provide clear localization in aortic cells versus cells in the aortic floor to bring significant added value to the work that lacks spatial resolution (ex: fluorescent WISH using confocal microscopy, allowing to superpose signal with cell types (either by double fluorescent WISH (vascular marker + Cx41.8) or superposing fluorescence signals with transmitted light)).
      • As mentioned by the authors in the Discussion, the other connexin Cx43 (Ref 36, Jiang et al 2010) is playing a significant role in HSPC specification in the zebrafish and is expressed in zebrafish arterial cells at 24 hpf. Hence there may be some functional redundancy between Cx43 and Cx48.1, as supported by previous work from the authors showing that a null mutant of Cx48.1 does not exhibit any phenotype in HSPC specification (Ref12, Cacialli et al 2021). This may be problematic for the experiments using drug treatments in the present work, because they are not selective for the different connexins (ex: anti-oxydants (NAC), connexin blockers (heptanol, CBX)), thus blurring interpretations on the specific function of Cx48.1 versus the ones exerted by Cx43 (this should be also valid for the vhl MO treatments). This comment is strengthened by the fact that the authors do not systematically address, for both WT and mutant embryos (Fig3 E, F; FigS6; FigS8), if expression levels with drugs/H2O2/MO are different for the 2 conditions (if relatively equal, it would indeed indicate that these drugs/conditions possibly act on another connexin, which would help the authors in their analyses and interpretations).

      Minor comments

      • The authors should take care of the fact that at 16hpf, it is an overstatement to speak of an aorta when the cord is starting to lumenize at around 18hpf, Jin et al Development 2005 (see Main text referring to Fig3). To make the data more convincing on the ROS production in the ventral side of the cord in wild type embryos (which suggests that future hemogenic cells are already ventralized at that stage), it would be important to obtain confocal images of the region of interest and perform reconstitution of Z-stacks with a sagittal view (rather than longitudinal). It would be nice also to obtain comparable images later on, after lumenization and before initiation of HSPC emergence (before 28hpf).
      • The authors may try to rescue the wt phenotype by expressing, in the Cx48.1tq/tq mutants, the mRNA encoding for the wt protein.
      • It would be more informative and secure, Fig2D, to show images of the double transgenics (Cx48.1:eGFP;kdrl:mCherry) at 28-30 hpf (rather than 48 hpf) which is more narrowed down to the specification of the hemogenic endothelium thus preventing any risk to visualize the fluorescence signals coming from recently born HSPCs rather than signals from cells embedded in the aortic floor.

      Significance

      Petzold et al propose a potentially appealing function of connexin Cx48.1 expressed in the zebrafish in the specification of the vascular aortic subtype of cells that will ultimately lead to the formation of hematopoietic stem cell precursors, ie the hemogenic endothelium. They build the work on a possible translation of the function of connexin Cx40 in mammals that is described to localize to mitochondrial membranes in endothelial cells and promote the production of ROS in mitochondria. They propose a function of mitochondria-derived ROS in stabilizing the Hif1/2-Notch1 pathway that is essential for HSPC precursor specification and that may be extended to developmental hematopoiesis in mammals (the putative ortholog of zebrafish Cx48.1 in mammals (Cx40) is highly expressed in the hemogenic endothelium of mouse and human species (see the Discussion paragraph)).

      The proposed model is potentially of high significance for the field of hematopoiesis and more generally for translation of knowledge to regenerative medicine aimed at producing hematopoietic stem cells endowed with long term regenerative potential. However, the current work remains preliminary, suffering from lack of resolution in the main experimental axes that are undertaken (WISH analyses and their low accuracy quantifications; low resolution of in situ live imaging; apparent weaknesses of methodologies that are difficult to fully appreciate since poorly detailed in the Method section, in particular regarding WISH quantification and, hence, statistical significance). My recommendation is that the authors should put some efforts in completing the work with other, more quantitative, methodologies (ex: qRT-PCR) and improving the quality/resolution of imaging (by providing confocal images to alleviate any ambiguity on what is visualized and strengthen the results); these are technical approaches that are relatively standard in the field and the authors have extensively used qRT-PCR and FACS-sorting in their previously published work. Also, the endogenous expression of Cx48.1 in the hemogenic endothelium, during the time-window of its specification (20-28hpf), should be addressed; this would be essential to complement the imaging performed with the new transgenic line that expresses eGFP under the control of the Cx48.1 promoter and which provides weak fluorescence signals).

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Petzold et al. explores the functions of connexin 41.8 (cx41.8) (mammalian homologue Connexin 40) in hematopoietic stem cell (HSC) formation in the zebrafish dorsal aorta. The authors use a cx41.8 allele that appears to be hypomorphic, as the phenotype is milder than a previous cx41.8 allele that the same group published (Cacialli et al., 2021). cx41.8tq/tq mutants exhibit delayed onset of hemogenic endothelial specification, as marked by gata2b at 24 hpf, but HSPC development proceeds normally from 48 hpf onwards. A new reporter line for cx41.8, Tg(cx41.8:GFP), was generated and is expressed in the floor of the dorsal aorta, consistent with the location of hemogenic endothelial cells. Lower ROS production in the whole cell and in the mitochondria was reported in the cx41.8tq/tq mutants, and treatment with ROS enhancers, H2O2 and menadione, appeared to rescue the mutant phenotype of reduced HSPCs at 28 hpf. Finally, the authors tested a link between cx41.8 and Hif1α by pharmaceutically (DMOG/CoCl2) or genetically (vhl morpholino) inhibiting Hif inhibitors, and observed a rescue of HSPC formation in cx41.8 mutants.

      Major comments

      • I think it would be important for the authors to address the mechanisms of why cx41.8tq/tq and the other cx41.8-/- (leot1/t1) mutant phenotypes are different, with the latter allele showing more severe phenotypes of increased HSPC apoptosis and reduced HSPCs during later development. The authors speculate the cx41.8tq/tq allele encodes a missense mutation in one of the channel domains, and as such, might be a hypomorph. The authors cited the original paper by Watanabe et al. (2006); however, this paper actually noted that the cx41.8tq/tq allele is likely to be a dominant negative - and as such, should have exhibited a stronger phenotype than the leot1/t1 mutant allele. From the paper: "leotw28 and leotq270 heterozygotes have phenotypes different from that of WT; thus, they represent dominant-negative alleles." Importantly, no data are shown to provide evidence that the allele is a hypomorph - at minimum, qPCR data should be provided to show whether there is NMD of the mRNA in cx41.8tq/tq mutants.
      • One major missing component is experimental data that distinguish the gap junction/plasma membrane- related and the mitochondrial membrane-related functions of Cx41.8. This is critical, as the role of Connexins in the mitochondria remains poorly understood (and Connexin 43 is the best understood one). Thus, it is a big claim by the authors that Cx41.8 primarily acts through the mitochondria and not the gap junctions. Suggested experiment: The authors should generate a fluorophore-tagged Cx41.8 - under a ubiquitous (ubb or actin) or HSPC-/hemogenic endothelium-specific (gata2b) promoter to monitor the protein localization of Cx41.8. Providing data on whether Cx41.8 protein indeed localizes to the mitochondria is important to support their claim.
      • Related to the above point, the authors should test whether the gap junction function of Cx41.8 is intact in the cx41.8tq/tq mutants by assessing calcium waves in the GCamp transgenic line.
      • The quantification data in this manuscript are not satisfactory. The authors only provide graphs that show embryos with "low", "medium" and "high" numbers of HSPCs, which is incredibly subjective. Considering that the authors already have the cx41.8tq/tq in the Tg(myb:GFP) background (Figure 1E), they could have quantified the precise numbers of Tg(myb:GFP)-positive cells at different timepoints and with the different pharmaceutical rescue experiments. Ideally, this should be combined with other HSPC markers such as Tg(cd41:GFP) or Tg(runx1:GFP) - although this could be limited by the authors' access to the lines or time it takes to cross the mutants to the transgenes.
      • The link between cx41.8 and Hif1α is tenuous. The authors should perform in situ hybridization for the hif1 genes and their downstream effector notch1 which is known to be important for the HSPC specification (Gerri et al., 2018). The authors might also want to consider performing transcriptomic analysis (bulk RNA sequencing) from purified HSCs in wild types and cx41.8 mutants and assess the downstream pathways affected by the loss of this gene.

      Significance

      Overall, this study presents another piece of evidence that Connexin 41.8 regulates HSC formation. It provides a potential link between Connexin 41.8, mitochondrial ROS regulation and Hif/hypoxia-sensitive pathways in promoting endothelial-to-hematopoietic transition. The role of the mitochondrial ROS in particular is quite interesting and might provide a new angle into the role of connexins in regulating hemato-vascular development; however, the authors would need to strengthen the link between Cx41.8 and mitochondrial respiration.

      It is important to note that the quantitative data in this manuscript need to be strengthened and refined to strengthen the conclusions. The study is not very deeply mechanistic and appears to be more at an observational/correlational level. The manuscript might be of interest for people in the hematopoietic field but does not shed much more insight into the cellular and molecular mechanisms that govern HSC formation, particularly in light of the paper on Cx41.8 role by the same group (Cacialli et al., 2021).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). Please place your comments about significance in section 2.

      In the manuscript "Regulation of adaptive growth decisions via phosphorylation of the TRAPPII complex in Arabidopsis" the authors investigate the TRAPPII interactome carried out by an already published IP-MS screen. They study previously identified shaggy-like kinases SK as TRAPII interactors and the phosphorylation sites by Y2H (interactions of wild type, deletion mutants and phosphomutants) and kinase assays (in vitro) and pharmacological inhibition in the subunit AtTRS120. The authors provide a deeper phenotypical analysis of trapii null mutant lines and classification as "decision mutants", based on "limited budget" and "conflict of interest" experiments (previously described) as a starting point of investigations of TGN function in comparison with hormone mutants. Cell elongation is used as a response phenotype. Authors focus on mainly TRS120 and phosphorylation by SK and partly on another TRAPP component, CLUB. Authors study the assays with differing kinases, e.g. Y2H with BIN2, phosphorylation with SK11.

      Major comments:

      • Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      A major issue is that new claims and conclusions are not supported by the new data provided here. The title says "in Arabidopsis", but only components are from Arabidopsis. Interactions, instead, are studied in this manuscript by Y2H in yeast, and phosphorylation in vitro. The abstract is very misleading and does not distinguish which aspects are studied in vitro and which in vivo. The Abstract does not mention that this study is based on previously identified interactome data.

      o The figure legends are often not sufficiently detailed to understand what exactly is represented.

      Therefore, it is not possible to judge in every case whether experiments are supported by data. E.g.

      Fig. 1: A, describe which data were used and which control for IP-MS had been taken into account. B, this is a plot, please describe what is represented. Explain better why Shaggy kinases were chosen. C, explain the principle and what is represented. How is this experiment controlled and how is it ensured that negative results are not caused by absent proteins.

      Fig. 2: Indicate the phosphorylation sites in the other subfigures. Fig. 2E: How was it generated, explain what is seen. Since this is the only figure illustrating the protein complex of TRAPP, this figure should be more thoroughly prepared and labeled. I recommend a better visualized protein complex. As before, Fig. 2F remains unclear.

      Fig. 3: Please add a figure illustrating the mutations. 3C: what has been diluted? Other examples are found in other figures.

      Fig. 4: Shouldn't the wild type be compared with all the mutants? Then statistics have to be conducted accordingly. Better explain G and H. If there are quotients, explain of what exactly.

      Fig. 5. Same as before. How do I see that there is a phenotype? There is no comparison with wild type. It is also unclear to which values the statistics refer to.

      Fig. 6: Please guide the reader through the figure and experiment.

      Fig. 8: I miss the connection with other shaggy-like kinases. This summary could be more complete. What about phosphorylation sites?

      o Line 133-134: "we focus on the TRAPPII complex as a starting point as it is required for all aspects of TGN function, including the sorting of proteins such as PINs to distinct membrane domains" I did not find an obvious connection to the PIN transporters as well as clear data to TGN functions. This sentence was for me misleading about the context of this manuscript.

      o Figure 1C: A supporting Western Blot control is needed, to fully validate the missing interaction of BIN2 with the truncated variants of TRS120 and CLUB. Additionally, swapping the constructs from DB to AD and vice versa will provide a better set-up of the interaction screen. This should be easily done in a few weeks.

      o Line 431-432: "This presents intriguing implications regarding the potential role of the AtSK-TRAPPII module in meeting the unique demands of endomembrane traffic in plants." Why do the authors come to this assumption? Further discussion is needed here.

      o Figure 2F: What serves as positive controls? What is the purpose of showing every panel between each TRS120-T2 variant with CLUB-C2, CLUB-C3, TRS120-T1 and TRS120-T3 and not only interactions between BIN2 and the TRS120-T2 variants? Why are there six negative controls as it is every time the same control? - Please request additional experiments only if they are essential for the conclusions. Alternatively, ask the authors to qualify their claims as preliminary or speculative, or to remove them altogether.

      Clearly, title, abstract and statements have to be formulated differently. The discussion should contain a limitations paragraph in which the authors detail that conclusions are based on in vitro, yeast and plant IP-MS screening data, and they should describe approaches how the study can be continued in the future. Which alternative explanations are possible. Are SKs and TRAPP expressed and present in the same locations? - If you have constructive further reaching suggestions that could significantly improve the study but would open new lines of investigations, please label them as "OPTIONAL". - Demonstrating interactions and phosphorylation by other approaches in vivo - demonstrating effects of TRAPP phosphomutants and lack of kinases in vivo - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated time investment for substantial experiments. - Are the data and the methods presented in such a way that they can be reproduced?

      o Figure S9: Is it not a loss of chlorophyll instead of GFP? Does not look like a fluorescent image.

      o Lacking information of pH of in vitro kinase assay solution with Mass-spectrometry.

      o What is the purpose of transferring 10 days old seedlings to fresh plates for scanning? Needs additional information for understanding, at the moment it sounds more like unnecessary extra stress for the seedlings.

      o Why are seedlings grown under constant light? - Are the experiments adequately replicated and statistical analysis adequate?

      o Figure 5: It will be good to use ANOVA for statistics here. I personally doubt the high significance of some parameters, e.g. for club-2 cell width and cell surface area between dark and darkW due to the high standard errors. Rechecking with the original values is necessary. Why is there no comparison between wild-type and the two mutants?

      o Figure 7A - C, statistic is probably not correct. For example: in A statistical differences with P<0.001 between wild-type (~100 %) and TRS120SαβγD (~80 %), in C statistical difference of only P<0.05 between wild-type type (90 %) and TRS120SαβγD (60 %)

      o No information on IP-MS replicate numbers mentioned.

      o Also see comments above to figures

      Minor comments:

      • Specific experimental issues that are easily addressable.
        • Specific experimental issues that are easily addressable.

      o Figure 4, S7, S8, S11 and S12: It will be helpful to support the data with images of the seedlings. - Are the text and figures clear and accurate? Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      o The introduction is quite lengthy with unnecessary information, e.g. about PIN transporters, but useful information about shaggy-like kinases and connection to brassinosteroid signaling is lacking.

      o Figure 1C: In the figure legend is no explanation of abbreviation "Co"; no explanation of BET3, TRS31, Tca17 and TRIPP; no indication that spots come from different plates (just visible by different brightness of the squares). Why are there eleven negative controls as it is every time the same control?

      o Figure 1C is specifically for BIN2, but BIN2 was not identified in the IP-MS screen represented in Figure 1B. Why does 1C not focus on SK11/12/32, identified in 1B?

      o Figure 1C shows several truncated variants of TRS120 and CLUB, a schematic overview as represented in Figure 2A will be helpful for the understanding of 1C. Order of variants should be the same (now: in 1C first TRS than CLUB in 2A first CLUB than TRS).

      o Figure 1C: Interaction of TRS120 full-length with BIN2 is missing in this figure but is presented in Figure 2F.

      o Result of Figure 2F is described after Figure 3. Better arrangement of Figures or text is needed here.

      o Figure 3A: Why was AtSK11 and not BIN2 used for the main figure? Better change Figure 3A with Figure S4 to keep the focus on BIN2. No explanation of the result in the text.

      o Figure 3A: In the figure legend is no explanation of abbreviation CBB. What are the non-phosporylated variants? Where are they shown? Description sounds that only TRS120-T2-SαβγA versus TRS120-T2 WT was tested by t-test is this correct? And if yes, why?

      o No need for Figure 3B, information was already given in Figure 2A + B.

      o Figure 3C: Why BIL2 for Clade II and not BIN2?

      o Figure 4: Why are A-E not directly compared to wild-type but trs120-4 as seen in 4F? What is the purpose of using different types of diagram?

      o Figure 4H: Why are phyAphyBcry1cry2 and pyrpyl1pyl2pyl4 depicted? No description in the text.

      o Figure 6: Confusing order of given information in the figure legend. Sentence one belongs to D and H only, second sentence describes whole figure. o Figure 6D + H, color difference between black and blue is hard to see, better change one into e.g. red.

      o Figure 7D - F wrong indication of D to F, named in the description as A) - C). Why is E different to D in F (D and F: 0-1 is attenuated, >1 enhanced; E the other way around).

      o Figure S9A: Indication of protein size on the Coomassie gel is missing and the respective position of 160 kDa is not visible on the gel.

      o Figure S12D: No explanation of the color code in the figure legend.

      o Consistent labelling and layout of all Figures and Supplemental Figures will be helpful. E.g., Figure 3A and S4; in S8A-E + S11A-C bars of different conditions have the same color. Most of the figure legends are quite shortly described and lack information about what kind of data is presented.

      o YFP parameters are described in material and methods, but no YFP construct appeared in the manuscript to my knowledge. - Are prior studies referenced appropriately? - Lines 243-245: Text is nearly identical to Kalbfuß et al., 2022. - Lines 246-254: Text is identical to Kalbfuß et al., 2022. - Are the text and figures clear and accurate?

      Please see the above and below comments to figures and figure legends. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Overall, the manuscript may have very interesting data and new findings. It is very interesting that the authors study the regulation of a protein complex that may mediate environment responses and intracellular Golgi functions. However, it is very difficult to follow and understand the ideas and concept of the manuscript. This manuscript is based on a previously published interactome study by Rybek et al. 2014, Steiner et al. 2016, Kalde et al. 209. Moreover, a physiological approach is published in Kalbfuß et al. 2022. The outcomes and conclusions from these previously published manuscripts and the emanating open questions addressed here should be clearly described in the introduction. This is currently not the case. Moreover, many experimental approaches and results (e.g. figures, figure legends) are not properly described. Overall, it is therefore not possible to understand the manuscript without studying in depth all other manuscripts. Before the manuscript can be more thoroughly judged, it is necessary that the authors rewrite the manuscript, reorganize it and explain better their ideas and approaches. It is also necessary to explain and define unusual terms such as "decision mutants", "limited budget" and "conflict of interest" experiments, which are crucial for the understanding. The importance of the TRAPPII complex should be illustrated using specific physiological examples and the context in which this complex is studied here has to be explained. Before this is not corrected, the following assessment will remain rather incomplete. Another complication is that two subunits of TRAPP were studied and different types of SKs, however, authors did not systematically analyze all interactions. At least it should be thoroughly described, and a flow chart would be helpful as supplemental figure clearly describe which types of proteins were tested in the different assays. The introduction is not well written. It is very lengthy, however the important messages from previous publications are left out. Thus the open question is not understandable (see above). Instead, the results parts start with introduction again. Explanations are also lacking in every result paragraph on the approach and expected data. The Discussion is also not very well written. It is much focused on physiological and molecular actions and consequences in plants. However, there should be at first a technical discussion on the relevance since in the study is based on in vitro and heterologous expression data, and the physiological analysis was only conducted with knockouts but not phosphomutants. Therefore, the link between the protein interaction and physiological functions needs to be worked out.

      Referees cross-commenting

      My colleague and I have read thoroughly the manuscript and found a number of issues which we indicated in our review. These points can be fixed by the authors, if they formulate more carefully and remove the overstatements. They should also work on reorganizing and including more explanations.

      Significance

      Provide contextual information to readers (editors and researchers) about the novelty of the study, its value for the field and the communities that might be interested.

      The following aspects are important:

      One new aspect of this story is the validation of interaction of TRAPII subunits as substrate for AtSKs and their action as phosphorylation agents shown in vitro. The other new aspect is the phenotypical characterization of trapii mutants under stress-conditions (grown in darkness) and additive stress (with additional drought stress). The potential interaction with brassinosteroid signaling via BIN2 is intriguing.

      • General assessment: provide a summary of the strengths and limitations of the study. What are the strongest and most important aspects? What aspects of the study should be improved or could be developed? A strength is that a new interaction is further studied. A weakness is that the studies are primarily conducted in yeast and in vitro, leaving open how relevant this process is in plants. A strength is further studies and phenotypic analysis of trapii mutant effects. A weakness is that this mutant analysis is disconnected from the action of SKs.

      Further, the writing should be improved and more clear (see comments above). - Advance: compare the study to the closest related results in the literature or highlight results reported for the first time to your knowledge; does the study extend the knowledge in the field and in which way? Describe the nature of the advance and the resulting insights (for example: conceptual, technical, clinical, mechanistic, functional,...). The introduction gives the impression of a stronger investigation of TGN function, which is from my point of view not the case and should be reformulated and/or put into a deeper context with known literature. The authors switch several times between the different TRAPPII subunits and shaggy-like kinases in the main figures which made it for me very confusing. I believe that rearranging some data/figures will improve the understanding of the story. The text is also lacking explanations of many abbreviations and gene names which caused more difficulties in understanding the story and slowed down the reviewing process. From my point of view it seems to be necessary to read the often cited Kalbfuß et al., 2022 publication before, as many important technical aspects and scientific background, e.g. the reason to use specific control mutants, are well explained there, but are lacking in this manuscript and needs improvement. - Audience: describe the type of audience ("specialized", "broad", "basic research", "translational/clinical", etc...) that will be interested or influenced by this research; how will this research be used by others; will it be of interest beyond the specific field? Based on the cited literature in this manuscript the direction of the story with "limited budget" and "conflict of interest" situations to classify mutants methodically seems to be a recently emerged approach. Apart from that this manuscript provides only new impact on TRAPII and AtSKs specific knowledge based on well-established and frequently used techniques that address the problem in vitro and in a heterologous system. Therefore, this story will be interesting for researchers specialized in stress responses, TGN and growth defects as well as important for basic research. Limitations in interpretation are present. - Please define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Our field of research is related to nutrition-regulated processes especially in Arabidopsis with a strong methodological background in interactomics, physiological, morphological and molecular responses and biochemical approaches and microscopy.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This manuscript adopted the concept that plants have cognitive ability and proposed the hypothesis that the trans-Golgi network plays a role in the decision process of cells. It investigates how this organelle function in order to reach the right decision when exposed to the combined drought stress and dark or to germination under osmotic stress. The study tests the hypothesis focusing on the on the TRAPPII complex. The authors demonstrated that defects (mutations) on TRAPPII complex cause wrong growth decisions, particularly when seedling are exposed to decisions with trade-offs.

      Major comments

      The experiments made for this study are enough to support the conclusions, and they were performed with adequate procedures. I just make the comment that follows.

      Lines 468-472: The authors propose that "signal integration and decision-making occur at the AtSK-TRAPPII interface". It should be considered whether the decision is made in a specific step and location or if all the cascade of responses is the decision process. It is proposed that TRAPPII makes the decision and the Rab GTPase cascades (or the downstream signals) implement the decision. The authors demonstrated that a defect on TRAPPII causes wrong-decisions, but what would happen if TRAPPII were normal but something downstream was defective and could would proceed the regular process? Would it also lead to wrong growth decisions? I am afraid that any defect may cause wrong decisions because the decision is the full metabolic process and not a single step in the route.

      Minor comments

      Lines 259-260: To make it easier to follow the reasoning, the reader should be informed what were the expected responses in case of "primary defects in cytokinesis".

      Line 413: Correct Kim et al (2023).

      Significance

      This study offers an important contribution to the discussion on how plants make decisions. The signals and the cascade of stimuli flows through an intricate network. This study demonstrates that on of the streams of information used for growth decisions passes through the Golgi complex.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Universally present across the eukaryotic world, TRAPPII is a hetero-oligomeric complex that plays a key role in the regulation of the TGN that has often been classified as a multisubunit tethering complex, although conclusive evidence for the tethering role is still lacking. In contrast, it is well established that the complex acts as a GEF for RAB11, primarily by studies carried out with model fungi such as Saccharomyces cerevisiae and Aspergillus nidulans. Atomic structures have revealed that a few amino acids in one of the subunits of the complex suffice to "kick off" GDP from the active site of the substrate RAB. While some of the subunits are necessary to place the RAB nucleotide binding pocket at the right distance from the key TRAPPII residues on the target membrane, it seems unlikely that the sole function played by this Md complex is catalysing the exchange of GDP by GTP in the target GTPase. In this particular regard, our understanding of other potential physiological roles of the complex is utterly incomplete

      This very well written manuscript explores the regulation of TRAPPII by phosphorylation, more precisely, the role of phospho-sites in the regulation of TRS120/TRAPPC9. Using co-immunoprecipitation strategies coupled to mass spectrometry, the authors identified a member of the saggy/GSK family of kinases. Given that the interaction of protein kinases with their substrates is supposed to be transient, this is technically sound result that testifies to the impeccable methodology used by authors. They further exploit MS methodology and two hybrid analysis to identify region of TRAPPC9 interacting with the kinase, as well as the phosphorylated residues. The authors close the circle by establishing that substitutions of these residues result in modified responses to abiotic stresses. Thus, a significant merit of this work is opening up the can of regulation by phosphorylation of TRAPP functions

      Experiments/challenges for the future are determining the downstream components that govern these physiological changes in response to phosphorylation of TRAPPII, and expanding these phosphorylation studies to TRAPPIII. This latter complex is involved both in exocytosis and in autophagy, and it seems plausible that alternation between these two different fates is governed by post-translational modifications of the type studied here.

      My suggestions to the authors: there has been a burst of atomic structures of TRAPPs recently, with three papers authored by the Fromme, Munro and Sui labs. It mighty worth comparing the longer Trs120 in these structures with the prediction of the shorter protein from Arabidopsis. A very minor point is that possibly the most thorough report on the localisation of TRAPPII to the TGN is that co-signed by Mario Pinar and myself in the Journal of cell science

      Referees cross-commenting

      I have nothing to add to my review, which was made from the point of view of TRAPP researcher. if my colleagues understand that the manuscript is hyperbolic in places, over statements should be removed

      Significance

      TRAPPII is regulated by TRAPPC9 phosphorylation in cruciferae. Convincing evidence. Impeccable presentation and writing.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Review Commons Refereed Preprint #RC-2023-02149

      Dear Reviewers #1 and #2,

      We extend our deepest gratitude for your dedication to reviewing our manuscript during such a busy period. We have diligently addressed the insightful feedback provided in our revisions. The variable quality of human fetal tissues, due to fixation and extended preservation times, is acknowledged as a limitation that may affect the quality of our immunostaining results. Despite this, we maintain that the findings from these experiments are crucial for human applications. The extrapolation of the results from mice experiments to human biology is a critical step in propelling research forward. We are confident that our paper, with its acknowledged limitations, still offers valuable contributions to our understanding in this domain.

      Please find the primary amendments of our revision detailed below for your review.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary Yamaguchi et al. performed a comprehensive characterisation of lymphatic vessel development in human embryos, spanning stages C8 to GW9. Through the utilisation of immunohistochemistry targeting proteins expressed in the lymphatic endothelium and blood endothelium, the authors have discerned the presence of lymphatic endothelial cells within the cardinal vein and in extraveinal locations. By systematically analysing the progression of embryonic stages, the authors identified the emergence of lymph sacs. Furthermore, they confirmed the presence of lymphatics in various organs, such as the heart, kidney, lung, and mesentery. However, lymphatics were not detected in the central nervous system during the embryonic stages. At the molecular level, lymphatic endothelial cells express similar factors as in mice, including Prox1, Vegfr3, Lyve1, and PDPN, although the timing and combination of these factors may vary depending on the tissue. This study significantly contributes to our knowledge of lymphatic development in humans.

      Major comments Human embryo samples are exceptionally valuable and ethically sensitive, making their maximum utilisation crucial. While the authors conducted a thorough anatomical and molecular analysis, it raises questions about whether more insights can be gleaned.

      Specifically, the authors should clarify whether data from embryos collected at CS8-CS10 were processed, and what was the status of venous and lymphatic development?

      Response: After a careful review of the clinical data for the specimen previously classified as CS8, we found a record indicating the initial detection of a heartbeat in the preceding week, an observation not made earlier. When correlating the last menstrual period with the morphological features, such as the open neural tube, it suggests that the specimen may actually be at CS 9-10, rather than CS8. We have revised the details in our records to reflect this more accurate staging in Table 1. We have included sections of this particular specimen for Figures for reviewer 1. Despite exhaustive sectioning until the sample was depleted, the heart structure was not located. The developmental stage of the specimen seems comparable to that of a mouse embryo at approximately embryonic day 7.5, evidenced by what appears to be a caudal neuropore. In addition, we observed surrounding blood vessels expressing PECAM, which contained nucleated red blood cells, but these did not exhibit Prox1 expression.

      Figure for reviewer 1. Prox1 Expression Pattern in a CS9-10 Human Embryo.

      Cross-section of a CS9-10 human embryo. Immunostaining for PECAM and Prox1.

      The authors commented that CS11 lymphatic vessels were not identified in the vein. Was there any indication of LECs outside the vein? Could the authors include images of this stage?

      Response: The CS11 embryo is depicted in Supplemental Figure 2A-C’. In this section, identification of one side of the precardinal vein was possible. Furthermore, formation of the pharyngeal arch was observed. Prox1 expression was absent in the precardinal vein at this stage.

      For embryos at CS12, it would be insightful to know the proportion of LECs versus VECs within the vein, the quantity of LECs outside the veins, and whether there was section-dependent variability in these observations. Response:

      For a single section, the numerical data for the right and left anterior cardinal veins were averaged. This process was repeated and the results were then averaged across two sections.

      1. The proportion of LECs to VECs within the vein. On average, there were 18.25 nuclei per cross-section of the CV; of these, 4.5 were Prox1-/PECAM+ blood endothelial cells (BECs), and 13.75 were Prox1+/PECAM+ LECs. Therefore, BECs constituted 24.7%, and LECs constituted 75.3%.

      The number of LECs outside the veins.

      There were an average of 9.75 Prox1+/PECAM+ cells located externally to the CV."

      This point is described in Figure 1 legends as follows.

      On average, there were 18.25 nuclei per cross-section of the CV; of these, 4.5 were Prox1-/PECAM+ blood endothelial cells (BECs), and 13.75 were Prox1+/PECAM+ LECs. Therefore, BECs constituted 24.7%, and LECs constituted 75.3%. There were an average of 9.75 Prox1+/PECAM+ cells located externally to the CV. (Page 11, lines 486-490)

      It would be helpful if Table 1, "Information of human embryos and fetuses", could be complemented with a summary of the main findings at each stage, including which markers LECs expressed and their distribution.

      To strengthen the assertion that this study provides unique insights compared to those of mice, a schematic summarizing the similarities and differences between mouse and human observations should be included. Response:

      We have enriched the information presented in Table 1 and introduced Figure 5 as a new comprehensive illustration. Figure 5 provides a comparative analysis of lymphatic vessel development between mice and humans, with a particular emphasis on the early stages of development, meticulously summarizing the alterations in lymphatic marker expression at specific stages.

      The authors mentioned differences in lymphatic markers at various regions of the embryo and different developmental stages. It is essential to clarify whether all regions express the same markers at the latest developmental stage. Response:

      We have added immunostaining for Podoplanin and LYVE1 at GW9 as Supplemental Figure 4X-Y''. This demonstrates the expression of Podoplanin and LYVE1 in lymphatic vessels of the lung, heart, kidney, mesentery, intestinal wall, and lower jaw. This information regarding the expression of LYVE1 and PDPN has also been incorporated into the main body of the text under the section of ‘The Development of Lymphatic Vessels Varies Among Organs’.

      A discussion of the limitations of analysing embryos from abnormal pregnancies is necessary. In addition to the determined lack of chromosomal abnormalities, it is crucial to consider phenotypical and morphological integrity. The authors should address the possibility of developmental defects and mutations causing abnormalities in the lymphatic vessels.

      Response:

      In the "Tissue Collection and Ethical Considerations" section of the Materials and Methods, we have addressed the possibility that developmental defects and mutations may cause abnormalities in the lymphatic vessels.

      This is depicted as follows:

      Detailed information regarding each sample is presented in Table 1. The sex of each sample was not determined, with the exception of one case of miscarriage. In this particular case, chromosomal analysis verified the absence of any karyotypic abnormalities. There were no malformations observed in any of the embryos or fetuses. Nevertheless, for the remaining embryos, there is a possibility that developmental defects or mutations could lead to abnormalities in the lymphatic vessels. (Page 7, lines340-346)

      Minor comments In the abstract, the authors refer to lymphatic malformations as a specific type of lymphatic disease. We recommend acknowledging the broader implications of this study beyond such specific cases. 

      Response:

      We have modified the concluding paragraph of the Abstract to reflect a more expansive and encompassing narrative as follows.

      Our research clarifies the early development of human lymphatic vessels, contributing to a better understanding of the evolution and phylogenetic relationships of lymphatic systems, and enriching our knowledge of the role of lymphatics in various human diseases. (Page2, lines 58-60)

      The term "lymph-related disease" should be clarified for better understanding. Response:

      To make it clearer, we have modified the last paragraph of the Introduction that includes 'lymph-related disease' as follows.

      Our research offers essential insights into the evolution and phylogeny of lymphatic vessels, and may also illuminate the pathogenesis of lymphatic-related diseases, which include lymphedema, obesity, cardiovascular disorders, Crohn's disease, and congenital lymphatic disease, such as lymphatic malformation. (Page 3, lines127-131)

      Figure 3S shows kidney samples, not the myocardium or endocardium, as indicated. Response:

      No, it is correct. Figure 3P-S represents the heart, which is surrounded by the lungs on both sides. Figure 3S depicts the endocardium, indicating that lymphatic vessels are not present within the endocardial layer.

      Reviewer #1 (Significance (Required)):

      This study largely reaffirms the existing knowledge from mouse models and previous human data. Given the absence of a cure for lymphatic diseases, gaining a deeper understanding of how lymphatic vessels develop in humans could serve as a crucial stepping stone in this field of research.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This study by Yamaguchi et al., explores the progression of lymphatic vessel growth in different stages of human embryos. They also try to identify the origin of the lymphatic vessels in different organs. The study first shows that lymphatic endothelial cells (LECs) first show up in the anterior cardinal veins (ACVs) of CS12 in human embryos, which is similar to what is known to occur in mouse embryos. They also checked whether the PROX1+ LECs of the heart are derived from Flk1+/Isl+/PECAM- cells. However, Flk1+/Isl+/PECAM- cells do not co-express PROX1. These results suggest that in human embryos LECs originate from the ACVs. The authors then identify that lympho-venous valves formed between lymph sacs and the cardinal veins at around Carnegie Stage (CS)18. The valves have showed obvious bicuspid shape at Gestational week (GW)9. Finally, the authors demonstrate that the development of lymphatic vessels happens at different time points in various organs. At CS16, lymphatic vessels and LECs can be detected in the lower jaw, heart and the lungs; mesenteric and intestinal lymphatic vessels can be detected between CS17 and CS18; kidney lymphatic vessels can be found at CS23; At GW9, the lymphatic vessels are observed around the aorta, which may combine to form the future thoracic duct. Together, this informative study sheds light on the progression of lymphatic vasculatures during embryonic stage in humans.

      This study has many strengths, in addition to some areas that if addressed, would further increase the impact of the findings. These include:

      1. Since immunostaining is the major method that the authors have used for their work, they could use positive and negative controls (secondary antibody only or IgG control) for different antibodies. The authors can also show some Isl1 and Flk1 staining in GW9 fetus or adult tissue, like PROX1 or LYVE1 in Supplemental figure 1.

      Response:

      We have introduced new Supplemental Figures 1I-N. Included are negative controls for fluorescent staining with only the secondary antibody (Supplemental Figure I-I’’’’) and for DAB staining with only the secondary antibody (Supplemental Figure J-L). Furthermore, we have added images showing Flk1 staining within lymph sacs (Supplemental Figure 1M) and Isl1 staining (Supplemental Figure 1N). Flk1 expression was confirmed in the lymph sacs; however, Isl1 expression was not observed.

      The description regarding the negative controls is as follows.

      Additionally, the specificity of the staining was confirmed with controls using only the secondary antibodies (Supplemental Figure 1I-L). (Page 3, lines146-147)

      The description regarding Flk1 and Isl1 in the lymph sac is as follows.

      Additionally, at GW9, Flk1 expression was detected in the cervical lymph sac, but Isl1 expression was not (Supplemental Figure 1M and N). (Page4, lines186-187)

      Figures 1 F-H, S' and S", U', U", and U'" are hard to appreciate. Can the authors offer higher quality images or show some confocal images?

      Response:

      In response to the reviewer's comments, we conducted several trials to improve image quality. However, due to fixation issues, we were unable to enhance the quality beyond the original for the CS12 specimen. Therefore, all images except those of VEGFR3 have been left unchanged. It is possible that the quality appeared reduced in the initial submission due to compression, making them difficult to view. We will resubmit without reducing the image quality as much as possible and ask for your understanding in this matter. Additionally, the CS12 specimen was very small, and there was a limited number of sections available, making further attempts challenging. This is also a limitation of research using human embryos. Regarding Figure 1R-U’’’, we have revised and replaced the images, although the quality has not significantly changed. We believe this may also be due to the compression of the image quality at the time of submission. There is no change in the conclusions drawn.

      According to the author's previous publications (ref 17 and 30) and literature (ref 31), Flk+/Isl+/PECAM- cells differentiate into LECs. However, in this work they did not observe any PROX1+Isl1+ cells at CS13 and CS14. I am curious to know if they found any PROX1+Isl1+ cells at later time points such as GW9.

      Response:

      Isl1 is posited to be an early transcription factor that directs the differentiation of undifferentiated mesodermal cells towards a cardiac lineage. Our prior research utilizing tamoxifen-inducible mice indicated that a cohort of cells expressing Isl1 at a defined interval (E6.5 to E9.5 in mice) contributes to the formation of lymphatic structures in the head, neck, mediastinum, and heart before subsequently losing this expression(Maruyama et al., eLife, 2022). However, in human studies, it is not possible to trace the lineage and differentiation trajectories of Isl1+ cells. Consequently, we anticipated finding LECs that initially express Isl1 in the embryonic stage, with this expression diminishing as development ensues. Nevertheless, such cell groups were not observed in human embryos. In mice, our search for cells concurrently expressing Isl1, Prox1, Flk1, or PECAM from E9.0 to E11.5 (referenced in Maruyama et al., eLife, 2022, Supplemental Figure 3) also yielded no such populations. This evidence suggests that Isl1 protein expression in the cardiac pharyngeal mesoderm likely ceases during the differentiation into lymphatic endothelium. Given the hypothesis that Isl1+/Prox1+ LECs might exist at an earlier developmental stage, we examined specimens from CS16, 17, and 18 for the presence of such LECs but to no avail. This investigation has been documented as Supplemental Figure 3Q-S for the CS16 sample. With the GW9 sample, due to its substantial size, we initially conducted a DAB staining search for lumen structures that might express Isl1. However, no such structures were identified. Moreover, despite conducting triple immunostaining for PECAM, Isl1, and Prox1, we were unable to locate any LECs or lymphatic vessels expressing Isl1.

      The description regarding Isl1 and Prox1 expression for CS16 and GW9 is as follows:

      At CS16, cells co-expressing Prox1 and Isl1 were not observed in the lower jaw or the cardiac outflow tract regions (Supplemental Figure 3Q-S'). Additionally, at GW9, Flk1 expression was detected in the cervical lymph sac, but Isl1 expression was not (Supplemental Figure 1M and N). (Page 4, lines184-187)

      For the GW9 stage, we have provided images of lymphatic vessels in the lung and heart stained with PECAM, Isl1, and Prox1 as a Figure for the reviewer's consideration.

      Figure for reviewer 2. Isl1 is not expressed in GW9 lymphatic vessels.

      Fluorescent immunostaining of PECAM, Prox1, and VEGFR3 was conducted at GW 9 fetuses. Scale bars 100μm.

      Figure 3 N and O show comparable VEGFR3+PROX1+ cell numbers in different time points, however it shows increased VEGFR3+PROX1+ vessel numbers. If so, do LECs become more elongated and form the vessel-like structures?

      Response:

      In our previous findings (Maruyama et al., Dev bio, 2019, Maruyama et al., iScience, 2021), we documented that surrounding the heart, LECs progressively interconnect to form a reticular network, which is subsequently remodeled into more substantial lumen-bearing vessels. This sequence appears to be conserved in humans, with LECs initially presenting as solitary entities that gradually interlace into a network. Presumably, a portion of this network is then streamlined, giving rise to increasingly luminal structures. Therefore, while the count of LECs remains constant, there is an augmentation in the number of defined luminal vessels. This observation has been depicted as follows.

      Throughout this process, the initially mesh-like capillary lymphatics undergo progressive remodeling to establish lumen-bearing vessels. Consequently, while the density of LECs per unit area remains relatively stable, there is an increase in the number of lymphatic vessels possessing distinct luminal structures (Figure 3N and O). (Page 5, lines 220-223)

      The authors have mentioned that the staging of the embryos and fetuses was done by Carnegie stage and clinical information. The authors should offer more detailed information about those embryos and fetuses. For example, crown-rump length, menstrual weeks, craniofacial features etc. This information will be useful for other researchers in this field.

      Reply:

      We have substantially expanded the data presented in Table 1 regarding embryos and fetuses. For specimens dating back over 15 years, some lacked echo graphic details. In those instances, we estimated the developmental stage by integrating available data, such as the date of the last menstrual period or morphological features of the fetus. For a case initially assessed as CS 8, which had no recorded cardiac activity in the preceding week, a subsequent ultrasound noted a heartbeat. Considering this alongside the specimen's size, we revised the estimated stage to CS 9-10, correlating with the onset of heart formation. Despite exhaustive sectioning of this particular embryo until the samples were depleted, the heart structure remained undetected. Nevertheless, taking into account morphological observations, such as an open neural tube, the stage was adjudged to be CS9-10. Furthermore, for ectopic pregnancies, which frequently necessitated emergency surgeries due to symptoms like abdominal pain or bleeding, preoperative embryonic data was often unavailable.

      Reviewer #2 (Significance (Required)):

      Strengths: Very informative results for human embryonic lymphatic development. They have performed the experiments at various developmental stages.

      Limitations: Image quality need to be improved. Many high magnification images are not clear. Human samples come from certain diseases, which might have affected the embryo's development.

      Advance: this study clarified the process of early lymphatic vessel formation in human embryos.

      Audience: clinical and basic science in developmental biology and lymphatic biology.

      Reviewer expertise: lymphatic development, lymphatic biology, vascular biology.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study by Yamaguchi et al., explores the progression of lymphatic vessel growth in different stages of human embryos. They also try to identify the origin of the lymphatic vessels in different organs. The study first shows that lymphatic endothelial cells (LECs) first show up in the anterior cardinal veins (ACVs) of CS12 in human embryos, which is similar to what is known to occur in mouse embryos. They also checked whether the PROX1+ LECs of the heart are derived from Flk1+/Isl+/PECAM- cells. However, Flk1+/Isl+/PECAM- cells do not co-express PROX1. These results suggest that in human embryos LECs originate from the ACVs. The authors then identify that lympho-venous valves formed between lymph sacs and the cardinal veins at around Carnegie Stage (CS)18. The valves have showed obvious bicuspid shape at Gestational week (GW)9. Finally, the authors demonstrate that the development of lymphatic vessels happens at different time points in various organs. At CS16, lymphatic vessels and LECs can be detected in the lower jaw, heart and the lungs; mesenteric and intestinal lymphatic vessels can be detected between CS17 and CS18; kidney lymphatic vessels can be found at CS23; At GW9, the lymphatic vessels are observed around the aorta, which may combine to form the future thoracic duct. Together, this informative study sheds light on the progression of lymphatic vasculatures during embryonic stage in humans.

      This study has many strengths, in addition to some areas that if addressed, would further increase the impact of the findings. These include:

      1. Since immunostaining is the major method that the authors have used for their work, they could use positive and negative controls (secondary antibody only or IgG control) for different antibodies. The authors can also show some Isl1 and Flk1 staining in GW9 fetus or adult tissue, like PROX1 or LYVE1 in Supplemental figure 1.
      2. Figures 1 F-H, S' and S", U', U", and U'" are hard to appreciate. Can the authors offer higher quality images or show some confocal images?
      3. According to the author's previous publications (ref 17 and 30) and literature (ref 31), Flk+/Isl+/PECAM- cells differentiate into LECs. However, in this work they did not observe any PROX1+Isl1+ cells at CS13 and CS14. I am curious to know if they found any PROX1+Isl1+ cells at later time points such as GW9.
      4. Figure 3 N and O show comparable VEGFR3+PROX1+ cell numbers in different time points, however it shows increased VEGFR3+PROX1+ vessel numbers. If so, do LECs become more elongated and form the vessel-like structures?
      5. The authors have mentioned that the staging of the embryos and fetuses was done by Carnegie stage and clinical information. The authors should offer more detailed information about those embryos and fetuses. For example, crown-rump length, menstrual weeks, craniofacial features etc. This information will be useful for other researchers in this field.

      Significance

      Strengths: Very informative results for human embryonic lymphatic development. They have performed the experiments at various developmental stages.

      Limitations: Image quality need to be improved. Many high magnification images are not clear. Human samples come from certain diseases, which might have affected the embryo's development.

      Advance: this study clarified the process of early lymphatic vessel formation in human embryos.

      Audience: clinical and basic science in developmental biology and lymphatic biology.

      Reviewer expertise: lymphatic development, lymphatic biology, vascular biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Yamaguchi et al. performed a comprehensive characterisation of lymphatic vessel development in human embryos, spanning stages C8 to GW9. Through the utilisation of immunohistochemistry targeting proteins expressed in the lymphatic endothelium and blood endothelium, the authors have discerned the presence of lymphatic endothelial cells within the cardinal vein and in extraveinal locations. By systematically analysing the progression of embryonic stages, the authors identified the emergence of lymph sacs. Furthermore, they confirmed the presence of lymphatics in various organs, such as the heart, kidney, lung, and mesentery. However, lymphatics were not detected in the central nervous system during the embryonic stages. At the molecular level, lymphatic endothelial cells express similar factors as in mice, including Prox1, Vegfr3, Lyve1, and PDPN, although the timing and combination of these factors may vary depending on the tissue. This study significantly contributes to our knowledge of lymphatic development in humans.

      Major comments

      Human embryo samples are exceptionally valuable and ethically sensitive, making their maximum utilisation crucial. While the authors conducted a thorough anatomical and molecular analysis, it raises questions about whether more insights can be gleaned.

      Specifically, the authors should clarify whether data from embryos collected at CS8-CS10 were processed, and what was the status of venous and lymphatic development?

      The authors commented that CS11 lymphatic vessels were not identified in the vein. Was there any indication of LECs outside the vein? Could the authors include images of this stage?

      For embryos at CS12, it would be insightful to know the proportion of LECs versus VECs within the vein, the quantity of LECs outside the veins, and whether there was section-dependent variability in these observations.

      It would be helpful if Table 1, "Information of human embryos and fetuses", could be complemented with a summary of the main findings at each stage, including which markers LECs expressed and their distribution.

      The authors mentioned differences in lymphatic markers at various regions of the embryo and different developmental stages. It is essential to clarify whether all regions express the same markers at the latest developmental stage.

      To strengthen the assertion that this study provides unique insights compared to those of mice, a schematic summarising the similarities and differences between mouse and human observations should be included.

      A discussion of the limitations of analysing embryos from abnormal pregnancies is necessary. In addition to the determined lack of chromosomal abnormalities, it is crucial to consider phenotypical and morphological integrity. The authors should address the possibility of developmental defects and mutations causing abnormalities in the lymphatic vessels.

      Minor comments

      In the abstract, the authors refer to lymphatic malformations as a specific type of lymphatic disease. We recommend acknowledging the broader implications of this study beyond such specific cases.

      The term "lymph-related disease" should be clarified for better understanding.

      Figure 3S shows kidney samples, not the myocardium or endocardium, as indicated.

      Significance

      This study largely reaffirms the existing knowledge from mouse models and previous human data. Given the absence of a cure for lymphatic diseases, gaining a deeper understanding of how lymphatic vessels develop in humans could serve as a crucial stepping stone in this field of research.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time since our responses contain graphs and tables that might contain formatting errors in the conversion process.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors characterized circRNAs that can escape degradation in cells infected by alpha-, beta and gamma herpesviruses both in cultured cells and in mice. During lytic infection, ten circRNAs were modulated across the virus subfamilies, and 67 circRNAs were upregulated after virus infection or treatment with interferon- or interferon-. The authors examined in detail interferon-induced circRNA circRELL1 and noted that this circRNA suppresses lytic infection, likely by interacting with the mTOR pathway and promoting cell proliferation. They also made the astonishing observation that the circRNAs were more resistant to cleavage by virus-encoded nucleases than their linear counterparts. This is a comprehensive study that reveals circRNA key players that control lytic and latent herpesvirus infections.

      Comments:

      1. Linear mRNA abundances are decreased after infection, but linear-derived circRNA abundances are upregulated. What determines increased circRNA abundances when their linear counterparts become limiting? Is the rate of splicing/back-splicing altered in infected cells? Is nuclear-cytoplasmic transport of circRNAs changed?
      2. CircRELL1 loss of function experiment: Does the employed siRNA affect the abundance of linear RELL1 mRNA?
      3. CircRELL1 gain of function experiment: How was this performed?
      4. Why are circRNA and, especially circRELL1, resistant to degradation in interferon-treated cells when OAS genes, and presumably RNAseL, are upregulated?

      Significance

      This study enhances our knowledge of circRNAs in viral infections.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Dremel et al explore the interplay between herpesvirus and circRNAs. This team has been a pioneer in the field and has made important discoveries about the regulation of circRNAs during herpesviral infection. While past work focused on the gamma-herpesviruses, here in this study, they expand their work to alpha and beta HV as well as extending their findings into animal models. They found consistent increase in CircRNA levels and found that these circRNAs are mostly resistant to viral-induced RNA decay. They then uncover that these circRNA can be induced by interferon which indicates that circRNAs could act as a first line of anti-viral defense. Consistently, they found that circRELL1, previously identified as a circRNA inhibiting KSHV lytic infection, can also restrict HSV-1. This seemingly conserved anti-viral capacity could point to an evolutionarily conserved mechanism of defense against infection. This study combines together an impressive amount of sequencing data and elegantly draws parallels between the various HV. While this is a complex story, bringing together expression levels of mRNA, circRNA and even diving into miRNA networks, this is extremely well written and as a reader, you feel transported into a well-crafted journey that keeps uncovering novel and exciting findings.

      Major comments:

      • there is a back and forth between the HVs that are included in the study: fig 1 has HSV, CMV and KSHV; while fig 2 is HSV, KSHV and MHV68. For clarity and consistency, the authors should consider including all 4 representative HV in their figures.
      • this might be a misunderstanding that just needs clarification: CIRCScores provide a measure of CircRNA reads over their linear counterparts: during host shutoff, the linear counterpart would likely decrease and therefore the CIRCscore would artificially go up. So the fact that the CIRCscore remains constant over lytic reactivation in KSHV and MHV68, wouldn't that indicate that instead the CircRNA level go down?
      • what happens to circRNAs in HCMV infection as they do not encode an endoRNase?
      • line 238: the authors should discuss why CpG and poly I:C treatment largely failed to induce expression of CircRELL1
      • line 289: if CircRELL1 is induced in response to infection, it could be interesting to induce its expression after infection (maybe a DOX-inducible promoter on the lentivirus) instead of prior to infection (which might hinder infection and hide more significant effects later).
      • line 329: the authors mention that the mRNA targets of SOX carry a "degenerate motif": do the circRNA downregulated during KSHV infection contain such motif?

      Minor comments:

      • figure 3B should show p-values
      • figure 3B: showing expression levels of the endoRNases could provide some context for the extent of their effect
      • I would refrain from using sentences referring to phenotype "trending upward" which basically reflects that the results are not significant in either upward or downward direction.

      Significance

      CircRNAs are only beginning to emerge as important regulators of gene expression in cells. Only recent developments in sequencing technology have allowed scientists to even detect the presence of these small RNA. Their roles and mechanism of induction remain largely uncharacterized, let alone in the context of viral infection. This team has pioneered the exploration of CircRNA in KSHV and is now poised to extend their findings to other members of the herpesvirus family. This study will be of interest to a broad audience, both virologist looking to better understand the viral-host battle during infection and RNA biologists seeking to better characterize how gene expression can be controlled with circRNA. There is also major therapeutic potential with these types of approaches.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Dremel et al reports on IFN-induced circRNAs that are produced in response to herpesviral infections and that are resistant to virus-mediated degradation (e.g. host shut-off). They provide a wealth of interesting data on circRNA production changes in response to infection with alpha, beta, and gamma herpesviruses and identify a number of host circRNAs that are commonly regulated across all subfamilies. Using circRELL1 as an example, they demonstrate a modest impact on productive HSV-1 infections (which follows up on their previous report of circRELL1 impacting KSHV lytic infection). They subsequently propose a new model in which a subset of IFN-stimulated genes produce both mRNAs and circRNAs with the potential for antiviral activity, the latter operating as a 'workaround' for avoiding virus-mediated shutoff.

      Major Comments

      • Line 146: The authors extended their analysis to HSV-1 latently infected mouse trigeminal ganglia but potentially missed a trick but not including an analysis of HSV-1/VZV infected human trigeminal ganglia for which potentially compatible datasets are available (PMID: 29563516).
      • Line 264: The authors note that a direct relationship in gene expression changes was observed between some circRNAs and mRNAs and not others. However, a gene level analysis may be problematic here as many genes encode multiple distinct transcript isoforms that are variably regulated e.g. during infection. A transcript level analysis (e.g. using Kallisto/Salmon) might enable the authors to specifically link circRNAs with individual mRNA isoforms which would be valuable information in the context of interferon-driven gene expression. Alternatively, the lack of a direct relationship in gene expression changes may also link to variability in circRNA decay / halflives. The authors could potentially solve this using a metabolic labelling approach to measure mRNA and circRNA decay rates for a subset of the genes of interest.
      • Line 273: The weak element in the paper relates to the role of circRELL1 in restricting HSV-1 productive infections. The effects observed are modest at best and the biological impact appears limited. It is also entirely unclear how circRELL1 might act to restrict HSV-1. The paper would significantly benefit from the authors extending their analysis to include 2-3 additional circRNAs that are commonly regulated by herpesviruses to determine (i) whether similar effects are observed and (iii) to determine whether a compound effect can be achieved by targeted silencing of multiple IFN-responsive circRNAs at the same time.

      Minor comments

      • Line 55: To this reviewers knowledge, only eight routinely infect humans (HSV1, HSV2, VZV, HCMV, HHV6, HHV7, EBV, and KSHV. It's possible the authors are considering HHV6A and HHV6B as distinct but this is not really the case.
      • Line 57: As written it implies that therapeutic agents capable of clearing VZV exist which is not really the case.
      • Line 132: The authors inclusion of previously published data (e.g. HCMV) along with reams of their own makes for a compelling analysis.
      • Line 582: A number of the sequencing datasets are not yet publicly available. This should be rectified before publication.

      Significance

      It is clear that a lot of work has gone into this manuscript and there a reams of data that will provide a great resource for mining to the wider herpesvirus community. While this naturally leads to a more descriptive nature, it does not undermine the value of the data. However, as indicated below, more functional validation is required if this work is to provide a significant step forward in our understanding of how and why IFN-induced circRNAs might be important for combating viral infections.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We have thoroughly revised the manuscript, taking into account all comments from all four reviewers. We have added new data (Supplemental Figure 2 and Supplemental Figure 4) in response to these comments.

      Reviewer 1

      The assessment of data reproducibility is currently uncertain due to the absence of replication and statistical analysis in the dataset. It is essential to provide explicit information regarding sample sizes or replicates for all data and figures, data should be presented as mean +/- SD/SEM, and the interpretation of results should be grounded in rigorous statistical analysis. The lack of experimental replicates and statistical analysis in most of the figures presented raises major concerns regarding the validity of the result.

      We have now added error bars for the graphs in Figure 3D, E, F, G, H; Figure 4 D, F, G, H, I, J; Figure 5 B, C, D, E, F, G; and Figure 6B, C, D. All GTPase assays have repeated three times. The mean ± S.D. (n = 3) is plotted for each condition. For high-speed pelleting assays, all assays have been conducted three times, and a representative assay is shown.

      Why was only one of the MiD proteins, specifically MiD49, studied, while MiD51 was not includedin the investigation?

      This is an excellent point. In our previous work (doi:10.1101/2023.07.31.551267), we found that MiD49 and MiD51 were strikingly similar in their abilities to activate Drp1 after their own activation with fatty acyl-CoA. We feel that the demonstration here with MiD49 suggests that a similar effect would occur with MiD51. Due to time constraints for the lead author, preparing more MiD51 protein was out of the scope of what could be done. We now add a line in the Discussion that results for MiD51 may be different.

      The author suggestion of Drp1 phosphorylation, based on the mobility of protein observed in SDS-PAGE gel (fig 4A, 5A, 6A), is not a sufficiently valid assessment. While western blot analysis is a valid method to assess Drp1 phosphorylation, it is essential to include replicates for semi-quantitation and demonstrate the reproducibility of the results. Moreover, it is recommended to incorporate Western blot analyses to provide additional support for the findings presented in Figures 5 and 6.

      • We agree with the reviewer that additional information on the phosphorylation state of these proteins should be provided. We now include phospho-proteomic analysis for Erk2 phosphorylation of WT Drp1 and Drp1-S600D (Supplemental Table 1), showing that S579 is by far the predominant phosphorylation site. For WT Drp1, three lines of evidence now suggest efficient Erk2 phosphorylation of S579:
      • Western blot using anti-phosphoS579
      • Phosphoproteomic analysis
      • Gel shift

      For the Drp1-S600D phosphorylation, we have phosphoproteomic and gel shift analysis. For isoform 6, we regrettably only have gel shift. However, given the fact that the effect of Erk2 treatment on actin-stimulated GTPase activity mimics what we found for WT-Drp1 and for Drp1-phosphoS579/S600D, we think it is highly likely that the equivalent phosphorylation (S629 in this case) has been affected.

      Data on phosphorylated peptides with replicates experiments should be presented.

      We now present these data, which have been significantly expanded since the initial submission (new Supplemental Table 1). While non-phophorylated S579 is still detected in both the WT and S600D phosphorylation reactions, the phosphorylated peptide is 2.2 and 2.3-fold more abundant, respectively. Our conclusion is that Erk2 efficiently phosphorylates S579, although stoichiometric phosphorylation was not obtained here. We have added statements in the relevant sections of the Result, and in the Methods. We have also added Supplemental Table 1 to show the spectral counts obtained from phospho-proteomic analysis, and have deposited the raw data files with the PRIDE consortium (access information in the Methods).

      Please provide additional context or specific details about the GFP-tagged Drp1 protein, such as the protein site where GFP was attached, as well as whether this tag could potentially impact the Drp1 GTPase activity and oligomerization. Figure 7C and D appear to suggest an increase in the GTPase activity of the GFP-Drp1 protein.

      We have now added these details to the Methods section, and have also added the complete amino acid sequence for the final purified construct in Supplemental Figure 4. We have also added that a previous study (PMID: 32901052) found that inclusion of GFP strongly inhibited Drp1 GTPase activity. We do not observe this effect here or in a previous study (PMID: 27559132), and provide possible reasons for this difference in the Methods. The reviewer points out that the activity of GFP-Drp1 appears higher than that of un-tagged Drp1 (comparing 7C with 7D). We find that the GTPase activity of Drp1 alone varies between 1 and 2 uM/min/uM protein depending on the preparation. This variation occurs for both untagged and GFP-tagged Drp1. This difference in basal activity from prep-to-prep might relate to differences between protein preparations, or exact amount of time required to freeze the aliquots of purified protein (we freeze small aliquots ( An optional experiment that would significantly enhance the biological relevance of the findings presented in the current study is to assess the morphology of mitochondria in cells expressing the phospho-mimetic mutant Drp1 proteins. This experiment would provide valuable insights into the functional consequences of Drp1 S579 and S600 phosphorylation on mitochondrial structure and dynamics.

      We fully agree that these would be valuable experiments. The issue is that a large number of experiments using phospho-mimetic mutants in cells have already been conducted, with varying results (Taguchi et al., 2007; Qi et al., 2011; Yu et al., 2011; Strack et al., 2013; Kashatus et al., 2015; Serasinghe et al., 2015; Xu et al., 2016; Brand et al., 2018; Han et al., 2020, Chang and Blackstone, 2007; Cribbs and Strack, 2007; Cereghetti et al., 2008; Wikstrom et al., 2013, Han et al., 2008,Wang et al., 2012 Jhun BS, Sheu, 2018, J Physiol). To conduct more targeted tests examining specific forms of Drp1 activation in cells (for example, through Mff, MiD proteins, actin, or cardiolipin) will require extensive work that is outside the scope here. Our feeling is that S579 phosphorylation is likely to recruit another molecule (probably a protein) that has an activating effect. We tried to test one possibility (NME3, mentioned in the Discussion) but failed to produce useable NME3 protein for these tests and, given time constraints for the lead author, could not address this further.

      Provide reference for method on actin polymerization.

      We have now added a reference in the ‘Actin preparation for biochemical assays’ section of the Methods (PMID 16472659).

      Rectify the error in referencing figure 3 panels within the figure legends of Supplemental Fig S1.

      Thank you, we have changed this.

      The inclusion of full length isoform 6 is commendable. However, there is no mentioned of isoform6 in the method section.

      Thank you for pointing this out. We have added description of the construct and referenced our previous paper that used it.

      Since papers deposited in bioRxiv have not undergone peer review, reference #7 should not becited as references in scholarly work.

      Reference 7 has so far been reviewed by a peer-review journal. e are addressing reviewers’ concerns and will re-submit soon. We do not know how to rectify the issue of referencing this work, because it describes an extensive amount of groundwork for the MiD proteins. Our hope is that this work will be in press by the time the work reviewed here is ready for publication.

      Please provide details about the calculation of GTPase activity and the distinctions between the specific GTPase activity and total GTPase activity shown in figure 8D-F.

      We now describe these calculations in the “GTPase assay” section of the Methods.

      Reviewer 2

      Overall, the experiments described here are carried out with rigor and the conclusions drawn are of significance to understanding how phosphorylation regulates Drp1 functions.

      Thank you for these kind comments!

      Phosphorylation of both the serine residues appears to elicit a common effect in that they inhibitDrp1's stimulated GTPase activity. This would suggest that phosphorylation affects Drp1's self-assembly as tightly packed helical scaffolds. Instead of sedimentation analysis, an EM analysis of helical scaffolds on cardiolipin-containing membrane nanotubes or in the presence of soluble adaptors causing Drp1 to form filaments would provide a direct readout for defects in self-assembly.

      This is an excellent point, and we would love to conduct this work. Given our current EM infrastructure and expertise, these experiments would take extensive time for us to do. We do have a collaborator who could carry these out, but feel that the time it would take even for them to do this correctly is beyond that which we have (the lead author is transitioning to their next career phase). We have added the point that further EM studies of this type are necessary to test the effect on Drp1 assembly more directly.

      I am not sure of the rationale for experiments reported in Fig. 7 and 8. If the idea was to test if hetero oligomerization with WT Drp1 rescues defects associated with phosphorylated Drp1 then this could be stated explicitly in the manuscript. GFP-Drp1 is used as a WT mimic but a previous report (PMID: 30531964) indicates that this construct is severely defective in stimulated GTPase assays, much like the K38A mutant. But the rationale of using these constructs is not quite apparent. Is the intention to test if defects seen in the phospho-mimetic mutants of Drp1 can be rescued by the presence of a 'seed' of WT Drp1. If so, then this could be stated explicitly in the manuscript. But regardless, I am not quite sure what this data set achieves in terms of addressing mechanism.

      We apologize for not being clearer in our explanation of these experiments. Our goal was to test the effects of partial Drp1 phosphorylation on overall Drp1 activity, which likely mimics more accurately the cellular situation (wherein only a portion of the Drp1 population is likely to be phosphorylated even upon kinase activation). We now discuss these experiments in a clearer manner. For the GFP-Drp1, we do not observe the effect on GTPase activity shown in that previous manuscript by another laboratory, either here or in previous studies (eg, PMID: 27559132). In the Methods, we now provide a discussion of these differences and possible reasons for them, as well as providing the complete amino acid sequence of our GFP-fusion construct in Supplemental Figure 4.

      Finally, it would have been nice to see if the phospho-mimetic mutants of Drp1 produce the same effects on mitochondrial structure as those reported earlier. Reanalyzing their effects in a cellular assay becomes important because it would consolidate this work for the readers to evaluate the'true' effects of phosphorylation on Drp1 functions. If the phospho-mimetic mutants fare in a manner like those previously reported, then it signifies that stimulation in GTPase activity is not a readout that directly correlates with Drp1 functions. If not, then the results presented here would establish a comprehensive analysis of in vitro biochemical activities and in vivo functions of the phospho-mimetic mutants.

      We fully agree that these would be valuable experiments. The issue is that a large number of experiments using phospho-mimetic mutants in cells have already been conducted, with varying results (Taguchi et al., 2007; Qi et al., 2011; Yu et al., 2011; Strack et al., 2013; Kashatus et al., 2015; Serasinghe et al., 2015; Xu et al., 2016; Brand et al., 2018; Han et al., 2020, Chang and Blackstone, 2007; Cribbs and Strack, 2007; Cereghetti et al., 2008; Wikstrom et al., 2013, Han et al., 2008,Wang et al., 2012 Jhun BS, Sheu, 2018, J Physiol). To conduct more targeted tests examining specific forms of Drp1 activation in cells (for example, through Mff, MiD proteins, actin, or cardiolipin) will require extensive work that is outside the scope here. Our feeling is that S579 phosphorylation is likely to recruit another molecule (probably a protein) that has an activating effect. We tried to test one possibility (NME3, mentioned in the Discussion) but failed to produce useable NME3 protein for these tests and, given time constraints for the lead author, could not address this further.

      Previous work reports that the effect of actin on the GTPase activity of Drp1 is biphasic but the binding to actin is not. This is quite confounding, and the authors could perhaps explain why this is the case.

      The reviewer makes an excellent point, which we now explain further in the manuscript. We have also discussed this in doi:10.1101/2023.07.31.551267 (see Figure 2D in that work). Our interpretation is that it is the density of Drp1 bound to the actin that provides the activation, by positioning the GTPase domains in close proximity. As the amount of actin increases, the Drp1 becomes more dispersed on the filaments, and activation decreases. We observe the same effect for MiD49 and MiD51 oligomers (see the above-mentioned reference).

      The manuscript cites PMID: 23798729 for expression analysis of slice variants but PMID:29853636 provides a more compressive analysis. The authors could cite this work.

      Thank you for this reference. We were unaware of it, but are very glad to know of it now. We now include this reference. In particular, in the legend to Figure 1C (table of splice variants), we now state that this table is for human Drp1, and that additional splice variants have been identified for murine Drp1 (PMID 29853636).

      Reviewer 3

      The splendid results of the manuscript willbe interesting to the researchers in the related fields.

      Thank you for this nice comment!

      The manuscript provided well-organized biochemistry results for comparisons between phosphorylation of Drp1 S579 and S600. It is the reviewer's comments that the authors may include experiments that manipulate Drp1 phosphorylation at different amino acids in cells. Such experiments will provide strong support for this manuscript.

      • We fully agree that these would be valuable experiments. The issue is that a large number of experiments using phospho-mimetic mutants in cells have already been conducted (Taguchi et al., 2007; Qi et al., 2011; Yu et al., 2011; Strack et al., 2013; Kashatus et al., 2015; Serasinghe et al., 2015; Xu et al., 2016; Brand et al., 2018; Han et al., 2020, Chang and Blackstone, 2007; Cribbs and Strack, 2007; Cereghetti et al., 2008; Wikstrom et al., 2013, Han et al., 2008,Wang et al., 2012 Jhun BS, Sheu, 2018, J Physiol). To conduct more targeted tests examining specific forms of Drp1 activation in cells (for example, through Mff, MiD proteins, actin, or cardiolipin) will require extensive work that is outside the scope here. Our feeling is that S579 phosphorylation is likely to recruit another molecule (probably a protein) that has an activating effect. We tried to test one possibility (NME3, mentioned in the Discussion) but failed to produce useable NME3 protein for these tests and, given time constraints for the lead author, could not address this further.

        The authors discussed the known factors that involved in Drp1 activation, such as its receptors, actin and cardiolipin. Recent JCB paper (J. Cell Biol. 2023 Vol. 222 No. 10 e202303147) indicates that intermembrane space protein Mdi1/Atg44 may play a role in coordinating mitochondria fission with Dnm1 (Drp1 in yeast cells). It will be valuable if the manuscript could also discuss the potential factor.

      • Thank you for this comment. We now include Mdi1/Atg44 as a possible factor that might be influenced by Drp1 phosphorylation. Two points we would like to make here are: there doesn’t seem to be an Mdi1 homologue in mammals, so the equivalent factor must be identified before testing; and Mdi1 is an inter-membrane space protein, so any effect of Drp1 phosphorylation on coordinated functioning with Mdi1 would either require an intermediary factor or exposure of the IMS in some way.

        Keywords cannot represent the manuscript. It is recommended that the authors use other words to for the current manuscript.

      We have removed K38A from this list. The other key words are not mentioned in the Abstract.

      Reviewer 4

      The authors showed that the binding of Drp1 to actin depends on salt concentrations (Fig. 2Band C). In the presence of 65 mM NaCl, the phosphomimetic mutants showed decreased binding to actin. The GTPase assay is performed with 65 mM KCl, in which actin did not stimulate GTP hydrolysis of the phosphomimetic mutants. In contrast, with 140 mM NaCl, the S579D Drp1 exhibits slightly enhanced actin binding compared to WT Drp1. Could the authors assess the actin-activated GTPase activity in the 140 mM salt condition to test if actin activates GTP hydrolysis ofS579D Drp1 more potently than WT?

      This is a good point by the reviewer. However, with limited time for the first author, we chose to focus on the reviewer’s other comments (see below).

      Both phosphomimetic mutants show reduced activation for GTP hydrolysis in the presence of cardiolipin, Mff, and MiD49. Is this because the mutants have a lower affinity for these interactors? Or do they bind with the same affinity but experience diminished activation? The data suggests the latter scenario, potentially resulting from decreased oligomerization properties. Can the authors provide more insights on this, for example, by measuring their interaction in the presence of GMP- PCP, which fully induces oligomerization in all three forms of Drp1?

      • These are interesting ideas, and we conducted experiments similar to what the reviewer described: co-sedimentation experiments with combinations of Drp1 and Mff under three nucleotide states: no nucleotide, GMP-PCP, and GTP. We used Mff for these experiments because we have this protein in abundance, and have previously characterized this construct as a trimer in PMID 34347505. We use a high concentration of Mff (50 mM) versus Drp1 (1.3 mM) because of the relatively low affinity between the two proteins (shown in PMID 34347505). We find the following:
      • In the absence of nucleotide, Mff does not cause an increase in pelletable Drp1 for any of the Drp1 constructs.
      • In the GTP state, the presence of Mff greatly increases the amount of Drp1 in the pellet, suggestive of increased Drp1 oligomerization. This effect occurs for all Drp1 constructs (WT, S579D and S600D mutants), but the amounts of both Drp1 and Mff in the pellets are about 50% less for both mutants than for the WT construct. This result suggests a decrease in oligomerization for the mutants, but not necessarily a decrease in Mff binding.

      I'm curious what happens to oligomerization if GTP is added instead of nonhydrolyzable GMP-PCP (Fig. 1D). Does this lead to higher oligomerization in the mutants compared to WT since the mutants seem to have lower GTPase activity? This might explain why phosphorylation increases mitochondrial localization of Drp1 in cells seen in some studies.

      This is another interesting thought, and we describe the new experiments we conducted in the response to the previous comment. Essentially, while GTP does cause a slight increase in pelletable Drp1, the increase is somewhat similar for all constructs. As described in the last comment, the addition of Mff causes a substantial increase in pelletable Drp1 for both WT and the mutants. This result suggests that, while the basal oligomeric state of Drp1 (in the absence of nucleotide) is reduced for the mutants (our original analytical ultracentrifugation data), the mutants appear to be capable of responding to GTP and Mff in a similar manner to WT. We acknowledge that the assay used here (pelleting) lacks the precision required to draw detailed conclusions on oligomerization or interaction with Mff, and we try to reflect this in our discussion of the data. We do feel, however, that these data are useful to report, in guiding future study.

      Please include the number of experimental repeats and error bars where applicable.

      We have now added number of experimental repeats and error bars for the graphs in Figure 3D, E, F, G, H; Figure 4 D, F, G, H, I, J; Figure 5 B, C, D, E, F, G; and Figure 6B, C, D.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      During mitochondrial division, a mechanochemical GTPase, Drp1, interacts with its receptor proteins, phospholipids, and the actin cytoskeleton. These interactions regulate the mitochondrial recruitment of Drp1 and its activities, including oligomerization and GTP hydrolysis. Drp1 undergoes serine phosphorylation at two primary sites (S579 and S600 in human isoform 3). It has been suggested that S579 phosphorylation activates Drp1, while S600 phosphorylation inhibits Drp1. However, the biochemical effects of these phosphorylations on Drp1's activity are mostly unexplored. The current study by Liu et al. addresses this crucial question in extensive biochemical assays using recombinant proteins. First, the authors showed that phosphomimetic Drp1 mutations (S579D or S600D) have a reduced ability to oligomerize. Second, both mutants exhibited decreases in their activation for GTP hydrolysis by actin, and concurrently, they demonstrated reduced binding to actin. Third, the Drp1 phosphomimetic mutants showed decreased activation for GTP hydrolysis by cardiolipin and two receptor proteins, Mff and MiD49. This reduction was also evident when Mff was combined with actin. The authors confirmed these results by phosphorylating WT Drp1 at S579 in vitro using the protein kinase Erk2. The effects of phosphorylation seem consistent across Drp1 isoforms with different alternative exons.

      Specific comments

      1. The authors showed that the binding of Drp1 to actin depends on salt concentrations (Fig. 2B and C). In the presence of 65 mM NaCl, the phosphomimetic mutants showed decreased binding to actin. The GTPase assay is performed with 65 mM KCl, in which actin did not stimulate GTP hydrolysis of the phosphomimetic mutants. In contrast, with 140 mM NaCl, the S579D Drp1 exhibits slightly enhanced actin binding compared to WT Drp1. Could the authors assess the actin-activated GTPase activity in the 140 mM salt condition to test if actin activates GTP hydrolysis of S579D Drp1 more potently than WT?
      2. Both phosphomimetic mutants show reduced activation for GTP hydrolysis in the presence of cardiolipin, Mff, and MiD49. Is this because the mutants have a lower affinity for these interactors? Or do they bind with the same affinity but experience diminished activation? The data suggests the latter scenario, potentially resulting from decreased oligomerization properties. Can the authors provide more insights on this, for example, by measuring their interaction in the presence of GMP-PCP, which fully induces oligomerization in all three forms of Drp1?
      3. I'm curious what happens to oligomerization if GTP is added instead of nonhydrolyzable GMP-PCP (Fig. 1D). Does this lead to higher oligomerization in the mutants compared to WT since the mutants seem to have lower GTPase activity? This might explain why phosphorylation increases mitochondrial localization of Drp1 in cells seen in some studies.
      4. Please include the number of experimental repeats and error bars where applicable.

      Significance

      Overall, the current study provides a comprehensive set of biochemical data for the role of Drp1 phosphorylation using cutting-edge in vitro assays. One might expect that S579 phosphorylation would enhance some of Drp1's action while S600 phosphorylation would show the opposite impact. Unexpectedly and interestingly, the authors found that both phosphorylations decrease Drp1's activities. Therefore, this work significantly advances our mechanistic understanding of mitochondrial division.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The manuscript aimed to clarify the effects of Drp1 phosphorylation on its activation under different factors, such as receptors, actin and cardiolipin. The manuscript performed experiments with purified proteins to examine the correlation of protein-protein interactions and enzymatic activity. The authors specifically focused on the phosphorylation of s S579 and S600 of Drp1 isoform 3, which is the most abundant in HeLa, HL60 and PC12 cells. The results demonstrated the difference of post-translational modification on S579 and S600. The manuscript went further to suggest that additional factors may exist for the Drp1 activation by S579 phosphorylation.

      Major Concerns

      1. The manuscript provided well-organized biochemistry results for comparisons between phosphorylation of Drp1 S579 and S600. It is the reviewer's comments that the authors may include experiments that manipulate Drp1 phosphorylation at different amino acids in cells. Such experiments will provide strong support for this manuscript.
      2. The authors discussed the known factors that involved in Drp1 activation, such as its receptors, actin and cardiolipin. Recent JCB paper (J. Cell Biol. 2023 Vol. 222 No. 10 e202303147) indicates that intermembrane space protein Mdi1/Atg44 may play a role in coordinating mitochondria fission with Dnm1 (Drp1 in yeast cells). It will be valuable if the manuscript could also discuss the potential factor.

      Minor Concerns

      Keywords cannot represent the manuscript. It is recommended that the authors use other words to for the current manuscript.

      Significance

      The authors provided a model for Drp1 catalytic activity. The splendid results of the manuscript will be interesting to the researchers in the related fields.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The results presented here indicate that phosphorylation of either serine residue negatively affects Drp1's tendency to oligomerize and substantially reduces stimulation of its GTPase activity in the presence of actin, adaptor proteins and cardiolipin-containing vesicles.

      Overall, the experiments described here are carried out with rigor and the conclusions drawn are of significance to understanding how phosphorylation regulates Drp1 functions.

      Major Comments:

      1. Phosphorylation of both the serine residues appears to elicit a common effect in that they inhibit Drp1's stimulated GTPase activity. This would suggest that phosphorylation affects Drp1's self-assembly as tightly packed helical scaffolds. Instead of sedimentation analysis, an EM analysis of helical scaffolds on cardiolipin-containing membrane nanotubes or in the presence of soluble adaptors causing Drp1 to form filaments would provide a direct readout for defects in self-assembly.
      2. I am not sure of the rationale for experiments reported in Fig. 7 and 8. If the idea was to test if hetero oligomerization with WT Drp1 rescues defects associated with phosphorylated Drp1 then this could be stated explicitly in the manuscript. GFP-Drp1 is used as a WT mimic but a previous report (PMID: 30531964) indicates that this construct is severely defective in stimulated GTPase assays, much like the K38A mutant. But the rationale of using these constructs is not quite apparent. Is the intention to test if defects seen in the phospho-mimetic mutants of Drp1 can be rescued by the presence of a 'seed' of WT Drp1. If so, then this could be stated explicitly in the manuscript. But regardless, I am not quite sure what this data set achieves in terms of addressing mechanism.
      3. Finally, it would have been nice to see if the phospho-mimetic mutants of Drp1 produce the same effects on mitochondrial structure as those reported earlier. Reanalyzing their effects in a cellular assay becomes important because it would consolidate this work for the readers to evaluate the 'true' effects of phosphorylation on Drp1 functions. If the phospho-mimetic mutants fare in a manner like those previously reported, then it signifies that stimulation in GTPase activity is not a readout that directly correlates with Drp1 functions. If not, then the results presented here would establish a comprehensive analysis of in vitro biochemical activities and in vivo functions of the phospho-mimetic mutants.

      Minor comments:

      1. Previous work reports that the effect of actin on the GTPase activity of Drp1 is biphasic but the binding to actin is not. This is quite confounding, and the authors could perhaps explain why this is the case.
      2. The manuscript cites PMID: 23798729 for expression analysis of slice variants but PMID: 29853636 provides a more compressive analysis. The authors could cite this work.

      Significance

      This manuscript reports an extensive and systematic evaluation of the effects of serine phosphorylation on Drp1's GTPase activity in the presence of actin, adaptor proteins and cardiolipin-containing vesicles. Drp1 is predominantly phosphorylated at two sites, S579 and S600 (numbering based on isoform 1). A large body of literature indicates that phosphorylation at S579 activates Drp1 functions while phosphorylation at S600 inhibits Drp1 functions. But as the authors point out, the effects of phosphorylation on the biochemical functions of Drp1 have not been reported, which is what the present manuscript does.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The present study aims to delineate the effect of S579 and S600 phosphorylation on Drp1 oligomerisation and GTPase activity. Using phospho-mimetic mutant Drp1 proteins, in conjunction with GTPase activity and phosphorylation assays, as well as size exclusion chromatography, the authors conclude that phosphorylation of residue S579 does not activate Drp1 directly. Notably, the authors did not perform cell-based assays to assess mitochondrial fission. The abstract concludes by stating, "our results suggest that nearest neighbour interactions within the Drp1 oligomer affect catalytic activity". However, this assertion appears to lack clarity and direct support from the presented results. Further clarification or evidence linking the observed data to this conclusion would enhance the overall comprehensibility and validity of the study's findings.

      Major comments

      • The assessment of data reproducibility is currently uncertain due to the absence of replication and statistical analysis in the dataset. It is essential to provide explicit information regarding sample sizes or replicates for all data and figures, data should be presented as mean +/- SD/SEM, and the interpretation of results should be grounded in rigorous statistical analysis. The lack of experimental replicates and statistical analysis in most of the figures presented raises major concerns regarding the validity of the result.
      • Why was only one of the MiD proteins, specifically MiD49, studied, while MiD51 was not included in the investigation?
      • The author suggestion of Drp1 phosphorylation, based on the mobility of protein observed in SDS-PAGE gel (fig 4A, 5A, 6A), is not a sufficiently valid assessment. While western blot analysis is a valid method to assess Drp1 phosphorylation, it is essential to include replicates for semi-quantitation and demonstrate the reproducibility of the results. Moreover, it is recommended to incorporate Western blot analyses to provide additional support for the findings presented in Figures 5 and 6.
      • Data on phosphorylated peptides with replicates experiments should be presented.
      • Please provide additional context or specific details about the GFP-tagged Drp1 protein, such as the protein site where GFP was attached, as well as whether this tag could potentially impact the Drp1 GTPase activity and oligomerisation. Figure 7C and D appear to suggest an increase in the GTPase activity of the GFP-Drp1 protein.
      • An optional experiment that would significantly enhance the biological relevance of the findings presented in the current study is to assess the morphology of mitochondria in cells expressing the phospho-mimetic mutant Drp1 proteins. This experiment would provide valuable insights into the functional consequences of Drp1 S579 and S600 phosphorylation on mitochondrial structure and dynamics.

      Minor comments

      • Provide reference for method on actin polymerisation.
      • Rectify the error in referencing figure 3 panels within the figure legends of Supplemental Fig S1.
      • The inclusion of full length isoform 6 is commendable. However, there is no mentioned of isoform 6 in the method section.
      • Since papers deposited in bioRxiv have not undergone peer review, reference #7 should not be cited as references in scholarly work.
      • Please provide details about the calculation of GTPase activity and the distinctions between the specific GTPase activity and total GTPase activity shown in figure 8D-F.

      Significance

      The current investigation holds promise for advancing our understanding of the impact of post-translational modifications, specifically those occurring at the S579 and S600 sites, on Drp1 activity. Nevertheless, the absence of experimental replication and comprehensive statistical analysis introduces notable concerns regarding the credibility and replicability of the findings.

      Audience: Basic research that focus on mitochondrial morphology and Drp1 biology.

      I lack expertise in velocity analytical ultracentrifugation and, as a result, am unable to provide an assessment regarding the validity and accuracy of the assay.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Replies to Reviewers

      Thank you for inviting us to submit our revised manuscript titled, “Diffusive mediator feedbacks control the health-to-disease transition of skin inflammation.” We appreciate the time and effort the editor and each of the reviewers have dedicated to providing insightful feedback on ways to strengthen our manuscript. The revisions in the main text in response to the detailed comments are highlighted in red and were proofread by professional English editors. We hope that our revision and responses address all the concerns raised by the reviewer, and we look forward to hearing from you regarding this submission.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript provides a model of interacting populations of pro- and anti-inflammatory mediators to explain spatial patterns associated with various inflammatory conditions. The work is robust and articulated well, and is certainly scientifically relevant.

      Authors: Thank you for your positive evaluation and many insightful comments on our manuscript. We have incorporated your feedback, and hope that our revisions satisfy all the comments.

      Minor amendments:

      Personally, I feel that the model should be reported prior to the results, as the choice of model is likely to have great significance on the observations. It would be preferable for the reader to have a clear picture of the governing equations in their mind as they digest the results.

      Au: Following this reviewer's suggestion, we have relocated the Method section including the model description to be written prior to the Result section (p.9-14 lines 152-232; revised manuscript).

      The literature review is largely relatively thorough; however, I think it is important that the previous works of Joanne Dunster (University of Reading) and collaborators are included, as these are very closely related to this work. In particular, the authors should note the following two papers, which take a spatial approach:

      • Bayani, A., Dunster, J.L., Crofts, J.J. et al. Mechanisms and Points of Control in the Spread of Inflammation: A Mathematical Investigation. Bull Math Biol 82, 45 (2020). https://doi.org/10.1007/s11538-020-00709-y

      • Bayani A, Dunster JL, Crofts JJ, Nelson MR (2020) Spatial considerations in the resolution of inflammation: Elucidating leukocyte interactions via an experimentally-calibrated agent-based model. PLoS Comput Biol 16(11): e1008413. https://doi.org/10.1371/journal.pcbi.1008413

      Au: We have incorporated this comment by adding the two suggested papers to the relevant sentences in the literature review (p.6 line 118-119; revised manuscript) as follows: “Previous reaction-diffusion models, including chemotactic cells, have reproduced the resolution of inflammation in the lung [Bayani et al. 2020a, Bayani et al. 2020b]”

      One key point that should be mentioned in the discussion is that the model neglects any immune cells (e.g. neutrophils, macrophages) which contribute greatly to the inflammatory condition. Since these cells are motile, and also can contribute both pro- and anti-inflammatory effects, they are likely to influence spatial patterns significantly. It is not necessarily a problem that these aren't included in the model, but I feel that it is important that their omission be discussed in the manuscript.

      Au: We have now discussed the immune cells in the “Future implications” as the reviewer suggested (p.29 line 477-483; revised manuscript) as follows: “This is probably because the present model focuses on the non-chemotactic cells (e.g., including keratinocytes), whereas chemotactic cells (e.g., macrophages and neutrophils) also contribute to skin inflammation [Zhang and An 2007, Coondoo 2011]. Moreover, the present model focuses on the innate immune response, whereas the skin initiates an acquired immune response in the persistence of the innate immune response. Therefore, incorporating the chemotactic cells and acquired immune response into the model will reproduce the end of the expansion.”

      Reviewer #1 (Significance (Required)):

      The manuscript advances our current understanding of spatially spreading inflammation and corresponding patterns, but needs to be contextualized against existing literature as described above.

      This manuscript will appeal to theoreticians (Mathematicians) and clinicians/experimentalists alike.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors propose a minimal mechanistic mathematical model able to reproduce qualitatively different spatial patterns observed in healthy and disease epidermis. The starting point is a systematic review of medical images of different dermatological conditions, which they classify and successfully capture according to the spatial patterns. It is an interesting piece of work, but I consider that it will gain significance if the theoretical results are compared again with the clinical data. Specifically, the authors show a very interesting map between parameter regions and different spatial patterns; this result should be compared back to clinical data, to confirm that specific changes in spatial patterns indeed result from predicted changes in a specific parameter (e.g., due to a genetic condition that affects a feedback strength).

      Authors: We thank you for providing your valuable comments on our manuscript.

      Following your suggestion about the comparison of theoretical results with the clinical data, we have predicted which specific parameters including the feedback strength cause specific transitions of spatial patterns in the respective diseases. The discussion was added on p.26 lines 415-438 in the revised manuscript as follows: “The parameter-to-patterning correspondence (Fig. 4A, B, S2 Fig., and S3 Fig.) allows us to infer the pathogenesis mechanism in various diseases exhibiting each of diverse expanding patterns (seen in Table 2). For instance, psoriasis exhibits all five expanding patterns (Table 2) and increased levels of pro-inflammatory mediator (TNF-α) [Ringham et al. 2019], which is consistent with our theoretical results. The elevated pro-inflammatory mediator in psoriatic skin has been suggested to be caused by genetic mutations affecting regulatory feedback [Valeyev et al. 2010]. Considering these previous studies, our model predicts a psoriasis progression where fading pattern transits to arcuate, polycyclic, gyrate, annular, and circular pattern where increase in the TNF-α level is possibly due to mutation-induced alteration in the feedback parameters, e.g., increase of the production of pro-inflammatory mediator qa (Fig. 4A). Alternatively, Lyme disease exhibits circular, annular, and polycyclic patterns (Table 2). A clinical report showed that patients in Missouri predominantly exhibit an annular pattern without prognostic symptoms, while those in New York tend to exhibit a circular pattern with prognostic symptoms following the same treatment [Wormser et al. 2005]. Considering our theoretical result that the overproduction of pro-inflammatory mediators and the depletion of anti-inflammatory mediators leads to the annular and circular pattern, respectively (Fig 4, 5A, and B), altered levels of pro-inflammatory and anti-inflammatory mediators may significantly impact the development and prognosis of Lyme disease in Missouri and New York patients, respectively.

      These qualitative parameter estimations will be verified in the future through parameter quantification in each diseased skin exhibiting any expanding patterns. By incorporating this quantitative correspondence between patterns and parameters measured in each disease into the present model, we would develop each disease-specific model with a quantitative predictability of how much change of the skin parameters transit from healthy to diseased pattern or vice versa. Therefore, this study provides the first step to controlling the healthy-to-diseased transition of skin inflammation via diffusive mediator feedback.”

      Another shortcoming of this work is that some of the conclusions are rushed: the parameter-to-spatial patterns analysis would strongly benefit from adding a quantitative to the qualitative description, e.g., mapping how changes in a given parameter value results in gradual changes in fading speed. Along the same line, the stability analysis for the different fading pattens was performed only for selected parameter values, it is not clear how variations in parameter values affect the sizes of the basins of attraction of the different steady states; we want to make sure that the parameter values were not cherry-picked. Further, given that the authors show bistability for some parameter values, then the dependency on initial conditions on the final spatial pattern should be more extensively investigated.

      Au: We have incorporated these comments by adding a quantitative description including new results and future research strategies following each of the three constructive suggestions raised by the reviewer.

      First, regarding “the fading speed” the reviewer suggested, fading speed is affected by changes in parameters involved in mediator production. In particular, the speed is reduced by an increase in the production parameters of pro-inflammatory mediators (pa, qa) and a decrease in those of anti-inflammatory mediators (pi, qi) (Fig.2. C and D). Moreover, “the size of the basins” the reviewer pointed out corresponds to the distance between ST (Threshold) and SH (Healthy state) in the cases with excitability. The distance between ST and SH becomes closer indicating the health state being less stable when pro-inflammatory mediators (pa, qa) increase or anti-inflammatory mediators (pi, qi) decrease from the healthy fading pattern. The imbalance of the mediator production transits the fast fading pattern with a small trajectory into a slow fading pattern with a larger trajectory. As imbalance goes on, the expanding pattern appears in the order of arcuate, polycyclic, and gyrate (Fig. 5). In cases with bistability, the size of basins corresponds to the relative distance ST to SH and ST to SI (Inflamed state). The circular and annular patterns appear when the distance between ST and SH is closer. On the other hand, when the distance between ST and SI was closer, the inflamed area shrank rather than expanded. The shrinking pattern appeared by reducing the production of pro-inflammatory mediators (pa, qa) or increasing the production of anti-inflammatory mediators (pi, qi) under conditions of stability. We have added a new figure and described this finding in Results (p.24 lines 384-388; revised manuscript) as follows: “As a result, we found that the distance between the healthy state (SH) and the threshold state (ST, a closer unstable steady state to SH) was the smallest in the gyrate pattern and increased in the order of polycyclic, arcuate, slow fading pattern, and fast fading pattern (Fig. 5C–F, S4 Fig. B and C). The fast fading pattern showed a smaller trajectory (green curve in S4 Fig. B and C) of change in the mediator concentration than the slow fading pattern.”

      Second, regarding “the dependency on initial conditions”, we have further added a new result (p.24 line 374-382; revised manuscript) as follows: “The number of stable states determines the pattern regardless of the initial condition in the spatial distribution of mediator concentration. Similar to the fading pattern (Fig. 2), the arcuate, polycyclic, and gyrate patterns with the excitability appeared reproducibly, independently of the initial conditions due to a single stable state SH (Fig. 5C-F). Even in circular and annular patterns with bistability where the threshold ST was closer to the inflamed state SI than the healthy state SH (Fig. 5A-B), the final spatial pattern was dominated by the SI independently of the initial condition. On the contrary, when ST was closer to the SH than the SI, the inflamed area shrank rather than fading (S4 Fig. A). These results are general outcomes of the traveling wave of bistable systems [Murray 2002], and consistent with the previous theoretical studies on inflammations [Sudo and Fujimoto 2022, Volpert 2009]. ”

      Finally, we have added “a quantitative to the qualitative description as a future research strategy (p.27 line 432-438; revised manuscript) as follows: “These qualitative parameter estimations will be verified in the future through parameter quantification in each diseased skin exhibiting any expanding patterns. By incorporating this quantitative correspondence between patterns and parameters measured in each disease into the present model, we would develop each disease-specific model with a quantitative predictability of how much change of the skin parameters transit from healthy to diseased pattern or vice versa. Therefore, this study provides the first step to controlling the healthy-to-diseased transition of skin inflammation via diffusive mediator feedback.”

      For reproducibility it is essential that the authors add a much more detailed description of the methods, including the software tools / numerical analysis tools used. Making the code publicly available would also be very beneficial to ensure the reproducibility of the results.

      Au: Following your suggestion, we have added a description of the methods, including the simulation code, to the “Methods” (p.13 lines 231-232; revised manuscript) as follows: “A simulation code written in C language is available from GitHub: https://github.com/MakiSudo/Erythema-Patterns/blob/main/AInondim.c.”

      In conclusion, the work is very interesting and worth publishing, but requires (a) to come back to the clinical data for validation of model predictions, (b) a more thorough and quantitative investigation of the effects of parameter variations on model behaviors, (c) a more rigorous and systematic presentation of the methods, (d) carefully explaining how the proposed model is similar / differs to the classical activator -inhibitor model proposed by Turing, and (e) discussing / showing if the fading patterns result from a turning instability.

      Au: For (a) “validation of model predictions,” (b) “model behaviors,” and (c) “a more rigorous and systematic presentation of the methods,” we have reflected your suggestions in the revised manuscript as described above.

      Regarding (d) and (e), we have added an explanation of “how the proposed model is similar/differs to the classical activator–inhibitor model” and “if the fading patterns result from Turing instability” after the model construction in Methods (p.11-12 line 210-216; revised manuscript) as follows: “Reaction terms of this model are similar to the classical activator-inhibitor model proposed by Turing [Turing 1952], which includes the negative feedback of the activator through the inhibitor and the positive feedback of the activator. These reaction terms potentially result in Turing instability. However, the present model setting does not show Turing instability. The reason is that Turing instability requires a large difference between the diffusion coefficients of the activator and inhibitor [Murray 2002], whereas these coefficients in the present model were set to be equal based on molecular findings that these molecular weights are close in proximity [Coondoo 2011]. ”

      **Referees cross-commenting**

      I agree with the comments from Reviewer #1.

      Reviewer #2 (Significance (Required)):

      The work aims to bridge mathematical modelling to dermatological practice, which is much needed to enable the use of theoretical and computational tools to clinical decision-making. While some mathematical models of skin inflammation have been proposed in the past (refer to papers from the RJ Tanaka group in systems dermatology), most of these do not consider explicitly the spatial component, which is crucial for modelling the clinically visible spatial patterns. Potentially interested audience includes biomathematicians, systems biologists, systems dermatologists, and, if the validation of the model predictions is achieved (as suggested above), also dermatologists.

      I am a systems biologists working on multi-scale mechanistic mathematical modelling of epithelial tissue diseases. The work I just reviewed falls exactly within my area of expertise.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors propose a minimal mechanistic mathematical model able to reproduce qualitatively different spatial patterns observed in healthy and disease epidermis. The starting point is a systematic review of medical images of different dermatological conditions, which they classify and successfully capture according to the spatial patterns. It is an interesting piece of work, but I consider that it will gain significance if the theoretical results are compared again with the clinical data. Specifically, the authors show a very interesting map between parameter regions and different spatial patterns; this result should be compared back to clinical data, to confirm that specific changes in spatial patterns indeed result from predicted changes in a specific parameter (e.g., due to a genetic condition that affects a feedback strength). Another shortcoming of this work is that some of the conclusions are rushed: the parameter-to-spatial patterns analysis would strongly benefit from adding a quantitative to the qualitative description, e.g., mapping how changes in a given parameter value results in gradual changes in fading speed. Along the same line, the stability analysis for the different fading pattens was performed only for selected parameter values, it is not clear how variations in parameter values affect the sizes of the basins of attraction of the different steady states; we want to make sure that the parameter values were not cherry-picked. Further, given that the authors show bistability for some parameter values, then the dependency on initial conditions on the final spatial pattern should be more extensively investigated.

      For reproducibility it is essential that the authors add a much more detailed description of the methods, including the software tools / numerical analysis tools used. Making the code publicly available would also be very beneficial to ensure the reproducibility of the results.

      In conclusion, the work is very interesting and worth publishing, but requires (a) to come back to the clinical data for validation of model predictions, (b) a more thorough and quantitative investigation of the effects of parameter variations on model behaviors, (c) a more rigorous and systematic presentation of the methods, (d) carefully explaining how the proposed model is similar / differs to the classical activator -inhibitor model proposed by Turing, and (e) discussing / showing if the fading patterns result from a turning instability.

      Referees cross-commenting

      I agree with the comments from Reviewer #1.

      Significance

      The work aims to bridge mathematical modelling to dermatological practice, which is much needed to enable the use of theoretical and computational tools to clinical decision-making. While some mathematical models of skin inflammation have been proposed in the past (refer to papers from the RJ Tanaka group in systems dermatology), most of these do not consider explicitly the spatial component, which is crucial for modelling the clinically visible spatial patterns. Potentially interested audience includes biomathematicians, systems biologists, systems dermatologists, and, if the validation of the model predictions is achieved (as suggested above), also dermatologists.

      I am a systems biologists working on multi-scale mechanistic mathematical modelling of epithelial tissue diseases. The work I just reviewed falls exactly within my area of expertise.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript provides a model of interacting populations of pro- and anti-inflammatory mediators to explain spatial patterns associated with various inflammatory conditions. The work is robust and articulated well, and is certainly scientifically relevant.

      Minor amendments:

      Personally, I feel that the model should be reported prior to the results, as the choice of model is likely to have great significance on the observations. It would be preferable for the reader to have a clear picture of the governing equations in their mind as they digest the results.

      The literature review is largely relatively thorough; however, I think it is important that the previous works of Joanne Dunster (University of Reading) and collaborators are included, as these are very closely related to this work. In particular, the authors should note the following two papers, which take a spatial approach:

      • Bayani, A., Dunster, J.L., Crofts, J.J. et al. Mechanisms and Points of Control in the Spread of Inflammation: A Mathematical Investigation. Bull Math Biol 82, 45 (2020). https://doi.org/10.1007/s11538-020-00709-y
      • Bayani A, Dunster JL, Crofts JJ, Nelson MR (2020) Spatial considerations in the resolution of inflammation: Elucidating leukocyte interactions via an experimentally-calibrated agent-based model. PLoS Comput Biol 16(11): e1008413. https://doi.org/10.1371/journal.pcbi.1008413

      One key point that should be mentioned in the discussion is that the model neglects any immune cells (e.g. neutrophils, macrophages) which contribute greatly to the inflammatory condition. Since these cells are motile, and also can contribute both pro- and anti-inflammatory effects, they are likely to influence spatial patterns significantly. It is not necessarily a problem that these aren't included in the model, but I feel that it is important that their omission be discussed in the manuscript.

      Significance

      The manuscript advances our current understanding of spatially spreading inflammation and corresponding patterns, but needs to be contextulised against existing literature as described above.

      This manuscript will appeal to theoreticians (Mathematicians) and clinicians/experimentalists alike.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      Thank you for the constructive comments and suggestions from the reviewers to further strengthen our manuscript (RC-2023-02156) entitled, “CNTN4 modulates neural elongation through interplay with APP”. In response to the reviewer’s comments, we have outlined the following revision plan. Please find the point-by-point responses to the reviewer comments in red. All the additions and changes in the manuscript are shown as track changes. We trust that the revised manuscript and revision plan will meet the approval of the editor and reviewers. We would also be glad to respond to any further questions and comments that you may have.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer 2 6) Figure 8 C-E shows a reduction of APP mRNA in SH-SY5Y knockdown of CNTN4 and a reduction CNTN4 mRNA in SH-SY5Y knockdown for APP. These data suggest might suggest that "interaction between CNTN4 and APP contributes to their gene expression". However, this observation needs to be proved mainly in the CNTN4 and APP KO mice.

      Thank you for your insightful comment. We recognize the importance of validating this observation in CNTN4 and APP knockout mice. In line with your suggestion, we are currently conducting these experiments and plan to incorporate the results into our revised manuscript.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Reviewer #1 Page 12, lines 7-9. The conclusion here is that CNTN4 KO and APP KO phenotypes are different. But a more interesting way to look at it is that the Scholl analysis of dendrites shows that they are almost exactly the reverse of each other with regard to near and distal morphologic differences. That could be interesting. Maybe CNTN4 binding inhibits APP signaling or binding to another partner, or APP binding inhibits CNTN4 signaling or binding to another partner, or both. Then loss of either one would hyper-activate the signal induced by the other and give the observed yin-yang phenotypic relationship. I note that this would not fit with the neuroblastoma phenotypes, which seem to be in the same direction; but a developing brain is different in many ways from a neuroblastoma cell in culture. The Discussion is also somewhat vague about possible interpretations of the two phenotypes in vivo.

      Thank you for your valuable insights. In response to your comments, we have expanded our discussion (line 420) to include the following considerations: “Our hypothesis is that when CNTN4 is deficient there are two possibilities considered, 1) the function to which the binding of CNTN4 to APP contributes is lost, and the ability of CNTN4 to regulate dendritic spine formation diminishes which would cause abnormal neurite outgrowth (Figure 8); 2) the loss of CNTN4 would cause other proteins to alternately bind to the E1 domain of APP and affect neurite outgrowth and arborization. For example, the arborization trends of the near and distal Sholl apical and basal dendrites are the opposite of one another in the Cntn4- and App-deficient mice, respectively. Loss of either CNTN4 or APP may activate these opposing scenarios through inhibition of signaling, binding to another partner or both. However, further studies are needed to understand the interplay.”

      Reviewer #1 Where is APP normally expressed in the developing mouse brain? A few sentences in the introduction or discussion would be helpful. This information was provided for Cntn4 in the introduction.

      Thank you for this suggestion. We have now included important background information on APP expression in the Introduction (line 95): “Osterfield et al. has previously shown a direct binding between CNTN4 and transmembrane amyloid-beta precursor protein (APP). Expression of APP in mice has been observed early in development, and is ubiquitously expressed in adult mice (45).”

      Reviewer #1 Minor grammatical items.

      Page 3, line 6. "Over 1000 genes..."

      Page 4, line 17. "...battery to study Cntn4-deficient mice, which revealed subtle..."

      Page 5. First sentence in the Results section. "the cortical layer thickness is involved in migration" - that phrase needs to be reworked.

      Page 6, line 16. "total numbers of cells or of neurons"

      Page 8, line 12. "maturity morphologies" - that phrase needs to be reworked.

      Page 8, line 16. "In vitro primary cell culturing..." should be "Primary cell culturing...". After all, the only kind of cell culturing is in vitro.

      Thank you for pointing out these grammatical issues. We appreciate your attention to detail and have carefully revised each of the mentioned sections in the manuscript to ensure clarity and accuracy. The corrections have been implemented as follows:

      Page 3, line 6: The phrase "Over 1000 genes..." has been added into line 52

      Page 4, line 17: "...battery to study Cntn4-deficient mice, which revealed subtle..." has been added to line 88

      Page 5, first sentence in the Results section is now “In the cerebral cortex, the cortical layer thickness is related to migration and may be an indicator of neurodevelopment abnormalities.” (line 113)

      Page 6, line 16: "total numbers of cells or of neurons" has been added to line 138

      Page 8, line 12: The term "maturity morphologies" has been replaced by “spine morphologies” in line 183.

      Page 8, line 16: The phrase "In vitro primary cell culturing..." has been replaced by "Primary cell culturing..." (line 188)

      Reviewer #2 1) Figure 1A-E shows the organization of cortical layers in the CNTN4+/- and CNTN4-/- respect to WT mice. Looking at the NeuN staining the images in C show a reduction of the NeuN+ neurons of the upper layer in the CNTN4-/- mice with respect to WT mice, confirmed by the quantification and an increase of the NeuN+ neurons of the lower layer in the CNTN4-/-mice respect to WT mice, not confirmed by the quantification. Or upper layer thickness is reduced and the density of the NeuN+ is not changed. Also, surprisingly the Cux1/NeuN+ neurons are reduced in the CNTN4+/- compared to CNTN4-/- and WT mice, but images for the CNTN4+/- were not shown. These results need to be better clarified.

      Thank you for your insightful comments. We understand the importance of clearly presenting the staining differences among the layers however were faced with limitations due to space constraints in including the Cntn4+/- images initially. In response, we have revised Figure 1 to display the three phenotypes (Cntn4+/+, Cntn4+/-, and Cntn4-/-) side-by-side using smaller images for a more comprehensive comparison. Regarding the quantification presented in Figure 1D and 1E, our analysis indicates that while there is a reduction in the number of NeuN+ neurons in the upper layers of the Cntn4-/- mice, this reduction is not statistically significant when compared to the Cntn4+/+ mice. A similar pattern is observed in the lower layers. However, we did observe a significant reduction in the thickness of the upper layer in the Cntn4-/- mice, which suggests a change in cell density, even though the overall number of cells (DAPI+) remains consistent across phenotypes. This implies a shift in the proportion of cells in the Cntn4-/- mice. We have now clarified this in the results section, emphasizing the change in neuronal proportion rather than density (line 144), to better convey our findings.

      Reviewer #2 2) Figure 3 shows the quantification of dendritic spines but there are no images to support the quantification, in particular, it's unclear how "abnormal spines" were morphologically defined.

      Thank you for your valuable feedback regarding Figure 3. In response to your comment, we have now incorporated representative images of the apical dendritic spines into Figure 3. These images feature white arrows pointing to specific examples of spine morphology, thereby visually supporting our quantification. To further clarify how the dendritic spines were morphologically categorized, we have updated both the Materials and Methods section under 'Golgi Staining' and the legend of Figure 3. Furthermore, we have referenced a pertinent study in the Methods section where similar categorization of spine morphology has been undertaken. This citation provides a methodological context and validation for our approach in spine classification. Additionally, the reader can refer to Figure 3A schematic for examples.

      Reviewer #2 5) Figure 6 shows a nice characterization of dendrite arborizations in the APP-/- mice. However, these results are not really related to the function of CNTN4. Indeed, the minimal alteration in the number of apical dendrite tips that have been described in the APP-/- mice might be due to a function of APP unrelated to the interaction with CNTN4.

      Thank you for highlighting this aspect of our study. We acknowledge that the findings in the APP-/- mice, as presented in Figure 6, might initially seem tangential to the primary focus on CNTN4. However, our intention in examining the App-/- mice was guided by prior studies indicating a potential link between APP and the pyramidal neuron phenotypes observed in cortical neurons. This exploration was aimed at broadening our understanding of APP's role in neuronal development, which, while not exclusively tied to its interaction with CNTN4, is nevertheless relevant to the overarching context of our research. To address your concern and enhance clarity, we have made an explicit statement in the manuscript's discussion section (line 383): “Results in the App-/- mice cannot be attributed solely to any interaction APP may have with CNTN4.”

      Reviewer #2 4) Figures 6 B and C should show the colocalization of CNTN4 with APP, however, it's difficult to see any colocalization with images at this low magnification. Please provide images at higher magnification.

      Thank you for the comment. We have adjusted Figure 6B and 6C to include zoomed in versions of the existing image to highlight regions of colocalization. It should also be noted from the description in the main text results that the expression pattern isn’t just colocalization.

      Reviewer #2 3) The results of Figure 4 do not really provide a significant clue regarding the function of CNTN4 in relation to all the other data presented in this paper. Also, the staining of CNTN4 should be shown.

      Thank you for your feedback regarding Figure 4. We understand your concerns about the relevance of these results to the overall function of CNTN4 as explored in our study. Our objective with Figure 4 was to contrast the effects of CNTN4 overexpression in primary cultured neurons with the phenotypes observed in the CNTN4 knockout detailed elsewhere in the manuscript. This comparison was intended to provide a more comprehensive understanding of CNTN4's role in neuronal development and function. To address your point about the visualization of CNTN4, we have now included more explicit details in the legend of Figure 4 (line 1210). Both the full-length Cntn4 construct and the empty pcDNA3.1 control vector used in our experiments are tagged with GFP.

      Reviewer #1 Page 12, bottom. One would expect that the effect of CRISPR KO would be a complete elimination of the western blot band. It appears from the Figure and text that there is a little bit of residual signal. Could that band simply be cross-reactivity with another protein? Could it be protein contamination from serum? Or is the cell line not clonally pure?

      Thank you for your comment and for pointing out the ambiguity in our manuscript. We have now revised the relevant sections to describe mRNA and protein expression levels more accurately.

      To clarify, in our study, CRISPR knockout effectively eliminated CNTN4 protein expression in the CNTN4 knockout cell line, and similarly, APP protein expression was completely diminished in the APP knockout cell line. This complete reduction aligns with the expected outcomes of successful CRISPR knockout. Furthermore, we observed that the level of CNTN4 protein expression in the APP knockout cell lines showed a reduction of approximately 50%. Similarly, the level of APP protein expression in the CNTN4 knockout cell lines was reduced by about 50%. We believe these findings suggest an interdependent regulatory mechanism between CNTN4 and APP, which we have now elaborated upon in the revised manuscript (line 285).

      Reviewer #2 7) The discussion is too long and needs to be more concise.

      Thank you for your feedback regarding the length of our discussion section. We appreciate your guidance on enhancing the manuscript's clarity and focus. In response to your comment, we have thoroughly reviewed and condensed the discussion. Our aim was to streamline the content without compromising the coverage of our broad study.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      Reviewer #1 Page 11. It isn't clear whether the binding of a soluble protein ligand to a cell-surface protein is measuring cis or trans binding configurations. It could be either or both, depending on the geometry of the interaction. Demonstrating a bone fide cis interaction is not easy - that requires a FRET experiment with tagged cell-surface proteins or a cryoEM structure.

      Thank you for your insightful suggestion regarding the experimental approach to differentiate between cis and trans binding configurations. We acknowledge the importance of distinguishing these interactions and the potential insights that such experiments, like FRET with tagged proteins or cryoEM structure analysis, could provide. However, after careful consideration, we have concluded that incorporating these specific methodologies would extend beyond the current scope of our paper. While the suggestion of a more detailed examination through FRET or cryoEM is undoubtedly valuable, it would necessitate a separate set of experimental conditions and analyses, potentially forming the basis for future research.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, Bamford et al. showed a reduced cortical thickness in the motor cortex of Cntn4-/- mice, but cortical cell migration and differentiation were unaffected. They also found morphological changes in neurons in the M1 region of the motor cortex, indicating that CNTN4 is also involved in the morphology and spine density of neurons in the motor cortex. With mass spectrometry analysis they identified a number of interaction partners for CNTN4, among then they confirmed a previously demonstrated interaction between CNTN4 and APP. Thus, this study demonstrates that CNTN4 contributes to cortical development and that its binding and interplay with APP might also be important for neural elongation.

      The paper shows nice and convincing results, but the following points should be fully addressed in order to improve the significance of these findings.

      1. Figure 1A-E shows the organization of cortical layers in the CNTN4+/- and CNTN4-/- respect to WT mice. Looking at the NeuN staining the images in C show a reduction of the NeuN+ neurons of the upper layer in the CNTN4-/- mice with respect to WT mice, confirmed by the quantification and an increase of the NeuN+ neurons of the lower layer in the CNTN4-/-mice respect to WT mice, not confirmed by the quantification. Or upper layer thickness is reduced and the density of the NeuN+ is not changed. Also, surprisingly the Cux1/NeuN+ neurons are reduced in the CNTN4+/- compared to CNTN4-/- and WT mice, but images for the CNTN4+/- were not shown. These results need to be better clarified.
      2. Figure 3 shows the quantification of dendritic spines but there are no images to support the quantification, in particular, it's unclear how "abnormal spines" were morphologically defined.
      3. The results of Figure 4 do not really provide a significant clue regarding the function of CNTN4 in relation to all the other data presented in this paper. Also, the staining of CNTN4 should be shown.
      4. Figures 6 B and C should show the colocalization of CNTN4 with APP, however, it's difficult to see any colocalization with images at this low magnification. Please provide images at higher magnification.
      5. Figure 6 shows a nice characterization of dendrite arborizations in the APP-/- mice. However, these results are not really related to the function of CNTN4. Indeed, the minimal alteration in the number of apical dendrite tips that have been described in the APP-/- mice might be due to a function of APP unrelated to the interaction with CNTN4.
      6. Figure 8 C-E shows a reduction of APP mRNA in SH-SY5Y knockdown of CNTN4 and a reduction CNTN4 mRNA in SH-SY5Y knockdown for APP. These data suggest might suggest that "interaction between CNTN4 and APP contributes to their gene expression". However, this observation needs to be proved mainly in the CNTN4 and APP KO mice.
      7. The discussion is too long and needs to be more concise.

      Significance

      This study demonstrates that CNTN4 contributes to cortical development and that its binding and interplay with APP might also be important for neural elongation. However, the role of APP interaction on CNTN4 function was not well demonstrated.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      9-17-2023

      Bamford et al present a very thorough series of experiments to explore the role of CNTN4 and its interacting partner APP in the development of the mouse cerebral cortex. The manuscript presents the following analyses/experiments: (1) a thickness analysis of different cortical areas in the CNTN4 KO mouse, finding reduced cortical thickness and reduced cell numbers in motor cortex; (2) a pyramidal dendrite and spine morphologic analysis in motor cortex in the CNTN4 KO mouse, finding defects in proximal dendritic morphology and altered spine distribution; (3) a biochemical (mass spectrometry) search for CNTN binding partners in 293 cells, finding APP among others and confirming a previous finding; (4) a cell adhesion analysis in transfected 293 cells showing a trans interactions between the extracellular domains of CNTN4 and APP; (5) an analysis of pyramidal dendritic morphology in APP KO mouse motor cortex that looks like the reciprocal of the Scholl analysis phenotype observed with CNTN4 KO mouse cortex, and (6) a demonstration that neurite outgrowth is reduced in cultured SH-SY5Y neuroblastoma cells with CRISPR-mediated deletion mutations of either CNTN4, APP, or both. The experiments look to be technically well done, and they data are interpreted with an appropriate level of caution.

      This work will be of substantial interest to scientists working on CNS development in general and cortical development in particular. Genetic variation in CNTN4 has been implicated in developmental disorders, such as autism spectrum disorder, and the present work represents an important step forward in defining the mechanistic underpinning of that phenotype. This work will also be of interest to those studying Alzheimer disease who wonder (as many have) what the normal function of APP is.

      Minor comments:

      Page 11. It isn't clear whether the binding of a soluble protein ligand to a cell-surface protein is measuring cis or trans binding configurations. It could be either or both, depending on the geometry of the interaction. Demonstrating a bone fide cis interaction is not easy - that requires a FRET experiment with tagged cell-surface proteins or a cryoEM structure.

      Page 12, lines 7-9. The conclusion here is that CNTN4 KO and APP KO phenotypes are different. But a more interesting way to look at it is that the Scholl analysis of dendrites shows that they are almost exactly the reverse of each other with regard to near and distal morphologic differences. That could be interesting. Maybe CNTN4 binding inhibits APP signaling or binding to another partner, or APP binding inhibits CNTN4 signaling or binding to another partner, or both. Then loss of either one would hyper-activate the signal induced by the other and give the observed yin-yang phenotypic relationship. I note that this would not fit with the neuroblastoma phenotypes, which seem to be in the same direction; but a developing brain is different in many ways from a neuroblastoma cell in culture. The Discussion is also somewhat vague about possible interpretations of the two phenotypes in vivo.

      Page 12, bottom. One would expect that the effect of CRISPR KO would be a complete elimination of the western blot band. It appears from the Figure and text that there is a little bit of residual signal. Could that band simply be cross-reactivity with another protein? Could it be protein contamination from serum? Or is the cell line not clonally pure?

      Where is APP normally expressed in the developing mouse brain? A few sentences in the introduction or discussion would be helpful. This information was provided for Cntn4 in the introduction.

      Minor grammatical items.

      Page 3, line 6. "Over 1000 genes..."

      Page 4, line 17. "...battery to study Cntn4-deficient mice, which revealed subtle..."

      Page 5. First sentence in the Results section. "the cortical layer thickness is involved in migration" - that phrase needs to be reworked.

      Page 6, line 16. "total numbers of cells or of neurons"

      Page 8, line 12. "maturity morphologies" - that phrase needs to be reworked.

      Page 8, line 16. "In vitro primary cell culturing..." should be "Primary cell culturing...". After all, the only kind of cell culturing is in vitro.

      Significance

      This work will be of substantial interest to scientists working on CNS development in general and cortical development in particular. Genetic variation in CNTN4 has been implicated in developmental disorders, such as autism spectrum disorder, and the present work represents an important step forward in defining the mechanistic underpinning of that phenotype. This work will also be of interest to those studying Alzheimer disease who wonder (as many have) what the normal function of APP is.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their comments and constructive criticisms of the manuscript. We thank them for positive comments on the high quality of the genetic screen and recognizing our contributions in respect of RDGB work in Drosophila.

      In general, there is one important comment by the reviewers about the candidates identified in the proteomic screen as potential VAP interactors which have then been tested in the genetic screen. Reviewers have noted that many proteins in the proteomics study identified as VAP interactors do not have the classical FFAT motif that mediates VAP interaction. Therefore, what is the significance of such genes?

      Response: It is important to reflect on the fact that while VAP interacts with FFAT motifs in proteins , a VAP immunoprecipitation will identify two classes of proteins (i) those with classical FFAT motifs (ii) those proteins without FFAT motifs that interact indirectly with VAP via proteins which themselves have FFAT motifs. We have already depicted this in Fig 2A as category C proteins.

      We believe that the in vivo genetic screen does in fact serve the specific purpose of testing the functional significance of such non-FFAT containing proteins identified in the proteomic screen by functional validation of their ability to modulate rdgB degeneration.

      Key modifications to the text and a few experiments planned are listed in the next sections against pointwise response to reviewer comments. We believe that this will strengthen the manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Mishra and colleagues have conducted a large genetic screen to identify modulators of a Drosophila model for retinal degeneration. Using biomolecular techniques, they selected a few hundred proteins that interact with an ER bound protein VAP, to further test them in the retinal degeneration model. This was done by downregulating their expression using interference RNA (RNAi). Degeneration was first measured through a pseudopupil analysis, then suppressors of degeneration were further tested in a different retinal degeneration model, and finally through an ERG experiment. Finally, they focused on a strong suppressor of degeneration, dCert, by using a mutant allele to confirm the findings from the RNAi. The results suggest that a handful of these candidates are suppressors in this model of retinal degeneration, and identify at which stage of retinal degeneration these proteins may be involved in. These proteins may have a significance in forms of neurodegeneration.

      Major comment: - Perhaps a larger number of replicates could be done in the optical neutralization experiment as well as in the ERG. Figure 4.A(i) and (ii), please clearly state n values. I would suggest this as optional, but perhaps ut could help to increase n?

      Each optical neutralization experiment was done using 5 independent animals and 10 ommatidia were scored from each animal. For the optical neutralization experiment in 4.A(i) and (ii) we did 5 independent animals with 10 ommatidia/ animal for the statistical score. Based on our past experience, this number is sufficient to capture the intra-ommatidial variability in each eye and the inter animal variability between animals. This information will be added to the figure legend.

      For the ERGs minimum 5 animals were used per experiment which is already mentioned in the figure legend and conforms to the standards of analysis in the field for such experiments.<br /> - For human orthologs (Table 1), it could be worthwhile to add alignment scores between fly and human?

      We will add the table with the alignment score

      Minor comment: - Clarify the purpose in focusing on dCert specifically in the last results section and discussion - Several typos - Affect vs effect

      • Following the initial genetic screen, it was necessary to characterize a genes to understand in detail the temporal and spatial aspects of it role in modulating degeneration. Dcert was chosen for several reasons (i) a classical germ line mutant allele was available (ii) Prior papers had established its role as a protein that functions at contact sites. We will clarify our purpose of including dCert as proof of principle in the discussion part.

      -Typos will be corrected.

      Reviewer #1 (Significance (Required)):

      General assessment:

      The main significance of this study comes from the focus on proteins that are known to interact with VAP. This implies that the suppressors of degeneration that they have identified in the RdgB9 model may have an effect in other neurodegenerative models, namely in ALS models. This could have a very high significant potential in therapeutic avenues for neurodegenerative diseases.

      Among six candidates that had an effect with knocked down through RNAi, they pursued a single one (dCert) as proof of principle. It would help to add a justification for this choice in the main text and whether the authors have performed or intend to perform experiments using mutant forms of the other candidate proteins.

      Although six candidate genes were available for analysis, there were no mutants available in two of them (SET and CG3071). Mutants in Yeti are homozygous lethal making it difficult to work on it in this setting. However a viable mutant in APC is now available and a CG9205 CRISPR germ line deletion mutant has recently been generated in our lab. We will use these two alleles to test their, interaction with rdgB like we did for dcert. Since dCert and CG9205 have membrane interacting domains we prefer to focused on these two genes for this study as proof of principle.

      The work from Raghu and his team have been leading the research surrounding this model of degeneration in Drosophila. This study naturally further extends their field of research, identifying more candidates that modulate this form of degeneration, and helping elucidate the pathways leading to cellular degeneration.

      These results will be of high interest for specialized researchers studying the molecular pathways that lead to cellular degeneration, both in the context of retinal degeneration as well as neurodegeneration. Specifically, researchers that may be interested in these candidate proteins and how they may play a role in the pathogenesis of various degenerative diseases.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: In this study, Mishra and colleagues combine proteomic techniques with Drosophila genetics to identify interactors of the endoplasmic reticulum (ER) protein VAP and analyse their indirect implication in the light-dependent retinal degeneration phenotype caused by misfunction of the lipid transfer protein RdgB, another VAP interactor. This is an ideal model system to study neurodegeneration in vivo via which the authors aim to further understand the molecular mechanisms involved in VAP-RdgB's function in maintenance of membrane lipid homeostasis. Of an initial list of 403 VAP mammalian interactors found via Immunoprecipitation - Mass Spectromety performed in a human cell line, 52 homologous Drosophila genes are found to suppress the RdgB degeneration phenotype upon knockdown. The authors then test a series of genetic interactions to dissect the potential role of these genes in the degeneration phenotype and identify six genes that likely suppress the phenotype by directly acting on the same molecular processes as RdgB, two of which (Cert and CG9205) could have a direct mechanistic role in the lipid homeostasis maintained by the VAP-RdgB complex.

      Major comments: - The model suggested by the authors in Fig 2A is one by which the tested genes influence the VAP-RdgB function via direct binding to VAP. The direct interaction with VAP is a core aspect of the study as the IP-MS experiment biases the list of genes tested in Drosophila, and this list is often referred to as "VAP interacting proteins". However, the interactions between the proteins coded by the most relevant genes in the genetic screen and VAP are not tested. In fact, of the six genes that are found to likely modulate the same processes as RdgB, four probably do so via affecting gene expression, as discussed by the authors, therefore making it unlikely that they are true VAP interactors (unless they shuttle from the ER to the nucleus). Additionally, it would seem that most of the 52 genes found to suppress the degeneration phenotype are not necessarily VAP interactors either as only a handful of genes in this list have predicted strong FFAT motifs. In general, the authors should provide additional comments/evidence on the interaction, or likelihood of the interaction, of these proteins with VAP, that should include: o The IP-MS data should be made available (at least on my end, in the current submission the supplementary lists available cover only the 52 genes found to suppress the phenotype, I found the list of 403 genes in the biorXiv submission but this also does not include data on the enrichment of each hit in the IP-MS). It should be made clear how enriched were each of the proteins in the analysis, and how high the most relevant genes in the genetic screen rank within this interactor list.

      We will provide more details on the MS data. We can add the ratio (VAP WT vs VAP mutant) and the PSM, number of peptides etc in the table, and we will deposit the raw data on PRIDE repository.

      It is true that some of the genetic interactors, especially those that do not have an obvious FFAT motif likely influence the retinal degeneration phenotype either by indirect interactions with VAP or functional interactions rather than structural interaction. This point can be emphasized in the discussion. It is important to note that this is in some senses a good reason to couple of protein interaction screen with a genetic screen., the genetic screen sometimes uncovering functional interactions via indirect mechanism. The likely mode of interaction can be made clear in a revision.

      o The authors should provide further detail on the rationale behind defining the list of 403 genes to be tested (for example, what was the threshold enrichment considered as interaction). Also in relation to this, the authors should at least provide speculation as to why more than half of the 403 genes defined do not have an FFAT motif, despite the fact that the proteomic data was normalized to a non-FFAT motif binding mutant of VAP which should be capable of maintaining non-FFAT mediated interactions of the protein.

      As the reviewer correctly mentions, the 403 genes from the proteomics screen used for the genetic screen all satisfied the criterion of being differentially enriched in binding seen to wild type VAP but not the non-binding version of VAP. This is despite that fact that many of these proteins do not have an identifiable FFAT motif. The reason for this is most likely that the candidates without the FFAT motif most likely bind to VAP indirectly via a protein which itself has an FFAT motif. This is depicted in Figure 2A. This has already been explained in the text but this can be elaborated further.

      The rationale for including candidates from the proteomic screen without an FFAT motif in the genetic screen is that we were interested in all candidates that might influence RDGB function whether they are direct or indirect binders of VAP.

      o The authors should acknowledge the limitations of their experimental design in regards to identifying real interactors of VAP in Drosophila and avoid referring to this set of genes as "VAP interacting proteins" and rather use a more accurate description such as "proteins enriched in the IP-MS" or at least "potential VAP interacting proteins".

      We agree that the use of ‘potential VAP interactors’ may be more appropriate.

      o Testing interaction between VAP and all the 52 genes found to suppress the phenotype would be a huge amount of work. But the finding most relevant to the initial premise of the study (i.e. "molecular mechanisms underlying lipid transfer protein function at membrane contact sites") is that Cert (a lipid transfer protein) and CG9205 (fly homolog of a mammalian lipid transfer protein) influence RdgB function. Demonstrating an interaction between these proteins and VAP would argue for the experimental design and support the hypothesis and model of the study. Cert is already a well established interactor of VAP, hence the authors would not need to add anything regarding this protein. Is CG9205 expected to be also a true interactor of VAP? Biochemical experiments could be used to test this idea, or even recently developed in silico modelling of interactions (i.e. AlphaFold Multimer) could be of help. If no interaction is observed/expected this should also be pointed out in the manuscript. Optionally, showing localization of Cert or CG9205 to the ER-PM interface would also greatly support the model of VAP-RdgB regulation suggested by the authors.

      We agree that further experimental evidence to support the interaction of CG9205 would add useful information. This can be attempted by co-IP or by in silico methods such as Alpha fold multimer.

      Minor comments: - In the model shown in Fig2A, it would seem that many proteins can bind VAP in addition to RdgB, however, VAP proteins have only one FFAT binding pocket. This model would only be possible if oligomerization of VAP is considered (oligomerization of VAP has been reported to occur, see for example PMID:20207736). The model should be redrawn considering this fact.

      We agree.

      • In line 161 of the text VPS13D is mentioned, however VPS13C is the gene indicated in Fig 1D.

      The text will be corrected.

      Reviewer #2 (Significance (Required)):

      Previous studies by some of the authors and others have shown that RdgB can transfer lipids between the ER, to which it binds via VAP, and the plasma membrane (PM), and is required for proper replenishment of PI(4,5)P2 in the PM which is in turn necessary for sustained PLC signaling in Drosophila photoreceptors. Lack of RdgB leads to light-dependent degeneration of the retina, and hence it is utilized by the authors as a model for neurodegeneration. Given the clear phenotype of RdgB loss-of-function and the ease of Drosophila genetics, this system represents an ideal model to perform screens for the identification of new genes involved in maintaining neuronal lipid homeostasis required for proper function of the photoreceptors in vivo, and this aspect is the main strength of this study. Importantly, the use of this system could also shed light on the mechanisms behind human neurodegenerative disorders, as many of these involve dysregulation of lipid signaling and lipid transfer at membrane contact sites. A novel and interesting finding is the identification of another lipid transfer protein, Cert, to be involved in the degeneration of photoreceptors.

      The main limitation lies in the experimental design proposed by the authors to define the genes that are studied in their system. These are identified as potential interactors of human VAP in a mammalian cell line. Despite the fact that VAP is a highly conserved protein, and the genes identified are present in Drosophila as well, there is no evidence that these interactions are in fact occurring in Drosophila photoreceptors, and in fact, based on the function and the lack of VAP-binding motif in many of the 52 genes identified to have an effect on the RdgB phenotype, it is likely that many of the interactions are purely genetic and indirect, and that the modulation of the phenotype could in fact be due to a wide variety of factors (including, as discussed by the authors, gene expression, post-translational modifications, trafficking of proteins, etc) unrelated to mechanisms of VAP-RdgB mediated lipid transfer at ER-PM membrane contact sites.

      A more unbiased screen could have been carried out to identify VAP interactors involved in this degeneration phenotype by testing all of the FFAT or FFAT-related motif containing proteins. Due to this initial bias in the selection of genes to be tested, it is possible that other important VAP interactors that play a role at the ER-PM interface of photoreceptors have not been identified.

      We agree that an alternative approach might have been to perform the VAP interaction proteomics in fly photoreceptors rather than start with a proteomics data set from mammalian cells. At this late stage in the project this will not be a feasible approach. However, we could consider testing any FFAT containing proteins, identified bioinformatically in the fly genome in the future.

      An initial bioinformatics analysis has revelated that there are only 51 genes in the entire fly genome with an identifiable conventional FFAT motif. Of these 7 are already part of the genetic screen already completed. Of the remaining 44 genes, 11 show no expression in the eye and 9 show very low expression. Thus, using the approach suggested by the reviewer ca. 24 genes with FFAT motifs could have been missed and therefore could be screened, subject to genetic tools, i.e RNAi lines being available for these.

      This study provides great functional advance in the understanding of genes implicated in photoreceptor degeneration, and in those regards it is a great resource for a specialized audience, as it enables further characterization by others of the different processes implicated in this neurodegeneration phenotype. However, the advance is small in regards to the core mechanism of RdgB function at VAP-mediated ER-PM, which was the main aim of the article and the most broadly interesting aspect of the study. Many of the VAP-interacting proteins identified in the proteomic approach were already expected to be VAP interactors as they contain FFAT motifs, and these FFAT-containing proteins do not seem to have a major role in VAP-RdgB maintenance of neuronal lipid homeostasis, with the exception of Cert. The implication of Cert in RdgB-mediated lipid homeostasis is certainly interesting as it touches on a current topic in the field of membrane contact sites related to how the multiple interactions of VAP, a universal contact site adaptor at the ER, are regulated and influenced by each other.

      Reviewer's field of expertise: Lipid transfer at membrane contact sites; membrane lipid homeostasis in neurons. All of the Drosophila data seem to be of good general quality to me, but I do not have any expertise in Drosophila work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this study, Mishra and colleagues combine proteomic techniques with Drosophila genetics to identify interactors of the endoplasmic reticulum (ER) protein VAP and analyse their indirect implication in the light-dependent retinal degeneration phenotype caused by misfunction of the lipid transfer protein RdgB, another VAP interactor. This is an ideal model system to study neurodegeneration in vivo via which the authors aim to further understand the molecular mechanisms involved in VAP-RdgB's function in maintenance of membrane lipid homeostasis. Of an initial list of 403 VAP mammalian interactors found via Immunoprecipitation - Mass Spectromety performed in a human cell line, 52 homologous Drosophila genes are found to suppress the RdgB degeneration phenotype upon knockdown. The authors then test a series of genetic interactions to dissect the potential role of these genes in the degeneration phenotype and identify six genes that likely suppress the phenotype by directly acting on the same molecular processes as RdgB, two of which (Cert and CG9205) could have a direct mechanistic role in the lipid homeostasis maintained by the VAP-RdgB complex.

      Major comments:

      • The model suggested by the authors in Fig 2A is one by which the tested genes influence the VAP-RdgB function via direct binding to VAP. The direct interaction with VAP is a core aspect of the study as the IP-MS experiment biases the list of genes tested in Drosophila, and this list is often referred to as "VAP interacting proteins". However, the interactions between the proteins coded by the most relevant genes in the genetic screen and VAP are not tested. In fact, of the six genes that are found to likely modulate the same processes as RdgB, four probably do so via affecting gene expression, as discussed by the authors, therefore making it unlikely that they are true VAP interactors (unless they shuttle from the ER to the nucleus). Additionally, it would seem that most of the 52 genes found to suppress the degeneration phenotype are not necessarily VAP interactors either as only a handful of genes in this list have predicted strong FFAT motifs. In general, the authors should provide additional comments/evidence on the interaction, or likelihood of the interaction, of these proteins with VAP, that should include:
      • The IP-MS data should be made available (at least on my end, in the current submission the supplementary lists available cover only the 52 genes found to suppress the phenotype, I found the list of 403 genes in the biorXiv submission but this also does not include data on the enrichment of each hit in the IP-MS). It should be made clear how enriched were each of the proteins in the analysis, and how high the most relevant genes in the genetic screen rank within this interactor list.
      • The authors should provide further detail on the rationale behind defining the list of 403 genes to be tested (for example, what was the threshold enrichment considered as interaction). Also in relation to this, the authors should at least provide speculation as to why more than half of the 403 genes defined do not have an FFAT motif, despite the fact that the proteomic data was normalized to a non-FFAT motif binding mutant of VAP which should be capable of maintaining non-FFAT mediated interactions of the protein.
      • The authors should acknowledge the limitations of their experimental design in regards to identifying real interactors of VAP in Drosophila and avoid referring to this set of genes as "VAP interacting proteins" and rather use a more accurate description such as "proteins enriched in the IP-MS" or at least "potential VAP interacting proteins".
      • Testing interaction between VAP and all the 52 genes found to suppress the phenotype would be a huge amount of work. But the finding most relevant to the initial premise of the study (i.e. "molecular mechanisms underlying lipid transfer protein function at membrane contact sites") is that Cert (a lipid transfer protein) and CG9205 (fly homolog of a mammalian lipid transfer protein) influence RdgB function. Demonstrating an interaction between these proteins and VAP would argue for the experimental design and support the hypothesis and model of the study. Cert is already a well established interactor of VAP, hence the authors would not need to add anything regarding this protein. Is CG9205 expected to be also a true interactor of VAP? Biochemical experiments could be used to test this idea, or even recently developed in silico modelling of interactions (i.e. AlphaFold Multimer) could be of help. If no interaction is observed/expected this should also be pointed out in the manuscript. Optionally, showing localization of Cert or CG9205 to the ER-PM interface would also greatly support the model of VAP-RdgB regulation suggested by the authors.

      Minor comments:

      • In the model shown in Fig2A, it would seem that many proteins can bind VAP in addition to RdgB, however, VAP proteins have only one FFAT binding pocket. This model would only be possible if oligomerization of VAP is considered (oligomerization of VAP has been reported to occur, see for example PMID:20207736). The model should be redrawn considering this fact.
      • In line 161 of the text VPS13D is mentioned, however VPS13C is the gene indicated in Fig 1D.

      Significance

      Previous studies by some of the authors and others have shown that RdgB can transfer lipids between the ER, to which it binds via VAP, and the plasma membrane (PM), and is required for proper replenishment of PI(4,5)P2 in the PM which is in turn necessary for sustained PLC signaling in Drosophila photoreceptors. Lack of RdgB leads to light-dependent degeneration of the retina, and hence it is utilized by the authors as a model for neurodegeneration. Given the clear phenotype of RdgB loss-of-function and the ease of Drosophila genetics, this system represents an ideal model to perform screens for the identification of new genes involved in maintaining neuronal lipid homeostasis required for proper function of the photoreceptors in vivo, and this aspect is the main strength of this study. Importantly, the use of this system could also shed light on the mechanisms behind human neurodegenerative disorders, as many of these involve dysregulation of lipid signaling and lipid transfer at membrane contact sites. A novel and interesting finding is the identification of another lipid transfer protein, Cert, to be involved in the degeneration of photoreceptors.

      The main limitation lies in the experimental design proposed by the authors to define the genes that are studied in their system. These are identified as potential interactors of human VAP in a mammalian cell line. Despite the fact that VAP is a highly conserved protein, and the genes identified are present in Drosophila as well, there is no evidence that these interactions are in fact occurring in Drosophila photoreceptors, and in fact, based on the function and the lack of VAP-binding motif in many of the 52 genes identified to have an effect on the RdgB phenotype, it is likely that many of the interactions are purely genetic and indirect, and that the modulation of the phenotype could in fact be due to a wide variety of factors (including, as discussed by the authors, gene expression, post-translational modifications, trafficking of proteins, etc) unrelated to mechanisms of VAP-RdgB mediated lipid transfer at ER-PM membrane contact sites. A more unbiased screen could have been carried out to identify VAP interactors involved in this degeneration phenotype by testing all of the FFAT or FFAT-related motif containing proteins. Due to this initial bias in the selection of genes to be tested, it is possible that other important VAP interactors that play a role at the ER-PM interface of photoreceptors have not been identified.

      This study provides great functional advance in the understanding of genes implicated in photoreceptor degeneration, and in those regards it is a great resource for a specialized audience, as it enables further characterization by others of the different processes implicated in this neurodegeneration phenotype. However, the advance is small in regards to the core mechanism of RdgB function at VAP-mediated ER-PM, which was the main aim of the article and the most broadly interesting aspect of the study. Many of the VAP-interacting proteins identified in the proteomic approach were already expected to be VAP interactors as they contain FFAT motifs, and these FFAT-containing proteins do not seem to have a major role in VAP-RdgB maintenance of neuronal lipid homeostasis, with the exception of Cert. The implication of Cert in RdgB-mediated lipid homeostasis is certainly interesting as it touches on a current topic in the field of membrane contact sites related to how the multiple interactions of VAP, a universal contact site adaptor at the ER, are regulated and influenced by each other.

      Reviewer's field of expertise: Lipid transfer at membrane contact sites; membrane lipid homeostasis in neurons. All of the Drosophila data seem to be of good general quality to me, but I do not have any expertise in Drosophila work.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Mishra and colleagues have conducted a large genetic screen to identify modulators of a Drosophila model for retinal degeneration. Using biomolecular techniques, they selected a few hundred proteins that interact with an ER bound protein VAP, to further test them in the retinal degeneration model. This was done by downregulating their expression using interference RNA (RNAi). Degeneration was first measured through a pseudopupil analysis, then suppressors of degeneration were further tested in a different retinal degeneration model, and finally through an ERG experiment. Finally, they focused on a strong suppressor of degeneration, dCert, by using a mutant allele to confirm the findings from the RNAi.

      The results suggest that a handful of these candidates are suppressors in this model of retinal degeneration, and identify at which stage of retinal degeneration these proteins may be involved in. These proteins may have a significance in forms of neurodegeneration.

      Major comment:

      • Perhaps a larger number of replicates could be done in the optical neutralization experiment as well as in the ERG. Figure 4.A(i) and (ii), please clearly state n values. I would suggest this as optional, but perhaps ut could help to increase n?
      • For human orthologs (Table 1), it could be worthwhile to add alignment scores between fly and human?

      Minor comment:

      • Clarify the purpose in focusing on dCert specifically in the last results section and discussion
      • Several typos
      • Affect vs effect

      Significance

      General assessment:

      The main significance of this study comes from the focus on proteins that are known to interact with VAP. This implies that the suppressors of degeneration that they have identified in the RdgB9 model may have an effect in other neurodegenerative models, namely in ALS models. This could have a very high significant potential in therapeutic avenues for neurodegenerative diseases.

      Among six candidates that had an effect with knocked down through RNAi, they pursued a single one (dCert) as proof of principle. It would help to add a justification for this choice in the main text and whether the authors have performed or intend to perform experiments using mutant forms of the other candidate proteins.

      The work from Raghu and his team have been leading the research surrounding this model of degeneration in Drosophila. This study naturally further extends their field of research, identifying more candidates that modulate this form of degeneration, and helping elucidate the pathways leading to cellular degeneration.

      These results will be of high interest for specialized researchers studying the molecular pathways that lead to cellular degeneration, both in the context of retinal degeneration as well as neurodegeneration. Specifically, researchers that may be interested in these candidate proteins and how they may play a role in the pathogenesis of various degenerative diseases.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for their time and both thoughtful and constructive comments. Their specific points are addressed below but a general point that we would like to comment on is that in the original version it appears we did not make our model clear enough. The dogma in the field is that Rab7 is recruited to endosomes from a cytosolic pool via exchange with Rab5 (mediated by Mon1/Ccz1). Our work instead indicates that the majority of Rab7 is delivered to Dictyostelium phagosomes by fusion with other endocytic compartments. It was not our intention to imply there was no canonical recruitment of Rab7 from a cytosolic pool, and indeed we provide data to show this happens at a low level and discuss this in the manuscript. Nonetheless, we clearly over-stated the exclusivity of Rab7 recruitment to phagosomes via fusion at several points and our original model cartoon, and have tried to better explain or more nuanced model with multiple routes for Rab7 acquisition in this revision, including a completely redrawn model figure (Fig. 7).

      2. Description of the planned revisions

      Reviewer 1:

      1. The observation that macropinosomes undergo retrograde fusion with newly formed phagosomes to facilitate phagosome maturation is an interesting notion that challenges the traditional model. However, not all phagocytes exhibit a high level of macropinocytosis, and axenic Dictyostelium cells used in the study may be an exception. Thus, it remains unclear whether fusion with macropinosomes is universally required for phagosome maturation. WT Dictyostelium cells or axenic cells cultured under SorMC/Ka condition (Paschke et al., PLoS One, 2018) exhibit significantly reduced macropinocytosis. The authors could examine whether the accumulation of Rab7 and V-ATPase on large-sized phagosomes is delayed in these cells. These experiments may help broaden the applicability of the authors’ finding.

      As our previous work (Buckley et al. PloS pathogens 2019) demonstrated that bacterially-grown PIKfyve mutants are also defective in bacterial killing and growth it is highly likely that cells also are defective in V-ATPase and Rab7 acquisition. However, we agree that formally testing this will further support our conclusions and improve the paper and should be quite straightforward.

      We will therefore co-express GFP-V-ATPase and RFP-Rab7 in both Ax2 and non-axenic cells grown on bacteria and repeat our analysis of recruitment to phagosomes – with the caveat that non-axenic cells do not phagocytose large particles such as yeast (Bloomfield et al. eLife 2015), so the imaging and quantification will be more challenging in this case.

      PIKfyve seems to play a specific role in the maturation of phagosomes but not macropinosomes. The differences may be driven by signaling from phagocytic receptors, as the author suggested. Alternatively, the large size of the yeast-containing phagosomes may require additional steps for efficient lysosomal delivery. The authors should consider examining whether PIKfyve is needed for the delivery of Rab7 and V-ATPase to phagosomes of comparable size to regular macropinosomes, such as those containing K. aerogenes or small beads. In addition, whether the process also involves fusion between phagosomes and macropinosomes should be verified.

      Whilst it is possible that large size of yeast-containing phagosomes requires additional mechanisms to process them, our previous data demonstrate that PIKfyve is also required to kill much smaller bacteria such as Klebsiella and Legionella (Buckley et al. PloS pathogens 2019). Furthermore, in this paper we also showed that loss of PIKfyve disrupts phagosomal proteolysis using 3um beads, and showed that V-ATPase recruitment was reduced on purified phagosomes containing 1um beads. We therefore find consistent defects on phagosomes of different size, with different cargos. Nonetheless, the experiments above, observing V-ATPase and Rab7 in cells grown on bacteria should directly address this point.

      As suggested, we will also perform a dextran pulse-chase prior to addition of bacteria to test if we can observe macropinocytic delivery to bacteria-containing phagosomes - perhaps using E. coli as their elongated shape may help phagosome visualisation.

      In the previous study from the authors' group (Buckley et al., PLoS Pathog, 2019), it was shown that the accumulation of V-ATPase on phagosomes begins immediately after internalization in both PIKfyve mutant and WT, although V-ATPase accumulation reaches only half of the levels seen in WT. This partial accumulation of V-ATPase differs from the almost complete absence of Rab7 recruitment found in this study, which raises the question of whether there exists yet another population of fusogenic vesicles that are positive for V-ATPase but negative for Rab7. This could be checked by simultaneously examining the dynamics of V-ATPase and Rab7 during yeast phagocytosis in the PIKfyve mutant.

      We agree with the referee that there are multiple pools of V-ATPase, and we show that there is both a very early PIKfyve-independent recruitment of both V-ATPase and Rab7 as well as a later and more substantial pool delivered in a PIKfyve-dependent manner. It is clear that V-ATPase and Rab7 do not always co-localise however - the clearest example being on the contractile vacuole, which has lots of V-ATPase but no Rab7 (the large bright magenta structure in Fig 2G.).

      We suspect that the dramatically reduced, but not completely absent levels of both V-ATPase and Rab7 recruitment in the absence of PIKfyve are similar, but the challenges with imaging these very small low levels means we cannot formally exclude that there is a pool of V-ATPase vesicles that lack Rab7 which fuse to very early phagosomes. Nonetheless, as we will already be looking at V-ATPase and Rab7 in PIKfyve KO's in the experiments above will also attempt to unequivocally differentiate a pool of V-ATPase positive/Rab7 negative vesicles fusing with phagosomes.

      Reviewer 2:

      (1) The authors show that deletion of PIKfyve results "in an almost complete block in Rab7 delivery to phagosomes" (page 17) indicating that the delivery of Rab7 depends on fusion with Rab7-positive structures. This would suggest that the Rab7-GEF Mon1-Ccz1 is not localized to the membrane of the phagosomes. Could the authors test for the presence of Mon1-Ccz1 in either fluorescence microscopy experiments or on purified phagosomes to exclude the possibility of a "canonical" Rab7 recruitment by its GEF? If the GEF is found on phagosomal membranes it would indicate that a Rab-transition from Rab5 to Rab7 occurs on the phagosome during maturation, but on a low level. The later fusion event might be a homotypic fusion of two Rab7-positive compartments. The observed fusion events could still deliver the bulk of Rab7 and other endolysosomal proteins to the phagosome. If the Rab7-GEF is not found on phagosomes how do the authors envision that the organelle keeps its identity? Is it solely dependent on PI(3,5)P2? What is the fate of the Rab7-negative phagosome in ∆PIKfyve cells if Rab7 is not delivered to the membrane, is there degradation happening over longer periods of time?

      This is an excellent suggestion, for which we thank the reviewer. Mon1 and Ccz1 are highly conserved, with clear Dictyostelium orthologues that have never been studied. Our model is that there is a small proportion of Rab7 driven by this canonical pathway so would expect Ccz1/Mon1 to coincide with loss of Rab5 and be unaffected by loss of PIKfyve - although subsequent Rab7 delivery would be lost. This is easy to test by cloning and expressing GFP-fusions of both Ccz1 and Mon1 and would be highly informative. Note we do not exclude canonical Rab7 recruitment in our model (see discussion), our data just indicate this has a minor contribution.

      Reviewer 3:

      The focus is on their manuscript is loading of Rab7 on phagosomes, but there's no indication about Rab7 activation (GTP-loading). Would the RILP-C33 probe work in Dictyostelium? If not possible, the activation state of Rab7 should still be discussed. Despite Rab7 on other organelles in PIKfyve-inhibited cells, is this active or not?

      The GTP-loading status of Rab7 is a good question, although the general dogma is that membrane-localised Rabs are active. We will try the RILP-C33 probe in Dictystelium as suggested, but as these cells lack an endogenous RILP orthologue there is a high chance it will not work. Sadly, reliable tools to asses active Rab status are a general limitation for the field, so if the RILP-C33 probe does not work we will add this caveat to the discussion.

      The authors need to better address the confusing kinetics of early Rab7 recruitment, followed by SnxA (Fig. 4G, same for VatM - Fig. 4I ) - which is counterintuitive if PIKfyve activity is required to recruit Rab7. How do the authors explain this? Are phagosomes prevented from acquiring Rab7 in PIKfyve deficient cells because of a defect on phagosomes or the endo-lysosomes loaded with Rab7 (but not active).

      We believe this again relates to the over-simplification of our model. Our data indicate both PIKfyve dependent and independent Rab7 recruitment. In contrast to the abrupt recruitment of SnxA at ~120 seconds (Vines et al. JCB 2023), both Rab7 and VatM accumulate gradually over time starting from almost immediately following engulfment (Buckley et al. 2019, and Figure 2F). Our data indicate that the first stage of this is PIKfyve independent, and is responsible for ~10% of the total Rab7/V-ATPase accumulation by both the imaging in this paper, and Western blot for V-ATPase on purified phagosomes in Buckley et al. PLoS pathogens 2019. The arrival of some Rab7/V-ATPase prior to PI(3,5)P2 therefore supports our model where there are multiple sources of Rab7.

      As the reviewer quite rightly points out, interpretation of the defects observed in the absence of PIKfyve becomes complex and we cannot completely differentiate between a defect on the phagosome, or the Rab7 compartments that fuse with them (or indeed both). In fact, we already note that small Rab7 compartments that we observe in wild-type cells are much more sparse in PIKfyve mutants. Therefore whilst the requirement for PI(3,5)P2 in the clustering and fusion of macropinosomes with phagosomes is clear, additional effects on the PI(3,5)P2-independent Rab7 compartments cannot be excluded.

      The experiments above using the RILP-C33 active Rab7 biosensor as well as observation of the Mon1/Ccz complex should further clarify this, but we will also add further discussion of these points.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer 1:

      Minor comments.

      1. It is unclear how the experiment in Figure 3G was conducted. If microscopic analysis was involved, the corresponding images should be included.

      We apologise that we overlooked this and have now added a full description in the materials and methods (P8 L16-21). Fluorescence measurements were performed using a plate reader, so there are no images.

      Page 11-Line 2, the sentence "there was no obvious clustering around the nascent phagosome (Figure 2D)." It is Figure 2E, not Figure 2D.

      Corrected.

      There is an inconsistency regarding the description of fluorescent fusion proteins. For example, both GFP (RFP)-2xFyve and 2xFyve-GFP (RFP), as well as GFP-Rab5 and Rab5-GFP, were used. Typically, placing GFP (or RFP) before a gene suggests N-terminal tagging, while placing it after the gene implies C-terminal tagging. The authors should clarify the position of the fluorescent tag and ensure consistency in their descriptions.

      We apologise for this oversight, and have been through and corrected all fusion protein references accordingly.

      One of the videos was not referred in the manuscript or described in the Video legends. This video seems to correspond to Figure 5A, albeit with a different pseudo-color scheme.

      This has been corrected. Video 7 does correspond to Fig 5A, and we have corrected the colour scheme to match and added references to the video in the text and figure legend.

      Reviewer 2:

      (2) In their abstract, the authors state that they "...delineate multiple subpopulations of Rab7-positive endosomes that fuse sequentially with phagosomes" (page 2, line 14,15). However, the data provides only evidence for V-ATPase or PI(3,5) P2-containing structures and the authors conclude to my understanding that macropinosomes are the main source for vesicular structures fusing with phagosomes. I would ask the authors to please be clear on the identity of the "Rab7-donor"-structures throughout the manuscript. Saying that they delineate multiple subpopulations of endosomes seems to be overstated.

      We identify that macropinosomes are one source (subpopulation) of Rab7/PI(3,5)P2 vesicles but our data clearly show that they are the only source of Rab7 - there is clearly an additional early Rab positive / PI(3,5)P2-negative subpopulation of vesicles that cluster and fuse too at earlier stages. For example, in Figure 4F we co-express Rab7a/SnxA and show that whilst all the SnxA vesicles also contain Rab7 (and dextran), there is a clear separate population of small and early-fusing population of Rab7-containing vesicles that do not possess PI(3,5)P2. This is further validated in Figure 5B and C. To our mind this clearly demonstrates and defines different Rab7 endosomal populations, although we do not yet know the origins of the initial Rab7-positive/PI(3,5)P2 negative population - as discussed in our response to their point (3) below.

      Minor points:

      (1) The sentence "...which both deactivates and dissociates Rab5, and recruits and activates Rab7 on endosomes" is at least problematic as it suggests that Mon1-Ccz1 directly drives GTP-hydrolysis of Rab5 and dissociates it from the membrane. Indeed, Mon1-Ccz1 is shown to interfere with the positive feedback loop of the Rab5-GEF by interacting with Rabex (Poteryaev et al., 2010), so a rather indirect effect of Mon1-Ccz1. A GAP and the GDI are needed for Rab5 deactivation and dissociation from the membrane. How both are involved in the endosomal Rab-conversion is not clarified.

      We have changed the text to better represent this complexity (P4 L4-6)

      (2) Signals of RFP-labeled proteins are difficult to interpret throughout the experiments. What are the structures that show a strong accumulation of red signal in Fig. 1A,B, Fig 2G and Fig4A (20sec.) If these are fluorescently labeled proteins it would suggest that most of the proteins cluster/accumulate in the cell. Can the authors provide better images?

      We appreciate that some of these reporters with multiple localisations can be difficult to interpret. This is major challenge for these sort of studies and main reason we use the large and easily-identified yeast containing phagosomes for quantification. In Fig. 1 the large structure is the large peri-nuclear cluster of Rab5 previously reported (Tu et al. JCB 2022). In Fig. 2G the bright structure is the recruitment of V-ATPase on the CV. Both these large structures easily distinguished from the phagosomal pool we are interested in. Whilst we would love to provide better images, this is simply not possible - both these other structures are unavoidable and we are already using some of the best microscopy methods available. We have however clarified the additional localisations seen in these images in the revised figure legends.

      (3) On page 11 the authors state "...macropinosomes in ∆PIKfyve cells still appeared much larger. Quantification of their size and fluorescence intensity demonstrated that although macropinosomes started off the same size,...". This statement is not reflected in the data depicted in Fig. 3A,B. The size of the single labeled macropinosome appears to be larger in wildtype than in ∆PIKfyve cells from the beginning on. However, the quantification in Fig 3F is clear. So, are these bad examples in 3A,B, are they swapped or is this due to the additional expression of GFP-Rab7A? Could you please comment on the effect that the (over-)expression of GFP-tagged Rab-GTPases might have on the observations described in this paper in the discussion part?

      As you can see from the error bars in Figure 3F, macropinosomes are extremely variable in size - ranging from ~0.2-5 microns in size in axenic Dicytostelium. The image in Figure 3B is therefore indicative of this heterogeneity, rather than being a "bad example". This is why we designed the experiment to quantify several hundred vesicles in order to make any conclusions - as well as doing it in the absence of any GFP-fusion expression.

      Although we have not noticed any issues (enlarged vesicles are also clear in GFP-Rab7 expressing cells in Figure 1B), we do of course accept that GFP-Rab7 expression itself may have some detrimental effects on maturation and this is why we quantified macropinosome size in untransformed cells. We have clarified this in the results section (P12 L28).

      (4) In Fig. 6E it is hard to distinguish if the dextran is accumulating inside the phagosome. I would suggest conducting a 3D reconstruction of these images to allow judging if macropinosomes fused with the phagosomes or if they cluster around the neck of the phagosome.

      This would be nice, but not possible as these images are from single confocal sections, rather than a complete high-resolution Z-stack. We have however added an enlargement of both Figure 6D and E which we feel now more clearly shows the presence of dextran within the bounding PI(3)P membrane of the phagosome.

      (5) In the discussion, the authors state that the small pool of "PIKfyve-independent Rab7" is "insufficient to for subsequent fusion with other Rab7A-positive compartments, further Rab7 enrichment, and lysosomal fusion." What is the rationale for this conclusion? Is it shown how many Rabs are necessary to induce a tethering and fusion event? It would be good to revise this part of the discussion also in respect of the first major point of my comments above.

      Our data show that in the absence of PIKfyve, phagosomes still remove Rab5 and gain a small pool of Rab7 but progress no further. This is consistent with some block in the HOPS-mediated homotypic fusion of Rab7 compartments. However, we accept that this is not necessarily due to simply not having enough Rab's so have rephrased the discussion accordingly.

      (6) The intention of the paragraph about phagosomal ion channels is for this reviewer somehow out of context. It is not clear to me how the authors relate this to their findings. It would be could to bring this into a broader context.

      __ __We mention ion channels in the background as they represent the main class of PI(3,5)P2 effectors known so far. We feel this is important background context, even if our studies do not directly relate to this.

      Reviewer 3:

      Their disclosure and use of statistics is incomplete and/or inconsistent, and potentially wrong in some cases. For example, the authors disclose the number biological repeats in a few experiments (Fig. 3C, F) but not in the majority. Instead, they state the number of phagosomes without indicating biological repeats (eg. Fig. 2 and others). So, it is not possible to know if their data are reproducible. Despite not indicating independent experiments in some cases, they speak of SEM, which applies to mean of means from biological repeats. In other cases, none of this is disclosed (eg Fig. 3G). Often there is no indication of what statistical test was done OR if a statistical test was done (eg. Fig. 3G, Fig. 4, etc). I would recommend the authors review the excellent resource paper published in JCB on SuperPlots to better follow statistical expectations. This is essential to improve reproducibility and confidence in their observations.

      We apologise if this was unclear for the referee, but we have tried to be clear in each case. The confusion likely lies in the definition of a biological repeat, which depends on the type of experiment. For quantification of phagocytic events over time, we feel it reasonable to take each individual event (each from an individual organism) as a biological repeat. This is because events are relatively rare and taken from multiple different movies, and it is not technically possible to film both mutants and controls simultaneously. In all these sort of experiments (e.g. Figure 2) we have shown standard deviation, which indicates the reproducibility between phagocytic events. We have clarified that these events are from movies obtained on at least 3 independent days in the methods.

      In other cases, such as Figure 3C and F and Figures 5-6, we are able to take measurements across multiple cells simultaneously at each timepoint. It is therefore appropriate to average over multiple independent experimental repeats rather than individual cells. We have therefore used SEM in our analysis, and both the number of individual cells and independent repeats are stated on the graphs and legend. This was incomplete in a few cases but has now been clarified in all cases.

      Regarding statistical tests, which ones were used now been clarified in each figure legend. Note that in Fig 3G, we do not apply any test as both lines essentially overlap and it is clear there would not be any convincing differences. In Figure 4, the graphs all compare co-expression of different reporters rather than different mutants or conditions and are from single events. We therefore feel statistical tests are unnecessary and inappropriate. Comparison of the same reporters between strains averaged across multiple events, with statistical analysis is shown in Fig 2 instead. All these points have now been added to the statistics section of the methods (P9 L1-6)

      Minor Comments

      It is interesting that 2FYVE-GFP stays on phagosomes for 50 min or more - this is distinct from macrophages. Please comment. Have the authors tried other PI(3)P probes to see if the same (PX-GFP).

      We have not used other probes but we have no reason to believe 2xFYVE does not behave as predicted as it is the same probe used for most macrophage studies (FYVE domain from human Hrs), and gets removed from macropinosomes exactly as expected. We did not originally comment in this manuscript but PI3P dynamics are even more interesting as our previous data indicate that latex-bead containing phagosomes lose PI3P after 10 minutes (Buckley et al 2019, Figure 4F-G) This indicates phagosome maturation can be regulated by the cargo (under further investigation). Importantly however, both bead and yeast-containing phagosomes have comparable defects in the absence of PIKfyve. This is more fully discussed in our previous paper (Vines et al. JCB 2023) where we characterise PI(3)P and PI(3,5)P2 dynamics in more detail.

      Fig. 7 model: the macropinosome in the diagram seems like a dead end as depicted - is there any arrow or change that could be added to show that it doesn't just sit there in the middle? Also, the light green on yellow hurts the eyes!

      We apologise, there was actually supposed to be an arrow there but it was lost somewhere in the drafting process. The whole figure has now been updated to more clearly describe our full and more complex model.

      Fig. 3F, could be converted to volume assuming macropinosomes are spheres.

      This is true, however as these images are taken from single planes we cannot know where in the sphere the slices are and therefore what the maximum diameter would be. We therefore prefer to keep it as area so as not to confuse and over-interpret the data.

      Pg. 10, line 10 - Vps34 is Class III PI3K, not Class II.

      Corrected.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      • *

      Reviewer 2:

      (3) ("OPTIONAL") Optionally, the authors could also try to clarify these structures' identity by including further colocalization studies with additional early and late endosomal marker proteins. Are they for example positive for early or late endosomal markers like EEA1, ESCRT or Retromer? How about organelle-specific SNAREs? This would give further insights into the character of the "Rab7-donor" structures and would allow to clarify if multiple subpopulations are contributing to phagosome maturation in a sequential order as stated in the abstract. As I am not an expert on Dictyostellium I can`t estimate the effort that would go into such an experimental setup. However, since the time scale of the events in the cell is nicely worked out in this study, these colocalization studies would not need to be conducted as live-cell microscopy experiments.

      This is a sensible suggestion that would in theory help define these populations. However many of these markers are poorly defined with respect to phagosomes and/or Dictyostelium. Dictyostelium does not posses an EEA orthologue, but our data also indicate that these vesicles do not possess PI3P so cannot be canonical early endosomes. We have previously characterised WASH/retromer and whilst it is recruited to phagosomes at around the time of Rab5/7 transition Retromer appears to be recruited from the cytosol and drive recycling rather than being delivered on endosomes that fuse (see King et al. PNAS 2016). We have also previously looked at ESCRT (Lopez-Jimenez et al. PLoS Pathogens 2018) which also does not appear to have any recruitment to early phagosomes that would be consistent with a Rab7-sub-population. The SNAREs are yet to be studied in any detail, as they are often too divergent to assign a direct mammalian orthologue.

      Therefore, whilst this is a sensible suggestion, and something we would like to follow up in the future, this is not straight-forward and we feel outside the scope of the current study. We have however included additional discussion of this in the revised manuscript (P20 L21-26).

      Reviewer 3:

      Major Comments:

      1. Based on the current data, I am not entirely convinced that Rab7 is delivered mostly by fusion with other compartments. At least the data as provided cannot exclude other models. For example, Rab7-containing organelles that cluster with phagosomes may form contact sites that provide a local environment to load cytosolic Rab7. There's also a possibility that some of their Rab7 clusters are membrane sub-domains and not vesicles. Or perhaps, there is a first wave of cytosolic Rab7 recruitment, which then initiates fusion with Rab7 compartments, i.e., there is a two-phase Rab7 recruitment. While this last possibility is consistent with recruitment of Rab7 by fusion (the second phase), the authors present a model that is too simplistic and conclusive based on the data. The authors may be right, but they need to strengthen their evidence towards their claim. Maybe EM could help determine some of these issues. Perhaps better would be the use of FRAP, photo-activation, or optigenetics of Rab7. For example, if Rab7 is acquired on phagosomes after photobleaching clusters of Rab7, this would suggest a cytosolic Rab7 contribution, and if not, this would support their model. I recognize that these experiments are not necessarily trivial, but either the authors augment their data (as suggested or with other approaches) or significantly pare down their conclusions.

      We agree with the Referee that we cannot completely exclude other models, and as we talk about in the discussion, we do not wish to do so. We apologise if the role of fusion was over-stated but the model we propose is as the referee suggests: there is likely an early first wave of canonical Rab7 recruitment from the cytosol that is independent of PIKfyve before the majority of Rab7 is subsequently delivered by fusion in a PIKfyve-dependent manner. Our data indicate that the second wave is both quantitively and functionally more significant (see functional data in Buckley et al. 2019).

      We do however agree with the referee that we cannot formally exclude things such as contact-site mediated recruitment from the cytosol or sub-domains but not fusion however there is no data to support these either. In contrast, the hypothetical clustered Rab7 contacts/subdomains often (but not always) contain the transmembrane V-ATPase complex (Figure 2G) which must be delivered by fusion.

      However we do not wish to over-simplify our conclusions and as we state in the discussion, we do think there is probably a small amount of Rab7 recruited from the cytosol by the canonical pathway. We accept that our cartoon in Figure 7 is over-focussed on fusion so we have substantially revised this, as well as the discussion to give a more balanced and complex view.

      Regarding the proposed experiments, unfortunately, the imaging required to acquire these movies is already at the very limit of what is possible so we do not believe it would be technically feasible to employ methods such as FRAP and optogenetics on these relatively fast-moving phagosomes with the temporal resolution required. Furthermore, to differentiate recruitment from a cytosolic pool, every GFP-Rab7 cluster would need to be photobleached, which could not be reliably achieved.

      However, this point will be largely addressed by the suggestion of Reviewer 2 to look at the Mon1/Ccz complex. The presence or absence of this will give strong evidence for canonical Rab5/7 transition and Rab7 recruitment from the cytosol which would significantly clarify our model and define the two different mechanisms of Rab7 recruitment to phagosomes.

      Early macropinosomes fuse with early phagosomes more readily than 10-min old macropinosomes. Do 10-min old macropinosomes not fuse with older phagosomes? Is this not an issue of mismatched age?

      This is an interesting point that we have clarified in the text. We agree with reviewer that it appears the ages of the macropinosomes and phagosomes must match but our data indicate this only occurs when both parties possess PI(3,5)P2 as macropinosome fusions appears to happen in a single burst at about 240 seconds (Figure 6F) rather than as a continuous process. We also do not start to see any fusion of these older macropinosomes when the phagosomes get past the initial first 10 minutes of maturation (Figure 6G).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In Vines et al., the authors used time-lapse imaging of Dictyostelium to investigate the spatial-temporal maturation of macropinosomes labelled with a short pulse of dextran and phagosomes using yeast particles. The phagocytes expressed fluorescent Rab5 and/or Rab7 and/or biosensors for PI(3)P using 2FYVE-GFP and PI(3,5)P2 using the authors recently disclosed SnxA. They quantified the dynamics of these probes in wild-type and PIKfyve-deleted cells. The authors provide evidence for their main observations, which are that: i) Rab5 and PI(3)P are acquired early and independently of PIKfyve on phagosomes and macropinosomes, ii) but phagosomes require PIKfyve to acquire Rab7, iii) that phagosomes acquire Rab7 by fusing with Rab7-containing vesicles that cluster around the phagosome, iv) that macropinosomes do not require PIKfyve for Rab7 acquisition, and v) that PI(3,5)P2 on phagosomes follows Rab7. While the imaging data is high quality and supports several of the claims, the major discovery as proposed here is not fully supported by the data provided. I think the authors must address the following to strengthen their otherwise beautiful work.

      Major Comments:

      1. Based on the current data, I am not entirely convinced that Rab7 is delivered mostly by fusion with other compartments. At least the data as provided cannot exclude other models. For example, Rab7-containing organelles that cluster with phagosomes may form contact sites that provide a local environment to load cytosolic Rab7. There's also a possibility that some of their Rab7 clusters are membrane sub-domains and not vesicles. Or perhaps, there is a first wave of cytosolic Rab7 recruitment, which then initiates fusion with Rab7 compartments, i.e., there is a two-phase Rab7 recruitment. While this last possibility is consistent with recruitment of Rab7 by fusion (the second phase), the authors present a model that is too simplistic and conclusive based on the data. The authors may be right, but they need to strengthen their evidence towards their claim. Maybe EM could help determine some of these issues. Perhaps better would be the use of FRAP, photo-activation, or optigenetics of Rab7. For example, if Rab7 is acquired on phagosomes after photobleaching clusters of Rab7, this would suggest a cytosolic Rab7 contribution, and if not, this would support their model. I recognize that these experiments are not necessarily trivial, but either the authors augment their data (as suggested or with other approaches) or significantly pare down their conclusions.
      2. The focus is on their manuscript is loading of Rab7 on phagosomes, but there's no indication about Rab7 activation (GTP-loading). Would the RILP-C33 probe work in Dictyostelium? If not possible, the activation state of Rab7 should still be discussed. Despite Rab7 on other organelles in PIKfyve-inhibited cells, is this active or not?
      3. The authors need to better address the confusing kinetics of early Rab7 recruitment, followed by SnxA (Fig. 4G, same for VatM - Fig. 4I ) - which is counterintuitive if PIKfyve activity is required to recruit Rab7. How do the authors explain this? Are phagosomes prevented from acquiring Rab7 in PIKfyve deficient cells because of a defect on phagosomes or the endo-lysosomes loaded with Rab7 (but not active).
      4. Their disclosure and use of statistics is incomplete and/or inconsistent, and potentially wrong in some cases. For example, the authors disclose the number biological repeats in a few experiments (Fig. 3C, F) but not in the majority. Instead, they state the number of phagosomes without indicating biological repeats (eg. Fig. 2 and others). So, it is not possible to know if their data are reproducible. Despite not indicating independent experiments in some cases, they speak of SEM, which applies to mean of means from biological repeats. In other cases, none of this is disclosed (eg Fig. 3G). Often there is no indication of what statistical test was done OR if a statistical test was done (eg. Fig. 3G, Fig. 4, etc). I would recommend the authors review the excellent resource paper published in JCB on SuperPlots to better follow statistical expectations. This is essential to improve reproducibility and confidence in their observations.
      5. Early macropinosomes fuse with early phagosomes more readily than 10-min old macropinosomes. Do 10-min old macropinosomes not fuse with older phagosomes? Is this not an issue of mismatched age?

      Minor Comments

      1. It is interesting that 2FYVE-GFP stays on phagosomes for 50 min or more - this is distinct from macrophages. Please comment. Have the authors tried other PI(3)P probes to see if the same (PX-GFP).
      2. Fig. 7 model: the macropinosome in the diagram seems like a dead end as depicted - is there any arrow or change that could be added to show that it doesn't just sit there in the middle? Also, the light green on yellow hurts the eyes!
      3. Fig. 3F, could be converted to volume assuming macropinosomes are spheres.
      4. Pg. 10, line 10 - Vps34 is Class III PI3K, not Class II.

      Significance

      Overall, the potential novelty of this work is the authors' proposal that phagosomes acquire Rab7 mostly by fusion with Rab7-labelled organelles rather than a cytosolic tool. This is distinct from existing models that assume phagosomes acquire Rab7 from a cytosolic pool that is loaded onto the membrane. They also suggest that PIKfyve plays a role in this process. However, as noted above, this claim needs to stronger data as the current data allows for other possible models, in my opinion.

      This work is of relevance to cell biologists interested in membrane trafficking, phagocytosis, model organisms, and microscopy.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Vines et al. investigate in their manuscript the contribution of the lipid kinase PIKfyve on the maturation of phagosome. They follow the fate of the two main classes of identity markers of endocytic organelles: PIPs (PI(3)P and PI(3,5)P2) and Rab-GTPases (Rab5 and Rab7) in wildtype and ∆PIKfyve Dictyostelium discoideum cells. To follow the two species of PIPs they utilize the established reporter-proteins GFP-2xFYVE (PI(3)P-binder) and GFP-SnxA (PI(3,5)P2-binder) and correlate them with the appearance of fluorescently tagged Rab-GTPases on membranes of phagosomes in live-cell imaging. They find that the deletion of PIKfyve does not alter the recruitment and behaviour of Rab5. Therefore a lack of PI(3,5)P2 does not affect the early stages of phagosome formation. However, later stages of phagosome maturation are apparently affected by the lack of PI(3,5)P2: the Rab-GTPase Rab7 is not localizing to the membrane of the phagosome. Closer inspection of their live cell imaging data led the authors to the conclusion that Rab7 is delivered by the fusion of Rab7-positive structures with the phagosomes in wildtype cells. This fusion seems to be dependent on PIKfyve and PI(3,5)P2. However, surprisingly, the recruitment of Rab7 to macropinosomes or endosomal structures is independent of PIKfyve. The authors conclude (i) that lysosomal components are delivered to phagosomes by fusion of PI(3,5)P2-positive macropinosomes and (ii) a non-canonical delivery from Rab7 to phagosomes by fusion instead of GEF-dependent recruitment from the cytosol.

      Major comments

      Overall, the submitted manuscript of Vines et al. is of very good quality. The presented data supports mainly the conclusions that the authors draw. Methods and statistical analysis are sound and well-described. Their rationale, the description of results, and the presentation of data are easy to follow and understand. However, there are two major points that I would like to address here:

      1. The authors show that deletion of PIKfyve results "in an almost complete block in Rab7 delivery to phagosomes" (page 17) indicating that the delivery of Rab7 depends on fusion with Rab7-positive structures. This would suggest that the Rab7-GEF Mon1-Ccz1 is not localized to the membrane of the phagosomes. Could the authors test for the presence of Mon1-Ccz1 in either fluorescence microscopy experiments or on purified phagosomes to exclude the possibility of a "canonical" Rab7 recruitment by its GEF? If the GEF is found on phagosomal membranes it would indicate that a Rab-transition from Rab5 to Rab7 occurs on the phagosome during maturation, but on a low level. The later fusion event might be a homotypic fusion of two Rab7-positive compartments. The observed fusion events could still deliver the bulk of Rab7 and other endolysosomal proteins to the phagosome. If the Rab7-GEF is not found on phagosomes how do the authors envision that the organelle keeps its identity? Is it solely dependent on PI(3,5)P2? What is the fate of the Rab7-negative phagosome in ∆PIKfyve cells if Rab7 is not delivered to the membrane, is there degradation happening over longer periods of time?
      2. In their abstract, the authors state that they "...delineate multiple subpopulations of Rab7-positive endosomes that fuse sequentially with phagosomes" (page 2, line 14,15). However, the data provides only evidence for V-ATPase or PI(3,5) P2-containing structures and the authors conclude to my understanding that macropinosomes are the main source for vesicular structures fusing with phagosomes. I would ask the authors to please be clear on the identity of the "Rab7-donor"-structures throughout the manuscript. Saying that they delineate multiple subpopulations of endosomes seems to be overstated.

      ("OPTIONAL") Optionally, the authors could also try to clarify these structures' identity by including further colocalization studies with additional early and late endosomal marker proteins. Are they for example positive for early or late endosomal markers like EEA1, ESCRT or Retromer? How about organelle-specific SNAREs? This would give further insights into the character of the "Rab7-donor" structures and would allow to clarify if multiple subpopulations are contributing to phagosome maturation in a sequential order as stated in the abstract. As I am not an expert on Dictyostellium I can`t estimate the effort that would go into such an experimental setup. However, since the time scale of the events in the cell is nicely worked out in this study, these colocalization studies would not need to be conducted as live-cell microscopy experiments.

      Minor comments

      Minor points:

      1. The sentence "...which both deactivates and dissociates Rab5, and recruits and activates Rab7 on endosomes" is at least problematic as it suggests that Mon1-Ccz1 directly drives GTP-hydrolysis of Rab5 and dissociates it from the membrane. Indeed, Mon1-Ccz1 is shown to interfere with the positive feedback loop of the Rab5-GEF by interacting with Rabex (Poteryaev et al., 2010), so a rather indirect effect of Mon1-Ccz1. A GAP and the GDI are needed for Rab5 deactivation and dissociation from the membrane. How both are involved in the endosomal Rab-conversion is not clarified.
      2. Signals of RFP-labeled proteins are difficult to interpret throughout the experiments. What are the structures that show a strong accumulation of red signal in Fig. 1A,B, Fig 2G and Fig4A (20sec.) If these are fluorescently labeled proteins it would suggest that most of the proteins cluster/accumulate in the cell. Can the authors provide better images?
      3. On page 11 the authors state "...macropinosomes in ∆PIKfyve cells still appeared much larger. Quantification of their size and fluorescence intensity demonstrated that although macropinosomes started off the same size,...". This statement is not reflected in the data depicted in Fig. 3A,B. The size of the single labeled macropinosome appears to be larger in wildtype than in ∆PIKfyve cells from the beginning on. However, the quantification in Fig 3F is clear. So, are these bad examples in 3A,B, are they swapped or is this due to the additional expression of GFP-Rab7A? Could you please comment on the effect that the (over-)expression of GFP-tagged Rab-GTPases might have on the observations described in this paper in the discussion part?
      4. In Fig. 6E it is hard to distinguish if the dextran is accumulating inside the phagosome. I would suggest conducting a 3D reconstruction of these images to allow judging if macropinosomes fused with the phagosomes or if they cluster around the neck of the phagosome.
      5. In the discussion, the authors state that the small pool of "PIKfyve-independent Rab7" is "insufficient to for subsequent fusion with other Rab7A-positive compartments, further Rab7 enrichment, and lysosomal fusion." What is the rationale for this conclusion? Is it shown how many Rabs are necessary to induce a tethering and fusion event? It would be good to revise this part of the discussion also in respect of the first major point of my comments above.
      6. The intention of the paragraph about phagosomal ion channels is for this reviewer somehow out of context. It is not clear to me how the authors relate this to their findings. It would be could to bring this into a broader context.

      Referees cross-commenting

      Reviewer #1 provides valid questions. Addressing them would improve the manuscript by allowing consideration if the findings only apply to Dictyostellium or is of broader interest.

      I completely agree with the concern of Reviewer #3 that the data provided so far would also allow for alternative models. The authors need to include further controls to exclude Rab7 recruitment or activation by any other means than fusion.

      Significance

      The manuscript by Vines et al. describes a very interesting novel observation on how the organelle identity marker Rab7 is delivered to phagosomes. They propose a mechanism, the delivery of Rab7 by PIKfyve-dependent fusion events with Rab7-positive macropinosomes, which is in contrast to the canonical model that endosomal organelles gain their Rab7-identity by maturation from a Rab5-positive compartment with the help of the Rab7-GEF Mon1-Ccz1. In the proposed mechanism the lipid-kinase PIKfyve, which is also involved in cellular signaling processes, plays the key role. In this study the authors present profound live cell imaging experiments combined with pulse-chase uptake of phagosomal cargoes. The obtained data is giving surprising new insights on the order of events in the maturation of phagosomes and suggests an unprecedentedly important role for PIKfyve in the maturation process. These new insights are of broad interest to a readership interested in transport, maturation and signaling processes along the endolysosomal system as well as of interest in the perspective of pathogen invasion to host cells.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      PIKfyve/Fab1, a kinase responsible for phosphorylating PI3P to produce PI(3,5)P2, regulates phagosome maturation across various organisms. A previous work from the authors' group demonstrated that disrupting PIKfyve in Dictyostelium inhibits the delivery of V-ATPase and hydrolase, thus dramatically reducing the ability of cells to acidify newly formed phagosomes and digest their contents. The current manuscript further dissects the function of PIKfyve and PI(3,5)P2. Using live cell imaging, the authors show that nascent phagosomes acquire Rab7 by fusion with multiple populations of Rab7-positive vesicles, and the loss of PIKfyve abolishes this event. One of these fusogenic vesicle populations was identified as PI(3,5)P2-positive macropinosomes, which dock and fuse with phagosomes in a PIKfyve-dependent manner. Based on these findings, the authors propose that PIKfyve contributes to phagosome maturation by promoting heterotypic fusion between phagosomes and macropinosomes, which help deliver regulatory components necessary for phagosome acidification and digestion. This study provides fresh insights into the process of phagosome maturation. The work was well designed, performed and presented, and the manuscript is clearly written. However, there are several questions that should be addressed to strengthen the conclusions of the manuscript.

      Major points:

      1. The observation that macropinosomes undergo retrograde fusion with newly formed phagosomes to facilitate phagosome maturation is an interesting notion that challenges the traditional model. However, not all phagocytes exhibit a high level of macropinocytosis, and axenic Dictyostelium cells used in the study may be an exception. Thus, it remains unclear whether fusion with macropinosomes is universally required for phagosome maturation. WT Dictyostelium cells or axenic cells cultured under SorMC/Ka condition (Paschke et al., PLoS One, 2018) exhibit significantly reduced macropinocytosis. The authors could examine whether the accumulation of Rab7 and V-ATPase on large-sized phagosomes is delayed in these cells. These experiments may help broaden the applicability of the authors' finding.
      2. PIKfyve seems to play a specific role in the maturation of phagosomes but not macropinosomes. The differences may be driven by signaling from phagocytic receptors, as the author suggested. Alternatively, the large size of the yeast-containing phagosomes may require additional steps for efficient lysosomal delivery. The authors should consider examining whether PIKfyve is needed for the delivery of Rab7 and V-ATPase to phagosomes of comparable size to regular macropinosomes, such as those containing K. aerogenes or small beads. In addition, whether the process also involves fusion between phagosomes and macropinosomes should be verified.
      3. In the previous study from the authors' group (Buckley et al., PLoS Pathog, 2019), it was shown that the accumulation of V-ATPase on phagosomes begins immediately after internalization in both PIKfyve mutant and WT, although V-ATPase accumulation reaches only half of the levels seen in WT. This partial accumulation of V-ATPase differs from the almost complete absence of Rab7 recruitment found in this study, which raises the question of whether there exists yet another population of fusogenic vesicles that are positive for V-ATPase but negative for Rab7. This could be checked by simultaneously examining the dynamics of V-ATPase and Rab7 during yeast phagocytosis in the PIKfyve mutant.

      Minor points:

      1. It is unclear how the experiment in Figure 3G was conducted. If microscopic analysis was involved, the corresponding images should be included.
      2. Page 11-Line 2, the sentence "there was no obvious clustering around the nascent phagosome (Figure 2D)." It is Figure 2E, not Figure 2D.
      3. There is an inconsistency regarding the description of fluorescent fusion proteins. For example, both GFP (RFP)-2xFyve and 2xFyve-GFP (RFP), as well as GFP-Rab5 and Rab5-GFP, were used. Typically, placing GFP (or RFP) before a gene suggests N-terminal tagging, while placing it after the gene implies C-terminal tagging. The authors should clarify the position of the fluorescent tag and ensure consistency in their descriptions.
      4. One of the videos was not referred in the manuscript or described in the Video legends. This video seems to correspond to Figure 5A, albeit with a different pseudo-color scheme.

      Significance

      Disruption of PIKfyve results in severe defects in phagosomal maturation across different organisms, but the underlying mechanism remains unclear. This study demonstrates that PIKfyve plays a specific role in phagosome maturation by promoting heterotypic fusion between macropinosomes and newly formed phagosomes. These fusion events provide a means for the rapid delivery of lysosomal components to early phagosomes. The study challenges the conventional model of phagosome maturation and provides novel insights into the complex dynamics involved. Nonetheless, further investigations are needed to elucidate the exact role of PIKfyve/PI(3,5)P2 in regulating vesicle fusion and to explore whether the proposed model can be applied to other endocytic pathways or cell types.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1) List of the detailed experiments we plan to perform (including aforementioned experiments):

      • Careful analysis of the daughter cell size by measuring the real volume.

      • Quantifications of PCM (pericentrin and γ-tubulin) proteins and Plk1 with respect to centrosome age in G2 and metaphase (for Plk1) cells.

      • Analysis of the amount of Plk1 of metaphase cells when cenexin protein is absent (siControl vs siCenexin), and measurements of Plk1 in WT-cenexin vs. cenexinS796A mutant to test if Cenexin controls a subpool of Plk1 at centrosomes.

      • Careful analysis of Ctrl and TPX2 depletion experiment data in 1:1 cells. We plan to repeat the experiment to confirm or infirm on the contribution of TPX2 in spindle asymmetry.

      • Measurement of the PCM volume/intensity in 2:2 and 1:1 metaphase cells, to highlight on the contribution of the daughter centrioles in recruiting PCM proteins.

      • Live cell imaging of 2:2 cells and measurements of different parameters; cortex-to-centrosome and spindle pole to metaphase plate (half-spindle (a)symmetry) distances.

      • Long-term live cell imaging of 2:2 cells to investigate whether the asymmetry in centrosome-age dependent daughter cell size also affects the duration of the ensuing cell cycle. While we have carried out such long-term movies in the past, we are aware that they can be challenging due to high cell mobility over longer time courses.

      • Investigation of the microtubule nucleation capacity under different conditions of PCM protein depletion (depletion of Cdk5rap2 and/or pericentrin).

      • Analysis of the effect of the over-expression of PCM protein (Cdk5Rap2) on the (a)symmetry of the mitotic spindle size


      2) detailed answers (in green) to the reviewers’ comments:


      __Reviewer #1: __

      __(Major points) __

      1. The discovery of differences in half-spindle size during symmetric division is intriguing. However, the methodology for quantification of the data remains unclear. Key questions, such as how the center of the metaphase plate is determined from the image data, the definition of exact pole position when centrioles are located at spindle poles, the objective determination of daughter cell diameter and width from the image data, and the referential position of the cortex, need more detailed explanation in the manuscript. Additionally, it's crucial to elucidate the specific index used to quantify differences from the image data, especially when dealing with data that only varies by a few percent. Providing clarity on these aspects and, in some cases, re-quantifying the data should be necessary.

      We have already included clearer explanations in the method parts and results part about our methodology and will include a supplementary figure on how precisely we defined and measured the half-spindle sizes, as well as the index used for the asymmetry (using a methodology that we previously used in Dudka et al., Nature Comm., 2018). In addition, we will use a second method to measure the real daughter cell volume.

      The mechanism behind the difference in half-spindle size, related to the subdistal appendage (SDA), raises questions, especially considering that SDA is believed to disassemble during mitosis. Exploring whether differences in the localization of PCM components and half-spindle size result from disparities in Plk1 and PCM loading during G2/early mitosis, prior to SDA disassembly, necessitates experimental verification.

      As suggested by the reviewer we will quantify the amounts of PCM proteins on the old and young centrosome in G2 cells (and therefore prior SDA reorganization). This will also allow us to test whether the asymmetry depends on the SDA themselves, or the corresponding SDA proteins, which still accumulate specifically on the oldest centrosomes during mitosis

      For investigating the mechanism of half-spindle size asymmetry, many perturbation experiments employ knock-down techniques. To directly address the cause of asymmetry, it might be valuable to artificially localize Plk1 and PCM factors to one spindle pole using optogenetic tools or similar approaches and then quantify half-spindle and daughter cell sizes.

      We thank the reviewers for this suggestion, as it could indeed, be of great interest and provide a direct proof of principle. Unfortunately, based on our experience in establishing such a cell line we know that just the generation of such a light-manipulated stable cell line that contains markers for centrosomes and chromosomes or kinetochores takes 6-9 months, in the best-case scenario. This experiment is therefore not possible within a normal revision round (even if extended to 6 months).

      The asymmetry in Plk1 sub-population recruitment by SDA triggers the observed effects, but the evidence for this is relatively weak, given the small difference in spindle asymmetry. Quantifying the amount of Plk1 in its activated form, particularly in the context of SDA dismantling during metaphase, could strengthen this aspect of the study.

      While the commercial antibodies against the activated form of Plk1 (phospho-T210) work very well by immunoblotting, we have not been able to get it to work by immunofluorescence. We will nevertheless, test whether variation in the fixation methods can solve this issue. Alternatively, we will test to which extend depletion of Cenexin, or the presence of Cenexin WT vs the non-phosphorylatable Cenexin mutant affects the overall population of Plk1 on both spindle poles.

      While the focus on half-spindle size asymmetry during symmetric division is intriguing, it's important to address the broader physiological significance. The primary outcome of this asymmetry is differences in daughter cell size, which limits the broader significance of the study. Furthermore, the quantification method for daughter cell size warrants scrutiny and clarification.

      As mentioned above, we will use different method to measure and investigate daughter cell size (a)symmetry. Moreover, we will attempt with long-term live cell movies to test whether the variation in centrosome-age dependent daughter cell size also affects the duration of the ensuing cell cycle.

      (Minor points)

      1. Table 1 lists factors with asymmetric localization not analyzed in detail in this paper. It would be beneficial to discuss whether these factors play a role in spindle asymmetry, and the authors should address the completeness of the data in Table 1 in terms of selecting factors for analysis.

      We agree with this comment that other factors may participate in the regulation of spindle asymmetry. However, we performed this screening to identify key drivers of spindle (a)symmetry based on an investigation of the Pearson’s correlation coefficient and the value of slope.

      In addition, some of these proteins are known to control spindle size in acting in a same pathway (TPX2/Kif2A/Katanin) and (Pericentrin/CDK5RAP2/ϒ-tubulin). We will incorporate these points and the reasons for our selection in the discussion

      In Figure 1H, the impact of centriolin knock-out on the distribution of unaligned polar chromosomes is different from the effect of cenexin S796A in Figure 6H. This difference should be explained to provide clarity on the observed discrepancies.

      We will better explain this difference.

      In Figure 2A, there is no correlation data presented between daughter cell asymmetry and the presence or absence of cenexin signal. This relationship should be elucidated for a more comprehensive understanding.

      We will clarify this point. Specifically, we plotted the daughter cell symmetry index for 2:2 and 1:1 cells with respect to centrosome age. All the daughter cells display the presence of a cenexin signal at both grandmother and mother centrioles with a difference in fluorescence intensity that enables us to assign them to “old” vs “young centrosomes. We found a significant result indicating that there is a relationship between centrosome age and the formation of daughter cell with different sizes.

      In Figure 4G and H, the mean value of spindle asymmetry increases with siRNA treatment of Cdk5Rap2 or PCNT compared to the control. The possible interpretation of this finding should be discussed.

      This is an interesting observation that needs to be discussed in our revision.

      Figure 4K shows that the asymmetry of PCNT distribution is not eliminated by centriolin knock-down. This observation requires clarification and discussion.

      It has been shown that pericentrin is directly recruited by Plk1 at centriole (Soung et al., 2009). In addition, pericentrin has a PACT-domain that directly targets pericentrin to the centriole (Gillingham and Munro., 2000). Moreover, it has been demonstrated that the grandmother centriole is slightly longer than the mother one (Kong et al., 2020). Altogether, this suggests that the old and young centrosomes, based on this intrinsic property, may recruit different amount of pericentrin.

      We will add this explanation in the discussion.

      It appears that the difference in spindle asymmetry of the control group in Figure 5A is smaller than in other data. This discrepancy should be addressed. Additionally, the influence of TPX2 depletion on spindle formation, and any corresponding spindle staining data, should be included.

      This point will be discussed in the revised version of the manuscript.

      Claiming that the daughter centriole recruits PCM based on Figure 6A data alone may require additional supporting evidence. It is essential to investigate whether there is a clear PCM signal when the daughter centriole disengages in late mitosis and maintain consistency in the interpretation.

      As suggested by the reviewer 2, we will measure PCM volume/intensity in both 2:2 and 1:1 cells to demonstrate that daughter centrioles directly recruit PCM proteins.

      The lack of difference in TPX2 distribution in Figure 7E should be explained, along with a discussion of how this observation aligns with the spindle asymmetry data and any inconsistencies.

      We will discuss this point in the revised manuscript.

      The differing N numbers between samples in all the figures may affect the validity of comparisons. The authors should discuss whether it is necessary to have consistent N numbers in each experiment for more robust conclusions.

      Indeed, this is an important point that must be discussed.

      Reviewer #2____:

      Major comments:

      1) It is not completely clear how the authors determined whether a spindle was asymmetric or not. In the methods, they say that statistical tests are described in the legends. In Figure 1 legend they say: "Each condition was compared to a theoretical distribution centered at 0 (dashed line)". How did they generate this theoretical distribution?

      As explained under point 1 of reviewer 1, we will provide a more thorough explanation of our methodology and how we decide whether a spindle is symmetric or not. In brief, a perfectly symmetric spindle would yield an asymmetry index of 0, as there is no difference between the two half-spindle sizes.

      2) The authors claim that TPX2 depletion results in loss of spindle asymmetry in 1:1 cells, but the difference is very small (1.7% in control vs 1.3% in TPX2 depletion, Fig 5B) and the data is more variable in TPX2 depletion, which makes it less likely that a statistically significant difference from 0 would be found. Firstly, perhaps the authors could check the standard error of the mean, which provides a measure of how accurate the mean is with regard to N and variation. If a dataset is more spread (such as in TPX2 depletion) a higher N is required to attain the same accuracy in the mean value. This is normally not so important when directly comparing two datasets, but in this case the authors are comparing each dataset to 0. So, are the authors measuring enough cells in the TPX2 depletion to be sure that a 1.3% value is not significantly different from 0? Secondly, I don't understand why the control cells have such a low asymmetry index (1.7%), when previous data in the paper shows an asymmetry index of 4.1% (Fig 1D) and 3.4% (Fig 4E) in control 1:1 cells. This suggests that something about the way this experiment was carried out dampens the asymmetry, which could therefore lead the authors to conclude that TPX2 is more important than it really is.

      We agree with this comment, the mean of the control condition is smaller compared to others controls. As mentioned above, we will carefully look at the data (SD vs SEM) and in case add a new replicate to confirm or infirm the involvement of TPX2 in the formation of asymmetric spindles.

      3) The authors claim that daughter centrioles are associated with some Pericentrin and suggest that this may be why 2:2 centrosomes have less of an asymmetry than 1:1 centrosomes (Fig 6A). It is unclear whether the authors consider these daughter centrioles as being prematurely disengaged (they make reference to the fact that they previously showed how disengaged daughters recruit γ-tubulin, but it's unclear if this is related to their current observations). In Figure 6A, the Centrin spots look too far apart for engaged centrioles (~750nm). I appreciate that this may be the only way to dectect Pericentrin around the daughter at this resolution, but it may also force the authors to select cells where the centrioles have prematurely disengaged. For the asymmetry measurements, the authors presumably did not select cells where they could distinguish mother and daughter centrioles. One way to address this issue would be to compare PCM size at centrosomes in 2:2 cells with centrosomes in 1:1 cells. The expectation would be that centrosomes in 2:2 cells would have more PCM, due to the contribution of the daughter centrioles.

      We agree that on those high-resolution images the daughter centrioles seem to be far from the mother ones. The metaphase cells presented in this figure, are wild-type non-treated cells for which the daughter centrioles are engaged. Indeed, our own investigation of the centriole engagement status by expansion microscopy, indicates that over 98% of centriole pairs in metaphase RPE1 cells are engaged.

      Nevertheless, as suggested by the reviewer and to validate that daughter centrioles participate in this process, we will compare PCM size in 2:2 and 1:1 metaphase cells.

      4) The authors show that Plk1 recruitment by Cenexin (via S796 phosphorylation), which happens only at mother centrosomes, is important for asymmetry. Nevertheless, they show that Plk1 is symmetrically distributed between mother and daughter centrosomes (Table 1). This does not really fit, unless daughter centrosomes recruit more cenexin-independent Plk1 than mother centrosomes or if the cenexin-bound pool of Plk1 is only a minor fraction of total Plk1. If so, do the authors think that the Cenexin-bound pool of Plk1 is more potent than the rest of centrosomal Plk1?

      As indicated in point 4 of reviewer 1 we will test which proportion of the Plk1 pool at spindle poles depends on the presence of Cenexin, as we suspect that this Plk1 population is only a subpopulation.

      5) The circles drawn to measure cell size in Figures 2A,E and 7C do not look like a good representation of cell area (as the cells are not perfectly round). The authors use a formular for circle area with an approximation of the radius (based on mean length/width of an oval. It would be much better to use ImageJ to draw a freehand line around the perimeter of the cell and use the in-built tool to measure the area.

      As mentioned in point 1 of reviewer 1 we will use another method to measure daughter cell size.

      Minor comments:

      1) Asymmetry in centrosome size that correlates with centrosome age in apparently symmetrically dividing "cells" has been observed previously in Drosophila syncytial embryos (Conduit et al., 2010a, Curr. Bio.). I think this should be mentioned somewhere given the topic of the study.

      We thank the reviewer for this information. This paper will be discussed in the revised version.

      2) A full description of statistical tests and n numbers for each experiment should be provided in the methods, even if this duplicates information in the Figure legends.

      We will add this information in the method.

      OPTIONAL EXPERIMENTS:

      3) Given that chTOG is very important for microtubule nucleation, it seems strange that this protein was not analysed for a potential asymmetry.

      As suggested by the reviewers we will test for a potential chTOG asymmetry and its impact on spindle size asymmetry.

      4) Cooling-warming experiments could be done using higher concentration of formaldehyde, as it's likely that microtubule nucleation is not immediately halted when using 4% formaldehyde.

      The fixation solution was chilled at 4°C, which should halt any further depolymerization. We will specify this point in the Material and Methods section.

      Reviewer ____#____3:

      Major points:

      1) The evaluation of spindle and cell size asymmetry related to centrosome age only relies on fixed sample preparation. Cells should be followed by time-lapse microscopy as the metaphase plate position relative to the spindle poles and/or the cell cortex may fluctuate over time and as the observed differences remain in a very subtle range. This is an important possibility to consider for 1:1, 1:0 or 0:0 spindle pole configurations where centrosome integrity is impaired.

      We agree with the reviewer that this is a drawback of our approach, but the experiments the reviewer suggests is not possible for 1:0 or 0:0 or only in an approximate manner. Indeed, we do not have a centriole-independent spindle pole marker that would allow us to mark precisely the position of the spindle pole. In the past we used Sir-tubulin, which gave us an approximate position of the spindle poles, and which allowed to us monitor the spindle asymmetry over time of 1:0 cells (see Dudka et al., 2019), a point that we will discuss. Nevertheless, as suggested by the reviewer we will attempt to monitor these asymmetries in 2:2 and/or 1:1 cells expressing GFP-Centrin1 and GFP-CENPA (kinetochore marker) in WT conditions. Indeed, we cannot expand this approach to all the conditions, as the calculation of the spindle asymmetry index is based on a very high number of cells, and the monitoring of spindle asymmetry can only be achieved by selecting mitotic cells one-by-one and then monitoring them over a short period of them (Tan et al., eLife, 2015), which makes such an approach extremely time-consuming.

      2) Cell size asymmetry was evaluated based on cell area at the equator. Volumes will be a better indicator as daughter cell shapes can be different in telophase if they do not re-adhere at the same speed. This evaluation should also be confirmed with another readout, like the position of the cleavage furrow relative to the spindle poles in late anaphase, as again the observed differences are in a very subtle range.

      As indicated in the similar points of reviewer 1 and 2, we will improve our methodology to take this comment in account

      3) The authors propose that differential microtubule nucleation at the spindle poles underlies spindle size symmetry breaking without providing direct evidence. If the observed spindle symmetry in the 1:1 configuration after pericentrin, CDK5RAP2 or g-tubulin siRNA fuels this interpretation (Fig4C), the differential microtubule nucleation capacity at the spindle poles after microtubule-depolymerisation-repolymerisation assays was not evaluated in these conditions, as compared to the control situation.

      As suggested by the reviewer we will analyze the microtubule nucleation capacity after the downregulation of PCM proteins.

      4) If differential microtubule nucleation at the spindle poles is responsible for spindle asymmetry, overexpression of PCM proteins or g-tubulin should be sufficient for re-establishment of symmetric protein distribution, spindle and cell size symmetry in 2:2 or 1:1 configuration. The authors should evaluate whether this is the case or not.

      This is an interesting suggestion, which we will test, although overexpression of these proteins might also lead to other defects in the spindle, such as multipolar spindles.

      5) The authors describe that the cortex-centrosome distance is not changed according to centrosome age (Fig2C), but centrosome-metaphase plate distance is (Fig1D). These observations are difficult to reconcile if differential microtubule-nucleation capacity is at play. Again, time-lapse microscopy would enable to detect over time whether only metaphase plate position relative to spindle poles is changing or if spindle pole position relative to the cell cortex is also fluctuating.

      We plan to give a try to image WT 2:2 cells by time lapse microscopy and to measure several parameters such as half-spindle size, spindle (a)symmetry and the cortex to centrosome distance over time.

      Minor points:

      6) Main PCM and MT nucleation protein "depletion" do not appear to impact spindle assembly, but only spindle symmetry in 1:1 and 1:0 configurations (Fig4A and 4F-H). Can it be explained by the fact that their depletion is not always total (for pericentrin, Fig5F versus FigS2A or Fig7G)? Can they comment on this point?

      Spindles displaying abnormal centriole number at spindle poles (1:1 and 1:0) can still assemble bipolar spindle in absence of the main PCM proteins (Chinen et al., JCB, 2021, and Watanabe et al., JCB, 2020).

      In our study, the depletion of PCM protein is almost total (97% for pericentrin, 98% for Cdk5Rap2).

      7) If centrosome age dictates spindle and cell size asymmetry through differential MT-nucleation capacity at the spindle poles, how can this process be modulated? Indeed, centrosome age is common to all cell types, but cell size asymmetry is more or less pronounced. The authors should further discuss this point based on the literature.

      We will discuss this point in the discussion.

      __ Description of the revisions that we have already carried out in the revised manuscript__


      1. The discovery of differences in half-spindle size during symmetric division is intriguing. However, the methodology for quantification of the data remains unclear. Key questions, such as how the center of the metaphase plate is determined from the image data, the definition of exact pole position when centrioles are located at spindle poles, the objective determination of daughter cell diameter and width from the image data, and the referential position of the cortex, need more detailed explanation in the manuscript. Additionally, it's crucial to elucidate the specific index used to quantify differences from the image data, especially when dealing with data that only varies by a few percent. Providing clarity on these aspects and, in some cases, re-quantifying the data should be necessary.

      We have already included clearer explanations in the method parts and results part about our methodology and will include a supplementary figure on how precisely we defined and measured the half-spindle sizes, as well as the index used for the asymmetry (using a methodology that we previously used in Dudka et al., Nature Comm., 2018). In addition, we will use a second method to measure the real daughter cell volume.


      __ Description of the experiments that we prefer not to carry out:__


      Point 3 of reviewer 1 : For investigating the mechanism of half-spindle size asymmetry, many perturbation experiments employ knock-down techniques. To directly address the cause of asymmetry, it might be valuable to artificially localize Plk1 and PCM factors to one spindle pole using optogenetic tools or similar approaches and then quantify half-spindle and daughter cell sizes.

      We thank the reviewers for this suggestion, as it could indeed, be of great interest and provide a direct proof of principle. Unfortunately, based on our experience in establishing such a cell line we know that just the generation of such a light-manipulated stable cell line that contains markers for centrosomes and chromosomes or kinetochores takes 6-9 months, in the best-case scenario. This experiment is therefore not possible within a normal revision round (even if extended to 6 months).


    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript entitled "centrosome age breaks spindle size symmetry even in "symmetrically" dividing cells" by Thomas and Meraldi reports that centrosome age impacts microtubule-nucleation capacity and is sufficient to tune spindle symmetry and cell size in human culture cell lines. The manuscript is overall clear, well written, illustrated and discussed. Nonetheless, some key experiments are missing as the authors report very subtle differences that need to be confirmed with complementary experiments, including time-lapse microscopy and alternative evaluations of cell sizes. The mechanism by which spindle symmetry breaking is established by centrosome age is not clear, even if the authors have identified some important actors at the spindle poles.

      Major points:

      1. The evaluation of spindle and cell size asymmetry related to centrosome age only relies on fixed sample preparation. Cells should be followed by time-lapse microscopy as the metaphase plate position relative to the spindle poles and/or the cell cortex may fluctuate over time and as the observed differences remain in a very subtle range. This is an important possibility to consider for 1:1, 1:0 or 0:0 spindle pole configurations where centrosome integrity is impaired.
      2. Cell size asymmetry was evaluated based on cell area at the equator. Volumes will be a better indicator as daughter cell shapes can be different in telophase if they do not re-adhere at the same speed. This evaluation should also be confirmed with another readout, like the position of the cleavage furrow relative to the spindle poles in late anaphase, as again the observed differences are in a very subtle range.
      3. The authors propose that differential microtubule nucleation at the spindle poles underlies spindle size symmetry breaking without providing direct evidence. If the observed spindle symmetry in the 1:1 configuration after pericentrin, CDK5RAP2 or -tubulin siRNA fuels this interpretation (Fig4C), the differential microtubule nucleation capacity at the spindle poles after microtubule-depolymerisation-repolymerisation assays was not evaluated in these conditions, as compared to the control situation.
      4. If differential microtubule nucleation at the spindle poles is responsible for spindle asymmetry, overexpression of PCM proteins or -tubulin should be sufficient for re-establishment of symmetric protein distribution, spindle and cell size symmetry in 2:2 or 1:1 configuration. The authors should evaluate whether this is the case or not.
      5. The authors describe that the cortex-centrosome distance is not changed according to centrosome age (Fig2C), but centrosome-metaphase plate distance is (Fig1D). These observations are difficult to reconcile if differential microtubule-nucleation capacity is at play. Again, time-lapse microscopy would enable to detect over time whether only metaphase plate position relative to spindle poles is changing or if spindle pole position relative to the cell cortex is also fluctuating.

      Minor points:

      1. Main PCM and MT nucleation protein "depletion" do not appear to impact spindle assembly, but only spindle symmetry in 1:1 and 1:0 configurations (Fig4A and 4F-H). Can it be explained by the fact that their depletion is not always total (for pericentrin, Fig5F versus FigS2A or Fig7G)? Can they comment on this point?
      2. If centrosome age dictates spindle and cell size asymmetry through differential MT-nucleation capacity at the spindle poles, how can this process be modulated? Indeed, centrosome age is common to all cell types, but cell size asymmetry is more or less pronounced. The authors should further discuss this point based on the literature.

      Significance

      The question of whether centrosome age is translated into different capacity to nucleate microtubules and related consequences on spindle and cell size symmetry has already been addressed in different model systems. Nonetheless, cell lines were previously described as dividing symmetrically since their spindle is symmetric in size and since they give rise to daughter cells of equivalent sizes. The present manuscript reports a thorough re-evaluation of this question and provides evidence that subtle differences in PCM and spindle pole protein recruitment, microtubule-nucleation capacity and spindle symmetry can be observed as a function of centrosome age. They also identify some key actors whose differential recruitment at the spindle poles can underlie spindle symmetry breaking, even if their involvement seems to differ from one cell line to another one. This manuscript could be submitted after appropriate revisions as a report and will benefit to the basic research cell biology community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors show that in two types of human tissue culture cells the half spindles associated with mother centrosomes are slightly longer, on average, than the half spindles associated with daughter centrioles. They show that this correlates with centrosome age, with mother centrosomes tending to be associated with the longer half spindle, and with a correlative asymmetry in the size of daughter cells. They show that spindle asymmetry relates to asymmetries in the amount of certain PCM components at centrosomes, including Pericentrin, CDK5RAP2, TPX2, and γ-tubulin, which preferentially accumulate at mother centrosomes. Pericentrin/CDK5RAP2/TPX2/γ-tubulin are known to be involved, directly or indirectly, with microtubule nucleation, and the authors also show how that microtubule nucleation is more robust at mother centrosomes and that depletion of either Pericentrin, CDK5RAP2, TPX2, or γ-tubulin abolishes (or reduces) spindle asymmetry. The suggestion is that enhanced microtubule nucleation at the mother centrosome leads to longer half spindles and subsequent asymmetric positioning of the division plane and daughter cells of unequal size. Centrosome and spindle asymmetry is partially masked by the apparent equal accumulation of PCM at daughter centrioles, such that cells with centrosomes containing only mother centrioles show higher levels of asymmetry.

      Mechanistically, the authors show that a Cenexin-bound pool of Plk1, a kinase required for PCM assembly, is important for centrosome and spindle asymmetry. Cenexin is an "upstream" sub-distal appendage protein only found at mother centrosomes (due to appendage structures only being present on the grandmother centriole). Nevertheless, depletion of a more downstream sub-distal appendage protein, Centriolin, also abrogated spindle asymmetry, suggesting that multiple proteins of the sub-distal appendages are necessary for asymmetry. Results from some experiments show that spindle asymmetry and a known asymmetry in the distribution of polar centrosomes are mechanistically separable, while other experiments show a link.

      Major comments:

      1. It is not completely clear how the authors determined whether a spindle was asymmetric or not. In the methods, they say that statistical tests are described in the legends. In Figure 1 legend they say: "Each condition was compared to a theoretical distribution centered at 0 (dashed line)". How did they generate this theoretical distribution?
      2. The authors claim that TPX2 depletion results in loss of spindle asymmetry in 1:1 cells, but the difference is very small (1.7% in control vs 1.3% in TPX2 depletion, Fig 5B) and the data is more variable in TPX2 depletion, which makes it less likely that a statistically significant difference from 0 would be found. Firstly, perhaps the authors could check the standard error of the mean, which provides a measure of how accurate the mean is with regard to N and variation. If a dataset is more spread (such as in TPX2 depletion) a higher N is required to attain the same accuracy in the mean value. This is normally not so important when directly comparing two datasets, but in this case the authors are comparing each dataset to 0. So, are the authors measuring enough cells in the TPX2 depletion to be sure that a 1.3% value is not significantly different from 0? Secondly, I don't understand why the control cells have such a low asymmetry index (1.7%), when previous data in the paper shows an asymmetry index of 4.1% (Fig 1D) and 3.4% (Fig 4E) in control 1:1 cells. This suggests that something about the way this experiment was carried out dampens the asymmetry, which could therefore lead the authors to conclude that TPX2 is more important than it really is.
      3. The authors claim that daughter centrioles are associated with some Pericentrin and suggest that this may be why 2:2 centrosomes have less of an asymmetry than 1:1 centrosomes (Fig 6A). It is unclear whether the authors consider these daughter centrioles as being prematurely disengaged (they make reference to the fact that they previously showed how disengaged daughters recruit γ-tubulin, but it's unclear if this is related to their current observations). In Figure 6A, the Centrin spots look too far apart for engaged centrioles (~750nm). I appreciate that this may be the only way to dectect Pericentrin around the daughter at this resolution, but it may also force the authors to select cells where the centrioles have prematurely disengaged. For the asymmetry measurements, the authors presumably did not select cells where they could distinguish mother and daughter centrioles. One way to address this issue would be to compare PCM size at centrosomes in 2:2 cells with centrosomes in 1:1 cells. The expectation would be that centrosomes in 2:2 cells would have more PCM, due to the contribution of the daughter centrioles.
      4. The authors show that Plk1 recruitment by Cenexin (via S796 phosphorylation), which happens only at mother centrosomes, is important for asymmetry. Nevertheless, they show that Plk1 is symmetrically distributed between mother and daughter centrosomes (Table 1). This does not really fit, unless daughter centrosomes recruit more cenexin-independent Plk1 than mother centrosomes or if the cenexin-bound pool of Plk1 is only a minor fraction of total Plk1. If so, do the authors think that the Cenexin-bound pool of Plk1 is more potent than the rest of centrosomal Plk1?
      5. The circles drawn to measure cell size in Figures 2A,E and 7C do not look like a good representation of cell area (as the cells are not perfectly round). The authors use a formular for circle area with an approximation of the radius (based on mean length/width of an oval. It would be much better to use ImageJ to draw a freehand line around the perimeter of the cell and use the in-built tool to measure the area.

      Minor comments:

      1. Asymmetry in centrosome size that correlates with centrosome age in apparently symmetrically dividing "cells" has been observed previously in Drosophila syncytial embryos (Conduit et al., 2010a, Curr. Bio.). I think this should be mentioned somewhere given the topic of the study.
      2. A full description of statistical tests and n numbers for each experiment should be provided in the methods, even if this duplicates information in the Figure legends.

      OPTIONAL EXPERIMENTS: 3. Given that chTOG is very important for microtubule nucleation, it seems strange that this protein was not analysed for a potential asymmetry. 4. Cooling-warming experiments could be done using higher concentration of formaldehyde, as it's likely that microtubule nucleation is not immediately halted when using 4% formaldehyde.

      Significance

      This a well-conducted study with results being presented clearly and concisely. The methodology is solid in the main. The study reveals something unexpected - that apparently symmetrically dividing human tissue culture cells divide asymmetrically. While the asymmetry is only slight, it could be important - although the authors do not address its relevance for the cell population. Having analysed only 2 cultured cell types, it remains unclear if this is a widespread phenomenon, and whether this occurs in a more natural setting. Nevertheless, the proposed model (Plk1 at SDA's => increased PCM at mother => increased nucleation => offset division plane), which is supported by the data, would suggest this could be a widespread phenomenon. This study will be of interest to anyone studying cell division, but it would require some degree of insight into the importance of the observations for it to appeal to a very broad audience.

      I am a cell biologist with an interest in cell division and microtubule regulation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper, the authors demonstrated that there is asymmetry in mitotic spindles, which are usually considered symmetric. That is, they found that centrosome age causes asymmetry in the size of the half-spindle even when the number of centrioles forming the spindle poles is the usual pair or when there is only one mother centriole. It is also suggested that the difference in half-spindle size is due to the different microtubule-organizing activity of the centrosomes at each spindle pole. Furthermore, they observe that the difference in half-spindle size also results in asymmetries in the size of the daughter cells after cell division. In this study, they mainly analyze the mechanism by employing the condition of 1:1 number of centrioles, in which the difference in half-spindle size is more sharply pronounced. They showed that the subdistal appendage (SDA) of the centriole of the old centrosome is important for the molecular basis of this half-spindle size difference, as the SDA-dependent recruitment of the Plk1 pool mediates an asymmetric localization of pericentrin, Cdk5rap2, gamma-tubulin, TPX2, and other factors at the spindle poles. In addition, knockdown of these factors eliminated the half-spindle size asymmetry. They also confirm these findings using a different human cell line, BJ cells. In conclusion, they propose that, reflecting centrosome age, the old centrosome promotes asymmetric spindle formation by localizing a group of factors that promote microtubule organization, originating from the SDA-Plk1 pathway.

      Major points

      1. The discovery of differences in half-spindle size during symmetric division is intriguing. However, the methodology for quantification of the data remains unclear. Key questions, such as how the center of the metaphase plate is determined from the image data, the definition of exact pole position when centrioles are located at spindle poles, the objective determination of daughter cell diameter and width from the image data, and the referential position of the cortex, need more detailed explanation in the manuscript. Additionally, it's crucial to elucidate the specific index used to quantify differences from the image data, especially when dealing with data that only varies by a few percent. Providing clarity on these aspects and, in some cases, re-quantifying the data should be necessary.
      2. The mechanism behind the difference in half-spindle size, related to the subdistal appendage (SDA), raises questions, especially considering that SDA is believed to disassemble during mitosis. Exploring whether differences in the localization of PCM components and half-spindle size result from disparities in Plk1 and PCM loading during G2/early mitosis, prior to SDA disassembly, necessitates experimental verification.
      3. For investigating the mechanism of half-spindle size asymmetry, many perturbation experiments employ knock-down techniques. To directly address the cause of asymmetry, it might be valuable to artificially localize Plk1 and PCM factors to one spindle pole using optogenetic tools or similar approaches and then quantify half-spindle and daughter cell sizes.
      4. The asymmetry in Plk1 sub-population recruitment by SDA triggers the observed effects, but the evidence for this is relatively weak, given the small difference in spindle asymmetry. Quantifying the amount of Plk1 in its activated form, particularly in the context of SDA dismantling during metaphase, could strengthen this aspect of the study.
      5. While the focus on half-spindle size asymmetry during symmetric division is intriguing, it's important to address the broader physiological significance. The primary outcome of this asymmetry is differences in daughter cell size, which limits the broader significance of the study. Furthermore, the quantification method for daughter cell size warrants scrutiny and clarification.

      Minor points

      1. Table 1 lists factors with asymmetric localization not analyzed in detail in this paper. It would be beneficial to discuss whether these factors play a role in spindle asymmetry, and the authors should address the completeness of the data in Table 1 in terms of selecting factors for analysis.
      2. In Figure 1H, the impact of centriolin knock-out on the distribution of unaligned polar chromosomes is different from the effect of cenexin S796A in Figure 6H. This difference should be explained to provide clarity on the observed discrepancies.
      3. In Figure 2A, there is no correlation data presented between daughter cell asymmetry and the presence or absence of cenexin signal. This relationship should be elucidated for a more comprehensive understanding.
      4. In Figure 4G and H, the mean value of spindle asymmetry increases with siRNA treatment of Cdk5Rap2 or PCNT compared to the control. The possible interpretation of this finding should be discussed.
      5. Figure 4K shows that the asymmetry of PCNT distribution is not eliminated by centriolin knock-down. This observation requires clarification and discussion.
      6. It appears that the difference in spindle asymmetry of the control group in Figure 5A is smaller than in other data. This discrepancy should be addressed. Additionally, the influence of TPX2 depletion on spindle formation, and any corresponding spindle staining data, should be included.
      7. Claiming that the daughter centriole recruits PCM based on Figure 6A data alone may require additional supporting evidence. It is essential to investigate whether there is a clear PCM signal when the daughter centriole disengages in late mitosis and maintain consistency in the interpretation.
      8. The lack of difference in TPX2 distribution in Figure 7E should be explained, along with a discussion of how this observation aligns with the spindle asymmetry data and any inconsistencies.
      9. The differing N numbers between samples in all the figures may affect the validity of comparisons. The authors should discuss whether it is necessary to have consistent N numbers in each experiment for more robust conclusions.

      Significance

      In summary, while the study is intriguing for its exploration of spindle asymmetry during symmetric division, the major points raised here highlight areas where further clarification and data interpretation are needed. A more rigorous quantification method and additional evidence to support the proposed SDA-Plk1 signal as the initiator of asymmetry would enhance the study's validity. Moreover, addressing concerns about daughter cell size quantification and the physiological relevance of spindle asymmetry is essential for a more comprehensive understanding of the findings. This research presents an interesting challenge for researchers in the centrosome and mitotic spindle field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-02172

      Corresponding author(s): Philip Elks

      [The “revision plan” should delineate the revisions that authors intend to carry out in response to the points raised by the referees. It also provides the authors with the opportunity to explain their view of the paper and of the referee reports.

      • *

      The document is important for the editors of affiliate journals when they make a first decision on the transferred manuscript. It will also be useful to readers of the reprint and help them to obtain a balanced view of the paper.

      • *

      If you wish to submit a full revision, please use our "Full Revision" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      • *

      In this paper we report the discovery that a member of the tribbles pseudokinase family, TRIB1 is expressed in human monocytes and is upregulated after stimulation with mycobacterial antigen in a human patient challenge model, the first direct link between immune cell Tribbles expression and innate immune response to infection. We then interrogated the mechanisms of Tribbles roles in TB using a human disease relevant whole-organism in vivo zebrafish model of TB. We show that specifically TRIB1 modulation can tip the battle between host and pathogen enhancing the innate immune response and reducing bacterial burden. We then uncover the molecular mechanisms responsible for the host protective effect of TRIB1, with enhanced antimicrobial reactive nitrogen species and il-1beta, via cooperation with Cop1 E3 ubiquitin ligase. Our findings demonstrate, for the first time, TRIB1 as a host moderator of antimicrobial mechanisms, whose manipulation is of benefit to the host during mycobacterial infection and as such, a potential novel therapeutic target against TB infection.

      We thank the reviewers for their positive appraisal of our work and for their helpful suggestions that will improve our manuscript. In particular we would like to highlight the reviewer’s comments on the gap/need for a new zebrafish in vivo model to understand the roles of tribbles in infection that can “be extrapolated into the human system”, and how they feel these findings will be of broad interest and “significance to cross section of the research community” attracting “interest from readers in the fields of infection, immunity, hematology and animal models” alongside “researchers studying all aspects of Tribbles pseudokinase function, especially researchers seeking models to test small molecule agonists and antagonists.”

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      • *

      Reviewer 1

      The major weakness of the manuscript is that the authors do not evaluate C/EBP transcription factors at all. It is rather surprising as they emphasize cooperation between Trib1 and Cop1 in the main title. C/EBP family proteins are key factors of Trib1-mediated modulation of granulocytes and monocytes. Also, slbo, a drosophila homolog of C/EBP, is a target of tribbles, indicating that the pathway is evolutionary conserved. I would request the following experiments and discussions.

      RESPONSE: We agree that possible C/EBP roles should be discussed in detail, and we will add a new discussion section on this.

      We stand by our data that the host protective mechanism of Trib1 acts requires Cop1, but we are not able to directly show a C/EBP mechanism within the scope of the current project due to a lack of tools/knowledge in the zebrafish on this (further points/comments below on this). It is important to note that we have not claimed a C/EBP mechanism in our manuscript, and we think it is possibly unlikely given that monocyte and granulocyte numbers are not altered after TRIB1 manipulation. Indeed, there are many other candidates other than C/EBP that COP1 could be acting through. Some examples include MAPK (Niespolo et al., Front Immunol, 2020), serine threonine kinases (Durzynska et al., Structure, 2017) and beta-Catenin (Zahid et al., Proteins, 2022).

      In response to this comment, we have modified the title from “Tribbles1 and Cop1 cooperate to protect the host during in vivo mycobacterial infection” to “Tribbles1 is host protective during in vivo mycobacterial infection”. We believe our data does show that the protective effect of Tribbles requires Cop1, but changing the title in this way removes any suggestion that they directly cooperate in the potential C/EBP dependent manner, suggested by the reviewer.

      Although the authors found the number of neutrophils and monocytes unchanged by Trib1 overexpression nor knockdown, they did not demonstrate the differentiation status of both cell types. This is quite an important issue, given that Trib1 knockout promotes granulocytic differentiation via C/EBPa accumulation in mice. Also, the analysis of granulocytic/monocytic differentiation will provide the crucial information how Trib1 protects the host from mycobacterial infection regulating hematopoietic cell functions. The authors should perform morphological analysis and examine cell surface marker expression to examine whether Trib1 and Cop1 modulates granulocytic and monocytic differentiation with and without Mm infection.

      RESPONSE: Unfortunately, we do not have the same level of immunology knowledge nor the antibodies to look at cell surface markers in zebrafish larvae (it is noted that the reviewer identifies that they “not have sufficient expertise in zebrafish models.” We agree with the reviewer that this would be an obvious and informative experiment to do in mouse models, but is not currently possible in zebrafish larval models). The transgenic promoters used (mpx for neutrophils and mpeg1) are robust and widely published to look at total neutrophil and macrophage numbers (Renshaw et al., Blood 2006; Ellett et al., Blood 2011). Mpx, encoding myeloperoxidase, is expressed late in neutrophil differentiation. It is also worth noting that the zebrafish larval model is still a developing organism, and neutrophil/macrophage numbers rise every day between 1 and 5 days post fertilisation, therefore any effect/delay in leukocyte differentiation would likely be captured at the 2dpf timepoint we have already quantified. We cannot perform leukocyte counts during Mm infection reliably as neutrophils/macrophages cluster around infected areas making counting challenging.

      However, in response to this comment we will:

      1. Use a new Tribbles 1 stable CRISPR-Cas9 knockout mutant we have generated and assess neutrophil differentiation using Sudan Black (SD). SD stains neutrophil granules the development of which is during a late phase of neutrophil differentiation.
      2. Interestingly, it has been shown that a zebrafish myeloid specific C/EBP (c/ebp1) is not required for initial macrophage or granulocyte development, but knockdown does result in a loss of the secondary granule gene LysC (Su et al., Zebrafish, 2007). Therefore, our findings are not inconsistent with existing literature, even if C/EBPs are regulated by Tribbles. However, to test this further we will use an LysC:mCherry transgenic line (Buchan et al., PLoS One 2019) to assess expression in developing neutrophils after trib1 manipulation.

      It is interesting that Cop1 knockdown zebrafish is viable, given its ubiquitous expression and multiple important targets of protein degradation. The authors should provide the details of phenotype of Cop1 KO larva and discuss on this issue.

      RESPONSE: Zebrafish mutants are much less often embryonic lethal than mice as maternally contributed protein stores allow for basic metabolic functions to occur throughout the short period of embryonic development (Rossant and Hopkins. Genes and Development 1992). However, in the case of Cop1 Crispant, this is a knockdown rather than a knockout, so there may be sufficient remaining Cop1 availability for development if it is indeed a requirement for larval viability. Although Cop1 knockout mice are non-viable, hypomorphs are viable and develop relatively normally (similar to our knockdown zebrafish) but are tumour prone as Cop1 is required for effective tumour suppression (Milgliorini et al., JCI, 2011).

      We had not commented on the Cop1 larvae phenotype as they look like they develop normally eg. normal body axis, development. However, we agree that this is a relevant point to incorporate into the manuscript and thus will add a comment on this in the Results section. Furthermore, we will add wholebody neutrophil counts into supplementary information, which we have performed and there is no change with cop1 knockdown, suggesting no difference in granulopoiesis.

      [Optional] To obtain the more solid evidence for the Cop1 dependent function of Trib1 on mycobacteria infection, it is better to use the Trib1 mutant that loses the Cop1 binding activity. This experiment will strength the authors' conclusion of the Trib1 and Cop1 cooperation.

      RESPONSE: We will address this comment by using a newly generated stable zebrafish CRISPR-Cas9 Tribbles 1 knockout line with a 14 base pair deletion that is predicted to lead a premature stop at 94aa in the middle of the pseudokinase domain, lacking the catalytic loop. This also lacks the predicted COP1 binding area at the C terminal of the protein. We will assess bacterial burden in this model.

      1. Previous studies have shown multiple defects in hematopoietic lineages such as M2-like macrophages and eosinophils in Trib1 KO mice, suggesting that Trib1 affects cellular functions of macrophages upon mycobacteria infection. I would request the authors to mention some ideas on this point in discussion.

      RESPONSE: We will add a section in the discussion to address this.

      • *

      Reviewer 2

      Structural comparisons are relatively descriptive of identity etc. Nowadays it should be relatively straightforward to comment on structural conservation based on Alphafold models. Specific details may not be accurate but gross folds will be, and comparing those may be more informative.

      RESPONSE: We have taken an initial look at Alphafold models and there are indeed structural similarities between zebrafish and human Tribbles. We will incorporate Alphafold structural models and comment on similarities/differences.

      Some discussion of the mechanisms regulating TRIB1/2/3 transcriptionally is probably relevant given the differential upregulation observed during infection. There is quite a bit of characterisation of different Tribble promoter regions in humans-how Edoes this translate to Zebrafish?

      RESPONSE: We will add a discussion point on what is known about Tribbles promoter regions in humans. We will assess whether anything is known about the promoter regions in zebrafish Tribbles (we have not identified literature on this currently). If nothing is known on this in zebrafish we will attempt to search for regulatory regions found in humans in the zebrafish promoters.

      In terms of Crispr use-can it be confirmed that Crispr modified cell lines have effects at the protein level? This is not my specific expertise, but the supplementary evidence shown seems to show some genomic editing is occurring, but not necessarily how it effects protein levels.

      RESPONSE: We do not have antibodies that work on zebrafish Tribbles proteins to assess this directly. However, we will address this comment by using a newly generated stable zebrafish CRISPR Tribbles 1 knockout line with a 14 base pair deletion that is predicted to lead a premature stop at 94aa in the middle of the pseudokinase domain, lacking the catalytic loop. Unlike the “CRISPant” knockdown work in the peer-reviewed version, this represents a full knockout of Tribbles 1. We will assess the trib1 cDNA of the full knockout line to assess the knockout in terms of transcript.

      A major conclusion of the paper seems to be that TRIB1 works with COP1 in Zebrafish to mediate response to infection. However the discussion does not particularly tie this with the other discussed mechanisms. E.g. JAK/STS, and EBP-linked responses are discussed separately from COP1, where they could well be linked?

      RESPONSE: We agree and this comment fits in with some comments from reviewer 1. We will rework areas of the discussion to address this and bring possible mechanisms together into a new discission section.

      • *

      Reviewer 3

      All comments addressed in new revision (see below).

      It is noted that this reviewer has “expertise from genetic studies of model organisms to assess all aspects of the tools and approaches used in the paper.”

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      • *

      Reviewer 1

      Figure 1D, E, F is mislabeled in lines 268-271.

      RESPONSE: Apologies for this typo, this has now been changed.

      Typo in line 399.

      RESPONSE: We have changed “suggesting” to “suggest”.

      Figure 6A-B is mislabeled in line 415

      RESPONSE: Apologies for this typo. We have changed this from “Figure 5A-B” to “Figure 6A-B”.

      • *

      Reviewer 2

      • *

      While the protective effect is stated as an effect size 'close to that of HIF-1a', is there additional rationale suggesting that the two may be linked?

      RESPONSE: Yes, there have been a number of studies that link Tribbles and Hif1-alpha. The best characterised link is in different cancer cells where Tribbles 3 has been linked to HIF-1alpha or hypoxia (in breast cancer (Wennemers, Breast Cancer Research 2011), renal cell carcinoma cells (Hong et al., Inj J Biol Sci, 2019) and adenocarcinoma (Xing et al., Cancer Management Research, 2020). In Drosophila Hif-1alpha induces TRIB in fat body tissue (Noguchi et al., Genes Cells 2022). We have now added references to these studies to the relevant section in the results.

      Reviewer 3

      • *

      Minor issues: small problems with clarity and figure panel correlation as detailed below:

      Mycobacterium marinum Lines 363-365 Refers to Fig 2C-D should be 3C-D

      RESPONSE: Apologies for this typo. This has now been changed to 3C-D.

      negative controls DN Hif-1alpha and PR (Figure 4A-B). Similarly, trib1 overexpression increased the levels of anti-nitrotyrosine staining, a proxy for immune cell antimicrobial nitric oxide production (Forlenza et al. 2008), to similar levels of DA Hif-1alpha (Elks et al. 2014; Elks et al) Not seeing this for Trib1

      RESPONSE: We are not completely sure what the reviewer is referring to here. We think possible confusion stems from the increase of nitric oxide in trib1 is compared to the phenol red control, so we have now clarified that in the text.

      As previously observed, overexpression of trib1 significantly reduced bacterial burden compared to phenol red controls when co-injected with tyrosinase guide (Figure 5A-B).

      The Fig 3 A-B is correct, although 6A-B appear to be novel panels showing this result

      RESPONSE: Yes, we agree, 6A-B has new results showing similar results to 3A-B, as it is necessary to include siblings from the same clutch in each graph to make direct comparisons. To avoid unnecessary confusion, we have removed the “as previously observed” for figure 6 as we had not previously had the tyrosinase co-injection so these are indeed new data.

      444 no comma 446 no comma 457 no comma after "activation"

      RESPONSE: We have removed these punctuations.

      472-475 confusing - better structure in particular in 474 what does "this" refer to?

      RESPONSE: We agree, and have clarified in the following new, clarified sentences:

      “Lipid droplets form in macrophages during Mtb infection that are potentially used as source of lipids by Mtb to allow for intracellular growth (Daniel et al. 2011). However, more recent findings suggest that lipid droplets are formed during the immune activation process after macrophage Mtb infection (Knight et al. 2018), that can subsequently influence the dynamics response of macrophage host defence (Menon et al. 2019). This macrophage lipid metabolism and handling could potentially be influenced by Tribbles.”

      525-526 confusing - better structure perhaps begin with 'Because...'

      RESPONSE: We have changed this confusing sentence to:

      “Here, we demonstrate il-1b and NO control by Trib1, suggesting that Trib1 controls multiple immune pathways and that therapeutic Trib1 manipulation may be more effective than targeting individual immune pathways alone.”

      confusing 538 "this and 539 pave the way for further research into TRIB1 as a target for host-derived therapies" Perhaps "further research into TRIB1 as a target for host-derived therapies could potentially improve infection outcome of mycobacterial infection via pharmacological targeted delivery methods and transient manipulation through genetic approaches"

      RESPONSE: We have changed this sentence as suggested.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      • *

      Reviewer 1

      1. The authors should investigate the expression of the C/EBPa protein p42 isoform and/or other C/EBP family proteins such as C/EBPb, and confirm that the p42 is degraded by Trib1 overexpression and recovered by Trib1 and Cop1 knockout. It is also important to determine both p42 and p30 isoforms are preserved in zebrafish.

      RESPONSE: This is a complex point to unpick in zebrafish and we believe this to be out of the scope of the current project. We do not claim a link to C/EBP. As mentioned in above comments we think that a link to C/EBP may be unlikely given that monocyte and granulocyte numbers are not altered after TRIB1 manipulation. We will add more data to look at different markers of neutrophils (see above comments). There are many other candidates other than C/EBP that COP1 could be acting through. Some examples include MAPK (Niespolo et al., Front Immunol, 2020), serine threonine kinases (Durzynska et al., Structure, 2017) and beta-Catenin (Zahid et al., Proteins, 2022). There is also evidence suggesting that COP1 and C/EBP have distinct binding sites on TRIB1, potentially unlinking their activity in some biological situations (Murphy et al., Structure, 2015).

      C/EBPa is found in zebrafish and is involved in myeloid differentiation and haematopoeisis (Yuan et al., Blood 2011). There is not a huge amount in the literature on this, but it has been shown in zebrafish models that the drug Tanshinone IIA reduces C/EBPa (Park et al., In J Mol Sci, 2017) and we know from previous work in our department that Tanshinone IIA does not affect total neutrophil numbers in the zebrafish larvae (Robertson et al., Sci Trans Medicine, 2014). The most involved C/EBP in zebrafish myelopoiesis appears is a zebrafish specific isoform called c/ebp1 that is myeloid expressed (Lyons et al., Blood 2001). This has a highly conserved carboxy-terminal bZIP domain but the amino-terminal domains are unique. Interestingly, reduction of c/ebp1 does not ablate initial macrophage or granulocyte development, but did result in loss of expression of LysC, a secondary granule marker (we are checking expression of this gene after Trib1 modulation using a LysC:mCherry transgenic zebrafish line).

      We do not have antibodies or tools to detect p42 and p30 in zebrafish. As Tribbles1 regulation of C/EBPa appears to be post-translational (Bauer et al., J Clin Invest, 2015), this would be incredibly challenging to unpick in the zebrafish model due to lack of tools to do this. Due to this and the reasons above we believe this to be out of the scope of the current project.

      [Optional] The effect of enhanced ERK phosphorylation by Trib1 for the protective effect against mycobacterial infection is another interesting point. It would be better if the authors could provide the ERK phosphorylation status upon Trib1 overexpression.

      RESPONSE: Unfortunately, we have no method to answer this question to a conclusive level within the scope of this project. There are limited reports of phosphorylated ERK antibodies that work in wholemount zebrafish (eg, Maurer and Sagerström, BMC Developmental Biology, 2018, that use a rabbit antibody), but this is widely expressed in many tissues of the zebrafish and immune cells would be challenging to resolve.

      Reviewer 2

      We have addressed or propose to address all of reviewer 2’s comments.

      • *

      Reviewer 3

      We have addressed or propose to address all of reviewer 3’s comments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: In this manuscript, the authors test the role of Tribbles psuedokinase 1 in the primary immune defence against Mycobacterium tuberculosis, the pathogen in tuberculosis. After showing increased Trib1 and 2 in response to mycobacterial infection cell culture and from biopsies of challenged human tissue, they turn to a zebrafish model of infection of the caudal vein with Mycobacterium marinum, a natural fish pathogen similar in effects on macrophages to M. tuberculosis in humans.

      Authors find that overexpression of tribs 1, 2 and 3 had no strong effect on development but reduced the bacterial burder significantly for Trib1, less so for Trib2 and not at all for Trib3. Converesely, Trib1 Crisper-mediated knockdown increased Mm burde, which 3 had no effect ( and 2 guide RNAs were not effective at reducing Trib2).

      Trib1 increased levels of pro-inflammatory interleukin 1 beta, as measure by a reporter gne , which Trib3 had no similar effect, which was on par to the effect of the Hif-1alpha transcription factor, known to regulate il-1beta. The effect of Trib1 was not upon increased Hif-a (hence independent) but was dependent on the co-factor COP1, a known target of the Trib1 C-tail when activated

      No major problems seen

      Minor issues: small problems with clarity and figure panel correlation as detailed below Mycobacterium marinum Lines 363-365 Refers to Fig 2C-D should be 3C-D

      1. 386 negative controls DN Hif-1and PR (Figure 4A-B). Similarly, trib1 overexpression increased
      2. 387 the levels of anti-nitrotyrosine staining, a proxy for immune cell antimicrobial nitric oxide
      3. 388 production (Forlenza et al. 2008), to similar levels of DA Hif-1(Elks et al. 2014; Elks et al. Not seeing this for Trib1

      4. 414 As previously observed, overexpression of trib1 significantly reduced bacterial burden

      5. 415 compared to phenol red controls when co-injected with tyrosinase guide (Figure 5A-B). The Fig 3 A-B is correct, although 6A-B appear to be novel panels showing this result

      444 no comma 446 no comma 457 no comma after "activation" 472-475 confusing - better structure in particular in 474 what does "this" refer to? 525-526 confusing - better structure perhaps begin with 'Because...'

      confusing 538 "this and 539 pave the way for further research into TRIB1 as a target for host-derived therapies"

      Perhaps "further research into TRIB1 as a target for host-derived therapies could potentially improve infection outcome of mycobacterial infection via pharmacological targeted delivery methods and transient manipulation through genetic approaches"

      Significance

      General assessment: The paper is well written and clear. The importance of developing host derived therapies is stated well in the introduction and the discussion makes clear the significance of the work to developing the zebrafish as a useful alternative to traditional models for studying the pathophysiology of mycobacterial infection.

      Advance: for the Tribbles field, which I can comment on, this advances an important and neglected model organism (the zebrafish), introducing this set of three homologs Trib1, 2 and 3, that are well studied in mammals, Drosophila and C.elegans.

      Audience: The paper will be of significance to cross section of the research community on the one hand developing novel approaches to treat TB and on the other to researchers studying all aspects of Tribbles pseudokinase function, especially researchers seeking models to test small molecule agonists and antagonists.

      I have expertise from genetic studies of model organisms to assess all aspects of the tools and approaches used in the paper.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper by Hammond et al characterises the role of Tribbles proteins in Zebrafish in response to MTb infection. They show interesting upregulation of TRIB1 and 2, relative to TRIB3 which is not upregulated. This may provide an interesting system to further explore the role of Tribbles in response to infection, which is currently underexplored, but could do with some additional detail to stake that claim more strongly.

      Major Comments

      Some discussion of the mechanisms regulating TRIB1/2/3 transcriptionally is probably relevant given the differential upregulation observed during infection. There is quite a bit of characterisation of different Tribble promoter regions in humans-how does this translate to Zebrafish?

      Structural comparisons are relatively descriptive of identity etc. Nowadays it should be relatively straightforward to comment on structural conservation based on Alphafold models. Specific details may not be accurate but gross folds will be, and comparing those may be more informative.

      In terms of Crispr use-can it be confirmed that Crispr modified cell lines have effects at the protein level? This is not my specific expertise, but the supplementary evidence shown seems to show some genomic editing is occurring, but not necessairly how it effects protein levels.

      While the protective effect is stated as an effect size 'close to that of HIF-1a', is there additional rationale suggesting that the two may be linked?

      Minor comment

      A major conclusion of the paper seems to be that TRIB1 works with COP1 in Zebrafish to mediate response to infection. However the discussion does not particularly tie this with the other discussed mechanisms. E.g. JAK/STS, and EBP-linked responses are discussed seperately from COP1, where they could well be linked?

      Significance

      Tribbles are clearly important in immune cell development. The work is an interesting foray into Tribbles in Zebrafish, which is an interesting tool to be developed going forward. It also hints at a broader role of Tribbles in human infection, but this requires more work to play out.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The paper by Hammond et al. describes the protective role of Trib1 for mycobacterial infection such as tuberculosis. They first found that TRIB1 expression is upregulated upon mycobacterial antigen injection in human monocytes. They then tried to perform functional studies using a zebrafish model. To achieve the study, they confirmed the preservation of Trib family genes in zebrafish. The transgenic zebrafish expressing exogenous trib1 demonstrated decreased bacterial burden of M. marium, and trib1 knockdown showed the opposite effect. Trib1 expression also induced the production of Il1B and NO in the Hif independent manner. Finally, they found that cop1 knockdown reduced trib1-mediated protection against Mm infection.

      Major comments

      The major weakness of the manuscript is that the authors do not evaluate C/EBP transcription factors at all. It is rather surprising as they emphasize cooperation between Trib1 and Cop1 in the main title. C/EBP family proteins are key factors of Trib1-mediated modulation of granulocytes and monocytes. Also, slbo, a drosophila homolog of C/EBP, is a target of tribbles, indicating that the pathway is evolutionary conserved. I would request the following experiments and discussions.

      1. The authors should investigate the expression of the C/EBPa protein p42 isoform and/or other C/EBP family proteins such as C/EBPb, and confirm that the p42 is degraded by Trib1 overexpression and recovered by Trib1 and Cop1 knockout. It is also important to determine both p42 and p30 isoforms are preserved in zebrafish.
      2. Although the authors found the number of neutrophils and monocytes unchanged by Trib1 overexpression nor knockdown, they did not demonstrate the differentiation status of both cell types. This is quite an important issue, given that Trib1 knockout promotes granulocytic differentiation via C/EBPa accumulation in mice. Also, the analysis of granulocytic/monocytic differentiation will provide the crucial information how Trib1 protects the host from mycobacterial infection regulating hematopoietic cell functions. The authors should perform morphological analysis and examine cell surface marker expression to examine whether Trib1 and Cop1 modulates granulocytic and monocytic differentiation with and without Mm infection.
      3. It is interesting that Cop1 knockdown zebrafish is viable, given its ubiquitous expression and multiple important targets of protein degradation. The authors should provide the details of phenotype of Cop1 KO larva and discuss on this issue.
      4. [Optional] The effect of enhanced ERK phosphorylation by Trib1 for the protective effect against mycobacterial infection is another interesting point. It would be better if the authors could provide the ERK phosphorylation status upon Trib1 overexpression.
      5. [Optional] To obtain the more solid evidence for the Cop1 dependent function of Trib1 on mycobacteria infection, it is better to use the Trib1 mutant that loses the Cop1 binding activity. This experiment will strength the authors' conclusion of the Trib1 and Cop1 cooperation.

      Minor comments

      1. Previous studies have shown multiple defects in hematopoietic lineages such as M2-like macrophages and eosinophils in Trib1 KO mice, suggesting that Trib1 affects cellular functions of macrophages upon mycobacteria infection. I would request the authors to mention some ideas on this point in discussion.
      2. Figure 1D, E, F is mislabeled in lines 268-271.
      3. Typo in line 399.
      4. Figure 6A-B is mislabeled in line 415

      Referees cross-commenting

      I do not have any cross-comments, however, I believe comments of other reviewers are helpful for authors.

      Significance

      This study is the first report on the Tribbles protective function for mycobacterial infection. The use of the zebrafish model is unique and provides useful information on tribbles family genes are expressed in hematopoietic cells in zebrafish. As granulocytic and monocytic functions are conserved between fish and mammals, the results can be extrapolated into the human system to understand infection and pathogenic mechanisms of mycobacterium. The lack of investigation on C/EBP transcription factors is a major limitation to be improved in this study, since C/EBPa is a key molecule in the Trib1 and Cop1 cooperation in granulocytic differentiation. The manuscript will attract interest from readers in the fields of infection, immunity, hematology and animal models. This reviewer's field of expertise is hematology/oncology/model animals, although I do not have sufficient expertise in zebrafish models.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):*

      The mechanisms that differentiate ER from the nuclear envelope (NE) remain to be fully elucidated but likely depend at least in part on junctions between the ER and NE. How such junctions are formed and maintained is the subject of this manuscript where extensive correlative light and electron microscopy is used to observe and characterize ER-nuclear envelope (ER-NE) junctions at distinct phases of the cell cycle. The authors make use of their own electron tomography data as well as publicly available focused-ion beam scanning electron microscopy (FIB-SEM) datasets to compare the morphology of these junctions in different human cell types as well as in budding yeast. The major finding is that ER-NE junctions in human cell lines are more constricted than ER-ER junctions, often to the point of excluding lumen. The examination of mitotic cells suggests that this constriction likely occurs at the end of mitosis as the NE is completing its maturation from ER to NE. The implications of these morphological changes are discussed but there are no mechanistic or functional studies. Overall, the data are well presented, are of high quality and are rigorously evaluated. The manuscript is well written and scholarly, and the speculations as to the function of the constrictions are reasonable. I only have minor comments. *We thank the reviewer for the positive evaluation on our work and for the useful suggestions on how to further improve the manuscript.

      1. * In Figure 2D, the authors present evidence to demonstrate that an hourglass-like constriction occurs at ER-NE junctions. From the side view, it is difficult to interpret this on the plot, particularly for the ER-NE junctions with a lumen. Perhaps, in the supplemental data, the authors could plot both with and without lumen data separately, and color-code individual traces? I believe this would convey the hourglass nature of these constrictions more clearly.* To make it easier to see individual membrane profiles, we will plot the profiles with and without lumen separately and labelled each profile with distinct colour, as the reviewer suggested.

      * In the Methods section, the authors should describe how carbon-coating of sapphire discs was achieved. If these were provided from the manufacturer precoated, this should be specified.*

      We coated the sapphire discs with carbon by ourselves. We will specify how the carbon-coating was done in the revised manuscript.

      * On page 10, Figure 5F callout 9 lines from the bottom likely should be 5E. We will correct this error.

      Reviewer #1 (Significance (Required)):

      Overall, this work provides an important new morphological perspective on the nature of ER-NE junctions in human cells. As the authors describe in their introduction, such junctions have been noted previously in the literature but not in a dedicated study using modern imaging techniques in human cell lines. In describing the morphology of these junctions, the authors lay the groundwork for future mechanistic, functional, and structural studies. We thank the reviewer for appreciating the significance and the impact of our work.

      *

      • *

      • *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):*

      Summary: In this manuscript, Bragulat-Teixidor et al., use correlative live-cell imaging and electron tomography to study the structure of the endoplasmic reticulum-nuclear envelope (ER-NE) junction in HeLa cells (and also in S. cerevisiae). The authors also make use of publicly available whole-cell FIB-SEM datasets to study ER-NE junctions in mouse pancreatic islet, HeLa, and human macrophage cells to corroborate their findings in other cell types.

      The authors show that the structure of the ER-NE junction in interphase cells adopts an hourglass shape with a constricted neck. Comparing the ER-NE junction to the ER tubule-sheet junction, the authors show that these structures are different: the ER tubule-sheet junction is not constricted. Because the NE forms from the ER during postmitotic NE assembly, the authors compare the structure of the ER-NE junctions in anaphase, telophase, and interphase cells, and find that the junction becomes constricted in telophase. The number of ER-NE junctions increase going from telophase to interphase.

      While the authors do not provide any direct evidence for this, they propose a functional model where the ER-NE junction is constricted because it regulates the supply of certain lipids and proteins from the ER to the NE. One proposed example is that the constriction of the ER-NE junction might prevent the passage of large protein aggregates from entering the NE.

      The general question of how the structure of the ER-NE junction might regulate the passage of lipids and proteins from the ER to the NE is interesting and potentially important. However, the authors should address the following issues to improve the accuracy and completeness of this manuscript for it to be considered for publication. *We thank the reviewer for the appreciation of our work and the thoughtful suggestions for further improvements.

      * Major comments: 1. The authors compare the structure of the ER-NE junction to the structure of the ER tubule-sheet junction in interphase cells. They should instead or in addition be comparing the ER-NE junction to ER sheet-sheet junctions. This is likely a better comparison for two reasons:

      i) The NE is similar to an ER sheet due to its flat and extended structure. The ER membranes surrounding the NE consists mostly of a dense network of sheet-like ER (Zheng et al., 2022, PMID: 34912111). Therefore, the ER-NE junction should be compared to these NE-adjacent ER sheet-sheet junctions and not ER tubule-sheet junctions which are likely to be found in the cell periphery.

      ii) In HeLa cells, the NE assembles from large ER sheets and not ER tubules (Zhao et al., 2023, PMID: 37098350; Otsuka et al., 2018, PMID: 29323269; Lu et al., 2011, PMID: 21825076). Therefore, the ER-ER junctions the authors are already studying in anaphase cells are likely to be ER sheet-sheet junctions, which should be kept the same in their analysis of the ER-ER junctions in interphase cells.

      Related to this point, comparing the side view panels in Figure 2D with 2H, it seems that the width of the ER membranes on either side of the neck region of the ER-NE junction is in fact getting wider (more sheet-like). This is in contrast to the ER-ER junction where the width stays constant for the ER tubule that is fusing onto the ER sheet. This suggests that indeed, the ER-NE junction is more similar to an ER sheet-sheet junction. *It is a very interesting possibility that the ER-NE junction might be similar to the ER sheet-sheet junction. We will inspect whether the ER that forms the ER-NE junction consists of sheet or tubular ER in our EM tomograms, and describe the outcome in the revised manuscript.

      * The authors claim that in late anaphase cells, the ER-ER/NE (written like this because the ER and NE cannot be distinguished like the authors also point out) junctions are not constricted and had a similar morphology to ER-ER junctions in interphase. However, this claim is only qualitative at the moment, as the authors do not provide any quantification of the width of the ER-ER/NE junctions in late anaphase cells. To make the current claim that the ER-NE junction only becomes constricted in telophase, the authors should report the width of the ER-ER/NE junctions in late anaphase cells.

      In late anaphase cells, large ER sheets initially wrap around chromatin at the periphery of the chromosome mass (Zhao et al., 2023, PMID: 37098350; Otsuka et al., 2018, PMID: 29323269; Lu et al., 2011, PMID: 21825076). Therefore, the authors might find it easier to identify ER-ER/NE junctions in the so-called "non-core" regions, instead of in the current regions shown in Figure 3A. *As the reviewer pointed out, we did not provide quantification of the width of ER-ER/NE junctions in late anaphase cells. We will measure them and show the quantification in the revised manuscript.

      * Minor comments: 1. In the Supplementary Figures 1 A-D, make the scale bars white. Currently, the black scale bars are especially difficult to see in the top panels in Supplementary Figure 1C. *We will change the colour of some scale bars to make them more visible in the Supplementary Figure 1.

      * In the Results section entitled "The number of ER-NE junctions per cell increases from telophase to interphase", the authors should tone down this claim because the number of telophase cells examined is low (only 2 telophase versus 9 interphase cells). It would be better to include the word "slightly" in the title to change it to "slightly increases". *We will modify the text accordingly. * In the Results section entitled "The number of ER-NE junctions per cell increases from telophase to interphase", the authors state "These densities were much lower than those of ER-ER junctions...". For sure this is true for ER tubule-tubule junctions in the periphery of the cell as ER tubules form an intricate network by constantly fusing to each other, but it's not clear if this is also the case for ER tubule-sheet or ER sheet-sheet junctions. For clarity, the authors should state that they mean ER tubule-tubule junctions.

      Same comment also for the statement "...although their abundance remains considerably lower than that of ER-ER junctions or nuclear pores at both cell cycle stages". The authors should state that they mean ER tubule-tubule junctions. We will clarify what we mean by ER-ER junctions in the revised manuscript. * In the Results section entitled "The constricted morphology of ER-NE junctions is observed in different mammalian cells, but not in budding yeast", the authors state "...pancreatic islet cells (Figure 5A), HeLa (Figure 5B), and macrophage (Figure 5C) were significantly smaller than most ER-ER junctions (Figure 5F)". The last figure reference here is wrong and should be changed to Figures 5D-E. We will correct this error. * In Discussion, the authors state "Proteins known to form and stabilize junctions in the ER, including Atlastins and Lunapark...". The authors should specify that they mean ER tubule-tubule three-way junctions. Also more generally throughout the manuscript, the authors should be more careful in specifying which ER-ER junctions they mean in each case.*

      As pointed out in the Major comment 3 above, we will clarify this point in the revised manuscript.*

      1. In Discussion, the authors state "Thus, we favour a second scenario in which ER-NE junctions are generated from ER tubules that contact and eventually fuse with the ONM". Given that the ER membranes adjacent to the NE are mostly sheet-like (as pointed out in Major comment 1 above), the authors need to explain how they think an ER tubule (mostly found in the cell periphery) could access and fuse to the NE. As mentioned in the response to Major comment 1 above, we will examine if the ER that forms ER-NE junctions is tubule or sheet in our EM tomograms. Depending on the outcome of the examination, we will rephrase the text.

      *

      * Reviewer #2 (Significance (Required)):

      Although the ER-NE junction has been studied in other organisms before, this study represents the first structural characterisation of the ER-NE junction in mammalian cells. Therefore, this study represents an advance for the field in gaining a better understanding of different ER structures and morphologies. How the ER is remodelled during the cell cycle is also an interesting question and an active field of research (Merta et al., 2021 PMID: 34853314; Zhao et al., 2023, PMID: 37098350) which this study further contributes to. This study would therefore be interesting for anyone interested in ER structure/morphology, ER-NE connections, and cell cycle regulation of such ER-NE connections.

      My field expertise is in ER and NE. I do not have sufficient expertise to evaluate the methodology for the EM tomography part of this paper. We thank the reviewer for appreciating the novelty and the impact of our work.

      *

      *

      *

      * Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The manuscript by Bragulat-Teixidor et al. is a study of the connection of the ER with the nuclear envelope. It uses advanced ultrastructural techniques: high pressure freezing instead of chemical fixation and EM tomography instead of serial sectioning. Synchronized HeLa cell cultures were examined during interphase, late anaphase (4-6 min after anaphase onset) and early telophase (8-10 minutes after anaphase onset).

      The investigators find an unexpected, unusual structure - a constricted neck 7-20 wide and about 10 nm long where the ER connects to the nuclear envelope. The 7 nm connections had no apparent lumen. These are not seen in late anaphase when the NE has not yet formed, but they are seen a few minutes later during early telophase when there is a newly formed NE surrounding the chromosomes. A quantitation was made of their abundance, more was found later during interphase, and with wider lumens.

      It is very nice to show the EM images as uncolored and segmented (colored). The images shown in the figures are presumably the best that were obtained during the study. Heavy metals do not stain membranes uniformly or exclusively, and identification of structures doesn't always seem unambiguous. The three dimensional information can certainly make this easier though this information is difficult or not possible to show in journal format. In the end, the reader must depend on the judgment of the person who did the analysis. Overall, the analysis seems trustworthy. *We thank the reviewer for the comment. To better present the three-dimensional structure of ER-NE junctions, we will provide movies of the EM sub-tomograms containing the junctions. In this way, the readers will be able to inspect the three-dimensional structure of six ER-NE junctions.

      * HeLa cells are very convenient for getting information on cell cycle dependence. However, they are cancer cells in culture, so it is important to look at other cell types as well. The same methodology was used on budding yeast and they saw a wide tentlike connection, which reproduces an earlier study. This seems more consistent with what is known or expected from ER membranes. It is not less interesting but perhaps less puzzling. To get evidence on other mammalian cells, the authors did an analysis of data from OpenOrganelle. These are high pressure frozen cells / tissue imaged by FIB-SEM. The voxels are 4 nm, which is significantly larger than those in EM tomography. Unfortunately, the difficulty of identifying structures is correspondingly more significant. The images shown do not contradict the HeLa results but by themselves (without the HeLa cell data), a convincing case for narrow connections probably couldn't be made. *The reviewer raises a very good point about a limitation of the FIB-SEM datasets in OpenOrganelle. We agree with the reviewer that, as we had mentioned in the manuscript (line 6–11, page 10), the spatial resolution of the FIB-SEM datasets are not enough to gain insights into the exact morphology of the 7–20 nm wide ER-NE junctions because the voxel size is 4 nm. However, the resolution is good enough to examine if ER-NE junctions are narrower than ER-ER junctions, as shown in Figure 5A–E. The fact that we rarely found non-constricted ER-NE junctions in FIB-SEM datasets confirms the tiny nature of ER-NE junctions. To clarify this point, we will modify the text (line 24–25 on page 10) as below:

      Previous: This analysis of FIB-SEM images confirms the hourglass morphology that distinguishes ER–NE from ER–ER junctions as seen in our EM tomograms…

      Revised: This analysis of FIB-SEM images confirms that ER-NE junctions are narrower than ER-ER junctions as seen in our EM tomograms…

      * The work in this manuscript seems to have been done well. Assuming that this structure is confirmed in other mammalian cells, another kind of question comes to mind: is this the final word on ER to NE connections? The lumenless neck does not seem like it would be a stable structure, somehow it seems like a transient one. In the future, it would help if a new structural protein was identified or some theoretical analysis to help explain the shape. *Certainly, this will not be the final word on ER-NE junctions, which are crucial for the ER-to-NE transport of lipids and transmembrane proteins. In the future, it will be important to identify structural proteins regulating the junctions and reveal how their constricted morphology affects the ER-to-NE transport. We believe that, as you kindly mentioned in the last paragraph of your comments, our observations “serve as a starting point for further structural and functional work” for this unique yet fundamental junctions that connect the ER to nucleus.

      * It is generally now assumed that high pressure freezing preserves structure perfectly. However, in this reviewer's mind, there is a possibility that some structures are not. The sample is brought to 2000 atmospheres within a few milliseconds, frozen, then the high pressure is released after a second. Although many intracellular structures do seem well preserved, could the junction be susceptible to high pressure? A second source of uncertainty is that in order to embed the samples in resin, the water was removed by freeze substitution. This is known to cause a small amount of tissue shrinkage and possibly could alter a delicate structure. Another way to look at this kind of structure is cryo-EM tomography on hydrated lamellae from plunge frozen cells. I don't recommend that the authors do another arduous, possibly too arduous set of experiments with a completely different technique, but perhaps another group has data which could support their findings. *We think it is very unlikely that ER-NE junctions were deformed due to the high-pressure freezing. In general, high-pressure freezing allows vitrification of specimens up to 0.5 mm in thickness and the vitrification works better for thinner specimens. Our specimens are only 0.02 mm thick monolayer cells frozen in a chamber with 0.03 mm depth. Thus, the vitrification is expected to occur fast and the ER-NE junctions must have been frozen in the same way as in other regions of the cell.

      However, as the reviewer pointed out, it is possible that the dehydration of the samples due to freeze substitution might cause deformation in ER-NE junctions. To verify the structural preservation of ER-NE junctions in our protocol, we will compare the morphology of the ER and NE in cryo-EM datasets that are available in public databases with ours. We will describe the outcome in the revised manuscript.

      We think that our conclusion from the EM analysis is solid, because we observed significant structural difference between ER-NE junctions and ER-ER junctions in the same cells (Figure 2). In addition, we found the morphology change of ER-NE junctions in late-anaphase, early-telophase, and interphase cells that were high-pressure frozen and freeze-substituted on the same sapphire disc, and found that the ER-NE junctions became progressively constricted from telophase to interphase (Figure 3).

      * The following are suggestions for the Discussion:

      Yeast have many of the same biochemical processes as mammalian cells. Perhaps their lack of narrow connections can be used as a clue to the function of the narrow necks seen in HeLa cells. For instance, the authors speculate that the narrow connection serves to keep phosphatidylserine in the nuclear envelope low. If the yeast nucleus has the same concentration of phosphatidylserine as the ER, it would provide good evidence for this idea. Yes, it is indeed the case. It was shown that the yeast outer nuclear membrane has the same concentration of phosphatidylserine as the ER (Tsuji et al., Proc. Natl. Acad. Sci. U. S. A.*, 2019). We had described this in the discussion on page 14 “this phosphatidylserine enrichment occurs in mammalian cells and not in budding yeast (Tsuji et al., 2019)”, which was probably overlooked by the reviewer. In the revised manuscript, we will rephrase the text to make this point clearer.

      * There might be other instances of lumenless neck structures. Dynamin mutants can cause a stable constricted tubule - are the dimensions of this tubule similar to that of the ER / NE connections? Or possibly some ESCRT related structure? These are very interesting questions. As shown in Figure 2A-D and Supplementary Figure 1B, the inner diameter (an inner leaflet distance) of the lumenless ER-NE junctions is below 1 nm. In contrast, the inner diameter of most constricted membrane tubules that the dynamin mutant K44A Dynamin 1 generates is 3.7 nm (Antonny et al., EMBO J., 2016, doi: 10.15252/embj.201694613). The inner diameter of membrane tubules that ESCRT-III subunits CHMP1B and IST1 form is 4.4 nm (Nguyen et al., Nat. Struct. Mol. Biol.*, 2020, doi: 10.1038/s41594-020-0404-x). Thus, the lumenless ER-NE junctions is unique in their highly-constricted nature and might be regulated by proteins other than dynamin or ESCRT proteins. We will discuss this point in the revised manuscript.

      * There do not seem to be any recent studies of the ER / nuclear membrane connection in fixed cells. However, there is serial section data online which can be inspected. There are connections in mouse brain cortex in the data of Kasthuri et al., 2015 (https://neurodata.io/project/ocp/). Instead of a tubule connection, there seems to be a narrow sheet of ER that connects to the nuclear envelope. But there is something odd about these too. The authors may like to mention something about this or similar work in their manuscript. This reviewer has looked at chemically fixed data from several cell types from his own unpublished data and connections are surprisingly hard to find. Possibly, the connection is particularly sensitive to chemical fixation.* We inspected the serial section data of mouse brain cortex that was chemically fixed. The nuclear envelope in this dataset is deformed and does not seem well preserved. We do not think that we can extract useful information on the ultrastructure of ER-NE junctions from this dataset, and thus will not mention this work in our manuscript.

      It is great to hear that the reviewer tried to look for ER-NE junctions in their own EM data. The frequency of ER-NE junctions is rare (only 0.1 junction per square micrometer, Figure 4). Thus, we think that the reason why it was hard to find the junctions in the reviewer’s data is due to the low-frequent nature of this junction and not due to the chemical fixation.

      • *

      * Reviewer #3 (Significance (Required)):

      This is a careful and thorough study of the connection between the ER and the nuclear envelope. The discovery of reticulons and similar proteins, along with biophysical modeling, made the form of the ER accessible to analysis. The factors that govern ER structure are now much better understood. This is particularly true of sheets versus tubules, the three way tubule junctions and to some extent, the junction of ER tubules coming out of the edge of a sheet. However, with all this activity, the subject of the connection of the ER to the nucleus has not been examined in detail. What makes it different is that the tubule is connected perpendicular to the plane of a sheet.*

      We thank the reviewer for appreciating the quality and novelty of our work.

      * The manuscript uses the best ultrastructural techniques and provides strong evidence for a narrow neck at this connection in HeLa cells. With the same methodology, yeast cells (S. cerevisiae) have a wider connection. OpenOrganelle data from other mammalian cell types was examined. This data has less resolution and although it does not contradict the HeLa cell data, it does not support it strongly. *As mentioned in the response to one of this reviewer’s comments above, the spatial resolution of FIB-SEM datasets is good enough to examine if ER-NE junctions are narrower than ER-ER junctions. We think that our observation of several mammalian cells in FIB-SEM datasets strongly supports the conclusion that ER-NE junctions are narrower than ER-ER junctions and extends our findings in HeLa cells to two other mammalian cell types.

      * This work is of interest to cell biologists specializing in membranous organelles or those interested in nuclear physiology. The connection of ER to nuclear envelope is an interesting problem that has not been studied recently. This manuscript could very well serve as a starting point for further structural or functional work by the authors or other groups. *We thank the reviewer for appreciating the significance and impact of our work.

      *

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Summary: Membrane bound ribosomes and ER exit sites are present in the cytosolic side of nuclear envelope (NE), suggesting that NE shares protein translocation, folding and quality control functions with the endoplasmic reticulum (ER). Moreover, membrane continuity between the ER and outer NE membrane is evident, and, thus, NE is considered as a subdomain of the ER. To support this, during cell division, NE loses its identity, and participates to daughter cells as part of the ER. However, NE has also membrane proteins and luminal proteins that are enriched to NE and absent from ER during interface, and the segregation of NE specific proteins/lipids occurs concomitantly with NE formation during late anaphase/telophase. In this study, the ultrastructure of the ER-NE junctions is described using high resolution electron tomography. Results show convincingly a specific constriction at the ER-NE neck during interface in several mammalian cell types. This structure is absent during metaphase, and also from the budding yeast. Authors present a model for the formation of ER-NE junctions in higher eukaryotes and speculate about their functional role. *We thank the reviewer for the appreciation of our work and the valuable suggestions for further improvements.

      * Major comments: The main conclusion of the paper is that although the ER and outer NE membranes are continuous, a specific hourglass shaped constriction at the neck is found in higher mammalian cells during interphase. The structure is specific to ER-NE necks, as it is absent during metaphase and ER-ER junctions. For the analysis, authors used high pressure freezing to ensure best structural preservation. Unfortunately, fixation is not the only potential source of artifacts; during tomography at ambient temperature, the thinning of the plastic sections under the beam can be up to 30%. In evaluation of the results, authors should consider how this thinning could affect the measurements of membrane distances and luminal width, and what type of distortions may happen as a consequence of asymmetric shrinkage.*

      In addition to analysis of own samples, authors took advantage of the publicly available whole-cell datasets in OpenOrganelle and used these datasets to expand the number of cell types analyzed. Moreover, the 3D-datasets were generated with different imaging technique, FIB-SEM. Although this technique provides lower resolution in general, it provides isotropic resolution, and the data could be used to eliminate the shortcomings of the tomography, thinning of the sections and the missing wedge. The authors could expand the comparison of the data from these different sources from this perspective, especially since HeLa cells were used in their own tomography studies and FIB-SEM datasets in OpenOrganelle. Similarly, it would be interesting to see if similar approach could be used to compare their results to those obtained by cryo-EM by utilizing the cryo-EM database. Have authors checked if any suitable datasets for analysis of ER-NE junctions could be found from public archives? For the analysis of mitotic cells, double thymidine block was used to synchronize the cell culture. It is not clear, why synchronization was necessary, as CLEM was used to select the cells, and their number was rather low. Do cells continue growing and synthesizing new proteins during thymidine blocks? As one way to control potential artifacts due to the synchronization treatment, authors could compare the average thickness of ER and NE in naturally occurring interphase and mitotic cells vs. synchronized cells. We agree with the reviewer that it is important to clarify the degree of shrinkage and deformation of the sample that our EM protocol might introduce. To access the degree of sample shrinkage and deformation in the plastic sections, we will compare the ONM-INM distance measured in our plastic sections with the one in cryo-EM tomograms of rapidly-frozen and FIB-milled mammalian cells that are publically available (EMPIAR, the Electron Microscopy Public Image Archive, https://www.ebi.ac.uk/pdbe/emdb/empiar/), and describe the outcome in the revised manuscript.

      The reason why we synchronized the cell cycle is to enrich cells in late anaphase and early telophase in the same plastic sections, so that we can compare their ultrastructure side-by-side. In the revised manuscript, we will examine if the double thymidine block affects the ER-NE junction morphology by comparing the morphology of the ER and NE between the synchronised and non-synchronised cells.

      As we described in the response to Reviewer 3, we think that our conclusion from the EM analysis is solid because of the following reasons. (i) We observed a significant structural difference between ER-NE junctions and ER-ER junctions in the same cells (Figure 2). (ii) We discovered a morphology change of ER-NE junctions in late-anaphase, early-telophase, and interphase cells that were freeze-substituted on the same sapphire disc; the ER-NE junctions became progressively constricted from telophase to interphase (Figure 3).

      Minor comments: On page 5, last chapter (+ Fig.1 legend and materials and methods): "the quick tomograms covered the entire NE" is misleading, as the imaging covered a thin layer of the entire NE only. - Authors could have analyzed the entire NE from the FIB-SEM datasets but chose to use stereological approach to minimize their work.

      We will modify the text to make it clear that the quick tomograms covered the NE in a section and not the entire NE of the cell in the revised manuscript.

      * To save time from the readers to follow the reference, authors could describe how the specimens used in OpenOrganelle datasets were fixed and processed, especially as they emphasize the importance of high pressure freezing in their own sample prep. Similarly, in Fig.4 legend, authors refer to measurements done in the previous study without explaining how and from what type of data. *We thank the reviewer for pointing these out. We will describe how the OpenOrganelle datasets were generated and how the nuclear surface area measurement was done.

      • *

      Is there a difference between mesh generation and segmentation, or is it just two different terms used for the same thing by different programs? We apologize our short description of these terms. We will clarify these terms in the revised manuscript.

      *

      Reviewer #4 (Significance (Required)):

      General assessment: ER-NE gates were described earlier in the literature for specific cell types using standard thin-section TEM imaging, and in this study, the analysis was done with modern technology at 3D. The text is fluent and clear, and the quality of the images was excellent. The analysis of the data was thorough, and materials and methods including image analysis part were presented accurately and clearly. Ultrastructural analysis was done systematically, and generated models are beautiful and informative. Much thought has put into planning of the experiments and experimental approach. The shortcoming of the study is its limitation to ultrastructural analysis only without attempts to connect to any mechanism. The discussion part contains lot of speculation of the factors that might be needed for the formation and maintenance of the constriction and present several hypotheses for the function of the constriction. The paper would be much stronger if one of few of the leads would be followed, and if there would be any explanation for the role of these structures, or factors affecting them. *We thank the reviewer for the appreciation of the clarity and quality of our work. The molecular mechanism that regulates the function, shape and biogenesis of ER-NE junctions will be the subject of future studies, for which our discovery of a highly-constricted morphology of the ER-NE junctions lays the groundwork.

      * Advance: The paper provides a very nice example for the reuse of publicly archived imaging datasets to complement own experimental work. Hopefully this paper encourages others to the same path, as the large volumeEM datasets require significant investments and contain wealth of potential for reuse. *We strongly agree with the reviewer. The volume EM datasets that are publically available contain wealth of potential for new discoveries. We also hope that our paper encourages other scientists to make good use of those datasets and also to deposit their own data to the public databases. We will deposit our EM tomograms to EMPIAR, the Electron Microscopy Public Image Archive.

      * The paper strengthens the description of the ER-NE junction structure significantly and convincingly but does not further our understanding of the mechanisms behind the structure nor the function of them and raises more questions than provides answers. For structural analysis of this kind, the state-of-the-art technology is cryo-EM (e.g., preparation of lamella with cryo-FIB-SEM followed by cryo-tomography), and in this study, the technical limitations come from plastic embedding and ambient temperature imaging. The used techniques would be more adequate for cell biological study, where the described structure is somehow connected to the function in cell, or the factor(s) needed to the formation or maintenance are identified. *Indeed, a limitation of our current study is that we did not reveal the underlying molecular mechanism and the functions of the constricted morphology of ER-NE junctions. We do not think that cryo-EM is necessarily required because we have collected evidence that the ER-NE connections are distinct from the ER-ER junctions in not only our EM tomography data (Fig. 2) but also in the EM datasets deposited in public databases (Fig. 5).

      * Audience: This study will be of special interest to cell biology community. The study could be an opening to several lines of research, e.g., identification of the factors forming or maintaining the structure, the potential function of the structure, how the structure affects the dynamics of the NE/ER membrane and luminal proteins. *We thank the reviewer for appreciating the impact of our work.

      * Reviewer's expertise: The reviewer has long experience in electron microscopy, volumeEM techniques and image analysis, and operates mainly in the field of cell biology.*

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Summary:

      Membrane bound ribosomes and ER exit sites are present in the cytosolic side of nuclear envelope (NE), suggesting that NE shares protein translocation, folding and quality control functions with the endoplasmic reticulum (ER). Moreover, membrane continuity between the ER and outer NE membrane is evident, and, thus, NE is considered as a subdomain of the ER. To support this, during cell division, NE loses its identity, and participates to daughter cells as part of the ER. However, NE has also membrane proteins and luminal proteins that are enriched to NE and absent from ER during interface, and the segregation of NE specific proteins/lipids occurs concomitantly with NE formation during late anaphase/telophase. In this study, the ultrastructure of the ER-NE junctions is described using high resolution electron tomography. Results show convincingly a specific constriction at the ER-NE neck during interface in several mammalian cell types. This structure is absent during metaphase, and also from the budding yeast. Authors present a model for the formation of ER-NE junctions in higher eukaryotes and speculate about their functional role.

      Major comments:

      The main conclusion of the paper is that although the ER and outer NE membranes are continuous, a specific hourglass shaped constriction at the neck is found in higher mammalian cells during interphase. The structure is specific to ER-NE necks, as it is absent during metaphase and ER-ER junctions. For the analysis, authors used high pressure freezing to ensure best structural preservation. Unfortunately, fixation is not the only potential source of artifacts; during tomography at ambient temperature, the thinning of the plastic sections under the beam can be up to 30%. In evaluation of the results, authors should consider how this thinning could affect the measurements of membrane distances and luminal width, and what type of distortions may happen as a consequence of asymmetric shrinkage. In addition to analysis of own samples, authors took advantage of the publicly available whole-cell datasets in OpenOrganelle and used these datasets to expand the number of cell types analyzed. Moreover, the 3D-datasets were generated with different imaging technique, FIB-SEM. Although this technique provides lower resolution in general, it provides isotropic resolution, and the data could be used to eliminate the shortcomings of the tomography, thinning of the sections and the missing wedge. The authors could expand the comparison of the data from these different sources from this perspective, especially since HeLa cells were used in their own tomography studies and FIB-SEM datasets in OpenOrganelle. Similarly, it would be interesting to see if similar approach could be used to compare their results to those obtained by cryo-EM by utilizing the cryo-EM database. Have authors checked if any suitable datasets for analysis of ER-NE junctions could be found from public archives? For the analysis of mitotic cells, double thymidine block was used to synchronize the cell culture. It is not clear, why synchronization was necessary, as CLEM was used to select the cells, and their number was rather low. Do cells continue growing and synthesizing new proteins during thymidine blocks? As one way to control potential artifacts due to the synchronization treatment, authors could compare the average thickness of ER and NE in naturally occurring interphase and mitotic cells vs. synchronized cells.

      Minor comments:

      On page 5, last chapter (+ Fig.1 legend and materials and methods): "the quick tomograms covered the entire NE" is misleading, as the imaging covered a thin layer of the entire NE only. - Authors could have analyzed the entire NE from the FIB-SEM datasets but chose to use stereological approach to minimize their work. To save time from the readers to follow the reference, authors could describe how the specimens used in OpenOrganelle datasets were fixed and processed, especially as they emphasize the importance of high pressure freezing in their own sample prep. Similarly, in Fig.4 legend, authors refer to measurements done in the previous study without explaining how and from what type of data. Is there a difference between mesh generation and segmentation, or is it just two different terms used for the same thing by different programs?

      Significance

      General assessment:

      ER-NE gates were described earlier in the literature for specific cell types using standard thin-section TEM imaging, and in this study, the analysis was done with modern technology at 3D. The text is fluent and clear, and the quality of the images was excellent. The analysis of the data was thorough, and materials and methods including image analysis part were presented accurately and clearly. Ultrastructural analysis was done systematically, and generated models are beautiful and informative. Much thought has put into planning of the experiments and experimental approach. The shortcoming of the study is its limitation to ultrastructural analysis only without attempts to connect to any mechanism. The discussion part contains lot of speculation of the factors that might be needed for the formation and maintenance of the constriction and present several hypotheses for the function of the constriction. The paper would be much stronger if one of few of the leads would be followed, and if there would be any explanation for the role of these structures, or factors affecting them.

      Advance:

      The paper provides a very nice example for the reuse of publicly archived imaging datasets to complement own experimental work. Hopefully this paper encourages others to the same path, as the large volumeEM datasets require significant investments and contain wealth of potential for reuse.

      The paper strengthens the description of the ER-NE junction structure significantly and convincingly but does not further our understanding of the mechanisms behind the structure nor the function of them and raises more questions than provides answers. For structural analysis of this kind, the state-of-the-art technology is cryo-EM (e.g., preparation of lamella with cryo-FIB-SEM followed by cryo-tomography), and in this study, the technical limitations come from plastic embedding and ambient temperature imaging. The used techniques would be more adequate for cell biological study, where the described structure is somehow connected to the function in cell, or the factor(s) needed to the formation or maintenance are identified.

      Audience:

      This study will be of special interest to cell biology community. The study could be an opening to several lines of research, e.g., identification of the factors forming or maintaining the structure, the potential function of the structure, how the structure affects the dynamics of the NE/ER membrane and luminal proteins.

      Reviewer's expertise:

      The reviewer has long experience in electron microscopy, volumeEM techniques and image analysis, and operates mainly in the field of cell biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Bragulat-Teixidor et al. is a study of the connection of the ER with the nuclear envelope. It uses advanced ultrastructural techniques: high pressure freezing instead of chemical fixation and EM tomography instead of serial sectioning. Synchronized HeLa cell cultures were examined during interphase, late anaphase (4-6 min after anaphase onset) and early telophase (8-10 minutes after anaphase onset).

      The investigators find an unexpected, unusual structure - a constricted neck 7-20 wide and about 10 nm long where the ER connects to the nuclear envelope. The 7 nm connections had no apparent lumen. These are not seen in late anaphase when the NE has not yet formed, but they are seen a few minutes later during early telophase when there is a newly formed NE surrounding the chromosomes. A quantitation was made of their abundance, more was found later during interphase, and with wider lumens.

      It is very nice to show the EM images as uncolored and segmented (colored). The images shown in the figures are presumably the best that were obtained during the study. Heavy metals do not stain membranes uniformly or exclusively, and identification of structures doesn't always seem unambiguous. The three dimensional information can certainly make this easier though this information is difficult or not possible to show in journal format. In the end, the reader must depend on the judgment of the person who did the analysis. Overall, the analysis seems trustworthy.

      HeLa cells are very convenient for getting information on cell cycle dependence. However, they are cancer cells in culture, so it is important to look at other cell types as well. The same methodology was used on budding yeast and they saw a wide tentlike connection, which reproduces an earlier study. This seems more consistent with what is known or expected from ER membranes. It is not less interesting but perhaps less puzzling.

      To get evidence on other mammalian cells, the authors did an analysis of data from OpenOrganelle. These are high pressure frozen cells / tissue imaged by FIB-SEM. The voxels are 4 nm, which is significantly larger than those in EM tomography. Unfortunately, the difficulty of identifying structures is correspondingly more significant. The images shown do not contradict the HeLa results but by themselves (without the HeLa cell data), a convincing case for narrow connections probably couldn't be made.

      The work in this manuscript seems to have been done well. Assuming that this structure is confirmed in other mammalian cells, another kind of question comes to mind: is this the final word on ER to NE connections? The lumenless neck does not seem like it would be a stable structure, somehow it seems like a transient one. In the future, it would help if a new structural protein was identified or some theoretical analysis to help explain the shape.

      It is generally now assumed that high pressure freezing preserves structure perfectly. However, in this reviewer's mind, there is a possibility that some structures are not. The sample is brought to 2000 atmospheres within a few milliseconds, frozen, then the high pressure is released after a second. Although many intracellular structures do seem well preserved, could the junction be susceptible to high pressure? A second source of uncertainty is that in order to embed the samples in resin, the water was removed by freeze substitution. This is known to cause a small amount of tissue shrinkage and possibly could alter a delicate structure. Another way to look at this kind of structure is cryo-EM tomography on hydrated lamellae from plunge frozen cells. I don't recommend that the authors do another arduous, possibly too arduous set of experiments with a completely different technique, but perhaps another group has data which could support their findings.

      The following are suggestions for the Discussion:

      Yeast have many of the same biochemical processes as mammalian cells. Perhaps their lack of narrow connections can be used as a clue to the function of the narrow necks seen in HeLa cells. For instance, the authors speculate that the narrow connection serves to keep phosphatidylserine in the nuclear envelope low. If the yeast nucleus has the same concentration of phosphatidylserine as the ER, it would provide good evidence for this idea.

      There might be other instances of lumenless neck structures. Dynamin mutants can cause a stable constricted tubule - are the dimensions of this tubule similar to that of the ER / NE connections? Or possibly some ESCRT related structure?

      There do not seem to be any recent studies of the ER / nuclear membrane connection in fixed cells. However, there is serial section data online which can be inspected. There are connections in mouse brain cortex in the data of Kasthuri et al., 2015 (https://neurodata.io/project/ocp/). Instead of a tubule connection, there seems to be a narrow sheet of ER that connects to the nuclear envelope. But there is something odd about these too. The authors may like to mention something about this or similar work in their manuscript. This reviewer has looked at chemically fixed data from several cell types from his own unpublished data and connections are surprisingly hard to find. Possibly, the connection is particularly sensitive to chemical fixation.

      Significance

      This is a careful and thorough study of the connection between the ER and the nuclear envelope. The discovery of reticulons and similar proteins, along with biophysical modeling, made the form of the ER accessible to analysis. The factors that govern ER structure are now much better understood. This is particularly true of sheets versus tubules, the three way tubule junctions and to some extent, the junction of ER tubules coming out of the edge of a sheet. However, with all this activity, the subject of the connection of the ER to the nucleus has not been examined in detail. What makes it different is that the tubule is connected perpendicular to the plane of a sheet.

      The manuscript uses the best ultrastructural techniques and provides strong evidence for a narrow neck at this connection in HeLa cells. With the same methodology, yeast cells (S. cerevisiae) have a wider connection. OpenOrganelle data from other mammalian cell types was examined. This data has less resolution and although it does not contradict the HeLa cell data, it does not support it strongly.

      This work is of interest to cell biologists specializing in membranous organelles or those interested in nuclear physiology. The connection of ER to nuclear envelope is an interesting problem that has not been studied recently. This manuscript could very well serve as a starting point for further structural or functional work by the authors or other groups.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Bragulat-Teixidor et al., use correlative live-cell imaging and electron tomography to study the structure of the endoplasmic reticulum-nuclear envelope (ER-NE) junction in HeLa cells (and also in S. cerevisiae). The authors also make use of publicly available whole-cell FIB-SEM datasets to study ER-NE junctions in mouse pancreatic islet, HeLa, and human macrophage cells to corroborate their findings in other cell types.

      The authors show that the structure of the ER-NE junction in interphase cells adopts an hourglass shape with a constricted neck. Comparing the ER-NE junction to the ER tubule-sheet junction, the authors show that these structures are different: the ER tubule-sheet junction is not constricted. Because the NE forms from the ER during postmitotic NE assembly, the authors compare the structure of the ER-NE junctions in anaphase, telophase, and interphase cells, and find that the junction becomes constricted in telophase. The number of ER-NE junctions increase going from telophase to interphase.

      While the authors do not provide any direct evidence for this, they propose a functional model where the ER-NE junction is constricted because it regulates the supply of certain lipids and proteins from the ER to the NE. One proposed example is that the constriction of the ER-NE junction might prevent the passage of large protein aggregates from entering the NE.

      The general question of how the structure of the ER-NE junction might regulate the passage of lipids and proteins from the ER to the NE is interesting and potentially important. However, the authors should address the following issues to improve the accuracy and completeness of this manuscript for it to be considered for publication.

      Major comments:

      1. The authors compare the structure of the ER-NE junction to the structure of the ER tubule-sheet junction in interphase cells. They should instead or in addition be comparing the ER-NE junction to ER sheet-sheet junctions. This is likely a better comparison for two reasons:

      i) The NE is similar to an ER sheet due to its flat and extended structure. The ER membranes surrounding the NE consists mostly of a dense network of sheet-like ER (Zheng et al., 2022, PMID: 34912111). Therefore, the ER-NE junction should be compared to these NE-adjacent ER sheet-sheet junctions and not ER tubule-sheet junctions which are likely to be found in the cell periphery.

      ii) In HeLa cells, the NE assembles from large ER sheets and not ER tubules (Zhao et al., 2023, PMID: 37098350; Otsuka et al., 2018, PMID: 29323269; Lu et al., 2011, PMID: 21825076). Therefore, the ER-ER junctions the authors are already studying in anaphase cells are likely to be ER sheet-sheet junctions, which should be kept the same in their analysis of the ER-ER junctions in interphase cells.

      Related to this point, comparing the side view panels in Figure 2D with 2H, it seems that the width of the ER membranes on either side of the neck region of the ER-NE junction is in fact getting wider (more sheet-like). This is in contrast to the ER-ER junction where the width stays constant for the ER tubule that is fusing onto the ER sheet. This suggests that indeed, the ER-NE junction is more similar to an ER sheet-sheet junction. 2. The authors claim that in late anaphase cells, the ER-ER/NE (written like this because the ER and NE cannot be distinguished like the authors also point out) junctions are not constricted and had a similar morphology to ER-ER junctions in interphase. However, this claim is only qualitative at the moment, as the authors do not provide any quantification of the width of the ER-ER/NE junctions in late anaphase cells. To make the current claim that the ER-NE junction only becomes constricted in telophase, the authors should report the width of the ER-ER/NE junctions in late anaphase cells.

      In late anaphase cells, large ER sheets initially wrap around chromatin at the periphery of the chromosome mass (Zhao et al., 2023, PMID: 37098350; Otsuka et al., 2018, PMID: 29323269; Lu et al., 2011, PMID: 21825076). Therefore, the authors might find it easier to identify ER-ER/NE junctions in the so-called "non-core" regions, instead of in the current regions shown in Figure 3A.

      Minor comments:

      1. In the Supplementary Figures 1 A-D, make the scale bars white. Currently, the black scale bars are especially difficult to see in the top panels in Supplementary Figure 1C.
      2. In the Results section entitled "The number of ER-NE junctions per cell increases from telophase to interphase", the authors should tone down this claim because the number of telophase cells examined is low (only 2 telophase versus 9 interphase cells). It would be better to include the word "slightly" in the title to change it to "slightly increases".
      3. In the Results section entitled "The number of ER-NE junctions per cell increases from telophase to interphase", the authors state "These densities were much lower than those of ER-ER junctions...". For sure this is true for ER tubule-tubule junctions in the periphery of the cell as ER tubules form an intricate network by constantly fusing to each other, but it's not clear if this is also the case for ER tubule-sheet or ER sheet-sheet junctions. For clarity, the authors should state that they mean ER tubule-tubule junctions.

      Same comment also for the statement "...although their abundance remains considerably lower than that of ER-ER junctions or nuclear pores at both cell cycle stages". The authors should state that they mean ER tubule-tubule junctions. 4. In the Results section entitled "The constricted morphology of ER-NE junctions is observed in different mammalian cells, but not in budding yeast", the authors state "...pancreatic islet cells (Figure 5A), HeLa (Figure 5B), and macrophage (Figure 5C) were significantly smaller than most ER-ER junctions (Figure 5F)". The last figure reference here is wrong and should be changed to Figures 5D-E. 5. In Discussion, the authors state "Proteins known to form and stabilize junctions in the ER, including Atlastins and Lunapark...". The authors should specify that they mean ER tubule-tubule three-way junctions. Also more generally throughout the manuscript, the authors should be more careful in specifying which ER-ER junctions they mean in each case. 6. In Discussion, the authors state "Thus, we favour a second scenario in which ER-NE junctions are generated from ER tubules that contact and eventually fuse with the ONM". Given that the ER membranes adjacent to the NE are mostly sheet-like (as pointed out in Major comment 1 above), the authors need to explain how they think an ER tubule (mostly found in the cell periphery) could access and fuse to the NE.

      Significance

      Although the ER-NE junction has been studied in other organisms before, this study represents the first structural characterisation of the ER-NE junction in mammalian cells. Therefore, this study represents an advance for the field in gaining a better understanding of different ER structures and morphologies. How the ER is remodelled during the cell cycle is also an interesting question and an active field of research (Merta et al., 2021 PMID: 34853314; Zhao et al., 2023, PMID: 37098350) which this study further contributes to. This study would therefore be interesting for anyone interested in ER structure/morphology, ER-NE connections, and cell cycle regulation of such ER-NE connections.

      My field expertise is in ER and NE. I do not have sufficient expertise to evaluate the methodology for the EM tomography part of this paper.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The mechanisms that differentiate ER from the nuclear envelope (NE) remain to be fully elucidated but likely depend at least in part on junctions between the ER and NE. How such junctions are formed and maintained is the subject of this manuscript where extensive correlative light and electron microscopy is used to observe and characterize ER-nuclear envelope (ER-NE) junctions at distinct phases of the cell cycle. The authors make use of their own electron tomography data as well as publicly available focused-ion beam scanning electron microscopy (FIB-SEM) datasets to compare the morphology of these junctions in different human cell types as well as in budding yeast. The major finding is that ER-NE junctions in human cell lines are more constricted than ER-ER junctions, often to the point of excluding lumen. The examination of mitotic cells suggests that this constriction likely occurs at the end of mitosis as the NE is completing its maturation from ER to NE. The implications of these morphological changes are discussed but there are no mechanistic or functional studies. Overall, the data are well presented, are of high quality and are rigorously evaluated. The manuscript is well written and scholarly, and the speculations as to the function of the constrictions are reasonable. I only have minor comments.

      1. In Figure 2D, the authors present evidence to demonstrate that an hourglass-like constriction occurs at ER-NE junctions. From the side view, it is difficult to interpret this on the plot, particularly for the ER-NE junctions with a lumen. Perhaps, in the supplemental data, the authors could plot both with and without lumen data separately, and color-code individual traces? I believe this would convey the hourglass nature of these constrictions more clearly.
      2. In the Methods section, the authors should describe how carbon-coating of sapphire discs was achieved. If these were provided from the manufacturer precoated, this should be specified.
      3. On page 10, Figure 5F callout 9 lines from the bottom likely should be 5E.

      Significance

      Overall, this work provides an important new morphological perspective on the nature of ER-NE junctions in human cells. As the authors describe in their introduction, such junctions have been noted previously in the literature but not in a dedicated study using modern imaging techniques in human cell lines. In describing the morphology of these junctions, the authors lay the groundwork for future mechanistic, functional, and structural studies.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript entitled "The Drosophila Tumour Suppressor Lgl and Vap33 activate the Hippo pathway by a dual mechanism, involving RtGEF/Git/Arf79F and inhibition of the V-ATPase." by Portela et al. presents an interesting perspective of the molecular mechanism regulating Hippo pathway, revealing new proteins involved in this process. In this study, the authors try to show us that Lgl activates the Hippo pathway via Vap33 either by interacting with RtGEF/Git/Arf79F or by inhibiting V-ATPase, thus controlling epithelial tissue growth. The methodology used by the authors is adequate but could benefit from further experiments that would allow them to reach the conclusions stated in their research. Thus, based on the interpretation of the results presented by the authors some concerns were raised that should be addressed during the review process and that are explained in the major comments. Major comments: • It is not clear why in "The Hippo signaling pathway is negatively regulated by V-ATPase activity in Drosophila" section, the authors use Vha68-2 RNAi to reduce the activity of V-ATPase and later they use the overexpression of Vha44 to activate V-ATPase. The authors should explain why they used different proteins to regulate V-ATPase. The way the authors wrote their results sounds like different Vha proteins regulate V-ATPase, which means that cells may have different ways to activate V-ATPases, not being clear if regardless that the downstream effect of V-ATPase activation is always reflected in the Hippo pathway. Thus, the authors should state what other Vha proteins may have a similar effect, I would like to see evidence that Vha44 and Vha68 knockdown and overexpression leads to similar results.

      Response: Vha68-2 and Vha44 are both components of the V-ATPase. We have added further details to the results to make this clearer. We have previously shown that knocking down several components of the V-ATPase, which disrupt V-ATPase function, have a similar effect on the Notch pathway (Portela et al., 2018 Sci. Signal., PMID: 29871910). Vha44 overexpression had been documented to result in V-ATPase activation (Petzoldt et al., 2013, Dis Model Mech., PMID: 23335205), and no other Drosophila V-ATPase transgenes were available to conduct experiments with other lines.

      • In "Vap33 activates the Hippo pathway" section, the authors' conclusions represent a big statement considering the results obtained. Though Diap1 is a Hippo pathway target, it does not mean that this protein is solely regulated by this pathway. For example, there are studies that show that this gene can also be transcribed by STAT activity. Though in the following section the authors show how Vap33 activates this pathway, the results obtained in the section "Vap33 activates the Hippo pathway" are not enough to make this assumption. We suggest that the authors rephrase this section. (Optional: To maintain this statement, the authors should have performed, for example, a luciferase assay containing specifically Hippo pathway binding sites in the Diap1 gene, showing that the transcription factor of the Hippo pathway is somehow regulated by Vap33). Response: Whilst Jak-STAT signalling has been shown to induce Diap1 expression in the wing disc during development (PMID: 28045022), however expression profiling after activation of the Jak-STAT signalling in the eye epithelium did not identify Diap1 as a target (PMID: 19504457). Additionally, there are no reports that Lgl depletion in eye disc clones elevates Jak-STAT signalling (Stephens et al., J. Mol. Biol. 2018, PMID: 29409995), but instead loss of cell polarity in scrib mutant cells in the eye disc results in expression of the Jak-STAT pathway ligand, Upd, and non-cell autonomous induction of Jak-STAT signalling in the surrounding wild-type cells (PMID: 25719210, __PMID: __23108407). We have previously shown that Lgl depletion leads to inactivation of the Hippo pathway and elevates expression of the canonical Yki targets, Ex and Diap1 (Grzeschik et al., 2010, Curr Biol., PMID: 20362447). In this current study we show that Vap33 overexpression leads to the downregulation of Diap1 and in lgl mutant tissue reduces the elevated Diap1 expression. Since there is no evidence that either Lgl or Vap33 (VAPB) perturbations affect the Jak-STAT signalling pathway, we conclude from our results that Vap33 acts by reducing Yki activity and thus activating the Hippo pathway. We have added additional explanation to this section of our manuscript.

      • The authors present a highly speculative discussion, raising different hypotheses. Though such hypotheses are well supported by the literature, the authors would enrich the quality of their research if indeed they could prove them. Particularly, testing for vesicle acidification, testing if V-ATPase indeed blocks the interaction of Lgl/Vap33/RtGEF/Git/Arf79F, and alters Hpo localization, testing if Git/RtGEF inhibits Arf79F and consequent Hpo localization. Response: Although it would extend the paper to conduct further experiments, my lab is now closed so this is not possible. We have already published that vesicle acidification is increased in lgl mutant tissue (Portela et al., 2018, Sci. Signal., PMID: 29871910) and that Hpo localization is altered in lgl mutant tissue (Grzeschik et al., 2010, Curr. Biol., PMID: 20362447).

      • The authors should also apply more specific techniques to infer how the Hippo pathway is affected by such genetic manipulation since diap1 can be a target gene of different pathways. Response: We have shown that lgl mutant tissue also shows upregulation of the Hippo pathway target, Ex-LacZ, and affects the phosphorylation of Yki (Grzeschik et al., 2010, Curr. Biol., PMID: 20362447), and RtGEF/Git mutant tissue shows upregulation of the Yki target, Ex-LacZ (Dent et al., 2015, Curr. Biol., PMID: 25484297). Since RtGEF/Git are positive regulators of Hippo, but there is no evidence that they are involved in the regulation of the Jak-STAT pathway, the effect of Vap33 overexpression on Diap1 levels in the context of a RtGEF knockdown (Fig 5) is most likely to be due to effects on the Hippo pathway. Similarly, since Lgl deficiency upregulates Yki targets, Ex-LacZ and Diap1 (Grzeschik et al., 2010, Curr. Biol., PMID: 20362447), the reduction of the elevated Diap1 levels in lgl mutant clones by knocking down or reducing Arf79F activity (Fig 7), is most likely due to inhibition of Yki activity and therefore elevated Hippo pathway signalling.

      Minor comments: • The authors present a well-structured manuscript, that generally is easy to understand. However, at some points, the statements given by the authors seem highly speculative. • The figures presented in this manuscript and the statistical analysis seem adequate and are clearly described.

      Response: We thank the reviewer for their support of our study. We have added more explanation to support our conclusions.

      Reviewer #1 (Significance (Required)):

      The study presented by Portela et al. gives new insights into the regulation of the Hippo pathway with the discovery of new proteins involved in this mechanism, which can be interesting to those working on basic research and focused on studying signal transduction. However, this study lacks some novelty. Throughout the manuscript, the authors only observed the physiological consequences of manipulating this pathway based on the eye phenotypes, and in the discussion, many hypotheses were raised based on the already available literature, which shows that much is already known about the Hippo pathway. The advances shown in this study are limited to the description of the signaling pathway itself and to the eye morphology. As a suggestion, the authors should explore the knowledge of their findings in order to understand how we can use them to achieve advances in other fields and physiological conditions. For example, only at the end of the discussion, did the authors raise the questions that would really push their discoveries a step forward, namely how this mechanism acts during the response to tissue wounding and whether the mammalian orthologs of Lgl and Vap33 also act via these mechanisms to control tissue growth in mammals. It would be interesting if the authors could direct their research efforts to understand if the proteins identified can be targeted to improve wound healing or to delay aging for example. Altogether, the authors present an interesting study but, at this moment, it still lacks the significance and novelty needed for publication. We encourage the authors to keep up their good work to address these suggestions, which will definitely improve the quality of their study.

      Response: We respectfully disagree with the reviewer’s comments regarding the significance of our study. On the contrary, our study is significant since it has discovered a mechanism linking Lgl and Vap33-RtGEF/Git/Arf79F and the V-ATPase to the regulation of the Hippo pathway, an important tissue growth regulatory and tumour suppressor pathway. The Drosophila eye epithelium is a highly validated model for exploring mechanisms that are relevant to human epithelial biology and cancer. Whilst extending our studies of the mechanism by which Lgl controls the Hippo pathway to wound healing and mammalian systems would be the next step, this is beyond the scope of this discovery paper.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary This manuscript investigates potential mechanisms through which the lgl gene might affect the Hippo signaling pathway. The authors employ a combination of physical interaction studies and clonal analysis in Drosophila eye discs to investigate potential links between lgl and other genes. Some of the results are intriguing, but the analysis is rather preliminary, and there are technical concerns with some of the results presented.

      Main issues - The authors propose effects of genes involved in vesicle trafficking and acidification in Hippo signaling, but there is no clear cellular mechanism described by which these effects could be mediated. This deserves further consideration. eg if they think there are effects on the localization of Hippo, this could be directly examined. In the Discussion, the authors suggest that "The V-ATPase might therefore act to inhibit Hippo pathway signalling by blocking the interaction of Lgl/Vap33/RtGEF/Git/Arf79F with Hpo in vesicles, thereby altering Hpo localization and inhibiting its activity." but Hippo is a cytoplasmic protein and has never been reported to be within vesicles.

      Response: Whilst Hpo is a cytoplasmic protein there is evidence that it could also be associated with vesicles, since Hpo pathway components bind to several endocytic proteins by mass spectrometry analysis (Kwon et al., 2013, Science, PMID: 24114784; Verghese and Moberg, 2020, Front. Cell Dev. Biol., PMID: 32010696). We have previously published that Hippo localization is altered in lgl mutant tissue (Grzeschik et al., 2010, Curr. Biol., PMID: 20362447). For a better precision, we have updated the wording to state that the proteins described in our manuscript may alter Hippo localization “on endosomes” as opposed to the previous “in vesicles”.

      • The Yki stains in Fig. 1 are confusing. The nature of the signal throughout the wing disc looks very different in 1A vs 1B vs 1C, this needs to be explained or re-examined. Fig 1C (wts RNAi ) seems to show an elevated Yki signal in some cells, and lower in others in - prior studies have reported that wts affects the nuclear vs cytoplasmic localization of Yki, but not its levels, so this needs to be clarified.

      Response: There are some tissue folds in the eye disc tissues that might be confusing the reviewer, but Yki nuclear staining is lower in Vha68-2 mutant clones, and higher in wts knockdown and Vha44 over-expressing clones (arrowheads). When Yki is concentrated in the nucleus the staining appears more intense, as it does in the wts knockdown clones. Similar results on Yki staining upon Hippo pathway impairment in epithelial tissues have been obtained by other Hippo pathway researchers (eg PMID: 20362445, __PMID: 19900439, PMID: __19913529, __PMID: __26364751).

      • In Fig 1D the clones appear to have different effects in different regions of the eye disc; the authors should clarify. Also, the disc in 1D appears much younger than the discs in 1A-C, but similar age discs should be used for all comparisons.

      Response: All eye discs are from wandering 3rd instar larvae, but the mounting of the samples on the slide and the confocal Z-section could account for apparent different regions of the eye disc showing stronger upregulation of Ex-LacZ and Yki staining. The data has been statistically analysed from multiple eye discs and the effects observed are significantly different to the control (as plotted in Fig 1E).

      • The authors should clarify whether any the manipulations they perform are associated with Jnk activation, as this could potentially provide an alternative explanation for downregulation of Hippo signaling.

      Response: Lgl mutant clones only upregulate the JNK target MMP1 in some cells at the border of the clones but show elevated Yki activity within the clones. Vha44 overexpressing clones do show upregulation of JNK signalling (Petzoldt et al., 2013, Dis Model Mech., PMID: 23335205), but since JNK signalling is known to inhibit Yki activity in the eye epithelium (PMID: 22190496), it is unlikely that the upregulation of Yki activity (downregulation of Hippo signalling) in Vha44 overexpressing clones is due to JNK activation.

      • The authors report in Fig 2C,E that over-expression of Vap33 reduced expression of Diap1, which they interpret as evidence of increased Hippo pathway activity, but this experiment is lacking essential controls, as the apparent reduction of Diap1 could simply reflect increased cell death or a change in focal plane, and indeed the difference in the label stain makes it look like these cells are undergoing apoptosis. Thus it's important to also have a stain for a neutral protein, or at least a DNA stain. Additionally, it is important to stain for at least one additional marker of Hippo pathway activity (eg ex-lacZ or Yki localization), as there are other pathways that regulate Diap1

      Response: We have previously examined the effect of Vap33 overexpressing clones on the Notch signalling pathway and do not see a reduction in Notch target gene expression relative to the control (Portela et al., 2018, Sci. Signal., PMID: 29871910, Fig 3). Thus, although there might be some cell death in Vap33 overexpressing clones (possibly due to lower Diap1 levels), it is unlikely that cell death per se results in lower Diap1 levels. We are unable to conduct further experiments to examine other Hippo pathway activity markers since my lab is now closed.

      • In Fig. 4 the authors perform PLA experiments to examine potential association between various pairs of proteins, but they don't show us key controls. They report in the text using single antibodies as negative controls, but this doesn't control for non-specific localization of antibodies. The better negative control is to do the PLA experiments in parallel on tissues lacking the protein being detected (eg from animals not expressing the GFP- or RFP-tagged proteins they are examining). Also, there is a lot of variation in the apparent signals shown in different PLA experiments in fig 4, the authors should comment on this.

      Response: We have previously used the PLA assay to examine Lgl and Vap33 interactions (Portela et al., 2018, Sci. Signal., PMID: 29871910, Fig 2) and have conducted an experiment expressing Vap33 tagged with HA via the GMR driver in the posterior region of the eye disc and then detected Lgl-HA protein interactions, which only showed PLA foci in the posterior region where Vap33-HA is expressed but not in the anterior region where Vap33-HA is not expressed. This may be thought of as the best possible control since these differentially expressing regions were part of the same tissue sample. Furthermore, in our previous study (Portela et al., 2018, Sci. Signal., PMID: 29871910, Fig S2), we conducted a negative control PLA using the GFP and Vap33 antibodies in eye tissue not expressing GFP-Lgl and observed no PLA foci. We have edited the text to refer to these controls.

      The variation in PLA signal may be due to low levels of expression of certain proteins or lower levels of protein-protein interactions. We have edited the text to add this explanation.

      • The authors claim that RtGEF mutant cells increase Diap1 expression, and that Vap33 over-expression reverses this effect (Fig. 5). The effect of RtGEF looks very subtle and variable, it should be confirmed by examining additional reporters of Hippo pathway activity. It also seems like the disc in 5A is at a different stage &/or the quantitation is done from a different region as compared to the disc in 5C.

      Response: RtGEF mutant cells have also been shown to upregulate the Yki target, Ex-LacZ (Dent et al., 2015). Unfortunately, we were unable to construct an Ex-LacZ RtGEF mutant stock and there was no available Ex antibody.

      For Diap1 quantification, clones were chosen just posterior to the morphogenetic furrow of each eye disc and multiple clones were analysed relative to the adjacent wild-type clones in many samples and quantified and plotted in Fig 5E.

      • The analysis of the influence of Vha68-2 mutant clones, and their genetic interaction with Git, similarly suffers from missing controls and incomplete analysis. Additional Hippo reporters besides just Diap1 should be examined. The Diap1 analysis which shows reduced expression needs examination of neutral controls or nuclear markers to assess potential apoptosis within clones, or changes in focal plane.

      Response: We have also examined the effect of Vha68-2 clones on Ex-LacZ expression (Figure 1) and show that it is also reduced relative to the surrounding wild-type clones.

      We have previously examined Vha68-2 mutant clones for the expression of a Notch pathway target (Portela et al., 2018, Sci. Signal., PMID: 29871910, Fig S1) and show with DAP1 staining that cells are in the same plane and are retained in pupal retina, so are not dying. We now refer to our previous study in the text.

      Similarly, the analysis of Arf79F mutant clones in Fig 7E,G is compromised by lack of controls for viability and tissue layer, and analysis of an additional Hippo reporter is once again essential.

      Response: We don’t believe DAPI stains are necessary as the GFP membrane/cytoplasmic staining clearly shows the outline of the cells and where the nucleus is in the mutant clones and shows that the cells are intact and not dying.

      Reviewer #2 (Significance (Required)):

      The strength of the study is the potential dissection of novel connections between the lgl tumor suppressor and the Hippo pathway. However, there are signifiant limitations due to the preliminary nature of the study, which is incomplete and missing essential controls. If these limitations are overcome the work will be of interest to specialists in the field.

      Response: We are hoping that our explanations and responses to the main issues above alleviate concerns regarding controls.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this study, Portela and colleagues identified new regulators of Hippo pathway downstream of the core apico-basal polarity protein Lgl. While the impact of Lgl depletion of Yki activation was already characterised both in Drosophila and Vertebrates, the mechanism connecting these two pathways was still unclear. Using the Drosophila eye, mosaic analysis, epistatic analysis and mass-spectrometry, they identified two routes through which Lgl depletion can lead to Hippo pathway downregulation and eye overgrowth. This regulation required the previously characterised Lgl interactor Vap33, which on the one hand activates Hippo by inhibiting the V-ATPase, and on the other hand activates Hippo through its interactions with the actin regulators Git, RtGEF (two previously characterised regulators of Hippo, https://pubmed.ncbi.nlm.nih.gov/25484297/) . They also identified another regulator of Hippo downstream of Lgl, Arf79F, whose ortholog interact with Git in mammals and is also in close proximity with Hippo, Git and RtGEF in Drosophila, and whose depletion abolish Hippo downregulation and eye overgrowth in Lgl mutant. This is a well performed study which identified new links between Lgl and regulation of the Hippo pathway. Many of them are conserved in mammals and may be relevant in pathological conditions associated with Lgl loss of function and Yap missregulation. The experiments are well conducted with a quite thorough epistatic analysis combined with many assays to characterize protein interactions. Admittedly, the molecular mechanism remains uncharacterised and some experiments may help to indicate putative mechanisms, but the characterisation of these news regulators and clear genetic interactions results constitute already solid and interesting data. I have some suggestions though that could help to reinforce the conclusions.

      Main suggestions :

      1. While the precise molecular mechanisms is not absolutely necessary, it would be interesting to document the subcellular localisation of these new Hippo regulators in WT and Lgl mutant context (Git, RtGEF Vap33 and Arf79F), either with Antibody if they exist, or with fusion protein (which for a good part were already generated for the PLA results). This may reveal obvious misslocalisation which would support the role of Lgl as a scaffolding protein that maintain proper subcellular localisation of these factors.

      Response: Whilst this experiment would extend the study, we are unable to do this since my lab has now closed.

      Most of the epistatic experiments focus on factors that rescue the overgrowth and increase of diap1 expression in Lgl mutant. Did the author test if any of these core regulators are sufficient to recapitulate Lgl mutant eye phenotype, for instance Vap33 KD in the eye, or Arf79F overexpression. Negative results would still be informative as they would point to the existence of other downstream regulators of the eye phenotype

      Response: Vap33 knockdown by RNAi in clones does not phenocopy the lgl mutant mosaic adult eye phenotype (Portela et al., 2018, Sci. Signal., PMID: 29871910, Fig 2), presumably due to other functions of Vap33. We have added further details regarding this point In the Discussion.

      We have not examined Arf79F overexpressing clones.

      It is at the moment hard to interpret the relevance of the results obtained by PLA. While there are some negative controls based on the absence of secondary antibody, what is the number of particle obtained for two non-interacting cortical proteins ? Since this is based on proximity, I would expect that some positive particles would still appear by chance, but much less than for two physically interacting proteins or subunits of a complex. Could the author provide such a negative control by testing for instance Git/RtGEF with another non-interacting cortical protein ? That would help to assess the relevance of the conclusions based on PLA.

      Response: The PLA is a robust assay to assess protein-protein interactions of proteins that are

      Some of the epistatic links are a bit hard to interpret at the moment, and additional epistatic test may be relevant. For instance, the increase of diap1 upon Git depletion in the Vha68 mutant (Figure 6) is used to conclude that Git is required for the Hippo upregulation upon reduced V-ATPase activity. However this could be compatible with two independent branches regulating Hippo (in an opposite manner), which is more less what is suggested by the authors in their conclusion and the model of figure 8. I would suggest to reformulate this conclusion in the result part. Similarly, there is currently no experimental exploration of the epistatic link between Arf79F, Git and RtGEF (which is based on results in mammals). It would be interesting to check if Git and RtGEF mutant phenotype (Hippo downregulation) can also be suppressed by downregulation of Arf79F.

      Response: We have now added further explanation to the result section regarding Fig 6.

      Unfortunately, we are unable to do further experiments since my lab is now closed.

      Apart from very obvious phenotype (eye in Lgl mutant mosaic) it is a bit hard to interpret the picture of adult eye provided in this study (specially for mild phenotype). Could the authors provide more explanation in the legends, and if possible some quantitative evaluation of the phenotype when relevant? Otherwise, apart from obvious rescue of the Lgl mutant, it is a bit hard to interpret the other genotypes (e.g. : Vap33OE, RtGEF mutant, Vha68 mutant)

      Response: We have added more explanation of the adult eye phenotypes in the text/fig legends.

      Other minor points :

      1. I would recommend when possible to clearly indicate in Figure 8 which part of the pathway are clearly documented in this study, and which part are still hypothetical (eg: link with PAK).

      Response: We have re-drawn the model figure to highlight what we have found in this study by adding orange arrows between Lgl-Vap33-RtGEF/Git-Arf79F-Hpo and Lgl-Vap33-V-ATPase and V-ATPase-Hpo.

      1. Page 4, the sentence "as aPKC's association with the Hpo orthologs, MST1/2, and uncoupling MST from the downstream kinase, LATS (Wts), thereby leading to increased nuclear YAP (Yki) activity [17], consistent with what we observe in Drosophila [5]." may need to be reformulated (at least I had trouble to understand it).

      Response: We have edited the sentence to "In mammalian systems, deregulation of Lgl/aPKC impairs Hippo signalling and induces cell transformation, which mechanistically involves the association of aPKC with the Hpo orthologs, MST1/2, thereby uncoupling MST from the downstream kinase, LATS (Wts) and leading to increased nuclear YAP (Yki) activity [17], consistent with what we observe in Drosophila [5]."

      1. Page 11 : "a decrease in Diap1 expression was observed and clones were smaller than wild-type clones (Fig 7E), suggesting that the Arf79F knockdown clones were being out-competed" I am not sure one can conclude from this that the clone are "outcompeted" (which would suggest at context dependent disappearance of clone, while here the data could be totally compatible with a cell-autonomous decrease of growth and survival). This statement would only make sense if global eye depletion of Ar79F had no adult eye phenotype.

      Response: We have edited the sentence to "a decrease in Diap1 expression was observed and clones were smaller than wild-type clones (Fig 7E), suggesting that the Arf79F knockdown clones have reduced tissue growth ----."

      Reviewer #3 (Significance (Required)):

      This study identifies regulators of Hippo which through their interactions with Vap33 explains for the first time how Lgl depletion leads to Hippo misregulation (without impairing apico-basal polarity). This is an interesting study based on epistatic analysis and mass-spectrometry and identify several regulators conserved in mammals. While the molecular mechanism remained to be explored, it clarifies for the first time how Lgl depletion ( a core regulator of apico-basal polarity) leads to Hippo downregulation and tissue overgrowth, a phenotype also observed in mammals and characterised several years ago in Drosophila. The authors previously characterised the interaction between Vap33 and Lgl and its role in the regulation of Notch signaling through the V-ATPase. This study nicely complement these previous results and connect now Vap33 with Hippo and Lgl while answering a long unresolved question (how Lgl depletion affect Hippo pathway).

      This results will be interesting for the large community studying the hippo pathway, apico-basal polarity and tissue growth. It also outlines interesting factors that could be relevant for tumour neoplasia and hyperplasia.

      I have expertise in epithelial biology, apoptosis, cell competition, Drosophila, cell extrusion, mechanobiology, morphogenesis and growth regulation.

      Response: We thank the reviewer for recognizing the significance of our study.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, Portela and colleagues identified new regulators of Hippo pathway downstream of the core apico-basal polarity protein Lgl. While the impact of Lgl depletion of Yki activation was already characterised both in Drosophila and Vertebrates, the mechanism connecting these two pathways was still unclear. Using the Drosophila eye, mosaic analysis, epistatic analysis and mass-spectrometry, they identified two routes through which Lgl depletion can lead to Hippo pathway downregulation and eye overgrowth. This regulation required the previously characterised Lgl interactor Vap33, which on the one hand activates Hippo by inhibiting the V-ATPase, and on the other hand activates Hippo through its interactions with the actin regulators Git, RtGEF (two previously characterised regulators of Hippo, https://pubmed.ncbi.nlm.nih.gov/25484297/) . They also identified another regulator of Hippo downstream of Lgl, Arf79C, whose ortholog interact with Git in mammals and is also in close proximity with Hippo, Git and RtGEF in Drosophila, and whose depletion abolish Hippo downregulation and eye overgrowth in Lgl mutant.

      This is a well performed study which identified new links between Lgl and regulation of the Hippo pathway. Many of them are conserved in mammals and may be relevant in pathological conditions associated with Lgl loss of function and Yap missregulation. The experiments are well conducted with a quite thorough epistatic analysis combined with many assays to characterize protein interactions. Admittedly, the molecular mechanism remains uncharacterised and some experiments may help to indicate putative mechanisms, but the characterisation of these news regulators and clear genetic interactions results constitute already solid and interesting data. I have some suggestions though that could help to reinforce the conclusions.

      Main suggestions:

      1. While the precise molecular mechanisms is not absolutely necessary, it would be interesting to document the subcellular localisation of these new Hippo regulators in WT and Lgl mutant context (Git, RtGEF Vap33 and Arf79F), either with Antibody if they exist, or with fusion protein (which for a good part were already generated for the PLA results). This may reveal obvious misslocalisation which would support the role of Lgl as a scaffolding protein that maintain proper subcellular localisation of these factors.
      2. Most of the epistatic experiments focus on factors that rescue the overgrowth and increase of diap1 expression in Lgl mutant. Did the author test if any of these core regulators are sufficient to recapitulate Lgl mutant eye phenotype, for instance Vap33 KD in the eye, or Ar79C overexpression. Negative results would still be informative as they would point to the existence of other downstream regulators of the eye phenotype
      3. It is at the moment hard to interpret the relevance of the results obtained by PLA. While there are some negative controls based on the absence of secondary antibody, what is the number of particle obtained for two non-interacting cortical proteins ? Since this is based on proximity, I would expect that some positive particles would still appear by chance, but much less than for two physically interacting proteins or subunits of a complex. Could the author provide such a negative control by testing for instance Git/RtGEF with another non-interacting cortical protein ? That would help to assess the relevance of the conclusions based on PLA.
      4. Some of the epistatic links are a bit hard to interpret at the moment, and additional epistatic test may be relevant. For instance, the increase of diap1 upon Git depletion in the Vha68 mutant (Figure 6) is used to conclude that Git is required for the Hippo upregulation upon reduced V-ATPase activity. However this could be compatible with two independent branches regulating Hippo (in an opposite manner), which is more less what is suggested by the authors in their conclusion and the model of figure 8. I would suggest to reformulate this conclusion in the result part. Similarly, there is currently no experimental exploration of the epistatic link between Arf68C, Git and RtGEF (which is based on results in mammals). It would be interesting to check if Git and RtGEF mutant phenotype (Hippo downregulation) can also be suppressed by downregulation of Arf79C.
      5. Apart from very obvious phenotype (eye in Lgl mutant mosaic) it is a bit hard to interpret the picture of adult eye provided in this study (specially for mild phenotype). Could the authors provide more explanation in the legends , and if possible some quantitative evaluation of the phenotype when relevant ? Otherwhise, apart from obvious rescue of the Lgl mutant, it is a bit hard to interpret the other genotypes (e.g. : Vap33OE, RtGEF mutant, Vha68 mutant)

      Other minor points:

      1. I would recommend when possible to clearly indicate in Figure 8 which part of the pathway are clearly documented in this study, and which part are still hypothetical (eg: link with PAK).
      2. Page 4, the sentence "as aPKC's association with the Hpo orthologs, MST1/2, and uncoupling MST from the downstreamkinase, LATS (Wts), thereby leading to increased nuclear YAP (Yki) activity [17], consistent with what we observe in Drosophila [5]." may need to be reformulated (at least I had trouble to understand it).
      3. Page 11 : "a decrease in Diap1 expression was observed and clones were smaller than wild-type clones (Fig 7E),suggesting that the Arf79F knockdown clones were being out-competed" I am not sure one can conclude from this that the clone are "outcompeted" (which would suggest at context dependent disappearance of clone, while here the data could be totally compatible with a cell-autonomous decrease of growth and survival). This statement would only make sense if global eye depletion of Ar79F had no adult eye phenotype.

      Significance

      This study identifies regulators of Hippo which through their interactions with Vap33 explains for the first time how Lgl depletion leads to Hippo misregulation (without impairing apico-basal polarity). This is an interesting study based on epistatic analysis and mass-spectrometry and identify several regulators conserved in mammals. While the molecular mechanism remained to be explored, it clarifies for the first time how Lgl depletion ( a core regulator of apico-basal polarity) leads to Hippo downregulation and tissue overgrowth, a phenotype also observed in mammals and characterised several years ago in Drosophila. The authors previously characterised the interaction between Vap33 and Lgl and its role in the regulation of Notch signaling through the V-ATPase. This study nicely complement these previous results and connect now Vap33 with Hippo and Lgl while answering a long unresolved question (how Lgl depletion affect Hippo pathway).

      This results will be interesting for the large community studying the hippo pathway, apico-basal polarity and tissue growth. It also outlines interesting factors that could be relevant for tumour neoplasia and hyperplasia.

      I have expertise in epithelial biology, apoptosis, cell competition, Drosophila, cell extrusion, mechanobiology, morphogenesis and growth regulation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This manuscript investigates potential mechanisms through which the lgl gene might affect the Hippo signaling pathway. The authors employ a combination of physical interaction studies and clonal analysis in Drosophila eye discs to investigate potential links between lgl and other genes. Some of the results are intriguing, but the analysis is rather preliminary, and there are technical concerns with some of the results presented.

      Main issues

      • The authors propose effects of genes involved in vesicle trafficking and acidification in Hippo signaling, but there is no clear cellular mechanism described by which these effects could be mediated. This deserves further consideration. eg if they think there are effects on the localization of Hippo, this could be directly examined. In the Discussion, the authors suggest that "The V-ATPase might therefore act to inhibit Hippo pathway signalling by blocking the interaction of Lgl/Vap33/RtGEF/Git/Arf79F with Hpo in vesicles, thereby altering Hpo localization and inhibiting its activity." but Hippo is a cytoplasmic protein and has never been reported to be within vesicles.
      • The Yki stains in Fig. 1 are confusing. The nature of the signal throughout the wing disc looks very different in 1A vs 1B vs 1C, this needs to be explained or re-examined. Fig 1C (wts RNAi ) seems to show an elevated Yki signal in some cells, and lower in others in - prior studies have reported that wts affects the nuclear vs cytoplasmic localization of Yki, but not its levels, so this needs to be clarified.
      • In Fig 1D the clones appear to have different effects in different regions of the eye disc; the authors should clarify. Also, the disc in 1D appears much younger than the discs in 1A-C, but similar age discs should be used for all comparisons.
      • The authors should clarify whether any the manipulations they perform are associated with Jnk activation, as this could potentially provide an alternative explanation for downregulation of Hippo signaling.
      • The authors report in Fig 2C,E that over-expression of Vap33 reduced expression of Diap1, which they interpret as evidence of increased Hippo pathway activity, but this experiment is lacking essential controls, as the apparent reduction of Diap1 could simply reflect increased cell death or a change in focal plane, and indeed the difference in the label stain makes it look like these cells are undergoing apoptosis. Thus it's important to also have a stain for a neutral protein, or at least a DNA stain. Additionally, it is important to stain for at least one additional marker of Hippo pathway activity (eg ex-lacZ or Yki localization), as there are other pathways that regulate Diap1
      • In Fig. 4 the authors perform PLA experiments to examine potential association between various pairs of proteins, but they don't show us key controls. They report in the text using single antibodies as negative controls, but this doesn't control for non-specific localization of antibodies. The better negative control is to do the PLA experiments in parallel on tissues lacking the protein being detected (eg from animals not expressing the GFP- or RFP-tagged proteins they are examining). Also, there is a lot of variation in the apparent signals shown in different PLA experiments in fig 4, the authors should comment on this.
      • The authors claim that RtGEF mutant cells increase Diap1 expression, and that Vap33 over-expression reverses this effect (Fig. 5). The effect of RtGEF looks very subtle and variable, it should be confirmed by examining additional reporters of Hippo pathway activity. It also seems like the disc in 5A is at a different stage &/or the quantitation is done from a different region as compared to the disc in 5C.
      • The analysis of the influence of Vha68-2 mutant clones, and their genetic interaction with Git, similarly suffers from missing controls and incomplete analysis. Additional Hippo reporters besides just Diap1 should be examined. The Diap1 analysis which shows reduced expression needs examination of neutral controls or nuclear markers to assess potential apoptosis within clones, or changes in focal plane.

      Similarly, the analysis of Arf79F mutant clones in Fig 7E,G is compromised by lack of controls for viability and tissue layer, and analysis of an additional Hippo reporter is once again essential.

      Significance

      The strength of the study is the potential dissection of novel connections between the lgl tumor suppressor and the Hippo pathway. However, there are signifiant limitations due to the preliminary nature of the study, which is incomplete and missing essential controls. If these limitations are overcome the work will be of interest to specialists in the field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript entitled "The Drosophila Tumour Suppressor Lgl and Vap33 activate the Hippo pathway by a dual mechanism, involving RtGEF/Git/Arf79F and inhibition of the V-ATPase." by Portela et al. presents an interesting perspective of the molecular mechanism regulating Hippo pathway, revealing new proteins involved in this process.

      In this study, the authors try to show us that Lgl activates the Hippo pathway via Vap33 either by interacting with RtGEF/Git/Arf79F or by inhibiting V-ATPase, thus controlling epithelial tissue growth. The methodology used by the authors is adequate but could benefit from further experiments that would allow them to reach the conclusions stated in their research.

      Thus, based on the interpretation of the results presented by the authors some concerns were raised that should be addressed during the review process and that are explained in the major comments.

      Major comments:

      • It is not clear why in "The Hippo signaling pathway is negatively regulated by V-ATPase activity in Drosophila" section, the authors use Vha68-2 RNAi to reduce the activity of V-ATPase and later they use the overexpression of Vha44 to activate V-ATPase. The authors should explain why they used different proteins to regulate V-ATPase. The way the authors wrote their results sounds like different Vha proteins regulate V-ATPase, which means that cells may have different ways to activate V-ATPases, not being clear if regardless that the downstream effect of V-ATPase activation is always reflected in the Hippo pathway. Thus, the authors should state what other Vha proteins may have a similar effect, I would like to see evidence that Vha44 and Vha68 knockdown and overexpression leads to similar results.
      • In "Vap33 activates the Hippo pathway" section, the authors' conclusions represent a big statement considering the results obtained. Though Diap1 is a Hippo pathway target, it does not mean that this protein is solely regulated by this pathway. For example, there are studies that show that this gene can also be transcribed by STAT activity. Though in the following section the authors show how Vap33 activates this pathway, the results obtained in the section "Vap33 activates the Hippo pathway" are not enough to make this assumption. We suggest that the authors rephrase this section. (Optional: To maintain this statement, the authors should have performed, for example, a luciferase assay containing specifically Hippo pathway binding sites in the Diap1 gene, showing that the transcription factor of the Hippo pathway is somehow regulated by Vap33).
      • The authors present a highly speculative discussion, raising different hypotheses. Though such hypotheses are well supported by the literature, the authors would enrich the quality of their research if indeed they could prove them. Particularly, testing for vesicle acidification, testing if V-ATPase indeed blocks the interaction of Lgl/Vap33/RtGEF/Git/Arf79F, and alters Hpo localization, testing if Git/RtGEF inhibits Arf79F and consequent Hpo localization.
      • The authors should also apply more specific techniques to infer how the Hippo pathway is affected by such genetic manipulation since diap1 can be a target gene of different pathways.

      Minor comments:

      • The authors present a well-structured manuscript, that generally is easy to understand. However, at some points, the statements given by the authors seem highly speculative.
      • The figures presented in this manuscript and the statistical analysis seem adequate and are clearly described.

      Significance

      The study presented by Portela et al. gives new insights into the regulation of the Hippo pathway with the discovery of new proteins involved in this mechanism, which can be interesting to those working on basic research and focused on studying signal transduction. However, this study lacks some novelty. Throughout the manuscript, the authors only observed the physiological consequences of manipulating this pathway based on the eye phenotypes, and in the discussion, many hypotheses were raised based on the already available literature, which shows that much is already known about the Hippo pathway.

      The advances shown in this study are limited to the description of the signaling pathway itself and to the eye morphology. As a suggestion, the authors should explore the knowledge of their findings in order to understand how we can use them to achieve advances in other fields and physiological conditions. For example, only at the end of the discussion, did the authors raise the questions that would really push their discoveries a step forward, namely how this mechanism acts during the response to tissue wounding and whether the mammalian orthologs of Lgl and Vap33 also act via these mechanisms to control tissue growth in mammals. It would be interesting if the authors could direct their research efforts to understand if the proteins identified can be targeted to improve wound healing or to delay aging for example.

      Altogether, the authors present an interesting study but, at this moment, it still lacks the significance and novelty needed for publication. We encourage the authors to keep up their good work to address these suggestions, which will definitely improve the quality of their study.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In their manuscript, Tetzlaff et al. report a substantially improved protocol for the isolation of mitochondria from the parasitic apicomplexan Toxoplasma gondii, which allowed improved sequencing and in-depth analyses of the organism's peculiarly complex mitochondrial genome. Follow-up small RNA-sequencing made it then possible to confirm the expression of fragmented mitochondrial ribosomal RNAs (mt-rRNAs) and to identify a dozen new RNA species of unknown function. The authors document not only multiple Toxoplasma mitochondrial genes that overlap one another-including rRNA and protein-coding genes, otherwise a rare occurrence-but also show that some fragmented rRNA genes recombine, effectively leading to multifunctional sequence segments, another rare feature and consequence of the peculiar architecture of the organism's mitochondrial genome. Lastly, the authors confirm that products of three genes presumed to encode pieces of the highly fragmented mitochondrial large subunit (mtLSU) rRNA do indeed assemble-presumably with additional components-into large molecular-weight complex(es).

      Major comments

      Key conclusions of the manuscript are that Toxoplasma's mitogenome encodes overlapping rRNA and protein-coding genes, divergent and chimeric rRNA pieces, and several small RNAs (sRNAs) of unknown function. Provided evidence is very solid for certain aspects of the study, but objectionable for the others as detailed below.

      1. The extent of the presented analysis of rRNAs and unassigned sRNAs seems lacking. In several places of the manuscript, the authors wonder about potential implications of divergent rRNA sequences, but their analyses appear to have been limited to sequence similarity searches. Had modelling of secondary structure interactions been attempted, this conundrum could potentially be solved. Importantly, similarity searches (to conventional rRNAs) were performed using BLASTN, which is a rather crude tool for the purpose, instead of covariance models/HMMs. It is therefore not entirely surprising that some sRNAs remained unassigned. Admittedly, recognizing rRNA motifs in divergent RNAs is a challenging issue. However, it is important to not conflate similarity to conventional rRNA and the molecule's functionality as an rRNA, i.e., sequence divergence does not necessarily disqualify the unassigned sRNAs as potential rRNAs. Mitochondrial rRNA sequences are among the most divergent, often constrained only by base-pairing, if at all, as has shown the research on kinetoplastid and diplonemid mt-rRNAs, which contain very few conserved elements and very few base pairs (e.g., Ramrath,2018,Science & Valach,2023,NAR). Even in generally less divergent cases such as green algae, the fragment encoding a highly divergent and derived 5S-like rRNA has only been recognized as such only after the mitoribosome structures were determined (Waltz,2021,Nature Comm & Tobiasson,2022,Nature Comm). It would not be surprising if the same was the case for Toxoplasma's fairly quickly evolving mitochondrial genome.
      2. The discovery of overlapping protein-coding and rRNA genes is intriguing, but the authors do not explain why it should be considered as fundamentally groundbreaking as the 'Abstract' and 'Discussion' make it sound. Gene overlaps are found in mitochondria of many organisms (e.g., fungi, animals, various protists), especially of tRNA and protein-coding genes. Even in Plasmodium, a rather close relative of Toxoplasma studied in the presented work, LSUB (rRNA) gene overlaps cob (protein) gene in the antisense orientation. Admittedly, the extent of the overlaps in Toxoplasma does seem fairly high at a first glance, but it is necessary to provide more data and, importantly, broader context to make the case that Toxoplasma overlaps are in any way special. For instance, what is the average size of the overlaps? What is their cumulative size? How does their extent (i.e., the size of overlapping coding sequences compared to the total length of coding sequences) compare to gene overlaps in other (mitochondrial) genomes? Certain additional aspects of the analysis and interpretation of protein- and/or rRNA-coding sequence overlaps are somewhat underdeveloped. For example, are the RNA-coding regions that overlap protein-coding sequences more divergent in those three conserved proteins compared to other organisms, i.e., does their function as rRNA take precedence, or is the converse the truth, i.e., are the rRNA sections more divergent? RNA19 (overlapping coxIII and cob) is the only example discussed in depth, but at least a short sentence summarizing the overall picture would be useful. As for the authors' interpretations, proposed formation of sRNA:mRNA hybrids, through which sRNAs could by implicated in facilitating mRNA recognition by the mitoribosome, is an interesting hypothesis, but a simpler scenario, which is given very little space, is that the genes happen to overlap by chance and that the overlaps are merely a consequence of genome compaction (a phenomenon that the authors rightly highlight). Without a comprehensive analysis, it is impossible to conclude which possibility is more likely. For instance, if both protein-coding and non-protein-coding sequences are divergent, this would indicate that there are few evolutionary constraints and so the fact that these sequences overlap means very little and might be just due to neutral drift, an effect of genome compaction without much consequence for the organism. Lastly, considerable significance is attributed in the study to the presence of antisense overlaps, especially between rRNA- (or sRNA-) and protein-coding genes. Yet, the overall extent of sense and antisense overlaps in the Toxoplasma mitogenome is quite similar, which-again-seems to point to a neutral evolutionary process. Can the authors elaborate if this aspect of the genome architecture was taken into account and if they regard it as of lesser relevance (and why, if so)?
      3. Another controversial issue concerns prevalent sequence block combinations and their impact on mitochondrial gene expression regulation. The authors postulate that 5′-terminal blocks of protein-coding genes always occurring near other protein-coding blocks has some functional significance. However, concluding this from just two cases (even if out of two) is quite speculative and seems like reading too much into a pattern that could very well be due to chance alone. The authors argue that the fact that 5′ ends of coxI & coxIII genes overlap is another indication of potential gene expression coordination. While it is possible to envisage such a regulation because of the 5′ termini proximity, the overlap between these genes means that their connection is hardwired into the genome, making it difficult to compare this particular case to the other sequence blocks. Arguably, it is tempting to speculate that an evolutionary pressure exists to coordinate protein expression and such a coordination does not indeed seem implausible, but the presented data and arguments are not convincing. The authors should at least expand on their ideas in the 'Discussion' and indicate potential experiments and/or which additional data could support (or refute) their speculation.
      4. My last major point concerns the experimental examination of large-molecular weight complexes and the interpretation of its results. To prove incorporation of the sRNAs into the mitoribosome, i.e., confirm that they do indeed represent rRNAs, the authors opted to investigate their distribution across a sucrose velocity gradient. This is a relatively simple and powerful approach and although it does not provide an irrevocable proof, it can be used to gain very useful insights. However, the presented design has critical flaws: 1) all sRNAs selected for Northern blot were mtLSU components, so only the mtLSU would be detected; 2) a single cytosolic LSU component was used as the control, so the distribution of cyto-SSU subunit, cyto-ribosome, and cyto-polysomes is actually unclear; 3) the authors' interpretation relies on the assumption that both mitochondrial and cytosolic ribosomes preserve their association as polysomes, but no relevant control is provided for this. For example, in Figure 6, fractions 6-14 clearly contain cyto-LSU, but polysomes (e.g., disomes) might just as well start in fractions 12-14; without additional controls, or at least continuous monitoring of UV absorbance across the gradient (to show a typical polysomal pattern), it is not guaranteed that what was detected actually included cyto-polysomes. The main concern, however, is the migration of mitoribosomes. First, the authors presume that the fractions 7-8 contain the mitochondrial monosomes because they are the fractions closest to the gradient top. This is not guaranteed. In fact, based on the experience of our and our colleagues' labs and taking into consideration the conditions used for the described experiment (more precisely, the use of Triton and deoxycholate, which in many organisms lead to mitoribosome subunit dissociation), it seems quite likely that fractions 7-9 actually contain separated mtLSU, not monosomes. Fractions in higher sucrose concentration would then represent monosomes and possibly assembly intermediates, though perhaps also a minor polysomal fraction (if the interactions are preserved in the conditions used). In particular, if the assembly process in Apicomplexa is as complex as in Euglenozoa (e.g., see papers on kinetoplastid mitoribosomes Saurer,2019,Science & Tobiasson,2021,EMBO Journal), which does not seem unlikely in Toxoplasma given the necessity to incorporate ~15 distinct rRNA pieces per mitoribosomal subunit, then the assembly intermediates might form ribonucleoprotein complexes that migrate quite far into a sucrose gradient (e.g., as in kinetoplastid mtSSU, Maslov,2007,Mol Biol Parasit). Thus, while it can be reasonably well argued that the detected RNAs co-migrate with the mtLSU (and possibly mito-monosome), the claim that they associate with mito-polysomes is open to question. More critically, investigating only sRNAs that are clearly identifiable as rRNA pieces-and all from the mtLSU at that-does not automatically prove that all sRNAs associate with the mitoribosome. To argue that the unassigned sRNAs are associated with mitoribosomes, northern blots of as many as possible (but at the very least one) unassigned sRNAs are absolutely necessary. However, I encourage the authors to consider performing additional experiments to address the issues raised in the preceding paragraph: for example, a western blot of mitochondrial ribosomal protein(s), a northern blot with at least one mtSSU rRNA fragment (since all three shown are from mtLSU), as well as a test that would examine the influence of detergents on mitoribosome stability (e.g., use milder detergents such as digitonin or dodecylmaltoside). Furthermore, if experimental conditions are identified allowing subunit dissociation, it would be possible to discern to which subunit which sRNA belongs and, importantly, whether the unassigned sRNAs are just "disguised" rRNAs (simplest explanation) or something completely different (speculative explanation seemingly favoured by the authors). All this would substantially boost the significance of the presented work.

      Minor comments

      General comments

      The word "novel" is rather overused in the manuscript. At several places, it is inappropriate, as the presented results are not as unprecedented as the manuscript makes them sound; at other places, it might be acceptable, but as the word's meaning is vague, the text would benefit from using more informative term(s) instead. The former case is exemplified by the sentence at the lane 102 "Here, we present a novel method for enriching organellar nucleic acids" - "novel" does not simply mean "new", but alludes to "unprecedented"; yet, the devised method, albeit clever, is a modification of existing approaches. The sentence at the lane 182 illustrates the latter case where "novel blocks" are mentioned, but "previously not detected blocks" would be more appropriate and to the point. The labelling of 5′ and 3′ is inconsistent throughout the manuscript - sometimes the prime is used, sometimes the apostrophe, sometimes it is the single quotation mark.

      Abstract

      In light of the raised concerns, the authors should consider carefully rewording this section, as some of the formulations are mis-representing the data and lead to unjustified generalizations.

      Introduction

      lanes 72-73: "How rRNA fragments are assembled into functional ribosomes remains an enigma." - Without proper context, this statement feels like an exaggeration. Fragmented rRNAs are known from other species and their mitoribosome structures were determined in the past few years (i.e., Tetrahymena, Polytomella, Chlamydomonas). Arguably, these mt-rRNAs are not as fragmented as in Toxoplasma, but at the very least, it is clear that base-pairing of rRNA pieces and RNA-binding proteins play significant roles in the process. If the authors think that this is not the case in apicomplexans, this should be at least alluded to, if not explained. l. 80-83: The paragraph mixes information on Plasmodium and Toxoplasma. To a non-initiated reader, this can be quite confusing. It would be useful to specify which species the authors refer to. l. 83-86: The information on the atovaquone impact lacks reference(s). l. 105: "demonstrated that they are incorporated into polysomes" - In light of the issues raised above and if the authors opt not to expand the work as suggested above, this claim (and similar throughout the text) should be emended. l. 106-108: "allowed us to identify novel transcripts, many of which originate from block boundaries and contain mixed origins from coding and noncoding regions." - This sentence would benefit from rephrasing because it is difficult to comprehend (the sequences overlap protein-coding and non-protein-coding regions, but do not contain any origins).

      Results

      l. 115-117: "cell fractionation method that takes advantage of the differential cholesterol content in plasma membranes" - Does Toxoplasma contain cholesterol? Perhaps it might be more practical to refer to sterols (since the effect of digitonin is not limited to cholesterol). l. 147: "significant increase" - It might be useful to specify that the increase was ~42-fold, so that readers can see the extent of improvement; it has the advantage of really highlighting the achievement. l. 180: "have been lettered from A-V" - Rewording to "designated by letters from A to V" works better. l. 213-218: This section is essentially a discussion so should be moved the corresponding section of the manuscript. l. 262-265: cotranscripts/transcript isoforms - It is a matter of nomenclature, but it seems more appropriate to refer to "a transcript containing LSUF and LSUG regions" instead of a co-transcript, because in the latter case, one then expects that these two will be separated in a following processing step, which-as the authors demonstrate-is clearly not the case for the vast majority of the population of these rRNA pieces. Given the prevalence of the larger pieces, it seems more appropriate to refer to the "smaller transcript isoforms" as possible degradation products and not isoforms, which implies some kind of functional relevance. l. 281: In the section "Discovery of novel rRNA fragments", it might be useful to provide a graphical representation or at least a sentence summarizing all different categories of sRNAs. For instance, what is missing from the text is that there are 11 species for which homologous sequences in "conventional" rRNAs were not identified and out of these only 4 seem to have sequence homologs in other Apicomplexa. In addition, in Table S5, the authors could indicate where these homologs are located in Plasmodium, since these appear to be newly identified candidates for Plasmodium sRNA species/rRNA pieces. l. 313-314: "In general, block combinations lead to the expression of novel RNAs in T. gondii that are not found in apicomplexan species with a simpler genome organization. " - It is not clear where this generalization comes from: Fig.S5A shows that RNA5, RNA7, RNA23t extend across block borders (but based on Table S5 are not unique to Toxoplasma), while only RNA31 and RNA34 are both absent from other Apicomplexa and extend across block borders - yet, this is still less than half of all newly identified sRNAs. In addition, the novelty claim is not clear either: based on the presented data, several sRNAs that overlap are clearly present in other apicomplexans (e.g., RNA1 and RNA2) and thus are not completely new, but merely more divergent in Toxoplasma, because parts of their sequence have been replaced by the shared sequence segment. l. 319-320: "None of the three RNAs had detectable homologies to rRNA." - Specify to which rRNAs were the sequences compared to make the inference. l. 320-321: "For all five coding-noncoding RNAs, homologs are present in the mitochondrial genome of P. falciparum." - Does this mean that they remain unassigned in Plasmodium as well or that they have not been previously recognized in Plasmodium? Confusingly, RNA34 is labeled as not having homologs in Apicomplexa in Table S5. In addition, mentioning "coding-noncoding RNAs" is somewhat misleading because some of the sRNAs clearly code for mt-rRNA pieces. l. 335-338: This section contains contradictory statements that should be reformulated. A couple of sentences prior, the authors experimentally determined that RNA19 actually overlaps only a single protein-coding sequence (coxI), but then refer to the original and demonstrably incorrect annotation of RNA19 overlapping also the cob gene. l. 341: The authors mention similarity to rRNA, but do not specify which rRNA. Referring to similarity to known or conserved rRNA sequences or segments would work better. Still, the region of the block S (i.e., 5′ proximal segment of RNA19) falls into the region between helices H51 and H60 of the domain III in the LSU secondary structure, which is sequence-wise relatively poorly conserved-especially in mitochondrial rRNAs-so sequence divergence is not unexpected. l. 366: "Note that RNA1 and RNA2 are registered according to their shared sequence" - Unclear what "registered" means here. l. 416-421: Specifying when reference is made to cytosolic vs. mitochondrial monosomes and polysomes would make this section and the related parts of the 'Discussion' clearer. Also, the authors clearly state here that there might be technical reasons for what they observed, but ignore this possibility in the 'Discussion' and assume that they did indeed separate polysomes.

      Discussion

      l. 444: "the reshuffling appears limited to specific block borders and is not random" - How many biological replicates of nanopore sequencing were performed? Did the authors test other T. gondii strains? What about other apicomplexan species? Unless this has been done, there is no demonstration that the block order and block-joining frequencies documented here are (more or less) constant and that block order is under some kind of purifying selection. Hence, the conclusion that the block borders are not random is debatable. Arguably, it is not random in this particular experiment, but neither is it limited to specific blocks because most combinations have been detected (even if at low frequency; Figure S1). l. 450: "One intriguing finding is the obligate linkage of coding sequences" - Presuming this sentence is about protein-coding sequences, this should be reformulated because it mis-represents the actual data. Figure 2 clearly shows that protein-coding blocks are often linked to rRNA-coding blocks. l. 454: "balancing the expression of coxI and coxIII" - Not clear where this information comes from, as it is not from the cited papers. l. 460-461: "Our small RNA sequencing results revealed another potential advantage of the block organization of the T. gondii mitochondrial genome" - This should be reformulated. Clearly, the discovery of the 15 sRNAs was facilitated by the recognition of block order, but the presented argument is a bit confusing: how does the organization into blocks provide an "advantage" and what kind of advantage do the authors mean? (An evolutionary advantage or an advantage related to gene expression regulation or an advantage for their sRNA-Seq data mapping?) l. 462-478: Multiple explanations are provided for the existence of sRNAs at block borders and what these sRNAs represent. While I agree that it is important to consider all options, even the more debatable ones, the authors seem to forget the simplest possibility: the identified unassigned sRNAs could well be rRNA pieces and them being encoded across block borders is not any more, nor any less surprising than the fact that protein-coding genes are encoded across (several) gene blocks. l. 485: "antisense RNA surveillance" - In contrast to the nuclei, the existence of a genuine antisense RNA "surveillance" mechanism in mitochondria is uncertain. Given what is known from mitochondria of other organisms (especially plants and kinetoplastids), it seems more likely that certain regions of sense and antisense transcripts are protected from exonucleases by RNA-binding proteins (RBPs such as PPR and related helix-turn-helix repeat proteins, e.g., Toxoplasma's homologs HPRs discovered in Plasmodium [Hillebrand,2018,NAR]), leading to RNAs that partially overlap, but are actually protected from base-pairing by these RBPs. This is not taken into account in any presented explanation of the phenomenon of antisense gene overlaps. l. 490: "start codon. while also " - Typo: should be a comma, not a dot. l. 500: "discovery of block-border sRNAs highlights the complex regulatory mechanisms at play" - This should be reformulated: the claim is very speculative, since no hard data are provided on such regulatory mechanisms in the presented work. l. 504: "sRNAs are incorporated into polysome-size structures" - In light of the concerns raised in the preceding section, this should be reformulated. l. 539-540: The closing sentence should be reformulated. The mitogenome organization in blocks per se does not "allow" the sequences to function as both mRNA and rRNA. Rather, it seems to be a combination of 1) the compactness of the genome that seems to lead to the re-use of certain segments in both mRNA and rRNA or in two distinct rRNAs, and 2) the apparently dynamic nature of the genome (due to recombination among gene blocks) that brings together certain combinations of gene blocks.

      Methods

      l. 607: Only agarose gel separation is mentioned, but most experiments shown are of denaturing PAGE separations (which is actually mentioned in several figure legends). l. 636: "Paste your materials and methods section here." - To be removed. l. 662: "NUMTS" - This should be "NUMTs"; the same typo occurs at multiple places in the 'Methods' section. l. 704: "Homology search for novel transcript annotation" - Somewhat confusing title; it is possible to guess what the authors likely mean, but it is unclear. l. 715: "New block annotations can be found in GenBank." - 1) The whole community would very likely appreciate if the GenBank entries were properly annotated (i.e., genes added), not just showed sequences as is currently the case for all Namasivayam,2021,Genome Res entries (not sure about the authors' own entries because they were inaccessible). If impossible to update the entries of the Namasivayam,2021,Genome Res study, then just submitting anew properly annotated GenBank entries would be appropriate. 2) It was not possible to properly assess some of the claims in the manuscript because access to the files was not provided to reviewers, nor have been the newly submitted GenBank entries made public by the authors.

      Figures

      Figure 1B - The load of total proteins into each well is unclear. Ponceau stain does not show identical loads, so it is unclear what the reader should take as the reference. Figure 1D -The phrasing "fragments found in the pellet fractions of the protocol" is a bit awkward. The fragments are in the pellet fractions after plasma membrane permeabilization and benzonase incubation, not in the "fractions of the protocol". Figure 2 - The chosen hues of red and green (for coxI and coxIII) are of such similar intensity that they are virtually indistinguishable to ~2% of the readers. A colourblind-friendly palette would be very much appreciated. For guidelines, see for example: https://www.nature.com/articles/nmeth.1618 . Figure 3 - The use of lowercase letters to indicate the probes (instead of the full probe names) is a nice idea and simplifies the reading experience, but the use of the same letter 'a' in different figures for different probes is confusing. Labeling each probe with a unique ID/letter and indicating this ID in the Table S6 (e.g., by adding an additional column) would work much better. Figure 4A - The wiggle lines for rRNAs are coloured in purple shades, which contrast with the grey colour that is assigned to them in the Figure 2. Keeping a consistent colour palette across figures would be preferable. Figure 4C - If the E.coli sequence was on the outer lines, the Toxoplasma sequences could be closer to one another, which would make it easier for the reader to understand the alignment. Figure 5 - Purple shades for rRNA are somewhat difficult to discern from the blue cob. Also, the 'reference' wiggles would work better if demarcated as a key because this would make it visually clearer that they are shared by the A and B panels.

      Supplementary Information

      Figure S1 - An explanation what the A and B panels show is missing. Figure S5 - It is difficult to appreciate the extent of overlaps with protein-coding sequences if these are missing from the figure (unlike in Fig.5). Table S4 - Nuclear genome accession number is missing. Add "mitochondrial" to the label of the column "sequence blocks". Table S5 - 1) It is unclear what the 'rRNA homology' refers to. (It does not seem to be the nomenclature used by Feagin et al.,2012, PLoS One.) 2) An extension of the table (or perhaps a separate table) with the cumulative size of mtLSU and mtSSU rRNA pieces, as well as unassigned sRNAs, would be useful. 3) It should also be stated somewhere if homologs of any of the rRNA pieces known from Plasmodium are missing in Toxoplasma. (If so, they could be among the newly identified short RNAs.)

      Referees cross-commenting

      Referee #2 rightly pointed out that basic statistics on nanopore reads, as well as omitted methodological details (e.g., minimap2 and SAMtools settings) would be welcome. Similarly, Figure 2 should indicate the upstream/downstream block orientation. If the authors intend to position their work as a major achievement in mitochondrial enrichment for Toxoplasma (as the text currently indicates), I also agree that a comparison with previously published protocols would not be out of place.

      Significance

      Speaking from personal experience, devising a protocol for such a substantial mitochondrial enrichment, as the study presents, is a great technical achievement, which cannot be understated, especially for a protist or any somewhat unconventional model organism. The mitoribosomal community will certainly take notice of the improved catalogue of mitochondrial rRNA pieces, while the discovery of overlapping protein-coding and rRNA genes will be of interest to those working in the field of mitochondrial evolutionary biology. The study already provides a significant upgrade from the previous attempts to understand the nature of the mitochondrial genome in Toxoplasma (and in Apicomplexa in general), and is well positioned to become a source of inspiration for future studies in the field. However, being at a crossroad of genomics, evolution, and molecular biology, it has certain limitations in its current form, mainly because the evolutionary and molecular biology aspects would benefit from further development (see 'Major concerns'). The text is generally well written and accompanying figures well designed, but clarifications, broader context, and less speculative interpretation would be welcome (as detailed mostly in 'Minor concerns'). To justify publication in a journal with a broad readership, the authors should provide additional experimental evidence to strengthen their case and generalize their findings.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this article, the authors delve into an intriguing topic, aiming to enhance our understanding of the organization of the mitochondrial genome of T. gondii, a parasite of significant importance in both human and animal health contexts.

      In essence, their approach involves enriching mitochondrial material, followed by genome sequencing and the analysis of mitochondrial short RNAs. They achieve a remarkable depth of mitochondrial sequencing and generate valuable RNA data. Furthermore, their efforts lead to the discovery and annotation of new short RNAs.

      Overall, the article is well-crafted and presents compelling results. However, it's worth noting that, at times, the authors appear somewhat self-congratulatory, and certain results might be perceived as overly ambitious. Nevertheless, the discussion is aptly constructed.

      Major comment:

      They assert certain discoveries that had already been reported. Notably, they adapt an existing protocol for mitochondrial enrichment and describe it as 'We developed a protocol to enrich T. gondii mitochondria.' It's worth noting that they neither reference a more recently described protocol (PMC6851545) nor compare the performance of their modified protocol with the original.

      The protocol they employ does not seem to yield exceptionally high success rates, as mitochondrial DNA constitutes less than 10% of the total sequenced DNA.

      Additionally, they frequently mention the identification of specific combinations of sequence blocks previously identified by Namasivayam et al. (PMC8092004), which was also discussed in Namasivayam et al. 2021."

      Missing in the supplementary material are basic details on the sequences performed. Distribution of mitochondrial reads length, depth, etc.

      Further clarification is needed for Figure 2. Specifically, the frequency units or combinations of frequency (A, B, and C) are not clearly explained. While the matrix's asymmetry suggests a 5'- 3' orientation difference, this orientation difference is not explicitly specified (B). Additionally, the fragment Mp does not appear in the block combination figure (C).

      Some points to improve the introduction:

      Provide an evolutionary context for the following phrase: 'An idiosyncratic feature of Apicomplexa is a highly derived mitochondrial genome.' Specify what you intend to emphasize.

      Line56: The sentence must begin with a capital letter

      In line 58 "Nuclear genes encoding proteins with functions in mitochondria contribute strongly to P. falciparum and T. gondii cell fitness" Although it is mentioned later, it would be more effective to introduce the fact that all but three genes are encoded in the nucleus.

      Line68: "Apicomplexan mitogenomes usually code only for three proteins" It seems to me that 'usually' should not be included.

      Line 65-67: The sentence should include that the mitochondrial genome is composed of a total of 20 blocks of repeating sequences organized in multiple DNA molecules of varying length and non-random combinations

      At the end of the introduction, the authors state that they have developed a protocol for mitochondrial enrichment. The text should be modified taking into account that: 1- The new protocol is an adaptation of another existing protocol. In fact, the Methods the authors say the protocol was "slightly" modified. 2- There is already existing mitochondrial enrichment protocol available [Reference: https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6851545/#mmi14357-bib-0074]. In any case, they should consider performing a comparative analysis between the proposed protocol and existing ones to determine its relative effectiveness. It should be noted that the proposed protocol enriches in organelles (including the nucleus and apicoplast), but when sequencing DNA, mitochondrial DNA accounts for only 5% of the total reads, which may raise doubts about its overall efficacy.

      Some points related to Results section:

      Lines 113-115: 'To distinguish between NUMTs (nuclear DNA sequences that originated from mitochondria) and true mitochondrial sequences, it is necessary to enrich mitochondrial DNA.' I disagree with this sentence. NUMTs, in general, consist of very short sequences. With long reads, it is relatively straightforward to differentiate mitochondrial sequences from those nuclear sequences that have small mitochondrial fractions. In my opinion, even many Illumina reads can be confidently identified as belonging solely to the mitochondria. I found this article that supports this argument, indicating that the majority of NUMTs are less than 100 nucleotides in length [Reference: https://pubmed.ncbi.nlm.nih.gov/37293002/].

      Lines 166-168: 'A previous sequencing study used Oxford Nanopore sequencing technology (ONT) to identify combinations of sequence blocks in T. gondii mitochondria (Namasivayam et al. 2021).' However, it's important to note that Namasivayam's group did not merely use ONT to identify combinations of blocks; rather, they discovered, identified, and defined these combinations based on sequencing with long reads.

      Line 177: "The length of mitochondrial reads ranged from 87 nt to 17,424 nt" It would be beneficial to include a histogram depicting the length distribution of the obtained reads. It's worth noting that nanopore reads tend to be shorter than Illumina reads

      Line 194-195 "we found that only a small fraction of all possible block combinations are prevalent within the genome" this has been previously described (PMC8092004)

      Line 201. "This indicates that the genome's flexibility is limited and that not all block combinations are realized". This is consistent with the findings published by Namasivayam et al. in 2021, which have already established that the combination of the 21 blocks is non-random.

      Line 205: "All combinations are well covered in our ONT results and helped to refine block borders relative to previous annotations (Fig. S2)" In the supplementary materials the authors say: "However, the blocks Fp, Kp, and Mp frequently occur separately in the mitochondrial genome We therefore treated Fp, Kp and Mp as separate blocks and have shortened the blocks F, K and M accordingly". As far as I understand, for this very reason, Namasivayam and collaborators annotate them as partial fragments, which may appear in other regions but are, in turn, parts of larger F, K, and M fragments. To redefine the segments F, K, and M without the sequences corresponding to Fp, Kp, and Mp, as shown in Figure S2, these fragments should be distinct from the 'partials.' In other words, segments of the type (F minus Fp), (K minus Kp), and (M minus Mp) should appear in the reads, and should be distinguishable from Fp, Kp, and Mp. If this distinction is made, I am satisfied with the new definition.However, if such a separation is not evident, it seems important to clarify it in the text or to reconsider this new definition.

      Lines 221-223: "This suggests that there is no need to postulate mechanisms of genomic or posttranscriptional block shuffling to arrive at full-length open reading frames." The authors argue that invoking mechanisms of genomic or post-transcriptional block shuffling is unnecessary to explain the presence of full-length open reading frames, given that genes represent 2-3% of mitochondrial sequences. However, there is a missing estimate regarding the probability of encountering all three genes within a single molecule or mitochondrial genome, as well as the total number of sequenced mitochondria. Consequently, the statement appears overly assertive. In the absence of alternative mechanisms for generating complete genes, this would mean that at most only 1646 mitochondrial genomes would have been sequenced. To comprehensively address this issue, the authors should consider discussing this scenario further. They should also provide information about how many reads they found containing all three genes and how many contained two of the genes.

      Lines 249-250 "using the block combinations identified here by ONT sequencing " which is the difference between blocks identified here with those on Namasivayam ? The division of M, K and F fragments?

      Line 287: "The six remaining small RNA fragments are specific to T. gondii" I would suggest being more cautious in this sentence by stating that they were not found in other organisms. Given the similarity of the mitochondrial genome between T. gondii, N. caninum, and other coccidians, it would be expected to find them in these organisms as well.

      Line 300 "Among the novel small RNAs identified, there is also a class that was only detectable due to our insights into genome block combinations." A valid strategy is to map the small RNAs to the generated nanopore reads or to an assembly made with these reads, rather than solely relying on the single blocks or combinations of blocks, as this approach would yield the same result.

      Line 444: "Upon closer scrutiny, however, the reshuffling appears limited to specific block borders and is not random" This was already established by Namasivayam et al 2021.

      I would like to highlight the potential for a more comprehensive examination of the mitochondrial genome in the discussion. While the proposed explanations for the presence of sRNAs at the 'block borders' appear plausible, it's worth noting that the definition of these blocks is artificial rather than biological. I think it is interesting to discuss without the concept of block sequences, but of sequences existing in the mitochondrial genome. Therefore, it's important to discuss whether these sequences (the block borders) are consistently present in all mitochondrial genomes. The total cumulative length of the blocks is 5.9 Kb, which is relatively small and comparable to one of the smallest mitochondrial genomes on record. It is conceivable that recombination and the generation of new sequences play a role in expanding genomic space for encoding, such as RNAs.

      Line 535-536 "We developed a protocol to enrich T. gondii mitochondria and used Nanopore sequencing to comprehensively map the genome with its repeated sequence blocks." I find this sentence to be somewhat assertive, especially considering that they modified an existing protocol and obtained results that may not be optimal. Additionally, they have not compared their protocol with other available methods for mitochondrial enrichment.

      Some points related to Method section: In none of the experiments is it specified how many parasites were initially used as a starting point

      "Masking NUMTs in the T. gondii nuclear genome" it's unclear whether the authors utilize all hits or filter the results of BLASTN. It would be helpful if they specify the criteria for filtering, such as identity percentage or query coverage. Additionally, it's not clear how they generate the GFF3 file from the BLAST results, or whether they instead create a BED file. Providing clarification on this process would enhance the reproducibility of their methods. Moreover, it would be beneficial if the authors include information regarding the number of sequences they intend to mask, the average length of the NUMTs, and the total percentage of the genome these masked sequences represent.

      Line 657 "Mapping results were filtered using SAMtools"<br /> The text does not specify the filtering criteria or the parameters used for this process.

      Line 673 establish "No matching reads were found" in the "Sequence comparisons of ONT reads found here with published ONT reads for the T. gondii mitochondrial genome" but in the results the authors say: "While smaller reads of our dataset are found in full within longer reads in the published datasets, we do not find any examples for reads that would be full matches between the dataset. Could you provide a more detailed explanation? Specifically, I would like to know how many reads from the dataset (including their length) are also present in other datasets, and at what minimum length do they cease to coincide?

      689 - The text does not specify the filtering criteria or the parameters used for Samtools filtering process.

      Lines 689-693 Please describe better the methodology used.

      Line 696: the program is fastp not fastq (Chen et al. 2018)

      Line 697: what do you mean only the ends of the reads were mapped? how many bases? Or do they mean that they map the reads fowrards and reverse reads?

      Significance

      In this article, the authors delve into an intriguing topic, aiming to enhance our understanding of the organization of the mitochondrial genome of T. gondii, a parasite of significant importance in both human and animal health contexts.

      In essence, their approach involves enriching mitochondrial material, followed by genome sequencing and the analysis of mitochondrial short RNAs. They achieve a remarkable depth of mitochondrial sequencing and generate valuable RNA data. Furthermore, their efforts lead to the discovery and annotation of new short RNAs.

      Overall, the article is well-crafted and presents compelling results. However, it's worth noting that, at times, the authors appear somewhat self-congratulatory, and certain results might be perceived as overly ambitious. Nevertheless, the discussion is aptly constructed.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Mitochondrial genomes of Apicomplexa parasites have undergone dramatic reductions during their evolution with genes for only three proteins remaining. In addition, ribosomal RNA genes are present in different, often species-specific gene arrangements. Toxoplasma exhibits massive variations in gene arrangement that are distributed over multiple copies. In this study the Schmitz-Linneweber lab not only re-analysed the mitochondrial genome of Toxoplasma gondii using a novel protocol for enriching the organellar nucleic acid, allowing to sequence the mitochondrial genome at unprecedented depth, they also addressed an enigma regarding the expression status of mitochondrial ribosomes. While indirect evidence of mitochondrial translation exists, no direct evidence for active mitoribosomes exist and their composition is still poorly understood. Here, using HTS or small RNAs the authors demonstrate that they are incorporated into polysomes. Furthermore, the authors developed the hypothesis that the block-based genome organization enables the dual utilization of mitochondrial sequences as both messenger RNAs and ribosomal RNAs.

      Own opinion/Major comments

      The mitochondria of the Apicomplexa are characterized by massive gene transfer into the cell nucleus, and sequence rearrangements, which has led to a single, questioned genome reorganization. The underlying mechanisms of gene transcription and translation are also poorly understood. In a previous study, the Kissinger lab demonstrate the unique organization of the mitochondrial genome that consists of minimally of 21 sequence blocks (SBs) totaling 5.9 kb that exist as nonrandom concatemers (Namasivayam et al. 2021). In this study the authors optimized a new isolation technique of organellar content to sequence the mitochondrial genome. This new purification protocol appears to be very robust and allowed the sequencing of mitochondrial genome at unprecedented depth. The obtained data not only validate previous studies, but they also suggest several new features, such as (potentially) continuous reshuffling of DNA blocks, leading to independent block combinations. The most important aspect of this study is the demonstration of polysomes and the presence of rRNAs within these complexes, taking previous studies (i.e. Lacombe et al., 2019) a step further. Taking all these efforts and data into account it is a very nice and interesting study that will certainly be of interest for a broader readership. All the presented data and analysis appear to be solid and well controlled. However, it must be mentioned that this reviewer is not an expert when it comes to the analysis and comparison of huge genomic datasets and the opinion of a bioinformatician would be helpful in assessing this study in more detail. All other data (organellar purification and analysis of polysomes) appear state of the art and no corrections are required.

      Referees cross-commenting

      I agree with reviewer 2 and 3. Some additional details on techniques and the enrichment should be added.

      Significance

      General assessment:

      Taking all these efforts and data into account it is a very nice and interesting study that will certainly be of interest for a broader readership. All the presented data and analysis appear to be solid and well controlled. However, it must be mentioned that this reviewer is not an expert when it comes to the analysis and comparison of huge genomic datasets and the opinion of a bioinformatician would be helpful in assessing this study in more detail. All other data (organellar purification and analysis of polysomes) appear state of the art and no corrections are required.

      Advance:

      The study fills an important gap in our knowledge regarding the organization and translational activity of the apicomplexan (Toxoplasma) mitoribosome. See also comments above.

      Audience: Cell Biology, Parasitology, Mitochondria

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-02012

      Corresponding author(s): Frederic, Berger

      [Please use this template only if the submitted manuscript should be considered by the affiliate journal as a full revision in response to the points raised by the reviewers.

      If you wish to submit a preliminary revision with a revision plan, please use our "Revision Plan" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We thank reviewers for useful suggestions and comments on our manuscript which helped to improve and strengthen our conclusions. Our point-by-point answers are below. We have answered most of the points raised by the reviewers and added numerous new experimental data including detailed structural and biochemical analyses that led to support further that BCP4 (and not BCP3) is the plant functional counterpart of MDC1 because in response to DNA damage it binds phosphorylated H2A.X and recruits the MRN complex. In addition, we provide further support to the phylogenetic analysis and evidence for the plant counterpart of PAXIP1.

      We believe that our revised manuscript which includes a set of new experimental data strongly support our main conclusion that BCP4 is a functional counterpart of metazoan MDC1.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      MDC1 is a key regulator of DNA damage responses (DDR) in animals. MDC1 has multiple protein domains, in which the BRCT domain binds γH2A.X. However, plants lack the homolog of MDC1. In this study, the authors found that BCP4 binds γH2A.X and proposed that BCP4 is a functional counterpart of MDC1, which will greatly enhance our understanding of plant DDR pathway. I have the following concerns.

      1. The relationship between BCP3 and BCP4 needs to be clarified. Line 255, the authors mentioned that"we conclude that BCP3 and BCP4 have functional properties as human MDC1". In the Abstract, the authors mentioned that "we identified BCP4 as a candidate ortholog of human MDC1". I am confused about the conclusion. Both BCP3 and BCP4 are or only BCP4 is MDC1? In addition, in BCP3 and BCP4, only their BRCT domains share homology with MDC1. They lack other domains of MDC1. Therefore, "ortholog" may not be an appropriate term. I think "functional counterpart" may be a better term.

      Response: Our analysis emphasizes the fact that human MDC1 is very derived from an ancestral form MDC1 that did not share most domains found in MDC1 from mammals. Because it is still difficult to establish with certainly what the ancestral MDC1 was, we agree that functional counterpart is a more correct term, so we changed this accordingly throughout the manuscript.

      BCP1-4 all contains tandem BRCT domains. I am wondering whether it is possible to figure out why only BCP3 and BCP4 bind γH2A.X through sequence analysis. Are there any key residues essential for γH2A.X binding?

      Response: We used AlphaFold models of tBRCT domain of BCP1, BCP2, BCP3, and BCP4. While in Alphafold models the tBRCT domain of each BCP protein largely overlaps with a structure of human MDC1 tBRCT domain, only the tBRCT domain of BCP3 and BCP4 are predicted to make contacts with γH2A.X similar to that of human MDC1. Although residues that are involved are not fully conserved between BCP3/4 and human MDC1 we obtain in vitro data supporting that the interaction of BCP4 is mediated by a comparable pocket of three key residues that contact the phosphate group of γH2A.X. See also answers to comments of Referee #2 and new Figures 3 and 4, corresponding description on page 8-9, and Supporting figure 3.

      Line 183, "On an unrooted phylogenetic tree, these two proteins clustered with MDC1 and PAXIP1 (Figure 1B).". In Figure 1B, MDC1 is closer to BCP3 and BCP4 than PAXIP1 and PAXIP1 is closer to BCP2 than MDC1. If the authors want to include PAXIP1 in Figure 1C, the authors should include BCP2 as well. In the γH2A.X binding assays, I do not understand why the authors tested BCP1 instead of BCP2. In Figure 2D, why bcp2 was not included?

      Response: We created a new alignment for Figure 1C including BCP2 tBRCT domain and the tree that includes all BCP BRCT domain (Figure 1D) does support a close relation between MDC1 and BCP3 and 4 and PAXIP1 and BCP1. As we stated on page 5-6 lines 175-178, BCP2, also contains acetyltransferase domain, which is unique for plant BCP2 protein. Based on its domain organization, BCP2 was not considered as a candidate for MDC1 homolog, and we did not perform mutant complementation. This is why after our initial analysis of bcp mutants (DNA damage sensitivity, formation of gammaH2A.X, and phylogeny), and based on similarities with MDC1 and PAXIP1 we focused on bcp1, 3, and 4 mutants and the corresponding proteins. The function of BCP2 remains to be investigated, but this is out of the scope of this manuscript that is primarily dedicated to find the functional counterparts of MDC1 and PAXIP1.

      The expression level of BCP1-4 in the mutants need to be examined using qRT-PCR. Especially, for the bcp3 mutant, which is a weak allele.

      Response: We did not perform this experiment, because it was done in Vladejic et al., 2022 and expression data are available from various genomic dataset on TAIR.

      The authors used "bleomycin" or "zeocin" in different parts. Please be consistent.

      Response: We consistently use Bleomycin for treatment of seedlings followed by western blotting and Zeocin for true leaf assay. These two agents produce DNA double strand brakes in similar ways, and we could show previously that levels of γH2A.X and γH2A.W.7 are similar when using these two agents (Rosa M, Mittelsten Scheid O Bio. Protoc. 4:e1093. doi: 10.21769/BioProtoc.1093: Lorkovic et al., Curr Biol. 2017, doi: 10.1016/j.cub.2017.03.002). Zeocin was chosen for true leaf assays because we observe lower variation between batches and biological repeats compared with bleomycin.

      1. Figure 3E and 3F, please indicate the treatments of the upper and lower panels.

      Response: Thank you for pointing this out. This has been indicated in the corresponding legend of the new Figure 3 A - C.

      Line 338, "bcp1 mutants show reduced homologous recombination rates (Fan et al., 2022; Vladejić et al., 2022; Yu et al., 2023)". The bcp1 mutant was not reported in Fan et al. paper.

      Response: This sentence has been changed to accurately describe data in each of the mentioned papers.

      Line 40, please add a comma after "In ". Line 331, please add a comma after "In mammals". animal

      Response: This has been corrected.

      Line 123, "only BRCA1 and BARD1 were described in plant lineage". Additional BRCT proteins were described in plants, including XIP1 (Nat. Commun. 13:7942), BCP1/DDRM2 (New Phytol. 238:1073-1084; Front. Plant Sci. 13:1023358), and DDRM1 (PNAS, 119: e2202970119).

      Response: This sentence refers to known BRCT domain mediator/effector proteins. From the published data about XIP1, BCP1/DDRM2, and DDRM1, it is not possible to assign these functions to proteins in question. Nevertheless, we changed this sentence to avoid ambiguous interpretation and we later in the text introduce XIP1, BCP1/DDRM2, and DDRM1 proteins as needed.

      Reviewer #1 (Significance (Required)):

      This study identified BCP4 as a functional counterpart of MDC1, which filled the gap of plant DDR signaling.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __ In this study, Frédéric Berger and colleagues identified BCP4 in Arabidopsis thaliana as a potential plant orthologue of vertebrate MDC1. The conclusions are based on both in silico analysis (phylogenetic analysis) and in vitro biochemical and cell biological experiments. BCP4 loss causes sensitivity of DNA damage. Moreover, BCP4 binds to a phosphopeptide derived from the C-terminus of H2AX, via its C-terminal BRCT domains and forms foci in cells exposed to DNA damage, which co-localize with gammaH2AX foci.

      Major comments: The conclusions are generally supported by the data, but the evidence presented is still quite limited. For example, it is still possible that BCP4 recruitment to sites of DNA damage is mediated by another protein and not by direct interaction with gammaH2AX. To firmly conclude that BCP4 is an MDC1 orthologue, it is in my opinion essential to perform a (limited) mutagenesis analysis. The key amino acids in the BRCT domains that recognize gammaH2AX need to be mutated and it has to be shown that these mutants are defective for H2AX phosphopeptide binding and are not recruited to sites of DNA damage. Such residues may be tricky to identify, but one obvious candidate would be the Ser residue in beta1 (VLFS motif). In vertebrates, this is a Thr that directly interacts with the phosphate in gammaH2AX. Another possible critical site may be shortly before alpha2 (RTRN motif). In vertebrates, it is RTVK, and the K makes direct contacts with the phosphate in gammaH2AX. This function is perhaps carried out by an R. Structure prediction with alphafold may help to identify the most critical residues

      Response: We thank the reviewer for these suggestions. We used AphaFold to predict structures of tBRCT domains of all BCPproteins and compared them with structure of human MDC1 in complex with gamaH2A.X peptide. Based on these analyses we performed mutagenesis of critical amino acids in BCP4 based on their predicted interaction and their conservation. We showed that mutations of critical residues reduced or almost completely abolished binding of BCP4 to γH2A.X. These data are now part of the new Figure 4. See also corresponding description on page 8-9. In addition we provide genetic data that show that the foci formation of BCP4 depends on H2A.X (new Fig 3B and C). We did not attempt genetic complementation experiments with these mutants because it would take nine months to obtain stable transgenic plant lines expressing various mutant versions of BCP4 and the limitation of Arabidopsis transgenesis does not allow to control precisely the expression of transgenes, which could cause a difficult interpretation in this particular case.

      Another critical issue is the introduction of the study. This needs to be revised, because the literature is not correctly cited in several places. For example, the cited paper by Salguero et al., 2019 did not show that the PST repeats of MDC1 constitute a docking site of TP53BP1, but instead, that the PST repeats can bind to chromatin independently of gammaH2AX.

      Response: We thank the reviewer for spotting this mistake. We carefully checked all references and corrected all wrongly associated ones or used original reports instead of reviews.

      Also, we did re-write some parts of the Introduction as referee #1 also asked for some clarification.

      The data are generally well presented and convincing. The only thing that needs to be added is a quantification of the microscopic analysis (e.g. number of foci per cell, or similar).

      Response: We quantified the foci number in all mutants reported in Figure 2C. These data are now included in the new Figure 2D. Optional: it would be interesting to address the question why plants seem to have two MDC1 orthologues. The longer BCP4 and the shorter BCP3. What is the functional difference between those? Do they perhaps distribute functions that are combined in one protein in vertebrate MDC1 on two different proteins? Response: Thank you for prompting us to address this outstanding question. We now provide evidence supporting that only BCP4 is a functional counterpart of MDC1. We show that a specific region of BCP4 but not BCP3 is able to interact with NBS1 of the MRN complex (see new Figure 6). Also, BCP3 is missing the N-terminal TQxϕ repeats present in BCP4. Although the function of these repeats is unknown at this point, these data together suggest some functional diversification between BCP3 and BCP4. We mention this on page 11, lines 372-374.

      Reviewer #2 (Significance (Required)):

      The strength of the study is the detailed phylogenetic analysis. Also, the biochemistry and cell biology is well done.

      Limitations are the lack of evidence that BCP4 carries out functions in the cell (beyond recognising gammaH2AX) that are carried out by MDC1 in vertebrate cells

      Response: We thank the reviewer for pointing out this important point. To address it we performed pull-down assays with TQxϕ and SQ/DWD regions of BCP4 with NBS1 and found that Arabidopsis NBS1 interacts with the SQ/DWD region, and that this interaction is mediated by FHA+tBRCT of NBS1. Based on Alphafold prediction, we performed further deletion and point mutation analysis of the SQ/DWD region and defined that the binding of NBS1 requires an alpha-helix comprising sequence that is not conserved in BCP3. So, we concluded that a sequence specific of BCP4 (not in BCP3) is capable of recruiting the MRN subunit NBS1.

      At this point we could not demonstrate this in vivo by analyzing NBS1 foci in BCP4 mutant background. Unfortunately, commercial antibodies for plant NBS1 or other subunits of the MRN complex are not available, and to get transgenic plants expressing fluorescent protein tagged NBS1 would require a period much longer than the time for reasonable revisions of a manuscript. Nevertheless, our in vitro interaction data strongly argue for BCP4 having function in binding MRN complex as human MDC1, although the mode of interaction of BCP4 with NBS1 is different from that of human MDC1 and NBS1.

      Please see the new Figure 6 and corresponding description on page 11-12.

      The study is of great interest to readers working on chromatin responses to DNA damage in plants.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary The authors set out to find proteins containing BRCT domain to isolate the readers of phosphorylated H2A.X in plants. Using systematic analysis of the BRCT domain proteome, they discovered 21 proteins. Further analysis showed that BCP3 and BCP4 are the ortholog of animal MDC1 and BCP1 is the animal ortholog of PAXIP1. They also extended their work to an evolutionary perspective, finding that BCP1 and BCP4 in plants and PAXIP1 and MDC1 in metazoans evolved independently form a common ancestor. However, this manuscript raises some concerns. Checkout the comments and questions below.

      Major comments: 1) If you think that BCP3 and BCP4 work as a mediator of DDR, can you show us that those mutants have a defect of DDR? The authors only assessed true leaf developing. Leaf developing is affected by not only DNA damage but also other factor. Therefore, authors should show us additional data showing the BCP mutant lines show defective of DNA damage response.

      Response: The “true leaf assay” is a classical assay for testing plant mutants for DNA damage sensitivity (Rosa M, Mittelsten Scheid O Bio. Protoc. 4:e1093. doi: 10.21769/BioProtoc.1093). If DNA damage occurs and is not efficiently repaired, meristematic cells in shoot meristem are arrested and do not divide, hence plants do not produce the first pair of “true” leaves after cotyledons expand. In this assay cotyledons open and grow normally as they are already fully determined and do not undergo any cell division after seed germination.

      In this assay the treated WT seedlings also show a reduction of the number of plants with true leaves as compared with untreated WT (100%). Furthermore, WT and mutant seedlings develop normally and comparably without Zeocin induced DNA damage.

      2) Do you have DNA damage sensitivity data for bcp3 bcp4 double mutants?

      Response: We obtained bcp3bcp4 double mutant and tested it for DNA damage sensitivity. The double mutant is slightly more sensitive than bcp4 single mutant, but not as sensitive as H2A.X mutant. The reason for this is presumably the nature of the bcp3 mutant allele available, with a T-DNA insertion located in the 5’-UTR with some residual expression of BCP3 protein as reported by Vladejic et al., 2022. We did not feel that this would improve the manuscript, so we did not include this data. To obtain a new mutant allele would take time and work beyond the reasonable time required for revision. In addition, since we show that the functional counterpart of MDC1 is BCP4, we did not think that it is relevant to pursue further the characterization of the function of BCP3 in the context of this manuscript.

      3) Some red algae have H2A.X but don't have BCP4 and BCP1 (Figure 4). In this case, how do they read the phosphorylated H2A.X? Can you discuss the point?

      Response: Actually, most red algae do not even have H2A.X. At this point we do not have data that could answer this question and it is difficult to make any prediction about this. Analysis of DDR system in red algae is totally beyond the scope of the current manuscript. See also answer to comment #5.

      4) L307-L312: I thought that the timing of the appearance of SQEF motif in H2A.X differ from the appearance of BCP4 from Figure 4. Why do you say that the evolution of BCP4 and H2A.X coincides?

      Response: we thank the Reviewer for pointing out the need for clarification.

      Histone H2A with a C-terminal SQEF/Y motif is categorized as H2A.X that distinguishes this variant from H2A.Z (not discussed here) and H2A itself. In Archaeplastida many algal species possess either H2A or H2A.X. Only in streptophytes the ancestral gene duplicated leading to neofunctionalization of both H2A and H2A.X and in this case H2A.X form a monophyletic clade. The evolution of BPC1 and 4 are slightly posterior or coincident with this neofunctionalized H2A.X variant, suggesting co-evolution in streptophytes.

      5) Some red algae don't have BCP1, BCP4 and H2A.X. How do they transfer the signal to downstream? Do you have any idea about this?

      Response: To address this interesting question we re-analyzed BRCT domain proteome of the red algae and again could not find any protein containing features of BCP4 present in green algae and land plants or in Opistokont MDC1.

      We did find that red algae without MDC1 do encode MRE11, RAD50 but not NBS1. Also, components of non-homologous end joining DNA repair pathway, Ku70 and Ku80 are conserved in these organisms. So, how some red algae cope with DNA damage remains enigmatic. Similarly unicellular red algae do not have the classical autophagy pathway. This is the result of the very strong genome reduction (Response: Thanks for this comment. We did change title of the manuscript to avoid ambiguity.

      Minor comments: 6) I think you should show us a schematic representation of BAP1 and PAXIP1 to compare both protein features.

      Response: We added schematic presentation of PAXIP1 to Supporting Figure 2B.

      7) L176-L178: Which data support this sentence? Response: The sentence in question: “BCP1 has two tBRCT domains positioned at the N- and C-terminus and a so far unrecognized C-terminal PHD finger which is present in all plant lineages except Brassicaceae (Supporting Figure S1A and S2A).”

      Response: Our data presented on Supporting Figure S1A (schematic presentation of BCP1 protein with indicated PHD finger consensus sequence) and S2A and Source data (alignment of PHD fingers in BCP1 in flowering plants, non-flowering land plants and multicellular green algae) clearly demonstrate the presence of a C-terminal PHD finger in BCP1 except in Brassicaceae. These can also be seen in the full complement of BCP1 sequences that are available in Source data.

      8) L271-L279: There are unreadable characters at "TQx_".

      Response: This very likely appeared during conversion into PDF file. We fixed this now.

      Reviewer #3 (Significance (Required)):

      Significance: General assessment: This study give us an idea how organisms have evolved the upstream system of DDR.

      Advances: This study extend the knowledge of DNA damage response in plants.

      Audience: broad and basic research

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The authors set out to find proteins containing BRCT domain to isolate the readers of phosphorylated H2A.X in plants. Using systematic analysis of the BRCT domain proteome, they discovered 21 proteins. Further analysis showed that BCP3 and BCP4 are the ortholog of animal MDC1 and BCP1 is the animal ortholog of PAXIP1. They also extended their work to an evolutionary perspective, finding that BCP1 and BCP4 in plants and PAXIP1 and MDC1 in metazoans evolved independently form a common ancestor. However, this manuscript raises some concerns. Checkout the comments and questions below.

      Major comments:

      1. If you think that BCP3 and BCP4 work as a mediator of DDR, can you show us that those mutants have a defect of DDR? The authors only assessed true leaf developing. Leaf developing is affected by not only DNA damage but also other factor. Therefore, authors should show us additional data showing the BCP mutant lines show defective of DNA damage response.
      2. Do you have DNA damage sensitivity data for bcp3 bcp4 double mutants?
      3. Some red algae have H2A.X but don't have BCP4 and BCP1 (Figure 4). In this case, how do they read the phosphorylated H2A.X? Can you discuss the point?
      4. L307-L312: I thought that the timing of the appearance of SQEF motif in H2A.X differ from the appearance of BCP4 from Figure 4. Why do you say that the evolution of BCP4 and H2A.X coincides?
      5. Some red algae don't have BCP1, BCP4 and H2A.X. How do they transfer the signal to downstream? Do you have any idea about this?
      6. Title is not clear to understand. Please change it more suitable one.

      Minor comments:

      1. I think you should show us a schematic representation of BAP1 and PAXIP1 to compare both protein features.
      2. L176-L178: Which data support this sentence?
      3. L271-L279: There are unreadable characters at "TQx_".

      Significance

      General assessment:

      This study give us an idea how organisms have evolved the upstream system of DDR.

      Advances:

      This study extend the knowledge of DNA damage response in plants.

      Audience:

      broad and basic research

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Frédéric Berger and colleagues identified BCP4 in Arabidopsis thaliana as a potential plant orthologue of vertebrate MDC1. The conclusions are based on both in silico analysis (phylogenetic analysis) and in vitro biochemical and cell biological experiments. BCP4 loss causes sensitivity of DNA damage. Moreover, BCP4 binds to a phosphopeptide derived from the C-terminus of H2AX, via its C-terminal BRCT domains and forms foci in cells exposed to DNA damage, which co-localize with gammaH2AX foci.

      Major comments:

      The conclusions are generally supported by the data, but the evidence presented is still quite limited. For example, it is still possible that BCP4 recruitment to sites of DNA damage is mediated by another protein and not by direct interaction with gammaH2AX. To firmly conclude that BCP4 is an MDC1 orthologue, it is in my opinion essential to perform a (limited) mutagenesis analysis. The key amino acids in the BRCT domains that recognize gammaH2AX need to be mutated and it has to be shown that these mutants are defective for H2AX phosphopeptide binding and are not recruited to sites of DNA damage. Such residues may be tricky to identify, but one obvious candidate would be the Ser residue in beta1 (VLFS motif). In vertebrates, this is a Thr that directly interacts with the phosphate in gammaH2AX. Another possible critical site may be shortly before alpha2 (RTRN motif). In vertebrates, it is RTVK, and the K makes direct contacts with the phosphate in gammaH2AX. This function is perhaps carried out by an R. Structure prediction with alphafold may help to identify the most critical residues

      Another critical issue is the introduction of the study. This needs to be revised, because the literature is not correctly cited in several places. For example, the cited paper by Salguero et al., 2019 did not show that the PST repeats of MDC1 constitute a docking site of TP53BP1, but instead, that the PST repeats can bind to chromatin independently of gammaH2AX.

      The data are generally well presented and convincing. The only thing that needs to be added is a quantification of the microscopic analysis (e.g. number of foci per cell, or similar).

      Optional: it would be interesting to address the question why plants seem to have two MDC1 orthologues. The longer BCP4 and the shorter BCP3. What is the functional difference between those? Do they perhaps distribute functions that are combined in one protein in vertebrate MDC1 on two different proteins?

      Significance

      The strength of the study is the detailed phylogenetic analysis. Also, the biochemistry and cell biology is well done.

      Limitations are the lack of evidence that BCP4 carries out functions in the cell (beyond recognising gammaH2AX) that are carried out by MDC1 in vertebrate cells

      The study is of great interest to readers working on chromatin responses to DNA damage in plants.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      MDC1 is a key regulator of DNA damage responses (DDR) in animals. MDC1 has multiple protein domains, in which the BRCT domain binds γH2A.X. However, plants lack the homolog of MDC1. In this study, the authors found that BCP4 binds γH2A.X and proposed that BCP4 is a functional counterpart of MDC1, which will greatly enhance our understanding of plant DDR pathway. I have the following concerns.

      1. The relationship between BCP3 and BCP4 needs to be clarified. Line 255, the authors mentioned that"we conclude that BCP3 and BCP4 have functional properties as human MDC1". In the Abstract, the authors mentioned that "we identified BCP4 as a candidate ortholog of human MDC1". I am confused about the conclusion. Both BCP3 and BCP4 are or only BCP4 is MDC1? In addition, in BCP3 and BCP4, only their BRCT domains share homology with MDC1. They lack other domains of MDC1. Therefore, "ortholog" may not be an appropriate term. I think "functional counterpart" may be a better term.
      2. BCP1-4 all contains tandem BRCT domains. I am wondering whether it is possible to figure out why only BCP3 and BCP4 bindγH2A.X through sequence analysis. Are there any key residues essential for γH2A.X binding?
      3. Line 183, "On an unrooted phylogenetic tree, these two proteins clustered with MDC1 and PAXIP1 (Figure 1B).". In Figure 1B, MDC1 is closer to BCP3 and BCP4 than PAXIP1 and PAXIP1 is closer to BCP2 than MDC1. If the authors want to include PAXIP1 in Figure 1C, the authors should include BCP2 as well. In the γH2A.X binding assays, I do not understand why the authors tested BCP1 instead of BCP2.
      4. The expression level of BCP1-4 in the mutants need to be examined using qRT-PCR. Especially, for the bcp3 mutant, which is a weak allele.
      5. The authors used "bleomycin" or "zeocin" in different parts. Please be consistent.
      6. In Figure 2D, why bcp2 was not included?
      7. Figure 3E and 3F, please indicate the treatments of the upper and lower panels.
      8. Line 338, "bcp1 mutants show reduced homologous recombination rates (Fan et al., 2022; Vladejić et al., 2022; Yu et al., 2023)". The bcp1 mutant was not reported in Fan et al. paper.
      9. Line 40, please add a comma after "In animal". Line 331, please add a comma after "In mammals".
      10. Line 123, "only BRCA1 and BARD1 were described in plant lineage". Additional BRCT proteins were described in plants, including XIP1 (Nat. Commun. 13:7942), BCP1/DDRM2 (New Phytol. 238:1073-1084; Front. Plant Sci. 13:1023358), and DDRM1 (PNAS, 119: e2202970119).

      Significance

      This study identified BCP4 as a functional counterpart of MDC1, which filled the gap of plant DDR signaling.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript addresses how the first mitotic spindle is centered in the ascidian zygote to promote a symmetric cell division. This is a universal problem in animals because the sperm centrosome, which will form the poles of the mitotic spindle, is initially at the edge of the zygote because it comes in from the outside. The authors hypothesize that cortical pulling is turned off so that a combination of microtubule pushing and cytoplasmic pulling can center the mitotic spindle. Experimental methods include live imaging of ascidian zygotes injected with mRNAs and proteins as well as computational modeling. Results support the hypothesis that cortical pulling is turned off at the same time that the spindle centers.

      Major comments:

      Inhibition of entry into mitosis by p21 injection prevents centrosome centering. This results supports the idea that CDK activity is required for centrosome centering but does not specifically support the inhibition of cortical pulling model.

      Data in figure 3 is used to support the conclusion that cytoplasmic pulling does not change between interphase and mitosis and therefore an increase in cytoplasmic pulling during mitosis cannot be responsible for the centering of the spindle. This interpretation needs to be more carefully qualified in the text of the results and discussion. The only "pulling proxy" that is quantitatively compared between interphase and mitosis is movement of Cell Mask Red-labeled endosomes toward or away from the centrosome. (1) Only endosomes are tracked and it is possible that yolk granules or mitochondria could exhibit different results.

      (2) If endosomes were the only contributors to cytoplasmic pulling, the ratio of anterograde vs retrograde transport would be significant. Data in Fig. S2C looks like this might change between interphase and mitosis but no statistical result is shown testing for a change in this ratio.

      (3) Single plane imaging is justifiably used for this high speed analysis which means that vesicles are likely to leave the focal plane frequently. Leaving the focal plane would artificially reduce "track persistence" which is strangely reported in "n.u." units with very small values. N.u. units are not defined. The units should by um like total transport. A more accurate statement might be: "We could not detect a significant change in centrosome-direct endosome transport in our experiments but we cannot exclude the possibility that a significant difference would be detected with different cargo or different methods." The data presented in figure 4 provides very strong evidence that cortical pulling is reduced in mitosis relative to interphase, supporting the overall conclusion of the paper.<br /> The interpretation of fig. 5 is not strongly supported. Latrunculin is not necessarily going to inhibit cortical pulling and cortical pushing without affecting cytoplasmic pulling. In C. elegans, depletion of GPR/LGN would be the appropriate experiment. The interpretation should be qualified. The presentation of figure 6 could be improved. The protrusions indicative of cortical pushing are not quantified interphase vs mitosis. Qualitatively, there are more protrusions during mitosis than interphase which could support an increase in cortical pushing as a mechanism promoting centration. The interpretation of this result should be clarified. Whether cortical pushing is regulated in the model should also be clarified. The cytosim results presented in figure 7 lend support to he overall conclusions of the manuscript. The legend for figure 7B should state the number of simulations run for each condition.

      Minor comments:

      The presentation of figure 1 could be improved. 1A and 1D are redundant, and C and D are cited in the text before B. The most important data is figure 1B (quantification of the distance of the paternal DNA from the cell center) but no images of DNA are shown. Only a cartoon of DNA localization is shown. The results text states that the data in fig. 1B was derived from time-lapse sequences of zygotes

      (1) expressing histone h2b::tomato,

      (2) histone h2b::venus, or

      (3) stained with Hoechst. If the zygotes all had microtubules and DNA labelled, Fig. 1A could be 2 color time-lapse images showing both DNA and microtubules. 1A could have the larger number of time points currently in 1D, then 1D could be deleted. The authors should then take care to cite the sub figures in order in the text. Given the number of DNA labeling methods, it would also be appropriate in the methods to state how the authors know that the labeling methods are non-toxic (especially the live cell Hoechst labeling). Fig. S1B needs statistically significant differences marked since the text states that the significant differences in Fig. 1B were reproduced in fixed images in S1B.

      The legend for fig. 2B needs to be clarified. It states that the dark shaded bar is "mitosis entry" but the p21 injected zygotes are not entering mitosis at this timepoint.

      The legend to figure 4B could spell out Cell Mask Orange.

      Referees cross-commenting

      I agree with most of reviewer 1's comments. The author's should validate their membrane ingression assay for cortical pulling by providing quantification of cortical actin after latrunculin treatment in each condition. Regarding the expectation that the sperm aster should move toward the cortex during interphase when cortical pulling is active, Fig. 1B shows significant movement toward the cortex during meiosis but movement away (with no statistical significance marked) during interphase. This might be due to a balance with microtubule pushing on the membrane. However, this question raises the need for better presentation of the data in figure 1B. The sperm DNA should start at 0 um from the cortex at fertilization. Is the huge variability in sperm DNA position at the first time point due to variability in egg diameter? If so, an additional plot of distance from the cortex or better a plot with %radius instead of um would be helpful. Or, is the huge variability due to significant movement of the sperm DNA during meiosis before the first time point? If this is the cases, the authors might present a scatter plot of the net displacement toward or away from the cortex for each individual zygote. This improved analysis might address some of reviewer 1's concerns.

      Significance

      Because inhibition of cortical pulling during M phase has been reported in multiple species before, to have a really high impact, the authors would need to identify the relevant phosphorylation sites on dynein/dynactin/LGN and show that non-phosphorlatable mutants retain invaginations in M phase and the aster fails to center. The work could have a moderate impact on the field with just the controls already suggested.

      Significance

      Significance

      Centering of the first mitotic spindle is an important biological process that has been previously addressed in C. elegans and mammalian zygotes. This manuscript provides a high quality description of centering in ascidian zygotes with appropriate comparisons to mammals and C. elegans. While the quality of the data is strong, no new molecular mechanisms are identified which limits the significance. Because the authors cite primate papers showing that ascidian centering follows the pattern of primates, it is not clear how the current study adds new medical significance to what is already known. Perhaps the authors could highlight what was not shown in the primate papers that is shown in the current manuscript.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Proper mitotic spindle orientation is decisive for asymmetric cell division and cell volume regulation after cell division. Different cellular systems utilize many ways to orient the spindle in space and time. In this work, the authors investigate the mechanisms regulating sperm aster centration in the zygote of ascidian Phallusia mammillata. The authors show that the sperm aster stays close to the cell cortex at the vegetal pole during interphase and migrates to the cell centre during mitosis. Their data reveal that cortical pulling forces are active during interphase, and these forces are off during mitosis. The strength of the work is that the authors are analyzing the aster positioning in eggs of ascidians, where the aster behaviour is similar to primates. The work's limitation is that most important conclusions are made using inhibitors that can impact multiple processes in the zygote of Phallusia mammillata (please see below).

      Major Comments:

      • In Fig. 4, the authors claim that the cortical forces are strong in interphase and weak in mitosis. This experiment was performed in conditions that disturb the actin cytoskeleton, and the cell membrane invaginations were monitored as a proxy for cortical force generation. Here the authors observed that the number of invaginations are decreased in mitosis, compared to the interphase. This led them to conclude cortical pulling forces are higher in interphase than in mitosis. The change in behaviour of the cell membrane invaginations could be because of a difference in actomyosin-based cytoskeleton thickness/dynamics between interphase and mitosis. Do the authors know if the cortical actomyosin meshwork between interphase and mitosis remains the same?

      • Similarly, did the authors test if the injection of p21 or cyclinBdelta90 does not change the actin cytoskeleton in the injected cell versus the non-injected cell (Fig. 4)?

      • If the cortical pulling forces are acting on the sperm asters in interphase, as the authors concluded, I wonder why do the authors not observe a significantly more number of invaginations close to the sperm asters because of the high density of microtubules in that region?

      • The authors mentioned that these membrane invaginations are microtubule-dependent and cited Godard at al., 2021. This point is vital; thus, the authors should include the nocodazole experiment in their data. Since the dynamic nature of microtubules is critical for aster positioning in C. elegans zygote, the authors should further test if dynamic microtubules regulate sperm aster position in interphase by treating these cells with taxol.

      • Also, the authors should analyze if the membrane invaginations during interphase are dynein/dynactin-dependent.

      • If the cortical pulling forces are the chief reason to keep sperm asters close to the cell membrane during interphase, then over time, the sperm aster distance from its geometric centre to the cell membrane should decrease. Do the authors observe this?

      • The authors should quantify the number of invaginations at the two-cell stage in p21 or cyclinBdelta90 injected cells.

      • In Fig. 5, the authors inhibit the actin cytoskeleton for characterizing if the cortical pulling forces are critical to prevent aster migration. The impact of the actin cytoskeleton on aster migration is quite indirect and does not affirmatively support their conclusions that it is via impacting cortical pulling forces. Can the authors show that cortical force generators (dynein/dynactin complexes) are localized at the membrane in their system, and if the actin inhibitors impact their localization?

      • It would be important for the readers to see both the DNA and the asters in Fig. 1 as the authors have injected Ensconsin-GFP and H2B-Tom mRNAs. Also, why did the authors choose to measure the distance between male DNA and the cell centre? They could measure the geometric centre of the male aster to the cell centre before the meeting and the centre of the mitotic spindle to the cell centre after spindle assembly, which would be more appropriate for studying spindle behaviour.

      Minor points:

      1. Do the authors know if the microtubule dynamics remain unaltered in p21 injected zygote (Fig.2)? It could simply be that the impact of p21 injection on aster migration is because of the change in microtubule dynamics

      2. In line 149, the authors write, 'During meiosis I, the aster is in the egg cortex' I guess that the authors would like to say that it is juxtaposed to the cell cortex.

      Significance

      The strength of the work is that the authors are analyzing the aster positioning in eggs of ascidians, where the aster behaviour is similar to primates. The work's limitation is that most important conclusions are made using inhibitors that can impact multiple processes in the zygote of Phallusia mammillata (please see the comments).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Evidence reproducibility and clarity The authors report that the human-specific KLF factor KLF7 can induce pluripotency in humans and can improve the reset toward naïve pluripotency when cells are cultured the PXGL medium. KLF7 falls behind KLF4 in reprogramming efficiency but might have a unique role in naïve reset (10-20 fold less efficient in iPSC colony yield). The topic of the study is interesting and adds important insights into the roles of KLF factors along the pluripotency continuum and pinpoints differences between mice and human. There are implications for stem cell engineering and boosting the developmental potency of stem cells (blastoid formation potential, interspecies chimera formation). However, some of the claims as to the unique role of KLF7 are unconvincing in the absence of comparison with other KLF factors, especially the Yamanaka factor KLF4. The flow and coherence of the text can be improved - at times reasoning and motivation of experiments are hard to follow

      Major comments - Why would a pan-pluripotency factor KLF7 which is expressed in both primed and naïve cells more potently trigger the naïve reset than the naïve specific factors KLF4/5/17? Such a comparison could widen the scope and interest of their work.. I would find it interesting if authors would compare the ability of key KLF factors to induce naivety. This is of particular interest as the overexpression of engineered Sox along with KLF4 was reported to improve the quality and developmental potential of PSC in multiple species (MacCarthy et al bioarxiv). Such an analysis could reveal unique features of KLF family members and lead to advanced stem cell models. They actually claim the SK naïve reset does not require naïve medium but the expression of SK alone is sufficient to induce this state. What do the authors think about this claim? Overall I feel the potential role of KLF7 in naïve reset is interesting but underdeveloped.

      We thank the reviewer for the useful comments.

      It has been shown in murine PSCs, that the pluripotency factors Nanog and Oct4 are expressed both at the naive and primed state and their forced expression, in combination with a medium supporting naive pluripotency, efficiently resets primed murine PSCs to naive (Radzisheuskaya A. et al., 2013; Theunissen T. et al., 2011). It is therefore not surprising that a similar regulation might also be conserved in human and that the general pluripotency factor KLF7 is expressed in both states and drives efficient resetting.

      Moreover, we agree that a direct comparison with another KLF factor could improve our work, so thanks to the reviewer’s suggestions, we will generate conventional/primed hPSCs with exogenous KLF4 expression in order to assess the efficiency of chemical resetting compared to hPSCs with overexpression of KLF7.

      Minor comments

      • P3, line 52: "Surprisingly, however, KLF4 is also routinely used to generate conventional human iPSCs." Why is this surprising? KLF4 (and SOX2) are the most potent iPSC factors whilst MYC and OCT4 can be omitted (at least in mouse).

      Thank you for pointing this out. We have rephrased the text accordingly (line 51).

      • It would be nice if the demonstration of pluripotency and quality of KLF7 iPSC go beyond transcriptome profiling and included some further assays common in the field.

      We assessed the quality of our OSK7M iPSCs by performing an EBs differentiation assay (Fig. 3d). We rephrased the text to further highlight this experiment (line 106). Of note, in vivo assays like teratoma formation are not allowed in Italy due to official regulations on animal testing.

      • Fig 1A-B: color coding (of dots) is very confusing- which ones are PSCs and which ones are iPSCs? Another colour palette might fix. What is meant by "interrogating previously published data" (line 67)? Are these public RNA-seq data that were re-analyzed? I

      We will highlight in the figures which cells are PSCs or iPSCs using different colours and shapes.

      We rephrase the text to clarify that available RNA-seq data were reanalysed (line 67).

      • Fig 2b: how were the colony numbers obtained? By morphology, or using live cell staining? So form of staining is recommended colony counting (i.e. TRA-1-60).

      We scored colonies both based on their morphology and after OCT4/NANOG staining. Actually, we observed that the counting based on morphology underestimated the number of iPSC colonies, so it is a more stringent method to score reprogrammed cells.

      • Fig 2e: Also, they say that "[t]hree technical replicates were carried out for all quantitative PCR". Unless I'm mistaken, it seems that only two technical replicates were performed for these qPCR reactions (two dots visible per bar).

      In figure 2e dots refer to two independent experiments. In each experiment we carried out three technical replicates for each sample.

      • Fig 3c: "colture"; change to "culture" (and the title: "bone fide" should be "bona fide")

      Thank you. We amended the typos in the figure and in the text (line 111).

      • For Fig 2/3: since the paper is on KLF4/7, I'm surprised that expression levels of OCT4 and SOX2 were analysed but not KLF4. Given that the main finding was that KLF4 was not upregulated in PSCs, I would be interested to see what the KLF4 levels are like in the iPSCs. RNA-seq analysis/qPCR would be best; but if the authors would like to use other methods, that's fine too.

      This is a good suggestion, we will add to Figure 3b the KLF4 expression levels.

      • Fig 4: The explanatory text is too sparse. Readers should be reminded of the differences between of naïve and primed PSCs and the known roles of KLF4 (this could also be improved in the introduction). List names of naïve media used on top of author names (5iLA, PXGL, EPSCM etc). Why was HENSM by Hanna excluded?

      We will amend the text explaining the main differences between naive and primed PSCs and the role of KLF4.

      We will add PSCs derived in the HENSM medium in the analyses shown in figure 4.

      • Fig 5: KLF7 is classified as a general pluripotency marker, but KLF4/KLF17 are classified as naïve markers. In that case, wouldn't it make more sense to overexpress a naïve specific marker in order to achieve naïve iPSCs at least as a control? What was the motivation here? I think the authors need to provide a more compelling reasoning why only KLF7 was studied or add more data for other KLFs (especially since it seems that the reprogramming efficiency of KLF4 is higher than that of KLF7 for conventional reprogramming (see Fig 2B)...)

      We will perform resetting experiments using KLF4, as suggested, in order to compare the efficiency of KLF7 to a known naive factor.

      o Fig 5B: the text currently says that the cells on the left side of Fig 5B are from Day7; but it says the cells are from Day0 in the actual figure. Which one is it? Also, based on how the text is written, do the cells on the left also contain EOS, or are they the wild-type variety?

      We agree that the text was confusing. Colonies appeared at day 7, but we showed them at day 12, when they were larger and easier to see. We amended the text accordingly. Moreover, the images at day 0 are simply the cell lines at the beginning of the resetting, which also contain EOS, as quantified on the right panels of Fig. 5b.

      o Fig 5c: not all markers in this figure are naïve markers (as stated in the text); would suggest separating the markers and labelling them accordingly AND rewriting the text to reflect that.

      We labelled the markers in the Fig. 5c as suggested by the reviewer and rephrased the text (line 136-137).

      o Life cell reporters for naivety (CD75,SUSD2) could enrich this study.

      We believe that the combination of bulk RNAseq and immunostaining for functional regulators of naive pluripotency (i.e. KLF17 and OCT4 (Lea et al., 2021 Development; Theunissen et al., 2014 Cell Stem Cell) are sufficient to described the acquisition of naive pluripotency.

      • Schemes in 5A/6A could indicate when transgenes were added

      For our chemical resetting experiments we used conventional hiPSCs (KiPS) with stable expression of KLF7 or an EMPTY vector (lines 126-127). We have also added this detail in the figure legends (line 291).

      • Fig 7: the claim regard mouse pluripotency is a little outside of the scope of this paper; would recommend de-emphasizing the claim .

      We will streamline the discussion and put less emphasis on murine PSCs.

      We thank the reviewer for the good suggestion that will be included in the revised manuscript.

      • Similarly, are there features outside the DBD that might suggest a unique activity (IDR, TAD,PTM)? It seems KLF7 generates iPSCs much less efficiently than KLF4. Given the high similarity between their DBDs I wonder why this is so.

      As above, this is an excellent point for discussion that will be added to revised manuscript.

      Reviewer #1 (Significance (Required)): Significance • General assessment: The strength of the study is that the authors provide a potentially new way for the naïve reset in humans. This could improve human stem cell and embryo models. A limitation is that evidence is solely based on molecular (not functional) profiling and the uniqueness of KLF7 versus other KLF's (first and foremost KLF4) was not established. • Advance: Findings on the human-specific role of KLF7 are novel and interesting especially the ability to facilitate the naïve reset. Yet, in the absence of a more systematic comparison with other methods (and KLF factors), the claim that KLF7 is essential for this feat is unconvincing. • Audience: It's of interest to basic researchers in the broader stem cell community and those interested in early embryo development. I work on cellular reprogramming, sequence-structure-function analysis of reprogramming factors and pluripotency.


      Reviewer #2 (Evidence, reproducibility and clarity (Required)): The naïve pluripotency is established in the inner cell mass (ICM) of blastocysts. After implantation, the naïve epiblast becomes primed for lineage specification. Pluripotent stem cells (PSCs) have been successfully derived from early embryos at different stages. In mice, stem cell derivations from ICM yield naïve ESCs. Primed PSCs derived from E5.5-7.5 epiblast are epiblast stem cells (EpiSCs). In humans, stem cell derivations from human embryos have yielded PSCs with features distinct from mouse ESCs and more like EpiSCs. Recently, naïve human PSCs have been directly isolated from pre-implantation epiblast or transformed from primed PSCs. Derivation of naïve hPSCs contributes to studying the molecular events of early lineage specification and accelerates the development of the generation of humanized organs in animal models from naïve hPSCs, opening an exciting avenue for regenerative medicine.

      In this manuscript, the authors found that OSK7M could enable the reprogramming of human primary somatic cells. KLF7 is highly expressed in naive PSCs and its forced expression in conventional hPSCs induces upregulation of naive markers and boosts the efficiency of chemical resetting to naive PSCs, suggesting that KLF7 is a general human pluripotency factor and an inducer of pluripotency. The new findings extend KLF7 function in naïve PSC generation and also provide references for the efficient generation of naive PSCs. The people who focus on studying pluripotency and early embryo development might be interested in and influenced by the findings. The data are in general convincing. However, there are some issues that need to be resolved and improved.

      Major comments:

      1. Line 90: The authors showed that colonies derived from OSKM and OSK7M cocktails could be readily propagated for at least 10 passages. How many passages can OSK7M-iPSCs maintain in vitro prolonged culture?

      And how about the pluripotency and developmental potential of OSK7M-iPSCs for a long-time culture? For example, pluripotency gene expression and teratoma formation.

      We culture OSK7M-iPSCs up to 10 passages without noticing any abnormalities in the morphologies and duplication rate. However, we could extend such cultures for 5-10 more passages (i.e. a total of 2 months from iPSC generation) and perform staining for pluripotency markers or molecular analyses (by qPCR) and EBs differentiation assay to assess their developmental potential.

      In vivo assays like teratoma formation are not allowed in Italy due to official regulations on animal testing.

      1. Overexpression of KLF7 promotes the derivation of naïve PSCs. Are they different from naïve PSCs derived only by chemical resetting? For example, the pluripotency, the in vitro or in vivo developmental potential, and the efficiency of human blastoid generation.

      A key feature of naive PSCs is the potential to differentiate towards the trophoblast lineage in addition to the 3 germ layers. We will perform in vitro differentiation and EB formation assay to gauge the effect of KLF7 on differentiation potential.

      However, establishing a human blastoid generation protocol would be beyond the scope of the current study.

      As the manuscript mentioned, KLF7 is a general human pluripotency factor and an inducer of pluripotency. How does KLF7 knock-out affect the biological characteristics of hESCs? And whether KLF17 KO affects the derivation of naïve PSCs?

      We agree that it would be informative to study the requirement of KLF7 for the maintenance of primed pluripotency and during resetting. We plan to do so either by knockdown or CRISPRi, depending on which technique allows efficient and controllable depletion of KLF7. It might be the case that a straight KO of KLF7 induces the collapse of primed PSCs, making resetting experiments not feasible.

      1. Can naïve PSCs be directly reprogrammed from somatic cells with OSK7M under the PXGL medium? If so, how is the efficiency?

      We believe that studying the role of KLF7 in the context of direct reprogramming of somatic cells to naive pluripotency would go beyond the scope of this manuscript, as it would require substantial work for optimisation and generation of reagents.

      Moreover, we think that both by over-expression and inhibition of KLF7 during resetting, we will be able to investigate its involvement in naive pluripotency acquisition.

      1. Figure 6d: The data showed that in PXGL medium, KiPS (EMPTY) contained about 66% of KLF17+ cells on day 7 and declined to 30% of KLF17+ cells on day 12. Why do KLF17+ cells (naïve PSCs) decline in PXGL medium? Cells overexpressing KLF7 contained about 62% of KLF17+ cells on day 7 and increased to 89% of KLF17+ cells on day 12. Whether KLF7 function at this stage?

      The reviewer raised an intriguing point, concerning the maintenance of naive markers during resetting. Chemical resetting seems to induce transiently >60% of KLF17+/OCT4+ positive cells by day 7, however only a fraction of these cells is stabilised until day 12 (30%). In the presence of KLF7 overexpression, we observed a similar induction at day 7, which is maintained, or increased, up to day 12.

      This would indicate that KLF7 is important for the maintenance of a population of naive cells, rather than only for their induction.

      We will add this important point to the discussion.

      1. Figure 6e: The authors showed transcriptome analysis of KiPS KLF7 cells compared to KiPS16 EMPTY cells in standard culture conditions and found that trophoblast markers were not significantly changed. How is the gene expression during primed to naive transition or TSC differentiation?

      We have already investigated this aspect, showing that at day 12 during primed to naive transition there is a strong induction of TSC markers, which is ablated by KLF7 expression (Fig. 5d). Quantitative immunostaining for GATA3 (TSC marker) confirmed this lack of activation in the presence of KLF7 (Fig. 6c).

      Minor comments:

      1. KLF7 is expressed in both primed and naive PSCs and when overexpressed in conventional PSCs, it enhances chemical resetting to naive PSCs. During primed to naïve transition, how does the KLF7 gene expression pattern change?

      This is a good suggestion, we will analyse the expression patter of KLF7 during resetting.

      1. Line 52: The reference should be added.

      Thank you, we will add the reference.

      1. Line 210-212: The reference should be added.

      Thank you, we will add the reference.

      Reviewer #2 (Significance (Required)): The people who focus on studying pluripotency and early embryo development might be interested in and influenced by the findings. The data are in general convincing. However, there are some issues that need to be resolved and improved.


      Reviewer #3 (Evidence, reproducibility and clarity (Required)): Summary: In this manuscript, the authors found that KLF7 is generally expressed in both prime and naïve human pluripotent stem cells. They showed that KLF7 could replace KLF4 to induce human iPS cells in the microfluidic reprogramming system. The authors then found that overexpression of KLF7 in human prime iPSCs can facilitate the generation of naïve iPS cells. They also showed that KLF7 is a repressor of trophoblast markers. Collectively, these findings indicated that KLF7 is a general pluripotency inducer for human iPS and naïve iPS induction.

      Major comments:

      1. In Figure 2, as the reprogramming efficiency of OSK7M is much lower than that of OSKM, the authors should provide an OSM control to show whether the cells can be reprogrammed without KLF4 and KLF7.

      We have performed the requested experiment (reprogramming with OSM only) as part of a manuscript in preparation. We observed an efficiency of reprogramming significantly lower than OSK7M, yet primary iPS colonies could be obtained.

      We believe that this is due to the expression of KLF4 and KLF7 in human fibroblasts, as shown in Figure 4a.

      1. It will be more convincing to perform a teratoma assay of OSK7M-iPSCs to demonstrate their multilineage differentiation potential.

      In vivo assays like teratoma formation cannot be performed in Italy due to official regulations on animal testing.

      However, we could extend such cultures for 5-10 more passages (i.e. a total of 2 months from iPSC generation) and perform staining for pluripotency markers or molecular analyses (by qPCR) and EBs differentiation assay to assess their multilineage differentiation potential.

      1. Since KLF7 is also expressed in primed human iPS cells, the authors should show the expression level of KLF7 in the established KLF7-iPSC and EMPTY-iPS.

      Good suggestion, we will add it to Figure 3b.

      Minor comments:

      The author claimed that KLF7 is a direct repressor of trophoblast markers, but the data in the manuscript cannot support this conclusion. The author can only claim that KLF7 can inhibit the expression of trophoblast markers.

      We agree with the reviewer, and we believe that there was a misunderstanding. On pages 8-9 line 182-190 we also concluded that KLF7 regulates naive pluripotency markers, rather than trophoblast markers. We will rephrase the text to make it clearer.

      Reviewer #3 (Significance (Required)): KLF family proteins such as KLF4 and KLF17 have been identified as pluripotent inducers. In this study, the authors demonstrated that KLF7 is a novel pluripotent inducer of human IPS and naïve iPS cells, providing new insights into the functions of KLF family proteins in human pluripotency induction.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, the authors found that KLF7 is generally expressed in both prime and naïve human pluripotent stem cells. They showed that KLF7 could replace KLF4 to induce human iPS cells in the microfluidic reprogramming system. The authors then found that overexpression of KLF7 in human prime iPSCs can facilitate the generation of naïve iPS cells. They also showed that KLF7 is a repressor of trophoblast markers. Collectively, these findings indicated that KLF7 is a general pluripotency inducer for human iPS and naïve iPS induction.

      Major comments:

      1. In Figure 2, as the reprogramming efficiency of OSK7M is much lower than that of OSKM, the authors should provide an OSM control to show whether the cells can be reprogrammed without KLF4 and KLF7.
      2. It will be more convincing to perform a teratoma assay of OSK7M-iPSCs to demonstrate their multilineage differentiation potential.
      3. Since KLF7 is also expressed in primed human iPS cells, the authors should show the expression level of KLF7 in the established KLF7-iPSC and EMPTY-iPS.

      Minor comments:

      The author claimed that KLF7 is a direct repressor of trophoblast markers, but the data in the manuscript cannot support this conclusion. The author can only claim that KLF7 can inhibit the expression of trophoblast markers.

      Significance

      KLF family proteins such as KLF4 and KLF17 have been identified as pluripotent inducers. In this study, the authors demonstrated that KLF7 is a novel pluripotent inducer of human IPS and naïve iPS cells, providing new insights into the functions of KLF family proteins in human pluripotency induction.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The naïve pluripotency is established in the inner cell mass (ICM) of blastocysts. After implantation, the naïve epiblast becomes primed for lineage specification. Pluripotent stem cells (PSCs) have been successfully derived from early embryos at different stages. In mice, stem cell derivations from ICM yield naïve ESCs. Primed PSCs derived from E5.5-7.5 epiblast are epiblast stem cells (EpiSCs). In humans, stem cell derivations from human embryos have yielded PSCs with features distinct from mouse ESCs and more like EpiSCs. Recently, naïve human PSCs have been directly isolated from pre-implantation epiblast or transformed from primed PSCs. Derivation of naïve hPSCs contributes to studying the molecular events of early lineage specification and accelerates the development of the generation of humanized organs in animal models from naïve hPSCs, opening an exciting avenue for regenerative medicine.

      In this manuscript, the authors found that OSK7M could enable the reprogramming of human primary somatic cells. KLF7 is highly expressed in naive PSCs and its forced expression in conventional hPSCs induces upregulation of naive markers and boosts the efficiency of chemical resetting to naive PSCs, suggesting that KLF7 is a general human pluripotency factor and an inducer of pluripotency. The new findings extend KLF7 function in naïve PSC generation and also provide references for the efficient generation of naive PSCs. The people who focus on studying pluripotency and early embryo development might be interested in and influenced by the findings. The data are in general convincing. However, there are some issues that need to be resolved and improved.

      Major comments:

      1. Line 90: The authors showed that colonies derived from OSKM and OSK7M cocktails could be readily propagated for at least 10 passages. How many passages can OSK7M-iPSCs maintain in vitro prolonged culture? And how about the pluripotency and developmental potential of OSK7M-iPSCs for a long-time culture? For example, pluripotency gene expression and teratoma formation.
      2. Overexpression of KLF7 promotes the derivation of naïve PSCs. Are they different from naïve PSCs derived only by chemical resetting? For example, the pluripotency, the in vitro or in vivo developmental potential, and the efficiency of human blastoid generation. As the manuscript mentioned, KLF7 is a general human pluripotency factor and an inducer of pluripotency. How does KLF7 knock-out affect the biological characteristics of hESCs? And whether KLF17 KO affects the derivation of naïve PSCs?
      3. Can naïve PSCs be directly reprogrammed from somatic cells with OSK7M under the PXGL medium? If so, how is the efficiency?
      4. Figure 6d: The data showed that in PXGL medium, KiPS (EMPTY) contained about 66% of KLF17+ cells on day 7 and declined to 30% of KLF17+ cells on day 12. Why do KLF17+ cells (naïve PSCs) decline in PXGL medium? Cells overexpressing KLF7 contained about 62% of KLF17+ cells on day 7 and increased to 89% of KLF17+ cells on day 12. Whether KLF7 function at this stage?
      5. Figure 6e: The authors showed transcriptome analysis of KiPS KLF7 cells compared to KiPS16 EMPTY cells in standard culture conditions and found that trophoblast markers were not significantly changed. How is the gene expression during primed to naive transition or TSC differentiation?

      Minor comments:

      1. KLF7 is expressed in both primed and naive PSCs and when overexpressed in conventional PSCs, it enhances chemical resetting to naive PSCs. During primed to naïve transition, how does the KLF7 gene expression pattern change?
      2. Line 52: The reference should be added.
      3. Line 210-212: The reference should be added.

      Significance

      The people who focus on studying pluripotency and early embryo development might be interested in and influenced by the findings.

      The data are in general convincing. However, there are some issues that need to be resolved and improved.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors report that the human-specific KLF factor KLF7 can induce pluripotency in humans and can improve the reset toward naïve pluripotency when cells are cultured the PXGL medium. KLF7 falls behind KLF4 in reprogramming efficiency but might have a unique role in naïve reset (10-20 fold less efficient in iPSC colony yield). The topic of the study is interesting and adds important insights into the roles of KLF factors along the pluripotency continuum and pinpoints differences between mice and human. There are implications for stem cell engineering and boosting the developmental potency of stem cells (blastoid formation potential, interspecies chimera formation). However, some of the claims as to the unique role of KLF7 are unconvincing in the absence of comparison with other KLF factors, especially the Yamanaka factor KLF4. The flow and coherence of the text can be improved - at times reasoning and motivation of experiments are hard to follow

      Major comments

      • Why would a pan-pluripotency factor KLF7 which is expressed in both primed and naïve cells more potently trigger the naïve reset than the naïve specific factors KLF4/5/17? Such a comparison could widen the scope and interest of their work.. I would find it interesting if authors would compare the ability of key KLF factors to induce naivety. This is of particular interest as the overexpression of engineered Sox along with KLF4 was reported to improve the quality and developmental potential of PSC in multiple species (MacCarthy et al bioarxiv). Such an analysis could reveal unique features of KLF family members and lead to advanced stem cell models. They actually claim the SK naïve reset does not require naïve medium but the expression of SK alone is sufficient to induce this state. What do the authors think about this claim? Overall I feel the potential role of KLF7 in naïve reset is interesting but underdeveloped.

      Minor comments

      • P3, line 52: "Surprisingly, however, KLF4 is also routinely used to generate conventional human iPSCs." Why is this surprising? KLF4 (and SOX2) are the most potent iPSC factors whilst MYC and OCT4 can be omitted (at least in mouse).
      • It would be nice if the demonstration of pluripotency and quality of KLF7 iPSC go beyond transcriptome profiling and included some further assays common in the field.
      • Fig 1A-B: color coding (of dots) is very confusing- which ones are PSCs and which ones are iPSCs? Another colour palette might fix What is meant by "interrogating previously published data" (line 67)? Are these public RNA-seq data that were re-analyzed? I
      • Fig 2b: how were the colony numbers obtained? By morphology, or using live cell staining? So form of staining is recommended colony counting (i.e. TRA-1-60).
      • Fig 2e: Also, they say that "[t]hree technical replicates were carried out for all quantitative PCR". Unless I'm mistaken, it seems that only two technical replicates were performed for these qPCR reactions (two dots visible per bar).
      • Fig 3c: "colture"; change to "culture" (and the title: "bone fide" should be "bona fide")
      • For Fig 2/3: since the paper is on KLF4/7, I'm surprised that expression levels of OCT4 and SOX2 were analysed but not KLF4. Given that the main finding was that KLF4 was not upregulated in PSCs, I would be interested to see what the KLF4 levels are like in the iPSCs. RNA-seq analysis/qPCR would be best; but if the authors would like to use other methods, that's fine too.
      • Fig 4: The explanatory text is too sparse. Readers should be reminded of the differences between of naïve and primed PSCs and the known roles of KLF4 (this could also be improved in the introduction). List names of naïve media used on top of author names (5iLA, PXGL, EPSCM etc). Why was HENSM by Hanna excluded?
      • Fig 5: KLF7 is classified as a general pluripotency marker, but KLF4/KLF17 are classified as naïve markers. In that case, wouldn't it make more sense to overexpress a naïve specific marker in order to achieve naïve iPSCs at least as a control? What was the motivation here? I think the authors need to provide a more compelling reasoning why only KLF7 was studied or add more data for other KLFs (especially since it seems that the reprogramming efficiency of KLF4 is higher than that of KLF7 for conventional reprogramming (see Fig 2B)...)
        • Fig 5B: the text currently says that the cells on the left side of Fig 5B are from Day7; but it says the cells are from Day0 in the actual figure. Which one is it? Also, based on how the text is written, do the cells on the left also contain EOS, or are they the wild-type variety?
        • Fig 5c: not all markers in this figure are naïve markers (as stated in the text); would suggest separating the markers and labelling them accordingly AND rewriting the text to reflect that.
        • Life cell reporters for naivety (CD75,SUSD2) could enrich this study.
      • Schemes in 5A/6A could indicate when transgenes were added
      • Fig 7: the claim regard mouse pluripotency is a little outside of the scope of this paper; would recommend de-emphasizing the claim .
      • Could authors comment on the molecular features and whether there might be any non-redundant biochemical of KLF7 compared to other stemness-related KLFs? Looking at the conservation of the amino acids mediating base readout (-1,2,3,6) I expect specificity for DNA to be identical between KLF7 and KLF4 i.e. Figure S1A as reference for the C2H2numbering convention: https://www.cell.com/cms/10.1016/j.stemcr.2018.07.002/attachment/51171b7f-e644-4b0e-93c9-837632fd5d10/mmc1.pdf
      • Similarly, are there features outside the DBD that might suggest a unique activity (IDR, TAD,PTM)? It seems KLF7 generates iPSCs much less efficiently than KLF4. Given the high similarity between their DBDs I wonder why this is so.

      Significance

      • General assessment: The strength of the study is that the authors provide a potentially new way for the naïve reset in humans. This could improve human stem cell and embryo models. A limitation is that evidence is solely based on molecular (not functional) profiling and the uniqueness of KLF7 versus other KLF's (first and foremost KLF4) was not established.
      • Advance: Findings on the human-specific role of KLF7 are novel and interesting especially the ability to facilitate the naïve reset. Yet, in the absence of a more systematic comparison with other methods (and KLF factors), the claim that KLF7 is essential for this feat is unconvincing.
      • Audience: It's of interest to basic researchers in the broader stem cell community and those interested in early embryo development.

      I work on cellular reprogramming, sequence-structure-function analysis of reprogramming factors and pluripotency.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this study, it is shown that cofilin severs actin filaments slowly when fascin is present. Authors show that this is due to slower cluster nucleation of cofilin on fascin-induced actin bundles. Interestingly, the authors show that cofilin binding promotes helicity in actin filament bundles which in turn promotes fascin exclusion and more cofilin clustering in adjacent filament bundles; thus, inducing local transmission of structural changes.

      The authors use an elegant approach, and the data is nicely presented. Overall, I

      consider that this manuscript is in good shape to be published. It might benefit from language editing, though.

      We thank the reviewer for their positive comments. We have edited the manuscript to improve its readability (changes are in blue in the manuscript).

      Reviewer #1 (Significance (Required)):

      According to me the significance of this manuscript is that elegantly shows the molecular details of the cofilin severing effect of fascin-induced actin filament bundles. The authors show that cofilin binding promotes helicity in actin filament bundles which in turn promotes fascin exclusion and more cofilin clustering in adjacent filament bundles; thus, inducing local transmission of structural changes.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this study, Chikireddy et al. perform a series of experiments in which they compare the efficiency of cofilin-mediated severing and actin filament disassembly on individual filaments versus bundles of different sizes from by the actin-bundling protein fascin. The key outcome, quite distinct from previously published conclusions by the authors themselves and other authors, is that fascin bundling actually reduces cofilin-mediated severing mostly because of much slower "nucleation" of cofilin clusters on fascin-bound filament bundles. Cofilin cluster formation is followed by local fascin removal, and the nucleation of a cofilin cluster on an adjacent bundle in the absence of fascin is strongly enhanced. The reason for the latter surprising observation is not entirely clear, but proposed to arise from cofilin-mediated changes in filament helicity of neighboring filaments. To my understanding, the main reason why fascin protects from cofilin severing here rather than enhancing it (as reported previously) is due to the lack of constraining of the induced, cofilin-mediated twist, because if this twist is constrained e.g. by anchoring of the bundles to the surface chamber, then severing by cofilin is accelerated.

      We thank the reviewer for their positive feedback on the manuscript. We have substantially edited the manuscript in light of the insightful comments of the reviewer (changes are in blue).

      Major comments:

      I think the study is very well done, most experiments are super-elegant and controlled; I really don't have any objections against the conclusions drawn, as most of what I have seen is totally justified and reasonable. So from a scientific point of view, I can easily agree with all the major conclusions drawn, and so in my view, this should be published fast.

      Minor comments:

      There are two minor points that could be addressed:

      1) I am not entirely convinced by the conclusions drawn from the EM images shown in Figure 6A, and in particular by the filaments in two-filament bundles locally twisting around each other (without breaking) at spatial sites lacking fascin and decorated by cofilin. This is hard to imagine for me, and the evidence for something like this happening is not very strong, as in the EM, only larger bundles could be observed. In addition, I am not sure that the braiding of filaments seen in the presence of cofilin is really occurring just locally on cofilin-decorated bundle segments and thus indeed coincides with loss of fascin as proposed in the scheme in Fig. 6B.

      Can the authors exclude that the braiding is not caused by some experimental artefact, as induced perhaps by sample preparation for negative staining?

      We thank the reviewer for raising this point. We have repeated the negative staining EM experiments several times and now show new images and quantification (new Supp Fig. 13). In our new series of experiments, the braiding that was previously shown in Fig. 6 proved difficult to reproduce and to quantify. We therefore decided to remove EM observations from the main Fig. 6, and we no longer present them as evidence supporting the mechanism that we propose for inter-filament cooperativity.

      From EM images, we now quantify the frequency of fragmentation of large actin filament bundles. We observed that bundles often terminate with the ends of their filaments in close proximity, consistent with sharp breaks due to co-localized cofilin clusters.

      We have rewritten this part of the result section in the manuscript which now reads : ‘To further investigate larger bundles, we imaged them using negative staining electron microscopy. In the absence of cofilin, filaments in bundles are arranged in a parallel manner, as previously reported in vitro (Jansen et al, 2011). Compared with the control, filament bundles exposed to cofilin show numerous sharp breaks (65 breaks per 122 µm of bundles, versus 4 breaks per 68 µm in the control. Supp. Fig. 13). This is consistent with bundle fragmentation occurring at boundaries of co-localized cofilin clusters.’

      Did the authors quantify the occurrence of such braided bundle segments with and without cofilin?

      How large are these braided segments on average when you quantify them? Would you also see them if you prepared the bundles for an alternative EM-technique, such as Cryo-EM, for instance?

      As mentioned in the answer to the previous point, the braided segments proved difficult to reproduce and quantify, and we have removed EM experiments from the main figure 6. Instead of the braided segments, we now quantify the severing of the bundles, and the distribution of filament ends at the extremities of the bundles (new Supp. Fig. 13).

      We have not tried Cryo-EM due to limited access to such experimental tools within the timeframe of the study.

      This may admittedly all be experimentally challenging, but would it be possible to combine the negative staining of filaments with staining for cofilin and/or fascin using immunogold technology, to prove that the braided segments do indeed correlate with high cofilin and low fascin concentrations? In the absence of such data, and in particular in the absence of a clear quantification, the proposal is too strong in my view. Finally, it would be nice (albeit not essential I guess) to also look at two-filament bundles. The authors stated these can not be easily generated due to the tendency of fascin to promote the formation of larger bundles, but can this not be titrated/tuned somehow by lowering fascin concentrations, to come closer in reality to what is proposed to occur in the scheme in Figure 6B? In any case, the way the data are presented right now appears to constitute a pretty large gap between experimental evidence and theoretical model.

      We agree with the reviewer that EM observations are limited and, alone, do not provide strong evidence in favor of braiding/super-twisting being the mechanism responsible for inter-filament cooperativity (please see our answers to the points above). We have performed negative staining EM assays at higher cofilin-1 concentration (500 nM) compared to microfluidics assays, in order for cofilin to quickly bind to filaments, even in large bundles, so that our chances to capture bundles targeted by cofilin would be high.

      Nevertheless, both microfluidics and EM observations point in the same direction : bundle fragmentation by cofilin is caused by the co-localized cooperative nucleation of cofilin clusters.

      2) I think that the proposal of cofilin-decorated filaments to "transfer" the resulting cofilin-induced changes in filament helicity onto neighboring filaments in the bundle, which is proposed to occur locally and in the absence of fascin is a bit vague, and difficult to understand mechanistically. Can the authors speculate, at least, how they think this would occur? Are there no alternative possibilities for explaining obtained results? Maybe I am missing something here, but with considering cofilin to be monomeric and only harboring one actin-binding site, this proposal of helicity transfer onto neighboring filaments seems inconclusive.

      On single actin filaments, the change of helicity induced by cofilin binding has been observed by many groups using EM and cryoEM (e.g. McGough et al, JBC 1997 10.1083/jcb.138.4.771; Egelman et al, PNAS 2011 10.1073/pnas.1110109108 ; Huehn et al, JBC 2018 10.1074/jbc.AC118.001843). These studies have revealed that actin subunits get ‘tilted’ relative to their original orientation along the filament long axis. This leads to the shortening of the helical pitch for cofilin-saturated actin filament segments.

      In our assays, the progressive binding of cofilin along a single filament creates a cluster where all actin subunits are tilted and the helical pitch of the filaments within the cluster is shortened (from a half pitch of 36 nm down to 27 nm). This change of helicity in a cluster induces the rotation of one end of the filament relative to the other (as we have shown previously in Wioland et al, PNAS 2019). Therefore, if two parallel filaments are stapled together, the local twisting of one filament causes the twisting of the other in the overlapping region.

      We have rephrased this point to more clearly explain this in the last paragraph of the results section:

      “From our kinetic analysis, we propose the following model that recapitulates the binding of cofilin to fascin-induced 2-filament bundles (Fig. 6D). Initially, actin filaments in fascin-induced bundles are in conformations that are less favorable for cofilin binding than isolated actin filaments. Once a cofilin cluster has nucleated, its expansion locally triggers fascin unbinding and prevents it from rebinding. The increase of filament helicity induced by cofilin causes a local twisting of the entire bundle, thereby changing the helicity of the adjacent filament in the fascin-free region facing the cofilin cluster. In this region, the increase in filament helicity enhances cofilin affinity, and thus locally promotes the nucleation of a cofilin cluster (inter-filament cooperativity).”

      We have tried to think of other alternative scenarios that might explain our observations, but none appeared to be valid.

      Reviewer #2 (Significance (Required)):

      General assessment:

      The strength of this study is that owing, at least in part, to the microfluidics devices employed and the careful biochemistry, the experimental setups are super-controlled and clean, and they are used in a highly innovative and elegant fashion. The simulations are also nice! A limitation is that it is not entirely clear how precisely the main observations can be translated to what's happening in vivo. The results are largely dependent on the bundles not being constrained I understand, so to what extent would bundles be unconstrained in vivo? Perhaps this is not so important, because the experimental setup allows the authors to dissect specific biochemical behaviors and inter-dependencies between distinct actin binding proteins, but the latter view (if correct) could be stated more clearly!

      We thank the reviewer for their remarks. We have updated the part where we discuss the biological implications of our in vitro observations to better explain how the twist-constraints expected for fascin bundles in cells would accelerate cofilin bundle disassembly.

      Advance:

      As stated above, the results are opposite to the proposed synergistic activities of fascin and cofilin observed for bundles previously, perhaps because they were not constrained. So although touched in part and in a very polite fashion in the discussion, the authors could specify more clearly what the differences between the studies are, and which of the distinct activities observed either here or in previous literature will be dominant or more relevant to consider in the future? This will be hard to discern as is now, in particular for non-experts.

      We agree with the reviewer that the manuscript will benefit from discussing more in depth the plausible reasons why our experimental observations are in disagreement with the earlier interpretation by Breitsprecher and colleagues. We have extended our discussion on this point, which now reads: “Previously, using pyrene-actin bulk experiments, Breitsprecher and colleagues observed a diminished cofilin binding to fascin-induced filament bundles (Breitsprecher et al, 2011). In spite of this, their observation of fluorescently labeled actin filament bundles seemed to indicate an efficient severing activity. Since cofilin was not fluorescently labeled, they could not observe cofilin clusters, and they proposed that severing was enhanced because fascin served as anchors along filaments and impeded cofilin-induced changes in filament helicity”

      Audience:

      This manuscript will be most influential for a specialized audience interested in the complexities of biochemical activities of specific actin binding proteins when looking at them in combination. Although specialized, this is still a quite relevant audience though, since prominent actin binding proteins like cofilin are highly important in virtually any cell type and various actin structures, hence of broad relevance again in this respect.

      Expertise:

      I am a cell biologist and geneticist interested in actin dynamics and actin-based, motile processes.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      My only major concern that is that although the authors provide data that strongly supports interfilament cooperativity in two filament bundles for cofilin binding, the evidence to support that this induces filament twist on the opposing filament is not strong enough to conclusively establish this as the mechanism for the observed interfilament cooperativity. This is stated as such in the results section as a proposed model, but stated with more certainty than the presented data supports in the discussion. It might be better, based on the data presented, to state this as one possible mechanism for the observed cooperativity.

      We thank the reviewer for their remark. We have edited our discussion section to clearly say that inter-filament cooperativity arises from cofilin-induced filament twisting is a proposed model that would best account for what we observed: “Indeed, we report here the exclusion of fascin from within cofilin clusters, and a strong increase in the nucleation of cofilin clusters on adjacent filaments. This inter-filament cooperativity mechanism leads to the co-localized nucleation of cofilin clusters, and permits bundle fragmentation faster than if the nucleation of cofilin clusters on adjacent filaments were purely random. To our knowledge, this is the first time such inter-filament cooperativity is ever reported. To explain this mechanism, we propose that the cofilin-induced change of helicity produced locally on one filament can be transmitted to the adjacent filaments within the bundle (Fig. 6D).”

      So far, we have been unable to propose alternative mechanisms that could explain our observations in light of what is known for cofilin at the single filament level (a similar point was raised by reviewer #2, please see above).

      Areas within the paper, if addressed, will improve the arguments presented as well as the readability of the paper.

      (1) The authors use both the terms cofilin binding (in section I of the results) as well as cofilin nucleation (in section III of the results). It is unclear if these terms are meant to indicate the same, or different, processes. The manuscript would benefit from a clear explanation of the steps of cofilin-mediated disassembly measured and quantified in the experiments, namely nucleation (or binding), cluster growth, and filament or bundle fragmentation. A clear description of these steps would also allow the reader to follow the logic of the experiments from Figure 3 to Figure 5.

      We have edited the introduction to better describe the different steps of cofilin activity, and to remove any ambiguity whereas we are referring to cofilin binding or cofilin nucleation.

      2) Throughout the paper, the authors move from single filaments, to 2-filament bundles, to multifilament bundles, using different concentrations of fascin and cofilin. Given the biphasic behavior of cofilin, namely that low concentrations favor severing and high concentrations can favor coating and filament stabilization, I think it is important that concentrations for the components are consistent across experiments, and if changes of concentrations of important components (such as cofilin and fascin) are changed, a clear explanation as to why is included.

      As explained in the beginning of the result section, most of our experiments and quantification of cofilin activity using the microfluidics assay were done using 200 nM fascin and 200 nM cofilin as a standard. This is the case, in particular, for all the data shown in Fig 2, 3 and 4, where we compare the behavior of single filaments, 2-filament bundles, and larger bundles, exposed to the same protein concentrations.

      We have also explored higher fascin and cofilin concentrations to document their respective impact, always mentioning any change in concentration. We agree with the reviewer that cofilin activity is biphasic at the single filament level (in the range of 0 to 1 µM for mammalian ADF/cofilin, at physiological pH 7.4). In the case of fascin-induced bundles (already for two-filament bundles), filament saturation by cofilin, and thus their stabilization, will occur at higher cofilin concentration. This is mainly due to the lower nucleation activity of cofilin on fascin-induced bundles, preventing the nucleation of numerous cofilin clusters that will eventually fuse together, thus preventing saturation of filament bundles by cofilin before bundle fragmentation.

      (3) In Figure 2, it is mentioned that for the spectrin seeds with the microfluidics, the filaments consisting of larger bundles were not analyzed along with the single filament and 2-filament bundles. Instead, a different experiment with seeds attached to beads is used to assess larger filament bundles. Why were larger bundles not analyzed in the microfluidic experiment?

      We appreciate the insightful observation by the reviewer. When elongating actin filaments from spectrin-actin seeds, the seeds are randomly located on the glass coverslip of the microfluidics chamber. Upon exposure to fascin, only a subsection of any filament will be in contact with one or multiple filaments, ultimately forming a bundle due to the presence of fascin. In the case of high filament densities leading to large bundles, it is very difficult to identify the exact subsection of each filament which is engaged in a bundle or not. Despite our attempts to image individual filaments before and after exposure to fascin for enhanced clarity, the inherent difficulty persisted.

      This limitation hindered our ability to quantify cofilin activity on large bundles when using spectrin-actin seeds randomly distributed on glass. To address this, we opted for an alternative approach involving micron-sized beads coated with spectrin-actin seeds. This modification not only circumvents the aforementioned limitation but also aids in the formation of larger bundles (up to 10 filaments per bundle). This adjustment significantly enhances our ability to study and quantify cofilin activity on larger bundles, contributing to a more robust and comprehensive understanding of cofilin activity on bundles.

      And conversely, why were 2 filament bundles not assessed with the beads? Comparing the findings on two filament bundles with the findings on multifilament bundles would be easier for the reader if the small and large bundles were evaluated in the same experiments. If this is not experimentally feasible, the authors need to provide clearer explanation as to why this analysis is not included.

      Actually, we did assess 2-filament bundles in the bead assay. The cofilin activity on 2-filament bundles from beads are reported, along with larger bundles, in figure 3E-F for nucleation, and in figure 4C for cofilin cluster growth rates.

      (4) The authors indicate that at increased fascin concentration (1uM) that single filaments decrease the nucleation rate of cofilin clusters. The authors should comment on the mechanism for fascin (at 1uM concentration) for affecting cofilin binding.

      We thank the review for this comment. We now comment on this mechanism in the result section:

      “This observation is consistent with the low affinity of fascin for the side of single actin filaments. Furthermore, this indicates that cofilin and fascin may have overlapping binding sites, or that a more complex competition may exist between the two proteins, where the binding of one protein would induce conformational changes on neighboring actin subunits affecting the binding of the other protein.”

      (5) The authors should determine and include the dissociation rate for the labeled cofilin used in this study, especially given the proposed mechanism for cofilin excluding fascin within the bundles.

      • If the reviewer means that we need to characterize the behavior of the labeled cofilin: in Wioland et al 2017, we have previously reported that cofilin dissociates slowly from cluster boundaries (at 0.7 s-1 for cofilin-1 on alpha-skeletal rabbit actin, as used in the present study) and extremely slowly from inside a cofilin cluster (~2.10-5 s-1).

      • If the reviewer means that we should investigate the competition between fascin and cofilin along bundles: we agree that this is indeed an interesting question. However this is quite complex because many unknown parameters are involved. In addition to the on/off-rates of each protein and how it is affected by the presence or the proximity of the other protein, we need to consider that fascin has fewer binding sites than cofilin, and that their accessibility changes as the helicity of the filament evolves as cofilin binds. Investigating this question would require many experiments, which we would need to confront to a model. We believe that this is out of the scope of this manuscript.

      (6) For Figure 4, D and E, what do the dynamics of fascin and cofilin signal look like on a larger filament bundle? It would be informative to provide the cofilin cluster nucleation rate on larger filament bundles with a range of fascin concentrations (as in 3D for a two filament bundle).

      It would be interesting indeed to investigate the dynamics of fascin and cofilin on larger bundles. However, this experiment is quite challenging due to the fluorescence background of fluorescently-labeled fascin in our microfluidics assay (regardless of bundle size). We have been unable to perform this assay with success on large bundles. Moreover, it is difficult for us to carry out more of these experiments now that the first author of the study has left the lab.

      However, based on our results, we would expect that, for large bundles, increasing fascin concentration would also have a limited impact on the reduction of cofilin nucleation. Indeed, for 2-filament bundles, we can note that the increase of fascin concentration has a more limited impact on the nucleation of cofilin clusters (fig. 3D, roughly ~2 fold decrease for fascin from 100 to 500 nM), than the number of filaments per bundle (fig. 3F, a 10-fold decrease when increasing the size of a bundle from 2 to 10 filaments).

      (7) Additionally, it would be useful to report the cofilin severing rate at a range of cofilin concentrations, at least for the 2 filament bundles.

      Cofilin severing rate is not dependent on cofilin concentration in solution. This has been reported previously by several groups, including ours (e.g. Suarez et al, Current Biology 2011 ; Gressin et al, Current Biology 2015; Wioland et al, Current Biology 2017).

      Below is the comparison of cofilin cluster severing at 100 and 200 nM cofilin, on single actin filaments, which we added to supplementary figure 10.

      At 100 nM cofilin, we measured a similar cofilin cluster severing rate on 2-filament bundles, by measuring the survival fraction of overlapping cofilin clusters that lead to 2-filament bundle fragmentation over time. The figure pasted below is new Supp. Fig. 11.

      When the severing occurs in the two filament bundles, does this severing occur mostly at boundaries with cofilin-actin and bare actin or does this severing occur at cofilin-actin/fascin-actin boundaries?

      This is an interesting point. In the presence of a saturating amount of fascin, on 2-filament bundles, one fascin protein is bound every 13 actin subunits along each filament of a bundle. Most of the time, a cofilin boundary will not be in contact with a fasin-bound actin subunit. The limited spatial resolution of optical microscopy does not allow to say whether fascin was present at the boundary of a cofilin cluster or not when severing occurred. Nonetheless, we show that cofilin cluster severing is unaffected by fascin-bundling (i.e. severing rates per cofilin cluster boundary are similar on single filaments and on 2-filament bundles). Overall, bundling by fascin probably does not change the way cofilin severs, i.e. it occurs at the boundary between cofilin-decorated and bare actin regions.

      (8) For the images of large bundles appearing braided in figure 6A, the lower left panel the braided appearance is not obvious. Additionally, what is the number of filaments in the bundles shown? Finally, given that in Figure 3F it is indicated that cofilin cluster nucleation events are rare on large bundles, and the cluster growth rate is reduced on large bundles (Figure 4C), the authors need to indicate how frequently this braided appearance is observed as well as what the nucleation rate, growth rate and severing rate is for 500nM cofilin on bundles.

      We have repeated the negative staining EM experiments several times and now show new images and quantification (new Supp. Fig. 13). In our new series of experiments, the braiding that was previously shown in Fig. 6 proved difficult to reproduce and to quantify. We therefore decided to remove EM observations from the main fig 6, and we no longer present them as evidence supporting the mechanism that we propose for inter-filament cooperativity.

      As stated in point (7) above, the severing rate is independent of cofilin concentration. We’ve used 500 nM cofilin, which is a rather high cofilin concentration, to investigate bundle fragmentation in EM, as in solution we mostly form large bundles and they are more slowly targeted by cofilin than individual or 2-filament bundles (figure 3F & 4C). At the single filament and 2-filament bundle level, the nucleation of cofilin clusters is extremely fast at 500 nM cofilin (> 10-4 s-1 per binding site).

      (9) The authors indicate that the rapid fragmentation of twist constrained 2-filament bundles prevented them from directly quantifying the nucleation rate of the subsequent cofilin clusters that overlapped the initial ones. I'm unclear why this is the case, and if this is the case, I don't understand how the authors can be sure that a second nucleation event occurred in the twist constrained bundles. From the experimental data in 7C, it appears that the fragmentation rate for two filament bundles is similar to the fragmentation rate for twist constrained single filaments. The authors need to clearly state what they were able to observe and quantify as well as include the timing for this severing. If the authors could not observe a second nucleation event prior to severing, this should be clearly stated.

      Fragmentation of a 2-filament bundle requires the severing of two co-localized cofilin clusters, one on each filament. When 2-filament bundles are twist-constrained the sequence of events leading to bundle fragmentation is fast, thus it is difficult to separate the events within the resolution of our experiment. In this case, cofilin clusters sever quickly, thus the size of the clusters is small, which translates into a low fluorescence intensity. Therefore, the quantification of the increase of cofilin fluorescence intensity along a bundle did not allow us to unambiguously identify the ‘cooperative’ nucleation of two-overlapping cofilin clusters before the bundle is fragmented. So, apart from the quantification of the nucleation of cofilin clusters, which we show is unaffected by twist-constraining the bundles, we were unable to measure the growth rate nor the severing rate of cofilin clusters.

      Numerical simulations, using similar severing rates for cofilin clusters on both twist-constrained single filaments and 2-filament bundles, satisfactorily reproduce our experimental observations (dashed lines in Fig. 3C).

      We have edited the ‘Twist-constrained bundle fragmentation’ section to clearly say what we measured and what could not be measured : “We observed that the nucleation rate of cofilin clusters was similar for both twist-constrained and twist-unconstrained fascin bundles (Supp. Fig. 15), in agreement with observations on single actin filaments (Wioland et al, 2019b).

      The rapid fragmentation of twist-constrained 2-filament bundles prevented us from directly quantifying the nucleation rate of the subsequent cofilin clusters that overlapped with the initial ones, as well as cluster growth and severing rates.”

      This could be due to the rapid fragmentation, but it could also be due to severing occurring in the absence of a second cofilin nucleation event. It would be informative to compare the time from cofilin nucleation to severing event for two filament bundles in twist constrained and unconstrained. Clarification of the dynamics of nucleation and spreading of cofilin and the timing of fragmentation of the twist constrained filament bundles is needed.

      As explained in the previous point, cofilin-induced severing occurs significantly faster on twist-constrained single actin filaments compared to unconstrained filaments.

      For twist-unconstrained filament bundles, we never observed bundle fragmentation that originated from only one cofilin cluster. For twist-constrained bundles, while our observation is limited by the rapid fragmentation of the bundles, it is hard to imagine that a single cofilin cluster on one filament would induce the fragmentation of the neighboring filament. Recently, Bibeau et al, PNAS 2023, using magnetic tweezers to twist single actin filaments, showed that, without cofilin, applying up to 1 rotation/µm to an actin filament does not cause its fragmentation. It is thus reasonable to say that cofilin binding is required to fragment twist-constrained filaments.

      Moreover, in our numerical simulations (without inter-filament cooperativity, faithfully reproducing the kinetic of 2-filament fragmentation observed in microfluidics), 75% of bundle fragmentation resulted from a sequential nucleation of cofilin clusters, with the nucleation of the second cofilin cluster occurring after the first cofilin cluster has already severed one filament of the bundle.

      (10) Discussion of how twist constrained fragmentation dynamics might affect the dynamics of larger bundles in structures such as filopodia would be useful.

      We had substantially edited the discussion section of the manuscript, attempting to better discuss the physiological implications of our in vitro observations (bundle size & twist-constraints).

      Minor changes that would improve the paper:

      (11) In Figure 1C, Figure 2B and Figure 2E, the indication, on the graph, of the fold-change between the rates is confusing as it is not clear from the labeling on the graph that the x15 is referring to the slope of the lines, keeping this information in the legend is appropriate, but if it is to be included on the graph, perhaps adding in the linear fit on the graph is also needed.

      We have edited the figures accordingly, and included fit lines in figure 1.

      (12) Figure 7A, lining up the diagram with the kymographs below would help improve interpretation of the diagram and simulation. Alternatively, if the diagram (upper) in A does not diagram the kymographs below, this needs to be clearly stated, and it would be preferable that the diagram above matches the kymographs below.

      We have edited the figure layout accordingly.

      (13) Despite referencing the Breitsprecher, 2011 paper in the introduction, the authors do not explain how their results showing that cofilin fragments filament bundles slower than single actin filaments correspond with the Breitsprecher findings that fascin bundles favors cofilin filament severing. While the authors do not need to explain the Breitsprecher data, if they reference these findings that run counter to their results, an explanation for the discrepancy would be reasonable to include in the discussion.

      We agree with the reviewer comments, which was also a comment made by reviewer #2. We now more directly discuss possible discrepancies between Breitsprecher and our studies : “Previously, using pyrene-actin bulk experiments, Breitsprecher and colleagues reported a diminished cofilin binding to fascin-induced filament bundles (Breitsprecher et al, 2011). In spite of this, their observation of fluorescently labeled actin filament bundles seemed to indicate an efficient severing activity. Since cofilin was not fluorescently labeled, they could not observe cofilin clusters, and they proposed that severing was enhanced because fascin served as anchors along filaments and impeded cofilin-induced changes in filament helicity. This proposed mechanism bears resemblance to our previously reported findings for artificially twist-constrained single actin filaments (Wioland et al, 2019b). Here, we show that this mechanism does not occur in fascin-induced bundles.”

      Reviewer #3 (Significance (Required)):

      The research presented in "Fascin-induced bundling protects actin filament from disassembly by cofilin" is relevant and of interest to the field as it directly addresses a limitation in our understanding of how cofilin-induced severing occurs in F-actin bundles. Bundled F-actin may constitute the majority of linear F-actin within the cell and is specifically important in F-actin-based structures such as filopodia and stress-fibers. The data supports a model for interfilament cooperativity that provides a molecular mechanism for cofilin-mediated severing of fascin-bundled filaments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      My only major concern that is that although the authors provide data that strongly supports interfilament cooperativity in two filament bundles for cofilin binding, the evidence to support that this induces filament twist on the opposing filament is not strong enough to conclusively establish this as the mechanism for the observed interfilament cooperativity. This is stated as such in the results section as a proposed model, but stated with more certainty than the presented data supports in the discussion. It might be better, based on the data presented, to state this as one possible mechanism for the observed cooperativity.

      Areas within the paper, if addressed, will improve the arguments presented as well as the readability of the paper.

      1. The authors use both the terms cofilin binding (in section I of the results) as well as cofilin nucleation (in section III of the results). It is unclear if these terms are meant to indicate the same, or different, processes. The manuscript would benefit from a clear explanation of the steps of cofilin-mediated disassembly measured and quantified in the experiments, namely nucleation (or binding), cluster growth, and filament or bundle fragmentation. A clear description of these steps would also allow the reader to follow the logic of the experiments from Figure 3 to Figure 5.
      2. Throughout the paper, the authors move from single filaments, to 2-filament bundles, to multifilament bundles, using different concentrations of fascin and cofilin. Given the biphasic behavior of cofilin, namely that low concentrations favor severing and high concentrations can favor coating and filament stabilization, I think it is important that concentrations for the components are consistent across experiments, and if changes of concentrations of important components (such as cofilin and fascin) are changed, a clear explanation as to why is included.
      3. In Figure 2, it is mentioned that for the spectrin seeds with the microfluidics, the filaments consisting of larger bundles were not analyzed along with the single filament and 2-filament bundles. Instead, a different experiment with seeds attached to beads is used to assess larger filament bundles. Why were larger bundles not analyzed in the microfluidic experiment? And conversely, why were 2 filament bundles not assessed with the beads? Comparing the findings on two filament bundles with the findings on multifilament bundles would be easier for the reader if the small and large bundles were evaluated in the same experiments. If this is not experimentally feasible, the authors need to provide clearer explanation as to why this analysis is not included.
      4. The authors indicate that at increased fascin concentration (1uM) that single filaments decrease the nucleation rate of cofilin clusters. The authors should comment on the mechanism for fascin (at 1uM concentration) for affecting cofilin binding.
      5. The authors should determine and include the dissociation rate for the labeled cofilin used in this study, especially given the proposed mechanism for cofilin excluding fascin within the bundles.
      6. For Figure 4, D and E, what do the dynamics of fascin and cofilin signal look like on a larger filament bundle? It would be informative to provide the cofilin cluster nucleation rate on larger filament bundles with a range of fascin concentrations (as in 3D for a two filament bundle).
      7. Additionally, it would be useful to report the cofilin severing rate at a range of cofilin concentrations, at least for the 2 filament bundles. When the severing occurs in the two filament bundles, does this severing occur mostly at boundaries with cofilin-actin and bare actin or does this severing occur at cofilin-actin/fascin-actin boundaries?
      8. For the images of large bundles appearing braided in figure 6A, the lower left panel the braided appearance is not obvious. Additionally, what is the number of filaments in the bundles shown? Finally, given that in Figure 3F it is indicated that cofilin cluster nucleation events are rare on large bundles, and the cluster growth rate is reduced on large bundles (Figure 4C), the authors need to indicate how frequently this braided appearance is observed as well as what the nucleation rate, growth rate and severing rate is for 500nM cofilin on bundles.
      9. The authors indicate that the rapid fragmentation of twist constrained 2-filament bundles prevented them from directly quantifying the nucleation rate of the subsequent cofilin clusters that overlapped the initial ones. I'm unclear why this is the case, and if this is the case, I don't understand how the authors can be sure that a second nucleation event occurred in the twist constrained bundles. From the experimental data in 7C, it appears that the fragmentation rate for two filament bundles is similar to the fragmentation rate for twist constrained single filaments. The authors need to clearly state what they were able to observe and quantify as well as include the timing for this severing. If the authors could not observe a second nucleation event prior to severing, this should be clearly stated. This could be due to the rapid fragmentation, but it could also be due to severing occurring in the absence of a second cofilin nucleation event. It would be informative to compare the time from cofilin nucleation to severing event for two filament bundles in twist constrained and unconstrained. Clarification of the dynamics of nucleation and spreading of cofilin and the timing of fragmentation of the twist constrained filament bundles is needed.
      10. Discussion of how twist constrained fragmentation dynamics might affect the dynamics of larger bundles in structures such as filopodia would be useful.

      Minor changes that would improve the paper:

      1. In Figure 1C, Figure 2B and Figure 2E, the indication, on the graph, of the fold-change between the rates is confusing as it is not clear from the labeling on the graph that the x15 is referring to the slope of the lines, keeping this information in the legend is appropriate, but if it is to be included on the graph, perhaps adding in the linear fit on the graph is also needed.
      2. Figure 7A, lining up the diagram with the kymographs below would help improve interpretation of the diagram and simulation. Alternatively, if the diagram (upper) in A does not diagram the kymographs below, this needs to be clearly stated, and it would be preferable that the diagram above matches the kymographs below.
      3. Despite referencing the Breitsprecher, 2011 paper in the introduction, the authors do not explain how their results showing that cofilin fragments filament bundles slower than single actin filaments correspond with the Breitsprecher findings that fascin bundles favors cofilin filament severing. While the authors do not need to explain the Breitsprecher data, if they reference these findings that run counter to their results, an explanation for the discrepancy would be reasonable to include in the discussion.

      Significance

      The research presented in "Fascin-induced bundling protects actin filament from disassembly by cofilin" is relevant and of interest to the field as it directly addresses a limitation in our understanding of how cofilin-induced severing occurs in F-actin bundles. Bundled F-actin may constitute the majority of linear F-actin within the cell and is specifically important in F-actin-based structures such as filopodia and stress-fibers. The data supports a model for interfilament cooperativity that provides a molecular mechanism for cofilin-mediated severing of fascin-bundled filaments.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this study, Chikireddy et al. perform a series of experiments in which they compare the efficiency fo cofilin-mediated severing and actin filament disassembly on individual filaments versus bundles of different sizes from by the actin-bundling protein fascin. The key outcome, quite distinct from previously published conclusions by the authors themselves and other authors, is that fascin bundling actually reduces cofilin-mediated severing mostly because of much slower "nucleation" of cofilin clusters on fascin-bound filament bundles. Cofilin cluster formation is followed by local fascin removal, and the nucleation of a cofilin cluster on an adjacent bundle in the absence of fascin is strongly enhanced. The reason for the latter surprising observation is not entirely clear, but proposed to arise from cofilin-mediated changes in filament helicity of neighboring filaments. To my understanding, the main reason why fascin protects from cofilin severing here rather than enhancing it (as reported previously) is due to the lack of constraining of the induced, cofilin-mediated twist, because if this twist is constrained e.g. by anchoring of the bundles to the surface chamber, then severing by cofilin is accelerated.

      Major comments:

      I think the study is very well done, most experiments are super-elegant and controlled; I really don't have any objections against the conclusions drawn, as most of what I have seen is totally justified and reasonable. So from a scientific point of view, I can easily agree with all the major conclusions drawn, and so in my view, this should be published fast.

      Minor comments:

      There are two minor points that could be addressed:

      1. I am not entirely convinced by the conclusions drawn from the EM images shown in Figure 6A, and in particular by the filaments in two-filament bundles locally twisting around each other (without breaking) at spatial sites lacking fascin and decorated by cofilin. This is hard to imagine for me, and the evidence for something like this happening is not very strong, as in the EM, only larger bundles could be observed. In addition, I am not sure that the braiding of filaments seen in the presence of cofilin is really occurring just locally on cofilin-decorated bundle segments and thus indeed coincides with loss of fascin as proposed in the scheme in Fig. 6B. Can the authors exclude that the braiding is not caused by some experimental artefact, as induced perhaps by sample preparation for negative staining? Did the authors quantify the occurrence of such braided bundle segments with and without cofilin? How large are these braided segments on average when you quantify them? Would you also see them if you prepared the bundles for an alternative EM-technique, such as Cryo-EM, for instance? This may admittedly all be experimentally challenging, but would it be possible to combine the negative staining of filaments with staining for cofilin and/or fascin using immunogold technology, to prove that the braided segments do indeed correlate with high cofilin and low fascin concentrations? In the absence of such data, and in particular in the absence of a clear quantification, the proposal is too strong in my view. Finally, it would be nice (albeit not essential I guess) to also look at two-filament bundles. The authors stated these can not be easily generated due to the tendency of fascin to promote the formation of larger bundles, but can this not be titrated/tuned somehow by lowering fascin concentrations, to come closer in reality to what is proposed to occur in the scheme in Figure 6B? In any case, the way the data are presented right now appears to constitute a pretty large gap between experimental evidence and theoretical model.
      2. I think that the proposal of cofilin-decorated filaments to "transfer" the resulting cofilin-induced changes in filament helicity onto neighboring filaments in the bundle, which is proposed to occur locally and in the absence of fascin is a bit vague, and difficult to understand mechanistically. Can the authors speculate, at least, how they think this would occur? Are there no alternative possibilities for explaining obtained results? Maybe I am missing something here, but with considering cofilin to be monomeric and only harboring one actin-binding site, this proposal of helicity transfer onto neighboring filaments seems inconclusive.

      Significance

      General assessment:

      The strength of this study is that owing, at least in part, to the microfluidics devices employed and the careful biochemistry, the experimental setups are super-controlled and clean, and they are used in a highly innovative and elegant fashion. The simulations are also nice! A limitation is that it is not entirely clear how precisely the main observations can be translated to what's happening in vivo. The results are largely dependent on the bundles not being constrained I understand, so to what extent would bundles be unconstrained in vivo? Perhaps this is not so important, because the experimental setup allows the authors to dissect specific biochemical behaviors and inter-dependencies between distinct actin binding proteins, but the latter view (if correct) could be stated more clearly!

      Advance:

      As stated above, the results are opposite to the proposed synergistic activities of fascin and cofilin observed for bundles previously, perhaps because they were not constrained. So although touched in part and in a very polite fashion in the discussion, the authors could specify more clearly what the differences between the studies are, and which of the distinct activities observed either here or in previous literature will be dominant or more relevant to consider in the future? This will be hard to discern as is now, in particular for non-experts.

      Audience:

      This manuscript will be most influential for a specialized audience interested in the complexities of biochemical activities of specific actin binding proteins when looking at them in combination. Although specialized, this is still a quite relevant audience though, since prominent actin binding proteins like cofilin are highly important in virtually any cell type and various actin structures, hence of broad relevance again in this respect.

      Expertise:

      I am a cell biologist and geneticist interested in actin dynamics and actin-based, motile processes.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, it is shown that cofilin severs actin filaments slowly when fascin is present. Authors show that this is due to slower cluster nucleation of cofilin on fascin-induced actin bundles. Interestingly, the authors show that cofilin binding promotes helicity in actin filament bundles which in turn promotes facin exclusion and more cofilin clustering in adjacent filament bundles; thus, inducing local transmission of structural changes.

      The authors use an elegant approach, and the data is nicely presented. Overall, I consider that this manuscript is in good shape to be published. It might benefit from language editing, though.

      Significance

      According to me the significance of this manuscript is that elegantly shows the molecular details of the cofilin severing effect of fascin-induced actin filament bundles. The authors show that cofilin binding promotes helicity in actin filament bundles which in turn promotes facin exclusion and more cofilin clustering in adjacent filament bundles; thus, inducing local transmission of structural changes.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      ReviewCommons Reviews Point-by-Point

      Manuscript number: RC-2023-02131

      Corresponding author(s): Holger, Gerhardt

      Reviewer #1

      Evidence, reproducibility and clarity

      Summary: Giese et al. use genetic lineage tracing techniques and novel computational analysis methods to quantify and predict how the spatial distribution of ECs changes over time in the developing mouse retina from P5 to P9. They also develop mathematical models to describe and predict the response of ECs to the hypothesized dual force-field formed by the chemoattractant VEGFA, and the flow-induced shear stress.

      Given that the mouse retina cannot be live imaged, these new methods are essential to infer cell dynamics from static images. With these methods the authors confirm previous findings that arteries are formed by endothelial cells derived from veins, or tip cells, or capillaries. They then combine their genetic system with Cdc42 and Rac1 floxed alleles to understand the function of these genes in EC mobilization. They find that Cdc42 has a very strong role in EC mobilization to arteries, not so much to the sprouting front, whereas the loss of Rac1 has relatively minor effects in vivo. Loss of Rac1 slows the cells but they maintain their directionality towards arteries. The discussion section and integration of these paper findings with previous work in the field is excellent.

      Overall, this work provides a much higher level of quantitative analysis of endothelial cell dynamics in the developing vasculature of the mouse retina. It also provides mathematical models that can be useful to explain and predict the impact of other genetic mutations or pharmacological interventions during vascular development.

      Reviewer #1, Major comment 1: This work provides one of the finest examples of quantitative biology in the vascular biology field. The conclusion on cell dynamics is however largely based on static images measurements and pre-defined mathematical models given also previous work and the proposed model of a dual-force field. The authors conclude that sprouting front cells mainly migrate away towards VEGF, whereas remodelling plexus cells migrate towards arteries. However, this is based on the entire EC population measurements/displacement and averaging, and does not account for the possibility of a few ECs having a different behaviour from most of their neighbours. This comment is related with the fact that arteries are formed mainly by tip-derived ECs, the cells closest to the VEGF source and further away from the flow/shear stress force. It seems the authors model presented here would not allow this to happen. According to the model presented, it seems that an EC close to the VEGF source, and subjected to low flow (shear stress), would always migrate to the front and never turn back towards arteries. Can a more complex model enable the consideration of the stochastic loss of VEGF signalling (or gain of shear stress sensing) by some ECs at the sprouting front ? And their subsequent formation of arteries ?

      Response: We would like to thank the reviewer for these insightful views and for raising these questions. The aim of the computational model is to provide the simplest possible model that can be used to obtain estimates of EC migration patterns. This computational model can be used to introduce other aspects of endothelial cell behaviour, including proliferation and subpopulations with different migration sensitivities. However, the stochastic loss of VEGF signalling or gain of shear stress sensing is modelled by a stochastic term. For example, the coupling strength for control is 0.36, meaning that 0.36 is explained by directed migration along the force field, while the remaining 0.64 of the movement is stochastic. We agree that this could also be modelled differently to disentangle this stochastic term if more data on subpopulations were given. This will certainly be a fruitful direction for future research. As we do not have explicit proliferation data or data on subpopulations, direct inclusion of these extensions is difficult to validate and justify, or must be based on further assumptions or speculation.

      We agree with the reviewer that there are likely distinct subpopulations, given also that ECs can compensate for higher sensitivity to shear stress or VEGF-A with higher migration speed (see Figure 2 F). Currently, the shear stress cue is overwritten by the VEGF-A cue in the sprouting front. Furthermore, both cues cancel each other out in the transition region from remodelling plexus to sprouting front, this was also suggested in (Barbacena et al., 2022) though this behaviour can be explained by heterogeneity or ECs being randomly polarised due to conflicting cues, which is not directly resolved by the data.

      Nevertheless, random migration is a very inefficient strategy as shown in our theoretical investigation. Therefore, at the system level, it seems to be a much more efficient strategy to have EC subpopulations, since in this case ECs would migrate directly along one of the cues, rather than behaving randomly due to conflicting cues. We will add these considerations to the discussion of the mathematical model.

      *Reviewer #1, Major comment 2: Previous work by Laviña et al showed that Cdc42 is required for the migration of ECs to the sprouting front. The authors' data suggest that Cdc42 is not necessary for this process. *

      __Response: __We thank the reviewer for highlighting these points. We report a significant phenotype for Cdc42 depleted cells in the sprouting front: There is a significant shift of labelled cells from arterial to venous direction from P5 to P9 and in particular from P8 to P9. Therefore, the region of the sprouting front above the artery lacks Cdc42-depleted ECs, which can be clearly seen in the KDE plots in Figure 3.

      A very important difference in our study is the definition of the sprouting front. In our study, the sprouting front is a whole region that extends to the tip of the vein, see Figure 2 and explanation on page 7: “Additionally, each retina was divided into a remodelled region containing mature veins and arteries (in the following called remodelling plexus) and a region lacking mature vessels (called sprouting front)”. However, in Lavina et al., as well as in some other studies, for example Barbacena et al., the sprouting front is the very end of the retinal vasculature. In Barbacena et al. the sprouting front is VEGFA dependent on the 100-200 μm from the very edge of the vasculature, whereas our region of interest extends much further for about 500 μm and we may lose definition for a tip cell related Cdc42 phenotype in our analysis. Our KDE plots extending from 0 (optic nerve) to 2000 μm (end of the vasculature) show a clear sprouting front Cdc42 phenotype, indicating that there is an accumulation of Cdc42 KO ECs at the end of the veins. However, these plots lose definition in the region analysed in Lavina et al. and Barbacena et al. Therefore, we will extend our analysis and explicitly report cell number proportions in the sprouting front that are compatible with (Lavina et al., 2018) and (Barbacena et al., 2022).

      We will add a quantification of EC proportions in the sprouting front to make our study more comparable.

      *Reviewer #1, Major comment 2 (continued): Could the difference between previous and the authors results be technical and related with the different stage of induction/analysis or the extent of Cdc42 deletion ? Did the authors tried inducing at P1 and collecting at P6/P7 ? *

      Response: Lavina et al. used earlier induction time points at P0, P1 and P2. We do not know whether a retina at P5 requires less Cdc42 for sprouting front development. At this time point (P5), there is a distinct vasculature of veins, arteries forming a vascular plexus, and a sprouting front in contrast to P1, which is mainly a small vascular plexus. We will add these important points to the discussion of our manuscript.

      In order to understand the role Cdc42 and Rac1 play in sprouting angiogenesis and migration to the sprouting front that is not influenced by different Cre lines and induction regimen, we will co-culture control and Cdc42/Rac1-deficient HUVECs in 3D microfluidic devices and analyse their sprouting over a number of days . With this assay we will be able to quantify the proportion of siCdc42/siRAC1 cells in tip and stalk positions compared to control, as well as do junctional and cytoskeletal analysis via immunofluorescent staining. See preliminary data in Additional Fig. 1 .

      Reviewer #1, Major comment 2 (continued): The reporters used were also different and they may have different sensitivities to tamoxifen (and hence report Cdc42 deletion differently).

      __Response: __This is correct as different Cre reporters have been shown to exhibit different sensitivities to recombine, including some with tamoxifen independent activity. The mTmG reporter we used however has been shown to reliably report tamoxifen induced recombination. Nevertheless, given that the reporter allele and the floxed gene alleles can recombine independent from each other, we cannot exclude the possibility that some GFP expressing cells still express Cdc42 or Rac1, or that some cells that have lost expression of Cdc42 or Rac1 due to recombination remain GFP negative. Statistically, these will however be rare events. The fact that we track all GFP positive cells allows us to draw conclusions on population behaviour, but not necessarily on the validity of any specific cell. The power of our analysis lies in the ability to draw conclusions on many randomly labelled populations across multiple time points without the necessity to validate each individual cell. The fact that the GFP population that carries floxed alleles for Cdc42 or Rac1 behave differently from those that do not provides strong evidence for successful loss of function for most of the cells. Importantly, unlike conventional full KO, this altered population behaviour occurs in the absence of an overt overall tissue phenotype, as we only lose gene function in a subpopulation of endothelial cells. The fact that we observe distinct deficiencies for migration towards the artery but not towards the sprouting front is therefore likely a true reflection of distinct functional importance and not evidence for a technical problem. Orthogonal evidence for such a selective role stems further from our in vitro cell culture experiments.

      In the revised manuscript, we will include new mosaic flow-migration microfluidic studies as well as the mosaic vessel-on-chip assays mentioned above to independently verify the selective role for Cdc42 in flow-migration coupling versus sprouting (Additional Fig. 1).

      Reviewer #1, Major comment 3: In the last section, some of the junctional/polarity/actin markers and analysis done in vitro could be also done in vivo.

      Response: We agree with the reviewer that these experiments could be very informative to investigate junctional, polarity and actin markers. However, we believe that without a specific question these experiments would be rather explorative, do not add significant information to the current message of the study and can therefore not justify further animal experiments. This should in our opinion be the subject of future work.

      We will however, quantify junctional markers in a sprouting 3D assay, see response to Reviewer #1, Major comment 2.

      Reviewer #1, Major comment 4: The extent of Rac1 deletion in the mosaic experiments (done with suboptimal doses of

      tamoxifen) could be analysed. This is especially relevant since minor effects for Rac1 were observed in these in vivo experiments.

      Response: Lavina et al. used earlier induction time points at P0, P1 and P2. We do not know whether a retina at P5 requires less Cdc42 for sprouting front development. At this time point (P5), there is a distinct vasculature of veins, arteries forming a vascular plexus, and a sprouting front in contrast to P1, which is mainly a small vascular plexus. We will add these important points to the discussion of our manuscript.

      In order to understand the role Cdc42 and Rac1 play in sprouting angiogenesis and migration to the sprouting front that is not influenced by different Cre lines and induction regimen, we will co-culture control and Cdc42/Rac1-deficient HUVECs in 3D microfluidic devices and analyse their sprouting over a number of days . With this assay we will be able to quantify the proportion of siCdc42/siRAC1 cells in tip and stalk positions compared to control, as well as do junctional and cytoskeletal analysis via immunofluorescent staining. See preliminary data in Additional Fig. 1 .

      Reviewer #1, Minor comment 1: Lee et al., 2022 is a review. Better cite the original papers if possible: Some examples: Xu et al., 2014, Pitulescu et al 2017 and Lee et al., 2021.

      Response: We would like to thank the reviewer for this suggestion and have added the citations to original publications where appropriate (see page 2 in the “Introduction” section).

      Reviewer #1, Minor comment 2: Figure 1A: Stage of induction with tamoxifen is missing. Likely P5.

      Response: We added this information to the caption of Figure 1A, Figure 3A and Figure 4A.

      Reviewer #1, Minor comment 3: Figure 3 and 4 data would be easier to compare/understand by readers if part of the Wt data in Figure 1 was also plotted here. Or at least a Wt trend/average line on top of the mutant data, for us to see how much Cdc42 or Rac1 deletion changes the behaviour of the mutant cells versus the Wt cells.

      Response: We agree with this suggestion and will add the wild type data for comparison in Figure 3 and 4.

      Reviewer #1, Minor comment 4: Overall, for all dot plots and heatmaps, would be better to indicate the total number of cells analysed/plotted since the power of the analysis is related with cell number rather than number of retinas.

      Response: We would like to thank the reviewer for this comment and agree that indicating the number of cells analysed allows the reader to contextualise the results more easily. We will therefore add estimated numbers of cells analysed to the respective figures. However, it is important to note that we used labelled pixels (GFP and ERG positive) as a proxy for the EC distribution, but did not segment out single cells. We always used the same number of 10,000 randomly selected pixels by bootstrapping to quantify the endothelial cell distributions. The heat map plots summarise the single cell behaviour pooled together for all retina samples. Therefore, each retina contributed equally to the analysis. This way, we could provide statistics on independent biological replicated samples for a thorough analysis.

      Significance

      General assessment: This paper is very strong on the quantitative analysis and mathematical modelling. Methods used and the model proposed may be of broad relevance for the field of vascular biology. It is however based on certain author-defined parameters and assumptions. EC dynamics in vivo can be much more complex than what can be modelled by equations. For example, heterogenous single cell genetics and signalling inputs can induce changes in cells that override the normal/average behaviour of the cells that are modelled. Despite the high level of quantitative analysis and modelling, the main findings here presented are not entirely novel, given previous work. For example, it was previously known that arteries are formed by vein/tip/capillary cells. It was also known that Cdc42 was required for proper EC migration away from veins (Laviña et al., 2018). However, the better quantitative analysis here presented does provide a higher level of detail and reliability.

      The mosaic genetics used to delete Cdc42 is in general clear since few reporter positive cells can make arteries, suggesting efficient deletion of this gene. The data also goes in line with previous work. However for Rac1, given that a much weaker phenotype was observed, is not possible to be sure that all GFP+ ECs had deletion of Rac1. This is especially important in mosaic genetic experiments, using a suboptimal dose of tamoxifen. The extent of Rac1 deletion in GFP+ cells was not analysed. Which leaves the open question if Rac1 is really dispensable for EC migration and arterialization. Embryos lacking Rac1 in endothelial cells die early during development. Therefore ECs fully lacking Rac1 may have stronger defects than the ones shown here. All this data was obtained in the postnatal retina angiogenesis system. Other organ vessels may develop differently. Future work will tell if the models proposed can explain the dynamics of ECs during the growth of other vascular beds.

      *Audience: Vascular/Cell biology researchers and bioinformaticians developing tools for image analysis and/or cell migration/dynamics modelling. *

      Expertise of Reviewer: Vascular Biology. I do not have sufficient expertise to evaluate the mathematical modelling part of this paper.

      Reviewer #2

      Evidence, reproducibility and clarity

      Summary: Giese et al., develop a computational model to track the migration of tip and venous endothelial cells (ECs) into developing arteries in the mouse retina. They validate prior studies which have identified important roles of shear stress and VEGFA gradient in driving EC migration and argue through their model that these dual forces shift in their relative influence along the vein-artery axis. Through in vivo lineage tracing in combination with Cre-inducible mouse knockout models, Giese et al., explore the role of Cdc42 and Rac1 in flow-migration coupling. They conclude that Cdc42, and not Rac1, is the primary driver of polarized flow-migration, further confirming this result with in vitro flow experiments involving siRNA depletion of Cdc42. Finally, the authors show that Cdc42 also plays a role in EC migration independent of flow, highlighting that its role in migration is universal and directly related to both actin regulation and cell junction dynamics.

      Reviewer #2, Major comment 1: In Figure 1 the authors show retinas from a Vegfr3CreERT2 mouse model and make the claim that Vegfr3 is expressed in ECs in developing veins and tip cells, but not in arteries. Ralf Adam's group has previously shown that while Vegfr3 is more abundant in veins and capillaries than in arteries, expression is not exclusive to these EC subtypes (Ehling 2013). They, along with studies from Christer Betsholtz and Kari Alitalo's groups, show that Vegfr3 is observed in postnatal arteries (Tammela 2008, Ehling 2013). Indeed, from the panels in figure 1 (specifically P6 and P7) and supplemental figure 1, it appears that there may also be low levels of Vegfr3 expression in the arterial branches of postnatal Vegfr3CreERT2 mouse retinas. The authors should consider revising their statement that "ECs in arteries, however, do not express Vegfr3" and provide quantification of the number of GFP-labeled cells in arteries, veins and capillaries for each postnatal time point. Additional lineage tracing from later stages when migration into arteries is halted would be a good control for demonstrating that Vegfr3CreERT2 is not expressed in arteries, further support the authors' argument and conclusions.

      Response: We would like to thank the reviewer for this comment. The percentage of labelled ECs in arteries of retinas for mice injected at P5 and collection at P6 is close to zero in all conditions (0.38 % +- 0.08 % (SEM) for control, 0.35 % +- 0.05 % (SEM) for Cdc42 depleted and 0.36 % +- 0.07% (SEM) for Rac1 depleted ECs). The same is true for mice injected at P8 and collected at P9 (0.22 % +- 0.06 % (SEM) for control, 0.11 % +- 0.03 % (SEM) for Cdc42 depleted and 0.11 % +- 0.01 % (SEM) for Rac1 depleted ECs).

      As suggested by the reviewer, we will therefore revise the statement "ECs in arteries, however, do not express Vegfr3" and instead present the exact numbers in the revised manuscript, as this claim cannot be rejected or supported by our data. We will also add a discussion with references to (Tammela et al. 2008, Ehling et al. 2013) in our revised manuscript.

      We would like to point out that this does not change the results of our study, since for us the Cre line is primarily a tool to label ECs of venous and microvascular origin to follow the change in EC distributions over time, and any pan endothelial Cre line would be suitable for our analysis. This was demonstrated for example in (Jin et al. 2022), where ECs with Cdh5Cre-induced expression of iSureCre+ MbTomato were used.

      In addition, we will provide percentages as well as total cell numbers for labelled ECs in veins, arteries and capillaries in the revision of our manuscript for all time points and conditions.

      Reviewer #2, Major comment 2: In their computational simulations, the authors investigate three models: random walking of ECs (M1), VEGFA gradient driven migration (M2), and integrated VEGFA /shear-stress driven migration (M3). The reader naturally wonders why a model considering only shear-stress driven migration is not presented as a control simulation. The absence of this model reduces the strength of the claims that only the M3 model captures observed EC movement rates, the mean population shift in vein-artery distance, and the arterial proportion of ECs.

      __Response: __We agree with the reviewer and will certainly add a simulation of a shear stress only model and introduce it as model M4 in the revised manuscript.

      Reviewer #2, Major comment 3: Much of the terminology in this study needs more detailed explanation and carefully usage. It would be more friendly to readers if they were consolidated into a box figure or table. (For example, how is coupling strength related or different from coupling rate?) The reported numbers of these factors are noted separately in text here and there. It would be helpful to put them together to highlight the difference between different models and mutant strains, as this is one of the novel findings for this study.

      Response: Thank you for pointing this out. The word 'coupling rate' was incorrectly used in the introduction on page 3. It has been replaced by 'coupling strength', which is used throughout the text. We will add a table with quantitative information and explanations of parameters.

      Reviewer #2, Major comment 4: The lack of labelled cells in the P9 retinas of Cdc42iECKO mice is striking (Figure 3), and strong evidence for the importance of Cdc42 in the migration of ECs towards arteries. The authors cite Lavina et al, 2018 when they note that Cdc42 depleted ECs proliferate at normal rates, but independent verification of this observation through EdU quantification would allow the authors to distinguish between the two possibilities outlined on page 15 of the manuscript: local proliferation vs enhanced migration of non-recombined ECs. These experiments and analyses are expected to be quick (depending on the availability of mice) and low cost. An independent EC proliferation analysis would also give the authors insight into the degree to which localized proliferation likely impacts vein-artery migration, a parameter which is currently unaccounted for in their computational model. The authors recognize that the lack of EC proliferation parameters is a limitation of their current model and speculate that this makes their estimates of vein to artery migration slightly too low. Independent EC proliferation analysis would thus be informative and may allow the authors to remove this speculation from their discussion.

      __Response: __We do not see any reduction in the labelled Cdc42 population (Supplementary Figure 2) at P9. for all conditions we observe an increase of the labelled population from earlier to later stages (P6, P7, P8, P9). Our analysis is robust with respect to EC number, meaning that changes in EC number does not change the distribution. Our computational model would only slightly be affected by the difference in proliferation between arterial and venous beds, but not by the total number of proliferation events.

      It is known from the literature that endothelial activation by shear stress is associated with inhibition of EC proliferation (Dejana et al. 2004; Bogorad et al. 2015). We used immunostaining to label phospho-histone H3 (pHH3) in perfused monolayers after 12 hours of flow exposure to uncover the effects that downregulation of Rho GTPases might have on the proliferation of ECs after exposure to flow. The biomarker pHH3 is a well-established standard for detecting the late G2 phase of mitosis. As shown in Additional Fig. 5, and in agreement with the literature, proliferation of ECs under flow conditions decreased by 27.38% compared to control static conditions. Following Rho GTPase depletion, siCdc42 cells further decreased their proliferation by 22.48% compared to control flow conditions, respectively. No significant difference was observed in siRac1 cells. It has been shown that the absence of Cdc42 increases EC apoptosis both in vivo and in vitro (Barry et al. 2015; Jin et al., 2013), but the role of Cdc42 in endothelial proliferation remains unclear in the literature. The data presented here suggest that Cdc42 has a modest effect on endothelial proliferation under flow in vitro. As shown in Additional Fig. 5, none of the loss-of-function conditions appeared to drastically alter the effect of flow on reducing proliferation in confluent monolayers. This suggests that RhoGTPases may not play a major role in the regulation of proliferation under the influence of flow.

      We do not expect any further insight from EdU staining since also with EdU staining we could only quantify ECs entering the S phase in static images and how this translates into proliferation would still introduce further speculation. We therefore addressed this question in our discussion, as a quantification of proliferation would go beyond the scope of this study.

      Of note, any requests for additional in vivo experiments would require new animal licence application and therefore a considerable time delay which we would only suggest to accept if substantial additional insight was to be expected. As it stands, all our claims are supported by several independent observations and can where necessary and as detailed in our revision plan, be addressed by orthogonal means.

      Reviewer #2, Major comment 5: In the legends of Figure 1B & C, no information about the x or y-axis were mentioned. Even though the authors explained it elsewhere in the text or other figures, it is the first time that these parameters show up in the paper, which would be a proper place to note the details of these KDE plots.

      Response: We added this information in the caption of Figure 1B,C, Figure 3B,C, and Figure 4B,C.

      Reviewer #2, Major comment 6: In the legends of Figure 2B & C, it mentioned the results of coupling strength and fraction of shear stress, which was not obvious by just looking at the figures provided. It is also not mentioned whether the numbers are calculated by collected or predicted data.

      __Response: __We agree with the reviewer that this information should be more prominent in the figure and will add it accordingly. These are predicted data, where we systematically tested different values of key parameters. We will therefore also change the caption of the figure from “Simulation of EC migration in the retinal vasculature” to “Prediction of EC distributions from computational model simulations” to make this clear.

      Reviewer #2, Major comment 7: Please direct the reader to the supplemental figure containing the tamoxifen injection strategy when the Vegfr3CreERT2 mouse model is introduced in the text and when Cdc42 and Rac1 EC knockout is discussed in the methods.

      __Response: __We also moved Supplementary Figure 1 and 2 to the beginning of the supplement as they are now referenced first in the main text and hope this will erase the confusion.

      Reviewer #2, Major comment 8: There appears to be a significant degree of variance in the KDE plots from Figures 1, 3, and 4. It would therefore be helpful if the authors could include plots of the remodeling plexus and sprouting front that show the median for each dataset (similar to Figure 1 D and E), since this is less likely to be skewed by extrema. Additionally, for clarity and ease to read, it would be nice if author can consolidate the results of the mean from WT, Cdc42 and Rac 1 iECKO side by side. The images of the retina are striking, but the quantification is not as well-communicated.

      Response: We plotted the median (including 10th and 90th percentile) already in Figure 1 Supplement 1, Figure 3 Supplement 1 and Figure 4 Supplement 1 and found significant

      Reviewer #2, Major comment 8 (continued): If the argument is that Cdc42 only disrupts the migration, but not the differentiation, of artery EC, the authors should see a significant difference between WT and Cdc42 iECKO only in P9, but not early stages.

      Response: We are not sure that we have fully understood the reviewer's reasoning, but our data confirm the reviewer's prediction of disrupted Cdc42 KO EC migration: The timeline for the percentage of labelled arterial ECs relative to the total labelled EC population is shown in Additional Fig. 2. Here, a significant increase was observed for both control and Rac1 iECKO ECs between time points P8 and P9, but not at earlier stages. For Cdc42 iECKO no significant increase was found at all stages. A direct comparison of all conditions is shown in Additional Fig. 3. Similarly, the proportion of Cdc42 iECKO compared to control is significantly different at P9 for mice injected at P5, but not at earlier stages.

      Reviewer #2, Minor comment 1: For KDE plots in Figure 1B and Figure 3B, the authors labeled the x-axis differently (r vs dr). No description about the x-axis is noted in the legends. Please consider keeping the label consistent so the readers can relate and compare them.

      Response: Thank you for spotting this mislabelling. We changed the axis annotation to d_r in Figure 1 as it is used in Figure 3,4 and throughout the text. We added furthermore information on the x and y axis in the caption of Figure 1B,C, Figure 3B,C, and Figure 4B,C. See also major comment 5.

      Reviewer #2, Minor comment 2: For the in vitro study, the cell line used is not mentioned in the results, even though there is a method section about HUVEC. Authors should note this information in the main text.

      Response: We agree with the reviewer and have added the following information in the main text on page 18 in the section “Cdc42, but not Rac1, drives polarised flow-migration in vitro” stating:

      “To assess EC migration under coupling to shear forces in vitro, siScrambled (siScr) control, siCdc42 and siRac1 treated human umbilical venous endothelial cell (HUVEC) monolayers were exposed to 20 dynes/cm2 of flow using the Ibidi perfusion system and observed for up to 17 hours using a non-toxic fluorescent DNA dye for nuclear tracking.”

      *Reviewer #2, Minor comment 3: Please note the flow direction in Figure 5B. *

      __Response: __We have added an additional indication of flow direction for Figure 5B to improve the clarity of the figure. Note that the direction of flow is already indicated in Figure 5A and is the same throughout the figure.

      Reviewer #2, Minor comment 4: The naming of supplemental figures requires revisiting. Currently there are two figures labeled with variations of supplementary Figure 1, which makes identifying the correct data challenging.

      Response: We regret that the reviewer found the labelling of supplementary figures ambiguous and thank the reviewer for spotting this mislabelling. We corrected the label Figure S1 to Supplementary Figure 1. Furthermore, we also moved Supplementary Figure 1 and 2 to the beginning of the supplement as they are now referenced first in the main text and hope this will erase the confusion (also see Reviewer #2, major comment 7).

      Reviewer #2, Minor comment 5: There appears to be a small typo on page 16 in second paragraph in the sentence that currently reads "Nevertheless, only few cells reached the arteries by P9"

      Response: We changed this sentence: “Nevertheless, the accumulation of Rac1-deficient ECs in the artery was less pronounced compared to control.” Also we provide actual numbers, see Additional Fig. 2 and Additional Fig. 3 and our response to Reviewer #1, Major comment 4.

      Reviewer #2, Minor comment 6: The axis labels for the plots in Figure 5, Figure 2 Supplement 1, and Supplementary Figure 2 are currently very tiny and difficult to read.

      Response: We agree with this suggestion and will increase the size of the axis labels.

      Reviewer #2, Minor comment 7: Additional information is required to describe the trajectory plots in Figure 2 Supplement 1, Figure 3 Supplement 3 and Figure 4 Supplement 3. I assume that blue trajectories move in the positive x direction while orange trajectories move in the negative x direction, but this is currently not specified in any of the legends.

      Response: We added a specification in the figure legend.

      Significance

      Giese et al advance current understanding of the molecular mechanisms that guide vein to artery migration by convincingly demonstrating (both in-vivo and in-vitro) that Cdc42 is essential for this process. In addition, they present a computational model that captures the dual force field of VEGFA and shear stress gradients to simulate and quantify this migratory process in the mouse retina: a tool which will likely be useful for future mechanistic studies of vascular remodeling and EC migration. As far as I am aware, no standard coordinate system exists in the field for the quantification and modeling of this migratory process, so the introduction of this method alone serves as a useful innovation for the field of vascular biology.

      It should be noted by authors and the editors that the mathematical details of the computational model and its statistical accuracies are not evaluated in this review, which instead focuses on the study's findings as they relate to EC biology and vascular development.

      This study complements previous work that has identified populations of ECs that appear to be primed for incorporation into arteries, termed "pre-arteries" (Su et al 2018, Phansalkar 2021, Luo 2021). The authors' observation of heterogeneous EC movements during migration supports the intriguing suggestion that actin regulation through Cdc42 may be related to the establishment or phenotype of this pre-artery identity, though further mechanistic work will be required to validate this hypothesis, as noted by Giese et al in their thoughtful discussion.

      Reviewer #3

      Evidence, reproducibility and clarity

      Summary: this study uses VEGFR3cre as a lineage tracking system to track venous endothelial cells within the maturing retina postpartum in order to investigate the cues related to sprouting and remodeling of the vascular network. They use complex computation modeling to predict outcome and come with in vivo/vitro observations.

      They identify that venous EC move towards the arterial bed with flow and oxygen tensions as critical parameter. It is an impressive study that in principle confirms earlier studies on the role of EC population in sprouting and vascular remodeling however utilizes an interesting cell population based computational approach. Since these finding are pretty new, confirmation with different technologies and approaches is important.

      However for more "traditional" vascular biologist it is very difficult to understand and follow particular if not familiar with the data presentation and mathematical modeling. This is a major shortcoming of this manuscript because I feel that if the authors would put more effort to better communicate their findings to a broader community not only enhance the value of their findings but also disseminate their computational approaches to a broader vascular scientist community.

      __Response: __We thank the reviewer for this valuable critique, we will add a table where all parameters are described. Furthermore, we will add an additional subpanel in Figure 2 to explain the computational model.

      I do have a couple of model related comments. The authors are using different models without adequate description.

      The VEGFR3 is a lymphatic EC tracking model that potential can be used to track venous tip cells in the retina. But this model is not characterized within the retina in the study the author cite. I believe it would be necessary to characterize this model in traditional way before it can be used as described.

      __Response: __We appreciate this comment and generally agree with the wish to see Cre lines well characterised. This is particularly relevant in studies where researchers delete a gene of interest using a Cre line and then make claims about the role of the gene in certain tissues with assumed recombination specificity. In our study however, we are not using the Cre line as a lymphatic or venous specific line to make such claims. Instead we use it in combination with the mTmG reporter, to quantify population distributions. Therefore every sample is its own “characterization”.

      We could cite literature that support our claim of lack of arterial expression (Ehling et al., 2013, Tammela et al., 2008) of Vegfr3 in postnatal retina, but these studies did not use this Cre-line. For the purpose of our study, and in line with previous comments by referee 2, we feel it is best to moderate the claim, as very rarely single arterial cells can be found to have recombined 24 h after tamoxifen injection, see Additional Fig. 2 and Additional Fig. 3. The revised manuscript therefore has the claim toned down to better reflect this. Nevertheless, the utility of the Cre-line is not dependent on whether or not single arterial cells can be labelled, as the coordinate system and population quantification shows the population shift. This would even be valid using Cre-lines with random endothelial recombination in all vascular segments (Jin et al., 2022).

      If deemed necessary, we have reporter expression from various stages of recombination in the postnatal retina, as well as in the developing brain, as well as comparison with arterial Cre-lines such a BMX cre. A rather complete characterization could be provided in supplements. However, we would argue that this is not relevant for the present study.

      Distance to artery /veins is one parameter the authors are using in their modeling. I'm not sure I understand how they determine it since ECs can original form any place with the venous network, then again they might not.

      __Response: __In the computational model we assume an initial distribution that is derived from the distribution at P6, which is shown in Figure 2A (middle panel). In summary, we do not hypothesise any origin of the ECs but start with the experimentally observed distribution at P6. From this starting distribution and for the following time steps we can then compute the exact distances to the closest vein and artery.

      SiRNA experiment do not show knock down efficiency, which is probably also heterogenous. I'm not sure if this affects the modeling. In the last set they use HUVECs which is a very specialized "venous" ECs which I would not use for their modelsystem.

      __Response: __The knockdown efficiency is shown in the supplementary data, see Figure 5 Supplementary data 2: qPCR knockdown validation. We do not use the in vitro data for the computational modelling, only the distribution in the in vivo data. Vegfr3 is expressed in endothelial tip cells and ECs in the developing vein, as well as in scattered ECs throughout the primitive vascular plexus. Therefore, despite the general limitations of in vitro systems, HUVECs are very similar to the in vivo situation shown in our study. HUVECs, despite being of venous origin, are a very versatile tool for endothelial studies. They express both venous and arterial genes, including dll4 and many components of the notch signalling cascade. Importantly, they are heterogenous, but adapt to media and flow conditions. The medium we use stimulates a microvascular growth pattern, and exposing HUVECs to flow results in transcriptional and proteomic changes that fit well with microvascular responses. Using fully differentiated arterial endothelial cells would not be useful as we are modelling endothelial responses that set in venous and microvascular regions of the vascular plexus in vivo, and stimulate a response that leads to movement towards arteries. We have therefore purposefully chosen and validated this model system.

      Significance

      I already commented in the above paragraph on this topics

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      this study uses VEGFR3cre as a lineage tracking system to track venous endothelial cells within the maturing retina postpartum in order to investigate the cues related to sprouting and remodeling of the vascular network. They use complex computation modeling to predict outcome and come with in vivo/vitro observations.

      They identify that venous EC move towards the arterial bed with flow and oxygen tensions as critical parameter. It is an impressive study that in principle confirms earlier studies on the role of EC population in sprouting and vascular remodeling however utilizes an interesting cell population based computational approach. Since these finding are pretty new, confirmation with different technologies and approaches is important.

      However for more "traditional" vascular biologist it is very difficult to understand and follow particular if not familiar with the data presentation and mathematical modeling. This is a major shortcoming of this manuscript because I feel that if the authors would put more effort to better communicate their findings to a broader community not only enhance the value of their findings but also disseminate their computational approaches to a broader vascular scientist community. I do have a couple of model related comments. The authors are using different models without adequate description. The VEGFR3 is a lymphatic EC tracking model that potential can be used to track venous tip cells in the retina. But this model is not characterized within the retina in the study the author cite. I believe it would be necessary to characterize this model in traditional way before it can be used as described. Distance to artery /veins is one parameter the authors are using in their modeling. I'm not sure I understand how they determine it since ECs can original form any place with the venous network, then again they might not. SiRNA experiment do not show knock down efficiency, which is probably also heterogenous. I'm not sure if this affects the modeling. In the last set they use HUVECs which is a very specialized "venous" ECs which I would not use for their modelsystem

      Significance

      The VEGFR3 is a lymphatic EC tracking model that potential can be used to track venous tip cells in the retina. But this model is not characterized within the retina in the study the author cite. I believe it would be necessary to characterize this model in traditional way before it can be used as described.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Giese et al., develop a computational model to track the migration of tip and venous endothelial cells (ECs) into developing arteries in the mouse retina. They validate prior studies which have identified important roles of shear stress and VEGFA gradient in driving EC migration and argue through their model that these dual forces shift in their relative influence along the vein-artery axis. Through in vivo lineage tracing in combination with Cre-inducible mouse knockout models, Giese et al., explore the role of Cdc42 and Rac1 in flow-migration coupling. They conclude that Cdc42, and not Rac1, is the primary driver of polarized flow-migration, further confirming this result with in vitro flow experiments involving siRNA depletion of Cdc42. Finally, the authors show that Cdc42 also plays a role in EC migration independent of flow, highlighting that its role in migration is universal and directly related to both actin regulation and cell junction dynamics.

      Major Comments:

      1. In Figure 1 the authors show retinas from a Vegfr3CreERT2 mouse model and make the claim that Vegfr3 is expressed in ECs in developing veins and tip cells, but not in arteries. Ralf Adam's group has previously shown that while Vegfr3 is more abundant in veins and capillaries than in arteries, expression is not exclusive to these EC subtypes (Ehling 2013). They, along with studies from Christer Betsholtz and Kari Alitalo's groups, show that Vegfr3 is observed in postnatal arteries (Tammela 2008, Ehling 2013). Indeed, from the panels in figure 1 (specifically P6 and P7) and supplemental figure 1, it appears that there may also be low levels of Vegfr3 expression in the arterial branches of postnatal Vegfr3CreERT2 mouse retinas. The authors should consider revising their statement that "ECs in arteries, however, do not express Vegfr3" and provide quantification of the number of GFP-labeled cells in arteries, veins and capillaries for each postnatal time point. Additional lineage tracing from later stages when migration into arteries is halted would be a good control for demonstrating that Vegfr3CreERT2 is not expressed in arteries, further support the authors' argument and conclusions.

      2. In their computational simulations, the authors investigate three models: random walking of ECs (M1), VEGFA gradient driven migration (M2), and integrated VEGFA /shear-stress driven migration (M3). The reader naturally wonders why a model considering only shear-stress driven migration is not presented as a control simulation. The absence of this model reduces the strength of the claims that only the M3 model captures observed EC movement rates, the mean population shift in vein-artery distance, and the arterial proportion of ECs.

      3. Much of the terminology in this study needs more detailed explanation and carefully usage. It would be more friendly to readers if they were consolidated into a box figure or table. (For example, how is coupling strength related or different from coupling rate?) The reported numbers of these factors are noted separately in text here and there. It would be helpful to put them together to highlight the difference between different models and mutant strains, as this is one of the novel findings for this study.

      4. The lack of labelled cells in the P9 retinas of Cdc42iECKO mice is striking (Figure 3), and strong evidence for the importance of Cdc42 in the migration of ECs towards arteries. The authors cite Lavina et al, 2018 when they note that Cdc42 depleted ECs proliferate at normal rates, but independent verification of this observation through EdU quantification would allow the authors to distinguish between the two possibilities outlined on page 15 of the manuscript: local proliferation vs enhanced migration of non-recombined ECs. These experiments and analyses are expected to be quick (depending on the availability of mice) and low cost. An independent EC proliferation analysis would also give the authors insight into the degree to which localized proliferation likely impacts vein-artery migration, a parameter which is currently unaccounted for in their computational model. The authors recognize that the lack of EC proliferation parameters is a limitation of their current model and speculate that this makes their estimates of vein to artery migration slightly too low. Independent EC proliferation analysis would thus be informative and may allow the authors to remove this speculation from their discussion.

      5. In the legends of Figure 1B & C, no information about the x or y-axis were mentioned. Even though the authors explained it elsewhere in the text or other figures, it is the first time that these parameters show up in the paper, which would be a proper place to note the details of these KDE plots.

      6. In the legends of Figure 2B & C, it mentioned the results of coupling strength and fraction of shear stress, which was not obvious by just looking at the figures provided. It is also not mentioned whether the numbers are calculated by collected or predicted data.

      7. Please direct the reader to the supplemental figure containing the tamoxifen injection strategy when the Vegfr3CreERT2 mouse model is introduced in the text and when Cdc42 and Rac1 EC knockout is discussed in the methods.

      8. There appears to be a significant degree of variance in the KDE plots from Figures 1, 3, and 4. It would therefore be helpful if the authors could include plots of the remodeling plexus and sprouting front that show the median for each dataset (similar to Figure 1 D and E), since this is less likely to be skewed by extrema. Additionally, for clarity and ease to read, it would be nice if author can consolidate the results of the mean from WT, Cdc42 and Rac 1 iECKO side by side. The images of the retina are striking, but the quantification is not as well-communicated. If the argument is that Cdc42 only disrupts the migration, but not the differentiation, of artery EC, the authors should see a significant difference between WT and Cdc42 iECKO only in P9, but not early stages.

      Minor Comments:

      1. For KDE plots in Figure 1B and Figure 3B, the authors labeled the x-axis differently (r vs dr). No description about the x-axis is noted in the legends. Please consider keeping the label consistent so the readers can relate and compare them.

      2. For the in vitro study, the cell line used is not mentioned in the results, even though there is a method section about HUVEC. Authors should note this information in the main text.

      3. Please note the flow direction in Figure 5B.

      4. The naming of supplemental figures requires revisiting. Currently there are two figures labeled with variations of supplementary Figure 1, which makes identifying the correct data challenging.

      5. There appears to be a small typo on page 16 in second paragraph in the sentence that currently reads "Nevertheless, only few cells reached the arteries by P9"

      6. The axis labels for the plots in Figure 5, Figure 2 Supplement 1, and Supplementary Figure 2 are currently very tiny and difficult to read.

      7. Additional information is required to describe the trajectory plots in Figure 2 Supplement 1, Figure 3 Supplement 3 and Figure 4 Supplement 3. I assume that blue trajectories move in the positive x direction while orange trajectories move in the negative x direction, but this is currently not specified in any of the legends.

      Significance

      Giese et al advance current understanding of the molecular mechanisms that guide vein to artery migration by convincingly demonstrating (both in-vivo and in-vitro) that Cdc42 is essential for this process. In addition, they present a computational model that captures the dual force field of VEGFA and shear stress gradients to simulate and quantify this migratory process in the mouse retina: a tool which will likely be useful for future mechanistic studies of vascular remodeling and EC migration. As far as I am aware, no standard coordinate system exists in the field for the quantification and modeling of this migratory process, so the introduction of this method alone serves as a useful innovation for the field of vascular biology.

      • It should be noted by authors and the editors that the mathematical details of the computational model and its statistical accuracies are not evaluated in this review, which instead focuses on the study's findings as they relate to EC biology and vascular development.

      • This study complements previous work that has identified populations of ECs that appear to be primed for incorporation into arteries, termed "pre-arteries" (Su et al 2018, Phansalkar 2021, Luo 2021). The authors' observation of heterogeneous EC movements during migration supports the intriguing suggestion that actin regulation through Cdc42 may be related to the establishment or phenotype of this pre-artery identity, though further mechanistic work will be required to validate this hypothesis, as noted by Giese et al in their thoughtful discussion.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Giese et al. use genetic lineage tracing techniques and novel computational analysis methods to quantify and predict how the spatial distribution of ECs changes over time in the developing mouse retina from P5 to P9. They also develop mathematical models to describe and predict the response of ECs to the hypothesized dual force-field formed by the chemoattractant VEGFA, and the flow-induced shear stress. Given that the mouse retina cannot be live imaged, these new methods are essential to infer cell dynamics from static images. With these methods the authors confirm previous findings that arteries are formed by endothelial cells derived from veins, or tip cells, or capillaries. They then combine their genetic system with Cdc42 and Rac1 floxed alleles to understand the function of these genes in EC mobilization. They find that Cdc42 has a very strong role in EC mobilization to arteries, not so much to the sprouting front, whereas the loss of Rac1 has relatively minor effects in vivo. Loss of Rac1 slows the cells but they maintain their directionality towards arteries. The discussion section and integration of these paper findings with previous work in the field is excellent. Overall, this work provides a much higher level of quantitative analysis of endothelial cell dynamics in the developing vasculature of the mouse retina. It also provides mathematical models that can be useful to explain and predict the impact of other genetic mutations or pharmacological interventions during vascular development.

      Major comments:

      1 - This work provides one of the finest examples of quantitative biology in the vascular biology field. The conclusion on cell dynamics is however largely based on static images measurements and pre-defined mathematical models given also previous work and the proposed model of a dual-force field. The authors conclude that sprouting front cells mainly migrate away towards VEGF, whereas remodelling plexus cells migrate towards arteries. However, this is based on the entire EC population measurements/displacement and averaging, and does not account for the possibility of a few ECs having a different behaviour from most of their neighbours. This comment is related with the fact that arteries are formed mainly by tip-derived ECs, the cells closest to the VEGF source and further away from the flow/shear stress force. It seems the authors model presented here would not allow this to happen. According to the model presented, it seems that an EC close to the VEGF source, and subjected to low flow (shear stress), would always migrate to the front and never turn back towards arteries. Can a more complex model enable the consideration of the stochastic loss of VEGF signalling (or gain of shear stress sensing) by some ECs at the sprouting front ? And their subsequent formation of arteries ?

      2 - Previous work by Laviña et al showed that Cdc42 is required for the migration of ECs to the sprouting front. The authors data suggest that Cdc42 is not necessary for this process. Could the difference between previous and the authors results be technical and related with the different stage of induction/analysis or the extent of Cdc42 deletion ? Did the authors tried inducing at P1 and collecting at P6/P7 ? The reporters used were also different and they may have different sensitivities to tamoxifen (and hence report Cdc42 deletion differently).

      3 - In the last section, some of the junctional/polarity/actin markers and analysis done in vitro could be also done in vivo.

      4 - The extent of Rac1 deletion in the mosaic experiments (done with suboptimal doses of tamoxifen) could be analysed. This is especially relevant since minor effects for Rac1 were observed in these in vivo experiments.

      Minor comments:

      1 - Lee et al., 2022 is a review. Better cite the original papers if possible: Some examples: Xu et al., 2014, Pitulescu et al 2017 and Lee et al., 2021.

      2 - Figure 1A: Stage of induction with tamoxifen is missing. Likely P5.

      3 - Figure 3 and 4 data would be easier to compare/understand by readers if part of the Wt data in Figure 1 was also plotted here. Or at least a Wt trend/average line on top of the mutant data, for us to see how much Cdc42 or Rac1 deletion changes the behaviour of the mutant cells versus the Wt cells.

      4 - Overall, for all dot plots and heatmaps, would be better to indicate the total number of cells analysed/plotted since the power of the analysis is related with cell number rather than number of retinas.

      Significance

      Significance

      General assessment: This paper is very strong on the quantitative analysis and mathematical modelling. Methods used and the model proposed may be of broad relevance for the field of vascular biology. It is however based on certain author-defined parameters and assumptions. EC dynamics in vivo can be much more complex than what can be modelled by equations. For example, heterogenous single cell genetics and signalling inputs can induce changes in cells that override the normal/average behaviour of the cells that are modelled. Despite the high level of quantitative analysis and modelling, the main findings here presented are not entirely novel, given previous work. For example, it was previously known that arteries are formed by vein/tip/capillary cells. It was also known that Cdc42 was required for proper EC migration away from veins (Laviña et al., 2018). However, the better quantitative analysis here presented does provide a higher level of detail and reliability. The mosaic genetics used to delete Cdc42 is in general clear since few reporter positive cells can make arteries, suggesting efficient deletion of this gene. The data also goes in line with previous work. However for Rac1, given that a much weaker phenotype was observed, is not possible to be sure that all GFP+ ECs had deletion of Rac1. This is especially important in mosaic genetic experiments, using a suboptimal dose of tamoxifen. The extent of Rac1 deletion in GFP+ cells was not analysed. Which leaves the open question if Rac1 is really dispensable for EC migration and arterialization. Embryos lacking Rac1 in endothelial cells die early during development. Therefore ECs fully lacking Rac1 may have stronger defects than the ones shown here. All this data was obtained in the postnatal retina angiogenesis system. Other organ vessels may develop differently. Future work will tell if the models proposed can explain the dynamics of ECs during the growth of other vascular beds.

      Audience: Vascular/Cell biology researchers and bioinformaticians developing tools for image analysis and/or cell migration/dynamics modelling.

      Expertise of Reviewer: Vascular Biology. I do not have sufficient expertise to evaluate the mathematical modelling part of this paper.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Planned Revisions based on comments from Reviewer #1

      • The introductory material and the title of the paper emphasize the ring canal scaling question. This problem is somewhat obscured in the text by the side problem of nuclear scaling, which comes up frequently even though the results are not as thoroughly explored. Could the authors think about moving these data into a different, single figure for the sake of coherence? This is not a required revision. Just a thought.
      • *We have moved the nuclear scaling data from Fig. 5 into Fig. S3, and once we have analyzed the data from the planned experiments (over-expressing either HtsRC or the active form of myosin), then we will have a better idea of whether we should move the rest of the nuclear scaling data out of the main part of the paper, consolidate it into a single figure (as Reviewer #1 suggests), or keep some of it in the main figures. *

      Planned Revisions based on comments from Reviewer #3

      • I cannot see differences in RC size in the panel A images. More importantly, this method altering ring canal size is limited. A more direct way is overexpression of HtsRC (https://doi.org/10.1534/genetics.120.303629).
      • We have requested and just recently received the line to over-express HtsRC in the germline. We plan to cross this UAS line to the mataTub-GAL4 which expresses GAL4 beginning around stage 3 of oogenesis. Because crossing this UAS line with this GAL4 line produced egg chambers with larger ring canals in the original study2*, we do not anticipate any technical issues with this experiment. We will incorporate the results from analysis of these egg chambers in the revised manuscript. *
      • To further explore the effect of ring canal size on scaling, we will also be testing a condition that we hope will have the opposite effect on ring canal size; expression of a phosphomimetic version of the non-muscle myosin II regulatory light chain, encoded by spaghetti squash (Sqh)(UAS-sqhE20E21). We plan to cross this UAS line to two different GAL4 drivers (nos-GAL4, which expresses GAL4 in a pulse during early oogenesis and then in another pulse in mid-oogenesis and the mataTub-GAL4 which expresses GAL4 beginning around stage 3 of oogenesis). We know that expression of sqhE20E21will reduce the size of the ring canals that connect the nurse cells to each other, but it is possible that the posterior ring canals will not show a strong phenotype. In a study that looked at egg chambers homozygous for a mutation in the myosin binding subunit of the myosin phosphatase, DMYPT, which should also increase sqh phosphorylation, it was shown that the posterior ring canals were larger than those connecting nurse cells 1*. Therefore, it is possible that this condition may not allow us to consistently reduce the size of all ring canal types; however, if we do see a significant reduction in posterior ring canal size in these egg chambers, we will include these data in the revised manuscript. *

      • In panel 2E, it would be helpful to plot the y-intercepts separately, too.

      • Based on the analysis of the data from the proposed experiments, we will consider plotting the y-intercepts separately for the various conditions.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      Revisions made based on comments from Reviewer #1

      • One way to think about the dhc-64C experiments presented in Figure 2 is that they are meant to test the hypothesis that ring canal size impacts scaling in such a way that transport across the four ring canals tends towards equilibrium over time. One possibility would therefore be that ring canals aren't programmed to grow to a particular final size but rather they grow at different rates until their diameters are the same. This seems to me an important distinction. It might be made by analysis of the arpC2-RNAi cells, since those ring canals are meant to be initially larger. Unfortunately, I can't see the answer.
      • *Reviewer #3 suggested determining the ratio of the diameter of the M1 ring canal to the M4 ring canal. If ring canals grow toward equilibrium (to achieve a similar final size), then we would expect to see this ratio approach 1; when we performed this analysis, we saw that the ratio did decrease as the egg chambers increased in volume, but it never quite reaches a ratio of 1. We have added a supplemental figure (Fig. S1) showing these data and incorporated this idea into the text within the results and discussion sections. *
      • *Although it would be informative to determine whether ring canals that all started with a similar diameter would grow at the same rate, we have not found a condition that would provide the opportunity to test this hypothesis. We hope that the planned experiments will provide us with a way to test this hypothesis; we will determine the M1/M4 ratio in egg chambers over-expressing either HtsRC or sqhE20E21 and see whether this ratio still decreases as egg chamber volume increases. *
      • *Once we perform the planned experiments to either increase or decrease ring canal size, then we can determine whether we need to further modify Fig. 3 to highlight these size differences between ring canals in the arpC2-RNAi egg chambers or whether we will instead focus more on the results of the planned experiments. *

      • The authors write that arpC2-RNAi "ring canals tended to be larger than those in similarly-sized control egg chambers," but that conclusion isn't obvious to me from the data in Figure 3B. The only difference I can see is that the M4 ring canals look to be consistently smaller in the experimental versus control egg chambers, especially at the final timepoint.

      • *To further clarify the difference in ring canal size between the control and the arpC2-RNAi egg chambers, we have added additional explanation to the results section to highlight that the y-intercepts of the lines of best fit are significantly higher in the arpC2-RNAi egg chambers at each stage. This demonstrates that given an egg chamber volume, the ring canals will be larger in egg chambers depleted of ArpC2 than in the controls. *

      • The authors write that "there was a consistent, but not significant decrease in the scaling exponents for the arpC2-RNAi egg chambers compared to controls," but I don't see this in the M1 (identical) or M2 (almost the same) ring canals. The scaling decrease is most pronounced at M4. All the other ring canals seem to reach a final size that's equivalent to controls. What does this tell us about scaling? Is the M4 more sensitive to the effect of arpC2-RNAi? I note and appreciate that the data for M4 show a wide distribution and might have been impacted by outliers, which could be discussed.

      • *We have separated the arpC2-RNAI ring canal scaling data by lineage (Fig. S2), and we have color-coded the data in Fig. 3B (as suggested by Reviewer #3). *
      • We have expanded the discussion of these results and their implications, and we have added a line in the results section to address this wide distribution of the M4 ring canal sizes.

      • The possibility that ring canal scaling "could generate eggs of different sizes" could use some elaboration (at least) as it does not seem to be especially well supported:

      • Only one of the small egg lines had lower scaling exponents than the big egg lines, and it's a struggle for me to understand the extent of that difference based on the data shown. (Is it significant?).
      • *We have restructured this section of the results and modified Fig. 5 to highlight similarities and differences between the four lines. In the results section (and in the figure legend), we have stated that when we compared the slopes of the regression lines for all four lines, there was a significant difference for M1, M2, and M4 (Fig. 5C, D, and F). We have also modified the results section to highlight that although the slopes for line 9.31.4 was not different from the two big egg lines, the intercepts were significantly different for M1, M2, M3, and M4 ring canals. We moved the nuclear scaling data to Fig. S3 to simplify the figure. *

      • The authors conclude that "the effect of lineage on ring canal scaling is conserved, and it suggests that at least in one line, reducing posterior ring canal scaling could provide a mechanism to produce a smaller mature egg." The first part of this sentence is confusing for me since I don't know what is meant here by "conserved." The second part of the sentence is technically correct but disguises what I would consider the more meaningful and exciting finding. The 9.31.4 line produces the smallest eggs but does not demonstrate scaling differences in comparison to the big egg lines examined (1.40.1 and 3.34.1). The authors have therefore avoided/solved a "chicken and egg" ("fruit fly and egg"?) problem by showing that scaling and egg size can be decoupled!

      • We have modified the first part of the sentence to clarify our point. We appreciate this suggestion and have modified the text in the results section to further elaborate on the results.

      • This point is not made very clearly in the discussion, which concludes with the suggestion that scaling could help explain why some insects produce much larger or much smaller eggs that fruit flies. I can only understand this to be the case if - as the authors point out - scaling "affect the directed transfer of materials into the oocyte." That argument seems predicated on the possibility that these insects make the same amount of initial material then regulate how much is transferred. Seems like a costly way to go about it.

      • *We have modified this section of the discussion. *

      • I really had to look very closely to distinguish the little blue boxes from the little blue circles in panels 2C and especially 2D. I suggest using a different color instead of a different shape, or maybe splitting the graphs up.

      • *We have made the shapes larger in Fig. 2C (nuclear sizes), and we have split the ring canal size data into Fig. 2D, E and made the shapes larger. The legend has been modified to reflect this change. *

      • "Depletion of the linker protein, Short stop (Shot), or dynein heavy chain (Dhc64C), significantly reduced the biased transport at the posterior, which reduced oocyte size (Lu et al., 2021)." I suggest this sentence might be clearer if it was rewritten as "Depletion of either dynein heavy chain (Dhc64C) or the linker protein Short stop (Shot) significantly reduces biased transport at the posterior, in turn reducing oocyte size (Lu et al., 2021).

      • We have made this change.

      • "Because nuclear growth has been shown to be tightly coupled to cell growth (Diegmiller et al., 2021), we can use nuclear size as a proxy for nurse cell size." I think it would help the reader to know that the Diegmiller study was performed using germline cysts in the Drosophila ovary; I paused when I got to this sentence because I initially read it as overly broad. I suggest "Recent work in demonstrates that nuclear growth is tightly coupled to cell growth in this system (Diegmiller et al., 2021), and we can therefore use nuclear size as a proxy for nurse cell size" or similar. This is certainly not a required revision, just a suggestion.

      • We have made this change.

      Planned Revisions based on comments from Reviewer #3

      • Reviewer #3 asked: Does the ratio of the diameter of M1 to M4 stay the same?
      • *We have performed this analysis in the control egg chambers (from Fig. 1), and we found that the ratio does not stay the same, but that it tends to decrease as the egg chamber increases in volume. We plotted the log of egg chamber volume versus this ratio, and the equation for the regression line was y = -0.166x + 2.32, which was significantly different from a slope of 0 (included in Fig. S1). *

      • It would be helpful to explain that the log-log plots were used to derive a line equation (y=mx + b) and why that is useful in this context. In the case of a log-log plot, what does the y-intercept mean biologically? Is it simply a way to compare two things or does it indicate real measurements such as volume or ring canal size? Also, the slope of the line is being used as a scaling value. Be careful to define the terms "scaling" and "scaling exponent".

      • We have added additional explanation in the results section.

      • Are four significant digits called for in calculating slope? The figure has 4 significant digits, the text has three.

      • *We have modified all figures and text to include only 3 significant digits. *

      • Why is isometric scaling 0.66 - is that microns squared over microns cubed? Please explain.

      • We have added additional explanation to the results section.

      • Were all four posterior nuclei measured? The figure indicates just M1 and M4.

      • We apologize that it was not clear that all four posterior nuclei were measured in Fig. 1. For the sake of space, we only showed images of the M1, M4, and Anterior ring canals and nuclei (in Fig. 1A), but all four nuclear measurements were included in the graph in Fig. 1B. We have added M1-M4 to the legend to clarify and revised the text of the legend.

      • It is hard to explain why all four posterior nuclei are bigger than anterior when one of the four is the same age as the anterior nucleus.

      • The posterior nuclei are larger than the anterior nuclei due to their proximity to the oocyte. Multiple recent studies have described this hierarchical nurse cell size relationship in which the nurse cells closest to the oocyte are larger than those separated from the oocyte by additional intercellular bridges 3–5*. *

      • In panel D, a conclusion is, "Further, the scaling exponent [slope] for the anterior ring canals, which are also formed during the fourth mitotic division, was not significantly different from that of the posterior M4 ring canals". Anterior is 0.23, M4 is 0.25. These seem different to me. How is significance determined? Were any of the scaling exponents in M1, M2, M3, M4 or Anterior significantly different?

      • *Significance was determined within the Prism software using a method equivalent to an ANCOVA. If the slopes are compared, M1 is significantly different from M2, M3, and M4, and M2 is significantly different from M4. M4 is not significantly different from the slope for the anterior ring canals, which supports the correlation between scaling and lineage. *

      • References are needed for the statements about biased transport to the oocyte.

      • *There was a reference to the Lu (2021) paper in that paragraph, but we have added an additional reference to that paper to this part of the results section. *

      • In panel 2C, why are the scaling exponents (slopes) of the controls bigger than in Figure 1B? The controls look hyper allometric in Fig. 2.

      • *This experiment was done with a different GAL4 driver, so it is possible that there are some differences in scaling based on genetic background. *

      • In panel 2D it is impossible to pick out the control posterior vs anterior lines - use different colors as in Figure 1. Why do the control lines for posterior and anterior merge?

      • *We have split the ring canal scaling data from Fig. 2D into different separate panels (Fig. 2D,E), as suggested by Reviewer #1. *
      • These lines likely approach each other because the slope of the line for the anterior ring canals (M4 type) is always larger than the slope for the combined posterior ring canals.

      • Re: Fig. 3: Scaling of what? RC size?

      • *We assume that this comment is related to the heading for this section of the results, so we have added “ring canal to the end of this title, so that it now reads: “Increasing initial ring canal size does not dramatically alter ring canal scaling” *

      • Since there was no effect, "dramatically" should be deleted from the section title.

      • This change has been made.

      • Clarify this sentence: If ring canal size inversely correlates with scaling, then increasing initial ring canal diameter should reduce the scaling exponent.

      • We have made this change in the text.

      • How does panel B show that RCs are larger in arpC2 KD? Fig. S1A has smaller y-intercept for control. Again, it is impossible to see which lines go with which M and which genotype.

      • *As mentioned above, we have modified Fig. 3 to highlight these differences and added additional explanation to the results section. *

      • Panels 4D & 4G are clear - should include significance indications.

      • *We have added asterisks to indicate significant differences. *

      • The conclusion from panels 5B and 5C that reducing RC scaling could lead to smaller mature eggs is a stretch. Without looking at the rest of the lines these data are preliminary and detract from the rest of the paper.

      • *As suggested by Reviewer #1, we have modified the results and discussion sections, and we have added a statement about the need for analysis of additional lines. *

      2. Description of analyses that authors prefer not to carry out

      Comment from Reviewer #2

      • I am surprised that the author has not considered controlling the impact of cell cycle regulation on this scaling process, especially as the work of Dorherty et al. has shown that this type of regulation is essential for regulating the size of nurse cell nuclei. The authors should test the impact of at least dacapo and cyclin E in this process.
      • We have attempted to deplete Dacapo from the germline by crossing two different RNAi lines to multiple germline drivers; however, we have been unable to see a consistent effect on nurse cell nuclear size, which suggests that these RNAi lines may not effectively reduce Dacapo protein in the germline. Although we agree with the reviewer that this is an obvious mechanism that should be explored, we believe that it is not necessary for it to be included in this manuscript, because altering Dacapo levels in the germline would not provide a mechanism to explain our model that ring canal lineage impacts ring canal scaling. Dacapo has been shown to contribute to the hierarchical pattern of nurse cell size observed in the germline. Dacapo mRNA produced in the nurse cells is transported into the oocyte, where it is translated. Then, the Dacapo protein diffuses back into the nurse cells, producing a posterior to anterior gradient 4. Doherty (2021) showed that reducing the levels of the Dacapo protein using the deGradFP system eliminated the nurse cell size hierarchy. If our data had supported a model in which proximity to the oocyte was a strong predictor of ring canal size and scaling (as shown for the nurse cells and their nuclei3,5*), then this would have been an excellent way to dig further into the mechanism. Instead, our data supported a role for ring canal lineage in predicting ring canal growth, since the M4 ring canals at the posterior and anterior showed similar scaling with egg chamber volume. *
      • We believe that performing the proposed experiments (over-expressing HtsRC to increase ring canal size or expressing the phosphomimetic form of the myosin regulatory light chain, sqhE20,E21 to reduce ring canal size) will allow us to determine how ring canal size affects scaling, which will provide additional mechanistic insight into this scaling behavior.*

      *

      Comment from Reviewer #3

      • Panel 3E is interesting and would fit better in Figure 1.
      • *This panel is from a different genetic background than the data in Fig. 1. Therefore, we do not think it would be appropriate to move it to Fig. 1. *
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Coordinating subcellular size of organelles is important for normal cell function. This research investigates how the size of intercellular bridges called ring canals in Drosophila egg chambers is regulated. The work builds on previous research in Change Tan's lab that reported ring canal size varies by lineage - the ring canal from the first germ cell mitotic division (M1) is larger than those resulting in subsequent divisions during egg chamber formation. Shaikh et al. probe this linkage between lineage and ring canal size during egg chamber development by testing the relationship with flow through the ring canals, initial ring canal size, and number of mitotic divisions. They also investigate whether ring canal size per se affects egg size. The quantification is carefully done.

      The results strongly reinforce the overall conclusion that lineage is the main driver of ring canal size. They report that growth of younger ring canals is slightly faster than old ones, suggesting a "catch-up" mechanism. However, neither the flow through ring canals nor their initial size has an apparent effect on the relationship with lineage.

      General comments:

      The authors derive scaling relationships between nurse cell or ring canal size and egg chamber volume to address their hypotheses. The most interesting observation is the relatively accelerated growth of young versus old ring canals attaching the oocyte to nurse cells, each from a different mitosis. Another perhaps simpler way to do this is to determine the ratios of the diameters of M1 to M4 ring canals as egg chambers develop. Does the ratio stay the same?

      Specific comments:

      Fig. 1:

      1. For those who forgot their algebra, it would be helpful to explain that the log-log plots were used to derive a line equation (y=mx + b) and why that is useful in this context. In the case of a log-log plot, what does the y-intercept mean biologically? Is it simply a way to compare two things or does it indicate real measurements such as volume or ring canal size? Also, the slope of the line is being used as a scaling value. Be careful to define the terms "scaling" and "scaling exponent".
      2. Are four significant digits called for in calculating slope? The figure has 4 significant digits, the text has three.
      3. Why is isometric scaling 0.66 - is that microns squared over microns cubed? Please explain.
      4. Were all four posterior nuclei measured? The figure indicates just M1 and M4. It is hard to explain why all four posterior nuclei are bigger than anterior when one of the four is the same age as the anterior nucleus.
      5. In panel D, a conclusion is, "Further, the scaling exponent [slope] for the anterior ring canals, which are also formed during the fourth mitotic division, was not significantly different from that of the posterior M4 ring canals". Anterior is 0.23, M4 is 0.25. These seem different to me. How is significance determined? Were any of the scaling exponents in M1, M2, M3, M4 or Anterior significantly different? Fig. 2: Less flow through M4 drives faster RC growth? No.
      6. References are needed for the statements about biased transport to the oocyte.
      7. In panel C, why are the scaling exponents (slopes) of the controls bigger than in Figure 1B? The controls look hyper allometric in Fig. 2.
      8. In panel D it is impossible to pick out the control posterior vs anterior lines - use different colors as in Figure 1. Why do the control lines for posterior and anterior merge?
      9. In pane E, it would be helpful to plot the y-intercepts separately, too. Fig. 3: increasing initial RC size does not dramatically alter scaling.
      10. Scaling of what? RC size?
      11. Since there was no effect, "dramatically" should be deleted from the section title.
      12. Clarify this sentence: If ring canal size inversely correlates with scaling, then increasing initial ring canal diameter should reduce the scaling exponent.
      13. I cannot see differences in RC size in the panel A images. More importantly, this method altering ring canal size is limited. A more direct way is overexpression of HtsRC (https://doi.org/10.1534/genetics.120.303629).
      14. How does panel B show that RCs are larger in arpC2 KD? Fig. S1A has smaller y-intercept for control. Again, it is impossible to see which lines go with which M and which genotype.
      15. Panel E is interesting and would fit better in Figure 1.

      Fig. 4: Additional mitotic division doesn't affect RC or nuclear scaling. 16. Panels D & G are clear - should include significance indications.

      Fig. 5: small and big lines 17. The conclusion from panels B and C that reducing RC scaling could lead to smaller mature eggs is a stretch. Without looking at the rest of the lines these data are preliminary and detract from the rest of the paper.

      Significance

      Overall, this work is an extension and reinforcement of information previously available rather than providing significant new insight. Researchers in Drosophila oogenesis will be interested.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Umayr Shaikh and colleagues study the evolution of the size of subcellular structures during tissue growth. To do this, the authors use the Drosophila egg chamber as a model system, studying the growth rate of the intercellular bridges, also known as ring canals, connecting the oocyte to the nurse cells and the growth rate of the nurse cell nuclei during the development of the egg chamber. In particular, they focused their study on the ring canals and the nuclei of the 4 nurse cells, both of which are in direct contact with the oocyte but have a different lineage history. They show that first born ring canal ring grow more slowly than smaller ring canal that are the result of subsequent mitotic divisions. This scaling process is maintained when polarised transport between the nurse cells and the oocyte is reduced by decreasing the level of dynein. They demonstrate that manipulation of the size of the ring canals by arpC2 RNAi does not radically alter scaling. Furthermore, by inactivating an uncharacterised gene CG34200, they show that additional mitotic division does not affect the scaling of the annular canal and nucleus.

      Significance

      Major comment

      These results are based on new and original observations. The results are clear and well documented. However, this work is very descriptive in its current state and in the absence of a mechanism for this lineage-based scaling process.

      I am surprised that the author has not considered controlling the impact of cell cycle regulation on this scaling process, especially as the work of Dorherty et al. has shown that this type of regulation is essential for regulating the size of nurse cell nuclei. The authors should test the impact of at least dacapo and cyclin E in this process.

      Without a mechanism for the scaling process, this manuscript is more suitable for a specialize journal

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The relationship between nuclear size and cell size has received a lot of attention. Less well-recognized is the problem of how other structures within the cell scale with size. This is a good question and the authors have a nice system - Drosophila female germline cysts - in which to study it. Here the authors show that lineage impacts ring canal scaling. This is an interesting finding that makes for neat biology; a simple way to think about it is that older ring canals have more time to mature (grow) than younger ones. The manuscript is beautifully and carefully written! It was fun to read. The experiments are straightforward, performed to a high standard, and generally well-presented.

      Comments:

      One way to think about the dhc-64C experiments presented in Figure 2 is that they are meant to test the hypothesis that ring canal size impacts scaling in such a way that transport across the four ring canals tends towards equilibrium over time. One possibility would therefore be that ring canals aren't programmed to grow to a particular final size but rather they grow at different rates until their diameters are the same. This seems to me an important distinction. It might be made by analysis of the arpC2-RNAi cells, since those ring canals are meant to be initially larger. Unfortunately I can't see the answer. The authors write that arpC2-RNAi "ring canals tended to be larger than those in similarly-sized control egg chambers," but that conclusion isn't obvious to me from the data in Figure 3B. The only difference I can see is that the M4 ring canals look to be consistently smaller in the experimental versus control egg chambers, especially at the final timepoint. Related to this concern, the authors write that "there was a consistent, but not significant decrease in the scaling exponents for the arpC2-RNAi egg chambers compared to controls," but I don't see this in the M1 (identical) or M2 (almost the same) ring canals. The scaling decrease is most pronounced at M4. All of the other ring canals seem to reach a final size that's equivalent to controls. What does this tell us about scaling? Is the M4 more sensitive to the effect of arpC2-RNAi? I note and appreciate that the data for M4 show a wide distribution and might have been impacted by outliers, which could be discussed.

      The possibility that ring canal scaling "could generate eggs of different sizes" could use some elaboration (at least) as it does not seem to be especially well supported:

      • Only one of the small egg lines had lower scaling exponents than the big egg lines, and it's a struggle for me to understand the extent of that difference based on the data shown. (Is it significant?).
      • The authors conclude that "the effect of lineage on ring canal scaling is conserved, and it suggests that at least in one line, reducing posterior ring canal scaling could provide a mechanism to produce a smaller mature egg." The first part of this sentence is confusing for me since I don't know what is meant here by "conserved." The second part of the sentence is technically correct but disguises what I would consider the more meaningful and exciting finding. The 9.31.4 line produces the smallest eggs but does not demonstrate scaling differences in comparison to the big egg lines examined (91.40.1 and 3.34.1). The authors have therefore avoided/solved a "chicken and egg" ("fruit fly and egg"?) problem by showing that scaling and egg size can be decoupled!
      • This point is not made very clearly in the discussion, which concludes with the suggestion that scaling could help explain why some insects produce much larger or much smaller eggs that fruit flies. I can only understand this to be the case if - as the authors point out - scaling "affect[s] the directed transfer of materials into the oocyte." That argument seems predicated on the possibility that these insects make the same amount of initial material then regulate how much is transferred. Seems like a costly way to go about it.

      Minor comments:

      I really had to look very closely to distinguish the little blue boxes from the little blue circles in panels 2C and especially 2D. I suggest using a different color instead of a different shape, or maybe splitting the graphs up.

      The introductory material and the title of the paper emphasize the ring canal scaling question. This problem is somewhat obscured in the text by the side problem of nuclear scaling, which comes up frequently even though the results are not as thoroughly explored. Could the authors think about moving these data into a different, single figure for the sake of coherence? This is not a required revision. Just a thought.

      I have two trivial comments regarding sentence structure in the text: "Depletion of the linker protein, Short stop (Shot), or dynein heavy chain (Dhc64C), significantly reduced the biased transport at the posterior, which reduced oocyte size (Lu et al., 2021)." I suggest this sentence might be clearer if it was rewritten as "Depletion of either dynein heavy chain (Dhc64C) or the linker protein Short stop (Shot) significantly reduces biased transport at the posterior, in turn reducing oocyte size (Lu et al., 2021).

      "Because nuclear growth has been shown to be tightly coupled to cell growth (Diegmiller et al., 2021), we can use nuclear size as a proxy for nurse cell size." I think it would help the reader to know that the Diegmiller study was performed using germline cysts in the Drosophila ovary; I paused when I got to this sentence because I initially read it as overly broad. I suggest "Recent work in demonstrates that nuclear growth is tightly coupled to cell growth in this system (Diegmiller et al., 2021), and we can therefore use nuclear size as a proxy for nurse cell size" or similar. This is certainly not a required revision, just a suggestion.

      Significance

      The manuscript is beautifully and carefully written! It was fun to read. The experiments are straightforward, performed to a high standard, and generally well-presented. The problem it addresses is an important/useful complement to other studies on the relationship between nuclear size and cell size. The paper identifies and characterizes differential scaling of ring canals, raising exciting mechanistic questions that can be addressed in future studies. I think it will be of interest to an audience cell and developmental biologists.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      *

      *Major comments: 1. Mirc56_2 and 4 showed lower integration rates, and the authors suggest that this could be due to sgRNA pool imbalance. The authors should validate this by performing sequencing of the input sgRNA and cassettes. *

      →Thank you very much for your comment, and we agree with your suggestion.

      We are going to confirm sgRNA pool imbalance in donor vector library by amplicon short-read NGS.

      In addition, to confirm another possibility that we raised, we re-sequenced sgRNA donor vector for Mirc56_2 and 4, and will add the following sentences:

      “We firstly doubted that their low integration frequencies were caused by any mutations on PB transposon of sgRNA donor vector, on especially ITR or ID that are important for integration efficiency [PMID: 15663772]. Therefore, we sequenced PB transposons for Mirc56_2 and 4 again. However, we could not find any mutations on their PB transposon.” following to “…efficiency or cell growth.” in the Discussion (page14, line 346)

      *2. Clonal analysis in Figure 5c is unclear a. Figure 5c indicates that all changes were homozygous (e.g. both alleles were deleted). Was this the case in all clones? Or were some mutations heterozygous? *

      →Thank you very much for your comment.

      We apologize for the misleading context.

      We targeted mono allele on X chromosome in male mES cells so that all mutations should be hemizygous as mentioned in the Result (page11, line 259-260)

      To enhance our study is monoallelic assessment, we will add the following sentence:

      “This study targeted mono allele on X chromosome in male mES cells so that all genotype on Mirc56 should be hemizygous and these mutations induced might be cis-mutation.” following to“…owing to six tandem repeats [37]” in the Result (page12, line 302)

      *b. Many clones in Figure 5c show that the entire region was deleted (all black dots). Could this be due to some experimental error or misinterpretation of the sequencing data, or could it be validated using some orthogonal method? This is especially surprising for clones in which the final guide (Mirc56_13) was not detected yet the final site (Mirc56_13) was reported as "Regional deletion". *

      →Thank you very much for your comment.

      We apologize for the misleading context.

      Firstly, we just confirmed and sequenced the mature-miRNA genomic regions by amplifying approximately 200 bp around the target sites. Therefore, we defined unamplified regions as “miRNA deletion”. In addition, to make the Figure 5C easy to understand, we added “predicted regional deletion” and each name of clones as attached.

      In fact, only 4 clones harbored entire Mirc56_X deletions on all analysed Mirc56_X genomic region (Mirc56_1 to 13). Besides, these clones could be PCR-amplified by sgRNA cassettes and Sry on Y chromosome so that these results suggested we could successfully obtain their genomic DNA and at least mature-miRNA genomic regions were deleted.

      Moreover, Mirc56_13 deletions without target sites on Mirc56_13 are always within predicted regional deletions that are induced from upstream and downstream of sgRNA target sites. Therefore, it could be estimated that these deletions were induced from the target sites on Mirc56_14, 15, 16, or 17 and upstream of Mirc56_13.

      To clarify them, we will add the following sentences:

      • “Four clones (#2_066, #1_021, #1_029 and #1_046) harboured entire Mirc56_X deletions on all analysed Mirc56_X genomic region. In addition to these clones, only 3 pairs (#2_019 and #2_084, #2_038 and #1_023, #1_016 and #1_027) harboured same combination of mutations.” following to “…combinations of mutations (Figure 5C).” in the Result (page16, line 378-380)
      • “Meanwhile, focusing on relationship between mutations and target sites that targeted by sgRNA cassettes in each clone, all Mirc56_X genomic regions harbouring Indel mutations were target Micr56_X In addition, if sequential Mirc56_Xs on the genome were deleted, the most upstream and downstream of Mirc56_Xs deleted were always on the target Mirc56_X sites except for #2_025 and #1_41.” following to “…combinations of mutations (Figure 5C).” in the Result (page12, line 304)
      • “Genotyping PCR amplified approximately 200 bp around the mature-miRNA genomic region. Unamplified region is defined as miRNA deletion (Black circle) and amplified region was determined as Indel mutation (Gray circle) or Intact by short-NGS. If sequential Mirc56_Xs on the genome were deleted, black translucent square indicates predicted regional deletion assumed that the genomic region flanked by miRNA deletions was also deleted. Besides, if miRNA deletion was induced in Mirc56_13 and the clone have target Mirc56_X on Mirc56_14, 15, 16, or 17” following to “…in each PB mES clone.” in the Figure legend (page23, line 575) Moreover, because we defined “miRNA deletion”, we will change ”regional deletion” to “miRNA deletion” where I mean “deletion of the mature-miRNA genomic regions” in the Result (page13, line 312) and the Discussion (page14, line 363)

      *3. Next-generation targeted sequencing of clones should be made publicly accessible. *

      →Thank you very much for your comment. We apologize for the inconvenience.

      We already informed Review commons that we made publicly available.

      We already described BioProject ID PRJNA996747 in the Data Availability (page16, line 383-384)

      4. OPTIONAL - Cassette integration number is understudied. One important aspect of tiling mutagenesis is the control over how many guides are present in each cell. The authors report an average of 4.7 cassettes/cell. This could be modulated by the amount of donor vector added, and indeed the authors performed titration experiments, but only with a fluorescent reporter readout. It would be very useful to know how the concentration of donor vector corresponds to the number of cassettes/cell - perhaps genotyping of clones from one or two additional experiments would be sufficient.

      → Thank you very much for your suggestion.

      We agree that cassette integration number is one important aspect of tiling mutagenesis.

      To investigate how many copies our concentration of donor vector could integrate, we are going to check actual copy numbers in several clones by qPCR.

      We think that it is other research to confirm “how the concentration of donor vector corresponds to the number of cassettes/cell”. The correlation might not be liner due to transposase overproduction inhibition (OPI) so that it would require huge amounts of experiments to confirm it. Our research is how CTRL-Mutations induce diverse mutations but not how property PB system have.

      Minor comments: 1. The background fails to acknowledge the work of CRISPR-Cas tiling screens (e.g. https://doi.org/10.1038/nbt.3450) or CRISPR-Cas in creating mutagenesis in cell lines (e.g. https://doi.org/10.1007/978-1-0716-0247-8_29*) *

      → Thank you very much for your suggestion, and we agree with your suggestion.

      We will add to acknowledge previous studies for CRISPR-Cas tilling screens.

      • “Recently, targeted mutagenesis combined forward genetics and reverse genetics has been developed such as saturating mutagenesis and tiling mutagenesis that induce random mutation within target gene(s) [PMID: 25141179, 31586052, 27260157, 28118392]. This targeted mutagenesis can construct a mutant library harbouring subtly different mutations within a target gene(s) so that comparative analysis through the mutant library can screen out critical mutation(s) for biological processes. These random mutagenesises have also revealed the function of numerous coding genes” following to “…list of coding genes [6–8].” in the Introduction (page3, line 55-56)
      • “In addition, the saturating mutagenesis are limited in the length of target region due to an approach basing homology-directed repair although it could introduce random mutations on donor template library harbouring any combination and variety of mutations [PMID: 25141179]. On the other hand, the tiling mutagenesis could expand target length in principle because the length depends on multiplex guide RNA (gRNA) designed to target genomic region. Therefore, tiling mutagenesis has been employed to identify critical regions embedded in cis-elements [PMID: 26375006, 30612741, 26751173, 27708057, 28416141, 31784727]. Tiling mutagenesis requires editor such as Cas9 or epigenetic modifier fused to catalytically dead Cas9 (e.g. KRAB-dCas9), and a library containing multiplex gRNA tiling across target genomic region. In general, random single guide RNA (sgRNA) expression cassette are integrated into chromosomes of host cells by retrotransposon system.” following to “…within a narrow region.” in the Introduction (page4, line 82) However, we do not agree that we have to acknowledge previous report about KI or KO by single or double cut in cell lines (as you suggested that https://doi.org/10.1007/978-1-0716-0247-8_29) because it is obvious knowledge. Therefore, we will not add this paper.

      2. Figure 1 left 'ROI random mutant PB mES cell' should be horizontally aligned so Mir_1, Mir_2 and MirX align with the upper figure.

      → Thank you very much for your kind comment, and we agree with your suggestion.

      Therefore, we changed it in the Figure 1.

      *3. It is interesting and unexpected that some guides never induce indels, even in the absence of a regional deletion (e.g. Mirc56_3, Mirc56_7). Why might this be? Was there perhaps an error in the assignment of these guides to these cells? *

      → Thank you very much for your comment.

      As you mentioned, Mirc56_3, 4 and 7 had no indel. We appreciate that we can correct our mistakes by your suggestion. We corrected Figure 5D as attached. In addition, we will correct average Mirc56_X site as 22.6 from 22.7.

      These sgRNA also induced miRNA deletion with low frequency (Mirc56_3: 38.9%, Mirc56_4: 25.0% and Mirc56_7: 68.0%, Figure 5D). Moreover, every deleted Mirc56_3, 4 and 7 was within predicted regional deletion except for target Mirc56_3 of PB mES clone #2_080 (revised Figure 5C).

      Therefore, we raised why some guides never induce indels even in the absence of a regional deletion, as “In addition to low frequencies, Indel mutation might disappear due to regional deletion if these sgRNAs could induce Indel mutation”.

      To clarify them, we will add the following sentences:

      • “In particularly, middle target sites such as Mirc56_3, 4 and 7 were induced only miRNA deletion or Intact (Figure 5D)” following to “…in our mutant library (Figure 5C, D).” in the Discussion (page14, line 364)
      • “In fact, every deleted Mirc56_3, 4 and 7 was within predicted regional deletion except for target Mirc56_3 of PB mES clone #2_080 (Figure 5C). In addition, these sgRNA induced mutation with low frequency (Mirc56_3: 38.9%, Mirc56_4: 25.0% and Mirc56_7: 68.0%, Figure S6). Therefore, we suspected that regional deletion and their low mutation introduction rate facilitated to disappear Indel mutation.” following to “…induced at target sites.” in the Discussion (page14, line 366)

        *4. Regarding Mirc56_2 and 4 integration, on line 34 the authors suggest that "We suspect this was caused by a technical error, such as an unequal amount of sgRNA donor vector or the sequence in sgRNA cassettes affecting integration efficiency or cell growth." sgRNA library imbalance would be a technical error, but integration affecting cell growth is not a technical error. This sentence should be reworded. *

      → Thank you very much for your comment.

      We apologize for the misleading sentence even though this paper was already English-reviewed by English language editor.

      We will reword that “We suspect this was caused by the sequence in sgRNA cassettes affecting integration efficiency or cell growth, or a technical error such as an unequal amount of sgRNA donor vector.” following to “…PB mES clones via FACS..” in the Discussion (page14, line 344)

      *5. Line 540 "ration" is the incorrect word - perhaps "ratio"? *

      → Thank you very much for your kind comment, and we are sorry for the typo.

      We will correct it in the Figure legend (page22, line 540).

      6. Plot 5b should be shown as a histogram rather than a swarm plot to show how many clones were in each category.

      → Thank you very much for your suggestion.

      In Figure 5B, we aimed to indicate the number of sgRNA cassette varieties in each clone but not distribution of the number of integrated sgRNA cassettes. Distribution of the number of integrated sgRNA cassettes in clone library matched with the frequency of target sites in Figure 5D.

      We already described the distribution data as “In addition, an average of 22.7 Mirc56_X sites … the same frequency except for the Mirc56_2- and 4-targeting cassettes.” in the Result (page13, line 312-315)

      *Reviewer #1 (Significance (Required)):

      1. General assessment: The authors are successful in creating clonal cell lines bearing a variety of mutations. Unfortunately, the cell lines also have transposase-mediated insertion events of the sgRNA cassettes at unknown positions in the genome, which will hamper the interpretability of any experiment using these cell lines. The authors fail to justify the use of the transposase and integration of the sgRNA, especially compared to lentiviral transfection or RNPs which would produce edits at the region of interest. Alternately, integrated sgRNA cassettes could have been excised with Flp recombinase as in https://doi.org/10.1007/978-1-0716-0247-8_29. *

      → Thank you very much for your suggestion.

      We agree that we did not mention why we choose PiggyBac system compared to lentiviral delivery.

      Therefore, we will add the following sentences:

      • “In general, random single guide RNA (sgRNA) expression cassette are integrated into chromosomes of host cells by retrotransposon system. To identify combinations of critical region embedded in target regions, it would require diverse combinations of mutations or inactivation sites. To induce multiple mutations or inactivated sites, it requires multiple sgRNA cassettes integration. However, multiple integration of sgRNA cassettes have higher risk of non-targeted endogenous gene disruptions and may impair functional analysis [PMID: 23435812].” in the Introduction.
      • “Here, we proposed that DNA transposon system rather than retrotransposon system is more suitable to remove sgRNA cassettes from a mutant library. Transposons are genetic elements that can relocate between genomic sites and there are two types of transposons: (1) DNA transposon is transferred by a "cut and paste" mechanism in which the transposon sequence is cut directly from the genome, and (2) retrotransposon is transferred by a "copy and paste" mechanism in which the transposon sequence is transcribed into RNA and then integrated by reverse transcribed [PMID: 21958341]. Therefore, retrotransposon is never removed from the genome. DNA transposon such as PiggyBac, Sleeping Beauty and Tol2 systems are also used as gene transfer tools in vertebrates [PMID: 26481584]. Especially, PiggyBac leaves no footprint on integrated sites after transposons relocated while other DNA transposon system leaves small insertion on integrated sites [PMID: 34064900]. In addition, excision-only-PiggyBac transposase that can remove transposons but not integrate them, is developed [PMID: 27929521]. Only PiggyBac system can remove transposons carrying sgRNA cassettes from mutant library with no footprint. Therefore, we aimed to validate PiggyBac system for CRISPRko tilling mutagenesis.” in the Introduction. However, we are not going to mention comparison to RNPs because it is obvious that random sgRNA expressions is important key for random mutagenesis and design of random sgRNA treatments by RNP is difficult. The reason is that the target region might be cleaved by almost all sgRNA incorporated into cells. On the other hand, it is easier to design the number of sgRNA expression variety using the delivery system via integration into the chromosome because only integrate sgRNA are expressed.

      In addition, we could not agree that “integrated sgRNA cassettes could have been excised with Flp recombinase as in https://doi.org/10.1007/978-1-0716-0247-8_29.”

      This paper reports the concept that one EM7>neoR expression cassette flanked by Frt within KI allele could select intended-KI clone and then the cassette could remove by Flp recombinase. However, this approach is not suitable for our method because it causes structural mutation by recombination of multi Frt cassettes that are integrated into nearby genomic regions. Therefore, we will not mention it.

      *2. Additionally, the genotyping analysis is unclear, and seems to indicate that each clone bears homozygous mutations, with several clones showing deletions of the entire region. *

      → Thank you very much for your suggestion.

      We will revise them in Reviewer #1 Major comment 2a and b.

      3. Advance: The authors are motivated to create clones using tiling mutagenesis. Tiling mutagenesis has already been performed without transposases (e.g. https://doi.org/10.1038/nbt.3450, https://doi.org/10.1371/journal.pone.0170445, https://doi.org/10.1038/s41467-019-12489-8*) in the context of a screen, and clones have already been created using CRISPR/Cas9 mutagenesis so the advance presented in this manuscript over previous published work is unclear. *

      →Thank you very much for your suggestion, and we agree with your suggestion.

      We will add to acknowledge previous studies for CRISPR-Cas tilling screens.

      We will add the following sentences:

      • “Recently, targeted mutagenesis combined forward genetics and reverse genetics has been developed such as saturating mutagenesis and tiling mutagenesis that induce random mutation within target gene(s) [PMID: 25141179, 31586052, 27260157, 28118392]. This targeted mutagenesis can construct a mutant library harbouring subtly different mutations within a target gene(s) so that comparative analysis through the mutant library can screen out critical mutation(s) for biological processes. These random mutagenesises have also revealed the function of numerous coding genes” following to “…list of coding genes [6–8].” in the Introduction (page3, line 55-56)
      • “In addition, the saturating mutagenesis are limited in the length of target region due to an approach basing homology-directed repair although it could introduce random mutations on donor template library harbouring any combination and variety of mutations [PMID: 25141179]. On the other hand, the tiling mutagenesis could expand target length in principle because the length depends on multiplex guide RNA (gRNA) designed to target genomic region. Therefore, tiling mutagenesis has been employed to identify critical regions embedded in cis-elements [PMID: 26375006, 30612741, 26751173, 27708057, 28416141, 31784727]. Tiling mutagenesis requires editor such as Cas9 or epigenetic modifier fused to catalytically dead Cas9 (e.g. KRAB-dCas9), and a library containing multiplex gRNA tiling across target genomic region. In general, random single guide RNA (sgRNA) expression cassette are integrated into chromosomes of host cells by retrotransposon system.” following to “…within a narrow region.” in the Introduction (page4, line 82) The paper you raised as DOI: https://doi.org/10.1038/nbt.3450 applied CRISPRko tiling mutagenesis to find out critical region embedded 2 kb of p53 binding enhancer region by lentiviral delivery of sgRNA cassettes. Our method employs PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions. In addition, we expand the length of target region to more than 50 kb. This is one of the advances.

      The paper you raised as DOI: https://doi.org/10.1371/journal.pone.0170445 applied CRISPRko tiling mutagenesis to find out critical mutation on MAP2K1 and BRAF protein coding sequence by lentiviral delivery of sgRNA cassettes. Our method employs PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions.

      The paper you raised as DOI: https://doi.org/10.1038/s41467-019-12489-8 applied CRISPRko tiling mutagenesis for to find out critical domain from protein coding sequence by lentiviral delivery of sgRNA cassettes. Our method employs PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions.

      To clarify the advantages, we will add the following sentences:

      • “Here, we proposed that DNA transposon system rather than retrotransposon system is more suitable to remove sgRNA cassettes from a mutant library. Transposons are genetic elements that can relocate between genomic sites and there are two types of transposons: (1) DNA transposon is transferred by a "cut and paste" mechanism in which the transposon sequence is cut directly from the genome, and (2) retrotransposon is transferred by a "copy and paste" mechanism in which the transposon sequence is transcribed into RNA and then integrated by reverse transcribed [PMID: 21958341]. Therefore, retrotransposon is never removed from the genome. DNA transposon such as PiggyBac, Sleeping Beauty and Tol2 systems are also used as gene transfer tools in vertebrates [PMID: 26481584]. Especially, PiggyBac leaves no footprint on integrated sites after transposons relocated while other DNA transposon system leaves small insertion on integrated sites [PMID: 34064900]. In addition, excision-only-PiggyBac transposase that can remove transposons but not integrate them, is developed [PMID: 27929521]. Only PiggyBac system can remove transposons carrying sgRNA cassettes from mutant library with no footprint. Therefore, we aimed to validate PiggyBac system for CRISPRko tilling mutagenesis.” in the Introduction.
      • “CRISPRko tiling mutagenesis is conducted for less than 15 kb target genomic region so far [PMID: 26375006, 30612741], while CRISPRi tiling mutagenesis can target more than 70 kb [PMID: 27708057], it is reported that. Hence, it remains unknown unclear how length CRISPRko tiling mutagenesis could expand” in the Introduction. In addition, we are going to conduct transposon removal by exicision-only-PBase treatment with several PB mES clones, for the proof of concept that CTRL-Mutagenesis can generate mutant library with no sgRNA cassettes.

      *4. Audience: The manuscript is written for the basic research audience, and the method could be applied to the study of regions of interest in many diseases. However, the unexcised use of transposases make the method less desirable than other methods. *

      → Thank you very much for your suggestion.

      We do not agree that the PiggyBac make the method less desirable than other methods.

      As mentioned in our response for reviewer #1 Significance 3, only PiggyBac system can remove transposons carrying sgRNA cassettes from mutant library with no footprint. Multiple integration of sgRNA cassettes have higher risk of non-targeted endogenous gene disruptions and may impair functional analysis. However, sgRNA cassettes by lentiviral delivery is never removed from the genome. In addition, other approaches such as Flp recombinase that reviewer #1 proposed in Significance 1 is not better than PiggyBac because Flp recombinase causes stratal mutation by recombination of multi Frt cassettes that are integrated into nearby genomic regions.

      To clarify them, we will add the following sentences:

      • “However, multiple integration of single guide RNA (sgRNA) cassettes has higher risk of non-targeted endogenous gene disruptions and may impair functional analysis.” in the Abstract.
      • “However, multiple integration of sgRNA cassettes have higher risk of non-targeted endogenous gene disruptions and may impair functional analysis [PMID: 23435812]. To eliminate the risk that integrated sgRNA cassettes disrupt non-targeted endogenous genes, it is best way to remove the sgRNA cassettes from the chromosome.” in the Introduction.
      • “Here, we proposed that DNA transposon system rather than retrotransposon system is more suitable to remove sgRNA cassettes from a mutant library. Transposons are genetic elements that can relocate between genomic sites and there are two types of transposons: (1) DNA transposon is transferred by a "cut and paste" mechanism in which the transposon sequence is cut directly from the genome, and (2) retrotransposon is transferred by a "copy and paste" mechanism in which the transposon sequence is transcribed into RNA and then integrated by reverse transcribed [PMID: 21958341]. Therefore, retrotransposon is never removed from the genome. DNA transposon such as PiggyBac, Sleeping Beauty and Tol2 systems are also used as gene transfer tools in vertebrates [PMID: 26481584]. Especially, PiggyBac leaves no footprint on integrated sites after transposons relocated while other DNA transposon system leaves small insertion on integrated sites [PMID: 34064900]. In addition, excision-only-PiggyBac transposase that can remove transposons but not integrate them, is developed [PMID: 27929521]. Only PiggyBac system can remove transposons carrying sgRNA cassettes from mutant library with no footprint. Therefore, we aimed to validate PiggyBac system for CRISPRko tilling mutagenesis.” in the Introduction. In addition, we are going to conduct transposon removal by exicision-only-PBase treatment with several PB mES clones, for the proof of concept that CTRL-Mutagenesis can generate mutant library with no sgRNA cassettes.

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Major concerns:

      1) Concern about the Novelty of Functional Analysis Platforms: The authors claim that there are no established platforms for the study of cis-elements or microRNA clusters. This assertion seems inaccurate, as previous studies have utilized Cas9 tiling screens to investigate cis-regulatory elements (CREs) and large-scale screens to probe microRNA functions, as exemplified by the works of Canver et al. in Nature 2015, Gasperini et al. in Cell 2019, and others. *

      → Thank you very much for your suggestion, and we agree with your suggestion.

      We apologize our false claim so that we will delete the following sentences:

      • “In contrast, no functional analysis platforms have been established for the study of cis-elements or microRNA cluster regions consisting of multiple microRNAs with functional overlap” in the Abstract (page2, line 28-30)
      • “While loss-of-function analysis has been conducted for numerous coding genes, very limited progress has been made on non-coding genes and cis-elements.” in the Introduction (page3, line 47-49) The paper you raised as DOI: https://doi.org/10.1038/nature15521 (Canver et al. in Nature 2015) applied CRISPRko tiling mutagenesis to find out critical region embedded 12 kb of BCL11A enhancer region by lentiviral delivery of sgRNA cassettes. Our method employs PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions. In addition, we expand the length of target region to more than 50 kb. This is one of the advances.

      The paper you raised as DOI: https://doi.org/10.1016/j.cell.2018.11.029 (Gasperini et al. in Cell 2019) applied CRISPRko tiling mutagenesis to find out critical region embedded maximum 12 kb enhancer candidates, in addition to CRISPRi tilling candidate screening through one sgRNA by one candidate enhancer, by lentiviral delivery of sgRNA cassettes. Our method employs PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions. In addition, we expand the length of target region to more than 50 kb. This is one of the advances. Additionally, to identify combinations of critical region embedded in target regions, it would require diverse combinations of mutations or inactivation sites. To induce multiple mutations or inactivated sites, it requires multiple sgRNA cassettes integration. However, multiple integration of sgRNA cassettes have higher risk of non-targeted endogenous gene disruptions and may impair functional analysis. To eliminate the risk that integrated sgRNA cassettes disrupt non-targeted endogenous genes, it is best way to remove the sgRNA cassettes from the chromosome. Thus, to identify combinations of critical region embedded in target regions with no artifact owing to no footprint by removal of sgRNA cassettes, CRISPRko tiling mutagenesis rather than CRISPRi is better method because CRISPRi requires integrated cassettes that stably expressed sgRNA and epigenetic modifier fused to dCas9. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions to find out combinations of critical region embedded in target regions.

      Therefore, to add to acknowledge previous studies and clarify the advantages, we will add the following sentences:

      • “In addition, the saturating mutagenesis are limited in the length of target region due to an approach basing homology-directed repair although it could introduce random mutations on donor template library harbouring any combination and variety of mutations [PMID: 25141179]. On the other hand, the tiling mutagenesis could expand target length in principle because the length depends on multiplex guide RNA (gRNA) designed to target genomic region. Therefore, tiling mutagenesis has been employed to identify critical regions embedded in cis-elements [PMID: 26375006, 30612741, 26751173, 27708057, 28416141, 31784727]. Tiling mutagenesis requires editor such as Cas9 or epigenetic modifier fused to catalytically dead Cas9 (e.g. KRAB-dCas9), and a library containing multiplex gRNA tiling across target genomic region. In general, random single guide RNA (sgRNA) expression cassette are integrated into chromosomes of host cells by retrotransposon system.” following to “…within a narrow region.” in the Introduction (page4, line 82)
      • “To identify combinations of critical region embedded in target regions, it would require diverse combinations of mutations or inactivation sites. To induce multiple mutations or inactivated sites, it requires multiple sgRNA cassettes integration. However, multiple integration of sgRNA cassettes have higher risk of non-targeted endogenous gene disruptions and may impair functional analysis [PMID: 23435812]. To eliminate the risk that integrated sgRNA cassettes disrupt non-targeted endogenous genes, it is best way to remove the sgRNA cassettes from the chromosome. Thus, to identify combinations of critical region embedded in target regions with no artifact owing to no footprint by removal of sgRNA cassettes, CRISPRko tiling mutagenesis rather than CRISPRi is better method because CRISPRi requires integrated cassettes that stably expressed sgRNA and epigenetic modifier fused to dCas9.” in the Introduction.
      • “Here, we proposed that DNA transposon system rather than retrotransposon system is more suitable to remove sgRNA cassettes from a mutant library. Transposons are genetic elements that can relocate between genomic sites and there are two types of transposons: (1) DNA transposon is transferred by a "cut and paste" mechanism in which the transposon sequence is cut directly from the genome, and (2) retrotransposon is transferred by a "copy and paste" mechanism in which the transposon sequence is transcribed into RNA and then integrated by reverse transcribed [PMID: 21958341]. Therefore, retrotransposon is never removed from the genome. DNA transposon such as PiggyBac, Sleeping Beauty and Tol2 systems are also used as gene transfer tools in vertebrates [PMID: 26481584]. Especially, PiggyBac leaves no footprint on integrated sites after transposons relocated while other DNA transposon system leaves small insertion on integrated sites [PMID: 34064900]. In addition, excision-only-PiggyBac transposase that can remove transposons but not integrate them, is developed [PMID: 27929521]. Only PiggyBac system can remove transposons carrying sgRNA cassettes from mutant library with no footprint. Therefore, we aimed to validate PiggyBac system for CRISPRko tilling mutagenesis.” in the Introduction.
      • “CRISPRko tiling mutagenesis is conducted for less than 15 kb target genomic region so far [PMID: 26375006, 30612741], while CRISPRi tiling mutagenesis can target more than 70 kb [PMID: 27708057], it is reported that. Hence, it remains unknown unclear how length CRISPRko tiling mutagenesis could expand” in the Introduction. On the other hand, we could not find previous studies employing Cas9 tiling mutagenesis to investigate miRNA functions. The application for miRNA cluster is also one of the advances.

      2) Advantages of PiggyBack System Over Lentiviral Integration: The paper does not clearly articulate the advantages of their proposed PiggyBack-based system for sgRNA integration over traditional lentiviral integration. Both methods facilitate the random integration of multiple gRNAs, but the paper lacks a comparative analysis or justification for choosing the PiggyBack system.

      → Thank you very much for your suggestion, and we agree with your suggestion.

      We agree that we did not mention why we choose PiggyBac system compared to lentiviral delivery.

      Therefore, we will add the following sentences:

      • “In general, random single guide RNA (sgRNA) expression cassette are integrated into chromosomes of host cells by retrotransposon system. To identify combinations of critical region embedded in target regions, it would require diverse combinations of mutations or inactivation sites. To induce multiple mutations or inactivated sites, it requires multiple sgRNA cassettes integration. However, multiple integration of sgRNA cassettes have higher risk of non-targeted endogenous gene disruptions and may impair functional analysis [PMID: 23435812].” in the Introduction.
      • “Here, we proposed that DNA transposon system rather than retrotransposon system is more suitable to remove sgRNA cassettes from a mutant library. Transposons are genetic elements that can relocate between genomic sites and there are two types of transposons: (1) DNA transposon is transferred by a "cut and paste" mechanism in which the transposon sequence is cut directly from the genome, and (2) retrotransposon is transferred by a "copy and paste" mechanism in which the transposon sequence is transcribed into RNA and then integrated by reverse transcribed [PMID: 21958341]. Therefore, retrotransposon is never removed from the genome. DNA transposon such as PiggyBac, Sleeping Beauty and Tol2 systems are also used as gene transfer tools in vertebrates [PMID: 26481584]. Especially, PiggyBac leaves no footprint on integrated sites after transposons relocated while other DNA transposon system leaves small insertion on integrated sites [PMID: 34064900]. In addition, excision-only-PiggyBac transposase that can remove transposons but not integrate them, is developed [PMID: 27929521]. Only PiggyBac system can remove transposons carrying sgRNA cassettes from mutant library with no footprint. Therefore, we aimed to validate PiggyBac system for CRISPRko tilling mutagenesis.” in the Introduction.

        *3) Lack of Comparative Analysis with Alternative Methods: The authors did not provide a comparison of CTRL-Mutagenesis with other existing screening methods. Such a comparison is crucial for understanding the effectiveness and efficiency of the new method in relation to established techniques. *

      → Thank you very much for your suggestion.

      We agree with the comparison is one of important experiments.

      However, our main claim is validation of tiling mutagenesis using PiggyBac that is only integration system with no footprint. Therefore, we propose our novelty without the comparison and not argue higher / lower efficiency of CTRL-Mutagenesis compared to exiting methods.

      *4) Limitations in Library Resolution: The paper acknowledges the limited resolution of their proposed library. The authors might have explored the use of base editors for enhanced resolution in such screens, as base editing could potentially offer more precise and controlled mutagenesis as briefly mentioned in the discussion. *

      → Thank you very much for your suggestion.

      We agree with your suggestion.

      Base editing is occurred within only editing window. In addition, a major limitation of prime editing is low efficiency (https://doi.org/10.1016/j.tibtech.2023.03.004). Therefore, design of sgRNA for base editor or pegRNA and its editing efficiency requires huge amounts of experiments.

      Our study is proof of concept to validate PiggyBac system for CRISPRko tilling mutagenesis and expanded the length of target regions. Thus, we just discussed limited resolution of our mutant library and proposed the use of base editors for enhanced resolution in the Discussion (page14, line 366-370).

      5) Absence of Functional Data Post-Mutagenesis: A significant limitation of the study is the absence of functional data following the creation of cells with different mutations. While the authors speculate about using differentiation systems or organoids for practical applications, they do not provide empirical data to demonstrate the utility of the CTRL-Mutagenesis approach. This lack of functional validation raises questions about the practical applicability of the method.

      → Thank you very much for your suggestion.

      We agree with your suggestion.

      We would make functional analysis future research.

      In this paper, we just validated PiggyBac system for CRISPRko tilling mutagenesis and expanded the length of target regions.

      To change our tone that claiming usability of our method for functional analysis, we will change the following sentences:

      • Change “to identify functionally important elements in non-coding regions” to “to induce diverse combination and variety of mutations within more than 50 kb non-coding region” in the Title.
      • Add “However, not much loss-of-function screens of non-coding regulatory elements has been conducted due to ambiguous annotations compared with protein-coding genes. Tiling mutagenesis has been employed to identify critical regions embedded in non-coding regulatory elements by comparative analysis through a mutant library harbouring subtly different regions mutated within less than 15 kb region. Conventional tiling mutagenesis construct a mutant library integrated multiple sgRNA cassettes by retroviral delivery. However, multiple integration of single guide RNA (sgRNA) cassettes has higher risk of non-targeted endogenous gene disruptions and may impair functional analysis. Herein, combining tiling mutagenesis and PiggyBac transposon that can be removed with no footprint on integrated sites, we established an expanded tilling mutagenesis method named CRISPR- & Transposase-based RegionaL Mutagenesis (CTRL-Mutagenesis). We demonstrated that PiggyBac system could integrated diverse combinations and varieties of sgRNA cassettes.and then CTRL-Mutagenesis randomly induces diverse combination and variety of mutations within more than 50 kb non-coding region in murine embryonic stem cells. CTRL-Mutagenesis would apply for wider non-coding regulatory elements with no risk of non-targeted endogenous gene disruptions.” in the Abstract.
      • Delete “Comparative analysis of mutants harbouring subtly different mutations within the same region would facilitate the further study of cis-element and microRNA clusters.” in the Abstract (page2, line 38-40).
      • Change “The generated random mutant mES clone library could facilitate further functional analyses of non-coding regulatory elements within the genome.” to “The generated random mutant mES clone library could develop to investigate critical regions of non-coding regulatory elements within the genome.” In the Introduction (page4, line 88-90)

        *Reviewer #2 (Significance (Required)):

        1. In summary, while the idea to integrate sgRNA in the genome by the PiggyBack system is interesting the claim of novelty is questionable due to existing methods in the field. The advantages of their system over existing technologies are not clearly articulated, and a lack of comparative analysis with other methods leaves the efficiency of CTRL-Mutagenesis uncertain. *

      → Thank you very much for your suggestion.

      Previous studies about CRISPRko and CRISPRi tiling mutagenesis employ lentiviral delivery of sgRNA cassettes into the genome. However, multi sgRNA cassette integrations have higher risk to disrupt non-targeted endogenous functions. To eliminate the risk that integrated sgRNA cassettes disrupt non-targeted endogenous genes, it is best way to remove the sgRNA cassettes from the chromosome. Nevertheless, lentiviral transposon, one of retrotransposon, cannot be removed from the chromosome. On the other hand, only PiggyBac transposon can be removed with no footprint. Therefore, we aimed to validate PiggyBac system for tiling mutagenesis. Moreover, there is no report that CRISPRko tiling mutagenesis apply for more than 15 kb genomic region. Therefore, we aimed to expand the length of target region.

      Therefore, we will change our claim that our method could expand CRISPRko tiling mutagenesis to more than 50 kb with no risk of non-targeted endogenous gene disruption.

      We will add the novelty and advantage of our method.

      • “Here, we proposed that DNA transposon system rather than retrotransposon system is more suitable to remove sgRNA cassettes from a mutant library. Transposons are genetic elements that can relocate between genomic sites and there are two types of transposons: (1) DNA transposon is transferred by a "cut and paste" mechanism in which the transposon sequence is cut directly from the genome, and (2) retrotransposon is transferred by a "copy and paste" mechanism in which the transposon sequence is transcribed into RNA and then integrated by reverse transcribed [PMID: 21958341]. Therefore, retrotransposon is never removed from the genome. DNA transposon such as PiggyBac, Sleeping Beauty and Tol2 systems are also used as gene transfer tools in vertebrates [PMID: 26481584]. Especially, PiggyBac leaves no footprint on integrated sites after transposons relocated while other DNA transposon system leaves small insertion on integrated sites [PMID: 34064900]. In addition, excision-only-PiggyBac transposase that can remove transposons but not integrate them, is developed [PMID: 27929521]. Only PiggyBac system can remove transposons carrying sgRNA cassettes from mutant library with no footprint. Therefore, we aimed to validate PiggyBac system for CRISPRko tilling mutagenesis.” in the Introduction.
      • “CRISPRko tiling mutagenesis is conducted for less than 15 kb target genomic region so far [PMID: 26375006, 30612741], while CRISPRi tiling mutagenesis can target more than 70 kb [PMID: 27708057], it is reported that. Hence, it remains unknown unclear how length CRISPRko tiling mutagenesis could expand” in the Introduction. However, our main claim is validation of tiling mutagenesis using PiggyBac that is only integration system with no footprint. Therefore, we will propose our novelty without the comparison and not argue higher / lower efficiency of CTRL-Mutagenesis compared to exiting methods.

      In addition, we are going to conduct transposon removal by exicision-only-PBase treatment with several PB mES clones, for the proof of concept that CTRL-Mutagenesis can generate mutant library with no sgRNA cassettes.

      2. Moreover, the limited resolution of their library and the absence of functional data post-mutagenesis are significant drawbacks that need to be addressed in future research to ascertain the method's practical utility.

      → Thank you very much for your suggestion.

      We agree with your suggestion.

      We would make functional analysis future research.

      Base editing is occurred within only editing window. In addition, a major limitation of prime editing is low efficiency (https://doi.org/10.1016/j.tibtech.2023.03.004). Therefore, design of sgRNA for base editor or pegRNA and its editing efficiency requires huge amounts of experiments.

      Our study is proof of concept to validate PiggyBac system for CRISPRko tilling mutagenesis and expanded the length of target regions. Thus, we just discussed limited resolution of our mutant library and proposed the use of base editors for enhanced resolution in the Discussion (page14, line 366-370).

      Therefore, we just claimed that we validated PiggyBac system for CRISPRko tilling mutagenesis and expanded the length of target regions.

      *Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Major comments: 1. Authors claim that "CTRL-mutagenesis randomly induces diverse mutations only within the targeted regions in murine embryonic stem (mES) cells.", however, the outcome of mutations is not entirely random since most of the mutations are regional deletions. For example, despite the random distribution of gRNAs per cell, the inner regions like Mirc56_5 or Mirc56_8 are mutated with >80% efficiency.*

      → Thank you very much for your comment.

      We agree that middle regions are tending to be deleted and mutation type induced is not entirely random. However, we do not agree that “the outcome of mutations is not entirely random since most of the mutations are regional deletions.” Focusing on the combinations of mutations as mentioned in the Result (page12, line 302-304), CTRL-Mutagenesis could induce diverse mutation combinations randomly at a moderate degree. In fact, 79.2% of clones harboring multiple mutations were induced different combinations of mutations. In addition, to confirm how mutations occurred within Mirc56 by CTRL-Mutagenesis, we constructed only 87 mutant clones though single cloning. Therefore, it is not completely understanded due to fewer clones compared with conventional CRISPRko tiling mutant library. Of course, we should improve the randomness of mutation combinations, but we already discussed it and proposed solutions in the Discussion (page14, line 366-370).

      Certainly, CTRL-Mutagenesis would be difficult to identify necessary and sufficient genomic region due to incomplete randomness. Nevertheless, there is no report to induce diverse combination and variety of mutations within more than 50 kb genomic region. Hence, CTRL-Mutagenesis should be worth screening out critical regions within more than 50 kb regions.

      To clarify them, will add the following sentences:

      • “CRISPRko tiling mutagenesis is conducted for less than 15 kb target genomic region so far [PMID: 26375006, 30612741], while CRISPRi tiling mutagenesis can target more than 70 kb [PMID: 27708057], it is reported that. Hence, it remains unknown unclear how length CRISPRko tiling mutagenesis could expand” in the Introduction.
      • “Four clones (#2_066, #1_021, #1_029 and #1_046) harboured entire Mirc56_X deletions on all analysed Mirc56_X genomic region. In addition to these clones, only 3 pairs (#2_019 and #2_084, #2_038 and #1_023, #1_016 and #1_027) were induced same combination of mutations. Besides, 26 clones had only one mutation from Mirc56_1 to Mirc56_13. On the other hand, there was no mutation on Mirc56_1 to 13 in 11 clones including 5 clones (#2_012, #2_015, #2_054, #2_092 and #2_102) carried no sgRNA cassette for Mirc56 _1 to 13 and 6 clones (#2_017, #2_053, #2_098, #1_003, #1_012 and #1_044) even carried any one of sgRNA cassettes for Mirc56 _1 to 13. Among 48 clones carrying multiple mutations except for clones carrying only one mutation or Intact, 38 clones (79.2%) harboured different combinations of mutations. These results suggested that CTRL-Mutagenesis could induce diverse combinations of mutations.” following to “…different combinations of mutations (Figure 5C).” in the Result (page12, line 304)
      • “Note that CTRL-Mutagenesis would be difficult to identify necessary and sufficient genomic region due to incomplete randomness. Nevertheless, CTRL-Mutagenesis should be worth screening out critical regions within more than 50 kb regions” following to “…to induce regional deletions.” in the Discussion (page15, line 378)
      • Change “diverse mutations” to “diverse combination and variety of mutations” in the Title, Abstract (page2, line 37), Introduction (page4, line 87), Result (page13, line 318), Discussion (page13, line 325), (page14, line 363) Additionally, we do not agree with your suggestions that “the inner regions like Mirc56_5 or Mirc56_8 are mutated with >80% efficiency”. We apologize for the misleading context. These high mutation rates were calculated on only the target sites. Actually, maximum mutation rate on all MIrc56_X genomic regions are 44.8% on Mirc56_10, minimum is 14.9% on Mirc56_2 and an average is 30.9% (attached Figure).

      We appreciate that we can recognize our misleading context by your suggestion. It is more important that the analysis focusing all Mirc56_X genomic regions rather than target Mirc56_X. Therefore, we newly made figure about event occurrence in Mirc56_X genomic regions (attached Figure) as Figure 5D and replaced previous Figure 5D about event occurrence in target Mirc56_X to Supplemental Figure S6.

      To clarify them, we will add the following sentences:

      • “As for event occurrences on each Mirc56_X genomic region, miRNA deletions were dominant and an average of 26.7 Mirc56_X genomic region were induced mutations in 87 clones (Figure 5D). Maximum mutation rate on all MIrc56_X genomic regions was 44.8% (39/87) on Mirc56_10, minimum was 14.9% (13/87) on Mirc56_2” following to “…on the same strand” in the Result (page12, line 309)
      • “__D, __Mutations in 87 Mirc56 random mutant clones. The target sites do not include Mirc56_14, 15, 16, and The vertical axis and bar graphs show event occurrence on each Mirc56 genomic region in 87 Mirc56 random mutant clones. The bar colour indicates each event (Black: Regional deletion, Gray: Indel mutation, White: Intact).” in the Figure legend.

        2. Also, although the authors discuss that the lower mutation frequency observed for Mirc56_2 and 4 may be due to a technical error, confirming this by repeating the experiment would be important to prove the usability of this method.

      →Thank you very much for your comment, and we agree with your suggestion.

      We had already constructed bulk PB mES cells twice and showed Figure 4B combined these experimental replicates.

      To clarify that we constructed bulk PB mES cells twice, we changed Figure 4B as attached and will add the following sentences:

      • “Even though these bulk PB mES cells were constructed twice, it seemed that sgRNA cassettes for Mirc56_2 and 4 were difficult to integrate into the genome.” following to “…were rarely detected” in the Result (page11, line 273)
      • “In addition, we suspected technical errors so that we constructed bulk PB mES cells twice. Unfortunately, their low integration frequencies were not improved.” following to “…efficiency or cell growth.” in the Discussion (page14, line 346)
      • “Bulk1 and Bulk2 indicate the experimental replicate.” following to “…next-generation sequencing (NGS).”in the Figure legend (page22 line 563) In addition, we re-sequenced sgRNA donor vector for Mirc56_2 and 4, and will add the following sentences:

      “We firstly doubted that their low integration frequencies were caused by any mutations on PB transposon of sgRNA donor vector, on especially ITR or ID that are important for integration efficiency [PMID: 15663772]. Therefore, we sequenced PB transposons for Mirc56_2 and 4 again. However, we could not find any mutations on their PB transposon.” following to “…efficiency or cell growth.” in the Discussion (page14, line 346)

      Moreover, to confirm the technical error, we are going to confirm sgRNA pool imbalance in donor vector library by amplicon short-read NGS.

      *3. Additionally, the experiments were performed on the haploid X chromosome of a male cell line. It is questionable whether this method can be generalized to other regions located in the other chromosomes. Clarifying These points would be essential especially because the focus of this manuscript is to describe the efficiency of this novel methodology. *

      →Thank you very much for your comment.

      We expect that CTRL-Mutagenesis could be valid on other biallelic locus.

      Therefore, we raised predicted issue such as complex genotyping and proposed one solution.

      When we target other biallelic locus, we must determine whether the combination of mutations induced are cis- or trans-mutations. Haplotype phasing, combined long-read sequencing with SNP markers within ROI on maternal/paternal chromosome, assembles each allele via SNP markers on each read [PMID: 35710642]. Therefore, combining CTRL-Mutagenesis on heterozygotic alleles of cells derived from such as human or murine hybrid with haplotype phasing might simplify genotyping.

      We will add the following sentences:

      “In this study, CTRL-Mutagenesis was validated by genotyping on mono allele in male mES cells to avoid investigating whether the combination of mutations induced are cis- or trans-mutations. All genotypes on Mirc56 should be hemizygous and these mutations induced might be cis-mutations so that we determined the genotypes by amplifying approximately 200 bp around the target sites. However, we did not confirm large mutations such as deletion of the genomic region between target sites and inversion. Long-read sequencing might capture their large mutations. Besides, we also expect that CTRL-Mutagenesis could be valid for ROI on biallelic autosome and X chromosome in female. Therefore, it is required to determine whether the combination of mutations induced are cis- or trans-mutation. Haplotype phasing, combined long-read sequencing with SNP markers within ROI on maternal/paternal chromosome, assembles each allele via SNP markers on each read [PMID: 35710642]. Therefore, combining CTRL-Mutagenesis on heterozygotic alleles of cells derived from such as human or murine hybrid with haplotype phasing might simplify genotyping.” in the Discussion.

      *4. The limitations of the methods seem not to be fully described in the manuscript and must be clarified. Compared to the previous studies (see "significance" section for details), this method is inferior in that (1) it is time-consuming because it requires clonal expansion of single cells and (2) it has low throughput because it requires genome sequencing due to the occurrence of deletions. These points should be described for the potential users of this methodology. For example, it may be useful to detail the time consumption in each experimental step in Fig. 4A. *

      →Thank you very much for your comment.

      We do not agree that (1) it is time-consuming because it requires clonal expansion of single cells.

      To confirm the mutations that CTRL-Mutagenesis induced, we did not conduct phenotyping screening such as dropout screening in this study. For further high-throughput screening, CTRL-Mutagenesis could apply bulk mutant mES cells, that is treated with Cas9 and EGFP-positive, for phenotyping screening.

      Additionally, we do not agree that (2) it has low throughput because it requires genome sequencing due to the occurrence of deletions. In this study, to prove our concept that CTRL-Mutagenesis could induce diverse combinations and varieties of mutations such as Indel and regional deletion, we conducted genotyping in all random mutant clones. On the other hand, there are alternative comparative method to improve throughput without genotyping. Combination of phenotyping screening and gene expression assay for target miRNAs or transcript regulated by target cis-element help us obtain clones harboring mutations on functionally critical regions within target region. Finally, we should conduct genotyping to identify critical regions embedded in non-coding regulatory elements.

      Even so, we will add the time consumption in Figure 4A as attached because the information may be useful for potential users as you mentioned.

      *Minor comments: 1. Data and methods are well-presented for reproducibility. The EGFP-positive ratio may be added to Fig. 4C for clarity. *

      →Thank you very much for your kind comment.

      We added the EGFP-positive ratio to Figure 4C and will add the following sentence:

      “The percentage above the box indicates the EGFP-positive ratio.” following to “…the gates of the EGFP filter.” in the Figure legends (page23, line 567)

      2. Enhance referencing accuracy, rectify DOI format in ref 21, and ensure consistency in citation formatting, e.g., ref 32.

      →Thank you very much for your kind comment.

      Along with the transfer, we will modify the style of references and have already confirmed the referencing accuracy in the Reference.

      3. It seems that the experimental condition (e.g. The amount of vectors used for transfection) should be re-considered every time the researcher wants to set up an experiment changing target genomic regions, cell types etc. If so, this also should be described in the text for potential users of this method.

      →Thank you very much for your comment, and we agree with your suggestion.

      We will add the following sentences:

      “This study just validated CTRL-Mutagenesis for 17 target sites in mES cells. Therefore, it might be better to adjust the number of integrated sgRNA cassettes according to the number of target sites and cell types.” following to “…sgRNA cassettes to be integrated.” in the Discussion (page14, line 355)

      *Reviewer #3 (Significance (Required)):

      There were various methods described in the late 2010's which aimed to screen for the functional non-coding regions using approaches such as KO-based, HDR-based, and epigenetic silencing using dCas9 (for example, PMID: 25141179, 26751173, 27708057, 28416141, 31784727). The authors should summarize what would be the strength of their method compared to these previously described methodologies. The strength of this methodology seems to be moderate complexity and cost-effectiveness compared to these previous techniques. It may be difficult for this methodology to become a state-of-the-art method to evaluate cis-element combinations, but it can be beneficial to researchers wanting to set up a low-cost system that can produce moderately complex cell libraries.*

      →Thank you very much for your suggestion.

      We will add to acknowledge previous studies for CRISPR-Cas tilling screens.

      • “Recently, targeted mutagenesis combined forward genetics and reverse genetics has been developed such as saturating mutagenesis and tiling mutagenesis that induce random mutation within target gene(s) [PMID: 25141179, 31586052, 27260157, 28118392]. This targeted mutagenesis can construct a mutant library harbouring subtly different mutations within a target gene(s) so that comparative analysis through the mutant library can screen out critical mutation(s) for biological processes. These random mutagenesises have also revealed the function of numerous coding genes” following to “…list of coding genes [6–8].” in the Introduction (page3, line 55-56)
      • “In addition, the saturating mutagenesis are limited in the length of target region due to an approach basing homology-directed repair although it could introduce random mutations on donor template library harbouring any combination and variety of mutations [PMID: 25141179]. On the other hand, the tiling mutagenesis could expand target length in principle because the length depends on multiplex guide RNA (gRNA) designed to target genomic region. Therefore, tiling mutagenesis has been employed to identify critical regions embedded in cis-elements [PMID: 26375006, 30612741, 26751173, 27708057, 28416141, 31784727]. Tiling mutagenesis requires editor such as Cas9 or epigenetic modifier fused to catalytically dead Cas9 (e.g. KRAB-dCas9), and a library containing multiplex gRNA tiling across target genomic region. In general, random single guide RNA (sgRNA) expression cassette are integrated into chromosomes of host cells by retrotransposon system.” following to “…within a narrow region.” in the Introduction (page4, line 82) The paper you raised as DOI: https://doi.org/10.1038/nature13695 (PMID: 25141179) applied saturation mutagenesis to find out critical mutation on BRCA1 and DBR1 protein coding sequence by HDR-based strategy using donor template library. This method based homologous recombination repair, so that the length of target region is limited. Our method employs tiling mutagenesis whose target length depends on sgRNA designed. We expand the length of target region to more than 50 kb from less than 15 kb previously reported. This is our strength compared with this report.

      The paper you raised as DOI: https://doi.org/10.1038/nbt.3450 (PMID: 25141179) applied CRISPRko tiling mutagenesis to find out critical region from 2 kb of p53 binding enhancer region by lentiviral delivery of sgRNA cassettes. Our method employs PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions. In addition, we expand the length of target region to more than 50 kb. This is one of the advances.

      The paper you raised as DOI: https://doi.org/10.1126/science.aag2445 (PMID: 27708057) applied CRISPRi tiling mutagenesis to find out critical region from 74 kb genomic region around GATA1 and MYC by lentiviral delivery of sgRNA cassettes. Our method employs CRISPRko and PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. To identify combinations of critical region embedded in target regions, it would require diverse combinations of mutations or inactivation sites. To induce multiple mutations or inactivated sites, it requires multiple sgRNA cassettes integration. However, multiple integration of sgRNA cassettes have higher risk of non-targeted endogenous gene disruptions and may impair functional analysis. To eliminate the risk that integrated sgRNA cassettes disrupt non-targeted endogenous genes, it is best way to remove the sgRNA cassettes from the chromosome. Thus, to identify combinations of critical region embedded in target regions with no artifact owing to no footprint by removal of sgRNA cassettes, CRISPRko tiling mutagenesis rather than CRISPRi is better method because CRISPRi requires integrated cassettes that stably expressed sgRNA and epigenetic modifier fused to dCas9. PiggyBac system can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions to find out combinations of critical region embedded in target regions.

      The paper you raised as DOI: https://doi.org/10.1016/j.molcel.2017.03.007 (PMID: 28416141) reported applied CRISPRi tiling mutagenesis to find out critical region from TAD scale (about 200 kb) with low magnification by lentiviral delivery of sgRNA cassettes. Our method employs PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions.

      The paper you raised as DOI: https://doi.org/10.1038/s41588-019-0538-0 (PMID: 31784727) reported applied CRISPRi tiling mutagenesis to develop method that can find out novel regulatory element around protein coding by lentiviral delivery of sgRNA cassettes. Our method employs PiggyBac system that can remove the sgRNA cassettes from the chromosome with no footprint. Therefore, our method should be novel method that generates mutant library with no risk of non-targeted endogenous gene disruptions.

      To clarify the advantages, we will add the following sentences:

      • “To identify combinations of critical region embedded in target regions, it would require diverse combinations of mutations or inactivation sites. To induce multiple mutations or inactivated sites, it requires multiple sgRNA cassettes integration. However, multiple integration of sgRNA cassettes have higher risk of non-targeted endogenous gene disruptions and may impair functional analysis [PMID: 23435812]. To eliminate the risk that integrated sgRNA cassettes disrupt non-targeted endogenous genes, it is best way to remove the sgRNA cassettes from the chromosome. Thus, to identify combinations of critical region embedded in target regions with no artifact owing to no footprint by removal of sgRNA cassettes, CRISPRko tiling mutagenesis rather than CRISPRi is better method because CRISPRi requires integrated cassettes that stably expressed sgRNA and epigenetic modifier fused to dCas9.” in the Introduction.
      • “Here, we proposed that DNA transposon system rather than retrotransposon system is more suitable to remove sgRNA cassettes from a mutant library. Transposons are genetic elements that can relocate between genomic sites and there are two types of transposons: (1) DNA transposon is transferred by a "cut and paste" mechanism in which the transposon sequence is cut directly from the genome, and (2) retrotransposon is transferred by a "copy and paste" mechanism in which the transposon sequence is transcribed into RNA and then integrated by reverse transcribed [PMID: 21958341]. Therefore, retrotransposon is never removed from the genome. DNA transposon such as PiggyBac, Sleeping Beauty and Tol2 systems are also used as gene transfer tools in vertebrates [PMID: 26481584]. Especially, PiggyBac leaves no footprint on integrated sites after transposons relocated while other DNA transposon system leaves small insertion on integrated sites [PMID: 34064900]. In addition, excision-only-PiggyBac transposase that can remove transposons but not integrate them, is developed [PMID: 27929521]. Only PiggyBac system can remove transposons carrying sgRNA cassettes from mutant library with no footprint. Therefore, we aimed to validate PiggyBac system for CRISPRko tilling mutagenesis.” in the Introduction.
      • “CRISPRko tiling mutagenesis is conducted for less than 15 kb target genomic region so far [PMID: 26375006, 30612741], while CRISPRi tiling mutagenesis can target more than 70 kb [PMID: 27708057], it is reported that. Hence, it remains unknown unclear how length CRISPRko tiling mutagenesis could expand” in the Introduction. In addition, we are going to conduct transposon removal by exicision-only-PBase treatment with several PB mES clones, for the proof of concept that CTRL-Mutagenesis can generate mutant library with no sgRNA cassettes.

      *Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Major concerns, 1, Authors claim "to identify functionally important elements in non-coding regions in the title but there is no evidence of any functional analysis in the manuscript.*

      → Thank you very much for your suggestion, and we agree with your suggestion.

      In this paper, we just validated PiggyBac system for CRISPRko tilling mutagenesis and expanded the length of target regions.

      To change our tone that claiming usability of our method for functional analysis, we will change the following sentences:

      • Change “to identify functionally important elements in non-coding regions” to “to induce diverse combination and variety of mutations within more than 50 kb non-coding region” in the Title.
      • Add “However, not much loss-of-function screens of non-coding regulatory elements has been conducted due to ambiguous annotations compared with protein-coding genes. Tiling mutagenesis has been employed to identify critical regions embedded in non-coding regulatory elements by comparative analysis through a mutant library harbouring subtly different regions mutated within less than 15 kb region. Conventional tiling mutagenesis construct a mutant library integrated multiple sgRNA cassettes by retroviral delivery. However, multiple integration of single guide RNA (sgRNA) cassettes has higher risk of non-targeted endogenous gene disruptions and may impair functional analysis. Herein, combining tiling mutagenesis and PiggyBac transposon that can be removed with no footprint on integrated sites, we established an expanded tilling mutagenesis method named CRISPR- & Transposase-based RegionaL Mutagenesis (CTRL-Mutagenesis). We demonstrated that PiggyBac system could integrated diverse combinations and varieties of sgRNA cassettes.and then CTRL-Mutagenesis randomly induces diverse combination and variety of mutations within more than 50 kb non-coding region in murine embryonic stem cells. CTRL-Mutagenesis would apply for wider non-coding regulatory elements with no risk of non-targeted endogenous gene disruptions.” in the Abstract.
      • Delete “Comparative analysis of mutants harbouring subtly different mutations within the same region would facilitate the further study of cis-element and microRNA clusters.” in the Abstract (page2, line 38-40).
      • Change “The generated random mutant mES clone library could facilitate further functional analyses of non-coding regulatory elements within the genome.” to “The generated random mutant mES clone library could develop to investigate critical regions of non-coding regulatory elements within the genome.” In the Introduction (page4, line 88-90)

        2, Genotypes of mutant library, especially Mirc56, 14,15, 16, 17 were not determined due to six tandem repeats. Thus, analysis of the relationship between genotype and biological functions is not possible. Moreover, the authors did not show any phenotypic analysis.

      → Thank you very much for your suggestion.

      The 6 tandem repeats consisted of each approximately 3.3 kb are hard to determine mutations and are uncommon.

      Therefore, we skipped genotyping Mirc56_14, 15, 16, and 17

      Certainly, it is drawback that we did not determine all mutations induced by CRTL-mutagenesis.

      Even so, we could determine the properties of mutant library within 37 kb genomic region from Mirc56_1 to Mirc56_13.

      Therefore, we could conclude that CTRL-mutagenesis could induce diverse combinations and variations of mutations into more than 50 kb.

      3, Multiple gRNA may cause deletion and inversion to targeted loci. With local PCR based amplification, detection of large deletion and inversion can be very difficult. I think the authors should examine and address this possibility more carefully. The definition of indel in Fig 5C should be explained in more detail.

      → Thank you very much for your comment, and we agree with your suggestion.

      We did not confirm inversion and large deletion.

      To confirm whether inversions were happened, we are going to perform PCR walking in several clones and long-read sequencing.

      4, Although the authors showed a variety of PB cassettes (Max is 17), more importantly would be to determine the actual copy number of PB cassettes. Difference between the highest and the lowest EGFP intensities in Fig 2C (Donor 300ng Effector 350ng) is approximately ~100 fold, thus ES clone bearing highest PB vector may contain ~100 copies of PB vector. PB transposon prefers insertion in active genes compared to other transposon system such as Sleeping Beauty and Tol2 transposon. (Yoshida J et al Sci Rep. 2017 Mar 2;7:43613. doi: 10.1038/srep43613.). Higher integration rates of PB vectors have a higher chance of endogenous gene disruptions and may impair functional analysis.

      → Thank you very much for your suggestion.

      We agree that cassette integration number is one important aspect of tiling mutagenesis. To determine actual copy number of PB transposon is useful information when potential user consider optimizing our method for own target region. However, to confirm whether the relationship between mutations induced and sgRNA cassettes integrated, the number of integrated cassette variety is more important because the diversity of sgRNAs variety expressed is more related to the diversity of mutations induced. Therefore, we identified the number of integrated cassette variety.

      To clarify this point, we will add we the following sentences:

      “rather than the copy number of sgRNA cassettes because the diversity of sgRNAs variety expressed is more related to the diversity of mutations induced” following to “…the number of sgRNA cassette varieties.” in the Result (page12, line 297)

      Certainly, we apologize that it is not accurate that “EGFP signal intensity correlated with the copy number of EGFP cassettes integrated into genomes[23]” in the Result (page11, line 249-250). EGFP expression levels are affected by cell cycle so that the paper reported that “Median EGFP intensities correlated with the copy number of EGFP cassettes integrated into genomes”.

      Therefore, we will delete the following sentence:

      “EGFP signal intensity correlated with the copy number of EGFP cassettes integrated into genomes[23]” in the Result (page11, line 249-250).

      To investigate how many copies our concentration of donor vector could integrate, we are going to check actual copy numbers in several clones by qPCR.

      Besides, we agree with your suggestion that “PB transposon prefers insertion in active genes compared to other transposon system such as Sleeping Beauty and Tol2 transposon. (Yoshida J et al Sci Rep. 2017 Mar 2;7:43613. doi: 10.1038/srep43613.)

      Therefore, we will change the following sentence:

      “random TTAA sites across genomes [24]” to “random TTAA sites of transcribed region rather than intergenic region [PMID: 28252665]” in the Discussion (page14, line 357).

      However, Sleeping Beauty and Tol2 transposon remain footprint at integration sites when these transposons move [PMID: 15133768, 23143102]. Especially, SB transposon leaves canonical 5 bp insertion at integration sites so that the canonical 5bp insertion into coding sequence could disrupt the function of endogenous protein frequently. On the other hand, PB transposon remains no footprint. Therefore, excision-only-PBase can remove the PB transposon from mutant library clearly. Thus, it is no worry about that PB transposon disrupt non-targeted endogenous gene impair functional analysis if PB mutant library is treated with excision-only-PBase.

      In addition, we are going to conduct transposon removal by exicision-only-PBase treatment with several PB mES clones, for the proof of concept that CTRL-Mutagenesis can generate mutant library with no sgRNA cassettes.

      5, Most non-coding regions are located at autosomes. Genotyping would be very difficult or even impossible by the current PCR based strategy.

      → Thank you very much for your comment, and we agree with your suggestion.

      This is one of our issues.

      We expect that CTRL-Mutagenesis could be valid on other biallelic locus.

      Therefore, we raised predicted issue such as complex genotyping and proposed one solution.

      When we target other biallelic locus, we must determine whether the combination of mutations induced are cis- or trans-mutations. Haplotype phasing, combined long-read sequencing with SNP markers within ROI on maternal/paternal chromosome, assembles each allele via SNP markers on each read [PMID: 35710642]. Therefore, combining CTRL-Mutagenesis on heterozygotic alleles of cells derived from such as human or murine hybrid with haplotype phasing might simplify genotyping.

      We will add the following sentences:

      “In this study, CTRL-Mutagenesis was validated by genotyping on mono allele in male mES cells to avoid investigating whether the combination of mutations induced are cis- or trans-mutations. All genotypes on Mirc56 should be hemizygous and these mutations induced might be cis-mutations so that we determined the genotypes by amplifying approximately 200 bp around the target sites. However, we did not confirm large mutations such as deletion of the genomic region between target sites and inversion. Long-read sequencing might capture their large mutations. Besides, we also expect that CTRL-Mutagenesis could be valid for ROI on biallelic autosome and X chromosome in female. Therefore, it is required to determine whether the combination of mutations induced are cis- or trans-mutation. Haplotype phasing, combined long-read sequencing with SNP markers within ROI on maternal/paternal chromosome, assembles each allele via SNP markers on each read [PMID: 35710642]. Therefore, combining CTRL-Mutagenesis on heterozygotic alleles of cells derived from such as human or murine hybrid with haplotype phasing might simplify genotyping.” in the Discussion.

      Moreover, genome-wide NGS and nanopore Cas9-treated sequencing (nCATs) could also help us to read the mutations without PCR-amplification. However, both methods can obtain reads of target regions with low frequency. Therefore, it is difficult to perform multiplex samples for mutant library.

      *6, Fig 4C, large amounts of Cas9 independent EGFP positive cells suggest the current system is not efficient. *

      → Thank you very much for your comment.

      We cannot agree with your indication.

      In fact, by the cutoff set in Cas9-untreated cells, the EGxxFP system successfully selected at least 76 mutant clones (87.4%) harboring mutations within Mirc56_1 to Mirc56_13. Moreover, we could seed 180 single-cells for single cloning by FACS once.

      To enhance this point, we added the following sentences:

      “Moreover, at least 76 out of 87 PB mES clones have mutations within all analysed Mirc56_Xs (Figure 5C). Therefore, the EGxxFP system could selected ROI mutant mES clones efficiently.” following to “…depended on integrated sgRNA cassettes.” in the Discussion (page13, line 355)

      *Reviewer #4 (Significance (Required)):

      The authors claim "Functional analysis" in the manuscript title but there is no evidence of functional analysis in the manuscript.*

      → Thank you very much for your suggestion, and we agree with your suggestion.

      In this paper, we just validated PiggyBac system for CRISPRko tilling mutagenesis and expanded the length of target regions.

      To change our tone that claiming usability of our method for functional analysis, we will change the following sentences:

      • Change “to identify functionally important elements in non-coding regions” to “to induce diverse combination and variety of mutations within more than 50 kb non-coding region” in the Title.
      • Add “However, not much loss-of-function screens of non-coding regulatory elements has been conducted due to ambiguous annotations compared with protein-coding genes. Tiling mutagenesis has been employed to identify critical regions embedded in non-coding regulatory elements by comparative analysis through a mutant library harbouring subtly different regions mutated within less than 15 kb region. Conventional tiling mutagenesis construct a mutant library integrated multiple sgRNA cassettes by retroviral delivery. However, multiple integration of single guide RNA (sgRNA) cassettes has higher risk of non-targeted endogenous gene disruptions and may impair functional analysis. Herein, combining tiling mutagenesis and PiggyBac transposon that can be removed with no footprint on integrated sites, we established an expanded tilling mutagenesis method named CRISPR- & Transposase-based RegionaL Mutagenesis (CTRL-Mutagenesis). We demonstrated that PiggyBac system could integrated diverse combinations and varieties of sgRNA cassettes.and then CTRL-Mutagenesis randomly induces diverse combination and variety of mutations within more than 50 kb non-coding region in murine embryonic stem cells. CTRL-Mutagenesis would apply for wider non-coding regulatory elements with no risk of non-targeted endogenous gene disruptions.” in the Abstract.
      • Delete “Comparative analysis of mutants harbouring subtly different mutations within the same region would facilitate the further study of cis-element and microRNA clusters.” in the Abstract (page2, line 38-40).
      • Change “The generated random mutant mES clone library could facilitate further functional analyses of non-coding regulatory elements within the genome.” to “The generated random mutant mES clone library could develop to investigate critical regions of non-coding regulatory elements within the genome.” In the Introduction (page4, line 88-90).
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Morimoto et al. presented a regional random mutagenesis study using multiple sgRNAs in PiggyBac transposon vectors to analyze the relationship between genotype and biological functions. For proof of principle, authors chose the X-linked miRNA cluster Mirc56 and made its mutant library.

      Major concerns

      1. Authors claim "to identify functionally important elements in non-coding regions in the title but there is no evidence of any functional analysis in the manuscript.
      2. Genotypes of mutant library, especially Mirc56, 14,15, 16, 17 were not determined due to six tandem repeats. Thus, analysis of the relationship between genotype and biological functions is not possible. Moreover, the authors did not show any phenotypic analysis.
      3. Multiple gRNA may cause deletion and inversion to targeted loci. With local PCR based amplification, detection of large deletion and inversion can be very difficult. I think the authors should examine and address this possibility more carefully. The definition of indel in Fig 5C should be explained in more detail.
      4. Although the authors showed a variety of PB cassettes (Max is 17), more importantly would be to determine the actual copy number of PB cassettes. Difference between the highest and the lowest EGFP intensities in Fig 2C (Donor 300ng Effector 350ng) is approximately ~100 fold, thus ES clone bearing highest PB vector may contain ~100 copies of PB vector. PB transposon prefers insertion in active genes compared to other transposon system such as Sleeping Beauty and Tol2 transposon. (Yoshida J et al Sci Rep. 2017 Mar 2;7:43613. doi: 10.1038/srep43613.). Higher integration rates of PB vectors have a higher chance of endogenous gene disruptions and may impair functional analysis.
      5. Most non-coding regions are located at autosomes. Genotyping would be very difficult or even impossible by the current PCR based strategy.
      6. Fig 4C, large amounts of Cas9 independent EGFP positive cells suggest the current system is not efficient.

      Significance

      The authors claim "Functional analysis" in the manuscript title but there is no evidence of functional analysis in the manuscript.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The study proposes a method utilizing EGxxFP, piggybac, and CRISPR-Cas9 systems to generate a random mutation library in non-coding genomic regions, such as cis-element regions or microRNA clusters. As a proof-of-concept of this method, a random mutation library of Mirc56 microRNA cluster was generated. The manuscript highlights the creation of regional mutations using mES cells.

      Major comments:

      Authors claim that "CTRL-mutagenesis randomly induces diverse mutations only within the targeted regions in murine embryonic stem (mES) cells.", however, the outcome of mutations is not entirely random since most of the mutations are regional deletions. For example, despite the random distribution of gRNAs per cell, the inner regions like Mirc56_5 or Mirc56_8 are mutated with >80% efficiency. Also, although the authors discuss that the lower mutation frequency observed for Mirc56_2 and 4 may be due to a technical error, confirming this by repeating the experiment would be important to prove the usability of this method. Additionally, the experiments were performed on the haploid X chromosome of a male cell line. It is questionable whether this method can be generalized to other regions located in the other chromosomes. Clarifying These points would be essential especially because the focus of this manuscript is to describe the efficiency of this novel methodology.

      The limitations of the methods seem not to be fully described in the manuscript and must be clarified. Compared to the previous studies (see "significance" section for details), this method is inferior in that (1) it is time-consuming because it requires clonal expansion of single cells and (2) it has low throughput because it requires genome sequencing due to the occurrence of deletions. These points should be described for the potential users of this methodology. For example, it may be useful to detail the time consumption in each experimental step in Fig. 4A.

      Minor comments:

      Data and methods are well-presented for reproducibility. The EGFP-positive ratio may be added to Fig. 4C for clarity.

      Enhance referencing accuracy, rectify DOI format in ref 21, and ensure consistency in citation formatting, e.g., ref 32.

      It seems that the experimental condition (e.g. The amount of vectors used for transfection) should be re-considered every time the researcher wants to set up an experiment changing target genomic regions, cell types etc. If so, this also should be described in the text for potential users of this method.

      Referees cross-commenting

      Major concerns raised by the reviewers, including the non-random nature of mutations, challenges in library resolution, and unclear advantages over existing methodologies, all seem to be reasonable. These concerns should be addressed before publication, especially because this is a methodology paper that reports the usability of this novel methodology.

      Significance

      The method combines CRISPR screens with EGxxFP and PiggyBac systems for complex cell library generation, albeit with limitations in throughput and time consumption.

      There were various methods described in the late 2010's which aimed to screen for the functional non-coding regions using approaches such as KO-based, HDR-based, and epigenetic silencing using dCas9 (for example, PMID: 25141179, 26751173, 27708057, 28416141, 31784727). The authors should summarize what would be the strength of their method compared to these previously described methodologies. The strength of this methodology seems to be moderate complexity and cost-effectiveness compared to these previous techniques. It may be difficult for this methodology to become a state-of-the-art method to evaluate cis-element combinations, but it can be beneficial to researchers wanting to set up a low-cost system that can produce moderately complex cell libraries.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, Morimoto et al. present CTRL-Mutagenesis, a method for region-specific random mutagenesis, aiming to identify functionally important elements in non-coding regions of the genome. While the idea to integrate sgRNA in the genome by the PiggyBack system is interesting there are several major concerns that limit the novelty and impact of this study as outlined below.

      Major concerns:

      1. Concern about the Novelty of Functional Analysis Platforms: The authors claim that there are no established platforms for the study of cis-elements or microRNA clusters. This assertion seems inaccurate, as previous studies have utilized Cas9 tiling screens to investigate cis-regulatory elements (CREs) and large-scale screens to probe microRNA functions, as exemplified by the works of Canver et al. in Nature 2015, Gasperini et al. in Cell 2019, and others.
      2. Advantages of PiggyBack System Over Lentiviral Integration: The paper does not clearly articulate the advantages of their proposed PiggyBack-based system for sgRNA integration over traditional lentiviral integration. Both methods facilitate the random integration of multiple gRNAs, but the paper lacks a comparative analysis or justification for choosing the PiggyBack system.
      3. Lack of Comparative Analysis with Alternative Methods: The authors did not provide a comparison of CTRL-Mutagenesis with other existing screening methods. Such a comparison is crucial for understanding the effectiveness and efficiency of the new method in relation to established techniques.
      4. Limitations in Library Resolution: The paper acknowledges the limited resolution of their proposed library. The authors might have explored the use of base editors for enhanced resolution in such screens, as base editing could potentially offer more precise and controlled mutagenesis as briefly mentioned in the discussion.
      5. Absence of Functional Data Post-Mutagenesis: A significant limitation of the study is the absence of functional data following the creation of cells with different mutations. While the authors speculate about using differentiation systems or organoids for practical applications, they do not provide empirical data to demonstrate the utility of the CTRL-Mutagenesis approach. This lack of functional validation raises questions about the practical applicability of the method.

      Significance

      In summary, while the idea to integrate sgRNA in the genome by the PiggyBack system is interesting the claim of novelty is questionable due to existing methods in the field. The advantages of their system over existing technologies are not clearly articulated, and a lack of comparative analysis with other methods leaves the efficiency of CTRL-Mutagenesis uncertain. Moreover, the limited resolution of their library and the absence of functional data post-mutagenesis are significant drawbacks that need to be addressed in future research to ascertain the method's practical utility.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In "Regional random mutagenesis driven by multiple sgRNAs and diverse on-target genome editing events to identify functionally important elements in non-coding regions", Morimoto et al. detail a method for generating clonal cell lines containing mutations tiled across a region of interest. Briefly, they create a pool of sgRNAs tiling a region of interest, then utilize the PiggyBac transposase to insert random sgRNAs from this pool into cellular DNA. Cas9 cleavage directed by cellular sgRNAs creates small indels and large deletions in cells that can be grown out into clonal cell lines. They apply their method to study the Mirc56 microRNA cluster to generate 87 clones, each clone being edited by an average of 4.7/17 sgRNAs. The authors suggest that these cell lines could be used to elucidate the functional importance of non-coding regions.

      Major comments:

      1. Mirc56_2 and 4 showed lower integration rates, and the authors suggest that this could be due to sgRNA pool imbalance. The authors should validate this by performing sequencing of the input sgRNA and cassettes.
      2. Clonal analysis in Figure 5c is unclear
        • a. Figure 5c indicates that all changes were homozygous (e.g. both alleles were deleted). Was this the case in all clones? Or were some mutations heterozygous?
        • b. Many clones in Figure 5c show that the entire region was deleted (all black dots). Could this be due to some experimental error or misinterpretation of the sequencing data, or could it be validated using some orthogonal method? This is especially surprising for clones in which the final guide (Mirc56_13) was not detected yet the final site (Mirc56_13) was reported as "Regional deletion".
      3. Next-generation targeted sequencing of clones should be made publicly accessible.
      4. OPTIONAL - Cassette integration number is understudied. One important aspect of tiling mutagenesis is the control over how many guides are present in each cell. The authors report an average of 4.7 cassettes/cell. This could be modulated by the amount of donor vector added, and indeed the authors performed titration experiments, but only with a fluorescent reporter readout. It would be very useful to know how the concentration of donor vector corresponds to the number of cassettes/cell - perhaps genotyping of clones from one or two additional experiments would be sufficient.

      Minor comments:

      1. The background fails to acknowledge the work of CRISPR-Cas tiling screens (e.g. https://doi-org/10.1038/nbt.3450) or CRISPR-Cas in creating mutagenesis in cell lines (e.g. https://doi-org/10.1007/978-1-0716-0247-8_29)
      2. Figure 1 left 'ROI random mutant PB mES cell' should be horizontally aligned so Mir_1, Mir_2 and MirX align with the upper figure.
      3. It is interesting and unexpected that some guides never induce indels, even in the absence of a regional deletion (e.g. Mirc56_3, Mirc56_7). Why might this be? Was there perhaps an error in the assignment of these guides to these cells?
      4. Regarding Mirc56_2 and 4 integration, on line 34 the authors suggest that "We suspect this was caused by a technical error, such as an unequal amount of sgRNA donor vector or the sequence in sgRNA cassettes affecting integration efficiency or cell growth." sgRNA library imbalance would be a technical error, but integration affecting cell growth is not a technical error. This sentence should be reworded.
      5. Line 540 "ration" is the incorrect word - perhaps "ratio"?
      6. Plot 5b should be shown as a histogram rather than a swarm plot to show how many clones were in each category.

      Referees cross-commenting

      All reviewers note previous functional non-coding screens and the lack of justification of the author's PiggyBac system. The authors should consider strengthening the justification and adding comparisons to existing methods if future revisions are considered.

      Significance

      General assessment: The authors are successful in creating clonal cell lines bearing a variety of mutations. Unfortunately, the cell lines also have transposase-mediated insertion events of the sgRNA cassettes at unknown positions in the genome, which will hamper the interpretability of any experiment using these cell lines. The authors fail to justify the use of the transposase and integration of the sgRNA, especially compared to lentiviral transfection or RNPs which would produce edits at the region of interest. Alternately, integrated sgRNA cassettes could have been excised with Flp recombinase as in https://doi-org/10.1007/978-1-0716-0247-8_29. Additionally, the genotyping analysis is unclear, and seems to indicate that each clone bears homozygous mutations, with several clones showing deletions of the entire region.

      Advance: The authors are motivated to create clones using tiling mutagenesis. Tiling mutagenesis has already been performed without transposases (e.g. https://doi-org/10.1038/nbt.3450, https://doi.org/10.1371/journal.pone.0170445, https://doi.org/10.1038/s41467-019-12489-8) in the context of a screen, and clones have already been created using CRISPR/Cas9 mutagenesis so the advance presented in this manuscript over previous published work is unclear.

      Audience: The manuscript is written for the basic research audience, and the method could be applied to the study of regions of interest in many diseases. However, the unexcised use of transposases make the method less desirable than other methods.

      I am a computational biologist with experience in CRISPR tiling screens and CRISPR amplicon sequencing analysis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Review Commons - Revision Plan

      Manuscript number: RC-2023-02228

      Corresponding author(s): Gatfield, David

      1. General Statements

      We are grateful to the three Reviewers for their detailed assessment of our manuscript and are delighted about their very constructive and positive evaluations, highlighting the study’s novelty and rigor.

      Briefly, the main points raised by Reviewers 1 and 3 do not involve additional experiments and are mostly about rethinking manuscript structure (e.g. moving data/analyses to the supplement or removing them altogether, as they distract from the main thrust of the story) and making the text overall less dense and more readable.

      Reviewer 3 also raises a number of additional interesting points that we should discuss in our manuscript, which would allow us placing our findings more effectively into the context of the existing literature.

      All these points are very well taken and will be implemented (see below, under 2).

      Reviewer 2 is overall also rather positive – speaking of “a very careful and detailed study that addresses an important issue” and the study being “really rigorous and the logic […] very well explained”; moreover, this Reviewer also shares the view of both other Reviewers that parts of the manuscript (i.e., in particular its beginning) should be shortened.

      Importantly, this Reviewer remarks in addition under “Significance”: “Without additional mechanistic insights suggesting that there is something particular different about the regulation of these mRNAs the manuscript is not of extremely high significance.” – an important point of criticism that we wish to address in our revision, as detailed below.

      2. Description of the planned revisions

      In the following, we detail how we plan to address the points raised by the Reviewers. The order in which we treat the points follows their – in our view – relative importance according to the Reviewers’ feedback. In particular the first item below, under (A), is the main point of criticism that we feel we should address carefully for the future revised version.

      (A) Major point raised by Reviewer 2: “However, the study falls short on addressing the mechanism of this regulation and if it is different of other feeding regulated mRNA oscillations. This diminishes the significance of the study unless additional mechanistic details are provided.” , which is cross-commented both by Reviewer 1: “More importantly, clues to the mechanism (e.g. iron, heme) regulating the rhythmic translation of IRP1 and IRP2 IRE-mRNAs in liver would increase the significance of the work.” as well as by Reviewer 3: “Reading the comment from Reviewer #2 over the lack of a mechanism to explain why only four transcripts with IREs amongst a larger pool are subject to circadian regulation by IRPs somehow reduces the significance of the study, one has to agree that a discovery - likely another component in the system - is wanting. I remain of the view that the present work exposes this "weakness" of the entire field in a global as opposed to a partial manner and in doing so, makes a significant contribution, especially by further sub-classifying the IRE-containing transcripts according to their responsiveness in the diurnal occupancy of their IREs.”

      Our response and revision plan: Indeed, in the original version of our manuscript we established the link to feeding, yet we did not pinpoint the precise molecular cue that could underlie the rhythmic regulation observed on certain IRE-containing mRNAs. We did discuss the molecular candidates quite extensively in the Discussion section of the manuscript (Fe2+; oxygen; reactive oxygen species), and it remains quite obviously the main question whether the observed diurnal control could be mediated directly by changes in intracellular iron availability.

      Of note, the preprint by Bennett et al., for which we cite the initial biorXiv version in our manuscript, was updated very recently (https://doi.org/10.1101/2023.05.07.539729 – see version submitted December 18, 2023). It now includes new data that analyses around-the-clock iron levels also in liver. Briefly, the preprint shows, first, that serum iron is rhythmic with a peak during the dark phase at ZT16 (Figure 1D in Bennett et al.) yet loses rhythmicity when feeding is restricted to the light phase (Bennett et al., Figure 2E), indicating both feeding-dependence and circadian gating. Moreover, liver total non-heme iron – quantified using a method that measures both ferrous Fe(II) and ferric Fe(III) – shows low-amplitude diurnal variations which, however, do not meet the threshold for rhythmicity significance (Bennett et al., Figure 3G). Still, the difference between timepoints ZT4 (lower iron; light phase) and ZT16 (higher iron; dark phase) is reported as significant, with a fold-change that is not very pronounced (not compatible with the observed direction of regulation of Tfrc mRNA, whose higher abundance in the dark phase would rather be in line with lower *cytoplasmic iron levels, as pointed out by the authors.

      Thus, at first sight the analyses by Bennett et al. would appear to answer part of the Reviewer’s question and point towards other mechanisms of regulation than iron levels themselves. However, it should be pointed out that the particular methodology for iron measurements used by the authors includes the use of reducing reagents and hence quantifies the sum of Fe2+ and Fe3+ iron. Large amounts of iron are stored in the liver in the form of ferritin-bound Fe3+, yet the bioactive, low-complexity iron that is considered relevant for IRP regulation is in the Fe2+ form. Therefore, the question whether bioactive ferrous iron levels follow a daily rhythm, compatible with the observed IRP/IRE rhythms described in our manuscript, still remains an open question and warrants a dedicated set of experiments that we are proposing to conduct in response to the Reviewers’ comments.

      Briefly, for the revision we propose to use liver pieces from the two relevant timepoints of our study (i.e., ZT5 and ZT12) and apply a method that allows the separate quantification of Fe2+ and Fe3+ (Abcam iron assay ab83366; this assay can be adapted to liver iron measurements, see e.g. PMID31610175, Fig. 4A). This experiment will provide novel and decisive data on the molecular mechanism that may regulate the IRP/IRE system in a rhythmic fashion and therefore add to the significance of our findings, as requested by the reviewers.

      Moreover, we believe that the outcome of the experiment would be very interesting either way, i.e. if we find rhythms in Fe2+ that are compatible with rhythmic IRP/IRE regulation, we would be able to provide excellent evidence in term of likely molecular mechanism and rhythmicity cue. If, by contrast, we find that Fe2+ is not rhythmic, it will point towards a mechanism that is distinct from simple Fe2+ concentrations.

      In the latter case, collecting additional evidence on relevant alternative molecular cues would be beyond our capabilities for this particular manuscript, as it would require quite sophisticated methodological setup and preparation. For example, one could imagine that measuring around-the-clock liver oxygen levels in vivo – another candidate cue – would be highly interesting, yet we would not be able to conduct these experiments in a reasonable time frame (to start with, we would first need to request ethics authorisation from the Swiss veterinary authorities, which would in itself take ca. 4-6 months before we could even start an experiment). Thus, in the case of non-rhythmic iron levels, we would leave the question of other responsible cues open, but still think that with a balanced discussion of the resulting hypotheses we could provide significant added value to our work.

      (B) Major comment raised by Reviewer 1: “Alas2 is expressed mainly in erythroid cells and not liver, whereas Alas1 is ubiquitously expressed. Therefore, it is possible that Alas2 in this study may originate from red cells/reticulocytes in the liver, and not from hepatocytes.”

      Our response and revision plan: We would like to thank the Reviewer for the comment that is indeed pertinent. It is well established that Alas1 is the main transcript encoding delta-aminolevulinate synthase activity in hepatocytes, and Alas2 is about 10-fold less abundant in total liver RNA-seq data (quantified form own RNA-seq data, not shown).

      We are nevertheless relatively sure that the Alas2 signal comes from low expression in hepatocytes; the best argument in support of this hypothesis is the analysis of single-cell RNA-seq data, as shown in the following Revision Plan Figure 1, which we would be happy to include in a revised version of the manuscript if the reviewers wish:

      (C) Minor comment raised by Reviewer 1: “The paper is dense and not easy to read. For example, the section on Tfrc regulation and NMD regulation is lengthy and perhaps not necessary for the paper and the section on "Previous observations in IRE-IRP regulation...." could be included in the discussion rather in than in the Results section. Some figures could be included in a supplement.” continued in Referee cross-commenting “I agree with Reviewer 2 that the first sections in the manuscript are lengthy and not needed.”; moreover, Reviewer 2: “Also, the manuscript first sections (which mainly describe negative results) seem too long and descriptive.”

      Our response and revision plan: We shall reorganize the paper accordingly, with the aim of making it an easier, shorter, clearer read. Many thanks for the input.


      (D) Minor comment raised by Reviewer 1: “A description of the new anti-IREB2 antibody is needed. What IRP2 sequence was used to generate antibodies?”

      Our response and revision plan: The following information will be included in the manuscript: “Rat monoclonal antibodies against ACO1/IRP1 and IREB2/IRP2 were generated at the Antibodies Core Facility of the DKFZ. Briefly, full-length murine ACO1/IRP1 and IREB2/IRP2 proteins, fused to a poly-histidine tag, were expressed in E. coli and purified on Ni-NTA columns using standard protocols. Purified His-tagged proteins were used to immunize rats and generate hybridomas. Hybridoma supernatants were first screened by ELISA against His-tagged ACO1/IRP1 and His-tagged IREB2/IRP2. As an additional control, supernatants were tested against full-length His-tagged murine ACO2 (mitochondrial aconitase), which shares 27 and 26% identity with ACO1/IRP1 and IREB2/IRP2, respectively. Supernatants reacting specifically with ACO1 or IREB2 were validated by western blotting using extracts from wild-type versus ACO1- or IREB2-null mice.”

      (E) Minor comment raised by Reviewer 1: “A model summarizing the data would be useful.”

      • *Our response and revision plan: Thank you for the suggestion – this will be done.

      (F) “Optional” idea raised by Reviewer 3: “One nuance in the field of circadian biology is that a rhythm is deemed to be genuinely "circadian" when it continues in the absence of zeitgebers. In this sense, although all experiments are valuable, the "collapse" of the rhythm in the paradigms where dietary rhythms have been disrupted makes the phenomenology a candidate "epiphenomenon" rather than being closer related to the biological clock(s). Likewise, in the manuscript we never learn how the liver IRE-binding activity behaves in constant darkness.”

      Our response and revision plan: This is an important aspect that we can clarify more specifically in our manuscript. It is true that constant (darkness) conditions are used to call a phenomenon circadian. We would nevertheless argue that for a rhythmic feature that is specifically found in liver, the constant darkness definition to distinguish circadian from non-circadian is not fully valid because even in constant darkness, the liver clocks are not in a free-running state but continue to be entrained by the SCN clock (it is only the latter that is free-running under these conditions).

      In our manuscript, we actually suggest that the observed rhythms are not a core output of the circadian machinery (Fig. 6 of our manuscript), but indirectly engendered through feeding rhythms, which are coupled to sleep-wake cycles and thus connect in an indirect way to the central circadian clock activity in the SCN.

      In wild-type mice we would therefore expect that irrespective of constant darkness or light-dark entrainment (and assuming ad libitum feeding), the hepatic rhythms of the relevant IRE-containing transcripts would persist in a similar fashion.

      (G) “Optional” idea raised by Reviewer 3: “Where the authors mention in a parenthesis "moreover, there are documented links between iron and the circadian timekeeping mechanism itself", I invite them to take a closer look to the paper Konstantinos Mandilaras and I coauthored in 2012 "Genes for iron metabolism influence circadian rhythms in Drosophila melanogaster". In that work, we showed that RNA interference of genes that are required for iron sulfur cluster formation (including on IRP1) in the central clock neurons of the fly result in loss of the circadian rhythm when flies were kept at constant darkness (not so when they were kept under light:dark oscillation). So this point should probably remain open..”

      Our response and revision plan: We would like to thank the Reviewer for pointing out this interesting connection that would fit well into the context of our manuscript. It should be cited in the context of our current Figure 3, where we measure in vivo and in tissue explants whether IRP-deficiency affects the clock itself.

      To follow Reviewer 3’s idea, we have gone a little further in our analyses of around-the-clock expression data to see if any of the components of the Fe-S assembly machinery is rhythmic itself, which could have the potential to add novel information.

      Briefly, we have used for this purpose our around-the-clock RNA-seq and ribo-seq data from PMID 26486724. In summary, we find that the expression at RNA and/or footprint level is non-rhythmic for the vast majority of genes involved in FeS biogenesis, assembly or transport, with the exception of low-amplitude rhythms for Glrx5 and Iba57 (Revision Plan Figure 2).

      By contrast, all of the following other genes are non-rhythmic throughout (list of Fe-S-relevant genes from PMID34660592): Cytoplasmic/nuclear, all non-rhythmic: Cfd1=Nubp2, Nbp35=Nubp1 , Ciapin1, Ndor1, Iop1=Ciao3=Narfl, Ciao1, Ciao2b=Fam96b, Mms19, Ciao2a=Fam96a; mitochondrial, all non-rhythmic: Iscu, Nfs1, Isd11=Lyrm4, Acpm=Ndufab1, Fdx1, Fdx2=Fdx1l, Fxn, Hspa9 Hsc20=Hscb, Abcb7, Alr=Gfer, Isca1, Isca2, Nfu1

      As these are mainly “negative results”, and as we are also unable to propose a solid possible mechanistic connection between the Glrx5 and/or Iba57 rhythms and the rest of the story of our manuscript, we do not intend to include such data in our manuscript, but are only putting it for the record into this rebuttal.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      NONE

      4. Description of analyses that authors prefer not to carry out

      NONE – we think we can address all points as described above.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The present manuscript highlights the previously neglected component of diurnal rhythms into the study of iron regulation in the liver, a key organ in the systemic regulation of the metal. A major, well substantiated finding is that IRP1 takes over from IRP2 as a highly relevant regulator during a time of maximal use of the combined system. The presence of a "dietary signal" that sustains the cycling of cellular IRE binding activity in the liver, although undisputable, is perhaps a lesser claim until these or other investigators can confirm or refute the possibility that iron itself (i.e., the best-established factor affecting cellular IRE-binding) is such a signal. If iron does the job, then the interest lies in showing diurnal rhythmicity of its availability in the circulatory system, presumably linked to dietary iron absorption. There is plenty of clinical evidence of this in humans, but the present study along with the cited preprint by Bennett et al. appear to be the first demonstrations of this diurnal variation in mice.

      The manuscript has other strengths not immediately evident from the above claims made in the abstract. It contains an elegant balance of reanalyzing and reassessing "big data" produced by the same laboratory in the past in light of new experimental findings that have appeared in the meantime in the literature together with important new additions of such data from combined RNAseq and ribo-seq collections using IRP1 and IRP2 knockout animals and, importantly, by making use of published material from other studies that have provided relevant comparators from other knockouts that studied aspects of liver circadian biology. The approach, besides providing robust testing of the ideas presented, opens the field to questions that remain unanswered but seem highly relevant (I come to these below). The authors write in a very open manner not only about the new findings but also about aspects we do not understand, and their systems biology approach is likely to generate new hypotheses to address incognita.

      Let me therefore be clear upfront that the response that follows is written on the premise that I evaluate the work presented as ready for publication: The figures have been constructed with care summarizing a lot of careful investigations and the main conclusions derive seamlessly from the experimental data. Rather than taken as potential criticism to the authors, I would ask that the counterviews or limitations that may arise from my response to the paper are better taken as a celebration of the work - and views - presented. Such a discussion is only possible due to the open style of this communication mentioned above and is meant to provoke a dialogue or even drive further questioning of the datasets and the design of future experimental approaches. If the authors find any of these comments useful for their revision, they are welcome to take them onboard, but everything that follows should be read under the term "optional".

      One nuance in the field of circadian biology is that a rhythm is deemed to be genuinely "circadian" when it continues in the absence of zeitgebers. In this sense, although all experiments are valuable, the "collapse" of the rhythm in the paradigms where dietary rhythms have been disrupted makes the phenomenology a candidate "epiphenomenon" rather than being closer related to the biological clock(s). Likewise, in the manuscript we never learn how the liver IRE-binding activity behaves in constant darkness. Where the authors mention in a parenthesis "moreover, there are documented links between iron and the circadian timekeeping mechanism itself", I invite them to take a closer look to the paper Konstantinos Mandilaras and I coauthored in 2012 "Genes for iron metabolism influence circadian rhythms in Drosophila melanogaster". In that work, we showed that RNA interference of genes that are required for iron sulfur cluster formation (including on IRP1) in the central clock neurons of the fly result in loss of the circadian rhythm when flies were kept at constant darkness (not so when they were kept under light:dark oscillation). So this point should probably remain open.

      Given the phenotypes collected from RNAi in different cell types, we became sensitive to the notion that different cell types may work with different sets of what we might call collectively "iron metabolism genes". Thus, while reading the present differences between the relative contributions of IRP1 and IRP2 in mouse liver at different times during the day (and night), I kept wondering if both function in the hepatocytes or whether the macrophages or other cell types in the liver may have their own particular contributions.

      Another issue raised by the authors early on relates to the differential effects of IRP1/2 "activation" on different IRE-containing transcripts. This is a fascinating problem, not answered by the six transcripts shown in figure 1G, but I consider that the present paper offers a great service to the field in figure 5I, where an even more comprehensive grouping of IRE-containing transcripts is provided in terms of their "regulation" by IRPs. Future research should attempt to discover features that correlate within the sets of transcripts, as grouped in here.

      Thus, another point made repeatedly by the authors that despite four decades of work on the IRPs we still have open questions about how they "regulate" is well taken. In the same tone, their results in relation to IRP2 degradation show that the story of how the presence of iron leads to the degradation of IRP2 has not been fully elucidated, either.

      Referees cross-commenting

      Reading the comment from Reviewer #2 over the lack of a mechanism to explain why only four transcripts with IREs amongst a larger pool are subject to circadian regulation by IRPs somehow reduces the significance of the study, one has to agree that a discovery - likely another component in the system - is wanting. I remain of the view that the present work exposes this "weakness" of the entire field in a global as opposed to a partial manner and in doing so, makes a significant contribution, especially by further sub-classifying the IRE-containing transcripts according to their responsiveness in the diurnal occupancy of their IREs.

      I would like to reinforce the comment of reviewer 1 with respect to the antibodies used in this study that should be made available to the community, given the specificity described. My congratulations to the authors.

      Significance

      A lesson, perhaps, for the field is that sometimes more than one mechanism may be at play in different cellular or physiological contexts, while vigorous testing requires time and resources and we should value examples of such care and openness, an example of which is offered, in my view, by the present study.

      Fanis Missirlis

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript entitled "Diurnal control of iron responsive element (IRE)-containing mRNAS through iron regular proteins IRP1 and IRP2 is mediated by feeding rhythms", Nadimpalli et al. uncover and mechanistically dissect how the circadian clock and feeding regulates the expression of proteins involved in iron homeostasis in mice. The authors first utilized RNAseq and ribose data and found that a subset of mRNAs containing IREs display rhythmic translation in the liver and/or kidney. The authors then utilized previously published or newly generated datasets to study the origin of these oscillations. After a careful and thoughtful examination, they determine that the oscillations of those mRNAs in the liver are mainly driven by feeding-associated signals, although they are influenced by other factors. This is a very careful and detailed study that addresses an important issue. The study is really rigorous and the logic is very well explained. So overall this study is very solid and the main conclusion of the study (that the oscillations of those mRNAs are driven by feeding) is solidly established. However, the study falls short on addressing the mechanism of this regulation and if it is different of other feeding regulated mRNA oscillations. This diminishes the significance of the study unless additional mechanistic details are provided. Also, the manuscript first sections (which mainly describe negative results) seem too long and descriptive. Still this is an important and solid study.

      Significance

      The main issue this reviewer has with the manuscript is the significance. Without additional mechanistic insights suggesting that there is something particular different about the regulation of these mRNAs the manuscript is not of extremely high significance.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This paper provides evidence for the diurnal regulation of specific subset of iron regulatory elements (IREs)-containing mRNAs in liver by iron regulatory proteins 1 and 2 (IRP1 and IRP2) in mice. The authors show that IRP2 oscillates over 24 h period to regulate IRE-containing mRNAs in the light phase, and collaborates with IRP1 to regulate IRE-mRNAs in the dark phase.

      Major Comments

      The authors have carefully performed experiments, and convincingly show that 5'-IRE containing transcripts (Fth1, Ftl1, Fpn and Alas2) display significant amplitude rhythms in ribosome occupancy in liver. Tfrc mRNA, which harbors a 3' IRE, also showed a rhythmic pattern in both liver and kidney. The changes in IRE-containing mRNAs correlated with IRP2 protein abundance. Further studies performed using Aco1 and Ireb2 knockout mice showed that both IRP1 and IRP2 are required for rhythmic regulation of IRE-containing mRNAs. Overall, the findings in this paper are interesting and novel, and show for the first time that IRE-containing mRNAs required for maintenance of cellular iron metabolism and IRP2 are subjected to rhythmic regulation. Alas2 is expressed mainly in erythroid cells and not liver, whereas Alas1 is ubiquitously expressed. Therefore, it is possible that Alas2 in this study may originate from red cells/reticulocytes in the liver, and not from hepatocytes.

      Minor Comments

      The paper is dense and not easy to read. For example, the section on Tfrc regulation and NMD regulation is lengthy and perhaps not necessary for the paper and the section on "Previous observations in IRE-IRP regulation...." could be included in the discussion rather in than in the Results section. Some figures could be included in a supplement. A description of the new anti-IREB2 antibody is needed. What IRP2 sequence was used to generate antibodies? A model summarizing the data would be useful.

      Referees cross-commenting

      I agree with Reviewer 2 that the first sections in the manuscript are lengthy and not needed. More importantly, clues to the mechanism (e.g. iron, heme) regulating the rhythmic translation of IRP1 and IRP2 IRE-mRNAs in liver would increase the significance of the work. Overall, the findings are novel, and would be of interest to the iron metabolism and circadian rhythm fields.

      Significance

      Previous studies have reported a role for iron in altering gene expression and circadian rhythms in mice. The current manuscript extends these studies to show that several IRE-containing mRNAs in liver and IRP2 are subjected to rhythmic regulation. These findings will be interest to researchers in circadian rhythm and iron metabolism fields.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank our reviewers for their constructive criticism and for their appreciation and enthusiasm for our study. Some reviewers expressed opposing views, particularly when it came to the function and identity of the Cdt1-related protein in Toxoplasma gondii. To avoid redundancy in our response, we would like to make a brief statement. Toxoplasma gondii and other apicomplexan parasites utilize unique and highly unusual modes of cell division; numerous studies suggest that multiple phases can run concurrently in apicomplexan cell cycles. The best-known examples include the asynchronous S/M cycles in schizogony and concurrent mitosis and budding in Toxoplasma endodyogeny. These overlapping phases are not a feature exclusive to apicomplexans, since in budding yeast, cytokinesis initiates in G1 phase by marking the location of budding on the surface of the mother. Based on years of previous research and from our experience, we adjusted our approach by focusing on the processes that are associated with each cell cycle phase rather than on their temporal order. While the model of a conventional cell cycle guides our studies, we “follow the breadcrumbs” that we discover and the published studies to create a more accurate model of apicomplexan cell cycle instead of relying on the traditional cell cycle map employed by distantly related eukaryotes. Below are point-to-point responses to reviewers’ comments.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: Hawkins et al. employ a reverse genetic approach to analyze the molecular function of the Toxoplasma gondii kinase Crk4 and the Toxoplasma gondii cyclin 4. The authors combine inducible depletion with imaging, (phospho-)proteomics, molecular modeling, and protein-protein interaction studies.

      Major comments: - The major conclusion of the manuscript is that TgCrk4/TgCyc4 regulate entry into mitosis and that the primary role of TgCrk4 is to suppress DNA re-replication and chromosome re-duplication (lines 105-106). The authors also provide evidence that TgCrk4 interacts with TgCdt1, a DNA licensing factor ("TgCdt1" is missing in line 107). (had been corrected) By sequence homology, the authors found homologues of TgCrk4 only in apicomplexan parasites with binary division and concluded that the dominant division mode, presumably schizogony, is repressed in these organisms in favor of binary division. Indeed, internal budding and daughter cell formation is defective in the inducible depletion mutants of TgCrk4 and most experiments focus on this developmental stage. However, the analysis of preceding events, such as DNA replication is rather brief. If G2 is indeed regulated by TgCrk4/TgCyc4, one would assume that the parasites are post-S phase and the nucleus contains two copies of the genome, as indicated in Fig. 2C. The data shown in Fig. 3H and 7A, however, show that the TgCrk4 and TgCTD1 depletion induces a developmental arrest pre-S phase. This contradicts the main conclusions of the manuscript.

      *We agree that the G2 location is odd for a conventional cell cycle model. Given the high possibility that cell cycle phases can overlap in apicomplexans, we determined the relative position of G2 phase in Toxoplasma endodyogeny by instead focusing solely on the processes that are attributed to a specific cell cycle phase (such as DNA replication for S phase, DNA re-replication for G2 phase, DNA segregation for mitosis). Our approach shows that Toxoplasma G2/M checkpoint operates upstream of SAC, which led to enrichment of parasites with replicated DNA (Fig. 3H and Fig. 7A), which places G2 at the end of S-phase. Our focus in the present study is on the G2 functions, the control of centrosome and chromosome reduplication, but we appreciate the suggestion to examine DNA replication in Toxoplasma, which could be investigated in future studies. *

      Indeed, many data of this manuscript could support an alternative conclusion, i.e., that TgCrk4 regulates entry into S-phase (similar to Plasmodium falciparum Crk4: PMID: 28211852). This alternative conclusion is supported by the data showing that TgCyc4 is in the nucleus during S-phase (Fig. 1H) and that TgCrk4 interacts with TgCdt1, which has a well-known role in origin of replication licensing and loading of the MCM complex. MCM subunits were less phosphorylated in absence of TgCrk4, which could also suggest a role for TgCrk4 in S phase. Together, it seems more parsimonious to interpret the data as a DNA replication phenotype rather than a phenotype in G2.

      *We understand some confusion from prior data, but PfCrk4 is not orthologous to TgCrk4 (Alvarez & Suvorova, 2017); The true TgCrk4 ortholog had not been found in Plasmodium genomes. Our understanding is that nuclear accumulation of TgCyc4 in S-phase activates TgCrk4, which leads to repression of the DNA reduplication. One of the possible mechanisms involves interfering with loading of the MCM complex on chromatin mediated by hyper-phosphorylated TgiRD1 (former TgCdt1), which has been reported in other eukaryotes. We also believe that increased MCM phosphorylation indicates entry into or active S-phase, while the reduced phosphorylation that was detected in Crk4-depleted cells supports a block at the end of S-phase (G2). *

      • *

      The currently provided data on the DNA content are, however, clearly insufficient to draw firm conclusions. The gating strategy (dotted lines in Figs. 3H, 7A) is unclear. Why are populations, e.g., not separated at the lowest part of the depression in the histogram, but shifted towards lower DNA content? This seems to overestimate the percentage of cells that have a higher DNA content and the statement in lines 269-271, i.e., that TgCrk4 deficient parasites break the "once and only once" rule, is not supported by data.

      *We corrected the gating of the FACScan plots to separate G1, S, G2+M, and parasites with over-duplicated DNA. Please note that, in general, the cell cycle gating of FACScan data is relative and somewhat subjective when it comes to the gaussian curve. Independent of the chosen gates, our data show that removal of either TgCrk4 or TgiRD1 led to substantial decrease of the G1 population (reduction of 1N peak) accompanied by increase of parasites in the process of replication, completed replication (increase of 1.8 N peak), as well as undergoing DNA re-replication, which supports our claim in lines 269-271. In the case of TgiRD1, the number of parasites with re-duplicated DNA nearly doubled upon 8h of factor deficiency. *

      • *

      It is also unclear how may biological replicates are represented by these data (Figs. 3H, 7A), a critical wild type control at t = 4 h is missing, as well as a statistical analysis. Alternatively, the authors could use microscopy to quantify the DNA content of individual nuclei, which would yield a direct read out on whether a nucleus is in pre-S phase, S-phase or post-S phase. Defining the onset of S-phase indirectly by the number of centrosomes per cell seems imprecise, given the small size of the structure and the resolution of the microscope. Without solving these issues, the major conclusions and several minor statements throughout the manuscript are in question.

      *Thank you for your point, we performed a minimum of three independent experiments to evaluate the DNA content of TgCrk4- or TgiRD1- (former TgCdt1) depleted tachyzoites and have now indicated this in the figure legends. The 0h time point is a “wild type” control, since the parasites that expressed factors were incubated without auxin (mock treated) for 4h. The DNA content of Toxoplasma has been thoroughly studied and we are thus confident our 0h data is a good representation of asynchronous healthy populations. Although the parental strain had been examined, due to the data density mentioned in the reviews, we included only relative results (control and two experimental points) for clarity. Our concern with using microscopy to analyze DNA content is that it can be highly subjective, hinging on the quality of staining and imaging, while flow cytometry produces more unbiased datasets. We have considered the concern that the start of centrosome duplication can be difficult to identify, but the centrin-positive centrosomes move apart by the middle of S-phase. The independent structures are then distinct and easy to resolve, providing a popular means of marking G1/S transition in Toxoplasma. *

      • Lines 187-189: The mentioned checkpoint is unclear and so is the "specific cell cycle population". Fig. 2B analyses budding, but as the final step in the cell cycle, the knock down parasites may have arrested at various other stages of the cell cycle. In addition, it is unclear on which primary data Fig. 2B is based. It appears these may be at least partially shown in Fig. 3. If so, please reorganize as this is highly misleading.

      *“A checkpoint” in the indicated lines refers to G2/M and SAC, which are regulated by TgCrk4 and TgCrk6, respectively. We refer to “specific cell cycle population” since each transgenic parasite that is subject to G2/M or SAC arrest can allow us to isolate very different cell cycle stages. TgCrk6-dependent arrest had been confirmed by the presence of unresolved centrocone (not shown but was previously reported in Hawkins et al., 2022), while we thoroughly examined the novel TgCrk4-dependent block by focusing on many parameters, such as joint centrosomes, single-bud assembly, or unresolved apicoplast. Fig. 2 and Fig. S2 summarize our rigorous quantifications of these phenotypes. For convenience, we used budding efficiency as a readout to compare arrest and release of G2/M and SAC, which was incorporated in Fig. 2B. Table S4 contains the primary data used in all figures in the manuscript, including Fig. 2B. *

      • Line 246-254: It is unclear how many biological replicates were performed and how many cells were analyzed to conclude that TgCrk4 deficient parasites cannot form a bipolar spindle (Fig. 2H, S3B). This, together with the possibility that the developmental arrest occurs pre-S phase (Fig. 3H), does not support the statement, that the G2/M transition is regulated by the novel TgCrk4-TgCyc4 complex.

      We have indicated our replicates in the M&M. As addressed for Fig. 3H above, these IFA experiments were performed in at least three independent experiments.

      * * Minor comments: - Throughout the manuscript, please reorganize and present the figures in order of appearance in the text. Also, Fig. 1G summarizes data that are only presented in Fig. 1H. Please reorder. Similarly, Fig. 2C appears to summarize data that are only presented later.

      *Thank you for the suggestion, however we must abide by the standards of the publishers. The order of the figures must be maintained, but there is a substantial degree of freedom in organizing panels within figures. Fig. 1G summarizes data shown in Fig. 1F, H, while Fig. 2C summarizes many panels including preceding Fig. 2B and Fig. S2. Most of our schematics are placed at the top of figures to provide guidance for the relevant experiments. *

      • Why was only the "G1" timepoint quantified in Fig. 1H? Do the other images shown in F and H represent the majority of cells analyzed?

      *You are correct, we indicated the percentage of factor-positive parasites only when the factor emerges during a specific cell cycle phase. For example, the TgCyc4-positive parasites with 1 centrin dot were quantified to show that TgCyc4 emerges in the middle of G1 phase. The lack of a number indicates that the image represents all the parasites progressing through this phase; we have added this explanation to the figure legends. *

      • Several micrographs lack scale bars (Fig. 1B, D; 2E, F, H, I; 6D; 7F, H and S2G, S3A, B; S5A, B, D).

      *Thank you, we have added the scale bars to indicated images.

      *

      • Lines 83-85 and 93-95: Recently several publications investigated the cell cycle of the apicomplexan parasite Plasmodium and data are accumulating, showing that there may be a gap between the last S phase and segmentation (e.g., PMID: 35731838; PMID: 35353560), which may be interpreted as a G2 phase. Thus, these statements could be revised to reflect the current literature.

      *The studies mentioned provide very valuable insights into S-phase dynamics; the gap that was detected between S-phase and segmentation includes mitotic events such as prophase, metaphase, and anaphase prior to telophase (karyokinesis to segmentation). However, studies using means like stage-specific markers could help resolve the composition and order of events in the apicomplexan cell cycle. We used processes specific to G2 (repression of DNA and centrosome reduplication) and identified TgCrk4/TgCyc4 as the first G2 markers in apicomplexans. *

      • Fig. 4 shows the effect on protein abundance and phosphorylation upon TgCrk4 depletion. Fig. 4B seems somewhat redundant as a more detailed analysis with two timepoints is shown in the rest of the figure.

      *Fig. 4B is provided in contrast to the plot in Fig. 4A. It demonstrates that TgCrk4 depletion results in a far more pronounced effect on global phosphorylation rather than on proteolysis. While Fig. 4B highlights the checkpoint arrest, panels C and D are dedicated to the search for TgCrk4 substrates: the phospho-sites that immediately lost intensity of phosphorylation and remained low during the 4h block. *

      *

      *

      • Lines 146-148: This statement is confusing in light of the expression data in Fig.1 F and H. If they stabilize each other, how is TgCrk4 stabilized in G1, when TgCyc4 is absent?

      We believe that multiple mechanisms contribute to the stability and function of TgCrk4. We tested one and found that depleting the cyclin partner led to reduced expression of TgCrk4, and were able to conclude that the complex is stable when both subunits are expressed. Please note that we probed the mixed cell cycle populations by WB, and our proteomics data show that TgCrk4 interacts with many partners (Fig. 1E). Thus, it is likely that G1 stability may have been mediated by other partners, or by a higher transcription/translation rate, which could be evaluated in further experiments that focus on the regulation of TgCrk4/TgCyc4 complex.

      • *

      • Fig. 2D, and G: Please provide representative images of what has been quantified, as E/F and H/I are apparently UxEM images.

      The corresponding images are included in Fig. S2.

      • Line 236-243: This statement seems to be based on a single IFA shown in Fig. 2K. If so, the manuscript would benefit from clearly stating that this is a singular observation.

      *Thank you, we have provided clarification as described in previous points. *

      *

      *

      • Lines 301-304: In the cited publication, the TgOTUD3A knockout could not be complemented, which raises the possibility that other factors are involved. Thus, this statement would benefit from revision.

      *The lack of TgOTUD3A KO complementation is an example of the unappreciated complexity of apicomplexan cell cycle regulation by controlled proteolysis. We highlighted the similarity of TgCrk4 and TgOTUD3A deficiencies, which indirectly confirms their partnerships in the G2 network. Fig. 8A shows that, in addition to TgOTUD3A, the G2 network contains numerous factors. *

      *

      *

      • Lines 421-422: PfCdt1 was annotated in PlasmoDB some time ago and this statement needs to be revised.

      *Please see our response to comments made by Reviewer 2. Briefly, we agree with Reviewer 2 comment that TgCdt1 does not function as conventional DNA replication licensing factor CDT1. Therefore, we named TGME49_247040 TgiRD1 – inhibitor of DNA and centrosome ReDuplication 1. *

      • *

      • Lines 448-450 and Fig. 6F: Are these data from a single biological replicate and how many cells were analyzed for the different time points? Given the insufficient data on the DNA content, the paper would benefit form more conservative conclusions on the role of TgCdt1. The numbers of biological replicates were added throughout the text, also please refer to our response to Reviewer 2 and the comment above.

      Reviewer #1 (Significance (Required)):

      • This manuscript investigates the role of TgCrk3, TgCyc4 and TgCdt1s and provides a large amount of data.
      • These data will contribute to our understanding of the unusual division modes of Apicomplexa, a field of research that recently gained momentum.
      • These data will be interesting to the community of cell and molecular biologist, which work on the fundamental biology of eukaryotic microorganisms.
      • My field of expertise is the cell biology of Apicomplexa.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: In this Manuscript, Hawkins et. al. describe advances in the apicomplexan parasite cell cycle, which is reminiscent but distinct from mammalian cell cycle regulation. These differences include a presumed lack of G2 phase and the ability to replicate in either a multinuclear (schizogony) or binary (endodyogeny) manner. Using Toxoplasma gondii (TG) as a model, the authors seek to expand the current understanding of how these highly variable parasitic cell cycles are regulated by describing a previously unreported G2 phase. Building on the authors earlier work, this manuscript defines the function of TgCrk4 and identifies a novel binding partner, TgCyc4. Crk4 and Cyc4 control a G2/M checkpoint by regulating centrosome duplication and separation.

      The authors also identify 247040, a protein with previously no known function, as a binding partner and substrate of TgCrk4/TgCyc4 and several replication fork proteins such as MCM and PCNA. Results indicate that the protein negatively regulates replication and centrosome duplication. The authors propose to rename this protein TgCDT1 despite "low sequence similarity" and having a completely opposite function to eukaryotic CDT1. Using Swiss-Prot modeling the authors claim 247040 bears a "partial resemblance" to mammalian CDT1. Indeed, both of these proteins show high intrinsic disorder and have 2 folded domains. While 247040, like hCDT1, does contain cyclin interacting motifs (Cy), a collection degrons (not all shared with other CDT1 orthologs), and an NLS, the list of nuclear cell cycle proteins that also contain Cy and degron motifs would be very long. Further, 247040 is regulated in an opposite manner to all other CDT1 orthologs because it is absent in TG G1 and present in TG S phase; eukaryotic CDT1 is either degraded or relocalized to the cytoplasm in S phase, and evidence for degradation via APC/C is minimal. Crucially, loss of 247040 resulted in inappropriate replication ("re-replication"), whereas all other eukaryotic CDT1 orthologs are essential for replication. Re-replication in eukaryotic cells can be caused by excess or hyper-active CDT1, not by loss of CDT1 activity as shown here for 247040. Clearly 247040 is a negative regulator of DNA replication, and as such, is not a candidate for the TgCDT1 ortholog. If anything, it is functionally analogous to metazoan geminin, the negative regulator of metazoan CDT1; of note, geminin also has centrosome-related phenotypes. We cannot support naming 247040 TgCDT1 because it will cause confusion in the field.

      Aside from this major issue, the study is well-executed, rigorous, quantitative, and thorough; it has many strengths from the unbiased interaction screens. The authors' sequence analysis also suggests broader possibilities for cyclin structures than had previously been appreciated. We appreciate the legend in Figure 2 to the organism-specific terminology.

      Major comments: The spatiotemporal dynamics of 247040, its role in repressing TG DNA replication, lack of PIP motif and winged helix domain indicate that some other nomenclature, other than TgCdt1 will be a better name for this protein of previous unknown function.

      We would like to thank Reviewer 2 for this highly insightful comment. We agree that TGME49_247040 functions as a CDT1 inhibitor rather than as CDT1 itself, so conserving the name would produce confusion in the cell cycle field. Based on TGME49_247040 protein function we decided to name this factor TgiRD1 – inhibitor of DNA and centrosome ReDuplication 1. We revisited our data, looked deeper into the protein structure, and adjusted our conclusions. Our new Figure S5 shows differences in the predicted folding of HsCDT1 and TgiRD1. We could not ignore the fact that TgiRD1 is phylogenetically related to CDT1 in ancestral branches and metazoans (Fig. 6B), but we identified substantial differences that may indicate a selective loss (or inheritance) of protein features. For example, TgiRD1 does not interact with ORCs that are critical for the licensing step, but TgiRD1 retained an MCM binding domain (winged helix-turn-helix) that plays a role in licensing and firing. Rather than CRL4Cdt2 degrons, TgiRD1 contains APC/C degrons that would be activated late in mitosis (similar to regulation of Geminin). Together with the lack of DNA licensing control in G1 and its opposing expression profile, we concluded that TgiRD1 represents a Cdt1-related protein that controls DNA and centrosome reduplication in S and G2 phases.

      Minor comments:

      1. For clarity, please include the number of replicates in the figure legends where appropriate. We added the requested information.

      For microscopy/imaging, how were representative cells/images chosen? The representative images constituted the most common phenotype of the feature we aimed to highlight, and most are accompanied by quantifications.

      In addition to the ELM analysis, the authors could also employ fold recognition software (such as Promal) to analyze 247040 structural models to show similarity to known protein structures.

      We use a variety of folding prediction software, including AlphaFold2, PyMol, and template-based SWISS-PRO module to examine protein structures in our study, indicated in the text and figure legends. Our new TgiRD1 (former TgCdt1) analysis is based on an AlphaFold2 prediction (Fig. S5). All the software we used is listed in the M&M section.

      Line 107: missing words "TgCdt1"

      *We corrected the sentence.

      *

      Line 141: the interpretation that the C terminus is "unstable" is misleading if it is simply that the protein cannot tolerate a fusion to the C-terminus.

      *We successfully incorporated a tag at the C-terminus (confirmed by sequencing across the recombinant gene) but could not detect protein expression. If our protein could not tolerate a recombinant tag, the transgenic parasites would not survive because TgCyc4 is essential protein. Therefore, since the parasites survived, we concluded that the lack of TgCyc4-AID-HA expression was due to native truncation at the C-tail (instability). *

      Line 221: word choice "reminisced" We have changed the wording.

      Line 348 refers to Orc4 expression in Figure 4A, but the data point is not labelled. Fig. 4A references GO group (DNA replication/licensing factors), and the raw data is included in Table S6, which is now indicated in the text.

      Lines 407-8 and 510-11: Reference Fig 1E We added the reference.

      Line 408: please define what is meant by "dominant interactor" We meant that TgiRD1 is the most prominent interactor of TgCrk4 and TgCyc4. To clarify the confusion, we changed the wording to “primary interactor”.

      Reviewer #2 (Significance (Required)):

      This manuscript makes great strides in defining apicomplexan cell cycle control and genome replication. These strides include defining a previously unrecognized G2/M checkpoint controlled by TgCrk4 and the novel TgCyc4. Further, the authors identify a binding partner and substrate of the novel Crk4/Cyc4 kinase complex, 247040 that acts as a repressor of replication.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary The present study Hawkins et al have described the important role of Cyclin-CDK complex in an apicomplexan parasite Toxoplasma(Tg) which exhibit binary mode of cell division like many other eukaryotes. In the apicomplexan field it is generally shown that G2 phase of cel cycle is either absent or has very little role. The authors here demonstrate that the combination of Tg CRK4 and Tg Cyclin4 works during the G2 phase of cell cycle such as chromosome rereplication and centrosome reduplication. In order to show the function of Cyclin-CRK function they used Auxin degradation system to down regulate or deplete the protein and study parasite growth during cell cycle as well as they used tagged parasite to identify the protein complex with these two molecules. In the study they showed that these two molecules Cyc4 and cRK4 formed the complex in protein pulldown method and show identical function in the cell cycle. In addition to thiese two proteins they also found another interacting partner Cdt1 that was further analysed to be involved in controlling Chromosome rereplication and centrosome. So overall the study is nicely performed and three molecules of Cyclin4-CRk4-Cdt1 and their role is illustrated in the binary mode of cell division in Toxoplasma.

      Comments 1.Though no new experiments need to be performed but it will be good if some details are given as to which stage of tachyzoite cycle the protein complex were performed and if there is difference in the various phases of cell cycle especially the s phase and the M phase. Are these period changed. Since G2 is suppose to be absent in many apicomplexan do the authors suggest that G2 phase is only coupled to binary mode of cell division. Please discuss how it is then linked to the other part of cell cycle.

      *You are correct, we propose that the presence of G2 phase is linked to binary division in apicomplexans and our hypothesis is supported by the overall evolution of the cell cycle (see Discussion section). We also entertained the hypothesis that G2 operates in multinuclear division since all apicomplexans encode TgiRD1 orthologs (please, see the Discussion section). For the first time, we identified the major functions of G2 functions (repression of the DNA and centrosome reduplication) in the apicomplexan cell cycle. However, given the unresolved organization of the Toxoplasma (or any apicomplexan) cell cycle, it is currently impossible to define the boundaries of G2. According to our study, TgCrk4 and TgCyc4 control G2/M transition or the end of G2 phase, and we still lack markers of G2 entry. In our comparative synchronization study (Fig 2), we uncovered the temporal link between G2/M and SAC regulatory points, which is discussed in the results section. *

      Ganter et al have studied CRK4 in Plasmodium previously and they do find in their phosphoproteome study the similar association with the DNA replication machinery with CRK4 but no cyclin was identified in their study. In the cyclin study by Roques et al it has been shown that no cell cycle cyclins are found in Apicomplexan so can the author discuss more how these complex can be different in two apicomplexan species. They describe that Crk4 is novel cell cycle kinase though this has been studied earlier. Authors have almost not discussed these previous finding with respect to their in this study.

      *We would like to clarify this confusion. We have not discussed Ganter et al. studies because PfCRK4 is not orthologous to TgCrk4, but rather it is related to TgCrk6. Unfortunately, the Plasmodium and Toxoplasma Crk nomenclature was published almost concurrently. Our previous (Alvarez & Suvorova, 2017) and current study show that Plasmodium and other apicomplexans that divide by multinuclear division do not encode TgCrk4 orthologs (and/or TgCyc4). Additionally, the mentioned studies by Roques and Ganter were released prior to newer genome annotations that include additional cyclin-domain proteins, including 10 Toxoplasma cyclins (5 new) that we categorized in our recent publication (Hawkins et al., 2022). Although the newly annotated cyclins are not related to conventional cell cycle cyclins, we had proven empirically that TgCyc1 together with TgCrk6 controls SAC, and now, the specific interaction of TgCyc4 with TgCrk4 controls G2 processes. Lastly, we call TgCrk4 “a novel” kinase only in the meaning that it is a novel cyclin-dependent kinase that is not related to known CDKs in other eukaryotes. The identification of TgCrk4 in our previous study (Alvarez & Suvorova, 2017) is described in the Introduction section and at the opening of the Results. *

      The manuscript is too dense, in terms of both figures and text. At times loses the focus and hence can be organised with most important finding in the figure and text. Especially Fig2, Fig4 and Fig7. Fig5 does not give too much in terms of the real finding an in fact take away from the focus. Some parts of these figures can be simplified or moved to supplementary. Some of the figures in Fig2 and 7 are missing the scale bars.

      We respectfully disagree with some conclusions made by the Reviewer. Our study contains ample material that is intended to guide the reader through the complexity of the Toxoplasma cell cycle and the intricate structures contained in the parasite. We have also introduced a few novel approaches that require additional schematics and dedicated discussions.

      • Fig 2*. The G2/M block, as well as the G2 phase, had never been detected in apicomplexans. We created a new approach to determine the timing of the G2/M checkpoint, which involves comparison to a known cell cycle block. Panels A, B, and C provide visuals and summarize our findings. The main events are highlighted with arrows (Panel C), while graphs (panel B) show differences in responses. The rest of the figure is devoted to quantification of the primary events caused by TgCrk4 deficiency, since the G2 block had never been examined. While the U-ExM images of the entire vacuole (2-4 parasites) may seem overwhelming, they represent that the deficiency is consistent. *
      • Fig 7* is devoted to the major Crk4/Cyc4 interactor TgiRD1 (former TgCdt1). This is one of the first mechanistic studies of central cell cycle regulators in Toxoplasma. This Cdt1-related protein was examined at the molecular level to support the main claims of its control of G2 Nevertheless, we moved two panels from Fig. 7 into the supplement. *
      • 4* is organized as follows. Top row: panels A, B visualize the G2/M checkpoint block at the protein level. Middle row: panels C, D, and E represent the workflow to find TgCrk4 substrates. Bottom row: panels F, G highlight TgCrk4 substrates of interest that are discussed in the paper. *
      • 5* is an in-depth analysis of the central cell cycle regulators across Apicomplexa phylum, a key figure of the study. Its comparative nature supports our main message: binary division is regulated by TgCrk4/TgCyc4, which are only expressed in a subgroup of apicomplexans that divide in a binary mode. *

      May be bit more discussion of ORC in relation to their Cyclin-CRK complex as they did find upregulation of the ORC in their genome profiling. So may be instead of CDT1 these are more important in the licencing of DNA replication.

      *Our choice to focus on Cdt1-related protein was driven by the fact this protein is a major component of the TgCrk4/TgCyc4 complex, while the ORCs act downstream (as TgCrk4 substrates). Shifting focus to ORCs opens an entire new project, which will be explored in the future. *

      5 The model in Fig8B does not take Cyc4 into consideration and I feel is bit oversimplified as there are many factors that may be responsible for centrosome non separation. The S and G2 are no separated in the Cell cycle as given in this Fig.

      Referring to comment 3, we focused on empirically supported, central findings and created the first model of centrosome cycle regulation in T. gondii. We intentionally drew focus to TgCrk4, which was extensively studied, while TgCyc4 received less attention due to difficulties in modulating its expression. We have used transcriptional downregulation to evaluate TgCyc4 (tet-OFF model), which is unfavorable for cell cycle studies because it exceeds the duration of the cell cycle. The unclear cell cycle borders are addressed in the introduction to this response. Briefly, the organization of apicomplexan cell cycle is currently unclear, thus most of the schematics are approximate.

      It is not clear from the data with CDt1 if this linking the inner and outer centrocone or its down regulation breaks the bipartite centrosome. May be some reflection it will be useful.

      *Our model suggests that both TgCrk4 and TgiRD1 (former TgCdt1) affect only the inner core of the centrosome, which we propose is comprised of two types of linkers. The arrows in Fig. 8 point specifically to the linkers whose stability depends on the expression of TgCrk4 or TgiRD1. *

      Minor comments

      I what is SAINT analysis as it is not described in methods.

      *We added the description of our SAINT analysis to M&M.

      *

      How was budding quantified

      *We supplemented the figure legend with the required information. *

      Western blot can have predicted size

      *Due to density of the figures, we did not supply the predicted MW of the proteins when they display the proper PAGE motility. *

      what does red star mean in Blot 1C

      *We added the description to the figure legend.

      *

      What does the number in Fig1H means please explain in the legends and same for Fig6F. In fig 1, removing the inhibition for 5 hours led to very less budding, but in fig 3, removing inhibition showed increased budding (50% in 2 hours). Please explain

      *Please see our response to the reviewer 1 minor comment regarding Fig. 1H and 6F. *

      *We presume that there is some confusion regarding figure numbers. Perhaps the Reviewer refers to Fig. 2B. Indeed, the 4h block at G2/M led to reduced budding (Fig. 2B), while release from the block for 2 hours (Fig. 3C, post-recovery) allows parasites to continue cell cycle progression and reach the next stage –budding. The numbers over the Fig. 3A, B, and C panels are from the plots in Fig. 2B to help give a comprehensive representation of the analyzed timepoint. *

      Fig2 has no scale bars -please add- this figure is too dense. May be fig2A, B,C can be in supplementary, legend in the figure can be in the figure legend.

      Please see our response to comment 3. We have included scale bars.

      Also this figure2 H and I in not quoted in line 231. Also this figure2 has no panel J but goes directly from I to K

      *The alphabetical order was corrected, and the reference added. *

      Fig3 the FigG can be more relevant in the Figure 8 while describing about the Crk4 and Cyc4 and CDt1 in binary mode of cell division. Also please define what stars mean either in legend or methods section in terms of significance.

      *Thank you for the suggestion. The Fig. 3G schematics summarize the overall findings of the Figure and acts as an intermediate conclusion in this study. We added the meaning of the stars in the M&M section. *

      Line 107 the sentence is incomplete

      We have corrected the sentence.

      Line 217 may be the figure could be referred as then it is not cleat about the description.

      Due to the density of the figures and well-established dynamics of the centrocone and basal rings, we included the reference to a publication rather than as a figure panel.

      **Referees cross-commenting**

      The study is quite rigrous and with analyses of CRK4-CYC4 and CDT. However it will be better if authors please revisit their conclusions on G2 phase of cell cycle in Toxoplasma based on their findings. The study will have important bearing on the community studying apicomplexan parasites and DNA replication as well as who work on eukaryotic cell cycle.

      Reviewer #3 (Significance (Required)):

      Significance In the manuscript by Hawkins etal have illustrated that in the apicomplexan parasite that have binary mode of cell division present a Cyclin-Crk complex with detailed analysis of Tg Crk4-Cyc4 that are novel in these group pf parasite infect humans and animal alike like malaria parasite and ones affecting cattle and chicken. So these finding are novel as very little is known about this interaction. The significant finding is to show how the G2 phase of cell cycle may be regulated in these parasites and how DNA licencing factor Cdt1 is highly divergent but part of this CRK-Cyclin complex.

      So though it discusses more on the Toxoplasma but it may be of interest to the scientist working on eukaryotes with divergent mode of cell cycle.

      General Assessment - The findings are novel but the manuscript is too dense and at time loses the focus. May be both text and Figures could be made less dense so that important finding are revealed in better way.

      Advance - It does give important insight into the cell cycle in apicomplexan parasite and how even though there are no cell cycle cyclin in Apicomplexa. The findings here suggest how different complexes can substitute for the function. It does extend the knowledge in the field of Cell division in divergent parasites both in terms of mechanistic, functional and technical way.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The present study Hawkins et al have described the important role of Cyclin-CDK complex in an apicomplexan parasite Toxoplasma(Tg) which exhibit binary mode of cell division like many other eukaryotes. In the apicomplexan field it is generally shown that G2 phase of cel cycle is either absent or has very little role. The authors here demonstrate that the combination of Tg CRK4 and Tg Cyclin4 works during the G2 phase of cell cycle such as chromosome rereplication and centrosome reduplication. In order to show the function of Cyclin-CRK function they used Auxin degradation system to down regulate or deplete the protein and study parasite growth during cell cycle as well as they used tagged parasite to identify the protein complex with these two molecules. In the study they showed that these two molecules Cyc4 and cRK4 formed the complex in protein pulldown method and show identical function in the cell cycle. In addition to thiese two proteins they also found another interacting partner Cdt1 that was further analysed to be involved in controlling Chromosome rereplication and centrosome. So overall the study is nicely performed and three molecules of Cyclin4-CRk4-Cdt1 and their role is illustrated in the binary mode of cell division in Toxoplasma.

      Comments

      1.Though no new experiments need to be performed but it will be good if some details are given as to which stage of tachyzoite cycle the protein complex were performed and if there is difference in the various phases of cell cycle especially the s phase and the M phase. Are these period changed. Since G2 is suppose to be absent in many apicomplexan do the authors suggest that G2 phase is only coupled to binary mode of cell division. Please discuss how it is then linked to the other part of cell cycle.<br /> 2. Ganter et al have studied CRK4 in Plasmodium previously and they do find in their phosphoproteome study the similar association with the DNA replication machinery with CRK4 but no cyclin was identified in their study. In the cyclin study by Roques et al it has been shown that no cell cycle cyclins are found in Apicomplexan so can the author discuss more how these complex can be different in two apicomplexan species. They describe that Crk4 is novel cell cycle kinase though this has been studied earlier. Authors have almost not discussed these previous finding with respect to their in this study. 3. The manuscript is too dense, in terms of both figures and text. At times loses the focus and hence can be organised with most important finding in the figure and text. Especially Fig2, Fig4 and Fig7. Fig5 does not give too much in terms of the real finding an in fact take away from the focus. Some parts of these figures can be simplified or moved to supplementary. Some of the figures in Fig2 and 7 are missing the scale bars. 4. May be bit more discussion of ORC in relation to their Cyclin-CRK complex as they did find upregulation of the ORC in their genome profiling. So may be instead of CDT1 these are more important in the licencing of DNA replication. 5 The model in Fig8B does not take Cyc4 into consideration and I feel is bit oversimplified as there are many factors that may be responsible for centrosome non separation. The S and G2 are no separated in the Cell cycle as given in this Fig. 6. It is not clear from the data with CDt1 if this linking the inner and outer centrocone or its down regulation breaks the bipartite centrosome. May be some reflection it will be useful.

      Minor comments

      I what is SAINT analysis as it is not described in methods. 2. How was budding quantified 3. Western blot can have predicted size 4. what does red star mean in Blot 1C 5. What does the number in Fig1H means please explain in the legends and same for Fig6F. In fig 1, removing the inhibition for 5 hours led to very less budding, but in fig 3, removing inhibition showed increased budding (50% in 2 hours). Please explain 6. Fig2 has no scale bars -please add- this figure is too dense. May be fig2A, B,C can be in supplementary, legend in the figure can be in the figure legend.<br /> 7. Also this figure2 H and I in not quoted in line 231. Also this figure2 has no panel J but goes directly from I to K 8. Fig3 the FigG can be more relevant in the Figure 8 while describing about the Crk4 and Cyc4 and CDt1 in binary mode of cell division. Also please define what stars mean either in legend or methods section in terms of significance.

      Line 107 the sentence is incomplete Line 217 may be the figure could be referred as then it is not cleat about the description.

      Referees cross-commenting

      The study is quite rigrous and with analyses of CRK4-CYC4 and CDT. However it will be better if authors please revisit their conclusions on G2 phase of cell cycle in Toxoplasma based on their findings. The study will have important bearing on the community studying apicomplexan parasites and DNA replication as well as who work on eukaryotic cell cycle.

      Significance

      In the manuscript by Hawkins etal have illustrated that in the apicomplexan parasite that have binary mode of cell division present a Cyclin-Crk complex with detailed analysis of Tg Crk4-Cyc4 that are novel in these group pf parasite infect humans and animal alike like malaria parasite and ones affecting cattle and chicken. So these finding are novel as very little is known about this interaction. The significant finding is to show how the G2 phase of cell cycle may be regulated in these parasites and how DNA licencing factor Cdt1 is highly divergent but part of this CRK-Cyclin complex.

      So though it discusses more on the Toxoplasma but it may be of interest to the scientist working on eukaryotes with divergent mode of cell cycle.

      General Assessment - The findings are novel but the manuscript is too dense and at time loses the focus. May be both text and Figures could be made less dense so that important finding are revealed in better way.

      Advance - It does give important insight into the cell cycle in apicomplexan parasite and how even though there are no cell cycle cyclin in Apicomplexa. The findings here suggest how different complexes can substitute for the function. It does extend the knowledge in the field of Cell division in divergent parasites both in terms of mechanistic, functional and technical way.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this Manuscript, Hawkins et. al. describe advances in the apicomplexan parasite cell cycle, which is reminiscent but distinct from mammalian cell cycle regulation. These differences include a presumed lack of G2 phase and the ability to replicate in either a multinuclear (schizogony) or binary (endodyogeny) manner. Using Toxoplasma gondii (TG) as a model, the authors seek to expand the current understanding of how these highly variable parasitic cell cycles are regulated by describing a previously unreported G2 phase. Building on the authors earlier work, this manuscript defines the function of TgCrk4 and identifies a novel binding partner, TgCyc4. Crk4 and Cyc4 control a G2/M checkpoint by regulating centrosome duplication and separation.

      The authors also identify 247040, a protein with previously no known function, as a binding partner and substrate of TgCrk4/TgCyc4 and several replication fork proteins such as MCM and PCNA. Results indicate that the protein negatively regulates replication and centrosome duplication. The authors propose to rename this protein TgCDT1 despite "low sequence similarity" and having a completely opposite function to eukaryotic CDT1. Using Swiss-Prot modeling the authors claim 247040 bears a "partial resemblance" to mammalian CDT1. Indeed, both of these proteins show high intrinsic disorder and have 2 folded domains. While 247040, like hCDT1, does contain cyclin interacting motifs (Cy), a collection degrons (not all shared with other CDT1 orthologs), and an NLS, the list of nuclear cell cycle proteins that also contain Cy and degron motifs would be very long. Further, 247040 is regulated in an opposite manner to all other CDT1 orthologs because it is absent in TG G1 and present in TG S phase; eukaryotic CDT1 is either degraded or relocalized to the cytoplasm in S phase, and evidence for degradation via APC/C is minimal. Crucially, loss of 247040 resulted in inappropriate replication ("re-replication"), whereas all other eukaryotic CDT1 orthologs are essential for replication. Re-replication in eukaryotic cells can be caused by excess or hyper-active CDT1, not by loss of CDT1 activity as shown here for 247040. Clearly 247040 is a negative regulator of DNA replication, and as such, is not a candidate for the TgCDT1 ortholog. If anything, it is functionally analogous to metazoan geminin, the negative regulator of metazoan CDT1; of note, geminin also has centrosome-related phenotypes. We cannot support naming 247040 TgCDT1 because it will cause confusion in the field.

      Aside from this major issue, the study is well-executed, rigorous, quantitative, and thorough; it has many strengths from the unbiased interaction screens. The authors' sequence analysis also suggests broader possibilities for cyclin structures than had previously been appreciated. We appreciate the legend in Figure 2 to the organism-specific terminology.

      Major comments:

      The spatiotemporal dynamics of 247040, its role in repressing TG DNA replication, lack of PIP motif and winged helix domain indicate that some other nomenclature, other than TgCdt1 will be a better name for this protein of previous unknown function.

      Minor comments:

      1. For clarity, please include the number of replicates in the figure legends where appropriate.
      2. For microscopy/imaging, how were representative cells/images chosen?
      3. In addition to the ELM analysis, the authors could also employ fold recognition software (such as Promal) to analyze 247040 structural models to show similarity to known protein structures.
      4. Line 107: missing words "TgCdt1"
      5. Line 141: the interpretation that the C terminus is "unstable" is misleading if it is simply that the protein cannot tolerate a fusion to the C-terminus.
      6. Line 221: word choice "reminisced"
      7. Line 348 refers to Orc4 expression in Figure 4A, but the data point is not labelled.
      8. Lines 407-8 and 510-11: Reference Fig 1E Line 408: please define what is meant by "dominant interactor"

      Significance

      This manuscript makes great strides in defining apicomplexan cell cycle control and genome replication. These strides include defining a previously unrecognized G2/M checkpoint controlled by TgCrk4 and the novel TgCyc4. Further, the authors identify a binding partner and substrate of the novel Crk4/Cyc4 kinase complex, 247040 that acts as a repressor of replication.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Hawkins et al. employ a reverse genetic approach to analyze the molecular function of the Toxoplasma gondii kinase Crk4 and the Toxoplasma gondii cyclin 4. The authors combine inducible depletion with imaging, (phospho-)proteomics, molecular modeling, and protein-protein interaction studies.

      Major comments:

      • The major conclusion of the manuscript is that TgCrk4/TgCyc4 regulate entry into mitosis and that the primary role of TgCrk4 is to suppress DNA re-replication and chromosome re-duplication (lines 105-106). The authors also provide evidence that TgCrk4 interacts with TgCdt1, a DNA licensing factor ("TgCdt1" is missing in line 107). By sequence homology, the authors found homologues of TgCrk4 only in apicomplexan parasites with binary division and concluded that the dominant division mode, presumably schizogony, is repressed in these organisms in favor of binary division. Indeed, internal budding and daughter cell formation is defective in the inducible depletion mutants of TgCrk4 and most experiments focus on this developmental stage. However, the analysis of preceding events, such as DNA replication is rather brief. If G2 is indeed regulated by TgCrk4/TgCyc4, one would assume that the parasites are post-S phase and the nucleus contains two copies of the genome, as indicated in Fig. 2C. The data shown in Fig. 3H and 7A, however, show that the TgCrk4 and TgCTD1 depletion induces a developmental arrest pre-S phase. This contradicts the main conclusions of the manuscript. Indeed, many data of this manuscript could support an alternative conclusion, i.e., that TgCrk4 regulates entry into S-phase (similar to Plasmodium falciparum Crk4: PMID: 28211852). This alternative conclusion is supported by the data showing that TgCyc4 is in the nucleus during S-phase (Fig. 1H) and that TgCrk4 interacts with TgCdt1, which has a well-known role in origin of replication licensing and loading of the MCM complex. MCM subunits were less phosphorylated in absence of TgCrk4, which could also suggest a role for TgCrk4 in S phase. Together, it seems more parsimonious to interpret the data as a DNA replication phenotype rather than a phenotype in G2. The currently provided data on the DNA content are, however, clearly insufficient to draw firm conclusions. The gating strategy (dotted lines in Figs. 3H, 7A) is unclear. Why are populations, e.g., not separated at the lowest part of the depression in the histogram, but shifted towards lower DNA content? This seems to overestimate the percentage of cells that have a higher DNA content and the statement in lines 269-271, i.e., that TgCrk4 deficient parasites break the "once and only once" rule, is not supported by data. It is also unclear how may biological replicates are represented by these data (Figs. 3H, 7A), a critical wild type control at t = 4 h is missing, as well as a statistical analysis. Alternatively, the authors could use microscopy to quantify the DNA content of individual nuclei, which would yield a direct read out on whether a nucleus is in pre-S phase, S-phase or post-S phase. Defining the onset of S-phase indirectly by the number of centrosomes per cell seems imprecise, given the small size of the structure and the resolution of the microscope. Without solving these issues, the major conclusions and several minor statements throughout the manuscript are in question.
      • Lines 187-189: The mentioned checkpoint is unclear and so is the "specific cell cycle population". Fig. 2B analyses budding, but as the final step in the cell cycle, the knock down parasites may have arrested at various other stages of the cell cycle. In addition, it is unclear on which primary data Fig. 2B is based. It appears these may be at least partially shown in Fig. 3. If so, please reorganize as this is highly misleading.
      • Line 246-254: It is unclear how many biological replicates were performed and how many cells were analyzed to conclude that TgCrk4 deficient parasites cannot form a bipolar spindle (Fig. 2H, S3B). This, together with the possibility that the developmental arrest occurs pre-S phase (Fig. 3H), does not support the statement, that the G2/M transition is regulated by the novel TgCrk4-TgCyc4 complex.

      Minor comments:

      • Throughout the manuscript, please reorganize and present the figures in order of appearance in the text. Also, Fig. 1G summarizes data that are only presented in Fig. 1H. Please reorder. Similarly, Fig. 2C appears to summarize data that are only presented later.
      • Why was only the "G1" timepoint quantified in Fig. 1H? Do the other images shown in F and H represent the majority of cells analyzed?
      • Several micrographs lack scale bars (Fig. 1B, D; 2E, F, H, I; 6D; 7F, H and S2G, S3A, B; S5A, B, D).
      • Lines 83-85 and 93-95: Recently several publications investigated the cell cycle of the apicomplexan parasite Plasmodium and data are accumulating, showing that there may be a gap between the last S phase and segmentation (e.g., PMID: 35731838; PMID: 35353560), which may be interpreted as a G2 phase. Thus, these statements could be revised to reflect the current literature.
      • Fig. 4 shows the effect on protein abundance and phosphorylation upon TgCrk4 depletion. Fig. 4B seems somewhat redundant as a more detailed analysis with two timepoints is shown in the rest of the figure.
      • Lines 146-148: This statement is confusing in light of the expression data in Fig.1 F and H. If they stabilize each other, how is TgCrk4 stabilized in G1, when TgCyc4 is absent?
      • Fig. 2D, and G: Please provide representative images of what has been quantified, as E/F and H/I are apparently UxEM images.
      • Line 236-243: This statement seems to be based on a single IFA shown in Fig. 2K. If so, the manuscript would benefit from clearly stating that this is a singular observation.
      • Lines 301-304: In the cited publication, the TgOTUD3A knockout could not be complemented, which raises the possibility that other factors are involved. Thus, this statement would benefit from revision.
      • Lines 421-422: PfCdt1 was annotated in PlasmoDB some time ago and this statement needs to be revised.
      • Lines 448-450 and Fig. 6F: Are these data from a single biological replicate and how many cells were analyzed for the different time points? Given the insufficient data on the DNA content, the paper would benefit form more conservative conclusions on the role of TgCdt1.

      Significance

      • This manuscript investigates the role of TgCrk3, TgCyc4 and TgCdt1s and provides a large amount of data.
      • These data will contribute to our understanding of the unusual division modes of Apicomplexa, a field of research that recently gained momentum.
      • These data will be interesting to the community of cell and molecular biologist, which work on the fundamental biology of eukaryotic microorganisms.
      • My field of expertise is the cell biology of Apicomplexa.
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Maaßen and colleagues followed up on their observation that infection with HCMV mutants lacking pUS2 and 3 impaired HLA-DP expression of IFN-gamma treated MRC-5 cells. This phenomenon is interesting because pUS2 and 3 are viral components that have been shown earlier to degrade HLA-DR alpha and HLA-DM alpha and thus inhibit the formation of HLA-DR alpha/beta heterodimers. These data indicated that in addition to pUS2 and 3 some other MHC II inhibitor must be encoded by HCMV. The authors tested this hypothesis by analyzing a HCMV gene expression library for the presence of a new HLA-DP antagonist. Indeed, the data revealed pUS28 as a new MHC II inhibitor that exhibited a posttranscriptional effect on CIITA, which is the key regulator of MHC II expression. In in vitro stimulation experiments, pUS28 impaired activation of antigen-specific CD4+ T cells.

      The study provides new and important information on how HCMV evades human immunity. The shown data support the main conclusions. Nevertheless, inclusion of some additional controls would facilitate understanding the overall concept.

      Minor points:

      A key element of this study is that IFN-gamma induces MHC II expression on MRC-5 fibroblasts. Nevertheless, professional antigen presenting cells such as dendritic cells express MHC II independent of any stimulation. Therefore, in Fig. 1 in addition to IFN-gamma induced DP expression on MRC-5 fibroblasts, DP and DR expression of dendritic cells should be shown. This could be easily done by analysis of dendritic cells in PBMC. Furthermore, the authors should show DP and DR expression of dendritic cells with and without IFN-gamma stimulation. Most likely, DP and DR expression of untreated dendritic cells is significantly higher than DP expression of IFN-gamma treated MRC-5 cells. It is important to include such controls to avoid the impression that IFN-gamma treated fibroblasts can have similar functions as professional antigen presenting cells.

      In Fig. 6b the proportion of CD137-positive T cells normalized to T cells activated by HeLa cells transfected with CIITA and pulsed with HCMV lysate is shown. In this kind of data presentation, the magnitude of the original effect, i.e., the percentage of CD4+ T cells that after 24 h of stimulation is CD137-positive, remains unclear. Therefore, it is recommended to first show actual data and then relative values. In the figure legend it is stated n = 4-10. Does this mean that T cells from different donors of the corresponding numbers have been tested? Or have T cells from some donors been tested more than once? More precise information should be given here.

      Considering the higher MHC II levels expressed by dendritic cells, it would be interesting to see to which extent pUS28 expression reduces MHC II expression of dendritic cells. Such experiments can be performed by lentiviral pUS28 expression for example in monocyte-derived dendritic cells.

      The conclusion in the last paragraph of the discussion that NKG2C+ memory NK cells might have activated antigen-specific CD4+ T cells is confusing. In the end, professional antigen presenting cells that have taken up viral proteins most probably stimulated antigen-specific CD4+ T cells. And since antigen presentation on MHC II is independent of infection of the antigen presenting cell, it is difficult to understand why under such conditions a red-queen race should have been taken place.

      Significance

      This is a highly relevant study. It adds important new information about pUS28 and how different viral components interact to evade human immunology. My expertise is on HCMV infection of human dendritic cells and the impact on antigen presentation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Maaßen and colleagues investigate how HCMV US28 functions to antagonize the class II Transactivator (CIITA) transcription factor and subsequent HLA class II expression. Through physical interaction with CIITA, US28 triggered a post-transcriptional decline in CIITA protein, leading to reduced cell surface expression of HLA class II molecules, including HLA-DR, HLA-DQ, HLA-DM, CD74, and HLA-DP. Moreover, the authors show that US28-mediated degradation of CIITA hindered the activation of HCMV-specific CD4+ T cells.

      Strengths of this article include the rigorous methodologies and analysis, clear and concise writing, and relevance within a clinical context. Despite the strengths, a few deficiencies were identified. Many figures lack statistical analysis to support the claims made by the authors. Where applicable, the authors should include these or explicitly state that only significant comparisons are shown. There is a lack of information regarding how the expression library screen was performed and how hits were determined/chosen for further analysis. While the observation that US28 antagonizes CIITA is well supported, the mechanism behind the antagonism is somewhat lacking.

      These findings have major implications for understanding the immune response to HCMV, particularly in immunocompromised patients where impaired HLA-II presentation poses clinical risks. The authors suggest that a comprehensive understanding of the molecular mechanisms governing HCMV immune evasion could guide the development of tailored protocols for risk protection, such as vaccination, cellular therapies, or drugs targeting US28-mediated CIITA degradation.

      Significance

      Major Comments:

      • Figure 1: Lacks statistical analysis and the number of replicate experiments that were performed.
      • Figure 2: Lacks information regarding how hits from the expression screen were determined. It would be helpful to understand the selection criteria.
      • Figure 3: While semi-quantitative RT-PCR is a useful method for determining the levels of mRNA, it would be beneficial to conduct RT-qPCR experiments to support the claim that US28 does not affect CIITA mRNA levels.
      • Figure 3: While the blots here support the authors claim that US28 antagonize CIITA at the post transcriptional level, it would be beneficial to make these observations within the context of viral infection.
      • Figure 5B: Labeling of this panel is confusing. The authors should attempt to relabel, or perform the experiment again and run samples on one gel.
      • Figure 5: data for the claim that US28-mediated antagonism of CIITA is independent of neddylation, proteasomal degradation, etc. should be shown if the authors wish to make this claim.

      Minor Comments:

      • Labeling for immunoblots should be clearer regarding transfection conditions. The terms "empty" and "vector" is ambiguous and is not descriptive enough for the conclusions the authors are drawing.
      • Many figure legends lack the information regarding the number of replicate experiments that were performed.
      • Many figures, lack statistical analysis supporting the claims being made. If observations do not reach statistical significance, these should be explicitly stated within the legends. (i.e. comparisons are shown where statically significant).
      • The authors should move away from making conclusions without showing any data substantiating their claims.
      • Minor grammatical and citation errors were identified throughout the manuscript. The authors should carefully read and fix any errors prior to publication.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study entitled "The human cytomegalovirus-encoded pUS28 antagonizes CD4+ T-cell recognition by targeting CIITA", Maassen et al. tried to identify viral genes/proteins downregulating the HLA class II molecule HLA-DP. In a transfection-based screening assay, they identified US28 as a viral gene downregulating CIITA-dependent HLA-DP expression in transfected HeLa cells. Other HLA class II molecules and other proteins expressed in a CIITA-dependent manner were downregulated as well. The authors went on to show that CIITA transcripts were not reduced in the presence of pUS28, but CIITA protein levels were massively reduced, suggesting that US28 downregulates CIITA on the post-transcriptional level. A signaling-deficient US28 mutant, but not a mutant lacking the cytoplasmic carboxy terminus, was capable of reducing CIITA levels. In a final set of T cell activation experiments, the authors showed that US28-expression in HeLa cells reduced HLA-II-dependent restimulation of CD4+ T cells.

      The data presented in the paper are generally very clean and convincing. However, not all conclusions are sufficiently supported by the data. A major weakness of the study is the fact that most experiments were done with transfected HeLa cells. Whether the proposed US28-mediated HLA-II downregulation occurs in HCMV-infected cells remains unclear. Moreover, the proposed pUS28-CIITA interaction was demonstrated in a single co-IP experiment, and the mechanism of CIITA downregulation remains obscure. Hence, the conclusion that they have identified "a mechanism employed by HCMV to evade HLA-II-mediated recognition by CD4+ T cells" (abstract) is not justified.

      Major comments:

      1. The mechanism of US28-dependent CIITA downregulation remains unresolved. The authors have made several attempts to clarify the mechanism, but these experiments have not met with success. Therefore, conclusions on the underlying mechanism should be toned down or removed.
      2. The claim that US28-dependent CIITA downregulation occurs by pUS28 interacting with CIITA is based on a single co-IP experiment. The result would be more convincing if the authors could show the same interaction in a reverse IP, and ideally also in HCMV-infected cells. Is the US28 C-terminus required for this interaction?
      3. The authors could not demonstrate US28-dependent HLA-II downregulation in HCMV-infected cells. Hence, they cannot conclude that HCMV employs this mechanism. Transfected HeLa cells are a somewhat artificial system. This does not invalidate the data, but one has to be careful when interpreting the data. Others have shown that HCMV downregulates CIITA transcript levels in myeloid cells (PMID 21458073 and 31915281). This apparent discrepancy could either be explained by several redundant mechanisms (as proposed by the authors of this manuscript) or by differences and limitations of the respective experimental systems.
      4. The possibility that US28 might downregulate CIITA in latently infected cells is intriguing. Have the authors tested this in an HCMV latency system, e.g. in infected THP-1 or Kasumi-3 cells? I acknowledge that such experiments are not trivial and may be beyond the scope of the present study. However, as latently infected cells express US28 but not the other viral genes previously shown to affect HLA-II expression (US2, US3, IE1 and 2), the latency model might a way to demonstrate biological significance in virus-infected cells.

      Minor comments:

      1. Figure 2. Why was US29 used as a control and not US27 as in other experiments. The authors pointed out themselves that US27 is probably an ideal control for US28 as both genes encode related GPCRs.
      2. Figure 2. The use of "empty" for mock-transfected cells is confusing, particularly as empty vector-transfected cells are labeled "vector".

      Significance

      Recognition of virus-infected cells by CD4+ T cells is an important immune defense mechanism. Viruses like HCMV have evolved numerous immune evasion mechanisms. Previous studies have identified HCMV proteins targeting HLA-II for degradation (US2, US3) or downregulating CIITA transcription (probably IE1+IE2). The findings of the present manuscript now demonstrate that US28 is capable of contributing to HLA-II downregulation. This is potentially of great significance as US28 is expressed in latently infected cells. However, the significance of the present study is limited by the fact that the studies were done in transfected HeLa cells, not in HCMV-infected cells, and that the mechanism of CIITA post-transcriptional downregulation remains unknown.

      In its present form, the study should be of interest for virologists and immunologists interested in new viral immune evasion strategies. The significance and appeal to a wider audience would be massively increased if the authors could clarify the mechanism or show its importance in virus-infected cells (or both).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Major comments:

      1. I don't understand the meaning of the sentence beginning on line 61. There are published structures of taste receptors and many papers have looked at activation mechanisms.

      2. The results are extremely difficult to follow. There is far too much information in here about methods and without sub-headings is impossible to follow. I'd suggest moving a lot of the methods information to the Materials and Methods and then adding more sub-headings.

      3. In Figure 1, the conclusion seems to be that the authors identified GPRC6A and taste receptors as most closely related. However, they stated in the Introduction that these were known to be most closely related, so this just confirms what is known. The authors should acknowledge this and shorten this section significantly.

      4. On line 349 the authors state that 'any substitution on the receptor disrupts the function of the receptor'. This is not true. There are several known benign mutations that have no effect on CaSR function.

      5. I found the section based around Table 2 very difficult to follow. Initially I presumed these were variants that have not been functionally characterized that the authors would predict, then test in vitro. However, this is not the case as several have been functionally assessed (e.g. I857X, T186N, T699N, R701G, T808P). The authors should add another column to state which have been functionally assessed and what this showed. This is important as their predictions are clearly wrong for some residues (e.g. T699N has been functionally assessed and shown to be LOF). This makes it difficult to understand what the point of the tool is. The authors should expand out their analysis to look at many more residues that are known to cause disease to really assess how useful the tool is (e.g. those reported in multiple families or those that have been functionally assessed). They should also test on residues with both GOF/LOF mutations.

      6. It was unclear why the authors focussed on one cryo-EM model. There are multiple models that have been published that implicate different residues in receptor activation. The authors should look at these models too.

      7. The authors state that mutations in the TM domain result in GOF. There are many examples of known inactivating mutations in the TMD and several switch residues (with LOF/GOF). This statement needs revising.

      8. The discussion largely re-states the results and doesn't place the research within the context of the current literature. This needs extensive re-writes.

      Minor comments:

      1. Abstract - The first sentence doesn't seem to fit with the rest of the abstract. I suggest removing.

      2. The authors should define 'clade' on its first usage as it is not a common word.

      3. The authors italicise some sentences for unknown reasons. This needs removing.

      4. Figure 2 and Figure 7 need revising as they are too small and/or illegible.

      Significance

      Mutations in the CaSR cause diseases of calcium homeostasis. Specifically inactivating mutations cause disorders of hypercalcemia, while activating mutations result in hypocalcemia. Bircan et al explore the evolutionary conservation of CaSR and try to use these findings to predict whether residues would be associated with hyper/hypocalcemia. This could be useful to researchers focussed on CaSR, particularly clinical geneticists or practising clinicians that may identify genetic variants in the receptor and require tools to predict pathogenicity. However, the manuscript does not fulfil these aims in its current form.

      It is very difficult to follow what the authors have done and what the purpose of the research is. The results section needs more sub-headings as at the moment it is too long, has too many methodological details and is very difficult to follow. The discussion needs completely re-writing as it doesn't really discuss the findings in the context of the current literature. I have tried to outline the areas that need improving the most.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Bircan et al. employ phylogeny-based methods and machine learning to determine positions in the Calcium Sensing Receptor (CaSR) that are specific for this receptor compared to those residues that are important for CaSR and related subfamilies. Using machine learning, the authors predict whether selected mutations in CaSR will lead to loss- or gain-of-function and compare this with experimental results from literature.

      Minor comments:

      • line 13/14: 'there are still gaps in our understanding of its specific residues' - possibly change to 'there are still gaps in our understanding of the specific function of its residues'?

      • line 17/18: 'The analysis revealed exceptional conservation of the CaSR subfamily, with high SDP scores being critical in receptor activation and pathogenicity' - are the SDP scores critical or some aspect of the receptor, i.e. the residues with high SDP scores?

      • lines 42-44: 'L-amino acid binding site at the interdomain cleft of LB1-LB2 and multiple Ca2+ amino acid binding sites on the VFT domain' - Should this be 'Ca2+ binding sites' instead?

      • lines 45-47 'While Ca2+ is the composite agonist for the CaSR, L-amino acids promote receptor activation along with Ca2+, but they are not able to activate the receptor alone'. - Unclear

      • lines 139, 146 'a ML tree' - should be 'an ML tree'

      • line 153 'γ-aminobutyric acid-B receptorsreceptors' - remove 'receptors'

      • line 162-164 'Comparison analysis of branch lengths (Patil, 2021) among common species between CaSR, GPRC6A and taste receptors shows that the CaSR subfamily is significantly more conserved than its closest subfamilies' - could you please give a very short explanation here for the non-specialists?

      • Fig 2A is unfortunately mostly unreadable. I would suggest replacing panel A with (an) alternative panel(s) clearly showing the stated results and moving the tree into the supplementary and/or making it available in a format that can be studied more closely.

      • Fig. 4A, right side. Both the x-axis and the bar colour are labelled 'SDP scores', but they don't agree with each other. Please clarify what is what.

      • Fig. 5 the numbers associated with the colour scales are unfortunately not readable

      • lines 394/5: 'Because CaSR is a highly conserved subfamily, any substitution on the receptor disrupts the function of the receptor and causes either GoF or LoF mutations.' - do you mean that no mutation in CaSR may be neutral?

      • Fig 7 is mentioned earlier than Fig 6.

      • line 516/7 and 532/3: ' we repeated the train-validation-test splitting procedure fifty times' - repetitive

      • Fig 6: what are the features in the bottom panel of 6B?

      Significance

      General assessment: The study uses computational methods to assess the importance of residues in the CaSR for function. The results are compared with the literature, as far as data are available. The study could be made more accessible to non-experts by putting results in context, more explanations in the figure legends and by making sure that the results mentioned in the text can easily be followed by looking at the figures. Another option could be to change subtitles in the results section to summarise the main findings of the section.

      Advance: This study uses phylogeny-based methods to advance our understanding of the role of residues in a GPCR and adds to our pool of techniques available for addressing such questions.

      Audience: The described research should be of interest for researchers working on CaSR, those interested in the evolution of GPCRs, and those studying the impact of point mutations in GPCRs on function and/or human health. I do not have sufficient expertise to evaluate the phylogeny-based methods used in this manuscript. At present the manuscript seems more likely to be of interest to a specialised audience, which could very likely be changed by making the manuscript more accessible to GPCR researchers that don't have a background in phylogeny-based methods.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This study aims at identifying key positions in the CaSR responsible for its specific properties. For that aim, they aligned all class C GPCR sequences and classified them according to their sequence identity and degree of conservation among orthologs. The final aim was to be able to predict the functional consequences of point mutations, as based on the degree of conservation of each residue within the orthologs, and within homologs. They include some considerations to predict whether the mutations of some of the conserved residues may lead to gain or loss of function. I definitively believe much can be learned from the sequence evolution of a protein, as highly conserved residues mean something. As such, a study like this one is of interest. However, I am far from convinced on their final approach to predict LoF and GoF mutations. Indeed, as far as I understood, they did consider the residue position, and its conservation either in the orthologs only, or also in some homologous sequences. However, they did not consider the type of mutation. In my opinion, a mutation at a given position may well be either LoF or GoF depending on the new residues. One can easily understand that a mutation into Gly or Trp may have very different effect. They also considered that mutations in the 7TM core domain are more prone to generate GoF. This is a statistical view of what could be going on, but by no way this can be included as a criteria to decide on the consequence of the mutation, as both LoF and GoF mutations can be found in this domain. Lastly, there exist a very long list of mutation of the CaSR with known functional consequences. These must be used to validate the authors' approach. In my opinion, validating theyr approach would mean making a long list of what their analysis can prediction with a large number of positions of the CaSR, not considering our actual knowledge along these lines, and then compare they prediction with what is already known. Eventually, for a few predictions for which there is no data supporting either their LoF or GoF effect, these should simply be tested to give the readers an expectation on the viability of their approach.

      Significance

      I must clearly state that I not a specialist of the bioinformatic approaches used in this study, and as such cannot judge all these aspects of the work presented in this story. However, I am also far from convinced with the analysis and the conclusions, that, in my opinion, are not in line with my views on this topic. One key aspect is that the authors only considered mutations at specific positions in the CaSR, with the aim to predict their loss of gain of function effect. However, in my opinion, such a functional consequence not only depends on the residue mutated, but also into which residue it is converted. I cannot see that a mutation into Ala, Gly or TRP could have the same effect. As such, I cannot recommend acceptance of this paper.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to the three reviewers:

      We thank the reviewers for the time they spent reading and evaluating our work, and for their comments and constructive criticisms.

      The three reviewers contested the novelty and significance of our findings. Their main arguments were that the role of plakin/desmosomes in the regulation of epithelial polarity is already known and that our work does not provide any novel mechanistic link between them and the process of cell polarization.

      To the first point we would like to argue that although a general relationship between plakins and cytoskeletal filaments networks, and notably cytokeratin, has been involved in the regulation of both intercellular junction strength and cell migration, their involvement in the asymmetric positioning of organelles in polarized cells has not yet been proposed nor demonstrated. However, in the light of reviewers’ comments, we admit that our wording has been misleading and that we have used the term “epithelial polarity” when it would have been more rigorous to use the term “asymmetric centrosome position in polarized epithelial cells” to describe our observations. We have modified our text to make this clearer and to streamline our descriptions and conclusions on the regulation of centrosome position and the associated asymmetry of the microtubule network. Considering this, we would like to stress out that our discovery about the specific involvement of three plakins (epiplakin, periplakin and desmoplakin) in the regulation of centrosome position in epithelial cells is novel (see our more detailed argumentation below) and fully demonstrated with our data. We insist that these discoveries are significant since we identified these plakins thanks to the changes of their expression levels in two set of cell lines representing progressive stages of mammary breast cancer. Finally, it is important to stress out that our experimental approach is also original since we used a cellular metric, the centrosome position, to interpret and sort transcriptomic data sets. This strategy of mixing cell biology and bioinformatics has proved fruitful and is thus likely to also become influential.

              To support our argumentation that the identification of the role of plakins in the regulation of epithelial cell polarity is novel, we searched for the words “polarity” and “(epi/peri/desmo)plakin” in PubMed.
      
      • “Polarity and epiplakin” returned 1 review (PMID 24352042)
      • PMID 24352042: It is a review that we cited, in which it is argued that plakins contribute to cell polarity as they bind to all cytoskeleton filaments and connect them with intercellular junctions. The section dedicated to polarity referred to two specific studies: one about BPAG1e, a member of the plectin family of plakin which is involved in the front-rear polarity of migrating keratinocytes, and one about the spetraplakin MACF1, which crosslinks actin and microtubules and is involved in the polarization of epidermal stem cells. The review also refered to the role of plectin in the regulation of centrosome position by attaching it to intermediate filaments.
      • “Polarity and periplakin” returned 1 review, the same as above, and 2 experimental papers (PMID 23777851 and 18823282)
      • PMID 23777851: It is a study on the protein expression profiles of skin cells derived from patients with Atopic dermatis. Authors found that TH17 cytokines, a inflamatory pathway involved in the differentiation and polarization of naive lymphocytes, was activated and the expression of TH17-related molecules was negatively correlated with periplakin.
      • PMID 18823282: It is a characterization of the ubinuclein, which is known to be essentially nuclear but was could be localized to lateral cell borders in differentiated keratinocytes characterized by the expression of involucrin and periplakin.
      • “Polarity and desmoplakin” returned 87 references, most of them related to the role of desmosomes in the establishment of the apical pole of epithelial cells, but only two of those references are specifically related to the centrosome. They showed that CSSP1 and ninein, two centrosomal proteins, can bind to desmosomes via desmoplakin (PMID 26241740, 17227889). But they are not related to centrosome positioning. The two papers are now cited in our discussion anyway. Based on this search, it seems to us that the establishment of the causal role of these three plakins in the role of centrosome position in polarized epithelial cells is clearly novel.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary The manuscript by Geay and colleagues examine potential regulators of centrosome positioning in an immortalised breast cell line in vitro on micropatterns that promote cell doublet formation. The authors mine expression data from breast cancer cell lines in vitro to identify microtubule-related transcripts that are potentially downregulated in cells with a mesenchymal phenotype. The authors identify some Plakin proteins, which upon depletion, are reported to change centrosome positioning relative to junctions. The authors propose that plakins are involved in the maintenance of epithelial polarity.

      Major comments I applaud the authors for attempting to identify novel regulation of epithelial polarity. However, I am sorry to say that this manuscript is overtly preliminary. It is a collection of observations without any mechanistic insight (described below). Despite what I write below, I apologise in that these shortcomings as so extensive that I cannot recommend experiments that would 'fix holes', without essentially writing an entirely new project. Even after addressing the points below, I think it unlikely that the observations would make a coherent, mechanistic contribution to the field of epithelial polarity. I do not like to give reviews like this, but unfortunately, the submission of such preliminary works puts us in this position.*

      Authors: It is correct that we did not investigate the underlying mechanism, and thus our work is preliminary from this point of view, but we provided the first set of evidence that the three plakins (epiplakins, periplakins and desmoplakins) are involved in the regulation of centrosome position and the associated asymmetry of the microtubule network in polarized epithelial cells. This identification was far from obvious, and relied on an unusual way to exploit transcriptomic. Data, which we think is quite valuable. We correlated the level of transcripts to a quantitative measurement of cell organisation (the distribution of nucleus-centrosome vectors). This strategy is novel and proved useful since we identified novel regulators of centrosome positioning.

        • 'Epithelial polarity' Throughout manuscript the authors refer to a 'polarity score' and the term 'epithelial polarity' when what they have actually measured is a specific angle of orientation of centrosomes in cell doublets in vitro. This is an overstatement and adds confusion. The term 'epithelial polarity' has overtones of a polarised epithelium, which such doublets do not model. There is no mechanistic investigation into how this polarity score relates to the ability to form a polarised epithelial monolayer, with apical-basal polarity orientations, either a monolayer on a substrate or a monolayer surrounding a single central lumen, such as these MCF10A cells are often used for in 3-dimensional culture. I suggest that the authors simply mention what they actually measure (and in their own words): "coordination of the centrosome along the nucleus-junction axis." *

      Authors: This is correct and we apologize for the confusion. We have now corrected the text and refer specifically to the “position of the centrosome in polarized epithelial cells” instead of “epithelial polarity”. However, it should be noted that we and others already showed that this position is relevant to the establishment of polarity in vitro in 3D culture, and in vivo in developing mouse embryo (Rodriguez-Fraticeeli et al., J Cell Biol, 2012) (Burute et al., Dev Cell, 2017). We have now added a paragraph at the end of the introduction to clarify this point and justify our experimental approach.

      • In Figure 1A-C, cell doublets are reported and apparently quantified to measure a 'polarity score', which is the angle of orientation of centrosomes in cell doublets. Yet, there is no clear information that explains how the cutoff for what defines this polarity score is generated (e.g. why is the cutoff point chosen to be where it is?), or what it means for epithelial polarity (e.g. why is this cutoff point important to be at that site?). Moreover, there is no indication that these cells actually form connected doublets. Labelling and quantitation of potentially connected cells is absent. Do these actually form junctions to the same extent, such that any differences have been exhaustively excluded to be only from the centrosome orientation, rather than cell spreading and cell-cell contact differences (that would alter geometry)? In addition, statistical analysis for part C is missing. *

      Authors: First, it is true that the geometrical sectioning of cells in order to define a region where centrosomes are considered as polarized toward the junction is arbitrary. But isn’t it the case for most thresholds in image analysis? This is how it has been done in all studies of the polarity of migrating cells during would healing for example. What we think is key here, is that the chosen angular sector for polarized centrosomes, is the same for all conditions, so it allowed us to compare the frequency of polarized centrosome based on this criterium.

      Second, it is also true that for the sake of conciseness we did not show too many data about the characterization of the doublets in order to focus on the criteria that we used for our study. But we analyzed the shape of the doublets. In this example below, we measured how “pinched” were the doublet as compared to a a fully convex envelope. Small intercellular junctions lead to high difference between the area of the convex hull and the area of the doublets. However, we did not find that doublets of comparable cell lines with distinct polarity index, such as HCC1937 and HCC1143, had distinct junction length:

      Finally, there is no statistical analysis for in the histogram shown in Figure 1C since we did not compare the polarity index of the different cell lines. We related them to their transcriptomic profiles (Figure 1D).

      3.

      *Fig 1D, 2A,B present select example genes correlated with either polarity score or EMT score (Fig 1D, 2B). It is unclear what insight providing select genes from many that are changed provides. In Fig 2A, an apparent EMT score (seemingly derived from mining of existing expression data not from this laboratory) is provided, ranked by an EMT. No description is provided for what these alterations are (e.g. what is a 'HME_Ras_Twist1E12_TGFb' sample?). Further, what this is supposed to indicate as a mechanistic insight is unclear. *

      Authors: These panels illustrate examples of protein for which the level of transcripts was well correlated (negatively or positively) to the polarity or EMT scores of the various cell lines we tested. We did not describe again these cell lines and referred to the study where they have been described in details since the conditions leading to their phenotypes were less relevant than the consequence on gene expression and EMT progression, which were described in our text and data. There was no specific value in the chosen proteins in Figure 1D and 2B, they simply illustrate how various transcript levels can be compared, and potentially correlated, to geometrical measurement (in the case of the polarity score, 1D) or to a identity measurement (in the case of the EMT score, 2B). There is no mechanistic insight at this stage. Figure 1 and 2 only illustrate the novel method we proposed to extract information about cell architecture or cell identity from transcriptional data sets. The most valuable information will come in Figure 3 in which we will cross these two pieces of information.

      • Figure 3 is highly preliminary. The entirety of Figure 3 is a correlation plot between EMT score and polarity score for microtubule-related transcripts. *

      Authors: We respectfully disagree. Some misunderstanding might explain reviewers’ comment. This is not a “correlation plot between EMT score and polarity score for microtubule-related transcripts”. The values are not the scores but the correlation between the transcript level and the scores (which were described in Figure 1D and 2B that we discussed in the two previous comments of the reviewer). This is much more informative and definitely not preliminary, since it revealed potential structural (polarity score) and functional (EMT score) implications of proteins that were not known before. Proteins on the top-left or bottom right of the graph are all candidate to influence EMT by acting on the structural polarisation of cells.

      *The authors state: "The graph showed an overall negative trend, which means that many genes were positively correlated with an EMT in HME were instead negatively correlated with epithelial polarity of TNBCs (Figure 3). This was expected and confirmed that the progression along EMT is associated with a loss of epithelial polarity." No statistical analysis is presented, no correlation scores and indication of robustness is provided. It is unclear how this provides any mechanistic insight. The authors themselves state that this association is expected. *

      Authors: We apologize for this. We now reported the statistical analysis that confirmed our previous description of a negative trend in this graph. Pearson correlation coefficient is -0.35 (p-value = 0.00023, 95% confidence interval: [-0.50, -0.17]. We have added these details in the main text and Material and Methods. It is correct that this tendency could be expected by considering various studies together, but it is still better when rigorously demonstrated.

      *Moreover, the authors state "Interestingly, three plakins, namely epiplakin (EPPK1), desmoplakin (DSP) and periplakin (PPL) all appeared as clear outliers (Figure 3)." How is an outlier defined & why is this clear? Is the association of these key cell-adhesion molecules with an epithelial cell state novel or known? *

      Authors: We define classically outliers as genes with a score higher than the 75th Percentile + 1.5 times the InterQuartile Range (IQR). 7 genes were outliers, including the three plakins. We have now detailed the procedure in a dedicated section in Material and Methods.

      • Figure 4. The authors perform siRNA-mediated depletion of Desmoplakin, Epiplakin and Periplakin in MCF10A cells. The authors report, "Interestingly, knocked-down cells in culture displayed abnormal shapes, being more elongated and less cohesive (Figure 4C)." No quantitation of such changes are provided. Moreover, cells with KD appeared to be at lower density. Can the authors exclude that these are not merely density-dependent effects.*

      Authors: This is really just a description of the images. The densities didn't seem that different to us, but it is true that we can't rule out a density effect. We didn't do a detailed quantitative description of these phenotypes because they were not central to the argument about centrosome position. However, we thought these images of knock-down cells were worth showing.

      • Throughout the work, the polarity index is reported from plakin depletion conditions with data from a reported 3 independent experiments seemingly pooled (no indication of graph of which independent experiment each data point comes from). Is the statistical analysis performed (missing in Fig 4E, present in Fig 5A-C, S2, S3) from pooled data? If so, this is in appropriate and should be from the averages of independent experiments, to understand batch effects. If not from pooled data, please alter graphs to display this appropriately. *

      Authors: We showed only one data set per conditions to avoid graph over-crowding. We know show these 3 different experiments with distinct colors in the graphs (SuperPlots). Noteworthy, the exact same experiments were performed with another set of siRNA for each of the three plakins and they show exactly the same effect (see Figure S2).

      • Figure 5A. It is unclear how F-actin is measure in the images. Is F-actin labelling a truly representative proxy for junction length? *

      Authors: This is correct, we assumed that the frontier between the two cells, which could be seen with F-actin, corresponded to the intercellular junction.

      • Fig 5C. Why are images of vimentin now provided not on micropatterns? The labelling of vimentin in siPeriplakin cells does not look appropriately controlled for by the other cell conditions. siPeriplakin is clearly at the edge of a colon, whereas this is not clear whether an appropriate region is labelled in the other conditions. *

      Authors: We performed experiments on micropattern when the aim was to characterize the localization of a protein or a compartment, since micropattern normalize cell shape and orient cell architecture. Here our aim was to visualize the global amount of a protein, so micropatterns were not needed. Actually, western blots would have been better suited for these experiments. But these were the data we had. Pictures that were shown have all been taken at the edge of a colony. The boundary is visible on all images.

      Reviewer #1 (Significance (Required)):

      *In the literature, there is a rich understanding of the molecular mechanisms of the cross-talk between cell-cell junctions, cell polarity complexes, and the organisation of the cytoskeleton. The authors are applauded for efforts to investigate whether there are common transcriptional downregulations of microtubule-related proteins that could potentially be key regulators of cellular polarisation. It is unfortunate that the work, as presented, is a series of modest observations, often the insight of which is overstated. Despite the analysis of plakins as a potential regulators of centrosome positioning between cell doublets, there is no mechanistic insight into a) how these plakins contribute to centrosome alignment asymmetry, or b) whether this is any way has an effect on true epithelial polarisation (beyond potential doublets on a micropattern). If significant development of mechanistic insight was added (requiring extensive additional experimentation, expected: 1-2 years of work), then the manuscript might be of interest to the cell biology community. *

      Authors: Our observations can be considered as modest but they are solid and novel and, to our point of view, significant. We acknowledge that the use of the term “epithelial polarity” instead of “asymmetric centrosome positioning” is an overstatement the impact of our observations and corrected it (including in the title). However, we think, based on previous works that are now better described in the revised version of our introduction, that the position of the centrosome in micropatterned cell doublets is a meaningful readout of the polarization of the organization of epithelial cells.

      It is true that we don’t provide the molecular and physical mechanism by which plakins affect centrosome position. And this would indeed deserve another complete study. However, since no previous work reported that plakins were involved in centrosome position, we think our work will be a valuable contribution to the field of cell polarity and to the recently rapidly growing field of plakins.

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary The new paper by Geay and colleagues studies epithelial cell polarity of triple negative breast cancer (TNBC) cell lines using a special H-shaped microculture device and confocal microscopy of centrosomes. Using a quantification method that calculates a polarity score, the polarity phenotype of each breast cancer cell line is associated to the corresponding transcriptome analyzed using an Affymetrix microarray platform. In this manner, expression of specific genes is correlated to the polarity score.

      In its second part, the study shifts its interest to the transdifferentiation process of EMT (epithelial-mesenchymal transition) and uses a published transcriptomic dataset based on human mammary epithelial cells that overexpress a series of oncogenic (Ras, TGF-beta) and EMT (ZEB1, ZEB2, TWIST1, E12) factors, and calculates an EMT score that is correlated to the expression of different genes identified in the published dataset. The interest in EMT is logical as EMT often correlates with the loss of epithelial cell polarity.

      Based on the two gene lists and their respective phenotypic correlation, known regulatory components of microtubule dynamics that can potentially regulate centrosomal position and thus epithelial cell polarity, are selected. Among these are genes encoding for components of desmosomes, the plakin family, that link membrane-based intercellular adhesions intracellularly to intermediate filaments, mainly cytokeratins, and indirectly with microtubules. Using traditional siRNA-based technology and an immortalized, non-malignant breast epithelial MCF10A cell line, silencing of specific plakin family mRNAs is shown to lead to polarity defects that correlate with concomitant high expression of the intermediate filament vimentin, the latter often used as a molecular marker of the EMT process. In other words, silencing of plakins leads to loss of breast epithelial cell polarity and gain of vimentin, a sign of enhanced EMT. This is the central observation of this study that is also captured in the title and it is not carried any further towards a mechanistic or deeper analysis. *

      Authors: We thank the reviewer for this fair and accurate description of our work

      *Major comments: I have two general or conceptual comments and one major technical comment: 1) Unfortunately, the study does not provide an advance in terms of understanding the action of plakins as regulators of cell polarity or EMT. Both cellular processes are well characterized, and in the case of EMT, specific guidelines have been published that dictate the large number of complementary assays required for a proper assessment of EMT (see Yang, J., et al. Nat Rev Mol Cell Biol. 2020 Jun;21(6):341-352). *

      Authors: This is correct. We did not elucidate the underlying mechanism but we identified the involvement of plakins in the regulation of centrosome position in polarized epithelial cells. Note that our aim was not to reveal new EMT regulator, but to reveal new regulator of centrosome positioning. We only took advantage of EMT as a natural mechanism that involves the destabilisation of epithelial polarity and the reversal of centrosome position. The focus and the experimental strategy of our study are now better explained in the revised version of our introduction.

      *2) The plakins studied make the intracellular adaptor interface that links desmosomes to intermediate filaments, primarily cytokeratins. The paper does not even mention at all these two important epithelial protein networks, which I believe should have been studied in both TNBC and HME-EMT cell models. Furthermore, the paper tries to emphasize the regulation of microtubular networks because of their established importance in organizing proper centrosomal positioning. Yet, the presentation of results and the discussion appears rather confusing and unclear as to whether the data present any real effects of plakin expression manipulation on microtubules. It appears that such effects were not scored, which leaves the central aim of the project incomplete and raises issues that demand further and deeper analysis of the regulation of centrosome positioning by plakins. Can the centrosomal effects be completely indirect or bypasser effects due to the overall architectural change that epithelial cells undergo when their desmosomes lose their rigid coupling to cytokeratins? *

      Authors: It is true that we did not study at all the mechanism by which plakins affect centrosome position. And cytokeratins would definitely by on the top list for such a study. We we focused our transcriptomic analysis to microtubule-binding protein, since they are likely involved in the regulation of centrosome positioning. But we did not investigate in details the role of microtubules in the mispositioning of centrosome we found in plakin mutants. As stated by the reviewer, centrosome mispositioning might not result from a direct role of plakins on microtubules. The mechanism could definitely involve desmosomes, cytokeratins or many other cytoskeleton components. The exploration of all these possibilities should be the focus of future studies. Our work simply provides multiple and solid evidences for the implication of plakins in the regulation of centrosome positioning and open the way for interesting follow-up studies.

      *3) The classic siRNA-based method is used to silence plakin family mRNAs. This well-established technology today demands the use of multiple independent siRNAs per mRNA and also rescue experiments in order to confirm the absence of so-called off-target effects. *

      Authors: This is correct. We used two distinct siRNAs per targeted proteins. The effect on centrosome mispositioning were quite similar with both sequences (see Figure S2). We did not have time to confirm the absence of off-target effects by rescue experiment. This is missing indeed and unfortunately, we don’t have the human resources to perform those experiments. But we would like to stress out that a direct correlation between the level of expression of the tree plakins we tested and centrosome mispositioning was established in the first part of the study that was based on the natural variation of their expressions in 12 cells lines.

      Specific comments: I also enlist here some specific comments in the order of the figure presentation:

      *3) Fig. 1A lacks the images of Hs-578T and MCF10A cells. *

      Authors: This is correct. But MCF10A were already shown in our previous publication (Burute et al., Dev Cell, 2017). We did not want to insist on something already shown. Then we decided to show only 10 images as it would be odd to organise a panel with 11 images. Individual images are not so informative in this case, they are more illustrative and they are not so different from each other.

      *4) The data of Fig. 1C demonstrate score of 15-30% for the TNBC and "normal" epithelial cells. These data must be discussed in the context of the established literature on cell polarity. Is a 30% score anticipated for a polarized cell type? Is the difference between 30 and 20% significant in terms of the polarity of cells within a tissue? What would such scores be if one studied highly polarized cell monolayers on transwell filters? Is the H-shaped microsystem reliable? *

      Authors: This is a good remark and a fair concern. We can’t compare directly this “polarity score”, which is a metric about the position of centrosome, to the complete polarization of structures and signalling pathways in actual epithelial tissues in vivo. But we already studied the polarity of MDCK and MCF10A doublets (Burute et al, Dev Cell, 2017), which showed similar level of asymmetry in their centrosome position.

      It is also fair to doubt of the reliability of H-shaped micropatterned. In our revised introduction (see last paragraph), we have now listed all the features that made us believe that the polarized organisation of intercellular junctions and associated components in micropatterned cell doublets is relevant to the establishment of polarity in polarized epithelial tissues in vivo. The list of polarized components was based on two independent works (Rodriguez-Fraticeeli et al., J Cell Biol, 2012) (Burute et al., Dev Cell, 2017). In addition, it should be noted that we also previously reported the inversion of centrosome position in epithelial cell doublets during TGF-beta-induced EMT in MCF10A, and that we also observed this repositionging in vitro in 3D mammary gland cultured cells and in vivo in vivo, in mouse mammary gland epithelia and in developing mouse embryo at gastrulation (Burute et al., Dev Cell, 2017).

      *5) The gene expression data of the TNBCs or publicly available data for the same cell lines from TCGA should be used to generate a heat map that illustrates the positioning of the examined cells in the spectrum of luminal epithelial to claudin-low, mesenchymal breast cancer cells. *

      Authors: Such an analysis would be interesting indeed. Actually, a lot of information about the role of plakin in the maintenance of epithelial polarity could be extracted from the comparison of transcriptomic profiles of these various stages of EMT. But this is a bit beyond the scope of our study which was more focused on the consequences of these changes on centrosome position.

      *6) The 13 HME cell models used in Fig. 2A should be described in detail despite their earlier publication 11 years ago. This is important because the derived EMT scores are slightly counterintuitive: the parental HME cells are plotted as having a higher EMT score than the transformed HMEs expressing Ras or Twist1. How can this be explained? P53 is well established as an epithelial differentiation factor that counteracts EMT. Why does shp53 and especially combined with Ras overexpression not lead to EMT? I note that this cell model is listed as having epi and mes varieties. What are these and why are these important phenotypes not presented in the results? TGF-beta is presented in the results as a transcription factor, yet it is a secreted growth factor. What does TGF-beta mean? HME cells overexpress the cDNA for TGF-beta (which one? There are 3 TGF-beta genes)or were the cell cultured in the presence of this cytokine? *

      Authors: These are interesting comments. Actually, one of the important observations of this earlier study was that mice over-expressing Ras alone or Twist alone in mammary tissues, either during embryonic development or later during mammary development induced by lactation, did not form invasive tumours. The expression of Ras induced low grade splenic lymphomas as well as anal and oral papillomas but they never progressed to the malignant stage. However, the combination of Ras and Twist expression had dramatic effects on the reduction of mice survival due to the formation of multifocal breast carcinomas with metaplastic features. So the absence of increase of the EMT score upon the overexpression of Ras or Twist alone is not so counterintuitive. But we can’t really explain how cells became “more epithelial” though. We think that it would be long and not so conclusive to enter into those details in the main text.

              Cells silenced for p53, to resist from oncogene-induced senescence and apoptosis, and over-expressing Ras could express or not EpCAM and thus were sorted in EpCAM positive (epi) or EpCAM negative (mes). In some conditions, TGF-beta was added to cells to induce EMT. The combinations of these various treatments induced more or less aggressive transformations that are described in this earlier study but we think it would take too long to describe them here.
      

      In the end, what mattered for our study, was that this set of cell lines allowed us to explore a broad range of EMT scores, which we could correlate to variations of transcriptomic profiles.

      *7) Minor semantic comment: does Fig.2B show collagen V or collagen XV? Related to this, the article has abundant typographical errors. *

      Authors: It was collagen V. We checked for other typos and hope to have corrected them.

      *8) Fig. 4: based on the major comment, this experiment requires analysis of rescue clones. *

      Authors: We fully agree, these experiments are missing indeed. Unfortunately, we don’t have the human resources to perform those experiments. However, it should be noted that the specificity of the target is somehow supported by the observation of the exact same phenotypes upon the use of another siRNA sequence for each plakins (Figure S2). In addition, we would like to stress out that a direct correlation between the level of expression of the tree plakins we tested and centrosome mispositioning was established in the first part of the study that was based on the natural variation of their expressions in 12 cells lines (Figure 1).

      *9) Fig. 5C: the vimentin microscopy needs to be complemented with full EMT analysis using both microscopic and protein expression assays (see major comment). More importantly, desmosomal and cytokeratin organization analysis is missing. *

      • *

      Authors: We agree that an immunostaining of vimentin is way too preliminary to conclude about an actual induction of EMT. Hence our tempered conclusion about the “suggestion that cells might be engaged in a form of EMT”. As also mentionned by reviewer #1 a full characgterization of the EMT state of these cells would require a long list of measurements, including the quantification of EMT-related transcripts and other structural analysis like desmosome and cytokeratins. But we don’t have the manpower to perform these experiments. Considering the broad interest of our community for the induction of EMT we thought that these observations were sufficiently interesting to be reported although they were somehow distant from the focus of our study on centrosome positioning.

      *10) Fig. 5C: In the desmoplakin and periplakin knock-down experiments the cells stained for vimentin appear to have their vimentin "baskets" rather well polarized. Is this true or my impression based on the few cells illustrated in the images? If the cytoskeleton is polarized, what does one mean with loss of cell polarity? Is centrosomal polarity change associated with mesenchymal (back to front) polarity gain? If this is true can polarity be established by studying only 2 cells in the H-shaped microcultures? Is it not more relevant to allow cells build a cohort of inter-adherent cells? *

      Authors: This is an interesting observation and a thoughtful analysis. And indeed, it can be seen in the quantification of polarity indices of periplakin knocked-down cells (see Figure 4E and figure S2) that the distribution seems to contain two distributions, one of epithelial-like orientation (values closed to +1) and one of reversed orientation (values closed to -1), toward the ECM, similar to our previous description of polarity reversal during EMT (Burute et al, Dev Cell, 2017). This suggest that indeed the knockdown of periplakin might not only impair the epithelial apico-basal polarity but also promote mesenchymal front-back polarity. Although interesting, we found it a bit too speculative to be stated in our revised version.

      *Reviewer #2 (Significance (Required)): *

      * Significance: The paper provides a quantitative analysis of single cell polarity and gene expression-based EMT and identifies plakin gene family members as potential regulators of cell polarity. If this finding can be substantiated via mechanistic work it will make an important contribution to epithelial cell biology. *

      * General assessment: As explained above, the paper is in a preliminary state, as it describes an observation that demands further analysis. Key cell biological constituents (desmosomes, cytokeratins) have not been included in the analysis. Specific key figure data are presented without explanation for the non-specialist, especially those data that have been generated based on older publications by some of the authors. *

      * Advance: At the present state, the paper does not make any advance but describes potentially interesting observations. *

      * Audience: This paper can stimulate interest in the broader field of cell biology and definitely more to the cell polarity and EMT sub-fields. *

      Authors: It is true that we don’t provide the molecular and physical mechanism by which plakins affect centrosome position. And this deserves indeed further characterisation. However, our work provides multiple and solid evidences for the implication of plakins in the regulation of centrosome positioning in epithelial cells and thus opens the way for interesting follow-up studies. Since no previous work reported this role of plakins, we think our work will be a valuable contribution to the field of cell polarity and to the recently rapidly growing field of plakins.

      *My field of expertise: I study signal transduction and transcriptional mechanisms that regulate the EMT and its association with cell proliferation in cancer cells. I have also specialized on studying dynamics of cytoskeletal assembly and architectural cell organization.

      *

      *Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this article the authors have analyzed the genes related to epithelial cell polarity and report the relevance of the desmosomal proteins epiplakin, desmoplakin and periplakin in this process. These genes are downregulated in cells that have lost cell polarity and their lack of expression correlates with the emergence of an EMT program. Moreover, interference in the expression of these proteins increase vimentin and likely other mesenchymal markers. The experiments are very neatly done, with all the appropriate controls; the methodology is adequate, the figures are well designed and are reader-friendly and the results have some interest. Therefore, I do not have objections related to these issues, other than two minor question indicated below. *

      Authors: We thank the reviewer for this positive assessment of our work

      *- Other EMT markers should be easily assessed in the cells transfected with the plakins shRNAs to analyze the extent of EMT in these cells. *

      Authors: We agree that an immunostaining of vimentin is way too preliminary to conclude about an actual induction of EMT. Hence our tempered conclusion about the “suggestion that cells might be engaged in a form of EMT”. As also mentionned by reviewer #1 a full characterization of the EMT state of these cells would require a long list of measurements, including the quantification of EMT-related transcripts and other structural analysis like desmosome and cytokeratins. But we don’t have the manpower to perform these experiments. Considering the broad interest of our community for the induction of EMT we thought that these observations were sufficiently interesting to be reported although they were somehow distant from the focus of our study on centrosome positioning.

      *- It would be interesting if the article is reviewed by a scientist with a deeper knowledge in EMT because the text contains some inaccuracies related to this process and the main references are outdated. *

      Authors: It is unfortunate that the reviewer was not more specific in his/her assessment. There are lots of references in the field of EMT. Not so many are related to the polarized organization of cells and we tried to cited those we found significant. We have added more recent references in the revised version of our introduction. We hope this will be satisfactory but we would be happy to complement this list.

      *Reviewer #3 (Significance (Required)):

      However, the significance of the conclusions is very limited. The relevance of desmosomes in cell polarity was described time ago by the Fuchs' group (see Lechler T, Fuchs E, J Cell Biol 2007, 176, 147-154); since then, this topic has been investigated by many other labs. For a more recent work see "Desmosomes polarize and integrate chemical and mechanical signaling to govern epidermal tissue form and function" Broussard et al, Curr Biol 2021, 31, 3275-3291. *

      Authors: This is correct. The role of desmosome in the establishment and maintenance of the apical pole of epithelial has been well established. However, their role in the positioning of centrosome is much less clear.

      Please note that the paper by Lechler and Fuchs is not about epithelial polarity. It describes the loss of astral organisation of microtubules in differentiating epidermal cells forming desmosomes thanks to the recruitment of ninein to desmosoems by desmoplakin.

      Please also note that the other study by the group of Kathleen Green is about the role of desmoplakin in ensuring distinct mechanical states in the apical and basal pole of epidermal cells. It is not related to the organisation of the microtubules, nor is it related to the position of the centrosome. So it is unclear to us how these works limit the significance of our findings about the role of plakins in the control of centrosome position and the establishment of apico-basal polarity. We were happy to include them in the revised version of our discussion anyway.

      Furthermore, our work is about three distinct plakins: periplakin, epiplakin and desmoplakin. Although they all localise to desmosomes their specific roles in the establishment and maintenance of epithelial polarity has not yet been established (as detailed in the general comments we wrote in the opening of this letter and in the revised version of our discussion). In addition, their specific roles should be distinguished from the multiple roles of desmosome in cell polarity, which involve inter-cellular junctions and connections to various inner cytoskeleton networks.

      So, although we acknowledge that a mechanistic understanding would significantly increase the strength of our study, we still believe that the demonstration of the involvement of these three plakins in the regulation of centrosome position in polarized epithelial cells is novel and significant.

      *Therefore, the authors need to analyze the mechanism with a greater detail if they want to contribute to the advance of this field. As a possible suggestion, they might use their plakin shRNA-transfected cells to investigate the signaling pathways that are altered, to transfect different desmoplakin mutants and describe their effects. *

      * Related to desmosome alterations and EMT, this has also been indirectly concluded in an article quoted by the authors (Chun and Hanahan). This might be also studied by the authors assessing if the main transcriptional factors related to EMT are altered in these cells. *

      Authors: We thank the reviewer for these constructive suggestions to deepen our investigation. The identification of these pathways could definitely highlight the mechanisms involved in the regulation of centrosome positioning. However, this is somehow beyond the scope of this study which was focused on the identification of regulators of centrosome asymmetric positioning in polarized epithelial cells. Counterintuitively, molecular motors did not seem to be involved. But several plakins were revealed. Further studies are now required to understand how they impact centrosome position.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this article the authors have analyzed the genes related to epithelial cell polarity and report the relevance of the desmosomal proteins epiplakin, desmoplakin and periplakin in this process. These genes are downregulated in cells that have lost cell polarity and their lack of expression correlates with the emergence of a n EMT program. Moreover, interference in the expression of these proteins increase vimentin and likely other mesenchymal markers. The experiments are very neatly done, with all the appropriate controls; the methodology is adequate, the figures are well designed and are reader-friendly and the results have some interest. Therefore, I do not have objections related to these issues, other than two minor question indicated below.

      • Other EMT markers should be easily assessed in the cells transfected with the plakins shRNAs to analyze the extent of EMT in these cells.

      • It would be interesting if the article is reviewed by a scientist with a deeper knowledge in EMT because the text contains some inaccuracies related to this process and the main references are outdated.

      Significance

      However, the significance of the conclusions is very limited. The relevance of desmosomes in cell polarity was described time ago by the Fuchs' group (see Lechler T, Fuchs E, J Cell Biol 2007, 176, 147-154); since then, this topic has been investigated by many other labs. For a more recent work see "Desmosomes polarize and integrate chemical and mechanical signaling to govern epidermal tissue form and function" Broussard et al, Curr Biol 2021, 31, 3275-3291.

      • Therefore, the authors need to analyze the mechanism with a greater detail if they want to contribute to the advance of this field. As a possible suggestion, they might use their plakin shRNA-transfected cells to investigate the signaling pathways that are altered, to transfect different desmoplakin mutants and describe their effects.

      • Related to desmosome alterations and EMT, this has also been indirectly concluded in an article quoted by the authors (Chun and Hanahan). This might be also studied by the authors assessing if the main transcriptional factors related to EMT are altered in these cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The new paper by Geay and colleagues studies epithelial cell polarity of triple negative breast cancer (TNBC) cell lines using a special H-shaped microculture device and confocal microscopy of centrosomes. Using a quantification method that calculates a polarity score, the polarity phenotype of each breast cancer cell line is associated to the corresponding transcriptome analyzed using an Affymetrix microarray platform. In this manner, expression of specific genes is correlated to the polarity score.

      In its second part, the study shifts its interest to the transdifferentiation process of EMT (epithelial-mesenchymal transition) and uses a published transcriptomic dataset based on human mammary epithelial cells that overexpress a series of oncogenic (Ras, TGF-beta) and EMT (ZEB1, ZEB2, TWIST1, E12) factors, and calculates an EMT score that is correlated to the expression of different genes identified in the published dataset. The interest in EMT is logical as EMT often correlates with the loss of epithelial cell polarity.

      Based on the two gene lists and their respective phenotypic correlation, known regulatory components of microtubule dynamics that can potentially regulate centrosomal position and thus epithelial cell polarity, are selected. Among these are genes encoding for components of desmosomes, the plakin family, that link membrane-based intercellular adhesions intracellularly to intermediate filaments, mainly cytokeratins, and indirectly with microtubules. Using traditional siRNA-based technology and an immortalized, non-malignant breast epithelial MCF10A cell line, silencing of specific plakin family mRNAs is shown to lead to polarity defects that correlate with concomitant high expression of the intermediate filament vimentin, the latter often used as a molecular marker of the EMT process. In other words, silencing of plakins leads to loss of breast epithelial cell polarity and gain of vimentin, a sign of enhanced EMT. This is the central observation of this study that is also captured in the title and it is not carried any further towards a mechanistic or deeper analysis.

      Major comments:

      I have two general or conceptual comments and one major technical comment:

      1) Unfortunately, the study does not provide an advance in terms of understanding the action of plakins as regulators of cell polarity or EMT. Both cellular processes are well characterized, and in the case of EMT, specific guidelines have been published that dictate the large number of complementary assays required for a proper assessment of EMT (see Yang, J., et al. Nat Rev Mol Cell Biol. 2020 Jun;21(6):341-352).

      2) The plakins studied make the intracellular adaptor interface that links desmosomes to intermediate filaments, primarily cytokeratins. The paper does not even mention at all these two important epithelial protein networks, which I believe should have been studied in both TNBC and HME-EMT cell models. Furthermore, the paper tries to emphasize the regulation of microtubular networks because of their established importance in organizing proper centrosomal positioning. Yet, the presentation of results and the discussion appears rather confusing and unclear as to whether the data present any real effects of plakin expression manipulation on microtubules. It appears that such effects were not scored, which leaves the central aim of the project incomplete and raises issues that demand further and deeper analysis of the regulation of centrosome positioning by plakins. Can the centrosomal effects be completely indirect or bypasser effects due to the overall architectural change that epithelial cells undergo when their desmosomes lose their rigid coupling to cytokeratins?

      3) The classic siRNA-based method is used to silence plakin family mRNAs. This well-established technology today demands the use of multiple independent siRNAs per mRNA and also rescue experiments in order to confirm the absence of so-called off-target effects.

      Specific comments:

      I also enlist here some specific comments in the order of the figure presentation:

      1) Fig. 1A lacks the images of Hs-578T and MCF10A cells.

      2) The data of Fig. 1C demonstrate score of 15-30% for the TNBC and "normal" epithelial cells. These data must be discussed in the context of the established literature on cell polarity. Is a 30% score anticipated for a polarized cell type? Is the difference between 30 and 20% significant in terms of the polarity of cells within a tissue? What would such scores be if one studied highly polarized cell monolayers on transwell filters? Is the H-shaped microsystem reliable?

      3) The gene expression data of the TNBCs or publicly available data for the same cell lines from TCGA should be used to generate a heat map that illustrates the positioning of the examined cells in the spectrum of luminal epithelial to claudin-low, mesenchymal breast cancer cells.

      4) The 13 HME cell models used in Fig. 2A should be described in detail despite their earlier publication 11 years ago. This is important because the derived EMT scores are slightly counterintuitive: the parental HME cells are plotted as having a higher EMT score than the transformed HMEs expressing Ras or Twist1. How can this be explained? P53 is well established as an epithelial differentiation factor that counteracts EMT. Why does shp53 and especially combined with Ras overexpression not lead to EMT? I note that this cell model is listed as having epi and mes varieties. What are these and why are these important phenotypes not presented in the results? TGF-beta is presented in the results as a transcription factor, yet it is a secreted growth factor. What does TGF-beta mean? HME cells overexpress the cDNA for TGF-beta (which one? There are 3 TGF-beta genes)or were the cell cultured in the presence of this cytokine?

      5) Minor semantic comment: does Fig.2B show collagen V or collagen XV? Related to this, the article has abundant typographical errors.

      6) Fig. 4: based on the major comment, this experiment requires analysis of rescue clones.

      7) Fig. 5C: the vimentin microscopy needs to be complemented with full EMT analysis using both microscopic and protein expression assays (see major comment). More importantly, desmosomal and cytokeratin organization analysis is missing.

      8) Fig. 5C: In the desmoplakin and periplakin knock-down experiments the cells stained for vimentin appear to have their vimentin "baskets" rather well polarized. Is this true or my impression based on the few cells illustrated in the images? If the cytoskeleton is polarized, what does one mean with loss of cell polarity? Is centrosomal polarity change associated with mesenchymal (back to front) polarity gain? If this is true can polarity be established by studying only 2 cells in the H-shaped microcultures? Is it not more relevant to allow cells build a cohort of inter-adherent cells?

      Significance

      Significance: The paper provides a quantitative analysis of single cell polarity and gene expression-based EMT and identifies plakin gene family members as potential regulators of cell polarity. If this finding can be substantiated via mechanistic work it will make an important contribution to epithelial cell biology.

      General assessment: As explained above, the paper is in a preliminary state, as it describes an observation that demands further analysis. Key cell biological constituents (desmosomes, cytokeratins) have not been included in the analysis. Specific key figure data are presented without explanation for the non-specialist, especially those data that have been generated based on older publications by some of the authors.

      Advance: At the present state, the paper does not make any advance but describes potentially interesting observations.

      Audience: This paper can stimulate interest in the broader field of cell biology and definitely more to the cell polarity and EMT sub-fields.

      My field of expertise: I study signal transduction and transcriptional mechanisms that regulate the EMT and its association with cell proliferation in cancer cells. I have also specialized on studying dynamics of cytoskeletal assembly and architectural cell organization.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Geay and colleagues examine potential regulators of centrosome positioning in an immortalised breast cell line in vitro on micropatterns that promote cell doublet formation. The authors mine expression data from breast cancer cell lines in vitro to identify microtubule-related transcripts that are potentially downregulated in cells with a mesenchymal phenotype. The authors identify some Plakin proteins, which upon depletion, are reported to change centrosome positioning relative to junctions. The authors propose that plakins are involved in the maintenance of epithelial polarity.

      Major comments:

      I applaud the authors for attempting to identify novel regulation of epithelial polarity. However, I am sorry to say that this manuscript is overtly preliminary. It is a collection of observations without any mechanistic insight (described below). Despite what I write below, I apologise in that these shortcomings as so extensive that I cannot recommend experiments that would 'fix holes', without essentially writing an entirely new project. Even after addressing the points below, I think it unlikely that the observations would make a coherent, mechanistic contribution to the field of epithelial polarity. I do not like to give reviews like this, but unfortunately, the submission of such preliminary works puts us in this position.

      1. 'Epithelial polarity' Throughout manuscript the authors refer to a 'polarity score' and the term 'epithelial polarity' when what they have actually measured is a specific angle of orientation of centrosomes in cell doublets in vitro. This is an overstatement and adds confusion. The term 'epithelial polarity' has overtones of a polarised epithelium, which such doublets do not model. There is no mechanistic investigation into how this polarity score relates to the ability to form a polarised epithelial monolayer, with apical-basal polarity orientations, either a monolayer on a substrate or a monolayer surrounding a single central lumen, such as these MCF10A cells are often used for in 3-dimensional culture. I suggest that the authors simply mention what they actually measure (and in their own words): "coordination of the centrosome along the nucleus-junction axis."

      2. In Figure 1A-C, cell doublets are reported and apparently quantified to measure a 'polarity score', which is the angle of orientation of centrosomes in cell doublets. Yet, there is no clear information that explains how the cutoff for what defines this polarity score is generated (e.g. why is the cutoff point chosen to be where it is?), or what it means for epithelial polarity (e.g. why is this cutoff point important to be at that site?). Moreover, there is no indication that these cells actually form connected doublets. Labelling and quantitation of potentially connected cells is absent. Do these actually form junctions to the same extent, such that any differences have been exhaustively excluded to be only from the centrosome orientation, rather than cell spreading and cell-cell contact differences (that would alter geometry)? In addition, statistical analysis for part C is missing.

      3. Fig 1D, 2A,B present select example genes correlated with either polarity score or EMT score (Fig 1D, 2B). It is unclear what insight providing select genes from many that are changed provides. In Fig 2A, an apparent EMT score (seemingly derived from mining of existing expression data not from this laboratory) is provided, ranked by an EMT. No description is provided for what these alterations are (e.g. what is a 'HME_Ras_Twist1E12_TGFb' sample?). Further, what this is supposed to indicate as a mechanistic insight is unclear.

      4. Figure 3 is highly preliminary. The entirety of Figure 3 is a correlation plot between EMT score and polarity score for microtubule-related transcripts. The authors state:

      "The graph showed an overall negative trend, which means that many genes were positively correlated with an EMT in HME were instead negatively correlated with epithelial polarity of TNBCs (Figure 3). This was expected and confirmed that the progression along EMT is associated with a loss of epithelial polarity."

      No statistical analysis is presented, no correlation scores and indication of robustness is provided. It is unclear how this provides any mechanistic insight. The authors themselves state that this association is expected.

      Moreover, the authors state "Interestingly, three plakins, namely epiplakin (EPPK1), desmoplakin (DSP) and periplakin (PPL) all appeared as clear outliers (Figure 3)."

      How is an outlier defined & why is this clear? Is the association of these key cell-adhesion molecules with an epithelial cell state novel or known?

      1. Figure 4. The authors perform siRNA-mediated depletion of Desmoplakin, Epiplakin and Periplakin in MCF10A cells. The authors report, "Interestingly, knocked-down cells in culture displayed abnormal shapes, being more elongated and less cohesive (Figure 4C)."

      No quantitation of such changes are provided. Moreover, cells with KD appeared to be at lower density. Can the authors exclude that these are not merely density-dependent effects.

      1. Throughout the work, the polarity index is reported from plakin depletion conditions with data from a reported 3 independent experiments seemingly pooled (no indication of graph of which independent experiment each data point comes from). Is the statistical analysis performed (missing in Fig 4E, present in Fig 5A-C, S2, S3) from pooled data? If so, this is in appropriate and should be from the averages of independent experiments, to understand batch effects. If not from pooled data, please alter graphs to display this appropriately.

      2. Figure 5A. It is unclear how F-actin is measure in the images. Is F-actin labelling a truly representative proxy for junction length?

      3. Fig 5C. Why are images of vimentin now provided not on micropatterns? The labelling of vimentin in siPeriplakin cells does not look appropriately controlled for by the other cell conditions. siPeriplakin is clearly at the edge of a colon, whereas this is not clear whether an appropriate region is labelled in the other conditions.

      Significance

      In the literature, there is a rich understanding of the molecular mechanisms of the cross-talk between cell-cell junctions, cell polarity complexes, and the organisation of the cytoskeleton. The authors are applauded for efforts to investigate whether there are common transcriptional downregulations of microtubule-related proteins that could potentially be key regulators of cellular polarisation. It is unfortunate that the work, as presented, is a series of modest observations, often the insight of which is overstated. Despite the analysis of plakins as a potential regulators of centrosome positioning between cell doublets, there is no mechanistic insight into a) how these plakins contribute to centrosome alignment asymmetry, or b) whether this is any way has an effect on true epithelial polarisation (beyond potential doublets on a micropattern). If significant development of mechanistic insight was added (requiring extensive additional experimentation, expected: 1-2 years of work), then the manuscript might be of interest to the cell biology community.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-02247

      Corresponding author(s): Heinz Jacobs

      Description of the planned revisions

      Reviewer #1

      This manuscript by de Groot et al. is focused on investigating the role of the DNA damage tolerance (DDT) pathway for maintaining genomic stability in mammalian cells. All experiments are well designed and executed, and the conclusions are strongly supported by the experimental data. The authors generate a pair of congenic T cell lymphoma cell lines with either WT or PcnaK164R/-Rev1-/- genotype (the latter are referred to as double-mutant [DM] cells throughout the proposal). The DDT-deficient DM cells are surprisingly normal under standard growth conditions but show a strongly increased sensitivity to DNA damaging agents. Thus, the authors conclude that in the absence of exogenous stress a backup pathway exists to allow for normal growth, and a CRISPR screen reveals the CDC13/CTC1, STN1, and TEN1 (CST) complex as central for cell survival and cell cycle progression. Subsequent DNA fiber assays reveal that this survival relies on increased repriming of replication, and that exogenous stress overwhelms this backup pathway. Finally, the consequences of DDT deficiency were tested by whole genome sequencing of the WT and DM cells were exposed after a single round of UV stress. and subsequent whole genome sequencing was used to identify DNA alterations. The absence of DDT led to a striking increase in the number of deletions ranging in size from 0.4 to 4.0 kbp (called to a type 3 deletions). Importantly, such mutations are also present in many human tumor genomes and their level appears to be linked to alterations in DNA repair pathways (but no clear causal relationship was shown). The main take home message is that repriming after the lesion is the last resort when a replication forks stalls at a DNA lesion as this leads to the loss of 400-4000 bp of genome information at every such event. The DDT pathway serves to channel the responses towards less deleterious (or even error-free) replication outcomes.

      The authors like to thank this reviewer for the very positive summary of our study.


      Major points:


      1) The authors rely on a genetically modifed cell line in which the Pcna and Rev1 genes are altered (the latter by CRISPR/Cas9 technology). To rule out that any additional inadvertent genetic changes occurred that may influence the phenotypes see here, it would be important to show for at least a subset of the experiments that ectopic re-expression of Rev1 and WT PCNA can rescue the survival defects seen here.

      To rule out any inadvertent genetic changes, we opted for an isogenic system and used two independent clones which provided consistent phenotypes. Furthermore, the double mutant model has been often and independently published, showing very similar phenotypes as observed in this study (PMID: 36669105, PMID: 18498753, PMID: 37498746, PMID: 17105346). Additionally, in a p53-WT setting, Rev1-deficient and PCNA-K164R mutant mice are viable, develop normally, and are born at the expected mendelian frequencies, indicating a rescue in the absence of stress would not show an effect. In our published NAR paper (PMID: 35819193), we do not observe sensitivity of REV1-ko lymphomas (which is an isogenic SM clone of the DM and WT lymphoma), indicating that a rescue experiment would likely not help us much; and in the context of a single PCNA-K164R mutant, these become sensitive to genotoxic agents, in line with widely available literature. These aspects will be clarified by textual amendments.

      2) It is unclear whether all experiments were conducted with a single clone of each genotype or if different clones were tested. This should be clarified.

      A valid point, to exclude inter-clonal variations, two different isogenic clones were used for both DM and WT. This important aspect will be clarified in the results section.

      3) The increase in the type 3 deletions in human cancers is very obvious, and as the authors clearly demonstrate DDT-deficiency results in the very same type 3 deletions. Although there is no data for this shown here, I'm assuming that a single deficiency in Rev1 would show a distinctly different mutation pattern. Given alterations of DNA repair/DNA damage tolerance gene mutations in the human tumors, it appears very unlikely to me that all tumors lack the DDT in its entirety. So why would the type 3 deletions then emerge? The authors should provide a clearer model of how this might work that could be tested in the future.

      Indeed, the human tumors analyzed in the manuscript are unlikely to have DDT defects but do have the indicated alteration in other DNA repair genes. This led us to hypothesize that these type 3 deletions are not specific for DDT defects but more a general phenotype that results from replication stress, a hallmark of tumors and especially those suffering from specific DDR defects. We consider that future studies should address this important aspect in more detail and will extend the discussion section accordingly.

      4) The authors only assess the genome alterations after a single dose of UV irradiation. Do the type 3 deletions also accumulate (albeit at a much lower rate) when these cells are grown for an extended period of time under normal conditions and do such cultures ultimately undergo senescence once too many deletions have been acquired?

      We did culture these cells for an extended period of 5 months and compared the mutation profiles of pre-cultured and post cultured WT and DM cells. On the genomic level no major differences appeared between the pre- and post-cultured samples, indicating that these deletions likely do not accumulate easily over time. Furthermore, DM grow indefinitely and do not display any signs of senescence. We will clarify this relevant point and further extend on hypothesis that replication stress is likely to underlie the generation of type 3 deletions.

      Minor points:

      1) In the methods section the description of how the cell lines that are central to this work were generated is not clear. The authors start with a p53-/-PcnaK164R/loxP Rev1wt/wt background. Then the Rev1 was inactivated using CRISPR technology, but how the Pcna wt/- genotype in the WT was restored is unclear. It would be helpful to provide a schematic drawing of the gene targeting strategies as a supplementary figure.

      An important point, we will add a schematic figure and legend to clarify the generation of the isogenic cell lines.

      2) The authors should describe whether (and how many) independent clones of the DM (and may be WT) cell lines were tested and used in the experiments.

      As stated above, two independent clones were studied to exclude inter-clonal variation, and this info will be added to the result and material and methods sections.

      3) The approach by which the genome-wide mutation load was assessed for each genotype is not described in sufficient detail. Did the authors compare WT before and after UV exposure and DM before and after UV exposure separately or were just the genomes of WT and DM after UV exposure compared.

      We extensively analyzed the data in both pre- and post-UVC exposures. Based on these analyses we chose to display our data as revealed in figure 4, where 4B indicates the deletions prior to UVC exposure, and figure 4C the deletions acquired upon UVC exposure. Additional analyses can be provided upon request.

      Reviewer #2

      The manuscript by de Groot and colleagues investigates the cellular and mutational phenotypes of mouse cells that are mutant for REV1 and also carry a PCNA-K164R mutation that prevents post-translational modifications at this residue. This double mutant (DM) likely removes all mechanisms for the recruitment of canonical translesion synthesis polymerases (Y family and pol zeta), thus the authors use it as a general DNA damage tolerance (DDT) deficient model. Using the cell line, they find signs of increased replication stress and a reliance on repriming. A whole genome CRISPR screen revealed a genetic dependence of the DM cells on the CST complex. Sequencing the genome of DM cells showed a specific increase in a distinct category of large deletions, which were also shown to be present in cancer genomes. While the study raises interesting points and contains much valuable data, I find major issues with both the study design and especially with the methodology, which appear to make it unsuitable for publication in its present form.

      We like to thank this reviewer for the careful analyses of our data. Remarkably, while reviewer 1 praised the study design and methodology, this reviewer raised some concerns which feel are addressed and clarified appropriately as outlined below.


      Major points:

      __ __Study design:

      The paper focuses on the double mutant PCNAK164R/- REV1-/- cells throughout, without testing the single mutants. This is a major drawback. It is unclear whether such single mutant cell lines were available to the authors. A PCNA-K164R appears to have been published previously (Ref.46) but do they also have a REV1 mutant lymphoma in a tp53 muntant background? By comparing a double mutant to the wild type the authors miss the opportunity to assign phenotypes to either mutation. For example, large deletions very similar to those found here have been recently reported in human cells (Ref79, noted at the end of this manuscript). That paper shows that these are due to the loss of REV1 or REV3, and the concurrent loss of PCNA ubiquitination does not contribute to this phenotype (partially?).

      We do have the WGS data of single mutants, but as this data did not show significant mutational differences, we felt like it would distract from the main story and decided to leave these data out. The major difference of our findings compared to Gyüre et al. (PMID: 37498746) is lack of a specific deletion phenotype in REV1 single mutant clones. With this independent study, we consider the overlap of our findings as most relevant.

      A second example is the interesting observation that the DM cells rely on repriming even during unperturbed DNA replication. However, this could also potentially be the consequence of the inactivation of REV1. Again, single REV1 mutants should be assayed, and REV1-related literature discussed.

      The role of REV1 in repriming and replication fidelity has been studied extensively in multiple systems (PMID: 31178121, PMID: 3797129, PMID: 31607544, PMID: 32330130, PMID: 32577513, PMID: 36669105, PMID: 34508659, PMID: 34624216, and others). Given the fact that this has been firmly elucidated, we decided to focus on the DM. However, we agree that this important aspect deserves to be discussed in detail and will add this to the discussion section.

      Mutation detection methodology:

      The analysis of small scale mutations shows some unexpected results. Not only is there no effect of the DDT mutations, there is also no effect of UV irradiation (Fig. S6E). Several papers have described in vitro experiments with UV treatment showing the clear mutation spectra that are also seen in cancers (SBS7). UV induces these spectra in mice even in vivo (PMID: 34210801 - though this paper used UVB). So it is difficult to believe that there would be no mutagenic effect in the cells used in this manuscript. Could there be an analysis problem instead?

      The lack of UV signature has also come to our attention, but we clearly see an effect of the UV in the large deletions and cell viability, indicating these cells were exposed to UV. Additionally, we also provided the data to several independent bioinformaticians confirming our results. This excluded an experimental and analytical bias. Given these assurances, we theorized that the lack of a UV-mutation signature relates to the very low UVC dose these cells were exposed to. This is an experimental limitation caused by the marked UVC sensitivity of the DM cells. Of note, other published data employing DDT deficient systems also accumulated very low numbers of de novo mutations (PMID: 29323295, PMID: 37498746, PMID: 32330130). We agree that the surprising observation regarding the lack of a UVC signature deserves detailed explanation, which will be argued in the discussion section.

      The mutagenesis experiment and the mutation calling are incompletely described. Precisely how many clones were sequenced? A table should be provided with such data, and sequencing data must be uploaded to an accessible database.

      As mentioned for reviewer one, for the mutagenesis analyses two independent clones have been used for both genotypes, these provided very similar results. This relevant aspect will be indicated in the revised manuscript. The sequencing data have been uploaded and the link will be provided upon acceptance. Furthermore, we will extend the method section to clarify mutation calling.

      Most importantly, how was the mutation calling done? Did the authors sequence an initial cell clone, to which the post-treatment clones could be compared? Without doing that, detected 'mutations' include many heterozygous SNPs which are differently called in different samples due to stochastic read count differences. Indeed, the mutation spectra in Fig. S6D and E look precisely like standard SNP spectra: flat in the C>T and T>C segment. If this is indeed the issue, mutation calling can be improved somewhat by filtering against mouse SNP databases, but the experiment cannot be fully rescued.

      The mutation calling has been performed with the use of a standard (MM10, from a C57BL/6 mouse, the same background as our cell lines) mouse reference genome for all samples. We opted for this method as it is widely used (similar method as used by Gyüre et al., PMID: 37498746, but for human). Additionally, we used alternative filtering strategies to call mutations with high confidence, such as joint genotype calling. Importantly, we also used the untreated WT lymphoma as a reference, all of these methods provided very similar results that did not change the interpretation of our results. We agree that the original mouse genome sample would have served as the most ideal reference genome, however given the above outline of steps taken, we are confident in our conclusions. We will address these points in an extended discussion and method section.

      The detection of large deletions is equally problematic. Fig. S5 suggests that the deletions are found in the same locations in the WT and the mutant cells. The probable reason for this is that the authors are finding the exact same deletions in both cell lines, which pre-existed even the making of the mutant cells, and are simply differences compared to whatever reference genome they are using! The DM appears to produce enough extra deletions to be detectable, but the real difference between the WT and the DM is likely much stronger than found here.

      We thank this reviewer for pointing out this relevant comment. In accordance with the data gathered, we hypothesize that replication stress favors the formation of type 3 deletions. Consequently, our p53 deficient WT cell lines experience replication stress and thus will generate type 3 deletions. Using this cell line to generate the DM cell lines, we agree that these pre-existing type 3 deletions will be present in subsequent sequencing analyses. However, due to the enormous increase of replication in the DM additional type 3 deletions will accumulate. This aspect was the intended message of figure S5 A&B. We have also figures that only depict the differences between the type 3 deletions in WT and DM in predefined genomic regions (bins of 1 million bp).

      Figure:

      (A) Genome wide distribution of the difference in the number of type 3 deletions comparing DM minus WT in untreated conditions.

      (B) Genome wide distribution of the difference in the number of type 3 deletions comparing DM minus WT after 0.4 J/m2 UV-C exposure.

      (A)

      Figure could not be uploaded in this portal.

      (B)

      Figure could not be uploaded in this portal.

      We feel that generally, the issue that is put forward is that the use of a widely used standard reference would increase the background and thereby prevent the detection of small mutational changes. This however does not subtract from the mutational changes we did detect, leaving the core of the story and results unchanged. We agree that the effect size is likely to be stronger and we will address this aspect in the results and discussion section.

      Some specific comments:

      The CRISPR screen in the DM cells no doubt provided very valuable data, and CST is an interesting hit. The authors found that the STN1 gene could not be knocked out in the DM, but it could be when apoptosis was inhibited by Bcl2 overexpression. Unexpectedly, Bcl2 overexpression reduced the increased replication speed in the DM, thus interfering with the very effect the authors were trying to measure. Without understanding the mechanism of this effect, it is difficult to draw conclusions from this cell line. And again, the effect of Bcl2 and Stn1 should have also been assayed in a WT background as controls, not just in triple/quadruple mutant combinations.

      Would single CST mutants also affect the cell cycle profile? The authors conclude that CST appears to have a role in tolerance of endogenous replication impediments, but without seeing the effect of the single mutant on the cell cycle they can only conclude about such impediments that are created in the absence of REV1 and PCNA-Ub. These may well be the breaks that result in the large deletions shown later.

      Our prime interest in CST is only in the context of DDT. This means that we conclude that in the absence of DDT, CST seems to have a role in damage tolerance of endogenous replication impediments. We will highlight this better in the revised text to prevent confusion. Indeed, we initially speculated that CST would have a role in forming these deletions but due to lack of evidence we decided not to make this connection.

      Additionally previous studies reported extensively on the role of CST in telomere maintenance (PMID: 28934486 and many others), DSB repair (PMID: 29768208), maintenance of genetically unstable regions (PMID: 34520548, PMID: 29481669) and its role in DDT (PMID: 35150303, PMID: 37590191).

      The analysis of deletion size distribution in tumors is interesting and does appear to show that the 'type 3' deletions are a general phenomenon. However, the last point seems tautological: those tumors with a higher proportion of type 3 deletions have 'a sizeable increase of type 3 deletions'? (Fig. 5C). The fact that these deletions were also abundant in the WT cell line (Fig. 4B), where they are likely pre-existing genomic variations, suggests that such deletions can arise as part of spontaneous mutagenic processes even in normal cells. Their presence in all tumor types to similar degrees agrees with this.

      Indeed, we agree that the process that leads to these types of deletions would be present in WT or normal cells. The increase in replication stress is likely underlying the formation of type 3 deletions. Their accumulation in the DDT deficient system is likely because this system generates replication stress similarly as in the presented human tumors, which in both cases appears to favor the formation of type 3 deletions.

      Figure 5C is meant to give an overview of 5B using the density profiles similarly as shown in the previous figures. Additionally, this figure provides a direct comparison between the mouse and human density profiles of large deletions. Tumors with the cutoff of 25% of deletions that fall in type 3 range, have density profile similar to those of DM cells. We will alter the text accordingly to explain this relevant issue more clearly.

      Minor comments:

      __ __- The model organism for the DM cells (mouse) should be mentioned in the abstract.

      Will be done.

      • In the introduction, 4 modes of DDT are described including template switching without the formation of a post-replicative gap, but there is little evidence for this. Ref18 is the authors' own review, which cites further reviews.

      We agree that this specific mode of DDT is less well documented, we will adjust the text to clarify this point and provide additional references.

      • In the first Methods item, the plasmid used for transfection is not specified. "To obtain WT and PcnaK164R/-;Rev1-/- lymphoma cell lines, 10 x 106 lymphoma cells from a p53-KO, PcnaK164R/loxP mouse(46) were nucleofected."

      The plasmid used was pX333. We will provide the info and reference.

      • The y axis label of Figure 4F appears to be wrong (frequency vs. percentage)

      We will change this legend to percentage.

      Reviewer #3

      The manuscript by De Groot et al investigates the impact of the PCNA(K164R)-REV1 double inactivation on genome integrity in lymphoma cells. The group had previously demonstrated that the double mutant is lethal in mice (papers in PNAS and NAR) but here, in lymphoma cells, additional mechanistic work could be performed. Chiefly, they were able to conduct a CRISPR screen to investigate backup mechanisms in these double mutant lymphoma cells and they identified a specific complex (CST). Mutating one of the CST complex proteins within double mutant cells led to lethality that was rescued by Bcl2 overexpression, allowing for further mechanistic studies. WGS on such cells identified specific types of structural variants that would normally kill cells (mostly large deletions). Finally, they identify similar type deletions in databases of human tumours, with specific preferences with regard to treatment modality (more deletions with chemo, immunotherapy and hormonal therapy, and fewer in tumours treated with tamoxifen, imatinib and some other small molecule inhibitors).

      The authors like to thank the reviewer for the time invested and the appreciation of our novel insights.

      Please find below our responses to the remaining specific comments.

      Specific comments:

      1) can the authors comment on the physiological relevance of the screen, considering that the double mutation is lethal in normal tissues?

      The screen was performed to understand how cancer cells can survive in a DDT deficient setting. This increases our understanding of the general function of DDT and identify alternative pathways that enable cancerous as well as normal cells to cope with DNA damage. Furthermore, tumors can have defects in the DDT system, knowledge on DDT function may help to target these tumors with specific inhibitors and chemotherapeutics. A relevant aspect, that we will elaborate on in the revised discussion.

      2) can the authors suggest a mechanism regarding how CST would work to maintain the viability of the double knockout lymphoma cells?

      An important point, our insights gathered so far favor a model where the CST prevents the formation of single stranded DNA gaps. As the double mutant DDT deficient cells already accumulate a high number of post-replicative gaps, the lack of CST complex further increases those, leading to genomic instability and eventually cell death. To clarify our model, we will extend our discussion section.

      3) it is implied that STN1 deletion would only kill double mutant lymphoma cells but is this actually the case? (a similar deletion in wildtype and single mutant cells is a necessary control).

      The role of CST, and its ablation, has been extensively studied by many others. Our main interest in CST is in the context of DDT and how CST maintains cells in a DDT deficient setting. Because the single DDT mutants have been studied in detail, we here focused on the role of CST in our double mutant DDT deficient setting. This setting enabled us to identify CST as a potential novel back up mechanisms to cope with replication impediments.

      4) I really liked the data from human tumour databases (figure 5) but are the deletions there correlated with the same DDT profiles as investigated here?

      The human tumors discussed in the manuscript do not contain similar specific DDT defects as in the lymphomas. However, we do see that human tumors with varied DDR defects have an increase in these deletions. This led us to speculate that the type 3 deletions arise due to general replication stress, present in both the human tumors and the DM lymphomas.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by De Groot et al investigates the impact of the PCNA(K164R)-REV1 double inactivation on genome integrity in lymphoma cells. The group had previously demonstrated that the double mutant is lethal in mice (papers in PNAS and NAR) but here, in lymphoma cells, additional mechanistic work could be performed. Chiefly, they were able to conduct a CRISPR screen to investidate backup mechanisms in these double mutant lymphoma cells and they identified a specific complex (CST). Mutating one of the CST complex proteins within double mutant cells led to lethality that was rescued by Bcl2 overexpression, allowing for futher mechanistic studies. WGS on such cells identified specific types of structural variants that would normally kill cells (mostly large deletions). Finally, they identify similar type deletions in databases of human tumours, with specific preferences with regard to treatment modality (more deletions with chemo, immunotherapy and hormonal therapy, and fewer in tumours treated with tamoxifen, imatinib and some other small molecule inhibitors).

      Specific comments:

      1. can the authors comment on the physiological relevance of the screen, considering that the double mutation is lethal in normal tissues?
      2. can the authors suggest a mechanism regarding how CST would work to maintain the viability of the double knockout lymphoma cells?
      3. it is implied that STN1 deletion would only kill double mutant lymphoma cells but is this actually the case? (a similar deletion in wildtype and single mutant cells is a necessary control).
      4. I really liked the data from human tumour databases (figure 5) but are the deletions there correlated with the same DDT profiles as investigated here?

      Referees cross-commenting

      Nice to see that most of the comments are aligned.

      Significance

      Reasonable significance, potentially ablated by the (lack of) physiological relevance of the screen (see comments above).

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by de Groot and colleagues investigates the cellular and mutational phenotypes of mouse cells that are mutant for REV1 and also carry a PCNA-K164R mutation that prevents post-translational modifications at this residue. This double mutant (DM) likely removes all mechanisms for the recruitment of canonical translesion synthesis polymerases (Y family and pol zeta), thus the authors use it as a general DNA damage tolerance (DDT) deficient model. Using the cell line, they find signs of increased replication stress and a reliance on repriming. A whole genome CRISPR screen revealed a genetic dependence of the DM cells on the CST complex. Sequencing the genome of DM cells showed a specific increase in a distinct category of large deletions, which were also shown to be present in cancer genomes. While the study raises interesting points and contains much valuable data, I find major issues with both the study design and especially with the methodology, which appear to make it unsuitable for publication in its present form.

      Study design:

      The paper focuses on the double mutant PCNAK164R/- REV1-/- cells throughout, without testing the single mutants. This is a major drawback. It is unclear whether such single mutant cell lines were available to the authors. A PCNA-K164R appears to have been published previously (Ref.46) but do they also have a REV1 mutant lymphoma in a tp53 muntant background? By comparing a double mutant to the wild type the authors miss the opportunity to assign phenotypes to either mutation. For example, large deletions very similar to those found here have been recently reported in human cells (Ref79, noted at the end of this manuscript). That paper shows that these are due to the loss of REV1 or REV3, and the concurrent loss of PCNA ubiquitination does not contribute to this phenotype. A second example is the interesting observation that the DM cells rely on repriming even during unperturbed DNA replication. However, this could also potentially be the consequence of the inactivation of REV1. Again, single REV1 mutants should be assayed, and REV1-related literature discussed.

      Mutation detection methodology:

      The analysis of small scale mutations shows some unexpected results. Not only is there no effect of the DDT mutations, there is also no effect of UV irradiation (Fig. S6E). Several papers have described in vitro experiments with UV treatment showing the clear mutation spectra that are also seen in cancers (SBS7). UV induces these spectra in mice even in vivo (PMID: 34210801 - though this paper used UVB). So it is difficult to believe that there would be no mutagenic effect in the cells used in this manuscript. Could there be an analysis problem instead? The mutagenesis experiment and the mutation calling are incompletely described. Precisely how many clones were sequenced? A table should be provided with such data, and sequencing data must be uploaded to an accessible database. Most importantly, how was the mutation calling done? Did the authors sequence an initial cell clone, to which the post-treatment clones could be compared? Without doing that, detected 'mutations' include many heterozygous SNPs which are differently called in different samples due to stochastic read count differences. Indeed, the mutation spectra in Fig. S6D and E look precisely like standard SNP spectra: flat in the C>T and T>C segment. If this is indeed the issue, mutation calling can be improved somewhat by filtering against mouse SNP databases, but the experiment cannot be fully rescued. The detection of large deletions is equally problematic. Fig. S5 suggests that the deletions are found in the same locations in the WT and the mutant cells. The probable reason for this is that the authors are finding the exact same deletions in both cell lines, which pre-existed even the making of the mutant cells, and are simply differences compared to whatever reference genome they are using! The DM appears to produce enough extra deletions to be detectable, but the real difference between the WT and the DM is likely much stronger than found here.

      Some specific comments:

      The CRISPR screen in the DM cells no doubt provided very valuable data, and CST is an interesting hit. The authors found that the STN1 gene could not be knocked out in the DM, but it could be when apoptosis was inhibited by Bcl2 overexpression. Unexpectedly, Bcl2 overexpression reduced the increased replication speed in the DM, thus interfering with the very effect the authors were trying to measure. Without understanding the mechanism of this effect, it is difficult to draw conclusions from this cell line. And again, the effect of Bcl2 and Stn1 should have also been assayed in a WT background as controls, not just in triple/quadruple mutant combinations. Would single CST mutants also affect the cell cycle profile? The authors conclude that CST appears to have a role in tolerance of endogenous replication impediments, but without seeing the effect of the single mutant on the cell cycle they can only conclude about such impediments that are created in the absence of REV1 and PCNA-Ub. These may well be the breaks that result in the large deletions shown later.

      The analysis of deletion size distribution in tumours is interesting, and does appear to show that the 'type 3' deletions are a general phenomenon. However, the last point seem tautological: those tumours with a higher proportion of type 3 deletions have 'a sizeable increase of type 3 deletions'? (Fig. 5C). The fact that these deletions were also abundant in the WT cell line (Fig. 4B), where they are likely pre-existing genomic variations, suggests that such deletions can arise as part of spontaneous mutagenic processes even in normal cells. Their presence in all tumour types to similar degrees agrees with this.

      Minor comments:

      • The model organism for the DM cells (mouse) should be mentioned in the abstract.
      • In the introduction, 4 modes of DDT are described including template switching without the formation of a post-replicative gap, but there is little evidence for this. Ref18 is the authors' own review, which cites further reviews.
      • In the first Methods item, the plasmid used for transfection is not specified. "To obtain WT and PcnaK164R/-;Rev1-/- lymphoma cell lines, 10 x 106 lymphoma cells from a p53-KO, PcnaK164R/loxP mouse(46) were nucleofected."
      • The y axis label of Figure 4F appears to be wrong (frequency vs. percentage)

      Referees cross-commenting

      I agree with comments by the other reviewers.

      Significance

      General assessment:

      My assessment is provided above, the manuscript is not suitable for publication in its present form. If I am correct about the faults of the experimental design, it would be advisable to repeat the entire mutagenesis experiment starting from newly isolated and sequenced single cell clones. Ideally, single mutants should also be included. Even if this is done, the novelty of the expected results is unfortunately compromised by the very similar recent data published from human cell lines. Alternatively, the mutation data could be left out, and the CST-based data could be expanded with more controls and hopefully more mechanistic insight.

      Advance:

      The CST-dependence of PCNAK164R/- REV1-/- cells and the general presence of 400-4000 bp deletions in tumours are both significant findings, but limited mechanistic insight is provided.

      Audience:

      DNA repair field.

      I have expertise in the field of DNA repair and mutagenesis.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript by de Groot et al. is focused on investigating the role of the DNA damage tolerance (DDT) pathway for maintaining genomic stability in mammalian cells. All experiments are well designed and executed, and the conclusions are strongly supported by the experimental data. The authors generate a pair of congenic T cell lymphoma cell lines with either WT or PcnaK164R/-Rev1-/- genotype (the latter are referred to as double-mutant [DM] cells throughout the proposal). The DDT-deficient DM cells are surprisingly normal under standard growth conditions but show a strongly increased sensitivity to DNA damaging agents. Thus the authors conclude that in the absence of exogenous stress a backup pathway exists to allow for normal growth, and a CRISPR screen reveals the CDC13/CTC1, STN1, and TEN1 (CST) complex as central for cell survival and cell cycle progression. Subsequent DNA fiber assays reveal that this survival relies on increased repriming of replication, and that exogenous stress overwhelms this backup pathway. Finally, the consequences of DDT deficiency were tested by whole genome sequencing of the WT and DM cells were exposed after a single round of UV stress. and subsequent whole genome sequencing was used to identify DNA alterations. The absence of DDT led to a striking increase in the number of deletions ranging in size from 0.4 to 4.0 kbp (called to a type 3 deletions). Importantly, such mutations are also present in many human tumor genomes and their level appears to be linked to alterations in DNA repair pathways (but no clear causal relationship was shown). The main take home message is that repriming after the lesion is the last resort when a replication forks stalls at a DNA lesion as this leads to the loss of 400-4000 bp of genome information at every such event. The DDT pathway serves to channel the responses towards less deleterious (or even error-free) replication outcomes.

      Major points:

      1. The authors rely on a genetically modifed cell line in which the Pcna and Rev1 genes are altered (the latter by CRISPR/Cas9 technology). To rule out that any additional inadvertent genetic changes occurred that may influence the phenotypes see here, it would be important to show for at least a subset of the experiments that ectopic re-expression of Rev1 and WT PCNA can rescue the survival defects seen here.
      2. It is unclear whether all experiments were conducted with a single clone of each genotype or if different clones were tested. This should be clarified.
      3. The increase in the type 3 deletions in human cancers is very obvious, and as the authors clearly demonstrate DDT-deficiency results in the very same type 3 deletions. Although there is no data for this shown here, I'm assuming that a single deficiency in Rev1 would show a distinctly different mutation pattern. Given alterations of DNA repair/DNA damage tolerance gene mutations in the human tumors, it appears very unlikely to me that all tumors lack the DDT in its entirety. So why would the type 3 deletions then emerge? The authors should provide a clearer model of how this might work that could be tested in the future.
      4. The authors only assess the genome alterations after a single dose of UV irradiation. Do the type 3 deletions also accumulate (albeit at a much lower rate) when these cells are grown for an extended period of time under normal conditions and do such cultures ultimately undergo senescence once too many deletions have been acquired?

      Minor points:

      1. In the methods section the description of how the cell lines that are central to this work were generated is not clear. The authors start with a p53-/-PcnaK164R/loxP Rev1wt/wt background. Then the Rev1 was inactivated using CRISPR technology, but how the Pcnawt/- genotype in the WT was restored is unclear. It would be helpful to provide a schematic drawing of the gene targeting strategies as a supplementary figure.
      2. The authors should describe whether (and how many) independent clones of the DM (and may be WT) cell lines were tested and used in the experiments.
      3. The approach by which the genome-wide mutation load was assessed for each genotype is not described in sufficient detail. Did the authors compare WT before and after UV exposure and DM before and after UV exposure separately or were just the genomes of WT and DM after UV exposure compared.

      Referees cross-commenting

      It appears that our comments are mostly overlapping.

      Significance

      This manuscript is a continuation of the very systematic work by the Jacobs lab to dissect the molecular mechanisms by which DDT factors act and the role of DDT in DNA damage responses. Here the authors demonstrate that in complete absence of DDT factors is not lethal, but reveals the priming of replication as the last resort response to avoid cell death. The analyses of the human tumor genome sequences suggest that dysfunctional DDT response are likely intimately involved in the generation of a distinct set of genetic lesions found in many tumors. What remains unclear is how the DDT pathway is inactivated in tumors as there is no consistent pattern of DDT factors being mutated. Overall this manuscript is of broad general interest far beyond the DNA repair community. The novelty of this manuscript is how reduced by the fact that another group published similar data in the human RPE cells 2023 (see reference 79 as acknowledged by the authors), but observing the same phenomena in two distinct system provides additional weight to these discoveries.<br /> Expertise: My expertise is in the gene diversification processes that assemble and alter TCR and immunoglobulin genes. They involve a broad range of DNA repair factors and DDT plays a unique role in somatic hypermutation that allows for the generation of high affinity antibodies.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility, and clarity):

      Summary: In this work, Kant and co-workers describe a two drugs regimen for therapeutics treatment of SARS-CoV-2 infection. SARS-CoV-2 infection of cells is dependent on the cleavage of the spike S protein by cellular proteases that prime S allowing the envelop protein to fuse of host membrane during entry and delivery of the viral genome to the target cell. The most important cellular protease is TMPRSS2 located at the surface of the cell. However, in cells with low TMPRSS2 levels, Cathepsins, located in endosomes have been shown to be able to also prime S. The therapeutic strategy of the authors relies on the combined usage of an inhibitor of TMPRSS2 (nafamostat) together with a compound that impairs endosomal maturation (apilimod) which is a key step for the activation of cathepsin. The rationale is that a dual regimen would be more effective to inhibit SARS-COV-2 infection. Using cell lines and a combination of SARS-CoV2 infection and pseudotyped VSV particles (VSV virus where the glycoprotein has been replaced by the SARS-CoV-2 spike proteins), the authors could show that a two-drug regimen was more efficient in preventing SARS-CoV-2 infection compared to single drug regimen. The authors next employed a mouse model of SARS-CoV-2 infection and similarly could show that bi-therapy was more efficient in preventing infection. Importantly, the authors describe a new formulation of the drugs that improve stability of the compounds and shelve life which could be of great benefit with respect to storage needs in therapeutic setting of the population.

      While the reviewer thinks the work is potentially very relevant, some of the conclusions are not fully supported by the data and additional experiments/quantifications should be performed to improve rigor and fully support the author conclusions.

      Major comments:

      • Throughout the paper, statistical analysis of the results should be performed to support the conclusion of the authors. Currently many experiments do not have statistical analysis and P values or statical significance are missing in most of the figures: Figure 1B, 1D, 4A, 5B, and S2. RESPONSE: As requested by the reviewer, the results of the statistical analysis of the differences are now reported for Figures 1B, 1D, 4A, 5B, and S2. There is no change in our conclusions as first reported in the original manuscript.

      • Quantification of the various pathology observed in mice should be quantified and scored. In the current version, the authors provided a supplementary table describing the pathology observed in individual mice upon SARS-CoV-2 infection. Adapted scoring of the different pathologies should be performed to obtain a statistical view of the pathology induced by SARS-CoV-2 and how this is prevented by the mono and bi-therapy approaches. RESPONSE: The mouse model employed in the present study, i.e. SARS-CoV-2 Beta infection in BALB/c mice, is characterised by a limited and short-lived viral infection of the lungs and rather subtle pathological changes, as described in detail in our previous publication (Kant et al., 2021. Viruses).

      We chose this model because it better mimics the typical (short-lived) respiratory infection observed in human patients than the K18-hACE2 model where infection is detected in nasal mucosa and lung parenchyma, generally sparing the respiratory epithelium, but also spreads to the brain (Seehusen et al., Viruses, 2022; De Neck et al., Viruses, 2023).

      In our model, infection of the lungs (i.e., alveoli) occurs strictly in association with infection of the airways, including the tracheal, bronchial, and bronchiolar epithelium, like the in hamster model. Pulmonary infection is, however, short-lived and wanes off around day 4. The histopathological changes, i.e. degenerative changes, and an inflammatory response, are at best mild in the untreated mice and not observed at all in successfully treated mice. (as summarized for each individual animal in Supplementary Table 2). . For these reasons, this information cannot be quantified by morphometry (which would be the most objective, hence best approach) or scored (a more subjective approach that would only be valid with distinct quantitative differences).

      Nevertheless, and in agreement with the reviewer that a quantitative approach is useful where possible we provide results from morphometry and to confirm the reduction in the degree of tissue damage (i.e., the extent of apoptotic death of infected respiratory epithelial cells; see comment below).

      • Additionally, table 1, is very difficult to read as mice are classified in 3 experiments but this does not match with the individual figures, making it very hard to look for the phenotypes. Is it an order issue within the table or are murine infection experiments performed in the order described in table 1? In this case, can the data be compared between the experiments as some conditions belong to experiment 2 and other to experiment 3? Given the low number of mice, do the experiments have statistical power? RESPONSE: We agree with the reviewer’s assessment of the figure and have therefore modified the graphs in Figure 2 B, to specifically relate experiments and data, by using circles for Experiment set 1 and squares for Experiment set 2.

      We can confirm the reported results have statistical power, particularly important given the constrain due to the low number of animals we were limited to use. As noted in the figure legends, that now includes the results from the statistical analysis, each of the three experiments included at least three control infected mice treated with vehicle. The infection levels in all the control vehicle treated infected mice are very similar in all three experiments.

      • To show that treatment of mice at 3 or 6 hpi indeed reduce the number of clv-capsase3 positive cells, the authors should perform a complete quantification and not limit their analysis to one representative tissue section from one animal. RESPONSE: Following the reviewer’s recommendation, we have now taken a quantitative approach in addition to illustrating the difference in cleaved caspase-3 expression. We have kept the images that illustrate the effect in tissue sections (Figure 4C).

      Briefly, we compared the extent of viral NP and cleaved caspase-3 expression between lungs of vehicle treated mice and mice treated with the drugs from 6 hpi onwards (3 mice per condition), using morphometry. Indeed, there was no significant difference in the extent of viral antigen NP expression in the lungs of the two groups of mice (Figure 4 B and C), which supports the PCR results representing viral RNA levels (Fig. Figure 4 A). However, there was a significant difference in the extent of cleaved caspase-3 expression in the consecutive sections. The results are shown in the new Figure 4D.

      • the authors insist on the new formulation that improves drug stability. To make this statement, this will need to be actively tested both in cell culture and in animal models: currently, the authors test the drugs stored 3 months at 25c or -20c and show that they remain active, but in this experiment freshly made drug was not directly tested in parallel. RESPONSE: As requested by the reviewer, we have extended our tests, and confirm our original view that the new formulation improves drug stability. Now shown in revised Figure 1C and D, we found equivalent inhibition in the cell infection assay using freshly made drugs and drugs stored at room temperature for 2 months.

      • Additionally, to make such a statement, different concentration of the drugs should be tested to calculate a IC50 for freshly prepared drug and stored drugs (as the current concentration tested might be at saturating concentration). RESPONSE: As requested by the reviewer, we have determined the IC50 for infection in cells of the drugs freshly prepared or stored. As reported in the revised Figure 1D, there were no differences detected.

      • Finally, the mouse experiments are performed with freshly made compounds and if the authors want to highlight the new formulation and increased stability, experiments in mice should be performed also with stored compounds. RESPONSE: We respectfully disagree with the reviewer on the need to perform additional in vivo experiments. We find no differences in the IC50 antiviral activity of the drugs prepared with our formulation and tested with cells in culture, whether fresh or kept for up to 2 months at room temperature. Given these observations, we feel that we cannot justify further animal experiments, neither ethically nor financially, using the same drugs with the same ab initio antiviral activity.

      • Alternatively, statement on drug stability should be removed or strongly tuned down from text. RESPONSE: We believe that the updated information included in the revised manuscript showing no difference in the IC50s of the compounds freshly prepared and stored at room temperature fully supports our original statement.

      • Statistical analysis on figure 2b should be done between Nafamostat alone and dual treatment to show that both drugs are cooperative in term of antiviral activities RESPONSE: We have carried the requested statistical analysis (Figure 2 B and C) and confirm that dual treatment is not only cooperative, but it also shows synergy, as we originally showed in our published work (Kreutzberger et al., Journal of Virology, 2021).

      • The authors state "A quantitative assessment of the in vivo synergy is shown here by the enhanced decrease of viral RNA in lungs of mice treated with both drugs at very low concentrations (Figure 2 B, compare using 2 mg/Kg apilimod dimesylate and 4 mg/Kg nafamostat mesylate alone, and in combination)." I guess, the authors want to comment on the fact that 0.2 mg/kg of apilimod and 0.4 mg/kg of nafamostat are as potent as 2 and 4 mg/kg. is that correct? If YES, to make this statement, bi therapy should be compared to mono therapy at the same concentration. RESPONSE: We apologise for not being clearer in the way we presented the information in our original version of the manuscript.

      Briefly, we compared the effect of high and low bi-therapy doses to the effect of Apilimod or Nafamostat used as single drugs at the highest concentrations. When administered alone, high dose Apilimod did not reduce infection. Nafamostat alone, even at 4 mg/Kg, decreases but does not completely block infection. When combined, even at low doses, the two drugs have a stronger antiviral effect than Nafamostat alone (and of course Apilimod, which was ineffective). Importantly, if the combined effect of the two drugs was merely additive, i.e. the arithmetic sum of the single effects, the addition of Apilimod, which alone has no in vivo antiviral activity, would not have improved the effect of Nafamostat. Instead, even at 10 times lower doses, the bi therapy significantly outperformed the single drug Nafamostat. Thus, the effect is synergistic (i.e. the effect of combined drugs is stronger than the mere sum of effects of each single drug).

      • when drugs are injected after infection (Fig 4), the drugs are not active. In fact, unless the reviewer mis-understood the plot, the mouse are even more infected compared to vehicle. The authors wrote that both regimes (3 and 6hpi) are equally less effective compared to drug administered during infection. The authors should write that both regimes are equally non protective. RESPONSE: We thank the reviewer for pointing out this imprecision. The modified text now reads “Both regimes, compared to drug administration at the time of virus inoculation, were equally ineffective in reducing the viral RNA load and NP expression in lungs as determined at 48 h.p.i. (Figure 4A, B).” (Line 236-238).

      • If drugs are not active after infection, does this approach really represent a therapeutic solution. The authors suggest that it does by limiting pathologies, but this needs to be better quantified (see comment above). RESPONSE: Our results suggest that application of the drugs post infection reduced the cytopathic effect of the virus in the respiratory epithelium in the lungs, reflected by a reduced extent of apoptotic cell death in association with infection. The finding is supported by quantitative morphometric analysis as shown in the new Figure 4D (see also comment above).

      • In the rebound experiment: unless the reviewer misunderstood, it appears that no conclusion can be driven from this experiment. Q-PCR data for vehicle animal a 4dpi show no sign of infection, so the experiment is not really interpretable since control animals are no longer positive. The authors suggest that there is less pathologies but this needs to be better quantified (see comment above). RESPONSE: We have tried to better word the rationale and interpretations of this experiment in the text. Following our drug treatments, viral antigen is still present in epithelial cells within the nasal mucosa, we also surmised that a small number of intact virions could have remained attached to the epithelial cells, trapped within their endosomes, or still within the environment surrounding the cells, any of them capable of triggering infection after removal of the drugs. Thus, the rationale behind the rebound experiment was to ask whether such remaining potentially intact virions could lead to a full reinfection of the lung two days after the treatment was stopped - which we found did not.

      We found that the virus did not regain full infectivity once the drug treatment was interrupted, resulting in undetectable lung PCR signal and very limited, sporadic antigen signal in the lung tissue.

      Minor comments:

      • I__t will make reading easier if the authors always mentioned which drugs inhibit what. For example: addition of the TMPRSS2 inhibitor nafamostat etc.... or addition of apilimod to block cathepsins activities..... __RESPONSE: Done

      • Figure 1: make a comment in the text that cells with low TMPRSS2 are more sensitive to the cathepsin inhibitor apilimod and vice versa, cells with high TMPRSS2 are more sensitive to nafamostat. This is expected and it could be highlighted. RESPONSE: Done

      • Figure 2B: how are the data normalized? should not RdRp, E and SubE all have a mean at 100% for the vehicle? RESPONSE: Done. Data are now normalized to the mean of RdRp measurements (which is indicated as 100%).

      • Line 211: something is missing here "when (Fig 2...) RESPONSE: Corrected

      • Line 221 should figure 4c RESPONSE: Corrected

      • Figure legends should only contain the details of the experimental design but should not contain description and interpretation of data. This is very minor and maybe a question of taste. __RESPONSE: __ Our figure legends are descriptive for some results and are in accordance with the style of PLOS Pathogens, the journal we are aiming this study.

      Editorial note:

      Referees cross-commenting: The other reviewers have highlighted the same limitations concerning the lack of quantifications of the immunochemistry and also the lack of robust statistical analyses. This should be highlighted to the authors as it appears to be the minimum to do prior publication. This should not take too much time as the data are in principle already available

      Reviewer #1 (Significance):

      The work by Kant and co-workers is potentially very significant but some limitations (as highlighted above) impair the impact of the work in his current version. The approach employing a two-drug regimen to combat SARS-COV-2 infection by targeting both TMPRSS2 and cathepsin activities is not new and was described before by the authors themselves. Employing this approach in an animal model is new and the new formulation improving drug stability and facilitating storage could be a game changer in therapeutic setting of patients. As such, this work could be highly significant and of broad interest. However, additional experiments and clarifications are needs to elevate this work to high impact standards. The reviewer believes that the requested experiments are easily achievable by the research teams of this project and think that the project will ultimately have a strong impact in the field.


      Reviewer #2 (Evidence, reproducibility and clarity):

      In this paper, the authors tested the antiviral activity of a combination of compounds by intranasal instillation in a mouse model of SARS-CoV-2. The two compounds used are PIKfyve Kinase inhibitor apilimod dimesylate, which inhibits endosomal maturation, and TMPRSS2 protease inhibitor nafamostat mesylate. The authors have previously shown that a combination of these two inhibitors acts synergistically to prevent entry and infection of SARS-CoV-2 in cell culture. Here, they further investigated the anti-SARS-CoV-2 activity of their combination of compounds by in vivo testing. They used Balb/c mice intranasally inoculated with the Beta variant of SARS-CoV-2. Their data show that concurrent administration of the combo together with the virus prevented lung infection without blocking nasal replication. Delayed administration of the compounds did not reduce replication in the lungs. The only effect was a decrease in bronchiolar cell death. Furthermore, they also tested the stability of the combo at room temperature and their data indicate that these compounds can be kept at room temperature for at least 3 months without losing antiviral activity, at least when resuspended in water. These data are potentially interesting, but they need to be consolidated by additional experiments.

      Major comments:

      • The authors only present immunohistochemistry to investigate viral replication in the nose. A quantitative analysis of replication would allow for better conclusions concerning viral replication in this organ. RESPONSE: We appreciate the reviewer’s comment and the wish to see viral antigen expression quantified in the nasal mucosa. As described below, however, practicalities associated with sample preparation prevented us from performing morphometric analysis. The complementary quantification of viral replication requires viral RNA by PCR. Unfortunately, we had not planned this aspect of the study and therefore did not collect the required fresh samples from nasal turbinates required for this analysis. Although interesting to investigate, we feel this is not vital for reaching the interpretation and conclusions derived from the current study. We thereby don’t think that this would be sufficient reason to undertake another round of infections, particularly taking into consideration that it would require sacrificing another significant number of animals.

      We could extend our morphometric analysis used in the lung and adapt it to the nasal mucosa. However, we are of the opinion that this would not provide trustworthy results. The main reason for this limitation is due to a problem that occurs during decalcification and paraffin embedding of the heads, which results in large variations in the area of the nasal mucosa as well as the olfactory epithelium in each section in different animals (Figure 3C provides some evidence of this).

      Briefly, we cut the entire heads longitudinally in the midline with a diamond saw and then gently decalcify the two halves of the head. This is followed by paraffin embedding. At some point during the process some of the thin and soft bits of nasal mucosa can become twisted and distorted, moving away from the cut surface exposed to the microtome blade. Therefore, the paraffin sections (appr. 3 µm thick) will in their majority not comprise full sections of the nasal mucosa. An objective comparative quantification of the extent of NP expression in the nasal mucosa would require (nearly) the entire mucosa to be assessed.

      • Complementary investigation on a potential anti-inflammatory effect of the drugs would also be welcome. Furthermore, it is surprising that the authors did not report potential weight changes. RESPONSE: Our mouse model, i.e. SARS-CoV-2 Beta infection in BALB/C mice, is characterised by the limited and short-lived viral infection of the lungs, rather subtle pathological changes and a limited inflammatory response strictly associated with the presence of viral antigens, as we previously described (Kant et al., 2021. Viruses). Hence, other animal models (for example the hamster model) would be more appropriate. Though potentially interesting, such investigations are beyond the current scope of our studies.

      In our study, the animal weight did not change during infection, in agreement with our earlier published work with the same animal model (Kant et al., 2021. Viruses). These data is now included in this manuscript.

      • It would have been interesting to complete the experiments with a demonstration that the compounds block viral transmission. RESPONSE: While it would be interesting to see whether the combined drugs also block viral transmission, such an experiment would require the use of a different animal model (possibly hamsters), an endeavour that is beyond the scope of our study. In our experience BALB/C mice infected with SARS-CoV-2 Beta variant do not transmit the virus. We have co-housed naïve BALB/C mice for 4 days with BALB/C mice intranasally challenged with 6 x 10^4 PFU SARS-CoV-2 Beta and have no evidence of virus transmission to the naïve mice (unpublished results). Similar results with C57BL/6 WT mice were obtained by Pan et al., Signal Transduction and Targeted Therapy, 2021).

      Minor comments:

      • The second paragraph of the introduction is not clear. It needs to be re-written. Furthermore, there is no evidence that Calu3 cells do not express cathepsins. RESPONSE: We have clarified this section of the introduction as follows:

      “It has been shown previously that SARS-CoV-2 infection can be blocked by serine protease inhibitors such as nafamostat mesylate in cells that express high levels of TMPRSS2 but very low or undetectable levels of cathepsin B/L (e.g. Calu-3 cells)5-7. In cells that instead express cathepsins but not TMPRSS2 (e.g. VeroE6 or A549 cells), infection depends on the delivery of endocytosed viruses to endo/lysosomes, a process that can be efficiently inhibited by drugs that interfere with endosome maturation and acidification such as Bafilomycin A1, chloroquine or ammonium chloride”.

      • Figure 4C: Is there any explanation for the lack of apoptosis? The authors should at least provide some hypotheses. Furthermore, this figure is quoted as Figure 4B in the text instead of Figure 4C. RESPONSE: For the revised manuscript, we have quantified the extent of apoptosis by a morphometric analysis of cleaved caspase-3 expression in the lung sections (now provided in new Figure 4C).

      We presently do not have an explanation for the reduction in the cytopathic effect of the virus, particularly in respiratory epithelial cells. This is an area of research we plan to continue investigating in future. We have commented on this in the Discussion session of the revised manuscript (Line 301-307).

      • Line 199: The authors claim that the effect of their combo is synergistic. However, this cannot be clearly concluded without appropriate additional experiments where they vary the concentration of the compounds. RESPONSE: The work we report here with mice is a follow up of our earlier work demonstrating the antiviral synergy of nafamostat and apilimod with cells in culture (Kreutzberger et al., Journal of virology, 2021). See comments to Reviewer 1.

      • Line 211: The sentence is incomplete RESPONSE: Fixed.

      • The lettering in the panels needs to be doublechecked. RESPONSE: Done.

      Reviewer #2 (Significance):

      __General assessment: __Finding new antiviral against SARS-CoV-2 remains a priority to fight against COVID-19. The validation of a combination of two molecules showing a partial antiviral activity in vivo is therefore of interest. However, this combo does not block viral replication in the nose and is inefficient when the treatment is added after infection, limiting the use of these molecules to prevent people in contact with COVID-19 patient of being infected. However, the authors should demonstrate that their molecules block viral transmission.

      __ Advance:__ The number of antivirals used in the clinics to treat COVID-19 patients remains extremely limited. Increasing the number of drugs available is still sorely needed. Audience: This paper potentially of large interest since the general population has been well informed of and/or have experienced COVID-19. Therefore, it is of interest beyond the virology and infectiology fields.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary: In manuscript reference RC-2023-02113, the authors addressed the impact of inhibitors of cell host factors as therapeutics against SARS-CoV-2 infection. They tested the combined inhibition of the enzymatic activities of the endosomal PIKfyve phosphoinositide kinase and the serine protease TMPRSS2, known as essential to meditate viral entry pathways: Conclusion: They showed a reduction, as assessed in vitro experiment (cell line) and in lung infection in mice intranasally- infected with SARS-CoV-2 beta. Moreover, the reduced viral infection is, as expected, associated to lower cell damage.

      Reviewer #3 (Significance (Required)):

      Positive points:

      • The topic is of interest.
      • Robust impact of the treatment although kinetic analysis post infection/symptoms are missing. Limitations:

      • Such a robust level of infection in this model (female BALB/c mice) is surprising, owing that the ACE is not the appropriate homologue. RESPONSE: We respectfully disagree with this concern. The BALB/c strain employed in the current study can be infected by the natural Beta variant, with mutations in the viral spike that allow it to bind to the murine ACE2 receptor and hence can efficiently infect the mice, as we previously described (Kant et al., 2021. Viruses).

      We chose the wt BalB/c model as it better mimics natural respiratory infection in human patients, while the transgene K18-hACE2 model also results in strong infection of the brain. As discussed above, while infection with the Beta variant is efficient, it is not associated with clinical signs, it has only limited pathological effects (mild tissue damage and very limited inflammatory response) and is naturally cleared after 4 days. The ancestral Wuhan strain of SARS-CoV-2 as well as most other variants, in contrast, are unable to bind murine ACE, hence would require the use of transgenic mouse models expressing the human ACE receptor.

      • It would have been interesting to complete the experiments with a demonstration that the compounds block viral transmission. RESPONSE: We apologize for our oversight of not including the statistical analyses in the original version of the manuscript. As requested, it is now included. We are pleased to confirm that in all cases, the differences were statistically significant between presence and absence of combined drugs, and fully support our original conclusions.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In manuscript reference RC-2023-02113, the authors addressed the impact of inhibitors of cell host factors as therapeutics against SARS-CoVé infection. They tested the combined inhibition of the enzymatic activities of the endosomal PIKfyve phosphoinositide kinase and the serine protease TMPRSS2, known as essential to meditate viral entry pathways: Conclusion: They showed a reduction, as assessed in vitro experiment (cell line) and in lung infection in mice intranasally- infected with SARS-CoV-2 beta. Moreover, the reduced viral infection is, as expected, associated to level cell damage.

      Positive points:

      • The topic is of interest
      • Robust impact of the treatment although kinetic analysis post infection/symptoms are missing

      Significance

      Positive points:

      • The topic is of interest
      • Robust impact of the treatment although kinetic analysis post infection/symptoms are missing

      Limitation

      • Such a robust levels of infection in this model (female BALB/c mice) is surprising, owing that the ACE is not the appropriate homologue.
      • Statistical analysis are missing for most of the results

      Should be improved to support the strong conclusions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, the authors tested the antiviral activity of a combination of compounds by intranasal instillation in a mouse model of SARS-CoV-2. The two compounds used are PIKfyve Kinase inhibitor apilimod dimesylate, which inhibits endosomal maturation, and TMPRSS2 protease inhibitor nafamostat mesylate. The authors have previously shown that a combination of these two inhibitors acts synergistically to prevent entry and infection of SARS-CoV-2 in cell culture. Here, they further investigated the anti-SARS-CoV-2 activity of their combination of compounds by in vivo testing. They used Balb/c mice intranasally inoculated with the Beta variant of SARS-CoV-2. Their data show that concurrent administration of the combo together with the virus prevented lung infection without blocking nasal replication. Delayed administration of the compounds did not reduce replication in the lungs. The only effect was a decrease in bronchiolar cell death. Furthermore, they also tested the stability of the combo at room temperature and their data indicate that these compounds can be kept at room temperature for at least 3 months without losing antiviral activity, at least when resuspended in water. These data are potentially interesting but they need to be consolidated by additional experiments.

      Major comments:

      1. The authors only present immunohistochemistry to investigate viral replication in the nose. A quantitative analysis of replication would allow for better conclusions concerning viral replication in this organ.
      2. Complementary investigation on a potential anti-inflammatory effect of the drugs would also be welcome. Furthermore, it is surprising that the authors did not report potential weight changes.
      3. It would have been interesting to complete the experiments with a demonstration that the compounds block viral transmission.

      Minor comments:

      1. The second paragraph of the introduction is not clear. It needs to be re-written. Furthermore, there is no evidence that Calu3 cells do not express cathepsins.
      2. Figure 4C: Is there any explanation for the lack of apoptosis? The authors should at least provide some hypotheses. Furthermore, this figure is quoted as Figure 4B in the text instead of Figure 4C.
      3. Line 199: The authors claim that the effect of their combo is synergistic. However, this cannot be clearly concluded without appropriate additional experiments where they vary the concentration of the compounds.
      4. Line 211: The sentence is incomplete
      5. The lettering in the panels needs to be doublechecked.

      Significance

      General assessment:

      Finding new antiviral against SARS-CoV-2 remains a priority to fight against COVID-19. The validation of a combination of two molecules showing a partial antiviral activity in vivo is therefore of interest. However, this combo does not block viral replication in the nose and is inefficient when the treatment is added after infection, limiting the use of these molecules to prevent people in contact with COVID-19 patient of being infected. However, the authors should demonstrate that their molecules block viral transmission.

      Advance:

      The number of antivirals used in the clinics to treat COVID-19 patients remains extremely limited. Increasing the number of drugs available is still sorely needed.

      Audience:

      This paper potentially of large interest since the general population has been well informed of and/or have experienced COVID-19. Therefore, it is of interest beyond the virology and infectiology fields.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this work, Kant and co-workers describe a two drugs regimen for therapeutics treatment of SARS-CoV-2 infection. SARS-CoV-2 infection of cells is dependent on the cleavage of the spike S protein by cellular proteases that prime S allowing the envelop protein to fuse of host membrane during entry and delivery of the viral genome to the target cell. The most important cellular protease is TMPRSS2 located at the surface of the cell. However, in cells with TMPRSS2 levels, Cathepsins, located in endosomes have been shown to be able to also prime S. The therapeutic strategy of the authors relies on the combined usage of an inhibitor of TMPRSS2 (nafamostat) together with a compound that impairs endosomal maturation (apilimod) which is a key step for the activation of cathepsin. The rational is that a dual regimen would be more effective to inhibit SARS-COV-2 infection. Using cell lines and a combination of SARS-CoV2 infection and pseudotyped VSV particles (VSV virus where the glycoprotein has been replaced by the SARS-CoV-2 spike proteins), the authors could show that a two drug regimen was more efficient in preventing SARS-CoV-2 infection compare to single drug regimen. The authors next employed a mouse model of SARS-CoV-2 infection and similarly could show that bi-therapy was more efficient in preventing infection. Importantly, the authors describe a new formulation of the drugs that improve stability of the compounds and shelve life which could be of great benefit with respect to storage needs in therapeutic setting of the population. While the reviewer think the work is potentially very relevant, some of the conclusions are not fully supported by the data and additional experiments/quantifications should be performed to improve rigor and fully support the author conclusions.

      Major comments

      • Throughout the paper, statistical analysis of the results should be performed to support the conclusion of the authors. Currently many experiments do not have statistical analysis and P values or statical significance are missing in most of the figures: Figure 1B, 1D, 4A, 5B,and S2.
      • Quantification of the various pathology observed in mice should be quantified and scored. In the current version, the authors provided a supplementary table describing the pathology observed in individual mice upon SARS-CoV-2 infection. Adapted scoring of the different pathologies should be performed to obtain a statistical view of the pathology induced by SARS-CoV-2 and how this I prevented by the mono and bi-therapy approaches. Additionally, table 1, is very difficult to read as mice are classified in 3 experiments but this does not match with the individual figures, making it very hard to look for the phenotypes. Is it an order issue within the table or are murine infection experiments performed in the order described in table 1?. In this case, can the data be compared between the experiments as some conditions belong to experiment 2 and other to experiment 3? Given the low number of mice, do the experiments have statistical power? To show that treatment of mouse at 3 or 6hpi indeed reduce the number of capsase positive cells, the authors should perform a complete quantification and not limit there analysis to one representative tissue section from one animal
      • the authors insist on the new formulation that improves drug stability. To make this statement, this will need to be actively tested both in cell culture and in animal models. Currently, the authors test the drugs stored 3 months at 25c or -20c and show that they remain active, but in this experiment freshly made drug was not directly tested in parallel. Additionally, to make such a statement, different concentration of the drugs should be tested to calculate a IC50 for freshly prepared drug and stored drugs (as the current concentration tested might be at saturating concentration). Finally, the mouse experiments are performed with freshly made compounds and if the authors want to highlight the new formulation and increased stability, experiments in mice should be performed also with stored compounds. Alternatively, statement on drug stability should be removed or strongly tuned down from text.
      • Statistical analysis on figure 2b should be done between nafamostat alone and dual treatment to show that both drugs are cooperative in term of antiviral activities
      • The authors state "A quantitative assessment of the in vivo synergy is shown here by the enhanced decrease of viral RNA in lungs of mice treated with both drugs at very low concentrations (Figure 2 B, compare using 2 mg/Kg apilimod dimesylate and 4 mg/Kg nafamostat mesylate alone, and in combination)." I guess, the authors want to comment on the fact that 0.2 mg/kg of apilimod and 0.4 mg/kg of nafamostat are as potent as 2 and 4 mg/kg. is that correct? If YES, to make this statement, bi-therapy should be compared to mono-therapy at the same concentration.
      • when drugs are injected after infection (Fig 4), the drugs are not active. In fact, unless the reviewer mis-understood the plot, the mouse are even more infected compared to vehicle. The authors wrote that both regimes are equally less effective compared to drug administered during infection. The authors should write that both regimes are equally none protective. If drugs are not active after infection, does this approach really represent a therapeutic solution. The authors suggest that it does by limiting pathologies but this needs to be better quantified (see comment above)
      • In the rebound experiment: unless the reviewer misunderstood, it appears that no conclusion can be driven from this experiment. Q-PCR data for vehicle animal a 4dpi show no sign of infection, so the experiment is not really interpretable since control animals are no longer positive. The authors suggest that there is less pathologies but this needs to be better quantified (see comment above)

      Minor comments

      • It will make reading easier if the authors always mentioned which drugs inhibit what. For example: addition of the TMPRSS2 inhibitor nafamostat etc.... or addition of apilimod to block cathepsins activities.....
      • Figure 1: make a comment in the text that cells with low TMPRSS2 are more sensitive to the cathepsin inhibitor apilimod and vice versa, cells with high TMPRSS2 are more sensitive to nafamostat. This is expected and it could be highlighted.
      • Figure 2B: how are the data normalized?. should not RdRp, E and SubE all have a mean at 100% for the vehicle?
      • Line 211: something is missing here "when (Fig 2...)
      • Line 221 should figure 4c
      • Figure legends should only contain the details of the experimental design but should not contain description and interpretation of data. This is very minor and maybe a question of taste.

      Referees cross-commenting

      the other reviewers have highlighted the same limitations concerning the lack of quantifications of the immunochemistry and also the lack of robust statistical analyses.

      this should be highlighted to the authors as it appears to be the minimum to do prior publication. this should not take too much time as the data are in principle already available

      Significance

      The work by Kant and co-workers is potentially very significant but some limitations (as highlighted above) impair the impact of the work in his current version. The approach employing a two-drug regimen to combat SARS-COV-2 infection by targeting both TMPRSS2 and cathepsin activities is not new and was described before by the authors themselves. Employing this approach in an animal model is new and the new formulation improving drug stability and facilitating storage could be a game changer in therapeutic setting of patients. As such, this work could be highly significant and of broad interest. However, additional experiments and clarifications are needs to elevate this work to high impact standards. The reviewer believes that the requested experiments are easily achievable by the research teams of this project and think that the project will ultimately have a strong impact in the field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-02224R

      Corresponding author(s): Austin Smith

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We thank the reviewers for constructive comments and helpful suggestions which we have adopted to clarify and improve the manuscript. In addition, we have added a link to a web portal that will allow readers to visualise gene expression profiles and create their own plots using our early human embryo UMAP embedding (https://bioinformatics.crick.ac.uk/shiny/users/boeings/radley2024umap_app/). Stefan Boeing created this tool and is added to the author list with agreement of other authors.

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary In this manuscript, Arthur Radley and Austin Smith designed a new feature selection method for scRNA-Seq, which is a successor to ESFW previously proposed by the same authors. As an evolution of this earlier framework, cESFW is also based on the idea that informative genes share information with other genes, whereas non-informative genes have a more random relative expression. The authors emphasize the key importance of feature selection in the scRNA-Seq workflow and assess the current state of the art for this step. They also propose that better feature selection leads to less data transformation. They show that cESFW outperforms Scran and Seurat feature selection in most cases of synthetic datasets. cESFW is then used in the context of early human development, re-analysing data from several published datasets where they show that they do not require batch correction. They also further strengthen the conclusion that a "2-step" model for TE-ICM and EPI-Hyp differentiation is also present in human embyros. Finally, they map several types of in vitro pluripotent stem cells, in particular primed and naive, to their manifold and study the evolution of the gene signatures during early human development. Overall, the manuscript is well written and presents a solid methodology. The re-analysis of human early development is convincing and justified. The main critic is that the quality of figures can be greatly improved: their resolution is too low and they are hard to read. For instance, more contrasted color schemes could be used to improve clarity, and given the high number of clusters for some UMAPs, indicating the name of some cluster near their centroids should improve clarity.

      We agree that the resolution of the figures should be improved. We had to compress the images to satisfy the size limit for uploaded documents to bioRxiv. Our final submission will be of higher quality (original figures are at 900dpi). With regards to colour schemes, this is a surprisingly difficult problem. We tried multiple colour palettes but could not achieve greater contrast. The suggestion to add key cluster names near to their centroids on the UMAPs is an excellent idea, which we have implemented.

      Comments: Page 2 I think the criticism of PCA is unfair because it is not a true feature selection method, and it is mainly used for computational purposes. I believe that for most workflows, between 30 and 50 PCs are retained, which do not significantly change the results in the downstream analyses. The citation (Yeung and Ruzzo 2001) does not seem appropriate, as they examine cases where only a small number of PCs are retained, outside the context of scRNA-seq.

      We agree that the criticism of PCA is insufficiently justified by the citation. We thank the reviewer for pointing this out and have removed the comment.

      "Furthermore, HVG selection has been found to be biased toward selecting highly expressed genes over low expressed genes." Could the author justify or remove this statement, as the Seurat and Scran methods are specifically designed to consider average expression to determine HVG? The cited article (Yip, Sham, and Wang 2019) raises this issue for methods other than Seurat and scran.

      The reviewer is correct that the provided citation highlights Seurat and Scran HVG selection as relatively insensitive to the average gene expression levels compared with other HVG selection methods. We again thank the reviewer and have deleted the comment.

      More generally, we have shortened the introduction, focusing on cESFW as a new approach to feature selection rather than critiquing alternative methods.

      Page 6 I might have missed it, but I do not understand the number of cells in the early human development dataset also shown in Figure S2B. The Petropoulos et al. dataset alone is larger than the sum of cells from different cell types. Is there some filtering step that is not described?

      We have added text in the data availability section to clarify the cells used in our analysis:

      “The pre-implantation raw counts scRNA-seq data from Yan et al. 2013, Petropoulos et al. 2016, Fogarty et al. 2017, and Meistermann et al. 2021, were compiled into a single gene expression matrix by Meistermann et al. 2021. For information regarding quality control and cell filtering of these 4 datasets, please refer to Meistermann et al. 2021.”

      The unsupervised clustering used to annotate cell types is unconventional (especially with the high number of clusters chosen), which is not a problem, but should be clarified. Improving the figure 3D to make it clearer and providing a cell cluster correlation plot might help to better appreciate the relationship between cell types.

      We agree that the gene expression heatmap in figure 3D contributed little to the interpretation of the data/results. As suggested, we have replaced this heatmap with a cell cluster correlation plot to help appreciate cell state similarities. (Changes in figure 3.)

      It could be emphasized that the ICM/TE branch cell type is a major difference with the mouse topology, as the readers might not be aware that the ICM/TE is an unspecified blastocyst state that only exists in humans.

      There appears to be some misunderstanding around the use of “ICM/TE branch”. The cluster comprises an uncommitted population at the branching point from morula to either ICM or TE, as also described in the mouse embryo. We have adjusted the discussion to make more clear that the two branching point clusters are heterogeneous populations, not unitary cell types or states:

      “The branching populations reside at critical junctures in blastocyst formation, the partitioning of extraembryonic and embryonic lineages. These branchpoint clusters do not define unitary states. On the contrary, cells in these clusters are heterogeneous and may become specified to alternative fates. For example, PDGFRA, a hypoblast marker (Corujo-Simon et al. 2023), and NANOG, an epiblast marker (Allegre et al. 2022), are heterogeneously distributed in the Epi/Hyp branching population. Furthermore, branch cluster boundaries extend beyond the topological bifurcation, potentially indicating that cells remain plastic and may be redirected. This would be consistent with the demonstration in mouse embryos that cells expressing ICM genes remain capable of generating TE up to the late 32-cell stage (Posfai et al. 2017).”

      Page 9 To further substantiate the stepwise ICM/TE and EPI/PrE specification events, authors could project cells from each embryo on the UMAP, and analyze what are the co-occurrence of cells (as performed for instance in Meistermann et al 2021). This should show as reported (and cited by the authors) that some GATA3 positive cells (TE fated) start appearing from late morula stage and that ICM cells almost never co-exist with EPI nor Hyp in embryos.

      We appreciate this suggestion. We have generated the requested plots showing where cells from individual embryos at different developmental timepoints are positioned on our UMAP embedding. (new supplemental figure (New figure, Figure S6). We present a summary heatmap of cell co-occurrence in revised Figure 4. These results offer greater insight than the RNA velocity analysis, which we have moved to supplemental Figure S6. We have added discussion of these analyses in the “Lineage branching blastocyst development” Results section.

      Reviewer #1 (Significance (Required)):

      The presented methodology shows significant value especially in the field of scRNA-Seq, where the critical step of feature selection is often inadequately addressed. Furthermore, this field is characterized by a limited set of feature selection methodologies. cESFW appears to be an important alternative to HVG methods that could improve scRNA-Seq analysis in certain contexts.

      The new findings on early human development are somehow incremental, but a welcome addition to solidify the two-step model and refine the concept of reject cells. The audience for this early development context is specialized, but cESFW will most likely have an impact to the entire field of scRNA-Seq analysis.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Here, Radley and Austin present a novel approach for feature weighting in scRNAseq data based on entropy sorting. Feature selection is a central part of scRNAseq analysis, and it is most likely the case that there is no single approach that outperforms all others across all datasets. Hence, innovation in this space is needed for the field. The cESFW method presented here has several appealing properties from a theoretical point of view, and it also performs well on the synthetic and real datasets considered. Nevertheless, there are several major issues that need to be addressed before I can recommend the manuscript for publication:

      1 The original entropy sorting (eq 1 in SI 1) is based on only two discrete states. However, calculating entropy for continuous distributions can be more tricky and it is unclear to me what assumptions are made regarding the gene expression. Could the authors clarify what properties of the distribution are required for the updated ESE equation to be valid? Is the only assumption that values are drawn from the [0, 1] interval? What happens if values are highly skewed, ie forming a bimodal or power-law distribution rather than something close to a uniform distribution?

      We agree that it is beneficial to clarify these points. We have added a section titled “Assumed properties of underlying sample distributions” to the supplemental information. Briefly, we show that the ESS correlation metric is directly linked to the commonly used correlation metric, Mutual Information (MI). A desirable properly of MI is that it is able to capture non-linear/skewed relationships between features. The ES framework and ESS share this property with MI, allowing the ES framework to be relatively robust to presence of non-uniform distributions.

      The main assumption for applying ES is that the features can be meaningfully scaled between values of 0 and 1. For gene expression, an intuitive way of achieving this is to inspect each gene and designate 0 count values as having 0 expression activity, and the maximum counts as having activities of 1, and all values in between existing within the [0,1] interval. A useful property of ES is that we do not need to assume a particular shape or distribution of the samples within the [0, 1] interval. The ES framework is non-parametric and does not require an assumed distribution to calculate the conditional entropy (CE), even in the continuous form. This is possible because the ES framework is formulated by turning the probabilistic form of CE into an ordinary differential equation (ODE), where the only dependent variable, x, is the overlap between the minority state activities of each individual sample. This calculation is explicitly identifiable/calculable, and is permutation invariant, meaning the shape of the distributions of a reference feature (RF) and query feature (QF) does not need to be assumed/defined. In other words, the ES framework quantifies to what degree active expression states enrich/overlap with one another in a manner that is robust to different distribution shapes.

      2 How robust is the procedure for the choice of percentile for normalizing the gene expression scores? Does one get roughly the same results for 90-99th percentile or is it sensitive to this choice?

      We have carried out a sensitivity analysis on the choice of percentile for each of the synthetic datasets and added it to the manuscript. (New figure, Figure S11). We find that on each of our 4 synthetic datasets the final results of cESFW are robust to a wide range of normalisation percentiles.

      3 Similarly, I am concerned about the procedure for how to choose the number of significant genes. How robust is this process? Also, it is not altogether clear how to generalize the procedure outlined on p19. Most potential users would benefit from more quantitative guidelines. In particular, having to rely on interpretation of GO terms typically requires a considerable amount of understanding about the system at hand which could make it challenging to apply the procedure for others. For most users it would be helpful to know how robust the procedure is to this step and also if there could be more stringent guidelines for how to decide which genes to include.

      We understand the reviewers concern regarding the robustness of feature selection on real scRNA-seq datasets. We have now applied our cESFW workflow to peripheral blood mononuclear cells (PBMC) scRNA-seq data, and found cESFW feature selection to be comparable, and by one metric more robust, than Seurat and Scran HVG selection (New Figure S2).

      As cESFW is applied to more scRNA-seq data, we will learn more about how results compare to highly variable gene selection, and how workflows may be adapted to optimise results in different scenarios. For example, we have found that supervising the selection of gene clusters using a small set of markers known to be important in the system of study can help identify which clusters of genes should be retained during gene selection. We have added this to the materials and methods with the following paragraph:

      “Furthermore, we suggest supervising the selection of gene clusters using a small set of markers known to be important in the system of study. In this work, we found that genes known to be important during early human embryo development (FigS4) are enriched in the dark blue cluster of genes, further suggesting that this cluster of genes is more likely to separate cell type identities in downstream analysis.”

      While gene cluster selection supervision in this manner requires a degree of domain expertise, we believe this is not unreasonable for most applications, and is the case for many scRNA-seq analysis pipelines.

      Our primary software contribution is the cESFW algorithm which calculates the ESS and EP matrices. With this manuscript we provide 6 commented workflows for applying cESFW to different datasets (4 synthetic data, human embryo data, PBMC data). We believe these workflows provide a good balance of documented use cases and user flexibility for cESFW usage. This is important because it is advantageous to be able easily to adapt workflows to incorporate domain expertise and different methodologies. Although workflows such as Seurat and Scran are user-friendly, their rigidity can be difficult when wanting to deviate from their standard workflows. In summary, we believe that our provided workflows are suitable for users to implement cESFW, while providing the flexibility to apply adapted pipelines.

      4 The comparison of the clusterings on p6 is not really fair is it? If I understand it correctly, the 3,012 genes identified by cESFW was used to define clusters in fig 3c through unsupervised clustering. The authors then use HVG methods to identify 3,012 genes and then carries out clustering based on those. To evaluate the methods the silhouette score is used, but the labels from the cESFW clustering is used as ground truth. This does not sound like a fair way to compare. Could the authors please clarify, and if needed come up with an approach where the three methods have a more level playing field if needed.

      The reviewer raises a fair point regarding the comparison of cluster identities and ranked gene lists. This issue is a chicken and egg problem, in that we require a baseline to benchmark different methodologies but lack an explicitly defined ground truth. For that reason we used synthetic datasets for initial comparison.

      For the human embryo data, we have presented substantial evidence that our cluster annotations are biologically coherent and consistent with prior knowledge. We therefore consider it legitimate to compare the ranked lists of Seurat, Scran and cESFW. However, we acknowledge the potential bias and have mentioned this in the “Limitations of the study” section.

      In addition, we have now analysed the peripheral blood mononuclear cells (PBMC) scRNA-seq dataset that is used in the tutorial workflows of Seurat and Scran. This PBMC dataset is arguably better defined since it has more discrete populations of cells, and by using the Seurat generated cell type labels we bias the analysis towards Seurat rather than cESFW. The results show that cESFW performs comparably to Seurat and Scran, and that the cESFW ranked gene list may be more stable than Seurat and Scran. These results suggest that cESFW can be widely applicable as a suitable alternative for feature selection. We have included this analysis in the Results and as a supplemental figure (New figure, Figure S2).

      5 The main cESFW.py file in the github repository is clearly well structured and commented. However, I would like to see a much better documentation so that one does not have to go through the source code to understand what functions there are and what they do. In particular, I would like to see a vignette to make it easier for others to incorporate cESFW into their workflows.

      We thank the reviewer for the positive comments regarding our cESFW.py commenting. We accept that our initial submission failed to point the reader directly towards our example workflows that provide step by step, well commented vignettes for using cESFW to analyse scRNA-seq data. In our initial submission we provided 5 workflows (4 synthetic data and the human embryo data), and in the re-submission we have added a workflow for analysing PBMC data. We have updated our cESFW Github to guide users to these example workflows (https://github.com/aradley/cESFW/tree/main).

      Please note, the embryo workflow will be easily accessible through GitHub, whereas the synthetic data and PBMC workflows will be provided through a Mendeley data link (referenced in the manuscript and on our GitHub). However, the content of the Mendeley link cannot be made public until the paper is finalised, as it cannot be changed after publication. We provide a temporary public Dropbox link for the reviewers so that they may access the additional workflows (https://www.dropbox.com/scl/fo/xr5o9xm6490ftjsa55wxg/h?rlkey=maindrxwdqnirsw1en3my5qsr&dl=0).

      Minor:

      Why are the figures not always in order? For example, fig S10 is mentioned before fig S2 on p 6

      Thank you for pointing this out; we have amended the text.

      I am not sure if the indexing in eq 1 (p 18) is correct. j is both on the LHS and it is also being summed over on the RHS. Should one of these be i instead?

      The indexing is correct. Each column j of a matrix refers to gene/feature on the RHS, and in the calculation on the RHS we take the column averages, leading to vector on the LHS that is still indexed by genes/features j. We have clarified this in the text.

      Reviewer #2 (Significance (Required)):

      The work presents a new method for feature selection in scRNAseq. Feature selection is a very important step and can have a big impact on findings. The method presented here is theoretically sound and it seems to provide interesting result when applied to early embryo development. However, as cESFW is only tested for one dataset it is unclear how well the method generalizes to other problems and datasets.

      Appreciation of the utility of cESFW will grow as it is applied to more datasets. However, we would like to highlight that the human embryo dataset consists of 6 independent scRNA-seq datasets from different laboratories, and that cESFW was able to identify common and differing structure between them without any batch correction, smoothing or feature extraction. We have added to our summary that we propose cESFW may be best suited to analysis of transcriptome trajectories in time course and developmental data. However, we have also now performed comparison of Seurat, Scran and cESFW feature selection in a different context, using a reference PMBC scRNA-seq dataset. The results demonstrate that cESFW is a viable alternative for feature selection in that static system also (New figure, Figure S2).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Here, Radley and Austin present a novel approach for feature weighting in scRNAseq data based on entropy sorting. Feature selection is a central part of scRNAseq analysis, and it is most likely the case that there is no single approach that outperforms all others across all datasets. Hence, innovation in this space is needed for the field. The cESFW method presented here has several appealing properties from a theoretical point of view, and it also performs well on the synthetic and real datasets considered. Nevertheless, there are several major issues that need to be addressed before I can recommend the manuscript for publication:

      1. The original entropy sorting (eq 1 in SI 1) is based on only two discrete states. However, calculating entropy for continuous distributions can be more tricky and it is unclear to me what assumptions are made regarding the gene expression. Could the authors clarify what properties of the distribution are required for the updated ESE equation to be valid? Is the only assumption that values are drawn from the [0, 1] interval? What happens if values are highly skewed, ie forming a bimodal or power-law distribution rather than something close to a uniform distribution?
      2. How robust is the procedure for the choice of percentile for normalizing the gene expression scores? Does one get roughly the same results for 90-99th percentile or is it sensitive to this choice?
      3. Similarly, I am concerned about the procedure for how to choose the number of significant genes. How robust is this process? Also, it is not altogether clear how to generalize the procedure outlined on p19. Most potential users would benefit from more quantitative guidelines. In particular, having to rely on interpretation of GO terms typically requires a considerable amount of understanding about the system at hand which could make it challenging to apply the procedure for others. For most users it would be helpful to know how robust the procedure is to this step and also if there could be more stringent guidelines for how to decide which genes to include.
      4. The comparison of the clusterings on p6 is not really fair is it? If I understand it correctly, the 3,012 genes identified by cESFW was used to define clusters in fig 3c through unsupervised clustering. The authors then use HVG methods to identify 3,012 genes and then carries out clustering based on those. To evaluate the methods the silhouette score is used, but the labels from the cESFW clustering is used as ground truth. This does not sound like a fair way to compare. Could the authors please clarify, and if needed come up with an approach where the three methods have a more level playing field if needed.
      5. The main cESFW.py file in the github repository is clearly well structured and commented. However, I would like to see a much better documentation so that one does not have to go through the source code to understand what functions there are and what they do. In particular, I would like to see a vignette to make it easier for others to incorporate cESFW into their workflows.

      Minor:

      Why are the figures not always in order? For example, fig S10 is mentioned before fig S2 on p 6

      I am not sure if the indexing in eq 1 (p 18) is correct. j is both on the LHS and it is also being summed over on the RHS. Should one of these be i instead?

      Significance

      The work presents a new method for feature selection in scRNAseq. Feature selection is a very important step and can have a big impact on findings. The method presented here is theoretically sound and it seems to provide interesting result when applied to early embryo development. However, as cESFW is only tested for one dataset it is unclear how well the method generalizes to other problems and datasets.

      My expertise is in computational genomics.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, Arthur Radley and Austin Smith designed a new feature selection method for scRNA-Seq, which is a successor to ESFW previously proposed by the same authors. As an evolution of this earlier framework, cESFW is also based on the idea that informative genes share information with other genes, whereas non-informative genes have a more random relative expression. The authors emphasize the key importance of feature selection in the scRNA-Seq workflow and assess the current state of the art for this step. They also propose that better feature selection leads to less data transformation. They show that cESFW outperforms Scran and Seurat feature selection in most cases of synthetic datasets. cESFW is then used in the context of early human development, re-analysing data from several published datasets where they show that they do not require batch correction. They also further strengthen the conclusion that a "2-step" model for TE-ICM and EPI-Hyp differentiation is also present in human embyros. Finally, they map several types of in vitro pluripotent stem cells, in particular primed and naive, to their manifold and study the evolution of the gene signatures during early human development. Overall, the manuscript is well written and presents a solid methodology. The re-analysis of human early development is convincing and justified. The main critic is that the quality of figures can be greatly improved: their resolution is too low and they are hard to read. For instance, more contrasted color schemes could be used to improve clarity, and given the high number of clusters for some UMAPs, indicating the name of some cluster near their centroids should improve clarity.

      Comments:

      Page 2 I think the criticism of PCA is unfair because it is not a true feature selection method, and it is mainly used for computational purposes. I believe that for most workflows, between 30 and 50 PCs are retained, which do not significantly change the results in the downstream analyses. The citation (Yeung and Ruzzo 2001) does not seem appropriate, as they examine cases where only a small number of PCs are retained, outside the context of scRNA-seq. "Furthermore, HVG selection has been found to be biased toward selecting highly expressed genes over low expressed genes." Could the author justify or remove this statement, as the Seurat and Scran methods are specifically designed to consider average expression to determine HVG? The cited article (Yip, Sham, and Wang 2019) raises this issue for methods other than Seurat and scran.

      Page 6 I might have missed it, but I do not understand the number of cells in the early human development dataset also shown in Figure S2B. The Petropoulos et al. dataset alone is larger than the sum of cells from different cell types. Is there some filtering step that is not described? The unsupervised clustering used to annotate cell types is unconventional (especially with the high number of clusters chosen), which is not a problem, but should be clarified. Improving the figure 3D to make it clearer and providing a cell cluster correlation plot might help to better appreciate the relationship between cell types. It could be emphasized that the ICM/TE branch cell type is a major difference with the mouse topology, as the readers might not be aware that the ICM/TE is an unspecified blastocyst state that only exists in humans.

      Page 9 To further substantiate the stepwise ICM/TE and EPI/PrE specification events, authors could project cells from each embryo on the UMAP, and analyze what are the co-occurrence of cells (as performed for instance in Meistermann et al 2021). This should show as reported (and cited by the authors) that some GATA3 positive cells (TE fated) start appearing from late morula stage and that ICM cells almost never co-exist with EPI nor Hyp in embryos.

      Significance

      The presented methodology shows significant value especially in the field of scRNA-Seq, where the critical step of feature selection is often inadequately addressed. Furthermore, this field is characterized by a limited set of feature selection methodologies. cESFW appears to be an important alternative to HVG methods that could improve scRNA-Seq analysis in certain contexts.

      The new findings on early human development are somehow incremental, but a welcome addition to solidify the two-step model and refine the concept of reject cells. The audience for this early development context is specialized, but cESFW will most likely have an impact to the entire field of scRNA-Seq analysis.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This is an excellent paper experimentally exploring variations in carboxylation rates of form I rubisco.

      We thank the Reviewer for this positive feedback on our study.

      I have three comments

      Q. Did the authors also measure the oxygenate activity of the enzyme? This is relevant to the evolution of carboxysomes and CCMs in general.

      We agree that this is relevant but there are technical limitations for achieving this with the current framework. This pipeline’s emphasis on the carboxylation rate allows for the screening of a high number of rubisco variants covering a wide genetic diversity. It provides a way to approach the complexity of the kinetic constraints of this enzyme with a realizable and reproducible method. However, we agree that the measurements of oxygenase activity, as well as affinities to CO2 and O2 , would enrich our understanding of the evolution of rubiscos in the context of CCMs, and thus expanding our pipeline in the future to cover the other kinetic dimension would be worthwhile but cannot be achieved currently due to various technical constraints (other dimensions to explore, like the temperature effect, could also be included). In line also with the comment of reviewer #3, we have expanded our discussion in the manuscript to clarify this matter more thoroughly:

      “By centering our analysis on the carboxylation rate, this pipeline systematically shows the particularity of carboxysome-associated rubiscos which are characterized by a poor affinity to CO2 (Badger et al, 1998; Falkowski & Raven, 2007) alongside a relatively high carboxylation rate (our data). This likely reflects the aforementioned catalytic tradeoff, suggesting that higher local concentrations of CO2 within CCMs probably allowed rubiscos to evolve towards higher kcat and KM. High-throughput measurements of other kinetic parameters beyond what was achieved here, such as the KM for both gasses or the oxygenation rate, would be valuable. It could provide values of the carboxylation efficiency, or even the enzyme specificity, which would enrich our understanding of this enzyme and of its adaptation to the atmospheric composition over geological timescales.”

      1. How do predicted structures (e.g., using Alpha fold) vary with catalytic efficient?

      We generated Alpha Fold structures of the 98 active variants revealed in this study and performed preliminary structural analysis of the active site. There was no strong correlation between measured rates and the active site structure. The RMSD of generated structures has a median RMSD of 2.7 Å for the large and small subunit together, and 1.3 Å for the active site, suggesting high conservation of the structure of rubisco, and probably explaining the difficulty to associate rate variation with any clear structural feature (especially coming from prediction algorithm already showing uncertainties on the order of magnitude of an angstrom (Jumper et al, 2021; Terwilliger et al, 2024)).

      We added a paragraph about this new analysis in the Results and in the Materials and Methods sections of the manuscript, and we present the results in new Supplementary Fig. 12. We made these structures available on the gitlab folder associated with this paper.

      1. The authors should note the paper by Tortell (not this reviewer) https://aslopubs.onlinelibrary.wiley.com/doi/pdfdirect/10.4319/lo.2000.45.3.0744

      Thank you. We added a reference to this paper in the following sentence: “Another strategy consists of the evolution of rubisco towards stronger affinity for CO2 (Tortell 2000).”

      Reviewer #1 (Significance (Required)):

      This is an excellent paper experimentally exploring variations in carboxylation rates of form I rubisco.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      I very much enjoyed reviewing this manuscript. de Pins et al provide a timely report on the catalytic turnover rate of a large number of Rubisco enzymes within the FormI group. These data provide novel insights into generalities of Rubisco function, specifically within certain phylogenies, and extend our understanding of carbon acquisition in these systems. In particular, the data presented by de Pins et al provide new insights into the relative carbon fixation rates of alpha-cyanobacteria, for which there are very few studies reporting catalytic turnover. It is apparent that the CO2 concentrating mechanisms (CCM) of cyanobacteria, especially the alpha-cyanobacteria (containing FormIA Rubisco) are a globally important contributor to CO2 capture into the biosphere via their carboxysomal Rubisco enzymes. This report provides then first broad selection of FormI Rubiscos to enable comparisons of catalytic turnover across this dominant enzyme family and shows that FormIA Rubiscos from phototrophic systems, and encapsulated in carboxysomes, are on average the fastest enzymes.

      We thank the Reviewer for the positive remark on the novelty of the study.

      de Pins et al use a high throughput screening technique that provides a highly correlative estimate of Rubisco turnover compared with traditional assays. This screen is based upon bulk expression of enzymes within E. coli from synthesised genes, and in some cases the co-expression of chaperonin factors to boost expression and solubility of holoenzymes. The assay process is sound and of high quality and the interpretations clear and uncomplicated.

      The conclusions are sound and I only have a number of minor issues for consideration.

      Minor comments: Temperature effects on 'true' Rubisco turnover rates. The authors quite reasonably note that a single measurement temperature was used in the assay and that this may not necessarily reflect the catalytic turnover of Rubiscos from thermophiles. Suppl. Fig 5b indicates a relatively large number of 'hot spring' species that have, generally, a low median kcat compared with, for example, both cyanobacterial classes. Can the authors comment on whether or not the thermophile set is not highly represented by one group (e.g. phototrophic alpha-cyanobacteria). Does this thermophile dataset have the potential to influence the generalities presented? Fig 2 would suggest this is not the case but it is not possible for the reader to know if all or any thermophiles are represented in Fig2 (as opposed to Suppl. Fig 4).

      We indeed identify 3 groups of rubiscos that are either expressed by thermophilic bacteria (Supplementary Figure 10), and/or are associated with hot environments (hot spring and hydrothermal vent; see Supplementary Figure 5). These rubiscos show relatively lower rates. We grouped them together and reproduced our main analysis (from Figure 2) with and without rubiscos from this group to create a new Figure (Supplementary Fig. 11). Interestingly, alpha-cyanobacteria had no representatives among this group of thermophilic and hot-environments-associated rubsicos. However, this is unlikely to explain the result observed as removing them does not influence the obtained tendencies. We add the following sentences in the Results section:

      “Additionally, the slightly lower carboxylation rate of rubiscos originating from thermophilic bacteria and isolated from hot environments (Supplementary Fig. 5 and 10) aligns with expectations, considering that these rubiscos naturally work at higher temperatures than in our in vitro assay (30°C). However, this is unlikely to explain the observed trends as the main results of this study are not affected by the removal of these rubiscos (Supplementary Fig. 11).”

      We also noticed, while reviewing the study, that the code generating Supplementary Fig. 8 and 10 incorrectly duplicated some dots for a few rubiscos (less than 5), although this did not affect the overall results. We have fixed this issue and have now updated the figures accordingly.

      Line 98: "in spite of" should be "despite"

      Done.

      Lines 171-173: There is an additional alpha carboxysomal Rubisco for which there are catalytic parameters described (Chapter 11 Engineering Photosynthetic CO2 Assimilation to Develop New Crop Varieties to Cope with Future Climates. RE Sharwood, BM Long - Photosynthesis, respiration, and climate change, 2021). This book chapter reports a kcat of 11.9 s-1 for the alpha carboxysomal Rubisco from Synechococcus WH8102, very much in line with the authors conclusions.

      We added this rubisco to the list of previously characterized rubisco and updated Figure 1 accordingly. We also updated the following sentence:

      “However, such statements were made based on scarce measurements, with only three kcat,C values currently available for both rubisco groups (Shih et al, 2016; Long et al, 2018; Sharwood & Long, 2021; Wilson et al, 2018; Aguiló-Nicolau et al, 2023).”

      Lines 232-234: I note that ref 30 posits that low CO2 was the more likely driver of carboxysome evolution than high O2.

      We agree that our phrasing did not point clearly enough that CO2 was more likely the driver of carboxysome evolution, rather than high O2. We rephrase the sentence to: “Carboxysomes likely evolved during the Proterozoic eon - in the context of the continuous decrease of carbon dioxide in Earth’s atmosphere (Flamholz & Shih, 2020)”

      Line 235: The preferred term is either "CO2 concentrating mechanism" or "inorganic carbon concentrating mechanism"

      We rephrased into “CO2 concentrating mechanisms”.

      Lines 254-256: The relative saturation of carboxysomes with Rubisco is still somewhat undecided, although relatively new datasets enable more accurate comparisons. A number of papers from the Liu Lab (Liverpool) enable estimates of Rubisco active site concentrations for alpha and beta carboxysomes in the range of 2-6 mM. It appears at this stage that Rubisco active site concentrations may be highest in alpha-carboxysomes.

      We thank the reviewer for drawing our attention to the work of the Liu lab which, while acknowledging potential inaccuracies in the estimates, tends to show a higher concentration of rubiscos in alpha-carboxysomes than in beta-carboxysomes (Sun et al, 2019; Sun et al, 2022). We removed this statement from the text.

      Lines 312-318: That genes were codon optimized for E. coli expression raises an interesting question about the effect of Raf1 on Rubisco solubility. Assuming expression rates were not constrained, can any conclusions be made as to the amino acid sequence differences that led to lower solubility? One assumes that the Rubisco sequences had a high degree of identity?

      We indeed note that the fact that every rubisco gene from this study was codon optimized for E. coli expression suggests that the solubility issues met here were post-translational. This suggests a post-translational role for Raf1 in rubisco folding/assembly that is in line with the mechanism proposed by Xia et al. of an interaction of Raf1 with rubisco large subunits dimer, further mediating the assembly of an octameric core and the recruitment of rubisco small subunits (Xia et al, 2020). We also note that insoluble rubiscos were met in all form I clades, suggesting that this property was not linked to a specific group of rubiscos sequences with a high identity degree.

      We add that we also tested the effect of not performing codon optimization on 8 rubiscos that were insoluble following codon optimization (to assess whether the codon optimization could negatively affect the folding, for instance by making the translation too fast). This did not yield any improvement in solubility.

      Lines 333-344: Was there an attempt to use acRAF (Raf2?) for FormIA Rubiscos that did not fold successfully in E. coli?

      While only 33% of β-carboxysome-associated (IB) rubiscos were originally soluble (i. e. without the coexpression of Raf1 from E. natronophila), 85% of the tested α-carboxysome-associated (IAc) rubiscos were soluble in our experimental conditions. We therefore decided that testing for the effect of co-expressing them with the chaperone acRAF would be less cost-effective.

      Reviewer #2 (Significance (Required)):

      This manuscript presents a significant advance in our broader understanding of the major enzyme involved in carbon input into the biosphere, Rubisco. It will be of key interest to those studying carbon biogeochemistry, global CO2 modelling, cyanobacterial and proteobacterial CCMs, and those interested in using these systems to improve plant-based carbon capture for food security and global carbon abatement systems. It provides, for the first time, a large dataset of hitherto unknown Rubisco kinetics in a globally important group of organisms. The study is extremely well carried out and will likely form the basis of future Rubisco screens to provide greater clarity to our knowledge base of this globally important enzyme.

      My expertise is in the study and application of CCMs as CO2 acquisition systems that can be used for Synbio applications.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors have presented a tremendous study into the diversity of carboxylation speed (kcatc) from bacterial Form I Rubisco enzymes. The authors identified some nice diversity in kcatc which resulted in the finding that Rubisco's originating from within a CCM were faster, which confirms what has been previously observed in the literature. The authors provided information on a pipeline to screen large numbers of Rubisco variants. In this manuscript, the authors tested 144 different enzymes with 112 of these successfully expressed in E.coli and of these 98 showed substantial catalytic activity. The authors showed that alpha cyanobacterial Rubisco possessed the fastest kcatc when compared to beta cyanobacterial counterparts which is contrary to that published in the literature so far. The authors have provided some nice insight into how they improved expression of soluble Rubisco with expressing bacterial chaperonin and Rubisco assembly factors such as Raf1 and rbcX. All which have been previously discovered plant and cyanobacteria. The authors also presented some nice correlations as shown in figure 2 and some weaker and non-correlations to various environmental parameters in the supplementary data.

      Overall, the field will learn something from this large body of work that has characterized only one Rubisco catalytic parameter.

      We thank the Reviewer for these positive comments.

      Major points: 1) The authors only measured carboxylation speed using a spec assay. The Michaelis constant for CO2 measured in N2 and 21% Oxygen is also valuable to understand the diversity in Rubisco catalysis. The authors should perhaps mention this and that the carboxylation efficiency is also an important measure for comparing Rubisco enzymes.

      In line with this and with the comment of reviewer #1, we have expanded the Discussion to include the following:

      “By centering our analysis on the carboxylation rate, this pipeline systematically shows the particularity of carboxysome-associated rubiscos which are characterized by a poor affinity to CO2 (Badger et al, 1998; Falkowski & Raven, 2007) alongside a relatively high carboxylation rate (our data). This likely reflects the aforementioned catalytic tradeoff, suggesting that higher local concentrations of CO2 within CCMs probably allowed rubiscos to evolve towards higher kcat and KM. High-throughput measurements of other kinetic parameters beyond what was achieved here, such as the KM for both gasses or the oxygenation rate, would be valuable. It could provide values of the carboxylation efficiency, or even the enzyme specificity, which would enrich our understanding of this enzyme and of its adaptation to the atmospheric composition over geological timescales.”

      2) The authors mentioned that they used E.coli lysates. Did the authors test for background activity due NADH dehydrogenases which are present in bacterial lysates? This could impact the catalytic rates measured.

      The background activity due to dehydrogenases from the lysate is negligible compared to the measured reaction (see as an example, in Supplementary Fig. 16A, the gray curve, corresponding to a CABP concentration of 90 nM, fully inhibiting rubisco in the reaction). Moreover, the use of CABP in the spectroscopic assay allows to take into account any “background activity” that would come from an element independent of rubisco because this background would be identical at every CABP concentration. We modified Supplementary Note 1 to make this point clearer:

      “We note that any NADH dehydrogenation due to other native E. coli proteins, while low (see the [CABP] = 90 nM gray curve of Supplementary Fig. 16A), is not influencing the measurement of the carboxylation rate which relies on the differential Vmax values at changing CABP concentrations (which should not affect the rates of these dehydrogenases).”

      3) For the microtitre plate assay, did the authors correct for the different pathlength? This is crucial for the Beer-Lambert law which is used to calculate the consumption of NADH.

      Because our assay is performed in a microwell plate instead of a standard 1 cm quartz cuvette, we indeed calibrated the assay to empirically determine the optical path length in our conditions. We add the following sentence in the Material and Methods section to better explain the measurement of NADH concentration in our assay:

      “Knowing the NADH extinction coefficient at 340 nm (ε340 = 6220 M-1cm-1), and after measuring the optical path length (𝑙 = 0.26 cm) with an NADH calibration curve in our setting, we used Beer-Lambert law (𝐴340 = ε340 . 𝑙 . 𝑐) to measure the NADH concentration 𝑐.”

      4) Did the authors consider studying the temperature response of kcatc for these enzymes? This could also reveal some interesting insight into their data.

      We indeed considered this. With a high-throughput pipeline involving >100 enzymes to express and test in parallel, we had to limit the experimental testing conditions to be realistic both in terms of time and budget. However, the finding that rubisco variants from thermophiles tend to have lower carboxylation rates in our standardized conditions (30°C) suggest that they will probably show faster rates at temperatures closer to their optimal growth temperature. We therefore agree that the study of the temperature response of the carboxylation rate in further works could validate these hypotheses and bring more insight into the catalytic characteristics of rubisco. We further emphasize this point in the following modified sentence:

      “Investigating the temperature response of rubisco carboxylation rate in further work could shed light on the importance of this parameter, especially among thermophilic or psychrophilic associated enzymes.”

      5) With this new catalytic knowledge, what can the field now do with this data to inform new research directions?

      We believe this new knowledge can inform new research directions in the field of microbial ecology, metabolic engineering, and machine learning in the context of kinetic parameters prediction. We modified a paragraph in the discussion to elaborate on this point:

      “This study provides a systematic exploration of bacterial form I rubisco maximal rates and its relationship with various contextual factors that could have shaped the evolution of this most abundant enzyme on Earth. It holds potential for future metabolic and ecological studies about specific bacterial species – for instance among cyanobacteria for which 40 rubiscos have been characterized here. By enriching our knowledge on carboxylation rates and their connection to environmental factors, it can also contribute to more accurately modeling global carbon fluxes. Additionally, this dataset of rubisco sequences and their associated rates can facilitate linking sequence motifs to catalytic function. Ultimately, it can improve our understanding, and possible harnessing, of bacterial CCMs for the potential development of plant-based carbon capture strategies, the increase of agricultural yields and the support of sustainable food production in the face of a changing climate.”

      Minor comments: The figures are of outstanding quality and easy to follow. This will set the bar high in the literature. I have no other minor comments.

      We thank the Reviewer for noting the care taken in the graphic presentation of our results.

      Reviewer #3 (Significance (Required)):

      Overall, the authors have presented an excellent study into bacterial Form I Rubisco's that will further enhance our understanding of Rubisco evolution. The pipeline for expression of bacterial Rubisco's in E.coli is developed nicely by the authors and the next step will be to determine how other important catalytic parameters can be determined to have more detailed understanding of Rubisco catalysis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors have presented a tremendous study into the diversity of carboxylation speed (kcatc) from bacterial Form I Rubisco enzymes. The authors identified some nice diversity in kcatc which resulted in the finding that Rubisco's originating from within a CCM were faster, which confirms what has been previously observed in the literature. The authors provided information on a pipeline to screen large numbers of Rubisco variants. In this manuscript, the authors tested 144 different enzymes with 112 of these successfully expressed in E.coli and of these 98 showed substantial catalytic activity. The authors showed that alpha cyanobacterial Rubisco possessed the fastest kcatc when compared to beta cyanobacterial counterparts which is contrary to that published in the literature so far. The authors have provided some nice insight into how they improved expression of soluble Rubisco with expressing bacterial chaperonin and Rubisco assembly factors such as Raf1 and rbcX. All which have been previously discovered plant and cyanobacteria. The authors also presented some nice correlations as shown in figure 2 and some weaker and non-correlations to various environmental parameters in the supplementary data. Overall, the field will learn something from this large body of work that has characterized only one Rubisco catalytic parameter.

      Major points:

      1) The authors only measured carboxylation speed using a spec assay. The Michaelis constant for CO2 measured in N2 and 21% Oxygen is also valuable to understand the diversity in Rubisco catalysis. The authors should perhaps mention this and that the carboxylation efficiency is also an important measure for comparing Rubisco enzymes.

      2) The authors mentioned that they used E.coli lysates. Did the authors test for background activity due NADH dehydrogenases which are present in bacterial lysates? This could impact the catalytic rates measured.

      3) For the microtitre plate assay, did the authors correct for the different pathlength? This is crucial for the Beer-Lambert law which is used to calculate the consumption of NADH.

      4) Did the authors consider studying the temperature response of kcatc for these enzymes? This could also reveal some interesting insight into their data.

      5) With this new catalytic knowledge, what can the field now do with this data to inform new research directions?

      Minor comments:

      The figures are of outstanding quality and easy to follow. This will set the bar high in the literature. I have no other minor comments.

      Significance

      Overall, the authors have presented an excellent study into bacterial Form I Rubisco's that will further enhance our understanding of Rubisco evolution. The pipeline for expression of bacterial Rubisco's in E.coli is developed nicely by the authors and the next step will be to determine how other important catalytic parameters can be determined to have more detailed understanding of Rubisco catalysis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      • I very much enjoyed reviewing this manuscript. de Pins et al provide a timely report on the catalytic turnover rate of a large number of Rubisco enzymes within the FormI group. These data provide novel insights into generalities of Rubisco function, specifically within certain phylogenies, and extend our understanding of carbon acquisition in these systems. In particular, the data presented by de Pins et al provide new insights into the relative carbon fixation rates of alpha-cyanobacteria, for which there are very few studies reporting catalytic turnover. It is apparent that the CO2 concentrating mechanisms (CCM) of cyanobacteria, especially the alpha-cyanobacteria (containing FormIA Rubisco) are a globally important contributor to CO2 capture into the biosphere via their carboxysomal Rubisco enzymes. This report provides then first broad selection of FormI Rubiscos to enable comparisons of catalytic turnover across this dominant enzyme family and shows that FormIA Rubiscos from phototrophic systems, and encapsulated in carboxysomes, are on average the fastest enzymes.

      • de Pins et al use a high throughput screening technique that provides a highly correlative estimate of Rubisco turnover compared with traditional assays. This screen is based upon bulk expression of enzymes within E. coli from synthesised genes, and in some cases the co-expression of chaperonin factors to boost expression and solubility of holoenzymes. The assay process is sound and of high quality and the interpretations clear and uncomplicated.

      • The conclusions are sound and I only have a number of minor issues for consideration.

      Minor comments:

      • Temperature effects on 'true' Rubisco turnover rates. The authors quite reasonably note that a single measurement temperature was used in the assay and that this may not necessarily reflect the catalytic turnover of Rubiscos from thermophiles. Suppl. Fig 5b indicates a relatively large number of 'hot spring' species that have, generally, a low median kcat compared with, for example, both cyanobacterial classes. Can the authors comment on whether or not the thermophile set is not highly represented by one group (e.g. phototrophic alpha-cyanobacteria). Does this thermophile dataset have the potential to influence the generalities presented? Fig 2 would suggest this is not the case but it is not possible for the reader to know if all or any thermophiles are represented in Fig2 (as opposed to Suppl. Fig 4).

      • Line 98: "in spite of" should be "despite"

      • Lines 171-173: There is an additional alpha carboxysomal Rubisco for which there are catalytic parameters described (Chapter 11 Engineering Photosynthetic CO2 Assimilation to Develop New Crop Varieties to Cope with Future Climates. RE Sharwood, BM Long - Photosynthesis, respiration, and climate change, 2021). This book chapter reports a kcat of 11.9 s-1 for the alpha carboxysomal Rubisco from Synechococcus WH8102, very much in line with the authors conclusions.

      • Lines 232-234: I note that ref 30 posits that low CO2 was the more likely driver of carboxysome evolution than high O2.

      • Line 235: The preferred term is either "CO2 concentrating mechanism" or "inorganic carbon concentrating mechanism"

      • Lines 254-256: The relative saturation of carboxysomes with Rubisco is still somewhat undecided, although relatively new datasets enable more accurate comparisons. A number of papers from the Liu Lab (Liverpool) enable estimates of Rubisco active site concentrations for alpha and beta carboxysomes in the range of 2-6 mM. It appears at this stage that Rubisco active site concentrations may be highest in alpha-carboxysomes.

      • Lines 312-318: That genes were codon optimized for E. coli expression raises an interesting question about the effect of Raf1 on Rubisco solubility. Assuming expression rates were not constrained, can any conclusions be made as to the amino acid sequence differences that led to lower solubility? One assumes that the Rubisco sequences had a high degree of identity?

      • Lines 333-344: Was there an attempt to use acRAF (Raf2?) for FormIA Rubiscos that did not fold successfully in E. coli?

      Significance

      This manuscript presents a significant advance in our broader understanding of the major enzyme involved in carbon input into the biosphere, Rubisco. It will be of key interest to those studying carbon biogeochemistry, global CO2 modelling, cyanobacterial and proteobacterial CCMs, and those interested in using these systems to improve plant-based carbon capture for food security and global carbon abatement systems. It provides, for the first time, a large dataset of hitherto unknown Rubisco kinetics in a globally important group of organisms. The study is extremely well carried out and will likely form the basis of future Rubisco screens to provide greater clarity to our knowledge base of this globally important enzyme.

      My expertise is in the study and application of CCMs as CO2 acquisition systems that can be used for Synbio applications.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an excellent paper experimentally exploring variations in carboxylation rates of form I rubisco.

      I have three comments

      1. Q. Did the authors also measure the oxygenate activity of the enzyme? This is relevant to the evolution of carboxysomes and CCMs in general.

      2. How do predicted structures (e.g., using Alpha fold) vary with catalytic efficient?

      3. The authors should note the paper by Tortell (not this reviewer) https://aslopubs.onlinelibrary.wiley.com/doi/pdfdirect/10.4319/lo.2000.45.3.0744

      Significance

      This is an excellent paper experimentally exploring variations in carboxylation rates of form I rubisco.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the three reviewers for their thoughtful and constructive comments. The changes to the text and figures made in response to the questions raised have made this a clearer and stronger manuscript. The additional citations suggested by the reviewers helped to further anchor our study within the growing literature on facultative parthenogenesis. Below we have responded to each comment in blue. We have added new data to the manuscript (Fig. 4C, Fig. S10B and Fig. S10D).

      Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      1. Summary: Here Ho et al. provide strong molecular evidence for the production of facultatively parthenogenetic whiptail lizards, through a gametic duplication. As evidenced through multiple routes, including microsatellites, WGS, RADseq, and RBC ploidy, and lines of evidence from multiple specimens, this study is timely in furthering our understanding of the mechanisms underlying FP. The findings are conclusive.

      That said, I have several comments that should be addressed prior to publication. The introduction which addresses FP in other systems fails to cite several key studies that provide strongly molecular support for terminal fusion automixis. Similarly, the study pushes the idea that this is an adaptive trait, however without proving that the parthenogens can themselves reproduce, this is a moot point at this stage.

      That said, my comments are minor. I found this to be an excellent study, well written, comprehensive in methodology, and one that I strongly advocate for publication.

      We thank reviewer 1 for referring to our manuscript as an excellent study and strongly advocating for its publication. We concur with his/her points that evidence for automixis in other systems was not sufficiently referenced and that the adaptive trait hypothesis for FP is somewhat speculative. The text has been modified accordingly (see below).

      Major comments - None.

      Minor Comments: Should be addressed.

      Line 36 - However, data that supports terminal fusion are no longer restricted to microsat data. Studies utilizing RADseq and whole-genome sequencing in snakes and crocodiles have now provided further evidence supporting terminal fusion.

      See: Booth et al. 2023. Discovery of facultative parthenogenesis in a new world crocodile. Biology Letters. 19, 20230129.

      Card et al. 2021. Genome-wide data implicate terminal fusion automixis in king cobra facultative parthenogenesis. Scientific Reports. 11, 1-9

      Allen et al. 2018. Molecular evidence for the first records of facultative parthenogenesis in elapid snakes. R. Soc. Open. Sci. 5, 171901.

      We have now included that automixis in other systems is supported by both microsatellite and NGS data in the abstract of our manuscript. The references have been included in the main text.

      Ln 42 - Evidence suggesting that isolation from males was not a pre-requisite for FP has previously been reported in snakes.

      See: Booth et al. 2011. Evidence for viable, non-clonal but fatherless Boa constrictors. Biology Letters. 7, 253-256.

      Booth et al. Facultative parthenogenesis discovered in wild vertebrates. Biology Letters. 8, 983-985.

      Booth et al. 2014. New insights on facultative parthenogenesis in pythons. Biol J Linn Soc. 112, 461-468.

      Despite the prior evidence to the contrary cited by the reviewer, it is still a commonly held belief among scientists and science journalists that isolation from males promotes or triggers FP. We have placed our findings in the context of other studies, including those mentioned above, that came to the same conclusion that isolation from mating partners is not a requirement for FP. We thank the reviewer for the additional citations, which are now included in the discussion section.

      Ln 48 - Is this really an argument. While an immediate transition to homozygosity will purge some deleterious alleles, given the genome-wide nature of this, there will also conversely have been strong selection for mildly deleterious alleles.

      Even though many FP animals have congenital defects, our data, combined with that of others, show that seemingly healthy animals arise as well. Even if these healthy animals harbor slightly deleterious alleles, the most detrimental alleles would have therefore been purged especially for subsequent generations. We have modified the abstract to be clearer: “Conversely, for animals that develop normally, FP exerts strong purifying selection as all lethal recessive alleles are purged in one generation.”

      Ln 56 - I would recommend the inclusion of both Allen et al. 2018. R. Soc. Open Sci, and Card et al. 2021. Sci Reports, here, as they are members of the elapids, not represented in the other examples.

      These two citations have been added.

      Ln 60 - Recent studies have highlighted the significance of sperm storage in reptiles. For example, Levine et al. 2021. Exceptional long-term sperm storage by a female vertebrate. PLos ONE. 16(6).e0252049, describe the storage of sperm by a female rattlesnake for ~70 months, with two instances of its utilization to produce healthy offspring during that period. Clearly, molecular tools are providing both support for long-term sperm storage, and an understanding of its utilization.

      Recent work has indeed provided new evidence for instances of long-term sperm storage and the two mechanisms are no longer competing hypotheses, but it is clear that both mechanisms exist in nature. We have modified the text accordingly to include “Nevertheless, clear examples of long-term sperm storage have also been documented in the recent literature (29), underscoring the need for molecular methods such as MS analysis or sequencing data to elucidate the underlying mechanisms.”

      Ln 68 - American Crocodile would also be suitable to include here.

      This has now been included in the list of examples of endangered species.

      Ln71 - The problem with this hypothesis is that parthenogens produced through FP tend to have very low viability. For example, Adams et al. 2023. Endangered Species Research, follow a cohort of sharks produced through FP and all survive. Similarly low levels of survival are reported across other systems for which FP was reported. More likely, FP is simply a neutral trait. The mother is not negatively impacted through producing parthenogens and can go on to produce sexual offspring. Few instances report successful reproduction of a parthenogen. See pers. Comm in Card et al. 2021. And Straube et al. 2016.

      We thank the reviewer for the comment and agree that more data on the successful reproduction of parthenotes are needed to claim that FP is an adaptive trait. We have modified the text to include that studies on “the successful reproduction by FP offspring” are needed to support this hypothesis and have included the Straube et al. 2016 citation. We decided to omit the Card et al. 2021 citation as the reports of second-generation FP was through personal communication mentioned in this study and the results themselves have not yet been published.

      Ln 79 - I doubt that there is a desperate need for this for conservation. However, I think there is a need to simply further our understanding of basic biological function, given that it is not uncommon, and is phylogenetically widespread in species lacking genomic imprinting.

      We agree that understanding FP as a basic biological function is important in light of the realization that it occurs more commonly than previously thought. We have added this aspect to the text: “A better understanding of the triggers and molecular mechanisms underlying FP and the fitness of the resulting offspring are therefore needed in a variety of contexts. These include: to understand a fundamental biological mechanism and its significance in vertebrate evolution, to aid in conservation efforts including captive breeding programs, and to possibly harness FP in an agricultural context (28).”

      Ln 85 - It would be worth citing Card et al. 2021., here given that they used genome-wide ddRAD markers to show support for terminal fusion.

      The citation has been added.

      Ln 91 - Better citations here are Card et al. 2021. Allen et al. 2018, and Booth et al. 2023, which all utilize either RADseq or WGS.

      These citations have been added.

      Ln 95 - The conclusion of genome duplication here was supported only by a small number of microsatellite loci. As such, given that terminal fusion has been supported through genome-wide markers in other species of snakes and crocodiles, the conclusion of genome duplication is likely incorrect.

      In light of the other examples that show terminal fusion in snakes, we have removed this sentence.

      Ln 96 - I would strongly disagree with this statement. Allen et al. 2018, Card et al. 2021, Booth et al. 2023, all provide evidence of heterozygous loci and thus support terminal fusion. While no species-specific chromosome level reference genome is available for any of these species, the fact that levels of heterozygosity are below 33% percent supports terminal fusion. Rates over 33% support central fusion, but have not been reported in any vertebrate to date. AS such, I would recommend the removal of this statement.

      We agree that the studies listed by the reviewer all support terminal fusion in snakes and crocodiles and therefore, we have removed the statement.

      Ln 121 - Recent work in Drosophila mercatorum and D. melanogaster suggest that three genes play a role in the activation of FP in unfertilized eggs. In this case, through the fusion of meiotic products. That said, it is plausible to assume that FP in these lizards has an underlying genomic mechanism that is not related to isolation from males. See Sperling et al. 2023. Current Biology. 33, P3545-P3560.E13.

      Clearly isolation from males is not a key trigger in FP in whiptail lizards and other vertebrate species. With recent work from Sperling et al. 2023 and the fact that selection has led to increases in parthenogenesis in birds, an underlying genetic mechanism may well be at play. We have cited and addressed this in the discussion and propose identifying the genetic basis for FP in whiptail lizards in future studies.

      “Recent work identifying key cell cycle genes inducing FP in two species of Drosophila (71) and selection resulting in higher incidences of parthenogenesis in birds (24, 33) suggest a genetic basis for the initiation of FP. [...] Additional whole-genome sequencing data for species with documented FP will aid in the understanding the genetic basis, propensity, and evolutionary significance of FP.”

      Ln 126 - While these data strongly support FP of the two unusual A. marmoratus appearing offspring, can long term sperm storage be ruled out. Either through captive history or allelic exclusion of other males in the group?

      We have added the following sentence to the text: “Given that all of these offspring are female, inherited only maternal alleles, and animal 122 had no history of being housed with a conspecific male during its lifetime, both interspecific hybridization and long-term sperm storage are all but ruled out and FP is strongly supported.”

      Ln 171 - 191 - Given that the topic of this manuscript is the genomic mechanism underlying FP in this species, are these data necessary? These are not discussed later and as such I would recommend that they are moved supplemental material. Otherwise, they simply clutter that manuscript and detract from the key question. Indeed, they are important to show that the genome constructed is of high quality, but online Supp Mat is the place for that here.

      We chose to keep this section in the main text for the following reasons: There is still a lack of published reference quality genomes for many reptile species and therefore we want to highlight that this A. marmoratus reference adds not only to the understanding of FP, but also expands the small list of reptile genomes and makes the first Aspidoscelis genome available to the community. The high quality and contiguity of the genome (as indicated by the high N50 value and BUSCO score) is important to emphasize in the main text because the absence of any heterozygous regions in FP animals supports a mechanism of post-meiotic genome duplication. We would not want to bury these key points in the supplement.

      Ln 296 - Comparable estimates were made for parthenogenetic production in wild populations of two North American pitviper species. See Booth et al. 2012. Biology Letters.

      In Booth et al. 2012, 2 out of 59 litters of the two pitvipers (3.39%) were identified to contain FP offspring and these results are very similar to our reported rate of FP in whiptail lizards. We have now included this similarity in our discussion. “Interestingly, these rates are similar to what has been reported for wild populations of two North American pitviper species (10)”.

      Ln 312 - Again, can this really be suggested? Above, the authors state that most FP animals that hatched had congenital defects, and a large number failed to hatch. This does not sound like strong support for generating individuals that counter the effects of population bottlenecks and inbreeding depression. The authors need to take this study further and monitor the long-term viability of the FP individuals that survive.

      We agree with the reviewer that the adaptive advantages of FP reproduction are dependent on the fitness and reproductive potential of FP offspring and present data is insufficient to clearly support this notion. We have modified the text to include that long-term studies are needed to support or refute this hypothesis: “However, support for this hypothesis is predicated on the fitness and reproduction of FP offspring and therefore more long-term studies on seemingly healthy individuals of FP origin are needed.”

      Ln 348 - To be able to provide support for this, you need to track animals long term to understand their reproductive competence, and that of their offspring.

      We have added the text: “To assess whether the co-occurrence of sexual and FP reproduction in vertebrates can indeed be considered a reproductive strategy rather than biological noise will require further studies to assess the reproductive competence and fecundity of offspring produced by either mode of reproduction.”

      Ln 358 - But, the caveat is that the parthenogens must themselves reproduce. This must me stated.

      The statement that parthenogens must be able to reproduce to support a hypothesis of FP as an adaptive trait has been added: “One must now consider the possibility that FP is an adaptive trait and that low rates of successful FP could contribute significantly to genome purification. Such a role for FP hinges on further studies demonstrating the ability of parthenogens to reproduce themselves either through further FP or sexually.”

      Ln 359 - Note that FP can also fix mildly deleterious alleles. Only if it is strongly deleterious will it be lost.

      We now make it clearer that selection only applies to strongly deleterious alleles.

      Ln 361 - See above comments.

      We have modified the text to include that “FP offspring will have low genetic load and only pass on neutral and mildly-deleterious alleles to the next generation.”

      Reviewer #1 (Significance (Required)):

      1. Significance:

      While reports of parthenogenesis have been reported as far back as the early 1900's, it has only been over the last decade that reports are become common. Such that facultative parthenogenesis is no longer considered a rarity, but is recognized now as being relatively common and phylogenetically widespread in species that lack genomic imprinting - particularly reptiles, birds, and sharks. Reasons for this are both an increased understanding that the trait can occur, hence recognizing it as an alternative mechanism to long-term sperm storage, and the ease of using molecular approaches.

      The fundamental questions of recent times have been understanding the mechanisms driving FP. Recent papers utilizing whole genome sequencing and ddRADseq have provided support for terminal fusion automixis in snakes and sharks. Here, this study provides evidence of gametic duplication in whiptails, a mechanism with an alternative outcome in regards to the levels of retained heterozygosity. As such, this study compares to the recent work of Card et al. 2021 (Scientific Reports), and Booth et al. 2023 (Biology Letters), in providing substantive advances in the field.

      The audience for this will be broad. Parthenogenesis is a fascinating topic that attracts significant media attention. See the Altmetric score of recent papers on the topic, particularly Booth et al. 2023 (Altmetric score - ~3100). As such, the study will be of interest to both a broad readership, but will also be of great significance to a specialized group working on parthenogenesis. All round, an excellent paper that has promise to advance the field.

      We thank reviewer 1 for this positive assessment and for putting our work into context.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The researchers bring together microsatellite and whole-genome sequencing data from long-term laboratory cultures of lizards to discover occasional production of parthenogenetic offspring by several species of otherwise sexually producing whiptail lizards ("facultative parthenogenesis, "FP") and to show that these FP-produced lizards have patterns of genomic homozygosity that are incompatible with currently held assumptions about mechanisms of FP. Instead, the FP lizards seem to have been produced by a mechanism that results in almost complete homozygosity, likely a consequence of post-meiotic duplication of genomes from haploid unfertilized oocytes. They also show that FP offspring were produced by females housed with males and along with sexually produced offspring, counter to prevailing assumptions that FP offspring are only produced in situations where mates are not available. Many of the FP-produced offspring did not survive to hatching or had major abnormalities, consistent with a situation where this high homozygosity exposes harmful alleles. Finally, the authors used reduced-representation sequencing (RAD-seq) to survey heterozygosity in 321 wild-collected whiptail lizards from 15 species, showing evidence for strikingly low homozygosity in at least one individual and perhaps up to 5, consistent with the potential for FP in nature. These data are of broad interest in demonstrating several exciting new possibilities. Most importantly, the data hint at a different mechanism of FP than previously assumed, and one that causes immediate near-complete homozygosity. This scenario would likely lead to immediate purging of harmful recessive alleles. If the selective load of this purging wasn't insurmountably high, a lineage with a history of purging could produce FP offspring of relatively high fitness. Other exciting possibilities suggested by the data include the existence of FP even in a setting where mating occurs and in natural populations, versus just captivity.

      Major Comments:

      I found it difficult to impossible to sort out exactly what the researchers did and with what lizards. For example, in line 107, they refer to a "systematic MS analysis" for all individuals of gonochoristic species in their laboratory, but where are these data? Indeed, at this early spot in the paper, the introduction from here on out suddenly reads like a discussion. What would be better here would be to summarize what was known and wasn't known about the system and questions involved, why gaps in knowledge were important, and what the researchers actually did for this paper. In my opinion, the paper would be a much easier read if the researchers left the results and interpretation for later in the paper.

      As a consequence of the reviewers’ comments, the text of the manuscript has undergone major revision, and we trust that reviewer 2 will find this new version far more accessible. The MS data collection of more than 1000 individuals is the subject of another ongoing study and was only mentioned peripherally here to put the identification of FP into context. As most of the MS data relates to gonochoristic reproduction and interspecific hybridization, we are only presenting the data that are directly relevant to this manuscript as part of this study. To our knowledge, there is no common repository to upload raw MS data, but we have provided the data for the FP animals and controls discussed in this paper in the Github repository (see section “Data availability”).

      Even with this suggested fix, however, the data are still too inaccessible and analyses too opaque. For example, in line 202, a critical definition is laid out regarding heterozygous sites as those having "equal support" for two alleles. What do the researchers mean by "equal support"? My presumption is that this is something about equal or close to equal numbers of reads, but this definition needs to be spelled out and justified because it underpins much of the downstream analyses. A similar problem occurs in line 208-209, where the authors make a statement about limiting further analysis to positions in the genome where the coverage is "equal" to the mean sequencing depth.

      We have changed the text to “we defined heterozygous sites as those having two alleles supported by an equal number of reads. This stringent requirement was chosen to limit the search to apparent heterozygous sites with strong support, decreasing the chance of false positives.”. We further look at only sites where the coverage is equal to the average sequencing depth to exclude regions where over-assembly and collapse of repetitive elements would artificially increase the coverage.

      Another data/analysis issue emerges with the components of the manuscript that deal with mixoploidy. As far as I can tell, these data come from one sexually produced lizard, one FP A. marmoratus, and one FP A. arizonae. While the reports of bimodality of nuclear size are certainly interesting, the data and discussion are no more than an anecdotal case study in the absence of careful replication across multiple FP lizards and comparison to sexually produced lizards. Without these data, the conclusion that “Animals produced by facultative parthenogenesis are characterized by mixoploidy” (Figure 4 caption; also see lines 324-331) is far too strong.

      We have added animal IDs to figure legends 4 and S10 to clarify that these erythrocyte staining come from two FP A. marmoratus, and one FP A. arizonae. In addition, imaging from two sexually produced control animals (1 A. marmoratus and 1 A. arizonae) have now been included in S10 (as S10B and S10D). We also have included an extra panel of flow cytometry data (new Figure 4C) as a complementary methodology for ploidy determination. Both imaging and flow cytometry support similar amounts of haploid cells. With the additional data and clarification, we hope that the reviewer agrees that the observations of mixoploidy are well beyond “anecdotal”. Nevertheless, we have changed the title for Figure 4 to “Detection of mixoploidy associated with facultative parthenogenesis.” We hope that our observations here will indeed inspire future studies to see if mixoploidy is a widespread phenomenon in FP outside of whiptails as indicated by earlier work in birds.

      I had a similar reaction to the discussion of developmental abnormalities and embryonic lethality of embryos of FP origin presented in lines 263-281 (also lines 307-309). What is the baseline level of such abnormalities and the frequency of lethality in sexually produced eggs/embryos/hatchlings, and especially those produced via inbreeding? These comparisons are needed to interpret the significance of the patterns observed in the FP eggs/embryos/hatchings. Analogously, the comparison of the ovaries and germinal vesicles from one FP individual relative to one sexual individual do not tell us anything nearly so definitive as the text in lines 279-281 (also see Fig. S12 title, which is too broad of a conclusion for N = 1). This overly ambitious conclusion also underpins the discussion regarding the potentially adaptive nature of FP with respect to genome purification (lines 341-363; also see lines 47-50). If FP does not actually increase the rate of purging in FP lizards relative to inbred sexual counterparts (sounds like inbreeding is common from line 339), it seems less likely that we can view FP as adaptive at least from this perspective.

      We have now included a comparison between defects seen in sexually produced animals vs FP animals: “six out of 16 FP animals (37.5%) hatched with no discernable developmental defects (Fig. S11A-B). This is in stark contrast to sexually produced animals, where over 98% of hatchlings showed no abnormalities. Additionally, most of the defects noted in sexually produced animals were less severe than in FP animals including bulges in tails or truncated digits.”

      We agree that our statement on the lack of differences between sexually produced and FP animals was too general. We have modified the title of Fig. S12 from “No differences between ovaries and germinal vesicles of Aspidoscelis marmoratus produced by facultative parthenogenesis or fertilization” to "Ovaries of Aspidoscelis marmoratus FP animal 8450 and germinal vesicles of FP sister 8449 revealed no differences in structure and anatomy compared to fertile sexually reproducing animals.” Due to instant complete homozygosity, FP would indeed have a higher rate of purging than inbreeding. While one hypothesis is that FP is adaptive (in large enough populations), our intentions were to highlight the alternative that FP could be detrimental in smaller populations (that already would likely experience high inbreeding rates). We would expect inbreeding to not be common in whiptails relative to other lizards given that they tend to have large population sizes and actively range across generalist habitats.

      A final data concern is with the use of liver tissue for whole-genome sequencing and reference genome assembly (lines 389-390) and then using these data and the reference genome to make conclusions about ploidy/coverage. Liver tissue is very commonly endopolyploid, meaning that coverage could be artificially high for animals for which liver (vs. tail) tissue was used for DNA extraction. In particular, it would be helpful if the researchers consider whether endopolyploidy could have affected their ability to make accurate estimation of coverage and thus, heterozygosity, when libraries generated from diploid (tail) tissues are aligned to a reference genome generated from a polyploid tissue as was done here.

      This is an interesting point and indeed hepatic cells in various organisms have been documented to be polyploid. The proportion of polyploid cells though vary and as far as we are aware, all published studies on polyploid hepatocytes are in mammals (DOI: 10.1016/j.tcb.2013.06.002). Reference genomes have been generated from a variety of tissue sources and liver is commonly used. As most assemblies are for haploid genomes, polyploidy (unlike aneuploidy) does not impact the assembly quality. The reference genome was also from an animal of FP origin and therefore has genome-wide homozygosity that aids in a more contiguous genome assembly by eliminating the phasing problem. For the 10 animals sequenced, genomic DNA was derived from liver for three animals and the rest from tail tissue. The sequencing data generated from either liver or tail resulted in similar coverage levels (Figure S6) and similar levels of heterozygosity (Figure 2A). Minor Comments:

      Line 410: Please explain why the BLAST cutoff was changed from the default.

      The BLAST cutoff was changed from the default 1e-03 to 1e-06 to be more stringent and thereby increase confidence in the BUSCO results.

      Lines 441-443: Please explain why this dataset was seemingly larger than expected.

      Animal 122 was sequenced on one flow cell without any multiplexing with other samples and therefore yielded more reads than other animals sequenced. We subsampled the reads from this animal for analysis, so it is directly comparable with the other WGS data.

      Line 510: The link to the Github repository was broken, so I was unable to access the code and data denoted as available here.

      We apologize for the unavailability of the link at the time of review. Review Commons did not request a reviewer token. The repository will be made public upon journal acceptance. We would be happy to provide a reviewer token in the meantime upon request by Review Commons.

      Figure 1, and other figures featuring comparisons of MS data across parents and offspring: The authors need to engage here with the alleles that do not match either parent here (e.g., allele 282 at MS7), explaining the likelihood that these alleles indeed represent a binning error (or, perhaps, stepwise mutation from parental allele), and these alleles should be flagged. Instead, they bin these unique alleles with the most similar parental allele without any explanation or flagged. The authors do bring this point up in Figure S1, but this issue needs to be addressed in the main text (related point: the mix of red/green in MS16 offspring appear more green than red. Is this meant to denote a probability different than 50:50? If not, the authors should adjust the shading so that this shape is half green, half red).

      We have added to the figure legend that single nucleotide differences are most likely binning errors and are therefore not considered “de novo” alleles. Instead, they are assigned it to the most similar parental allele, consistent with Figure S1. The shading at MS16 has been removed so that it is consistent with Figure 3.

      Figure 3: Indicate that white background for alleles means that allelic inheritance is not determinable, or use the mix of colors applied in Fig. 1 to indicate as such. Unique offspring alleles should be flagged rather than just automatically assigned to the most similar parental allele. Finally, it would be helpful if the alleles were presented within loci from the shorter to the longer alleles.

      We have included in the figure legend that non-shaded alleles are those for which multiple potential parents share the same allele and the inheritance therefore remains ambiguous for this locus. Single nucleotide differences are also now addressed, and sizes are ordered from smallest to largest.

      Figure S7. Indicate visually which panels indicate FP animals.

      We have now indicated which animals are FP and included this in Figure S6 as well.

      Fig. S13. The 5 animals that had especially low heterozygosity should be flagged. The title of this figure should be toned down in light of the tentative nature of the conclusions regarding FP in nature: low heterozygosity could instead reflect, for example, a long history of inbreeding. My reaction to the data is also that the % heterozygosity distribution for many of the species looks continuous rather than the bimodality one might expect under FP vs. sexual reproduction.

      Since FP has not been further confirmed in these animals, unlike those examples from our captive colony, there could indeed be other reasons for low heterozygosity. We have changed the title of the figure from “Facultative parthenogenesis in whiptail lizards collected in nature” to the more neutral “Heterozygosity estimates of whiptail lizards collected in nature.” Since there are so relatively few animals, one would not necessarily expect a bimodal distribution to be apparent in the current data. We did show that the animal with the lowest calculated level of heterozygosity (deppii LDOR30) was a statistical outlier when compared to other individuals of the same species though. Since these animals were sampled across different locations and habitats, the effective population sizes would be assumed to be different as well, reflecting the range of heterozygosity estimates seen here. This has been made clear in the text.

      Reviewer #2 (Significance (Required)):

      General assessment: strengths and limitations. The paper's strengths include the combination of data from lab and natural populations, the characterization of an unexpected means of achieving FP, with dramatic genetic consequences, and the data suggesting that this type of FP is fairly common and occurs even in the context of mating.

      Audience: The biological questions of relevance to these discoveries are of broad interest, and the paper is likely to garner some attention from the life sciences community as whole and the popular press.

      Advance: These data fill an important knowledge gap regarding the mechanisms potentially driving FP in vertebrates, how often FP is likely to occur, and its genetic consequences. The discoveries are potentially conceptual/fundamental, though the extent to which they are ground breaking is not clear in the absence of functional characterization of how FP occurs as well as the need for more rigorous comparisons and replication that I outlined above.

      We thank reviewer 2 for summarizing the strengths of this manuscript, pointing out the broad interest and stating that this work fills an important knowledge gap.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The occurrence of facultative parthenogenesis has been described in a number of vertebrate lineages but the underlying cytological mechanism(s) have remained largely speculative due to sparsity of data. Here, Ho & Tormey et al. provide a detailed analysis of facultative parthenogenesis in gonochoristic species of the lizard genus Aspidoscelis. They show that parthenogenesis leads to a complete loss of heterozygosity (LOH) within a single generation. They attribute the LOH to diploidization through duplication of the oocytes haploid genome after completion of meiosis. This mechanism is consistent with their finding of mixoploidy in erythrocytes of asexually produced offspring. Based on LOH the authors additionally show that facultative parthenogenesis in Aspidoscelis is not condition dependent (no developmental switch): it can occur in the presence of males, alongside with sexual reproduction in the same clutch, and both in captivity and the wild. Finally, the authors show that facultative parthenogenesis is associated with developmental aberrations, likely caused by expression of homozygous recessive deleterious mutations.

      Major comments: In my opinion, this study presents a very comprehensive, careful documentation of mechanistic aspects and consequences of facultative parthenogenesis in a vertebrate. The genomic and microsatellite results leave little to no doubt that facultative parthenogenesis has led to complete LOH in Aspidoscelis. I am particularly impressed by the meticulous analysis of genomic coverage to exclude e.g. false positive heterozygosity due to merged paralogs in the assembly. I also follow the authors conclusion that a post-meiotic "gamete duplication"-like mechanism is likely causative for the LOH (and the mixoploidy of erythrocytes; but I am no expert on that). I was wondering if terminal fusion automixis together with a complete absence of recombination would be worth mentioning as an (probably very unlikely) alternative in the discussion. It would be exciting to corroborate the conclusion of diploidization by genome duplication in the future, e.g. via early embryonic DNA stainings to show the duplication "in action" (if that is practically possible)...? As for this manuscript, I suggest emphasizing the indirect nature of the evidence for the mechanism of parthenogenesis a little bit more.

      We thank the reviewer for highlighting the effort that went into the genomic analysis that led us to our conclusions. In terms of terminal fusion without recombination, we argue that this is not an obvious alternative explanation as a large body of work has established that at least one crossover per homologous chromosome pair is required to advance into meiosis I in many organisms (e.g. see https://doi.org/10.3389/fcell.2021.681123) and therefore the absence of recombination would likely not produce the polar bodies necessary for automixis.

      We have added to the text: “In whiptail lizards, we have not been able to examine post-meiotic oocytes as locating the post-meiotic nucleus within a large yolked egg is inherently difficult. The difficulty is compounded by the unpredictability of which eggs will undergo FP development and the need to sacrifice animals to remove eggs.”

      While the genome duplication mechanism we propose is indeed indirect because we are unable to visualize developing FP embryos, the most parsimonious explanation from the whole-genome sequencing analysis is genome duplication because of the lack of heterozygous regions associated with automixis. In the text, we have made sure to state genome-wide homozygosity as the basis for our conclusion.

      I agree that facultative parthenogenesis in the presence of males hints at a baseline rate of parthenogenesis without requiring a developmental switch. However, this makes it difficult to rule out that sperm played a role in activation of embryonal development (gynogenesis; however I am only aware of gynogenesis in fishes and amphibians)... maybe, the authors want to take this up in the discussion. Were the five parthenogenetic individuals for whole genome sequencing actually produced in the presence of males, too?

      FP has been reported to occur in isolated females for other reptile and bird species, suggesting that sperm activation is at least not a general requirement in FP of amniotes. (Watts, et al. 2006, W. W. Olsen, S. J. Marsden 1954). In all cases in this study, the female mothers were housed with conspecific or heterospecific males. While we cannot completely rule out a non-genetic contribution of sperm in these cases, it would seem to be an unlikely explanation in light of the sperm-independent reproduction by obligate parthenogenesis in other species of whiptail lizards (unlike the sperm-dependence of all unisexual reproduction in amphibians and fish). We decided to not include speculation on sperm-dependence in this manuscript as we have no evidence in favor of it, nor is there any evidence for this in the literature relating to other amniotes. In fact, most examples of FP were reported from isolated females, most likely because offspring were not expected in those cases and prompted further analysis as to their origin.

      I agree with the interpretation of the LOH in the RADseq data as a likely case of facultative parthenogenesis in the wild. However, when looking at figure S13 I noticed some bimodal looking distributions (e.g. in A. guttatus). It may be interesting for future studies to look into what factors influence heterozygosity in natural populations of Aspidoscelis (e.g. inbreeding vs parthenogenesis). Could there be different mechanisms of facultative parthenogenesis in different Aspidoscelis species explaining different LOH intensities?

      The continuous nature of the data may reflect natural variation between individuals and collection at various locations with possibly different effective population sizes and levels of hybridization. Low levels of heterozygosity could be indicative of inbreeding or FP in some cases. This is important to note in future studies and we have added this to the manuscript (“Further fieldwork and analysis will be required to assess the level of FP in natural populations of gonochoristic Aspidoscelis species (and other factors that could influence the observed heterozygosity such as population size, levels of hybridization, and inbreeding) …”). While there are different mechanisms of FP in other vertebrate groups, the most parsimonious hypothesis is that within a genus, the mechanism would be the same.

      The manuscript is well written, the introduction nicely explains the significance of the study, the methods are fully appropriate and the results (and supplementary results) displayed comprehensibly and in great detail. The discussion might benefit from going a bit more generally into the occurrence and mechanism of obligate asexuality in Aspidoscelis. One might e.g. speculate on whether the ability for facultative parthenogenesis in gonochoristic species has facilitated the transitions to obligate parthenogenesis in the hybrid lineages and what peculiarities might predispose Aspidoscelis to parthenogenesis (e.g. are centrioles contributed by sperm required?). In addition, I think the occurrence of LOH due to gamete duplication (facultative and obligate) in invertebrates (e.g. due to Wolbachia) is worth mentioning in the discussion: e.g. there is a similar case in facultative asexual Bacillus rossius stick insects, where the early dividing cells are haploid. Some of them diploidize via duplication later and form the embryo.

      Thank you for complimenting each section of the manuscript and referring to it as well-written. Our lab has a long-standing interest in obligate parthenogenesis. While it is interesting that both obligate and facultative parthenogenesis occur alongside each other in this genus, the mechanisms appear to be fundamentally different, and we would like to focus the discussion on FP in a variety of systems and its potential implications in conservation and evolution. Parthenogenesis in general is a fascinating topic for a broad audience and not discussing another form of parthenogenesis (obligate in this case), the focus remains on FP and keeps the manuscript more accessible for non-specialists. We have included the stick insect as another example of diploid restoration through genome duplication in the discussion.

      Minor comments:

      39-41: I am a bit puzzled by the usage of the term "post-meiotic" to contrast the diploidization through duplication with automixis. Wouldn't one consider polar body fusion after completion of meiosis II also post-meiotic? Maybe I am just not aware of how the term is usually used in this context here...

      We use the term “post-meiotic” because the restoration of an entirely homozygous diploid cell can only occur after the completion of both meiotic divisions. It is our understanding that polar body fusion and meiotic restitution after meiosis I or meiosis II are generally considered meiotic mechanisms in the specialized literature, even though polar body fusion would also occur after the meiotic divisions.

      65: isn't that gynogenesis (sperm-dependent parthenogenesis) in the amazon molly?

      While sperm is required for parthenogenesis in the Amazon Molly, it is an all-female species that exclusively reproduces through gynogenesis. In this case, it is considered an example of obligate parthenogenesis rather than FP.

      78: the term "economically viable" may be a bit puzzling for a biologist's audience. "Economically sustainable" could be an alternative.

      This has been changed.

      129: the Arizona male was referred to as ID 4272 above. Here it is ID 4238?

      This has been corrected. The correct ID is 4272.

      218: please define over-assembly (see line 207)

      The definition of “over-assembly” is collapsing paralogous loci into a single representative sequence. This is now explained in the text.

      263-281: please, indicate a hatching rate/ rate of malformations of sexually produced offspring for comparison.

      A comparison has been added: “This is in stark contrast to sexually produced animals, where over 98% of hatchlings had no abnormalities noted.”

      333: in the haploid cells recessive deleterious mutations would be exposed in the hemizygous state but in the diploid cells in the homozygous state.

      The text has been modified to reflect the difference between haploid and diploid cells.

      470: please, provide more detail for the RADseq analyses (variant calling, calculation of heterozygosity etc.)

      We have elaborated on the analysis in the methods.

      Figure 1B: please, mention in the legend that the shown mechanisms are not exhaustive, e.g. first polar body fusion could occur right after meiosis 1 or polar body formation could be skipped completely.

      This has been added.

      Figure 1C: it may be interesting for non-specialists to name the distinctive morphological characters setting apart the three species in the figure legend and highlight them e.g. with arrows in the figure.

      We have now included in the figure legend characteristic color patterns for each species: “(C) Photographs of Aspidoscelis arizonae with characteristic blue ventral coloration (top), A. gularis with light spots in dark fields that separate light stripes on dorsum (middle), and A. marmoratus with light and dark reticulated pattern on dorsum (bottom).” Since the descriptions are specific and apparent, we did not add arrows to the pictures.

      Reviewer #3 (Significance (Required)):

      Significance: The study by Ho & Tormey et al. substantially enhances the understanding of (facultative) asexuality in vertebrates. In particular, while most reports of facultative parthenogenesis in vertebrates have been attributed to a form of automixis, the authors conclusively show an instance of diploidization through genome duplication, a mechanism functionally similar to "gamete duplication". The study is novel, very comprehensive and of interest for a general audience within the field of evolutionary biology.

      We thank reviewer 3 for pointing out that our study substantially enhances the understanding of asexuality in vertebrates, is very comprehensive and of interest for a general audience within

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The occurrence of facultative parthenogenesis has been described in a number of vertebrate lineages but the underlying cytological mechanism(s) have remained largely speculative due to sparsity of data. Here, Ho & Tormey et al. provide a detailed analysis of facultative parthenogenesis in gonochoristic species of the lizard genus Aspidoscelis. They show that parthenogenesis leads to a complete loss of heterozygosity (LOH) within a single generation. They attribute the LOH to diploidization through duplication of the oocytes haploid genome after completion of meiosis. This mechanism is consistent with their finding of mixoploidy in erythrocytes of asexually produced offspring. Based on LOH the authors additionally show that facultative parthenogenesis in Aspidoscelis is not condition dependent (no developmental switch): it can occur in the presence of males, alongside with sexual reproduction in the same clutch, and both in captivity and the wild. Finally, the authors show that facultative parthenogenesis is associated with developmental aberrations, likely caused by expression of homozygous recessive deleterious mutations.

      Major comments:

      • In my opinion, this study presents a very comprehensive, careful documentation of mechanistic aspects and consequences of facultative parthenogenesis in a vertebrate. The genomic and microsatellite results leave little to no doubt that facultative parthenogenesis has led to complete LOH in Aspidoscelis. I am particularly impressed by the meticulous analysis of genomic coverage to exclude e.g. false positive heterozygosity due to merged paralogs in the assembly. I also follow the authors conclusion that a post-meiotic "gamete duplication"-like mechanism is likely causative for the LOH (and the mixoploidy of erythrocytes; but I am no expert on that). I was wondering if terminal fusion automixis together with a complete absence of recombination would be worth mentioning as an (probably very unlikely) alternative in the discussion. It would be exciting to corroborate the conclusion of diploidization by genome duplication in the future, e.g. via early embryonic DNA stainings to show the duplication "in action" (if that is practically possible)...? As for this manuscript, I suggest emphasizing the indirect nature of the evidence for the mechanism of parthenogenesis a little bit more.

      • I agree that facultative parthenogenesis in the presence of males hints at a baseline rate of parthenogenesis without requiring a developmental switch. However, this makes it difficult to rule out that sperm played a role in activation of embryonal development (gynogenesis; however I am only aware of gynogenesis in fishes and amphibians)... maybe, the authors want to take this up in the discussion. Were the five parthenogenetic individuals for whole genome sequencing actually produced in the presence of males, too?

      • I agree with the interpretation of the LOH in the RADseq data as a likely case of facultative parthenogenesis in the wild. However, when looking at figure S13 I noticed some bimodal looking distributions (e.g. in A. guttatus). It may be interesting for future studies to look into what factors influence heterozygosity in natural populations of Aspidoscelis (e.g. inbreeding vs parthenogenesis). Could there be different mechanisms of facultative parthenogenesis in different Aspidoscelis species explaining different LOH intensities?

      • The manuscript is well written, the introduction nicely explains the significance of the study, the methods are fully appropriate and the results (and supplementary results) displayed comprehensibly and in great detail. The discussion might benefit from going a bit more generally into the occurrence and mechanism of obligate asexuality in Aspidoscelis. One might e.g. speculate on whether the ability for facultative parthenogenesis in gonochoristic species has facilitated the transitions to obligate parthenogenesis in the hybrid lineages and what peculiarities might predispose Aspidoscelis to parthenogenesis (e.g. are centrioles contributed by sperm required?). In addition, I think the occurrence of LOH due to gamete duplication (facultative and obligate) in invertebrates (e.g. due to Wolbachia) is worth mentioning in the discussion: e.g. there is a similar case in facultative asexual Bacillus rossius stick insects, where the early dividing cells are haploid. Some of them diploidize via duplication later and form the embryo.

      Minor comments:

      • 39-41: I am a bit puzzled by the usage of the term "post-meiotic" to contrast the diploidization through duplication with automixis. Wouldn't one consider polar body fusion after completion of meiosis II also post-meiotic? Maybe I am just not aware of how the term is usually used in this context here...

      • 65: isn't that gynogenesis (sperm-dependent parthenogenesis) in the amazon molly?

      • 78: the term "economically viable" may be a bit puzzling for a biologist's audience. "Economically sustainable" could be an alternative.

      • 129: the Arizona male was referred to as ID 4272 above. Here it is ID 4238?

      • 218: please define over-assembly (see line 207)

      • 263-281: please, indicate a hatching rate/ rate of malformations of sexually produced offspring for comparison.

      • 333: in the haploid cells recessive deleterious mutations would be exposed in the hemizygous state but in the diploid cells in the homozygous state.

      • 470: please, provide more detail for the RADseq analyses (variant calling, calculation of heterozygosity etc.)

      • Figure 1B: please, mention in the legend that the shown mechanisms are not exhaustive, e.g. first polar body fusion could occur right after meiosis 1 or polar body formation could be skipped completely.

      • Figure 1C: it may be interesting for non-specialists to name the distinctive morphological characters setting apart the three species in the figure legend and highlight them e.g. with arrows in the figure.

      Significance

      Significance: The study by Ho & Tormey et al. substantially enhances the understanding of (facultative) asexuality in vertebrates. In particular, while most reports of facultative parthenogenesis in vertebrates have been attributed to a form of automixis, the authors conclusively show an instance of diploidization through genome duplication, a mechanism functionally similar to "gamete duplication". The study is novel, very comprehensive and of interest for a general audience within the field of evolutionary biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The researchers bring together microsatellite and whole-genome sequencing data from long-term laboratory cultures of lizards to discover occasional production of parthenogenetic offspring by several species of otherwise sexually producing whiptail lizards ("facultative parthenogenesis, "FP") and to show that these FP-produced lizards have patterns of genomic homozygosity that are incompatible with currently held assumptions about mechanisms of FP. Instead, the FP lizards seem to have been produced by a mechanism that results in almost complete homozygosity, likely a consequence of post-meiotic duplication of genomes from haploid unfertilized oocytes. They also show that FP offspring were produced by females housed with males and along with sexually produced offspring, counter to prevailing assumptions that FP offspring are only produced in situations where mates are not available. Many of the FP-produced offspring did not survive to hatching or had major abnormalities, consistent with a situation where this high homozygosity exposes harmful alleles. Finally, the authors used reduced-representation sequencing (RAD-seq) to survey heterozygosity in 321 wild-collected whiptail lizards from 15 species, showing evidence for strikingly low homozygosity in at least one individual and perhaps up to 5, consistent with the potential for FP in nature. These data are of broad interest in demonstrating several exciting new possibilities. Most importantly, the data hint at a different mechanism of FP than previously assumed, and one that causes immediate near-complete homozygosity. This scenario would likely lead to immediate purging of harmful recessive alleles. If the selective load of this purging wasn't insurmountably high, a lineage with a history of purging could produce FP offspring of relatively high fitness. Other exciting possibilities suggested by the data include the existence of FP even in a setting where mating occurs and in natural populations, versus just captivity.

      Major Comments:

      • I found it difficult to impossible to sort out exactly what the researchers did and with what lizards. For example, in line 107, they refer to a "systematic MS analysis" for all individuals of gonochoristic species in their laboratory, but where are these data? Indeed, at this early spot in the paper, the introduction from here on out suddenly reads like a discussion. What would be better here would be to summarize what was known and wasn't known about the system and questions involved, why gaps in knowledge were important, and what the researchers actually did for this paper. In my opinion, the paper would be a much easier read if the researchers left the results and interpretation for later in the paper.

      • Even with this suggested fix, however, the data are still too inaccessible and analyses too opaque. For example, in line 202, a critical definition is laid out regarding heterozygous sites as those having "equal support" for two alleles. What do the researchers mean by "equal support"? My presumption is that this is something about equal or close to equal numbers of reads, but this definition needs to be spelled out and justified because it underpins much of the downstream analyses. A similar problem occurs in line 208-209, where the authors make a statement about limiting further analysis to positions in the genome where the coverage is "equal" to the mean sequencing depth.

      • Another data/analysis issue emerges with the components of the manuscript that deal with mixoploidy. As far as I can tell, these data come from one sexually produced lizard, one FP A. marmoratus, and one FP A. arizonae. While the reports of bimodality of nuclear size are certainly interesting, the data and discussion are no more than an anecdotal case study in the absence of careful replication across multiple FP lizards and comparison to sexually produced lizards. Without these data, the conclusion that "Animals produced by facultative parthenogenesis are characterized by mixoploidy" (Figure 4 caption; also see lines 324-331) is far too strong.

      • I had a similar reaction to the discussion of developmental abnormalities and embryonic lethality of embryos of FP origin presented in lines 263-281 (also lines 307-309). What is the baseline level of such abnormalities and the frequency of lethality in sexually produced eggs/embryos/hatchlings, and especially those produced via inbreeding? These comparisons are needed to interpret the significance of the patterns observed in the FP eggs/embryos/hatchings. Analogously, the comparison of the ovaries and germinal vesicles from one FP individual relative to one sexual individual do not tell us anything nearly so definitive as the text in lines 279-281 (also see Fig. S12 title, which is too broad of a conclusion for N = 1). This overly ambitious conclusion also underpins the discussion regarding the potentially adaptive nature of FP with respect to genome purification (lines 341-363; also see lines 47-50). If FP does not actually increase the rate of purging in FP lizards relative to inbred sexual counterparts (sounds like inbreeding is common from line 339), it seems less likely that we can view FP as adaptive at least from this perspective.

      • A final data concern is with the use of liver tissue for whole-genome sequencing and reference genome assembly (lines 389-390) and then using these data and the reference genome to make conclusions about ploidy/coverage. Liver tissue is very commonly endopolyploid, meaning that coverage could be artificially high for animals for which liver (vs. tail) tissue was used for DNA extraction. In particular, it would be helpful if the researchers consider whether endopolyploidy could have affected their ability to make accurate estimation of coverage and thus, heterozygosity, when libraries generated from diploid (tail) tissues are aligned to a reference genome generated from a polyploid tissue as was done here.

      Minor Comments:

      • Line 410: Please explain why the BLAST cutoff was changed from the default.

      • Lines 441-443: Please explain why this dataset was seemingly larger than expected.

      • Line 510: The link to the Github repository was broken, so I was unable to access the code and data denoted as available here.

      • Figure 1, and other figures featuring comparisons of MS data across parents and offspring: The authors need to engage here with the alleles that do not match either parent here (e.g., allele 282 at MS7), explaining the likelihood that these alleles indeed represent a binning error (or, perhaps, stepwise mutation from parental allele), and these alleles should be flagged. Instead, they bin these unique alleles with the most similar parental allele without any explanation or flagged. The authors do bring this point up in Figure S1, but this issue needs to be addressed in the main text (related point: the mix of red/green in MS16 offspring appear more green than red. Is this meant to denote a probability different than 50:50? If not, the authors should adjust the shading so that this shape is half green, half red).

      • Figure 3: Indicate that white background for alleles means that allelic inheritance is not determinable, or use the mix of colors applied in Fig. 1 to indicate as such. Unique offspring alleles should be flagged rather than just automatically assigned to the most similar parental allele. Finally, it would be helpful if the alleles were presented within loci from the shorter to the longer alleles.

      • Figure S7. Indicate visually which panels indicate FP animals.

      • Fig. S13. The 5 animals that had especially low heterozygosity should be flagged. The title of this figure should be toned down in light of the tentative nature of the conclusions regarding FP in nature: low heterozygosity could instead reflect, for example, a long history of inbreeding. My reaction to the data is also that the % heterozygosity distribution for many of the species looks continuous rather than the bimodality one might expect under FP vs. sexual reproduction.

      Significance

      General assessment: strengths and limitations. The paper's strengths include the combination of data from lab and natural populations, the characterization of an unexpected means of achieving FP, with dramatic genetic consequences, and the data suggesting that this type of FP is fairly common and occurs even in the context of mating.

      Audience: The biological questions of relevance to these discoveries are of broad interest, and the paper is likely to garner some attention from the life sciences community as whole and the popular press.

      Advance: These data fill an important knowledge gap regarding the mechanisms potentially driving FP in vertebrates, how often FP is likely to occur, and its genetic consequences. The discoveries are potentially conceptual/fundamental, though the extent to which they are ground breaking is not clear in the absence of functional characterization of how FP occurs as well as the need for more rigorous comparisons and replication that I outlined above.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Here Ho et al. provide strong molecular evidence for the production of facultatively parthenogenetic whiptail lizards, through a gametic duplication. As evidenced through multiple routes, including microsatellites, WGS, RADseq, and RBC ploidy, and lines of evidence from multiple specimens, this study is timely in furthering our understanding of the mechanisms underlying FP. The fundings are conclusive.

      That said, I have several comments that should be addressed prior to publication. The introduction which addresses FP in other systems fails to cite several key studies that provide strongly molecular support for terminal fusion automixis. Similarly, the study pushes the idea that this is an adaptive trait, however without proving that the parthenogens can themselves reproduce, this isa moot point at this stage.

      That said, my comments are minor. I found this to be an excellent study, well written, comprehensive in methodology, and one that I strongly advocate for publication.

      Major comments: None.

      Minor Comments: Should be addressed.

      • Line 36 - However, data that supports terminal fusion are no longer restricted to microsat data. Studies utilizing RADseq and whole-genome sequencing in snakes and crocodiles have now provided further evidence supporting terminal fusion.

      See: Booth et al. 2023. Discovery of facultative parthenogenesis in a new world crocodile. Biology Letters. 19, 20230129.

      Card et al. 2021. Genome-wide data implicate terminal fusion automixis in king cobra facultative parthenogenesis. Scientific Reports. 11, 1-9

      Allen et al. 2018. Molecular evidence for the first records of facultative parthenogenesis in elapid snakes. R. Soc. Open. Sci. 5, 171901.

      • Ln 42 - Evidence suggesting that isolation from males was not a pre-requisite for FP has previously been reported in snakes.

      See: Booth et al. 2011. Evidence for viable, non-clonal but fatherless Boa constrictors. Biology Letters. 7, 253-256.

      Booth et al. Facultative parthenogenesis discovered in wild vertebrates. Biology Letters. 8, 983-985.

      Booth et al. 2014. New insights on facultative parthenogenesis in pythons. Biol J Linn Soc. 112, 461-468.

      • Ln 48 - Is this really an argument. While an immediate transition to homozygosity will purge some deleterious alleles, given the genome-wide nature of this, there will also conversely have been strong selection for mildly deleterious alleles.

      • Ln 56 - I would recommend the inclusion of both Allen et al. 2018. R. Soc. Open Sci, and Card et al. 2021. Sci Reports, here, as they are members of the elapids, not represented in the other examples.

      • Ln 60 - Recent studies have highlighted the significance of sperm storage in reptiles. For example, Levine et al. 2021. Exceptional long-term sperm storage by a female vertebrate. PLos ONE. 16(6).e0252049, describe the storage of sperm by a female rattlesnake for ~70 months, with two instances of its utilization to produce healthy offspring during that period. Clearly, molecular tools are providing both support for long-term sperm storage, and an understanding of its utilization.

      • Ln 68 - American Crocodile would also be suitable to include here.

      • Ln71 - The problem with this hypothesis is that parthenogens produced through FP tend to have very low viability. For example, Adams et al. 2023. Endangered Species Research, follow a cohort of sharks produced through FP and all survive. Similarly low levels of survival are reported across other systems for which FP was reported. More likely, FP is simply a neutral trait. The mother is not negatively impacted through producing parthenogens and can go on to produce sexual offspring. Few instances report successful reproduction of a parthenogen. See pers. Comm in Card et al. 2021. And Straube et al. 2016.

      • Ln 79 - I doubt that there is a desperate need for this for conservation. However, I think there is a need to simply further our understanding of basic biological function, given that it is not uncommon, and is phylogenetically widespread in species lacking genomic imprinting.

      • Ln 85 - It would be worth citing Card et al. 2021., here given that they used genome-wide ddRAD markers to show support for terminal fusion.

      • Ln 91 - Better citations here are Card et al. 2021. Allen et al. 2018, and Booth et al. 2023, which all utilize either RADseq or WGS.

      • Ln 95 - The conclusion of genome duplication here was supported only by a small number of microsatellite loci. As such, given that terminal fusion has been supported through genome-wide markers in other species of snakes and crocodiles, the conclusion of genome duplication is likely incorrect.

      • Ln 96 - I would strongly disagree with this statement. Allen et al. 2018, Card et al. 2021, Booth et al. 2023, all provide evidence of heterozygous loci and thus support terminal fusion. While no species-specific chromosome level reference genome is available for any of these species, the fact that levels of heterozygosity are below 33% percent supports terminal fusion. Rates over 33% support central fusion, but have not been reported in any vertebrate to date. AS such, I would recommend the removal of this statement.

      • Ln 121 - Recent work in Drosophila mercatorum and D. melanogaster suggest that three genes play a role in the activation of FP in unfertilized eggs. In this case, through the fusion of meiotic products. That said, it is plausible to assume that FP in these lizards has an underlying genomic mechanism that is not related to isolation from males. See Sperling et al. 2023. Current Biology. 33, P3545-P3560.E13.

      • Ln 126 - While these data strongly support FP of the two unusual A. marmoratus appearing offspring, can long term sperm storage be ruled out. Either through captive history or allelic exclusion of other males in the group?

      • Ln 171 - 191 - Given that the topic of this manuscript is the genomic mechanism underlying FP in this species, are these data necessary? These are not discussed later and as such I would recommend that they are moved supplemental material. Otherwise, they simply clutter that manuscript and detract from the key question. Indeed, they are important to show that the genome constructed is of high quality, but online Supp Mat is the place for that here.

      • Ln 296 - Comparable estimates were made for parthenogenetic production in wild populations of two North American pitviper species. See Booth et al. 2012. Biology Letters.

      • Ln 312 - Again, can this really be suggested? Above, the authors state that most FP animals that hatched had congenital defects, and a large number failed to hatch. This does not sound like strong support for generating individuals that counter the effects of population bottlenecks and inbreeding depression. The authors need to take this study further and monitor the long-term viability of the FP individuals that survive.

      • Ln 348 - To be able to provide support for this, you need to track animals long term to understand their reproductive competence, and that of their offspring.

      • Ln 358 - But, the caveat is that the parthenogens must themselves reproduce. This must me stated.

      • Ln 359 - Note that FP can also fix mildly deleterious alleles. Only if it is strongly deleterious will it be lost.

      • Ln 361 - See above comments.

      Significance

      Significance:

      • While reports of parthenogenesis have been reported as far back as the early 1900's, it has only been over the last decade that reports are become common. Such that facultative parthenogenesis is no longer considered a rarity, but is recognized now as being relatively common and phylogenetically widespread in species that lack genomic imprinting - particularly reptiles, birds, and sharks. Reasons for this are both an increased understanding that the trait can occur, hence recognizing it as an alternative mechanism to long-term sperm storage, and the ease of using molecular approaches.

      • The fundamental questions of recent times have been understanding the mechanisms driving FP. Recent papers utilizing whole genome sequencing and ddRADseq have provided support for terminal fusion automixis in snakes and sharks. Here, this study provides evidence of gametic duplication in whiptails, a mechanism with an alternative outcome in regards to the levels of retained heterozygosity. As such, this study compares to the recent work of Card et al. 2021 (Scientific Reports), and Booth et al. 2023 (Biology Letters), in providing substantive advances in the field.

      • The audience for this will be broad. Parthenogenesis is a fascinating topic that attracts significant media attention. See the Altmetric score of recent papers on the topic, particularly Booth et al. 2023 (Altmetric score - ~3100). As such, the study will be of interest to both a broad readership, but will also be of great significance to a specialized group working on parthenogenesis. All round, an excellent paper that has promise to advance the field.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We appreciate the thoughtful comments of the reviewers. We have revised the manuscript according to these comments as detailed below.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Efficient proteostasis in cells demands efficient clearing of damaged or misfolded proteins, and an important pathway involved in such clearance is the ubiquitin-proteasome pathway. In this system, proteins are tagged with ubiquitin to target them for degradation by the 26S proteasome complex. The conventional 26S proteasome complex consists of a core particle (CP or 20S proteasome) and one or two regulatory particles (RP, or 19S proteasome) to form the singly or doubly-capped proteasome, respectively. Proteasome assembly is a well-orchestrated process that requires proper stoichiometry of proteasome subunits and dedicated proteasome assembly chaperones. This is maintained by fine-tuning their transcriptional and translational regulation.

      This manuscript elucidates an important aspect of how the different proteasome components are transcriptionally regulated upon denervation in mouse muscles for timely and efficiently assembling 26S proteasome. The authors present data that point out towards the model whereby a two-phase transcriptional program (early: day 3-7 and late: day 10-14) activates genes encoding proteasome subunits and assembly chaperones to boost an increase in proteasome content. This involves the coordinated functions of two transcription factors, PAX4 and alpha-PAL(Nrf1) which were important for both early and late phase of the transcriptional program. Their roles were not redundant as loss of one transcription factor was sufficient to prevent induction of various proteasome genes in muscle after denervation.

      In summary, the authors report a novel bi-phasic mechanism elevating proteasome production in vivo, which involves the coordinated functions of two transcription factors, PAX4 and alpha-PAL(Nrf1).

      Major points: 1) It is not clear why PAX4 and alpha-PAL(Nrf1) are both fully required for the transcriptional induction of some proteasome genes upon denervation (with good overlap), while only PAX4 is important for increased proteasome assembly. The authors speculate that this could be due to a stoichiometry problem but an alternative scenario where translation is increased upon alpha-PAL(Nrf1) inhibition would also be possible. This would explain why, for example, the induction of PSMC1 gene expression upon denervation is abolished upon alpha-PAL(Nrf1) inhibition (Fig. 5C) while the protein level is still increased (Fig. 6H). Is that also true for PSMD5 and Rpn9? Could it also be that the loss of function of alpha-PAL(Nrf1) is too detrimental for the muscle so that they induce an alternative stress response pathway increasing proteasome subunit translation?

      We thank the reviewer for this comment. To better clarify this important point, we conducted further experiments to examine the differential effects between PAX4 vs. α-PALNRF1 on proteasome assembly chaperons (Fig. S4b). Our new data show that PAX4 promotes the induction of the assembly chaperone, PSMD5 (S5b) at 3 days after denervation (Fig. S4B). This induction is critical for the increase in PSMD5 protein levels because PAX4 knockout results in decreased PSMD5 protein levels at both 3 and 10 days after denervation (Fig. 4K). α-PALNRF1, however, does not affect the mRNA levels of this chaperone (Fig. S4A). This new result strengthens our conclusion that induced expression of assembly chaperones by PAX4 is key to raising proteasome levels after denervation.

      We cannot rule out an indirect effect of α-PALNRF1 knock-down on protein synthesis, and therefore this potential alternative mechanism is now discussed in the text. It appears unlikely, however, that α-PALNRF1 knock-down is too detrimental to muscle as we do not find any evidence phenotypically for any type of stress or abnormalities.

      2) Pax4 controls Rpt1-2 transcription and these two Rpt proteins form a pair. As Rpt4 is also regulated by Pax4, is Rpt5 also controlled by Pax4?

      We believe the reviewer meant to request the data for Rpt4, because the data for Rpt5 was already included in original Fig. 4G-H. Therefore, we repeated the RT-PCR analysis of PAX4 KO mouse muscles for Rpt4 and now show that its induction requires PAX4 at 10 d after denervation, just when proteasome content is increased (Fig. 4G). At 3 d after denervation, Rpt4 induction is probably regulated by other transcription factors because its mRNA levels at this early phase were similar in muscles from WT and PAX4 KO mice (Fig. 4H). These data, strengthen our conclusions that coordinated functions of multiple transcription factors control proteasome gene expression in vivo. In future studies, we will investigate the specific mode of cooperation and mechanisms by which various transcription factors and co-factors collaborate to enhance the expression of proteasome genes in the early and delayed stages of gene expression within a living organism.

      What about the assembly chaperone for these two pairs: PSMD5 and p27? It would be very interesting to know if there is a transcriptional coregulation based on proteasome assembly intermediates.

      The referee raises an important point, which we also discuss in the text. We now present data showing that PAX4 promotes the induction of the assembly chaperon PSMD5 at 3 d after denervation (Fig. S4B), correlating nicely with the observed changes in protein levels of this chaperon (Fig. 4K). The expression of PSMD9 (p27) however, does not require neither PAX4 nor α-PALNRF1 (Fig. S4). Consequently, we conclude that PAX4 promotes proteasome biogenesis by promoting PSMD5 induction, and in the absence of α-PALNRF1 proteasome subunits can still efficiently assemble into the proteasomes (even though their expression is reduced), due to the induced expression and increased action of the assembly chaperone PSMD5. Our data highlight the intricacy in controlling proteasome levels, through transcriptional regulation of proteasome genes and assembly chaperones during muscle atrophy. We now further document and discuss the regulation of proteasome biogenesis by these two transcription factors in the text and Discussion (p.28).

      3) Fig. 4J: PSMD5 and PSMD13 are not tested in Fig. 4A, G and H. This needs to be done if the authors want to draw the parallel mRNA-protein levels, as in their conclusion. Moreover, the protein levels seem to be much more induced than the mRNA levels, could that be due to increased translation? This could be discussed.

      We accepted this thoughtful suggestion and now present the mRNA levels for PSMD5 and PSMD13 in Figs. 4A, G and H and Fig. S4. The new data does not change our conclusion that protein abundance largely correlate with the transcript levels (Figs. 2 and 4K).

      The reviewer raises an important question that we hope to resolve in the future. As we point out in the revised Discussion section, “the substantial rise in protein levels compared to mRNA levels after denervation suggests potential increased protein translation due to PAX4 loss. Whether PAX4 regulates protein synthesis and thus can affect protein levels beyond gene expression are intriguing questions for future research”.

      4) The conclusion is not correct in this sentence: "Moreover, analysis of innervated and 10 d denervated muscle homogenates from WT, alpha-PAL(Nrf1) KD or PAX4/alpha-PAL(Nrf1) KD mice by native gels and immunoblotting or LLVY-cleavage indicated that loss of both transcription factors is necessary to effectively block accumulation of active assembled proteasomes on denervation (Fig. 6H)". This is not correct, as the loss of PAX4 is sufficient to block accumulation of active assembled proteasomes on denervation (Fig. 4K). So, it could just be that alpha-PAL(Nrf1) KD has no effect on the induction of proteasome assembly after denervation and that all the effect of the double mutant is due to PAX4 loss. This needs to be corrected.

      We thank the reviewer for this thoughtful comment. The text has been revised accordingly.

      Minor points:

      1) I would rephrase the sentence "baseline at 14 d after denervation and showed a sustained low mRNA levels until 28 d (Fig. 2A-F).", as the mRNA levels are still significantly higher that the basal levels for most proteasome genes. Same for the sentence: "RNA sequencing (RNA-Seq) analysis of TA muscles at 14 d after denervation indicated that expression of most proteasome genes is low at 14 d (Fig. S1)". Expression is low compared to what and not being induced doesn't mean they are low. This needs to be rephrased.

      We revised the text accordingly and thank the reviewer for these suggestions.

      2) Microscopy images need more explanation: define the green and red channel and what they are used for in the legend.

      The legends have been updated as requested.

      3) Columns have moved from the Table 2.

      The tables have now been submitted as separate files.

      4) Fig. S3: RT-PCR on NRF-1(NFE2L1) need to be performed to see the extent of inhibition by shRNA.

      We thank the reviewer for this important comment. The data, which was added as new Fig. S3A, shows an efficient knockdown of NRF-1NFE2L1 with shNFE2L1.

      5) In the sentence: "PAX4 maintaining subunit stoichiometry for increased proteasome assembly.", could it be due to the much higher levels of PSMB8, 9 and 10 immunoproteasome subunits upon alpha-PAL(Nrf1) KD (Fig. 6F)?

      We addressed this aspect in Major Point #1, regarding the difference between PAX4 and α-PALNRF1; please see our response. As for the Reviewer’s comment concerning Fig. 6F, we think that the increased expression of PSMB 8, 9, and 10 in α-PALNRF1-KD compared to the double KD or PAX4 KO further suggests a distinct cooperative interaction between these transcription factors in promoting proteasome expression, assembly, and function, which we plan to thoroughly investigate in future separate studies. However, the increased expression of PSMB 8, 9, and 10 can affect the composition of the CP (by replacing their normal ounterpart), but not the RP assembly. CP and RP are known to assemble separately with their own dedicated chaperones; RP and CP then associate to complete the assembly of proteasome holoenzyme (RP-CP complex). Thus, it is unlikely that increased CP assembly alone would increase overall RP-CP assembly.

      **Referees cross-commenting**

      All other comments are relevant.

      Reviewer #1 (Significance (Required)):

      Overall, the work is impactful and timely, reporting the participation of a novel transcription factor, alpha-PAL(Nrf1), along with PAX4, in regulating the transcription of proteasome genes and the subsequent assembly of conventional proteasomes in mouse muscle upon denervation. One limitation is that alpha-PAL(Nrf1) kockdown is only inhibiting proteasome genes expression but proteasome assembly, the reason being still unknown. Most of the conclusions drawn in the manuscript are supported by the experimental data. Better understanding how proteasome homeostasis is regulated upon stressful conditions is an important fundamental aspect of proteasome biology. I would support publication of this manuscript providing the more specific concerns listed are addressed.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The main limitation of this study is that is based on a single model of muscle atrophy: that induced by cut of the sciatic nerve. Another one will nicely complement the findings as fasting atrophy or cancer cachexia model, to see if the two phase is recapitulated with regard to proteasome modulation.

      The referee raises an interesting point, but as we explained throughout the manuscript, we did not use denervation in this study as a model for atrophy but rather as an in vivo model system to investigate mechanisms of protein degradation and proteasome homeostasis in a whole organism in vivo. The reason we selected denervation as an in vivo model for accelerated proteolysis is due to the gradual nature of muscle loss, which allows us to dissect the various phases of proteasome homeostasis effectively. Fasting, as an alternative model, is too rapid for addressing the specific questions that we asked in this study. In addition, in the rapid atrophy induced by fasting the primary physiological mechanism to increase protein degradation in vivo is believed to be through post-synthetic modification of proteasomes, rather than the production of new proteasomes (VerPlank et al., 2019). In future separate studies, we will thoroughly investigate whether the mechanisms discovered here are applicable to other types of atrophy (e.g. diabetes, aging, cancer). The obtained results will be published and fully discussed separately, in part because covering all types of atrophy within a single paper is impractical and goes beyond the scope of the current manuscript.

      Another major concern is that the author do not measure over time during denervation atrophy the mRNA and protein content expression of the two transcription factors that they found crucial in the proteasome induction and assembly.

      We agree with the reviewer that time course would strengthen our conclusions that the two transcription factors are important for proteasome gene induction and assembly. We have added these data showing that PAX4 (Fig. 4I) and α-PALNRF-1 (Fig. 6E) both accumulate in the nucleus at 7 d after denervation, just when proteasome content is maximal (Fig. 3A) and protein breakdown is accelerated (Cohen 2009; Volodin 2017; Aweida 2021). The mRNA levels of PAX4 were presented as original Fig. 4F and indicate that PAX4 is induced already at 3 d after denervation. We have added new RT-PCR data for α-PALNRF-1 showing that α-PALNRF-1 is induced at 7 d and 10 d after denervation (Fig. 6D).

      Major and minor concerns are as follows:

      Typos now and then are present all over the text, as holoemzyme shall be replaced with holoenzyme on page 9, on page 12 proteasome is misspelled on mid page, as well as cellls. By cotrast shall be corrected on page 19. References on page 22 shall be formatted.

      We have corrected the typographical errors.

      • reference 29 on page 7 seems out of context together with the sentences it is coupled with.

      The reference is appropriately located within the text in terms of context, and precisely aligns with the sentence to which it is associated. Reference 29 (Boos 2019) describes a cellular state in which all proteasome genes rise simultaneously.

      • muscle electroporation of plasmid shall be replaced by AAV9 injection that causes less inflammation and more expressing fibers

      We do not understand and see no basis for the referee’s assertion that the “muscle electroporation of plasmid shall be replaced by AAV9 injection”. On the contrary, the electroporation methodology is widely used by many labs because of its many advantages. This in vivo gene transfection approach is extremely useful to study transient gene (or shRNA) effects in adult muscles, while avoiding the developmental effects of genes (or shRNA) that are often seen in transgenic or knockout animals (e.g., the inducible knockout of α-PALNRF-1 caused lethality, see Fig. 6B-C).

      In addition, the electroporation technique offers great advantages from its speed and major cost savings. We have been using it routinely in our lab for in vivo studies, and articles using it from many laboratories worldwide have appeared in all major journals, e.g. see our papers in Nature Communications, J Cell Biol, PNAS, EMBO rep, and papers from late Alfred Goldberg (Harvard), Marco Sandri (Padova, Italy), Jeff Brault (Indiana Univ.) and others. In all studies included in this manuscript that involve electroporation, contrary to the reviewer’s impression, there was no damage or inflammation to the muscles, and we routinely examined histological sections. Finally, for our studies, we always use muscles that are at least ~70% transfected, which has proven adequate for observing gene effects in mouse muscle. In each experiment, transfected muscles are always compared and analyzed in parallel to control muscles (transfected with scrambled shLacz control). In fact, the validity of the in vivo electroporation technique is further confirmed herein by our investigations of transgenic inducible knock-down mice, showing similar effects on proteasome gene expression.

      • the shGankyrin data shall be complemented with overexpression of the same chaperone to see the effects of proteasome expression and assembly.

      We understand the reviewer’s concern but do not believe that such an experiment is necessary since it is well known and there is already extensive evidence in the literature showing that the chaperon Gankyrin is essential for proteasome assembly (Kaneko et al. Cell 137, 914–925, May 29, 2009 (DOI 10.1016/j.cell.2009.05.008). Thus, various Gankyrin mutants have often been used as an inactive control for proteasome assembly in vitro and in vivo (Kaneko et al. Cell 137, 914–925, May 29, 2009 (DOI 10.1016/j.cell.2009.05.008). In fact, Gankyrin’s known function in ensuring not only the proper subunit composition, but also proper conformation of the proteasome holoenzyme (Lu et al., Mol Cell. 2017 Jul 20;67(2):322-333.e6).

      • another important transcription factor driving MuRF1 expression is Twist and it is totally ignored in the discussion, please add it.

      We regret this oversight. We did not mean to slight any authors, although our major new discoveries and focus is on proteasome genes and not MuRF1. However, to satisfy the reviewer, we now discuss in the text Twist and other transcription factors (including SMAD2/3, glucocorticoid receptors and NFkB) capable of inducing the major atrophy-related genes (among them MuRF1).

      • WB in Fig 2 shall be complemented by one in the Supp with more replicates per timepoint

      We accepted this thoughtful suggestion and now present blots from additional normal and atrophying denervated mouse muscle samples as new Fig. S1B. This approach, however, does not change any of our conclusions.

      • please justify why only PSMD10 (gankyrin) has been silenced and not any of the others (POMP, PSMD5, PSMD9)

      We silenced PSMD10 (Gankyrin) as a representative RP assembly chaperone, since it is better characterized than the other RP assembly chaperones (PSMD5 and PSMD9). We kept POMP (a CP assembly chaperone) intact. Since the formation of one proteasome holoenzyme (RP2-CP) requires two RPs and one CP, increasing proteasome assembly is expected to be more demanding for RP assembly than CP. This led us to predict that disrupting RP assembly should be sufficient to block the induced proteasome assembly. This prediction is supported by our data (Fig. 3), and this justification was also added to the revised text to enhance clarity.

      The originality is limited by the fact that Pax4 was already shown to have a role in muscle atrophy and drives the expression of p97 by the same authors. I would be curious to see if treatments in vitro know to induce the proteasome as starvation etc acts through the biphase mechanism showed in this paper, to understand how extendable to other kinds of atrophy is.

      We respectfully disagree that the originally of the present findings is limited, because previously we validated a single proteasome subunit (Rpt1) as a target gene for PAX4 (Volodin 2017), and here we discover novel global coordination of proteasome gene expression by multiple transcription factors.

      As we mention above, muscle denervation was used here as an in vivo model system of catabolic conditions. Unlike prior reports that were limited to cultured cells, our studies focus on the physiological setting in vivo to reveal mechanisms of proteasome homeostasis. In any case, regulation of proteasome gene expression by multiple transcription factors in other types of atrophy has not been investigated but is possible because common transcriptional adaptations activate protein breakdown in different types of muscle atrophy, including a coordinated induction of numerous components of the ubiquitin proteasome system (Jagoe 2002; Lecker 2004; Gomes 2001). In future independent research, we intend to investigate if the two-phase mechanism reported here can in fact be generalized to other atrophy (or stress) conditions.

      Reviewer #2 (Significance (Required)):

      The authors Gilda and co-workers made a great attempt to dissect the induction of proteasome activity during denervation muscle atrophy and discovered a two-phase process which involves two transcription factors Pax4 and NRF1. The manuscript is clearly written and the experiments fully delineated.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Using denervated mouse muscle as a model, Gilda et al. demonstrated that a two-step transcriptional program operates in the process of muscle atrophy after denervation and that proteasome expression-induced enhancement of protein degradation is important. Gilda et al. clarified that the transcription factors PAX4 and PAL/NRF-1 act on this proteasome expression induction and that the induction of these transcription factors and the expression induction of the proteasome gene cluster after denervation are necessary for muscle atrophy using an in vivo mouse model. The experiments were logically designed, and the results presented are considered clear and reliable. However, some of the descriptions in the text lack accuracy and courtesy, and some experiments require additional data to support and strengthen the author's claims. In particular, it is unclear whether PAX4, FOXO3, and NRF-1 work together or whether they have distinct functions. Although the authors claim that there are two stages of proteasome expression induction after denervation, this remains unclear. The authors should clarify the differences in target sequence or target genes and the substitutability of each transcription factor.

      Major comments: 1: In Figure 3A, the results of the immunoblot of SDS-PAGE against 20S proteasome subunits should also be shown to confirm the increase in proteasome activity and amount.

      We would like to clarify this aspect. We show the increased levels of proteasome holoenzyme complex (RP2-CP) by immunoblotting of the native gel, rather than SDS-PAGE gel. This is because the blots of the native gel can assess the levels of the actual proteasome complex, not simply subunit levels in their denatured state as in SDS-PAGE; SDS-PAGE cannot distinguish between free subunits and ones that are incorporated into the proteasome.

      If proteasome activity was increased due to some other mechanisms, proteasome levels would remain relatively constant, while proteasome activity would have increased. However, this is not the case here since our data demonstrates that both RP2-CP activities and levels peak at day 7. Furthermore, the in-gel peptidase assay (Fig. 3A panel b) directly tests the 20S CP activity within the proteasome holoenzyme (RP2-CP complex) using the fluorogenic model substrate, LLVY-AMC. The 20S CP is activated for substrate degradation, only upon its association with RP (RP2-CP complex), since RP opens the substrate entry gate of the 20S. Free 20S itself is inactive, as its gate for substrate entry is closed; for this reason, free 20S can be detected, only after its substrate entry gate is artificially opened by SDS (see free 20S in panel b, but not in panel a).

      2: In Figure 3, the reviewer assumed the conflict between the results of peptidase activity and SDS-PAGE in 14d. Therefore, quantification and statistical analysis should be performed on the results of proteasome peptidase activity and immunoblots to clarify the relationships between proteasome activity and amounts. Immunoblotting against ubiquitin is also needed to confirm the requirement and efficiency of proteasome induction.

      As the reviewer pointed out, it might seem discrepant that peptidase activity at 14 d denervation is lower than its peak at 7d (Fig. 3A, panel a), but SDS-PAGE signal for proteasome subunits seems still high (Fig. 3A, panel d, Rpn2). SDS-PAGE detects total cellular content of proteasome subunits (free subunits as well as ones assembled within proteasomes). However, at any given moment, these subunits are not only in the proteasome holoenzyme complex, but also in different assembly intermediates. When proteasome subunits are transcriptionally induced as in this study, proteasome assembly process is also increased. However, proteasome assembly is a multi-step process, and the fold-induction for each specific subunit is different (Fig. 2A-B). This means that the rate of a certain assembly step would be differently affected for a given subunit, depending on their fold-induction. For this reason, some subunits seem to exist at a high level at 14d (e.g. Fig. 3A, panel d, Rpn2), but they are not yet incorporated into the proteasome complex, because they might be still undergoing assembly process.

      As for the ubiquitin blot, it can be a good indicator for proteasome activity, when proteasome activity is decreased than normal. In such situations, ubiquitinated proteins accumulate (i.e. their signals increase as compared to control), due to their deficient degradation. However, our present study pertains to the opposite situation, where proteasome activity is increased in degrading ubiquitinated proteins. In normal cells, ubiquitinated proteins are hardly detectable due to their rapid degradation. Thus, when proteasome activity is greater than normal, ubiquitinated protein levels will be further decreased than normal. Data become unreliable when the signals are below the detection threshold. For this reason, we provided functional readouts involving the number of muscle fibers (for example, Fig. 3D).

      3: In Figure 3C, the sample labels of shGankyrin and shLacZ are repeated. Would it be mislabeled? In addition, NATIVE PAGE immunoblot analysis against Gankyrin and proteasome subunits are needed to prove the knockdown efficiency and to reveal the assembly defect of proteasome by Gankyrin knockdown.

      To present our findings more clearly, we show one of each sample in the revised figure, rather than the duplicates as in the previous figure (Fig. 3C). We also included the immunoblot data to show that Gankyrin knockdown disrupts proteasome assembly, as seen by the reduced proteasome complex activity and level (Fig. 3C, panels a, b, c, lane 3, see RP2-CP). In Gankyrin knockdown samples, proteasome holoenzyme complex exhibited smeary appearance (Fig. 3C, panel c, see bracketed region in lane 3), as opposed to a discrete band in the controls (lanes 1, 2). This smeary appearance reflects more heterogeneous proteasome populations, due to defects in their composition and/or conformation. This is in line with Gankyrin’s known function in ensuring not only the proper subunit composition, but also proper conformation of the proteasome holoenzyme (Lu et al., Mol Cell. 2017 Jul 20;67(2):322-333.e6).

      4: In Figures 4A, 4G, 4H, 4J, and 4K, the results of shPAX4 against innervated muscle should be shown to estimate the contribution of PAX4 in steady-state conditions. To clarify the innervated muscle-specific function of PAX4, histological analysis and quantification of proteasome gene expression in multiple organs in PAX4 KO mice are needed.

      The reviewer raises an interesting point, but as we explained above, we concentrate here on the major new discovery that multiple transcription factors increase proteasome content in a catabolic condition in vivo, correlating directly with the accelerated protein loss. Regulation of the basal levels of proteasome in normal conditions in various types of cells and tissues is certainly an important issue meriting in depth study and will be the subject for future studies, but it is beyond the scope of this lengthy paper. This point is now discussed in the revised text.

      The tissue distribution of PAX4 and the detailed description of the phenotype of KO mice are also needed to understand and evaluate the role of PAX4 in muscle.

      We added the requested data about PAX4 distribution as Fig. 4I. These data shows that PAX4 accumulates in the nucleus already at 3 d after denervation. Furthermore, we are happy to add further information about the knock-out mouse model. The requested information and a detailed description of how PAX4 KO mice were generated were added to the text. The PAX4 KO mice showed no abnormalities and did not appear in any way different from the wild type littermates.

      5: In Figure 4C, immunoblot analysis against PAX4 is essential to confirm the PAX4 protein knockout.

      We agree and representative blots were added to Fig. 4C.

      6: In Figure 5, peptidase activity and immunoblotting in NATIVE PAGE are needed to reveal the contribution of FOXO3 and NRF-1 in denervated muscle as shown in Figure 4.

      The requested data for FOXO3 using FOXO3 dominant negative (as in Fig. 5A-B) were added as new Fig. 5C-D, showing no effect on proteasome content by FOXO3 inhibition. These new data are consistent with our findings that the expression of only two proteasome subunit genes was affected by FOXO3 inhibition at 10 d after denervation (Fig. 5B). The data for α-PALNRF-1 and the effects of its knockdown on proteasome content and activity were shown as original Fig. 6H (now Fig. 6J).

      The expression of FOXO3 and NRF-1 should also be shown by RT-PCR and immunoblotting as shown in Figure 4.

      We thank the reviewer for this thoughtful suggestion, and as requested, we now show representative blots of transfected muscles to support the graphical data (Figs. 5C-F). These data confirm the efficient expression of HA-FOXO3ΔC or FLAG-α-PALNRF-1 dominant negative inhibitors in transfected muscles. It is important to note that these inhibitors are mutant forms designed to interfere with the normal function of the wild-type endogenous FOXO3 or α-PALNRF-1 proteins, without affecting their transcript levels. Given this mechanism, we believe that Western blotting is a more appropriate technique for assessing their impact, as it provides direct insights into protein expression. In the revised main text and methods, we have now clarified this point.

      Similar to previous comments, the expression of the dominant negative form of Foxo3 and NRF-1 should be performed in innervated muscles to reveal the significance and specificity of Foxo3 and NRF-1 function in denervated muscles.

      As mentioned above, regulation of the normal basal levels of proteasomes is certainly an important issue meriting in depth study and will be the subject for future studies, but it is beyond the scope of this lengthy paper, which focuses on the mechanisms increasing protein content in catabolic conditions in vivo. With respect to FOXOs, there is a large literature on its regulation and roles in normal muscle (please see papers by late Alfred L Goldberg, Marco Sandri and others). Under normal conditions FOXO3 is largely inactive via phosphorylation by insulin-PI3K-AKT signaling (Stitt 2004; Latres 2005; Zhao 2007).

      7: In Figure 6D, the list of genes should be served especially about 27 genes and 69 genes that show common features between NRF-1 KD and PAX4 KO.

      The requested data is now presented as new Table 4.

      8: In Figure 6F, the list of genes that change expression in PAX4 and NRF-1 KD mice is needed.

      We agree and the requested data has now been added to table 5.

      9: In Figure 6H, immunoblotting against ubiquitin is needed to evaluate the contribution of proteasome induction to protein degradation.

      We clarified this aspect in the Major Point #2. Please see our response.

      10: This study lacks the detailed mechanisms by which PAX4, Foxo3, and NRF-1 regulate the expression of proteasome genes. The contribution of these transcription factors is revealed by experiments, but the specific sequence that these transcription factors bind and how transcription factors are induced in denervated muscles is not clarified. As shown in the figures, the ChIP assay provides convincing results, but the detailed sequence or map of the promoter region of proteasome genes must be shown in the figures to clarify the target sequences of NFE2L1 and PAX4, FOXO3, and NRF-1. In addition, the luciferase assay would support the results of the ChIP assay.

      Again, the reviewer raises an important question that we plan to resolve in the future. As mentioned, our findings strongly suggest a novel coordinated mechanism involving multiple transcription factors that control proteasome content in catabolic states in vivo. The enclosed revised manuscript primarily focuses on elucidating the contributions of individual transcription factors (α-PALNRF-1, PAX4, NRF-1NFE2L1 and FOXO3) to the induction of proteasome genes, revealing a significant overlap in genes regulated by multiple transcription factors. The specific mode of cooperation among these and other transcription factors and cofactors is certainly an important question for future studies, but it is beyond the scope of this lengthy paper. In the revised text we have now clarified this point (page 27). In addition, we agree that clarifying how the transcription factors are induced in denervated muscles merits some considerations and a paragraph was added to the Discussion (page 26) concerning possible mechanisms. For example, it is possible that the transcription factor STAT3 is involved in PAX4 induction because, based on previous microarray and ChIP data in cultured NIH3T3 cells, PAX4 was identified as a target gene of STAT3 (Snyder et al., 2008), and STAT3 becomes activated after denervation (Madaro et al., 2018).

      We are delighted that the reviewer found the results obtained through the ChIP assay convincing. Given the extensive scope of our investigation and rigorous analyses of dozens of genes, it is not feasible to generate luciferase-encoding plasmids for all of them. However, in response to the reviewer's request, we have carried out predictions of the binding sites of the 4 transcription factors within the minimal promoter regions (300 up- and 1000 down-stream to TSS) of the 64 proteasome sequences. The predicted binding sites are now listed in Table 2A-D. These new data further support our key findings that multiple transcription factors control proteasome gene expression in a catabolic physiological state in vivo.

      11: The results of the loss of transcription factors are well done, but the authors should also try to estimate the effect of overexpression of transcription factors in muscle. If the overexpressed transcription factors cause proteasome induction and muscle fiber mass reduction, these results strongly support the importance of transcription factor-mediated proteasome enhancement.

      We understand the reviewer’s comment but do not believe that such an experiment is necessary to support our key findings about proteasome gene induction by multiple transcription factors in vivo. In fact, we have specifically refrained from pursuing overexpression studies in this context due to the apparent coordination and some potential interdependence between the functions of PAX4 and α-PALNRF-1 transcription factors in inducing proteasome genes. Manipulating one specific gene through overexpression could potentially disrupt this delicate coordination and yield misleading results.

      In addition, there are several limitations of gene overexpression in mouse muscle, as it may not be as efficient and does not represent physiological conditions. Therefore, to validate gene functions in a physiological setting in vivo, we generated transgenic animals with the gene of interest specifically knocked-out or knocked-down. Utilizing transgenic mice lacking the gene of interest, though time-consuming, is a widely accepted and common approach that proves to be the most suitable method for specifically demonstrating the involvement of a particular gene in a physiological process, enabling a targeted and controlled investigation of its role and providing valuable insights into its contribution to the observed effects.

      Minor comments:

      12: The authors should describe the inducible KO mice more carefully and correctly. In the Results section on P12, the description of "whole body Cre+ mice" confuses the readers in understanding the mechanism of inducible Cre-mediated KO.

      We agree and have added the information requested about the KO mice to the main text and a detailed description in the methods section.

      13: In Figures 6B and 6C, the number of mice and the meaning of the asterisk should be described correctly. Is it statistically significant?

      We agree. By accident the number of mice and sign for statistical significance were omitted during processing. The correct sign was added to Fig. 6B-C, and the number of mice used, and the meaning of asterisks were added to the corresponding legend. N=10 mice per condition. **, P

      14: There is no description of Figure 6E in the manuscript. The authors should include it.

      In the original version of this paper, we refer to Fig. 6E in the text on pages 21 and 25. Also, the presented illustration is fully described in the corresponding legend.

      Reviewer #3 (Significance (Required)):

      This paper clarified a novel mechanism of proteasome induction by transcription factors in denervated muscles other than Nrf1 (NFE2L1), which has been shown to contribute to the induction of proteasome gene expression in cultured cells. This is an important paper for expanding the understanding of the field. It is also important because it has demonstrated the potential for new therapeutic targets in diseases such as type 2 diabetes and cancer.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Using denervated mouse muscle as a model, Gilda et al. demonstrated that a two-step transcriptional program operates in the process of muscle atrophy after denervation and that proteasome expression-induced enhancement of protein degradation is important. Gilda et al. clarified that the transcription factors PAX4 and PAL/NRF-1 act on this proteasome expression induction and that the induction of these transcription factors and the expression induction of the proteasome gene cluster after denervation are necessary for muscle atrophy using an in vivo mouse model. The experiments were logically designed, and the results presented are considered clear and reliable. However, some of the descriptions in the text lack accuracy and courtesy, and some experiments require additional data to support and strengthen the author's claims. In particular, it is unclear whether PAX4, FOXO3, and NRF-1 work together or whether they have distinct functions. Although the authors claim that there are two stages of proteasome expression induction after denervation, this remains unclear. The authors should clarify the differences in target sequence or target genes and the substitutability of each transcription factor.

      Major comments:

      1. In Figure 3A, the results of the immunoblot of SDS-PAGE against 20S proteasome subunits should also be shown to confirm the increase in proteasome activity and amount.
      2. In Figure 3, the reviewer assumed the conflict between the results of peptidase activity and SDS-PAGE in 14d. Therefore, quantification and statistical analysis should be performed on the results of proteasome peptidase activity and immunoblots to clarify the relationships between proteasome activity and amounts. Immunoblotting against ubiquitin is also needed to confirm the requirement and efficiency of proteasome induction.
      3. In Figure 3C, the sample labels of shGankyrin and shLacZ are repeated. Would it be mislabeled? In addition, NATIVE PAGE immunoblot analysis against Gankyrin and proteasome subunits are needed to prove the knockdown efficiency and to reveal the assembly defect of proteasome by Gankyrin knockdown.
      4. In Figures 4A, 4G, 4H, 4J, and 4K, the results of shPAX4 against innervated muscle should be shown to estimate the contribution of PAX4 in steady-state conditions. To clarify the innervated muscle-specific function of PAX4, histological analysis and quantification of proteasome gene expression in multiple organs in PAX4 KO mice are needed. The tissue distribution of PAX4 and the detailed description of the phenotype of KO mice are also needed to understand and evaluate the role of PAX4 in muscle.
      5. In Figure 4C, immunoblot analysis against PAX4 is essential to confirm the PAX4 protein knockout.
      6. In Figure 5, peptidase activity and immunoblotting in NATIVE PAGE are needed to reveal the contribution of FOXO3 and NRF-1 in denervated muscle as shown in Figure 4. The expression of FOXO3 and NRF-1 should also be shown by RT-PCR and immunoblotting as shown in Figure 4. Similar to previous comments, the expression of the dominant negative form of Foxo3 and NRF-1 should be performed in innervated muscles to reveal the significance and specificity of Foxo3 and NRF-1 function in denervated muscles.
      7. In Figure 6D, the list of genes should be served especially about 27 genes and 69 genes that show common features between NRF-1 KD and PAX4 KO.
      8. In Figure 6F, the list of genes that change expression in PAX4 and NRF-1 KD mice is needed.
      9. In Figure 6H, immunoblotting against ubiquitin is needed to evaluate the contribution of proteasome induction to protein degradation.
      10. This study lacks the detailed mechanisms by which PAX4, Foxo3, and NRF-1 regulate the expression of proteasome genes. The contribution of these transcription factors is revealed by experiments, but the specific sequence that these transcription factors bind and how transcription factors are induced in denervated muscles is not clarified. As shown in the figures, the ChIP assay provides convincing results, but the detailed sequence or map of the promoter region of proteasome genes must be shown in the figures to clarify the target sequences of NFE2L1 and PAX4, FOXO3, and NRF-1. In addition, the luciferase assay would support the results of the ChIP assay.
      11. The results of the loss of transcription factors are well done, but the authors should also try to estimate the effect of overexpression of transcription factors in muscle. If the overexpressed transcription factors cause proteasome induction and muscle fiber mass reduction, these results strongly support the importance of transcription factor-mediated proteasome enhancement.

      Minor comments:

      1. The authors should describe the inducible KO mice more carefully and correctly. In the Results section on P12, the description of "whole body Cre+ mice" confuses the readers in understanding the mechanism of inducible Cre-mediated KO.
      2. In Figures 6B and 6C, the number of mice and the meaning of the asterisk should be described correctly. Is it statistically significant?
      3. There is no description of Figure 6E in the manuscript. The authors should include it.

      Significance

      This paper clarified a novel mechanism of proteasome induction by transcription factors in denervated muscles other than Nrf1 (NFE2L1), which has been shown to contribute to the induction of proteasome gene expression in cultured cells. This is an important paper for expanding the understanding of the field. It is also important because it has demonstrated the potential for new therapeutic targets in diseases such as type 2 diabetes and cancer.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The main limitation of this study is that is based on a single model of muscle atrophy: that induced by cut of the sciatic nerve. Another one will nicely complement the findings as fasting atrophy or cancer cachexia model, to see if the two phase is recapitulated with regard to proteasome modulation.

      Another major concern is that the author do not measure over time during denervation atrophy the mRNA and protein content expression of the two transcription factors that they found crucial in the proteasome induction and assembly.

      Major and minor concerns are as follows:

      Typos now and then are present all over the text, as holoemzyme shall be replaced with holoenzyme on page 9, on page 12 proteasome is misspelled on mid page, as well as cellls. By cotrast shall be corrected on page 19. References on page 22 shall be formatted.

      • reference 29 on page 7 seems out of context together with the sentences it is coupled with.
      • muscle electroporation of plasmid shall be replaced by AAV9 injection that causes less inflammation and more expressing fibers
      • the shGankyrin data shall be complemented with overexpression of the same chaperone to see the effects of proteasome expression and assembly
      • another important transcription factor driving MuRF1 expression is Twist and it is totally ignored in the discussion, please add it.
      • WB in Fig 2 shall be complemented by one in the Supp with more replicates per timepoint
      • please justify why only PSMD10 (gankyrin) has been silenced and not any of the others (POMP, PSMD5, PSMD9)

      The originality is limited by the fact that Pax4 was already shown to have a role in muscle atrophy and drives the expression of p97 by the same authors. I would be curious to see if treatments in vitro know to induce the proteasome as starvation etc acts through the biphase mechanism showed in this paper, to understand how extendable to other kinds of atrophy is.

      Significance

      The authors Gilda and co-workers made a great attempt to dissect the induction of proteasome activity during denervation muscle atrophy and discovered a two-phase process which involves two transcription factors Pax4 and NRF1. The manuscript is clearly written and the experiments fully delineated.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Efficient proteostasis in cells demands efficient clearing of damaged or misfolded proteins, and an important pathway involved in such clearance is the ubiquitin-proteasome pathway. In this system, proteins are tagged with ubiquitin to target them for degradation by the 26S proteasome complex. The conventional 26S proteasome complex consists of a core particle (CP or 20S proteasome) and one or two regulatory particles (RP, or 19S proteasome) to form the singly or doubly-capped proteasome, respectively. Proteasome assembly is a well-orchestrated process that requires proper stoichiometry of proteasome subunits and dedicated proteasome assembly chaperones. This is maintained by fine-tuning their transcriptional and translational regulation.

      This manuscript elucidates an important aspect of how the different proteasome components are transcriptionally regulated upon denervation in mouse muscles for timely and efficiently assembling 26S proteasome. The authors present data that point out towards the model whereby a two-phase transcriptional program (early: day 3-7 and late: day 10-14) activates genes encoding proteasome subunits and assembly chaperones to boost an increase in proteasome content. This involves the coordinated functions of two transcription factors, PAX4 and alpha-PAL(Nrf1) which were important for both early and late phase of the transcriptional program. Their roles were not redundant as loss of one transcription factor was sufficient to prevent induction of various proteasome genes in muscle after denervation.

      In summary, the authors report a novel bi-phasic mechanism elevating proteasome production in vivo, which involves the coordinated functions of two transcription factors, PAX4 and alpha-PAL(Nrf1).

      Major points:

      1. It is not clear why PAX4 and alpha-PAL(Nrf1) are both fully required for the transcriptional induction of some proteasome genes upon denervation (with good overlap), while only PAX4 is important for increased proteasome assembly. The authors speculate that this could be due to a stoichiometry problem but an alternative scenario where translation is increased upon alpha-PAL(Nrf1) inhibition would also be possible. This would explain why, for example, the induction of PSMC1 gene expression upon denervation is abolished upon alpha-PAL(Nrf1) inhibition (Fig. 5C) while the protein level is still increased (Fig. 6H). Is that also true for PSMD5 and Rpn9? Could it also be that the loss of function of alpha-PAL(Nrf1) is too detrimental for the muscle so that they induce an alternative stress response pathway increasing proteasome subunit translation?
      2. Pax4 controls Rpt1-2 transcription and these two Rpt proteins form a pair. As Rpt4 is also regulated by Pax4, is Rpt5 also controlled by Pax4? What about the assembly chaperone for these two pairs: PSMD5 and p27? It would be very interesting to know if there is a transcriptional coregulation based on proteasome assembly intermediates.
      3. Fig. 4J: PSMD5 and PSMD13 are not tested in Fig. 4A, G and H. This needs to be done if the authors want to draw the parallel mRNA-protein levels, as in their conclusion. Moreover, the protein levels seem to be much more induced than the mRNA levels, could that be due to increased translation? This could be discussed.
      4. The conclusion is not correct in this sentence: "Moreover, analysis of innervated and 10 d denervated muscle homogenates from WT, alpha-PAL(Nrf1) KD or PAX4/alpha-PAL(Nrf1) KD mice by native gels and immunoblotting or LLVY-cleavage indicated that loss of both transcription factors is necessary to effectively block accumulation of active assembled proteasomes on denervation (Fig. 6H)". This is not correct, as the loss of PAX4 is sufficient to block accumulation of active assembled proteasomes on denervation (Fig. 4K). So, it could just be that alpha-PAL(Nrf1) KD has no effect on the induction of proteasome assembly after denervation and that all the effect of the double mutant is due to PAX4 loss. This needs to be corrected.

      Minor points:

      1. I would rephrase the sentence "baseline at 14 d after denervation and showed a sustained low mRNA levels until 28 d (Fig. 2A-F).", as the mRNA levels are still significantly higher that the basal levels for most proteasome genes. Same for the sentence: "RNA sequencing (RNA-Seq) analysis of TA muscles at 14 d after denervation indicated that expression of most proteasome genes is low at 14 d (Fig. S1)". Expression is low compared to what and not being induced doesn't mean they are low. This needs to be rephrased.
      2. Microscopy images need more explanation: define the green and red channel and what they are used for in the legend.
      3. Columns have moved from the Table 2.
      4. Fig. S3: RT-PCR on NRF-1(NFE2L1) need to be performed to see the extent of inhibition by shRNA.
      5. In the sentence: "PAX4 maintaining subunit stoichiometry for increased proteasome assembly.", could it be due to the much higher levels of PSMB8, 9 and 10 immunoproteasome subunits upon alpha-PAL(Nrf1) KD (Fig. 6F)?

      Referees cross-commenting

      All other comments are relevant.

      Significance

      Overall, the work is impactful and timely, reporting the participation of a novel transcription factor, alpha-PAL(Nrf1), along with PAX4, in regulating the transcription of proteasome genes and the subsequent assembly of conventional proteasomes in mouse muscle upon denervation. One limitation is that alpha-PAL(Nrf1) kockdown is only inhibiting proteasome genes expression but proteasome assembly, the reason being still unknown. Most of the conclusions drawn in the manuscript are supported by the experimental data. Better understanding how proteasome homeostasis is regulated upon stressful conditions is an important fundamental aspect of proteasome biology. I would support publication of this manuscript providing the more specific concerns listed are addressed.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-02199 Corresponding author(s): Cornelis, Calkhoven

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      • *

      We would like to thank the reviewers for their comments that will help to improve the manuscript and the editor(s) for taking care of the process.

      Text revisions and corrections in the manuscript are in red font.

      To keep oversight, see below the distribution of the comments and our replies per category:

      Reviewer 1

      Major comment 1: see 2. planned revisions

      Major comment 2: see 3. already incorporated

      Major comment 3: see 2. planned revisions

      Major comment 4: see 4. prefer not to carry out

      Minor comment 1: see 3. already incorporated

      Minor comment 2: see 4. prefer not to carry out

      Reviewer 2

      Major comment 1: see 4. prefer not to carry out

      Major comment 2: see 4. prefer not to carry out

      Major comment 3: see 3. already incorporated

      Major comment 4: see 4. prefer not to carry out

      Minor comment 1: see 3. already incorporated

      Minor comment 2: see 3. already incorporated

      Minor comment 3: see 3. already incorporated

      Minor comment 4: see 3. already incorporated

      Minor comment 5: see 3. already incorporated

      Reviewer 3

      Major comment 1: see 3. already incorporated

      Major comment 2: see 3. already incorporated

      Major comment 3: see 4. prefer not to carry out

      Minor comment 1: see 4. prefer not to carry out

      Minor comment 2: see 4. prefer not to carry out

      Significance comments by the three reviewers

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer #1

      *Major comment 1: The reviewer is not convinced that FTO-CEBPB regulation is direct (data Figure 5). *

      Reply: We offer to perform immunoprecipitation of FTO from wt cells to examine interaction with CEBPB mRNA to resolve this. HPRT mRNA will be used as negative control (like in the MeRIP-RT-qPCR experiments in the manuscript).

      Major comment 3: FTO effect on proliferation and migration in less aggressive / normal breast epithelial cells (data Figure 1) (reviewer writes “cancer cells” and “MCF-10A”, but these are untransformed epithelial cells).

      Reply: We offer to perform proliferation and migration assays in MCF-10A wt and FTO-knockdown cells.

      Reviewer #2 – see below.

      Reviewer #3 – see below.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Reviewer 1

      Major comment 2: The reviewer asks to discuss the relevance of the findings for m6Am biology.

      Reply: We now discuss this in the discussion session at page 18.

      Here we mention that Sun et al (https://doi.org/10.1038/s41467-021-25105-5) performed m6Am-seq in HEK293T cells and detected no m6Am for C/EBPβ (Supplementary Data 2). These experiments were well controlled using m6A-IP experiments with PCIF1-KO HEK293T cells and compared with m6Am-Exo-seq and miCLIP published data.

      • *

      Minor comment 1: Mistake in Figure 5 legend and beyond.

      Reply: Legend of Figure 5 has been updated (text in red) and data for Supplementary Figure 5 has been added, together showing that FTO-CEBPB regulation takes place in MDA-MB-231, MCF-7 and MEFs.

      Reviewer 2

      Major comment 3: The reviewer noticed duplication of figures in Figure 6A and Supplementary Figure 6A.

      Reply: The data presented in the bar graphs of Figure 6 and Supplementary Figure 6 are from two independent experiments. Unfortunately, the pictures of the cells at the bottom for supplementary Figure 6A were the wrong ones and now are replaced by the proper pictures belonging to this experiment.

      *Minor comment 1: Statistic analysis in several panels is missing. *

      Reply: All significant events are clearly marked by *: p0.05).

      Minor comments 2 and 3: Issues in Figure 2E.

      Reply: The “E” is removed. We have included the cell type information in the panels D (MDA-MB-231) and E (MDA-MB-231 – shFTO1).

      Minor comment 4: Textual description belonging by Figure 2E.

      Reply: We have re-written the text describing the data, at page 6.

      Minor comment 5: Missing explanation about WTAP.

      Reply: We have extended the explanation about WTAP function at page 13.

      Minor comment 5: Layout of images needs improvement.

      Reply: we have to guess here what the reviewer means, but we have included the requested information in panels 2E and D (see under Minor comments 2 and 3).

      For Figure 4 we matched colours for bars representing shFTO1 and shFTO2 throughout the figure.

      In Figure 1, panel D we now label the cells as MDA-MB-231 – shTFO (instead of FTO-kd), in line with labelling throughout the paper.

      In Figures 1D and 2E we now use +FTO (instead on just FTO) for clearness.

      Concerning the reviewers remark under “significance”:

      *From a translational point of view, it is unclear if the study is significant for the future of therapeutic applications since no experiments have been performed on normal breast cancer cells. *

      Reply: We do not understand what the reviewer means with “normal breast cancer cells”, maybe the reviewer means untransformed breast epithelial cells (MCF-10A), or really breast cancer cells derived from patients?

      We now added supplementary data FTO knockdown decreases C/EBPβ-LIP levels in MCF-7 (luminal A type breast cancer) and in MEFs (Supplementary Figure 5A and B). This indicates that the FTO- C/EBPβ regulation is conserved in different cell lines of different origin. Still, FTO- C/EBPβ could play a more prominent role in breast cancer with high expression of FTO correlated with high C/EBPβ-LIP levels, and the possibility to suppress this.

      Reviewer 3

      Major comment 1: Disconnect between findings presented in Figure 1 and Figure 5.

      Reply: We have improved the reasoning at page 12.

      Major comment 2: In vivo phenotype of FTO deficiency.

      Reply: We already discussed the paper by Niu et al (https://doi.org/10.1186/s12943-019-1004-4) in the discussion, page 18.

      We have now added information about FTO knockout and overexpression mice and how the phenotypes of FTO knockout mice and CEBPB uORF deficient mice relate to each other at page 19.

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      Major comment 4: To support the data on the FTO and WTAP regulation of C/EBPβ isoform expression analysis of the correlation in expression between FTO/ C/EBPβ and WTAP/ C/EBPβ could be performed using the cancer cell encyclopedia (CCLE) dependency map portal.

      Reply: The CCLE map uses transcript expression data. Our observation is that mRNA-stability is not affected, but the protein C/EBPβ-LIP/-LAP ratio is. Therefore, CCLE analysis is not informative since it does not provide information on C/EBPβ-LIP/LAP protein ratios.

      *Minor comment 2: For the C/EBPβ-LIP overexpression-mediated rescue of the migration phenotype of figure 6 an additional replicate for shFTO1 may be helpful as only one of the two replicates presented is statistically significant. *

      Reply: Taken together the results presented in the main and supplementary figure show:

      • significant downregulation of migration for all shFTO1 and shFTO2 knockdowns (EV vs. EV-shFTO1 or EV-shFTO2).

      • Significant upregulation of migration in FTO-knockdown cells by ectopic LIP expression in three of the four FTO knockdown samples (shFTO1/2-EV vs. LIP).

      • Significant further upregulation of migration in one of the control (scr) cells by ectopic LIP expression (scr cells EV vs. LIP).

      • Throughout the experiments the observed changes in migration are consistent with shFTO knockdown and LIP expression.

      Reviewer #2 *(Evidence, reproducibility and clarity (Required)): *

      Major comment 1: 1. The authors focused on FTO and its function of RNA demethylation but the m6A-seq is not used at all. It is not likely to find the direct down stream effector of FTO without m6A-seq if you believe demethylation is the key function of FTO.

      Reply: In Figure 5A we show collective m6A sites based on experiments of others by miCLIP, DART-seq, and SRAMP prediction software. In combination with MeRIP-RT-qPCR in response to FTO KD this shows regulation of CEBPB-mRNA m6A modification by FTO, but without knowledge of the exact m6A sites affected. This study aimed to examine the way FTO affects cell proliferation and migration in different breast cancer cell lines. We revealed the FTO-C/EBPβregulation and will further examine the FTO-C/EBPβ interaction as proposed under session 2, planned revisions. Determination of specific m6Am sites in relation to the translational mechanism involved would be subject of a future study.

      *Major comment 2: Related to above concerns, the authors claimed that FTO regulated C/EBPβ-LIP. But how FTO regulated the protein expression is missing. As is known to all, FTO mainly affect RNA m6A. There is a gap between C/EBPβ-LIP protein and the RNA of this gene. *

      Reply: It is true we do not understand the mechanism of how m6A modification affects the differential translation of the CEBPB mRNA in the different protein isoforms. Examining this is a major effort and would take considerable time, which is beyond this study, but would be subject of further study.

      *Major comment 4: The authors claimed that EMT related genes are affected upon FTO knock down, but Figure 2C do not seem to align with the EMT process. Also, if the authors believe EMT process is regulated, another GSEA enrichment panel like Figure 2B should be included. *

      Reply: we have difficulty in understanding this comment. Relevance of the EMT-process is clearly shown by GSEA in panel 2B. GSEA uses a ranked list of gene expression (including all genes in the RNA-seq data set, also the ones not significantly changed in expression) and is thereby a powerful and comprehensive technique to identify relevant pathways since all data in the dataset is used. The GO-term analysis in Figure 2C looks at significantly regulated genes and maps these to GO-terms for biological process (BP), molecular function (MF) or cellular component (CC). Here we used BP, and find many GO-terms involved in ECM/cell membrane/cell adhesion/cell migration (linked to cell migration and thereby EMT since the cells are of epithelial origin). GO-term analysis only requires a list of genes that were found to be significantly regulated (It doesn’t know where your list of genes has been derived from and does not take into account the p-value of a specific gene or the fold change in expression observed between SCR and shFTO cells, in contrast to GSEA). GSEA and GO-term analysis are therefore complementary analyses based on different principles to help indicate what biological processes are changed upon FTO knock-down; their results cannot be directly compared.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Major comment 3: The lack of clear impact of FTO KD on the translatome is surprising. Maybe the reverse experiment (over expression) would lead to more relevant findings. Likewise, use of the mutant FTO could differentiate between the demethylase function of FTO vs other non-enzymatic functions.

      Reply: Although we agree the proposed experiments are interesting, we believe them to be beyond the scope and time of this manuscript – Major point 3.

      Minor comment 1: baseline FTO level in MDA MB231 seems inconsistent (1A vs 1D--controls). It would be useful to see FTO expression across breast cancer cell lines and normal mammary cells to justify choice of cell lines in which experiments were carried out.

      Reply: It is due to unfortunate choice of blots with different exposures: The immunoblot in Figure 1A shows wild-type MDA-MB-231 cells with scr-shRNA with expected expression of FTO, compared to shFTO knockdowns with low expression of FTO. Figure 1D are MDA-MB-231 cells with knockdown of FTO and therefore very low FTO expression, compared to FTO knockdown cells with re-expression of FTO. The exposure was a bit unnecessary low, and we changed the blot with a higher exposure blot.

      Niu et all (https://doi.org/10.1186/s12943-019-1004-4) have analysed breast cancer samples form patients and breast cancer cell lines and showing a general upregulation of FTO in breast cancer compared to healthy tissue or other cancer cell lines, respectively.

      Minor comment 2: lack of data from human specimens.

      Reply: Not available. Very difficult to realize since immunostaining techniques cannot discriminate between C/EBPβ-LAP and -LIP isoforms because LIP shares all possible antibody epitopes with LAP. This means that one needs enough material for western blotting which is difficult to arrange.

      Significance

      Reviewer #1: Despite the interest in the finding that FTO knockdown influences cellular proliferation and migration, the authors do not have enough mechanistic insights on how FTO regulates this process, leaving uncertainty on the study's relevance.

      Reply: We agree, but we feel that performing detailed mechanistic study would take considerable time and therefore should be part of future work.

      *From a translational point of view, it is unclear if the study is significant for the future of therapeutic applications since no experiments have been performed on normal breast cancer cells. *

      Reply same as under section 3: We do not understand what the reviewer means with “normal breast cancer cells”, maybe the reviewer means untransformed breast epithelial cells (MCF-10A), or really breast cancer cells derived from patients?

      We now added supplementary data FTO knockdown decreases C/EBPβ-LIP levels in MCF-7 (luminal A type breast cancer) and in MEFs (Supplementary Figure 5A and B). This indicates that the FTO- C/EBPβ regulation is conserved in different cell lines of different origin. Still, FTO- C/EBPβ could play a more prominent role in breast cancer with high expression of FTO correlated with high C/EBPβ-LIP levels, and the possibility to suppress this.

      *From a molecular point of view, FTO can influence m6A and m6Am. The authors do not mention the relevance of their findings in terms of m6Am biology. *

      Reply: See above under 3. Description revisions already incorporated - Major comment 2.

      *Reviewer #2: The study's significance is somewhat unclear. The initial sections in the main body present primarily negative or statistically insignificant results. While the explanations in later sections are insightful, they may give the impression of an attempt to justify these negative findings, leaving the study's significance somewhat ambiguous. *

      Reply: This raises the issue whether we as a scientific community should present “negative” results. In our opinion this is important since it informs about what is probably not involved (regulation of translation efficiency by FTO) and it leads the way to other hypothesis and explanations of a biological phenomenon. How can – in the reviewers’ words - insightful explanations by labelled as just justifying? They are what they are; insightful and contributing to the topic. Currently the role of FTO in cancer and/or translation regulation is not clear and different views exist (see for example our review https://doi.org/10.1158/0008-5472.CAN-21-3710). In this manuscript we provide evidence that FTO might not be significantly involved in global regulation of mRNA translation efficiency in MDA-MB-231 cells, these results seem quite relevant to share. Furthermore, we provide a possible mechanism that could explain part of the effects observed, and already indicate ourselves future work is needed to further strengthen these findings.

      Reviewer #3: * Well written report on the functions of FTO in breast cancer pointing to an oncogenic role and regulation of transcripts related to EMT. Data are well done and well presented, 2 shRNA transfected cell lines are included in the experiments; methodology is clear.*

      Reply: we are grateful for this assessment and hope the reviewer will find the manuscript further improved after revision.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The strengths are the well written manuscript, robust and well presented data, study of the effects of FTO on both transcriptome and translatome.

      Major weaknesses:

      1. There is a big disconnect between the findings presented in Figures 1-3 and the focus on CEBP in Figure 5. This needs to be reconciled either by demonstrating a link between CEBP and the transcripts found to be dysregulated in Fig 2 or some other way.
      2. There are no in vivo data. Does the phenotype caused by FTO KD lead to an in vivo phenotype?
      3. The lack of clear impact of FTO KD on the translatome is surprising. Maybe the reverse experiment (over expression) would lead to more relevant findings.Likewise, use of the mutant FTO could differentiate between the demethylase function of FTO vs other non-enzymatic functions.

      Minor weaknesses:

      1. baseline FTO level in MDA MB231 seems inconsistent (1A vs 1D--controls). It would be useful to see FTO expression across breast cancer cell lines and normal mammary cells to justify choice of cell lines in which experiments were carried out.
      2. lack of data from human specimens.

      Significance

      Well written report on the functions of FTO in breast cancer pointing to an oncogenic role and regulation of transcripts related to EMT. Data are well done and well presented, 2 shRNA transfected cell lines are included in the experiments; methodology is clear.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors are trying to demonstrate the oncogenic role of FTO in breast cancer. RNA-seq was used to explore the effect of FTO on gene transcription and translation. And it turned out that the protein expression ratio of LIP/LAP is the most prominent effect of FTO on breast cancer development. While the oncogenic roles of FTO in breast cancer cell lines look well supported, several key questions remain to be answered before it can be considered to publish.

      major comment:

      1. The authors focused on FTO and its function of RNA demethylation but the m6A-seq is not used at all. It is not likely to find the direct down stream effector of FTO without m6A-seq if you believe demethylation is the key function of FTO.
      2. Related to above concerns, the authors claimed that FTO regulated C/EBPβ-LIP. But how FTO regulated the protein expression is missing. As is known to all, FTO mainly affect RNA m6A. There is a gap between C/EBPβ-LIP protein and the RNA of this gene.
      3. Data quality is not convincing, the transwell migration assay image in figure 6 and Supplementary Figure 6 is identical, which is unacceptable.
      4. The authors claimed that EMT related genes are affected upon FTO knock down, but Figure 2C do not seem to align with the EMT process. Also, if the authors believe EMT process is regulated, another GSEA enrichment panel like Figure 2B should be included.

      Minor concerns

      1. Statistic analysis in several panels is missing. Normally, every date should include statistic analysis, even its not significant.
      2. Issues in Figure 2E, there is a "E" above the first column, which is not supposed to be there.
      3. Also in Figure 2E, these results conducted in FTO-knocked down cells, but the panel did not show clearly.
      4. In Figure 2E, the textual description does not entirely correspond with the image results, and there is a lack of information about the expression levels of COL12A1, FN1, MMP1, and TNC.
      5. Furthermore, while the article conveys the meaning of WTAP, it lacks a thorough textual explanation. Additionally, the layout of the article's images needs improvement.

      Significance

      The study's significance is somewhat unclear. The initial sections in the main body present primarily negative or statistically insignificant results. While the explanations in later sections are insightful, they may give the impression of an attempt to justify these negative findings, leaving the study's significance somewhat ambiguous.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors investigated the effect of FTO knockdown in triple-negative breast cancer (TNBC) cells. They show that FTO knockdown reduces the proliferation and migration of two breast cancer cell lines using clonogenic and transwell migration assays. Through transcriptome analysis of FTO-depleted cells, they showed a negative enrichment of the epithelial-to-mesenchymal transition (EMT) signature and showed that this decrease is not mediated at a transcript stability level. Through ribosome profiling experiments they demonstrated that the translatome is only mildly affected by FTO depletion and using differential ribosomal codon reading (diricore) analysis showed that FTO knockdown does not cause specific tRNA or amino acid shortages. They show that FTO and WTAP knockdown reciprocally regulate the expression of C/EBPβ isoform. Finally, through rescue experiments, the authors showed that C/EBPβ-LIP overexpression rescues the decreased migration phenotype observed upon FTO knockdown.

      Major comments:

      The authors showed that upon FTO knockdown, the C/EBPβ-LIP protein levels decrease, and conversely, upon WTAP knockdown, the levels of C/EBPβ-LIP protein increase (figure 5b and 5d) and suggest that a reversible WTAP- and FTO-controlled m6A modification of CEBP mRNA alters its translation. Upon FTO knockdown, however, the stability of CEBP mRNA is not affected, and despite a clear trend in the increase of the CEBP transcript m6A levels from MeRIP-RT-qPCR (figure 5E), this is not statistically significant. It remains unclear whether the regulatory effect of FTO on the CEBP mRNA transcript is a direct, m6A-mediated effect or an indirect effect. For this reason, further experiments may help to strengthen this result. It would be helpful to clarify if m6A levels increase in the CEPB mRNA level with more replicates of the MeRIP-RT-qPCR or using other quantitative techniques. Moreover, to clarify whether this is directly mediated by FTO a RIP-qPCR for the FTO protein and CEPB mRNA in wild-type cells could be performed.

      Additionally, the authors never consider the possibility that FTO could regulate CEBP because of a possible m6Am site. Does CEBP have an m6Am site? Would PCIF1 knockdown cause the same effect that FTO knockdown? Also, the authors should consider classifying mRNAs based on the number of m6A sites or m6Am sites and determine if mRNAs with an m6Am site or an m6A site show a difference in their translation or expression levels compared to non-methylated sites.

      It would be interesting to see if the effect of FTO on proliferation and migration impact less aggressive form of cancer or normal breast cancer cells, such as MCF-10A.

      To support the data on the FTO and WTAP regulation of C/EBPβ isoform expression analysis of the correlation in expression between FTO/ C/EBPβ and WTAP/ C/EBPβ could be performed using the cancer cell encyclopedia (CCLE) dependency map portal.

      Minor comments:

      The figure 5 legend is missing the description of the MeRIP-RT-qPCR experiment.

      For the C/EBPβ-LIP overexpression-mediated rescue of the migration phenotype of figure 6 an additional replicate for shFTO1 may be helpful as only one of the two replicates presented is statistically significant.

      Significance

      Despite the interest in the finding that FTO knockdown influences cellular proliferation and migration, the authors do not have enough mechanistic insights on how FTO regulates this process, leaving uncertainty on the study's relevance.

      From a translational point of view, it is unclear if the study is significant for the future of therapeutic applications since no experiments have been performed on normal breast cancer cells.

      From a molecular point of view, FTO can influence m6A and m6Am. The authors do not mention the relevance of their findings in terms of m6Am biology.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We would like to thank all three reviewers for their careful and comprehensive reviews of our manuscript. We have taken on board all the comments and have made appropriate changes to improve the manuscript. The more substantive changes are to the structuring of the text in Introduction section, and to improving the clarity of Figure 2 after reviewers’ comments (we have added extra panels to A, F and G). Other minor changes are individually signposted in each paragraph of the point-by-point response attached below.

      We performed a number of pieces of additional analysis to address reviewer comments. To be as transparent as possible we make these and all other data analyses available in the form of .html files exported by Rmarkdown, hosted at https://joebowness.github.io/YY1-XCI-analysis/.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: This manuscript uses differentiation of the highly informative inter-specific hybrid mouse ESC to follow features of genes that inactivate slowly. Resistance to silencing is reflected in reduced change in chromatin accessibility and the authors identify YY1 and CTCF as enriched amongst these 'slow' genes. This finding is provocative as these factors have been reported to enrich at both human and mouse escape genes. The authors go on to demonstrate that eviction of YY1 is slowly evicted from the X, and that removal of YY1 increases silencing.

      Minor Comments: Overall, the manuscript's conclusions are well supported; however, the brevity of the presentation in some places made it difficult to follow, and in other places seemed a missed opportunity to more fully examine or present their data.

      1. Introduction is only 2 paragraphs and half of the last is their new findings. First part of results/discussion is then forced to be very introductory. In addition, some discussion of escapees, even if predominantly human, seems warranted in the introduction. There are multiple studies that have tried to identify features enriched at genes that escape inactivation that could be mentioned.

      We have now written the introduction as 3 paragraphs instead of 2. In doing this, we have moved the sentence introducing chromatin accessibility from the results section to the introduction. Additionally, we now discuss the studies that focus on escapees (in mouse XCI) in the second introduction paragraph.

      Variation in silencing rates. 'Comparable rankings' cites multiple studies (oddly previous sentence cites only two) - how concurrent are they? Developing this further (perhaps a supplementary table) would inform whether the genes assessed are ones that routinely behave similarly across different studies/lines; and also serve as a resource for future studies.

      To avoid double-citing, we have made this one sentence and have cited at the end of the sentence 7 studies which describe gene-by-gene variability in rates of silencing. The majority of these studies include comparisons of their categories of fast and slow-silencing gene with previous classifications, and they all conclude that there is substantial concurrence. Some examples:

      • Marks et al, 2015, Table S3,
      • Loda et al, 2017, Figure 5,
      • Barros de Andrade E Sousa et al. 2019, Figure 2
      • Pacini et al. 2021, Figure 6e,i We believe this is sufficient evidence for our claim that these studies report “comparable categories” (“ranking” changed to “categories” as not all studies strictly rank). A comprehensive gene-by-gene comparison table would likely serve only to highlight differences due the various silencing assays/model systems/classification approaches used in the studies. If required, however, we would be willing to include a supplemental table which collates where gene silencing categories are discussed in each publication, and links to any supplemental files which provide full lists of X-linked genes.

      It would be helpful to give insight into informativity of cross - what proportion of ATAC-seq peaks were informative with allelic information (and similarly, what proportion of genes expressed had allelic information?

      Of the 2042 consensus ATAC-seq peaks we defined on ChrX via aggregating macs2 peaks over all time course samples, n = 821 passed our initial criteria for allelic analysis in the iXist-ChrX-Dom model line (ie they are proximal to the Xist locus in ChrX 0-103Mb, overlap SNPs, and contain sufficient allelic reads). A small number of peaks were additionally filtered out during fitting of the exponential decay model, leaving a final ATAC-seq peak set of n = 790 elements (38.6%) which we focus on in this study. We have added this information to the text (first Results paragraph).

      Our collections of ChrX genes amenable to allelic analysis were not redefined for this study. We used lists of genes defined in our previous ChrRNA-seq study (10.1016/j.celrep.2022.110830). In general, allelic analysis of gene expression is not as limited by the frequency of SNPs, because the sequence length of transcripts (including introns, which are a significant fraction of the reads in ChrRNA-seq data) is much greater than for ATAC-seq peaks. Only a few very lowly expressed genes are not amenable to allelic ChrRNA-seq analysis.

      P5: "can be influenced by Xist RNA via a variety of mechanisms" seems like it this sweeping statement could use expansion, or at least a reference. Authors could also clarify that 'distal elements assigned by linear genomic proximity is their definition of nearest gene.

      The statement that “both [chromatin accessibility and gene expression] can be influenced by Xist RNA via a variety of mechanisms” is intentionally broad to support a negative argument that we do not wish to mechanistically over-interpret the observation that Xi chromatin accessibility loss occurs slower than gene silencing. Nonetheless, we have added two references to studies which report mechanisms for how Xist may influence chromatin accessibility; via recruiting PRC1 (Pintacuda et al 2017) or antagonising BRG1 (Jegu et al 2019). That multiple molecular pathways simultaneously contribute towards the effect of Xist RNA on gene silencing is well established in the field (see reviews such as Brockdorff et al 2020, Boeren et al 2021, Loda et al 2022).

      We have clarified in the text that our definition of “distal” is all REs which do not overlap with promoter regions (TSS+/-500bp). We have also made it clearer that our definition of “nearest” gene refers to linear genomic proximity in both the Results and Methods sections.

      Figure S1 - there are 6-8 other regions that fail to become monoallelic - what are they?

      The regions which stand out most by the colour scheme of the heatmap in Figure S1 are those where accessibility increases on Xi, most notably the loci of Firre, Dxz4 and Xist, which are known to have unique features related to the 3D superstructure of the inactive X chromosome. A few other regions which do not become monoallelic harbour classic “escapee” genes. We have now labelled the locations of escapees Ddx3x, Slc25a5 and Eif2s3x in FigS1.

      The other regions noticeable in the heatmap have no obvious features which explain why they fail to become monoallelic. We have highlighted a region containing intragenic peaks within Bcor (a gene which is silenced in iXist-ChrX mESCs), but many other regions are not in the vicinity of genes. Some of the persistently Xi-accessible peaks within these regions contain strong YY1 or CTCF sites, although many others do not.

      It is also possible that some Xi-accessible peaks are artefacts of mismatches between the Castaneous or Domesticus/129Sv strain SNP databases and ground truth iXist-ChrX genome sequence. The number of these cases are small, and if a misannotated SNP is the only SNP present in a single peak, the peak is discarded by our allelic filtering criteria as it will appear monoallelic in uninduced mESCs.

      Is there any correlation between silencing speed and expression (as previously reported)? If yes, then is there also a correlation with YY1 presence - and is this correlation greater than or less than seen on autosomes?

      The data we present here pertaining to gene silencing kinetics is reused from our previous study. In that work we did indeed observe a significant association between silencing rate and initial gene expression levels (10.1016/j.celrep.2022.110830, Supplemental Information Figure S5F), which has also been reported by multiple groups previously.

      To correlate YY1 binding with gene expression levels, we calculated transcripts per million (TPM) for all genes from our genome-wide mRNA-seq data of uninduced iXist-ChrX-Dom cells (GSE185869). It is indeed true that, on average, X-linked genes classified as “direct” YY1-targets in our analysis have higher levels of initial expression (median TPM 70.8, n=64) compared to non-target genes (median TPM 30.7, n=346). Autosomal YY1 targets are also relatively higher expressed (median TPM 29.6, n=1882) than non-YY1 genes (median TPM 8.0, n=9983). Within the list of YY1-targets, there is no additional correlation between quantitative levels of YY1 ChIP enrichment (calculated in this study using BAMscale (Pongor et al, 2020)) and gene expression (R=-0.05, Spearman correlation).

      Therefore, we appreciate that this correlation between YY1-binding and gene expression levels may be a covariate in the correlation we report in this study between YY1-target genes and slow-silencing. This does not invalidate a potential functional role for YY1 in impeding silencing, as it could affect both variables via common or distinct mechanisms. Nevertheless, in an attempt to account for initial expression level as a covariate, we compared the silencing halftimes of YY1-targets versus non-targets within genes grouped by similar expression levels (low, medium and high-expressed genes). YY1-targets have slower halftimes in each comparison, and this difference is highly significant (p=1.9e-05, Wilcoxon test) for the “medium-expressed” gene group. This implies that YY1 contributes towards slower gene silencing kinetics independently of initial gene expression levels. We have added this panel to Fig2 with an associated sentence in the Results section.

      These new analyses are also appended to the documentation of the R scripts used to generate the main figures in this study (Figure2_YY1association.Rmd), which will all be published to Github.

      It is also important to note that this analysis approach is complicated by the methodology we use to classify YY1 target genes. In this study, we define YY1 targets based on the presence of ChIP-seq peaks overlapping the gene promoters, which is reasonable and widely accepted practice when defining targets of transcription factors. However, as briefly discussed in the Methods, in YY1 ChIP-seq data samples with very high signal:noise (eg Fig3), minor peaks of YY1 enrichment can be detected at almost every active promoter. As enrichment at these peaks is typically much less than at peaks with occurrences of the YY1 consensus DNA motif, we hypothesise that these small peaks result from secondary YY1 cofactors enriched at promoters (eg P300, BAF, Mediator) rather than direct sites of binding to DNA/chromatin. Therefore, for annotating genes as “direct” YY1 targets, we chose to use the YY1 peak set defined from lower signal:noise ChIP-seq data in iXist-ChrX produced with the endogenous YY1 Ab. Nevertheless, this behaviour is likely to confound any analysis correlating YY1 ChIP binding with gene expression.

      Figure 2: Have the authors considered using quartiles rather than an arbitrary division into depleted and persistent?

      We primarily chose this binary classification of REs as either Xi-“persistent” or Xi-“depleted” to maximise the numbers of sequences that could be used in each group as input for the HOMER motif enrichment software.

      It is also not trivial to separate REs into quartiles because our “Xi-persistent” classification includes peaks defined as “biallelically accessible in NPCs”, as well as peaks with slow accessibility halftimes. This is explained in both the Results and Methods but we now have edited Fig2A to make it clearer. Instead of quartiles, we have performed an analysis which keeps “biallelically accessible REs” as a separate category and subdivides the remaining peaks into three groups by halftimes (slow, intermediate and fast accessibility loss). The same trends are evident with this four-category approach as with the two-category approach.

      Importantly, our follow-up analyses which confirm the association between YY1 binding and slow Xi accessibility loss (Fig2E) and slow silencing (Fig 2F-H) are independent from categorisations of REs which rely on arbitrary thresholds.

      1. Could simplify secondary labels to solely YY1 and CTCF. D & F do not print in black and white. Overall the mESC versus NPC can be confusing, perhaps mESC (no diff) would be helpful?

      We have simplified the secondary labels in Fig2B and modified the colour scheme of FIg2D and Fig2F as suggested. “mESC” is now modified to “mESC no diff” in Fig2H, FigS2B, Fig3C and Fig3E to reduce the potential for confusion.

      The numbers appear to suggest YY1 is generally enriched on X, but not at promoters?? Is this true?

      The explanation for this is that clear peaks of YY1 ChIP are found at young LINE1 elements in iXist-ChrX mESCs (specifically over L1Md_T subfamilies). These elements are highly enriched (>2-fold) on the mouse X chromosome compared to autosomes (Waterston 2002), and the majority are not promoter-associated. We chose not to include a discussion of YY1 enrichment at repetitive LINE1 elements in this study primarily because of a) issues related to multiple-mapping reads, such as difficulties distinguishing ChrX vs autosomal reads, and b) the absence of strain-specific SNPs within annotated ChrX L1Md_Ts means that none of these elements are amenable to allelic analysis so we cannot compare Xi versus Xa. However, these LINE1 peaks are a significant fraction (262/521) of the numbers of YY1 ChIP-seq peaks in Fig2C.

      For Figure 2f, it might be helpful to show autosomal genes - are Fast depleted or Slow enriched for YY1 relative to autosomes?

      We have calculated these numbers as part of the analysis of gene expression on ChrX and autosomes above. Overall, the fraction of genes defined as YY1-targets is the same on ChrX as on autosomes (~0.16). Accordingly, fast-silencing genes are depleted for YY1 compared to autosomes, whereas slow-silencing genes are enriched for YY1 compared to autosomes. Fig2F is now redesigned to include the total numbers of YY1-target genes on ChrX and autosomes.

      More generally, is YY1 binding on the X lost more slowly than YY1 binding on autosomes, or is the slow loss a feature of YY1. While I agree YY1 could have direct up or down-regulatory roles, Figure S3 could also be reflecting a secondary impact.

      We agree that many of the differentially regulated genes after 52 hours of YY1 degradation could be secondary effects and have added a sentence on this to the relevant paragraph in the text.

      Figure 3, 4 and supplementary - the chromosome cartoon introduces the LOH in iXist, but this needs to be described in text. Describing the reciprocal as a biological replicate seems challenging given this LOH.

      It is true that the reciprocal lines iXist-ChrX-Dom and iXist-ChrX-Cast are not true biological replicates, and we try to avoid referring to them as such. Writing this in the legend of Fig3 was an error which we have corrected. We have now also mentioned the recombination event in the iXist-ChrX-Dom cell line at the point where data from this line is first discussed (paragraph 1 of Results section).

      For the latter parts this work (Figs 3 and 4), we made the conscious decision to proceed with two YY1-FKP12F36V cell lines from different reciprocal iXist-ChrX backgrounds (aF1 in iXist-ChrX-Dom, cC3 in iXist-ChrX-Cast), rather than “biological replicate” clones from either iXist-ChrX-Dom or iXist-ChrX-Cast. Our reasoning was to control against potential confounding effects of strain background on our experiments related to the role of YY1. Although there were some minor differences between the clones, aF1 and cC3 demonstrated essentially equivalent phenotypes in all analyses we performed.

      Could a panel of TFs be used rather than OCT4 which has its own unique properties to emphasize that YY1 is unique?

      This would indeed be worthwhile, and we did consider attempting to perform ChIP-seq for additional TFs other than OCT4 in order to collect more points of comparison for the slow rate of loss of YY1 binding to Xi. However, it is admittedly hard to identify appropriate candidate TFs in mESCs which a) have similar numbers of discrete peaks of binding in promoters and distal elements on ChrX and b) it is possible to reliably perform ChIP-seq for at sufficiently high signal:noise to allow for quantitative allelic analysis.

      We have changed the text to acknowledge that our comparison only to OCT4 limits the scope of the statements we can make about unique properties of YY1 binding.

      Figure 4 - by examining 'late' genes, a change in allelic ratio is observed, but what about escape genes (e.g. Kdm5c, Kdm6a)? Do they now become silent? It would be helpful to have all this data as a supplementary table so people could query their 'favourite' gene.

      YY1 degradation experiments performed for Figure 4 were performed on mESCs without cellular differentiation (YY1-ablated cells do not survive in our mESC to NPC differentiation protocol). In undifferentiated mESCs, silencing of the inactive X does not reach completion, and in fact all X-linked genes are residually expressed at a higher level than in equivalent timepoints of Xist induction with NPC differentiation (see Figure 4D, Bowness et al 2022). We write in the text “slow-silencing genes are residually expressed from Xi” because genes of this category account for the majority of expression under these conditions, and indeed almost all slow genes would all be classed as “escape genes” in this setting by a conventional definition of >10% residual expression from Xi (see also Figure 4D, Bowness et al 2022). Our analysis in Fig4D (of this study) includes all genes, and we share processed .txt files of allelic ratio and allelic fold changes in GEO, so querying the behaviour of a favourite gene would be easy (GSE240680).

      Incidentally, when we do perform NPC differentiation of iXist-ChrX NPC, at late stages very few genes show any expression from Xi (Ddx3x, Slc25a5, Eif2s3x and Kdm5c clearly escape, but even Kdm6a is entirely silenced). Unfortunately, with such a small number of “super” escapees it is hard to make any general conclusions, so in this study we can only make inferences about escape via the transitive property that many “slow-silencing” genes are facultative escapees in other settings without induced Xist overexpression. We now write about this consideration in the introduction and final paragraph of the main text.

      It seems surprising that loss of YY1 has no demonstrative impact on the Xa. Figure S3B suggests that over 1000 genes are significantly impacted - primarily down regulated. How many of those are X-linked? Perhaps they could be colored differently?

      For the broad-brush differential expression testing in FigS3B, we use all the ChrRNA-seq samples (6 x untreated, 6 x dTAG) as “pseudo-replicates”, disregarding any confounding effects related to induced Xist-silencing as effecting untreated and dTAG sample groups equivalently. We did specifically investigate the behaviour of X-linked genes in this volcano plot, however only a very small number of genes were differentially expressed (n=22 X-linked genes appeared significantly downregulated compared to n=4 genes upregulated). This can be seen in our analysis records uploaded to Github.

      Additionally, there is actually a minor effect of YY1 loss on expression of YY1-target genes on Xa. This can be seen in Fig4F, where the median lines of YY1-target boxes lie below the horizontal line of 0-fold change.

      Since XIST+/undifferentiated cells retain YY1, is YY1 binding sensitive to DNAme? Indeed, are X chromosome bound sites in islands that become methylated? Figure S4 shows YY1-targetted X genes in SMCHD1 knockout; can CTCF targets also be shown? While identified in Figure 2, CTCF was not examined the way YY1 was, although it has also been identified in somatic studies of genes that escape X inactivation.

      Binding of YY1 is indeed sensitive to DNA methylation; specifically it is reported to be blocked by CpG methylation (see refs (Kim et al, 2003; Makhlouf et al, 2014; Fang et al, 2019). Thus, crosstalk with the DNA methylation pathways, which deposit de novo CpG island methylation as a late event of XCI (Lock 1987, Gendrel 2012), did appeal to us as a potential mechanism of YY1 “eviction”. However, preliminary analysis we performed to investigate this revealed limited overlap between YY1 binding sites and de novo meythlated CpG islands in the iXist-ChrX model cell line.

      FigS4 presents ATAC-seq data from two iXist-ChrX SmcHD1 KO clonal cell lines, comparing the accessibility loss kinetics between YY1-binding and non-YY1 REs in these cells.

      Although FigS4 in this paper does not show genes, we have previously published ChrRNA-seq data from these SmcHD1 KO lines over a similar Xist induction + NPC differentiation time course (Figure 6, Bowness et al, 2022). A reanalysis of this ChrRNA-seq data by YY1-target vs non-target genes shows a similar trend to the accessibility data, although this is expected from the strong overlap of both “YY1-target” and “SmcHD1-dependent” genes with slow-silencing genes in our model.

      With respect to CTCF, we have performed a similar analysis of this data separating ATAC-seq peaks by CTCF-binding rather than YY1-binding. This shows a similar trend to YY1, but is overall less pronounced, and is now included in our analysis records. We have reported previously that loss of CTCF from many binding sites on Xi requires SmcHD1 (Gdula et al, 2019).

      When the authors use cf. do they simply mean see also, or as wikipedia suggests: "the cited source supports a different claim (proposition) than the one just made, that it is worthwhile to compare the two claims and assess the difference". Perhaps it would be worth spelling out to clarify for the audience.

      We used “cf.” in the text to mean “compare with”, when referring to a plot/observation/piece of data outside of the figure being immediately discussed (either in another study or different section of the paper). We were not aware of the recommendation to only use the cf abbreviation when the two items are intended to be contrasted. We do not believe this to be a universal grammatical convention, but nevertheless have changed incidences of cf. to “see also”.

      Reviewer #1 (Significance (Required)):

      General assessment: An important question in human biology is how much the sex chromosome contributes to sex differences in disease frequency. Genes that escape X inactivation in humans seem to have considerable impact on gene expression genome-wide. While there are not as many genes in mouse that escape inactivation, the use of the mESC cell differentiation approach allows detailed assessment of the timing of silencing during inactivation. The authors utilize an inter-specific cross and it would be interesting to know the limitations of such a system (in terms of informative DHS/genes that are informative).

      Advance: As the authors note, there are multiple studies of similar systems that have revealed differences in the speeds of silencing of genes. However, this is the first study to my knowledge that has then tried to assess timing with gene-specific factors. There are multiple studies in humans comparing escape and subject genes for TFs, but lacking the developmental timing that this study incorporates.

      Audience: While generally applicable to a basic research audience interested in gene regulation, the applicability to human genes that escape inactivation may interest cancer researchers or clinical audiences interested in sex differences.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The authors studied the molecular basis of a variation in the rate of individual gene silencing on the X undergoing inactivation. They took advantage of ATAC-seq to observe the kinetics of chromatin accessibility along the inactive X upon induction of Xist expression in mESCs. They demonstrated a clear correspondence between the decrease in chromatin accessibility and the silencing of nearby genes. Furthermore, they found that persistently accessible regulatory elements and slow-silencing were associated with binding of YY1. YY1 tended to associate longer with genes that required more time to be silenced than those that became silenced fast on the inactive X during XCI. The acute loss of YY1 facilitated silencing of slow genes in a shorter period. They suggest that whether or not the transcription factors stay associated longer is another factor that impacts the variation in the rate of gene silencing on the Xi.

      Reviewer #2 (Significance (Required)):

      It has been suggested that the rate of gene silencing during XCI is varies depending on the distance of individual genes from the Xist locus or the entry site of Xist RNA on the X, as well as their initial expression levels before silencing. This study provides another perspective on this issue. The persistent association of transcription factors during XCI affects the rate of gene silencing. Although the issued addressed here might draw attention from only the limited fields of specialists, their finding advances our understanding of how the efficiency of silencing is controlled during the process of XCI. The experimental data essentially support their conclusion, and the manuscript was easy to follow. However, I still have some comments, which I would like the authors to consider before further consideration.

      Major concerns 1. Based on the results shown in Figure 3E and F, the authors concluded that YY1 was more resistant than other TFs against the eviction from the X upon Xist induction. I am not still convinced with this. YY1 binds DNA via the zinc finger domain, while Oct4 binds DNA via the homeodomain. The difference in the binding module between them might affect their dissociation or the response to Xist RNA-mediated chromatin changes. In addition, given that YY1 has been reported to bind RNA, including Xist, as well, Oct4 might not be a good TF to compare.

      We acknowledge and agree that our singular comparison between YY1 and OCT4 is insufficient to support a general conclusion that YY1 is unique with respect to its binding properties on Xi. This was also alluded to by Reviewer #1 (see 10.), where in response we write about the difficulties of selecting other appropriate/feasible candidate TFs for ChIP-seq in order to widen the comparison beyond OCT4. In consideration of this concern, we have re-phrased our conclusions regarding this point in the text, both at the point where it is first presented (Fig3F) and in the first discussion paragraph.

      Furthermore, the difference in allelic ratio change between YY1 and OCT4 is admittedly not dramatic, and this metric can be influenced somewhat by the properties of the sets of peaks used (which is also why we have not tried to add statistical significance to this comparison in Fig3F). In order to make the comparison with OCT4 (a classic pluripotency factor), we were also limited to using mESC culture without differentiation conditions. It is possible that more pronounced differences between YY1 and other TFs would be observed under conditions where XCI is able to proceed further.

      Even so, we contend that our observation that YY1 binding is lost from the Xi relatively slowly likely stands without a requirement for a comparison with OCT4 or other transcription factors. The decrease in allelic ratio for YY1 ChIP occurs more slowly than overall loss of chromatin accessibility from REs, which is arguably a more general proxy for TF binding, and much slower than kinetics of gene silencing (Fig3D and FigS2C). In addition, no other TF motifs (except CTCF, which has its own unique properties) were found significantly enriched within persistently-accessible REs, which would be an expectation if a different factor had similar properties of late-retained Xi binding as YY1.

      Thus, overall we have tried to write the paper without overstating in isolation the importance of our claim that YY1 binding on Xi is relatively resistant to Xist-mediated inactivation, instead emphasising that it should be considered alongside the other pieces of data in the study.

      I don't think that Kinetics of YY1 eviction upon Xist induction in SmcHD1 KO cells during NSC differentiation fit the phenotype of Smchd1mutant cells. Although their previous study by Bowness et al (2022) showed that Smchd1-KO cells fail to establish complete silencing of SmcHD1-dependnet genes, their silencing still reached rather appreciable levels according to Figure 6 of Bowness et al (2022). This is, in fact, consistent with the idea that XCI initially takes place in the mutant embryos, at least to an extent that does not compromise early postimplantation development. On the other hand, a significant portion of YY1 appears to remain associated with the target genes on both active and inactive X (Figure S4), which I think suggests that the presence of YY1 is compatible with silencing of SmdHD1-dependent genes. This is contradictory to the proposed role of YY1 that sustains the expression of X-linked genes in this context.

      At any given timepoint of XCI, our data sets of gene silencing (ChrRNA-seq) consistently show a more pronounced allelic skew compared to chromatin accessibility (ATAC-seq). This behaviour is discussed in relation to Figure 1 in the text (see Results paragraph 2). We do not wish to overinterpret this quantitative difference because the assays are technically different and accessibility is not linearly correlated with gene expression. With this in consideration, we interpret the ATAC-seq data presented in Figure S4 to be fully consistent with the iXist-ChrX SmcHD1 KO ChrRNA-seq data in Figure 6 of our previous publication ie. a small increase in residual Xi gene expression from SmcHD1 KO NPCs is accompanied by a more appreciable increase in residual Xi chromatin accessibility. In line with this, it would not be contradictory for substantially increased Xi YY1 binding to sustain a quantitively small (but nonetheless meaningful) increase in residual gene expression from Xi.

      Additionally, the context in which we include this SmcHD1 KO ATAC-seq data in the current paper is to hypothesise a potential role for SmcHD1 in contributing towards the eventual removal of YY1 binding from Xi. This hypothesis is essentially based on two observations; 1.) There is substantially more residual YY1 binding to Xi in mESC no diff conditions (Figure 3) and 2.) One difference between no diff and diff conditions is absence of SmcHD1 recruitment in the former (Figure 5 in our previous study). The new SmcHD1 KO ATAC-seq data adds a third observation which supports the hypothesis - that YY1-bound REs are appreciably more accessible from Xi in SmcHD1 KO. However, none of these observations are direct evidence of a link between SmcHD1 and YY1, and more experiments would be required to substantiate this potential mechanism. If confirmed, it would be logically reasonable to suggest a role for YY1 in contributing towards the residual expression of X-linked in the context of SmcHD1 KO, but we do not yet claim this, and a potential link with SmcHD1 KO is not the main focus of the paper.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Bowness and colleagues describe the interesting finding that the transcription factor YY1 is associated with slow silencing genes in induced X-Chromosome Inactivation (XCI). The authors have conducted a comprehensive characterization of X-linked gene silencing and the loss of chromatin accessibility of regulatory elements in induced XCI in ESCs and during NPC differentiation. X-linked gene silencing was classified into four categories, ranging from fast-silenced genes to genes that escape silencing. Motif enrichment analysis of regulatory elements associated with slowly silenced genes identified YY1 as the transcription factor most significantly enriched. The separation of YY1-target and non-target genes confirmed that most genes bound by YY1 indeed exhibit slower silencing kinetics. A comparison of the binding kinetics of YY1 to another transcription factor, OCT4, during XCI revealed that YY1 is evicted more slowly compared to OCT4 on the inactive X, suggesting that slower eviction is a unique property of YY1. Conditional knock-outs of YY1 using protein degradation during induced XCI in mESCs demonstrated that the loss of YY1 at target genes enhances silencing. This supports the hypothesis that YY1 serves as a crucial barrier for slow-silenced genes during XCI. Finally, the authors propose a hypothesis regarding the mechanism of YY1 eviction, suggesting a potential connection to the role of SmcHD1 during XCI.

      The authors provide an in-depth analysis of the role of YY1 in gene silencing kinetics during induced XCI and believe this manuscript should be published if our comments are addressed.

      Major comment:

      Based on the allelic ratio in figure 3C only minor loss of YY1 binding occurs in induced XCI in mESCs on the Xi, while silencing is established properly as shown in figure 4C (left panel, red boxplots). This suggests that YY1 eviction is not necessarily required for these genes to be effectively silenced. Could the authors explain this discrepancy in the data regarding their manuscript conclusions? It seems this is true for XCI happening during differentiation towards NPCs, but not if cells are stuck in the pluripotency stage?

      Whilst indeed substantial, we do not consider the silencing seen for 6-day mESCs in Fig4C to be “established properly”. We refer to our previous publication (Figure 4 Bowness et al., 2022), which shows that silencing at equivalent timepoints under differentiation conditions (d5-d7) is significantly more pronounced (near-“complete”). Indeed, the level of silencing reached by YY1-FKBP mESCs (Xist induced but no dTAG treatment) aligns with the plateau of silencing in undifferentiated mESCs we describe in our previous study (median allelic ratio of approximately 0.1).

      We conclude that YY1 contributes somewhat to sustaining this residual expression in mESCs, because a) substantial YY1 binding remains on Xi at these timepoints in mESCs and b) silencing increases with degradation of YY1 (the latter is more direct evidence). Notably, silencing does not progress to completion (allelic ratio of 0) in the absence of YY1, so we do not claim that YY1 is the only factor sustaining residual Xi gene expression in mESCs.

      We interpret this comment to be a fundamentally similar concern to that raised by Reviewer #2 (2.), but in the context of undifferentiated mESCs rather than SmcHD1 KO. As stated above, we do not think it inherently contradictory for substantially increased Xi YY1 binding to sustain a quantitively small (but nonetheless meaningful) increase in residual gene expression from Xi.

      Minor comments:

      1. In the abstract lines 7-8, the authors state that the experiments were performed in mouse embryonic stem cell lines, but much of the data shown is acquired in NPC differentiations. Please adjust abstract.

      We have adjusted this sentence in the abstract to include that many of the experiments in the paper involved differentiation of iXist-ChrX mESCs.

      The last sentence of the abstract states that YY1 acts as a barrier to silencing but as stated in my major comment, that does seem to be the case in ESC differentiation towards NPCs, but not in ESCs themselves. Please tone down this sentence. Moreover, we do not fully understand where the 'is removed only at late stages' comes from? Is this because of the Smchd1 link? We find this link quite weak with the data presented. We would tone down that last abstract sentence.

      We have toned down the final sentence of the abstract accordingly. We agree that “removed only at late stages” is unsubstantiated since YY1 binding on Xi decreases over the entire time course (albeit slowly). However, we maintain that a connection between YY1 and late stages of the XCI process is reasonable to infer from the various pieces of evidence we provide in the study (egs YY1 is persistently enriched in accessible REs, it is associated with slow-silencing genes, and it remains bound to Xi in undifferentiated mESCs).

      Several comparisons to human XCI have been made in the article. We do agree that there are similarities between mouse and human XCI. However, there is insufficient data that substantiates that these genes are regulated in a similar manner in humans. We believe the comparisons should be removed altogether or attenuated.

      We agree that there is nothing in our data that directly pertains to human XCI. Comparisons to human are only made twice in the paper: Initially in the introduction to make a broad statement that many mechanisms of Xist function are conserved between species, and finally as speculation in the last discussion paragraph. We think it is relevant to acknowledge the parallels between our study, which links YY1 binding with resistance to Xist-silencing in a mouse ESC model, and literature describing a similar association between YY1 and XCI escape in humans.

      At bottom of page 4, the authors say that for any given gene, the allelic ration of accessibility at its promoter decreased more slowly than it silenced and then write Fig 1B. They probably mean S1C? Since 1B only shows 4 genes.

      The phrase “any given” was used colloquially (ie imprecisely), so we have replaced it with “individual”.

      Figure 1B shows the average allelic ratio of multiple clones for genes representing different silencing speeds. Each data point is the average of multiple clones for these representative genes, could the authors show the individual data points or the standard deviation?

      Fig1B predominantly shows the averages of only two replicate time-courses of Xist induction with NPC differentiation using the same parental clonal cell line, iXist-ChrX-Dom, but performed on different dates and passages. We regenerated the panel without merging the replicate data points, but this has little effect on the plot (see the Rmarkdown html file of Figure 1 on Github).

      Figure 1B. Loss of promoter accessibility lags behind loss of chromatin-associated RNA expression for these 4 genes. What about distal REs? Do the allelic ratios for the distal REs more closely follow chromatin-associated RNA expression? Could the authors show this in a supplemental figure?

      We comment from FigS1C on the general trend that accessibility decrease from Xi occurs slower than gene silencing (measured by ChrRNA-seq). We then find in FigS1D that distal elements lose accessibility slightly faster than promoters. Although overall the allelic ratio decrease of distal (non-CTCF) RE accessibility is slightly closer to the trajectory to that of gene silencing, it remains substantially slower (see again the Rmarkdown .html file of Figure 1 on Github).

      An equivalent plot to Fig1B showing distal REs would rely on our simplistic assignment of distal elements to their nearest genes. We believe this is reasonable generalisation for investigating chromosome-wide trends but unlikely to be sufficiently accurate at the level of specific genes.

      Figure 1B: gene silencing trajectory is depicted left while the legend says right. Same for promoter accessibility.

      The legend is now corrected.

      Figure S1A shows only part of the X chromosome. The area downstream of Xist is missing. Is this because the iXist-ChrXDom cell line is missing allelic resolution as shown in figure S2A? Could the authors explain in the figure legend that part of the X-Chromosome is missing?

      We have now included a reference to the recombination event in the iXist-ChrXDom cell line both when we present data from this background in the first paragraph of the Results section, and in the legend of FigS1A.

      Figure 2C shows that 94 TSSs bear a YY1 peak, yet Fig 2F shows 62 are targets of YY1. Is this because the rest are not properly silenced or are escapees?

      Fig2C shows the numbers of ChrX YY1 ATAC-seq peaks which overlap with “promoters” (ie regions +/- 500bp of a TSS). By contrast, Fig2F shows ChrX genes classified as direct YY1-targets for allelic silencing analysis. The discrepancy between these numbers is due to a number of reasons:

      1. It is possible for multiple YY1 peaks to overlap the same promoter (eg one peak overlaps 500bp upstream, a separate peak overlaps 500bp downstream).
      2. The count in Fig2C is not restrictive to one TSS per gene in cases where there are multiple transcript isoforms in the gene annotation, thus multiple YY1 peaks can overlap different promoters for the same gene.
      3. A few genes do not pass our filters for allelic silencing analysis (eg they are too lowly expressed). Some YY1 peaks may overlap these genes. We hope the revised version of Fig2F, which includes numbers of direct YY1 target genes on autosomes and ChrX, makes the distinction between these two numbers clearer.

      Moreover, YY1 has ~4-fold more peaks on the X chromosome on distal elements compared to promoters. Yet figure 2F exclusively shows the proportion of YY1 binding sites on TSSs. Would distal REs show similar proportions for the silencing categories? Could the authors show the differences in a Supplemental figure?

      As discussed in the response to Reviewer #1 (point 8.), a large fraction of distal YY1 peaks on ChrX are at LINE1 elements, which are not amenable to allelic analysis. Excluding these peaks results in a smaller number of distal elements bound by YY1. The application of our filters for allelic analysis reduces the number of distal YY1-bound REs even more, and our assignment of distal REs to their nearest gene is imprecise. For these reasons, we do not think a comparison of genes classified by whether they are putative targets of distal YY1-bound enhancers is informative.

      The authors switch between different model systems in the figures, which makes quite confusing which type of XCI is being discussed. We would like to see clearly stated above all panels which cell culture condition is being studied (mESCs or NPCs).

      We have tried to improve this potential source of confusion by modifying “mESC” to “mESC no diff” in the relevant figure panels (see response to Reviewer #1 comment 7B), and adding “in mESCs without differentiation” to the title of Figure 4.

      In Figure 3E and 3F the authors look at the binding retention of OCT4 during XCI in ESCs. However, it is not clear why the authors choose OCT4. Could the authors explain why specifically OCT4 was chosen for these analyses?

      In our responses to the other reviewers, we discuss the limitations of only having one other TF to compare to YY1. The choice of OCT4 was primarily dictated by our experience and confidence in being able to generate high quality ChIP-seq data of this factor.

      As it was essentially arbitrary for the purposes of this paper, we have added a comment to this effect in the text (“with that of a different arbitrary TF, OCT4”).

      What is the expression level of YY1 in NPCs compared to mESCS? In Supplemental S2A, it seems that YY1 protein levels decrease over time during NPC differentiation. Is part of the increased eviction a result of lower protein levels of YY1? Probably not since you calculate ratios between Xi and Xa. Can you please comment on this?

      We were similarly intrigued by this apparent decrease in YY1 protein levels in NPCs (there is no decrease on the RNA level) and initially considered if it could contribute to the relative.

      In FigS2A specifically, the d18 NPC band is probably just a poor quality sample extraction. Our ChIP-seq data generated from the same sample is similar poor compared to the others (FigS2B). In other YY1-FKBP12F36V clones we derived and characterised by Western (not described further in this study, but will likely be published as raw source data for the cropped blots we show in FigS2A), the apparent difference in YY1 protein levels in NPCs is less pronounced. Although a minor decrease in YY1 protein in NPCs seems to be robust, we do not think it relevant in the context of our analysis of YY1 and XCI, as we almost always use Xa as internal comparison for any observations made about Xi.

      On page 7 the authors state that degrading YY1 does not affect Xist spreading and/or localisation. Indeed, it has been previously shown by other groups that YY1 is required for Xist localisation during XCI. Could the authors elaborate further on the why their cells behave differently compared to the Jeon 2011 paper?

      We are working with a mouse ESC model of inducible Xist from its endogenous locus on ChrX and using the dTAG system to degrade YY1 protein. By contrast, Jeon 2011 worked with an Xist transgene integrated at random in the genome of mouse embryonic fibroblasts (MEFs) and siRNA knockdown of YY1. The difference in our observations could be linked to any of these 4 differences (ie cellular context, Xist genomic location, Xist introns, knockdown strategy), but we cannot identify a specific explanation.

      In figure 4G and figure S3D elevated levels of Xist are observed in the dTAG conditions. As the authors point out, this could then result in accelerated silencing of the X seen upon YY1 loss. Are these elevated Xist levels that result in enhanced silencing in figure 4 relevant for the kinetics of silencing? Moreover, YY1 could act as transcriptional regulator of those genes in the X and by removing YY1, one would expect decreased transcription, which would be read as accelerated silencing. The authors could see whether the genes that show accelerated silencing are regulated by YY1 in ESCs (+ dTAG, - Dox).

      We agree that these points are important to consider when interpreting the results of the YY1-FKBP12F36V ChrRNA-seq we present in Figure 4. However, we believe they are covered in the text during our discussion of the data.

      In relation to the final suggestion, the silencing of almost all X-linked genes is increased upon YY1 removal so separating a specific set of genes which show accelerated silencing would be difficult. Nevertheless, in Fig4F we report that the increases in Xi silencing are strongest for direct YY1 target genes. In fact, these genes also show a minor decrease in expression in the + dTAG - Dox condition (see response to Reviewer #1 point 12.). However, by-and-large the differences in Xa log2FCs between YY1-target and non-target genes are less statistically significant. Non-significant p-values are not shown on Fig4F, but can be found in our Rmarkdown analysis records.

      Can the authors explain why they decided to put the Smchd1 part after the conclusion? Before the conclusion would have been better? The probable link between YY1 and SmcHD1 is definitely something important to investigate.

      Supplemental FigS4 relating to SmcHD1 is more speculative and we lack direct mechanistic evidence linking YY1 and SmcHD1. It would require more experiments to substantiate this as a mechanism. We think these experiments could potentially be very interesting, but are beyond the scope of this study.

      In the paper the authors cite Bowness et al., 2022. In it, Figure 5F studies silencing times with respect to silencing dependency on SmcHD1. What is the overlap between SmcHD1 target genes and YY1 target genes? This would provide more data about the correlation between YY1 and SmcHD1.

      There is an association between YY1 target genes and our previous categories of genes based on SmcHD1 dependence (13/56 SmcHD1_dependent genes are YY1 targets compared to only 8/101 of SmchD1_not_dependent genes). However, this enrichment of YY1 targets in SmcHD1 dependent genes is not so striking to warrant inclusion into the (very short) discussion of SmcHD1 in this paper. This association is also expected from the fact that both YY1-target genes and SmcHD1-dependent genes associate with the set of slow-silencing genes.

      Of note, our categories of SmcHD1 dependency were in fact defined in a previous study (Gdula et al., 2019) from a different cellular model (SmcHD1 KO MEFs).

      The authors hypothesise that SmcHD1 might play a role in the eviction of YY1 in NPC differentiation. The current data shows impaired silencing of slow silencing genes and YY1-dependent genes in the SmcHD1 knock-out. However, it doesn't show SmcHD1 is required for YY1 eviction. Could the authors provide direct evidence for their hypothesis by performing NPC differentiation in wild type and SmcHD1 knock-out cells and investigate YY1 binding using ChIP-seq?

      The data we show in FigS4 is ATAC-seq data. It shows that YY1 target REs are particularly more accessible from the Xi in SmcHD1 KO, which is not direct evidence but does align with a potential role for SmcHD1 in mediating removal of YY1 binding from Xi (see our response to Reviewer #2’s comment 2.). We agree that YY1 ChIP-seq over the same time course would be an interesting experiment, but arguably this would also only be indirect evidence (ie increased Xi YY1 enrichment may be due to a confounding consequence of SmcHD1 KO). We therefore believe the full suite of experiments needed to rigorously test the hypothesis are beyond the scope of this paper.

      In figure S4A and S4B no significance is indicated among the different conditions across the different differentiation days. Could the authors add this?

      At all timepoints, differences of Xi accessibility between YY1-binding vs non-YY1 REs are significant. P values are now added to FigS4 and the statistical test is described in the legend.

      Finally, we would like the authors to elaborate in the conclusion about the order of events. As they correctly state at the top of page 5 (and we agree), delayed loss of promoter accessibility compared to gene silencing does not automatically mean that it is downstream of gene silencing. Can you elaborate on this? Also, in light of Fig S2C where loss of YY1 binding seems to happen after gene silencing.

      We mention in the text and in the above response to Reviewer #2 (point 2.) that we do not wish to overinterpret this quantitative difference because the assays are technically different and accessibility is not linearly correlated with gene expression.

      It is possible to speculate plausible biological explanations for this discrepancy in kinetics between accessibility loss, TF binding and gene silencing. For example, a change in the landscape of histone modifications at a promoter may have little effect on its accessibility to TFs but directly hinder RNA Polymerase II in initiation and/or elongation of transcription of the gene. However, we prefer to keep this speculation out of the main text of the paper.

      Reviewer #3 (Significance (Required)):

      This manuscript highlights a novel role for YY1 in XCI. The manuscript provides an analysis of the correlation and causation of YY1 in gene silencing during XCI. There is a clear correlation between YY1 and delayed silencing of genes on the Xi. To our knowledge, this is the first time such an analysis has been performed for YY1. It advances our conceptual and mechanistic understanding of gene silencing kinetics and what the factors involved in it are. We believe it is an important contribution to the XCI field and will be of great value to the XCI community.

      Strength:

      This study presents a comprehensive and in-depth characterization of X-linked gene silencing during XCI.

      Two different types of inducible XCI are studied and compared (ESCs vs differentiation towards NPCs), which we are grateful for.

      Systematic and stepwise analysis of the data is very strong.

      Many data points have been collected which provide stronger conclusions.

      Weakness:

      Some sentences in the abstract should be toned down.

      YY1 eviction on the inactive X doesn't seem crucial to establish X-linked gene silencing in mESCs.

      The mechanistic approach at the end of the manuscript with relation to SmcHD1 could be studied further.

      This paper will be suited for a specialised audience in XCI and transcription factor control of gene expression, i.e. basic research.

      Field of expertise: XCI, epigenetics, Xist, gene silencing, X chromosome biology.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, Bowness and colleagues describe the interesting finding that the transcription factor YY1 is associated with slow silencing genes in induced X-Chromosome Inactivation (XCI). The authors have conducted a comprehensive characterization of X-linked gene silencing and the loss of chromatin accessibility of regulatory elements in induced XCI in ESCs and during NPC differentiation. X-linked gene silencing was classified into four categories, ranging from fast-silenced genes to genes that escape silencing. Motif enrichment analysis of regulatory elements associated with slowly silenced genes identified YY1 as the transcription factor most significantly enriched. The separation of YY1-target and non-target genes confirmed that most genes bound by YY1 indeed exhibit slower silencing kinetics. A comparison of the binding kinetics of YY1 to another transcription factor, OCT4, during XCI revealed that YY1 is evicted more slowly compared to OCT4 on the inactive X, suggesting that slower eviction is a unique property of YY1. Conditional knock-outs of YY1 using protein degradation during induced XCI in mESCs demonstrated that the loss of YY1 at target genes enhances silencing. This supports the hypothesis that YY1 serves as a crucial barrier for slow-silenced genes during XCI. Finally, the authors propose a hypothesis regarding the mechanism of YY1 eviction, suggesting a potential connection to the role of SmcHD1 during XCI.

      The authors provide an in-depth analysis of the role of YY1 in gene silencing kinetics during induced XCI and believe this manuscript should be published if our comments are addressed.

      Major comment:

      Based on the allelic ratio in figure 3C only minor loss of YY1 binding occurs in induced XCI in mESCs on the Xi, while silencing is established properly as shown in figure 4C (left panel, red boxplots). This suggests that YY1 eviction is not necessarily required for these genes to be effectively silenced. Could the authors explain this discrepancy in the data regarding their manuscript conclusions? It seems this is true for XCI happening during differentiation towards NPCs, but not if cells are stuck in the pluripotency stage?

      Minor comments:

      In the abstract lines 7-8, the authors state that the experiments were performed in mouse embryonic stem cell lines, but much of the data shown is acquired in NPC differentiations. Please adjust abstract.

      The last sentence of the abstract states that YY1 acts as a barrier to silencing but as stated in my major comment, that does seem to be the case in ESC differentiation towards NPCs, but not in ESCs themselves. Please tone down this sentence. Moreover, we do not fully understand where the 'is removed only at late stages' comes from? Is this because of the Smchd1 link? We find this link quite weak with the data presented. We would tone down that last abstract sentence.

      Several comparisons to human XCI have been made in the article. We do agree that there are similarities between mouse and human XCI. However, there is insufficient data that substantiates that these genes are regulated in a similar manner in humans. We believe the comparisons should be removed altogether or attenuated.

      At bottom of page 4, the authors say that for any given gene, the allelic ration of accessibility at its promoter decreased more slowly than it silenced and then write Fig 1B. They probably mean S1C? Since 1B only shows 4 genes.

      Figure 1B shows the average allelic ratio of multiple clones for genes representing different silencing speeds. Each data point is the average of multiple clones for these representative genes, could the authors show the individual data points or the standard deviation?

      Figure 1B. Loss of promoter accessibility lags behind loss of chromatin-associated RNA expression for these 4 genes. What about distal REs? Do the allelic ratios for the distal REs more closely follow chromatin-associated RNA expression? Could the authors show this in a supplemental figure?

      Figure 1B: gene silencing trajectory is depicted left while the legend says right. Same for promoter accessibility.

      Figure S1A shows only part of the X chromosome. The area downstream of Xist is missing. Is this because the iXist-ChrXDom cell line is missing allelic resolution as shown in figure S2A? Could the authors explain in the figure legend that part of the X-Chromosome is missing?

      Figure 2C shows that 94 TSSs bear a YY1 peak, yet Fig 2F shows 62 are targets of YY1. Is this because the rest are not properly silenced or are escapees?

      Moreover, YY1 has ~4-fold more peaks on the X chromosome on distal elements compared to promoters. Yet figure 2F exclusively shows the proportion of YY1 binding sites on TSSs. Would distal REs show similar proportions for the silencing categories? Could the authors show the differences in a Supplemental figure?

      The authors switch between different model systems in the figures, which makes quite confusing which type of XCI is being discussed. We would like to see clearly stated above all panels which cell culture condition is being studied (mESCs or NPCs).

      In Figure 3E and 3F the authors look at the binding retention of OCT4 during XCI in ESCs. However, it is not clear why the authors choose OCT4. Could the authors explain why specifically OCT4 was chosen for these analyses?

      What is the expression level of YY1 in NPCs compared to mESCS? In Supplemental S2A, it seems that YY1 protein levels decrease over time during NPC differentiation. Is part of the increased eviction a result of lower protein levels of YY1? Probably not since you calculate ratios between Xi and Xa. Can you please comment on this?

      On page 7 the authors state that degrading YY1 does not affect Xist spreading and/or localisation. Indeed, it has been previously shown by other groups that YY1 is required for Xist localisation during XCI. Could the authors elaborate further on the why their cells behave differently compared to the Jeon 2011 paper?

      In figure 4G and figure S3D elevated levels of Xist are observed in the dTAG conditions. As the authors point out, this could then result in accelerated silencing of the X seen upon YY1 loss. Are these elevated Xist levels that result in enhanced silencing in figure 4 relevant for the kinetics of silencing? Moreover, YY1 could act as transcriptional regulator of those genes in the X and by removing YY1, one would expect decreased transcription, which would be read as accelerated silencing. The authors could see whether the genes that show accelerated silencing are regulated by YY1 in ESCs (+ dTAG, - Dox).

      Can the authors explain why they decided to put the Smchd1 part after the conclusion? Before the conclusion would have been better? The probable link between YY1 and SmcHD1 is definitely something important to investigate.

      In the paper the authors cite Bowness et al., 2022. In it, Figure 5F studies silencing times with respect to silencing dependency on SmcHD1. What is the overlap between SmcHD1 target genes and YY1 target genes? This would provide more data about the correlation between YY1 and SmcHD1.

      The authors hypothesise that SmcHD1 might play a role in the eviction of YY1 in NPC differentiation. The current data shows impaired silencing of slow silencing genes and YY1-dependent genes in the SmcHD1 knock-out. However, it doesn't show SmcHD1 is required for YY1 eviction. Could the authors provide direct evidence for their hypothesis by performing NPC differentiation in wild type and SmcHD1 knock-out cells and investigate YY1 binding using ChIP-seq?

      In figure S4A and S4B no significance is indicated among the different conditions across the different differentiation days. Could the authors add this?

      Finally, we would like the authors to elaborate in the conclusion about the order of events. As they correctly state at the top of page 5 (and we agree), delayed loss of promoter accessibility compared to gene silencing does not automatically mean that it is downstream of gene silencing. Can you elaborate on this? Also, in light of Fig S2C where loss of YY1 binding seems to happen after gene silencing.

      Significance

      This manuscript highlights a novel role for YY1 in XCI. The manuscript provides an analysis of the correlation and causation of YY1 in gene silencing during XCI. There is a clear correlation between YY1 and delayed silencing of genes on the Xi. To our knowledge, this is the first time such an analysis has been performed for YY1. It advances our conceptual and mechanistic understanding of gene silencing kinetics and what the factors involved in it are. We believe it is an important contribution to the XCI field and will be of great value to the XCI community.

      Strength:

      This study presents a comprehensive and in-depth characterization of X-linked gene silencing during XCI.

      Two different types of inducible XCI are studied and compared (ESCs vs differentiation towards NPCs), which we are grateful for.

      Systematic and stepwise analysis of the data is very strong.

      Many data points have been collected which provide stronger conclusions.

      Weakness:

      Some sentences in the abstract should be toned down.

      YY1 eviction on the inactive X doesn't seem crucial to establish X-linked gene silencing in mESCs.

      The mechanistic approach at the end of the manuscript with relation to SmcHD1 could be studied further.

      This paper will be suited for a specialised audience in XCI and transcription factor control of gene expression, i.e. basic research.

      Field of expertise: XCI, epigenetics, Xist, gene silencing, X chromosome biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors studied the molecular basis of a variation in the rate of individual gene silencing on the X undergoing inactivation. They took advantage of ATAC-seq to observe the kinetics of chromatin accessibility along the inactive X upon induction of Xist expression in mESCs. They demonstrated a clear correspondence between the decrease in chromatin accessibility and the silencing of nearby genes. Furthermore, they found that persistently accessible regulatory elements and slow-silencing were associated with binding of YY1. YY1 tended to associate longer with genes that required more time to be silenced than those that became silenced fast on the inactive X during XCI. The acute loss of YY1 facilitated silencing of slow genes in a shorter period. They suggest that whether or not the transcription factors stay associated longer is another factor that impacts the variation in the rate of gene silencing on the Xi.

      Significance

      It has been suggested that the rate of gene silencing during XCI is varies depending on the distance of individual genes from the Xist locus or the entry site of Xist RNA on the X, as well as their initial expression levels before silencing. This study provides another perspective on this issue. The persistent association of transcription factors during XCI affects the rate of gene silencing. Although the issued addressed here might draw attention from only the limited fields of specialists, their finding advances our understanding of how the efficiency of silencing is controlled during the process of XCI. The experimental data essentially support their conclusion, and the manuscript was easy to follow. However, I still have some comments, which I would like the authors to consider before further consideration.

      Major concerns

      Based on the results shown in Figure 3E and F, the authors concluded that YY1 was more resistant than other TFs against the eviction from the X upon Xist induction. I am not still convinced with this. YY1 binds DNA via the zinc finger domain, while Oct4 binds DNA via the homeodomain. The difference in the binding module between them might affect their dissociation or the response to Xist RNA-mediated chromatin changes. In addition, given that YY1 has been reported to bind RNA, including Xist, as well, Oct4 might not be a good TF to compare.

      I don't think that Kinetics of YY1 eviction upon Xist induction in SmcHD1 KO cells during NSC differentiation fit the phenotype of Smchd1mutant cells. Although their previous study by Bowness et al (2022) showed that Smchd1-KO cells fail to establish complete silencing of SmcHD1-dependnet genes, their silencing still reached rather appreciable levels according to Figure 6 of Bowness et al (2022). This is, in fact, consistent with the idea that XCI initially takes place in the mutant embryos, at least to an extent that does not compromise early postimplantation development. On the other hand, a significant portion of YY1 appears to remain associated with the target genes on both active and inactive X (Figure S4), which I think suggests that the presence of YY1 is compatible with silencing of SmdHD1-dependent genes. This is contradictory to the proposed role of YY1 that sustains the expression of X-linked genes in this context.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: This manuscript uses differentiation of the highly informative inter-specific hybrid mouse ESC to follow features of genes that inactivate slowly. Resistance to silencing is reflected in reduced change in chromatin accessibility and the authors identify YY1 and CTCF as enriched amongst these 'slow' genes. This finding is provocative as these factors have been reported to enrich at both human and mouse escape genes. The authors go on to demonstrate that eviction of YY1 is slowly evicted from the X, and that removal of YY1 increases silencing.

      Minor Comments:

      Overall, the manuscript's conclusions are well supported; however, the brevity of the presentation in some places made it difficult to follow, and in other places seemed a missed opportunity to more fully examine or present their data.

      1. Introduction is only 2 paragraphs and half of the last is their new findings. First part of results/discussion is then forced to be very introductory. In addition, some discussion of escapees, even if predominantly human, seems warranted in the introduction. There are multiple studies that have tried to identify features enriched at genes that escape inactivation that could be mentioned.
      2. Variation in silencing rates. 'Comparable rankings' cites multiple studies (oddly previous sentence cites only two) - how concurrent are they? Developing this further (perhaps a supplementary table) would inform whether the genes assessed are ones that routinely behave similarly across different studies/lines; and also serve as a resource for future studies.
      3. It would be helpful to give insight into informativity of cross - what proportion of ATAC-seq peaks were informative with allelic information (and similarly, what proportion of genes expressed had allelic information?
      4. P5: "can be influenced by Xist RNA via a variety of mechanisms" seems like it this sweeping statement could use expansion, or at least a reference. Authors could also clarify that 'distal elements assigned by linear genomic proximity is their definition of nearest gene.
      5. Figure S1 - there are 6-8 other regions that fail to become monoallelic - what are they?
      6. Is there any correlation between silencing speed and expression (as previously reported)? If yes, then is there also a correlation with YY1 presence - and is this correlation greater than or less than seen on autosomes?
      7. Figure 2: Have the authors considered using quartiles rather than an arbitrary division into depleted and persistent? B. Could simplify secondary labels to solely YY1 and CTCF. D & F do not print in black and white. Overall the mESC versus NPC can be confusing, perhaps mESC (no diff) would be helpful?
      8. The numbers appear to suggest YY1 is generally enriched on X, but not at promoters?? Is this true? For Figure 2f, it might be helpful to show autosomal genes - are Fast depleted or Slow enriched for YY1 relative to autosomes? More generally, is YY1 binding on the X lost more slowly than YY1 binding on autosomes, or is the slow loss a feature of YY1. While I agree YY1 could have direct up or down-regulatory roles, Figure S3 could also be reflecting a secondary impact.
      9. Figure 3, 4 and supplementary - the chromosome cartoon introduces the LOH in iXist, but this needs to be described in text. Describing the reciprocal as a biological replicate seems challenging given this LOH.
      10. Could a panel of TFs be used rather than OCT4 which has its own unique properties to emphasize that YY1 is unique?
      11. Figure 4 - by examining 'late' genes, a change in allelic ratio is observed, but what about escape genes (e.g. Kdm5c, Kdm6a)? Do they now become silent? It would be helpful to have all this data as a supplementary table so people could query their 'favorite' gene.
      12. It seems surprising that loss of YY1 has no demonstrative impact on the Xa. Figure S3B suggests that over 1000 genes are significantly impacted - primarily down regulated. How many of those are X-linked? Perhaps they could be colored differently?
      13. Since XIST+/undifferentiated cells retain YY1, is YY1 binding sensitive to DNAme? Indeed, are X chromosome bound sites in islands that become methylated? Figure S4 shows YY1-targetted X genes in SMCHD1 knockout; can CTCF targets also be shown? While identified in Figure 2, CTCF was not examined the way YY1 was, although it has also been identified in somatic studies of genes that escape X inactivation.
      14. When the authors use cf. do they simply mean see also, or as wikipedia suggests: "the cited source supports a different claim (proposition) than the one just made, that it is worthwhile to compare the two claims and assess the difference". Perhaps it would be worth spelling out to clarify for the audience.

      Significance

      General assessment: An important question in human biology is how much the sex chromosome contributes to sex differences in disease frequency. Genes that escape X inactivation in humans seem to have considerable impact on gene expression genome-wide. While there are not as many genes in mouse that escape inactivation, the use of the mESC cell differentiation approach allows detailed assessment of the timing of silencing during inactivation. The authors utilize an inter-specific cross and it would be interesting to know the limitations of such a system (in terms of informative DHS/genes that are informative).

      Advance: As the authors note, there are multiple studies of similar systems that have revealed differences in the speeds of silencing of genes. However, this is the first study to my knowledge that has then tried to assess timing with gene-specific factors. There are multiple studies in humans comparing escape and subject genes for TFs, but lacking the developmental timing that this study incorporates.

      Audience: While generally applicable to a basic research audience interested in gene regulation, the applicability to human genes that escape inactivation may interest cancer researchers or clinical audiences interested in sex differences.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      General Statement We very much appreciate the reviewers' thorough comments and are sincerely grateful for their kind remarks on the novelty and interest of our manuscript. We are confident to have addressed all the points that they have raised including new data, as well as revised figures and text.

      Point-by-point description of revisions All the revisions have been already carried out and included in the transferred manuscript.

      Reviewer #1

      Major comments:

      > The number of the replicates/animals for the experiments described in Figures 1 and 2 should be reported either in the figure legends or in the methods (statistical analysis). We have added the required numbers to the corresponding revised figures, as requested.

      > A relevant part of the discussion repeats what the authors have already said in the results. I would recommend to reorganize this section, emphasizing the importance of these results in the context of human brain tumors.

      Following our own style, we have written a very short (46 lines in length!) Discussion. We dedicate a few lines to highlighting two points: (1) the suggestion, derived from our allograft experiments, that the initial stages of tumour development and long-term tumour growth may be molecularly distinct events, and (2), the unique effect of the combined loss of TrxT and dhd on mbt tumour transcriptomics -unique because none of the suppressors of mbt reported before are as effective in erasing both the MBTS and SDS mbt signatures. Neither of these points are raised in Results. In the remaining few lines we put our results in the context of human Cancer/Testis and elaborate on the fact that the TrxT and dhd pair qualify as head-to-head, CT-X genes, like those reported in human oncology. This is as far as we are willing to go at this stage at emphasizing the importance of our results in the context of human tumours.

      Reviewer #2

      > 1. Figures should include information regarding the sex of the larvae, particularly as there has been a previously reported sex-linked effect in the phenotypes analysed. (e.g. in Figure 2 and Figure S1, where Indication of the sex of the animals should be provided in the figure OK and not just in the figure legend). We fully agree. Sex must always be taken into account as a biological variable. All the experiments reported in the manuscript were carried out with sexed samples, and were annotated accordingly in the original text. In compliance with the reviewer's request we have added this information also to the revised figure.

      *> 2. Data regarding fertility. Can this be shown in a table format? Are dhdKO females fully sterile? What are the fertility levels of Df(1)J5? * Please note that we are not discovering anything here but merely corroborating what has been published before: the lack of TrxT does not affect fertility in either sex; the lack of Dhd results in female sterility (Torres-Campana et al., 2022, Tirmarche et al., 2016, Svensson et al., 2003, Pellicena-Palle et al., 1997). Adding a table would not be justified. Moreover, it would be a rather simple table: all single-pair mating tests (n=10 for each genotype) with Trxt KO and Dhd KO males, and TrxT KO females were as fertile as control flies, while all single-pair mating tests (n=10) with Dhd KO females were sterile.

      > 3. Are dhd and TrxT the only genes affected by Df(1)J5? Is there transcriptional data from Df(1)J5 animals to suggest that nearby genes are not affected by the deficiency? Of particular interest would be to assess if snf is affected or not as it is a known regulator of gene expression and splicing. Yes dhd and TrxT are the only genes affected by Df(1)J5. That is the case according to Flybase (citing Svensson et al., 2003, and Salz et al., 1994) and confirmed by our own RNAseq data. No other transcripts, including snf, are affected by Df(1)J5.

      > 4. In Figure 1C, statistical test plus indication of significance is not presented. The requested statistical test and significance data have been added as required to the revised figure and figure legend.

      > 5. Related to Figure 1D. Additional neural markers could be assessed in dhdKO and TrxTKO flies. Whilst the gross morphology of the brain does not seem to be affected, there is a possibility that cell specification is affected. Specific markers for the NE, MED and CB could be used to assess this in more detail, particularly as the DE-cad images shown for dhdKO and TrxTKO flies seem to differ slightly from the control. We believe that there may be a small misunderstanding here. We have made this point clear in the revised version by referring to substantial published data showing that expression of these two genes is restricted to the germline and that, female fertility aside, TrxT and dhd deficient flies' development and life span are perfectly normal. If anything, Figure 1D is redundant. However, we would rather keep it as a control that our CRISPR KO mutants behave as expected.

      > 6. Related to Figure 2A, images from TrxTKO; l(3)mbtts1, dhdKO and l(3)mbtts1 should be added at the very least in a supplementary figure. Additionally, data for NE/BL ratio should be provided for dhdKO, TrxTKO and Df(1)J5 in the absence of l(3)mbtts1 tumours. Related to Figure S1, quantification of NE/BL ratio for female lobes should be added to the figure. All the requested images and data have been included in the revised version in new figures Figure S2B, Figure S2A, and Figure S1A.

      > 7. Related to Figure 2B and Figure S1, three rows of images are presented for each genotype. It is unclear whether these correspond to brain lobes from different larvae or different confocal planes from the same animal. This should be clarified in the figure and/or figure legend. This point has been clarified as requested in the revised figure legend. Each group of three rows correspond to brain lobes from different larvae of the same genotype.

      > 7 cont. Related to this, in addition to the anti-DE-cadherin data, it would be informative to include immunofluorescence data using antibodies such as anti-Dachshund (lamina), anti-Elav (medulla cortex) and anti-Prospero (central brain and boundary between central brain and medulla cortex) (as assessed in e.g. Zhou and Luo, J Neurosci 2013) in the mbt tumour situation to accurately describe regions disrupted by the tumours. There is no denying that taking advantage of the many cell-type specific markers that are readily available in Drosophila could be of interest. The same applies to cell cycle markers like PH3, FUCCI, and many others. However, we believe that interesting as they may be, none of this markers will give us the clue on the molecular basis of TrxT and Dhd tumour function that is, of course, the open burning question that we are trying to address now.

      > 8. Authors should clarify how the NE was defined when mbt tumours are generated, as it is severely affected. From the images provided, it is unclear which region corresponds to NE or how the NE/BL ratio was measured. It would be helpful to outline these regions in the images or, as mentioned above, use antibodies to define them. The figure has been modified to include the requested outlines defining the NE that indeed is correspond to the channel showing DE-Cadh staining.

      > 9. Figure 2C does not have indication of statistical significance for the comparisons stated in the text. Potential explanations for the different roles of Dhd and TrxT in long-term tumour development should be explored in the discussion. The requested statistical significance data for these comparisons were stated in the second last paragraph of that section. To make these data more prominent we have also added this information to revised Figure 2C.

      >9 cont. Related to this, does the analysis of the RNA-seq data from TrxTKO; l(3)mbtts1 and dhdKO; l(3)mbtts1 animals reveal why they have similar effect on mbt tumour development but do not synergistically contribute to long-term growth? Unfortunately our analysis of the RNA-seq data from TrxTKO; l(3)mbtts1 and dhdKO; l(3)mbtts1 animals does not give us any clue that could help us understand why they have similar effect on mbt tumour development, but not in long-term growth (allografts). To further explore this point, we have added new Figure S3 that includes a Venn diagramme showing the overlap between the affected mMBTS genes in TrxTKO; l(3)mbtts1 and dhdKO; l(3)mbtts1, together with the lists of enriched GOs among overlapping and non-overlapping genes. GO differences are tantalising, indeed, However, they do not immediately suggest any direct explanation for the different roles of Dhd and TrxT in long-term tumour development.

      > 10. Authors should clarify if there is any overlap between the affected M-tSDS and F-tSDS in the TrxTKO; l(3)mbtts1 and dhdKO; l(3)mbtts1 conditions. Would the limited overlap suggest that TrxT and dhd act in parallel rather than synergistically? This might also explain the differential effects on long-term tumour development. Additionally, the stronger effect observed in Df(1)J5 animals may be due to TrxT and dhd functional redundancy. Currently, there is limited evidence to suggest that TrxT and dhd act synergistically to regulate mbt tumour growth based on the presented data. See below.

      > 11. Authors should include a Venn diagram depicting affected genes (M-tSDS and F-tSDS) in the TrxTKO; l(3)mbtts1, dhdKO; l(3)mbtts1 and Df(1)J5; l(3)mbtts1 genotypes as this could clarify the percentage of overlap of gene signatures in these different conditions. Related to this point, authors could provide results from GO analysis to investigate whether specific functional clusters are altered in the different conditions. We have taken the liberty of fusing points 10 and 11 that are conceptually similar. The requested Venn diagrams showing the overlap between the affected M-tSDS and F-tSDS genes in the TrxTKO; l(3)mbtts1, dhdKO; l(3)mbtts1, and Df(1)J5; l(3)mbtts1 conditions, and GO analysis are now shown in new Figure S5. Unfortunately, these new data do not suggest any obvious explanation for the differential effects of these two genes, nor do they allow us to derive any further conclusions regarding the nature of the pathways through which TrxT and dhd cooperate to sustain mbt tumour growth. However, our analyses demonstrate that efficient suppression of mbt phenotypic traits (in larval brains) and transcriptome requires the combined elimination of both germline thioredoxins, while the effect of individual removal of either of them is only partial. These data demonstrate the synergistic nature of TrxT and dhd function in mbt tumour growth.

      > 12. In Figure 3E, authors should indicate more explicitly in the figure panel and/or figure legend which genes display significant differences in expression in the different samples. We apologise for not having made this point clear in the original version: All (21) genes shown in this Table are significantly downregulated in DfJ5;ts1 vs ts1. From these, nanos and Ocho are also significantly downregulated in TrxTKO;ts1 vs ts1, and Ocho, HP1D3csd, hlk, fj, Lcp9, CG43394, and CG14968 are significantly downregulated in dhdKO;ts1 vs ts1. These data have been included in the revised figure legend. Data on all other comparisons are included in Table S1.

      > 13. In Figure S2C-F it is not clear if the graphs represent data from all tissues or data from male and female tissues separately, as shown in Figure 4. Apologies for the confusion. All samples were from male tissues as indicated in the original figure legend. To make it more clear, we have labelled all four panels in the revised figure.

      > 14. Are TrxT and dhd also deregulated in other tumour types? Or is this specific for mbt tumours? This information could be provided to enhance the scope of the manuscript. Thank you for raising this point. TrxT and dhd are not dysregulated in the other tumour types that were analysed in Janic et al., 2010 (i.e pros, mira, brat, lgl and pins).

      > 15. Authors conclude that TrxT and dhd cooperate in controlling gene expression between wild-type and tumour samples and that they act synergistically in the regulation of sex-linked gene expression in male tumour tissue. However, the link between the two observations (if indeed there is a link) has not been well explained. Is the effect on gene expression in tumours simply a result of the regulation of sex-linked transcription? Our data show that TrxT and dhd synergistically contribute to the emergence of both the MBTS (i.e tumour versus wild type) and SDS (i.e. male tumour versus female tumour). The only certainty at this time regarding the interconnection between both signatures is that they overlap, but only partially, which answers one the questions raised by the reviewer: the effect on gene expression in tumours is not simply a result of the regulation of sex-linked transcription. Beyond that, the link (if indeed there is a link) between these two signatures has not been investigated. The lack of insight on this issue is not surprising taking into account that, in contrast to classical tumour signatures (tumour versus healthy tissue), the concept of sex-linked tumour signatures is relatively new and only a handful of such signatures have been published. Moreover, the vast majority of classical tumour signatures have not been worked out in a sex-dependent manner.

      Reviewer #3 Comments: > - In the first section of the results, as a first step to study the role of TrxT and dhd genes on mbt tumors the authors generate CRISPR knock outs of these genes and correctly validate them. However, afterwards, the experiment where the authors test the KO of these genes in a wild-type larva brain is not contextualized with the rest of the section. It might be best to first address the role of these genes in a tumor context and only then complement with the experiments in wild-type (in supplementary material). We do appreciate the reviewer's view, but respectfully disagree. In our opinion, the manuscript flows better by presenting the tools that we have generated in Figure 1, By corroborating published data showing that these two germline genes do not affect soma development (Torres-Campana et al., 2022, Tirmarche et al., 2016, Svensson et al., 2003, Pellicena-Palle et al., 1997) this first figure not only validates our CRISPR KO mutants, but also sets the stage to highlight their significant effect on a somatic tumour like mbt.

      > - Fig 2 B - To back up the quantifications in Fig 2A the authors could include images of l(3)mbt ts1 tumors with TrxT KO and dhd KO also. The requested images are shown in new figure Figure S2B.

      > Fig 2 B and C - Indeed, the results suggest that TrxT seems to be responsible for most tumor lethality upon l(3)mbt allografts, but not dhd. This is curious since l(3)mbt; dhd KO brain tumors have the same partial phenotype as l(3)mbt; TrxT KO (fig 1A). It would be interesting to further explore these phenotypes by staining l(3)mbt; TrxT KO and l(3)mbt; dhd KO brains with, for instance, PH3 to understand if the number of dividing cells of these tumors could be different. In addition, to back up this information, the authors could look at what happens to l(3)mbt tumors with TrxT KO and dhd KO at a later stage of development (or to larva or pupa lethality if that is the case) and compare it with l(3)mbt brains. We did explore the possibility of looking at later stages. Unfortunately, the onset of the lethality phase compounded by major tissue reshaping from larval to adult brain make these stages unsuitable to reach any meaningful conclusion. With regards to staining for PH3, we think that like FUCCI and a long list of other useful labels that could be explored, it is potentially interesting, but hardly likely to give us the clue on the molecular basis of TrxT and Dhd tumour function, that is of course the one important question that we are addressing now.

      > - Fig 2 B - What happens to the medulla in a l(3)mbt brain tumor? Although the ratio of NE/BL is the same for wild-type and D(1)J5; l(3)mbt, it still seems that the medulla in D(1)J5; l(3)mbt brains is substantially bigger, although quantifications would be required. Do the authors know if the NE in D(1)J5; l(3)mbt brains is either proliferating less or differentiating more? There are no significant differences in medulla/BL nor in CB/BL ratios. The corresponding quantifications have been added to the revised version. As for the question on proliferation versus differentiation, the simple answer is that we do not know.

      > Figure S1 - Although the effects of TrxT KO and dhd KO in male mbt tumors seem to be enhanced in relation to female tumors, the authors should include some form of tumor quantification for female tumors like in Fig 2 A. We have carried out the requested quantifications and added the results in a new panel in revised Figure S1A.

      Moreover in the 2nd section of the results, relative to Fig 1S in "...Df(1)J5; l(3)mbtts1 female larvae although given the much less severe phenotype of female mbt tumours, the effect caused by Df(1)J5 is quantitatively minor." to say "quantitatively" minor, the authors should include not only quantifications, but a form of comparison between female tumors vs. male tumors. The requested quantification was published in Molnar et al., 2019. However, we agree on the convenience of doing it again with our new samples. The new data, that confirm published results, are now shown as a new panel in revised Figure S1C.

      > - Fig 3D - The hierarchical clustering was done according to which parameters? A brief explanation could help a better interpretation of this results section. The requested information has been added to the Methods section. Hierarchical clustering was done using the function heatmap.2 in R to generates a plot in which samples (columns) are clustered (dendogram); genes (rows) are scaled by “rows"; distance = Euclidean; and hclust method = complete linkage. Expression levels are reported as Row Z-score.

      > - Fig 3D - It could be beneficial for the authors to include an analysis of the downregulated genes shared between TrxT KO mbt tumors and dhd KO mbt tumors, as well as the genes that are not shared (besides MBTS genes). Could be something like a Venn diagram. Thanks for pointing this out. New Figure S3 shows the requested Venn diagram, as well as the list of enriched GOs for each group.There are no enriched GOs in the list of overlapping genes. TrxTKO; l(3)mbtts1-specific genes are enriched for GOs related to game generation, sexual reproduction, germ cell development and simlar GOs. dhdKO; l(3)mbtts1 -specific genes are enriched for GOs related to chitin, molting and cuticle development. Tantalising as they are, these observations do not immediately suggest any direct explanation for the different roles of Dhd and TrxT in long-term tumour development. We are happy to add these supplemental information, but we do not deem it worth of any further discussion at this point.

      > - Results section 3 - "Expression of nanos is also significantly down-regulated upon TrxT loss, but remains unaffected by loss of dhd" - to corroborate the idea that TrxT and dhd work as a pair, but contribute to different functions within the tumor, it would be interesting for the authors to do an allograft experiment of dhd KO; l(3)mbt male tissue with nanos knock down in the brain, if genetically possible. The suggested experiment is published. The gene in question (nanos) is a suppressor of mbt tumour growth: In a nanos knock down background, l(3)mbt allografts do not grow (Janic 2010).

      Minor comments: * > - In the first section of the results, the authors claim that "Consistent with the reported phenotypes of Df(1)J5...", but then the study is not mentioned.* The corresponding references (Salz et al., 1994; Svensson et al., 2003; Tirmarche et al., 2016) have been added.

      > - Fig 1 B - It is a bit confusing to follow where TrxT and dhd are in the Genome browser view. I am guessing we should follow the TrxT-dhd locus from A, but the authors could make it clearer. Figure 1 has been changed to make this point more clear.

      > - In the same section, in the next sentence, the homozygous and hemizygous is a bit confusing. "...homozygous TrxTKO females, dhdKO males, and TrxTKO males", should be corrected. We appreciate the suggestion, but would rather stick to classical terminology and refer to KO/KO females as homozygous and to KO/Y males as hemizygous.

      >- In the same section (Fig 1C): "RNA-seq data also shows that TrxT is significantly upregulated in l(3)mbtts1 males compared to females (FC=7.06; FDR=1.10E-44) while dhd is not (FC=1.89; FDR=2.00E-14)." - But dhd is nevertheless upregulated, although less, in l3mbt males, right? The authors might need to rephrase. We refer to comparing males versus females, not wild type versus tumours. The text has been rephrased in the revised version to make this point clear.

      > - Fig 2 A (quantifications), should be after the confocal images (Fig 2 B). We respectfully disagree on this minor point. We initially organised this figure in the order recommended by the reviewer, but we eventually found it easier to write the article using the order shown in the submitted figure. We would rather stick to this version.

      > - Fig 2 B and Fig S1 - Please include an outline of at least neuroepithelia and, if possible, Central brain or medulla so that these regions can more clearly identified. Moreover, these results will be easier to interpret if you add a male symbol in this image and a female symbol in Figure S1, otherwise, it might seem like the same figure Outlines and symbols have been added to the revised figure, as required.

      > - In results, section 2, "Consequently, in spite of the strong sex dimorphism of mbt tumours, the phenotype of Df(1)J5; l(3)mbtts1 larval brains is not sexually dimorph" - to back this up, quantifications of Df(1)J5; l(3)mbtts1 female vs male tumor size, as well as statistical analysis are needed, like previously said. The requested the new data is now shown in revised Figure S1C.

      > - In results section 2 - "For allografts derived from, female larvae, we found that differences in lethality rate caused by TrxTKO; l(3)mbtts1, dhdKO; l(3)mbtts1, Df(1)J5; l(3)mbtts1, and l(3)mbtts1 tissues (7-23%) were not significant (Figure 2C)" - there is no statistical analysis to conclude that the lethality rate is not significant, from 7% to 23% still seems like a difference. Thanks for pointing this out. We did of course generate the requested statistical analysis data, but failed to include it in the manuscript. Chi-square statistical test gives a p value=0.2346. These data have been added to the revised version.

      > - Last paragraph of section 2 of results - very long and confusing sentence. Please rephrase text. We have rephrased this sentence to make it shorter and clearer.

      > - On section 3 of results: "The vas, piwi and CG15930 transcripts are not significantly down-regulated following either TrxT or dhd depletion alone." - in Fig 3E, not only these transcripts seem to suffer a slight downregulation, but there is also no statistical analysis supporting this. There seems to be a misunderstanding here. The requested statistical data for each gene were shown in Table S1

      > - First paragraph of section 3 results - the first sentence is written in a confusing way. Moreover, more context is needed in the sentence afterwards: "we first focused on transcripts that are up-regulated in male mbt tumour samples compared to male wild-type larval brains (mMBTS)." but using which data? The RNA seq data? Agreed; this paragraph has been amended in the revised version.

      > - Brief conclusion missing on the second paragraph of the last section of results. As far as the results presented in this paragraph are concerned, we can only mention the two potentially interesting observations, which were pointed out in the original version: (i) the suggestion that nanos upregulation could be critical for sustained mbt tumour growth upon allograft, and (ii) the fact that three genes (vas, piwi and CG15930), also known to be required for mbt tumour growth, are downregulated in Df(1)J5; l(3)mbtts1, but remain unaffected following either TrxT or dhd depletion alone. We are unable to derive any other conclusion from these observations.

      > - In the end of 3rd paragraph of last section of results: "...M-tSDS and F-tSDS genes is partially reduced in l(3)mbtts1 brains lacking either TrxT or dhd, but it is completely suppressed upon the lack of both." - "completely" might not be a correct word to use in this case, as there is still some small differences As requested, we have changed "completely" for "strongly".

      > - 4th paragraph of last section of results: Either mention the male results and then female (to be in order with the figure, as the female graphs come after the male graphs) or change the order in the figure. Also, this paragraph is not very clear, could benefit from a better explanation of the results and conclusions. Point taken. Figure 4 has been changed and female graphs come before male graphs. The paragraph is clearer now. The conclusion from this paragraph is included in the final paragraph of this section.

      > - Fig 4 C,D,E,F: to make it more clear, please write the name of the genotypes in question in the figure. At the reviewer's request, the genotypes in question are now written in each panel. Please note that we did not do so before because all four panels correspond to the same genotype: Df(J5); l(3)mbtts1 vs l(3)mbtts1, as we mentioned in the original figure legend.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      • In this manuscript, the authors address the role of the thioredoxins Dhd and TrxT in the development and growth of mbt tumors, a sexually dimorphic brain tumor that derives from the expansion of the neuroephitelium. To this end, the authors have successfully generated dhd and TrxT knock-out mutants using CRISPR-Cas9 and show that both dhd and TrxT individual knock-out partially reduces the mbt tumor-associated brain phenotype. Moreover, using Df(1)J5, a deficiency that affects both TrxT and dhd, the recovery of the phenotype is enhanced. However, although concomitant expression of dhd and TrxT is required for proper tumor development, they show that only TrxT is necessary for the growth of allografts derived from male l(3)mbt tumors. This is interesting, not only because TrxT and dhd are never co-expressed in physiological conditions, but also because this data suggests that the pathways leading to l(3)mbt tumor development are different from the ones that contribute to tumor proliferation and aggressiveness. Moreover, the authors show that TrxT and dhd contribute to the emergence of the mbt tumour signature (MBTS) and sex-dimorphic signature (SDS) of tumours by analysing transcriptomic data of TrxT KO; l(3)mbt, dhd KO; l(3)mbt and Df(1)J5; l(3)mbt. In fact, through hierarchical clustering, the authors show that male Df(1)J5; l(3)mbt brain transcriptomic profile becomes closer to wild-type brains than l(3)mbt ts1 tumors.

      • This study presents novelty to the cancer research field and both the model and methodology used were appropriate. Nonetheless, this study deals with mbt tumors which are sexually dimorphic, as well as male and female germline-specific genes that in a tumor can alter male and female sex-dimorphic signatures, making this study very easy to become confusing to non-experts in the field if not written in a very clear way. Therefore, the text, especially in the results and discussion section, could be revised in general to improve the comprehension and flow of the manuscript, given that some sentences and paragraphs are hard to follow. In particular, the results section could benefit with more contextualization and a more detailed explanation of experiments. Moreover, the study is lacking some quantifications and a few additional experiments. These issues can certainly be addressed by reviewing the text as well as reorganizing and including a few quantifications and experiments as described below. I am an expert in Drosophila brain development and tumorigenesis.

      Comments:

      • In the first section of the results, as a first step to study the role of TrxT and dhd genes on mbt tumors the authors generate CRISPR knock outs of these genes and correctly validate them. However, afterwards, the experiment where the authors test the KO of these genes in a wild-type larva brain is not contextualized with the rest of the section. It might be best to first address the role of these genes in a tumor context and only then complement with the experiments in wild-type (in supplementary material).

      • Fig 2 B - To back up the quantifications in Fig 2A the authors could include images of l(3)mbt ts1 tumors with TrxT KO and dhd KO also.

      Fig 2 B and C - Indeed, the results suggest that TrxT seems to be responsible for most tumor lethality upon l(3)mbt allografts, but not dhd. This is curious since l(3)mbt; dhd KO brain tumors have the same partial phenotype as l(3)mbt; TrxT KO (fig 1A). It would be interesting to further explore these phenotypes by staining l(3)mbt; TrxT KO and l(3)mbt; dhd KO brains with, for instance, PH3 to understand if the number of dividing cells of these tumors could be different. In addition, to back up this information, the authors could look at what happens to l(3)mbt tumors with TrxT KO and dhd KO at a later stage of development (or to larva or pupa lethality if that is the case) and compare it with l(3)mbt brains.

      • Fig 2 B - What happens to the medulla in a l(3)mbt brain tumor? Although the ratio of NE/BL is the same for wild-type and D(1)J5; l(3)mbt, it still seems that the medulla in D(1)J5; l(3)mbt brains is substantially bigger, although quantifications would be required. Do the authors know if the NE in D(1)J5; l(3)mbt brains is either proliferating less or differentiating more?

      • Figure S1 - Although the effects of TrxT KO and dhd KO in male mbt tumors seem to be enhanced in relation to female tumors, the authors should include some form of tumor quantification for female tumors like in Fig 2 A. Moreover in the 2nd section of the results, relative to Fig 1S in "...Df(1)J5; l(3)mbtts1 female larvae although given the much less severe phenotype of female mbt tumours, the effect caused by Df(1)J5 is quantitatively minor." to say "quantitatively" minor, the authors should include not only quantifications, but a form of comparison between female tumors vs. male tumors.

      • Fig 3D - The hierarchical clustering was done according to which parameters? A brief explanation could help a better interpretation of this results section.

      • Fig 3D - It could be beneficial for the authors to include an analysis of the downregulated genes shared between TrxT KO mbt tumors and dhd KO mbt tumors, as well as the genes that are not shared (besides MBTS genes). Could be something like a Venn diagram.

      • Results section 3 - "Expression of nanos is also significantly down-regulated upon TrxT loss, but remains unaffected by loss of dhd" - to corroborate the idea that TrxT and dhd work as a pair, but contribute to different functions within the tumor, it would be interesting for the authors to do an allograft experiment of dhd KO; l(3)mbt male tissue with nanos knock down in the brain, if genetically possible.

      Minor comments:

      • In the first section of the results, the authors claim that "Consistent with the reported phenotypes of Df(1)J5...", but then the study is not mentioned.

      • Fig 1 B - It is a bit confusing to follow where TrxT and dhd are in the Genome browser view. I am guessing we should follow the TrxT-dhd locus from A, but the authors could make it clearer.

      • In the same section, in the next sentence, the homozygous and hemizygous is a bit confusing. "...homozygous TrxTKO females, dhdKO males, and TrxTKO males", should be corrected.

      • In the same section (Fig 1C): "RNA-seq data also shows that TrxT is significantly upregulated in l(3)mbtts1 males compared to females (FC=7.06; FDR=1.10E-44) while dhd is not (FC=1.89; FDR=2.00E-14)." - But dhd is nevertheless upregulated, although less, in l3mbt males, right? The authors might need to rephrase.

      • Fig 2 A (quantifications), should be after the confocal images (Fig 2 B).

      • Fig 2 B and Fig S1 - Please include an outline of at least neuroepithelia and, if possible, Central brain or medulla so that these regions can more clearly identified. Moreover, these results will be easier to interpret if you add a male symbol in this image and a female symbol in Figure S1, otherwise, it might seem like the same figure if one does not properly read the legend.

      • In results, section 2, "Consequently, in spite of the strong sex dimorphism of mbt tumours, the phenotype of Df(1)J5; l(3)mbtts1 larval brains is not sexually dimorph" - to back this up, quantifications of Df(1)J5; l(3)mbtts1 female vs male tumor size, as well as statistical analysis are needed, like previously said.

      • In results section 2 - "For allografts derived from female larvae, we found that differences in lethality rate caused by TrxTKO; l(3)mbtts1, dhdKO; l(3)mbtts1, Df(1)J5; l(3)mbtts1, and l(3)mbtts1 tissues (7-23%) were not significant (Figure 2C)" - there is no statistical analysis to conclude that the lethality rate is not significant, from 7% to 23% still seems like a difference.

      • Last paragraph of section 2 of results - very long and confusing sentence. Please rephrase text.

      • On section 3 of results: "The vas, piwi and CG15930 transcripts are not significantly down-regulated following either TrxT or dhd depletion alone." - in Fig 3E, not only these transcripts seem to suffer a slight downregulation, but there is also no statistical analysis supporting this.

      • First paragraph of section 3 results - the first sentence is written in a confusing way. Moreover, more context is needed in the sentence afterwards: "we first focused on transcripts that are up-regulated in male mbt tumour samples compared to male wild-type larval brains (mMBTS)." but using which data? The RNA seq data?

      • Brief conclusion missing on the second paragraph of the last section of results.

      • In the end of 3rd paragraph of last section of results: "...M-tSDS and F-tSDS genes is partially reduced in l(3)mbtts1 brains lacking either TrxT or dhd, but it is completely suppressed upon the lack of both." - "completely" might not be a correct word to use in this case, as there is still some small differences.

      • 4th paragraph of last section of results: Either mention the male results and then female (to be in order with the figure, as the female graphs come after the male graphs) or change the order in the figure. Also, this paragraph is not very clear, could benefit from a better explanation of the results and conclusions.

      • Fig 4 C,D,E,F: to make it more clear, please write the name of the genotypes in question in the figure.

      Significance

      This study presents an interesting new concept for Drosophila tumors, the cancer germline genes, which to my knowledge has been a poorly explored field, although it has a lot of potential. It is particularly interesting since it addresses the role of two germline specific thioredoxins, that are dispensable for somatic cells, but have a critical role in somatic mbt tumors, exploring new tumor vulnerabilities. This manuscript will benefit researchers in the field of cancer biology, in particular, to better understand cancer-testis (CT) genes and how they promote tumorigenesis, since the biological function for the most part remains unclear.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Molnar and colleagues is a follow-up from the authors' previously published work (Molnar et al 2019), where they demonstrated differential sex-linked gene expression in Drosophila l(3)mbt brain tumours and identified the thioredoxin proteins TrxT and Dhd in the mbt tumour signature (MBTS). In the current manuscript, the authors generated genetic mutants for both genes using CRISPR TrxTKO and dhdKO and a deficiency that spans both genes (Df(1)J5) to understand the role played by these thioredoxins in normal larval brain development and l(3)mbt tumour development. Both genes were found to be largely dispensable for normal larval brain development, however the authors uncovered a role in l(3)mbt tumour growth and development. Interestingly, although both TrxT and Dhd were required for l(3)mbt tumour development, they appeared to have distinct roles in long-term tumour growth, as assessed by allograft-induced lethality. Furthermore, the authors also investigated the link between the sexually dimorphic signatures of l(3)mbt tumours and the thioredoxins and suggest that both genes are required for the differential gene expression observed between the sexes.

      Some of the conclusions are fairly well supported by the data presented. However, there are some aspects that are potentially not fully explored and that would provide more weight to the claims made by the authors. Some of these can be addressed by clarifications made in the text and/or figures and others would benefit from additional immunofluorescence staining experiments.

      Specific comments are provided below.

      1. Figures should include information regarding the sex of the larvae, particularly as there has been a previously reported sex-linked effect in the phenotypes analysed. (e.g. in Figure 2 and Figure S1, where Indication of the sex of the animals should be provided in the figure and not just in the figure legend).

      2. Data regarding fertility. Can this be shown in a table format? Are dhdKO females fully sterile? What are the fertility levels of Df(1)J5?

      3. Are dhd and TrxT the only genes affected by Df(1)J5? Is there transcriptional data from Df(1)J5 animals to suggest that nearby genes are not affected by the deficiency? Of particular interest would be to assess if snf is affected or not as it is a known regulator of gene expression and splicing.

      4. In Figure 1C, statistical test plus indication of significance is not presented.

      5. Related to Figure 1D. Additional neural markers could be assessed in dhdKO and TrxTKO flies. Whilst the gross morphology of the brain does not seem to be affected, there is a possibility that cell specification is affected. Specific markers for the NE, MED and CB could be used to assess this in more detail, particularly as the DE-cad images shown for dhdKO and TrxTKO flies seem to differ slightly from the control.

      6. Related to Figure 2A, images from TrxTKO; l(3)mbtts1, dhdKO and l(3)mbtts1 should be added at the very least in a supplementary figure. Additionally, data for NE/BL ratio should be provided for dhdKO, TrxTKO and Df(1)J5 in the absence of l(3)mbtts1 tumours. Related to Figure S1, quantification of NE/BL ratio for female lobes should be added to the figure.

      7. Related to Figure 2B and Figure S1, three rows of images are presented for each genotype. It is unclear whether these correspond to brain lobes from different larvae or different confocal planes from the same animal. This should be clarified in the figure and/or figure legend. Related to this, in addition to the anti-DE-cadherin data, it would be informative to include immunofluorescence data using antibodies such as anti-Dachshund (lamina), anti-Elav (medulla cortex) and anti-Prospero (central brain and boundary between central brain and medulla cortex) (as assessed in e.g. Zhou and Luo, J Neurosci 2013) in the mbt tumour situation to accurately describe regions disrupted by the tumours.

      8. Authors should clarify how the NE was defined when mbt tumours are generated, as it is severely affected. From the images provided, it is unclear which region corresponds to NE or how the NE/BL ratio was measured. It would be helpful to outline these regions in the images or, as mentioned above, use antibodies to define them.

      9. Figure 2C does not have indication of statistical significance for the comparisons stated in the text. Potential explanations for the different roles of Dhd and TrxT in long-term tumour development should be explored in the discussion. Related to this, does the analysis of the RNA-seq data from TrxTKO; l(3)mbtts1 and dhdKO; l(3)mbtts1 animals reveal why they have similar effect on mbt tumour development but do not synergistically contribute to long-term growth?

      10. Authors should clarify if there is any overlap between the affected M-tSDS and F-tSDS in the TrxTKO; l(3)mbtts1 and dhdKO; l(3)mbtts1 conditions. Would the limited overlap suggest that TrxT and dhd act in parallel rather than synergistically? This might also explain the differential effects on long-term tumour development. Additionally, the stronger effect observed in Df(1)J5 animals may be due to TrxT and dhd functional redundancy. Currently, there is limited evidence to suggest that TrxT and dhd act synergistically to regulate mbt tumour growth based on the presented data.

      11. Authors should include a Venn diagram depicting affected genes (M-tSDS and F-tSDS) in the TrxTKO; l(3)mbtts1, dhdKO; l(3)mbtts1 and Df(1)J5; l(3)mbtts1 genotypes as this could clarify the percentage of overlap of gene signatures in these different conditions. Related to this point, authors could provide results from GO analysis to investigate whether specific functional clusters are altered in the different conditions.

      12. In Figure 3E, authors should indicate more explicitly in the figure panel and/or figure legend which genes display significant differences in expression in the different samples.

      13. In Figure S2C-F it is not clear if the graphs represent data from all tissues or data from male and female tissues separately, as shown in Figure 4.

      14. Are TrxT and dhd also deregulated in other tumour types? Or is this specific for mbt tumours? This information could be provided to enhance the scope of the manuscript.

      15. Authors conclude that TrxT and dhd cooperate in controlling gene expression between wild-type and tumour samples and that they act synergistically in the regulation of sex-linked gene expression in male tumour tissue. However, the link between the two observations (if indeed there is a link) has not been well explained. Is the effect on gene expression in tumours simply a result of the regulation of sex-linked transcription?

      Significance

      The current manuscript provides additional information regarding the regulation of mbt tumours and establishes TrxT and Dhd as potential cancer-germline genes in Drosophila. This will be of interest to researchers studying basic mechanisms of tumourigenesis and could potentially lead to identification of genes that could serve as biomarkers of disease. However, for this, a more general role of TrxT and Dhd needs to be established, as well as their potential conserved role as cancer-germline genes needs to be established.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Two cancer-germline (CG) genes encoding the Drosophila thioredoxins Deadhead (Dhd) and Thioredoxin-T (TrxT) are located head-to-head in the X chromosome. Cristina Molnar and coworkers investigate the effects of Dhd and TrxT in brain tumours of either sex caused by mutations in l(3)malignant brain tumour (l(3)mbt). Using CRISPR/Cas9-mediated knock-out alleles and RNA-seq, they demonstrate that, although both TrxT and Dhd are not required for normal brain development, they have a significant but partial effect on l(3)mbt brain tumour development, that is stronger in male than in female larval brains. However, allograft experiments show that only TrxT plays a significant role in long-term, sustained tumour growth. TrxT and dhd play a synergistic contribution role in development of mbt tumours and in the emergence of l(3)mbt tumour-linked transcriptomic signatures.

      Major comments:

      Most of the work in this paper is well conducted and the key conclusions are convincing. I think that the number of the replicates/animals for the experiments described in Figures 1 and 2 should be reported either in the figure legends or in the methods (statistical analysis). A relevant part of the discussion repeats what the authors have already said in the results. I would recommend to reorganize this section, emphasizing the importance of these results in the context of human brain tumors.

      Significance

      This work provides the first instance of an X-linked, head-to-head cancer-germline gene pair in Drosophila showing that these genes are dispensable for somatic cell development but have a crucial role to prevent malignant growth. Importantly, in humans, cancer germline genes and cancer testis (CT) genes have been involved in a wide range of cancers and about half of CT genes are located on the X chromosome. Thus, findings in this paper would be of interest to a broad audience that includes all the scientists studying the molecular mechanisms leading to cancer development.

      The following keywords describe my expertise: Drosophila genetics, cell division, cancer genetics. I have less expertise to evaluate transcriptomics.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      'The authors do not wish to provide a response at this time.'

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Biddie et al. address the important question of what genomic annotations are relevant to the fine-mapping of trait-associated variation. They assessed data identifying DNase I hypersensitive sites, footprints, H3K27ac and other ChIP-seq peaks, ATAC-seq peaks, and eRNA locations using a benchmark set of allelically imbalanced DNase I hypersensitive sites and ChIP-seq peaks, and MPRA data. They find that a combination of DNase footprints and eRNA locations gives high enrichment for functional variants (albeit with low sensitivity) and demonstrate their FINDER on 53 traits from the GWAS catalog.

      I have some questions to address before publication, all of which relate to clarifying the description in the manuscript.

      1. I found this line in the abstract unclear: "This signature provides high precision, trading-off low recall".
      2. Figure 1C is missing a y-axis label and a colour legend.
      3. The authors note a high genomic coverage "by ATAC-seq (75.2%), [and] H3K27ac (61.5%)", which they attribute in part to "too low a threshold used in peak calling". However, both the benchmark datasets and the genomic predictors are utilized without consideration of the effect of thresholding. To what extent are DNase footprints and eRNA specifically informative, vs. representing datasets processed with highly selective cutoffs?
      4. The benchmark datasets are described as molQTL, bWTL, and caQTL. "QTL" implies a regression of a trait (e.g. accessibility) on a genotype. This is not accurate here: the Vierstra and Abramov datasets investigate allelic imbalance (a related but orthogonal approach), while van Arensbergen is an MPRA.
      5. QTLbase is cited alongside the Vierstra paper. I had some trouble searching in QTLbase but did not find the Vierstra dataset. It should be clarified whether QTLbase was used to download the Vierstra results, or to supplement it with other studies.
      6. Fig. 4 discusses the effect of variant centrality in a DHS for prioritization, but it isn't included in the FINDER schematic in Fig. 7. How come it wasn't employed in FINDER?
      7. The Discussion notes "Firstly, the identification of DNase footprints may be related to residency time of TFs on DNA, where rapidly exchanging factors impart poor footprints (Sung et al., 2014). Variants associated with altering binding of dynamic factors may therefore be missed. To overcome this, detection of footprints could be improved by enzymatic digestion bias correction". I don't see how enzymatic digestion bias is related to sensitivity to detect rapidly exchanging TFs.
      8. It looks like the eRNA data were obtained from GRO-seq or PRO-seq data. It would be helpful to note key details like this directly rather than leaving it to the reader to try to figure out what is in the PINTS database.
      9. The github link https://github.com/sbiddie/FINDER gives a 404 not found error. Is FINDER an actual tool implementation, or more generally describing the approach?

      Minor comments:

      1. Some of the figure text is quite small or blurry (e.g. Fig. 1A/B, Fig. S2).
      2. Typo in Figure 1A legend: "Heatpmap".
      3. The author of the last entry in the References is "Zhen Z" but should be "Zheng Z".

      Significance

      This work is highly relevant and well-done and provides practical information to guide future fine-mapping studies. The authors partially address the tradeoff between enrichment and recall, which is frequently swept under the rug. Their approach ought to be of high interest to the broad genetics and gene regulation communities.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: Disease and trait associated genetic variation is primarily localizes to non-coding regulatory DNA. A wide-variety methods and assays are routinely used to delineate putative regulatory elements. In this manuscript Bidde et al. sought to evaluate which of these assays is useful for identifying and prioritize trait-associated regulatory variation. To these ends the authors perform enrichment analysis on a genetic variation from different sources encompassing both molecular phenotypes (QTLs) and trait-associated variation via GWA-studies. The authors show that genetic variants localizing to DNase I footprints and within elements associated with RNA production (enhancer RNA) are maximally enriched for both molQTLs and trait-associated variation vs. other markers of regulatory DNA. Overall, I find that this manuscript is technically sound, and is consistent with prior studies (namely the enrichment of GWAS variants within DNase I footprints -- Vierstra et al. 2020). I suggest only a few additional analyses and edits to the presentation.

      Major comments:

      1. The authors should expand the QTL studies and the GWAS variants via LD and recompute the enrichments. For example, I would take all variants in high LD (r2 > 0.8 or 0.9) with either a QTL or GWA-variant. This will likely increase total variants overlapping an annotation, but reduce overall enrichment (odds-score), and possibly provide some information about which chromatin marks are more associated with "causal" variants.
      2. Can the authors comment on why eRNAs seem to be such a strong marker of functional variation? Are these just "strongest" (most accessible) distal elements? I would assume that these peaks have high overlap with chromatin accessibility peaks.
      3. Would the ATAC-seq enrichment increase if the authors stratified regions by signal rather than aggregating all peaks? The vast majority of chromatin accessibility peaks are very weak and could be false-positives. Lets imagine that in each dataset 1% of peaks are FPs and that the FP peaks are mostly randomly distributed accross the genome. As such, aggregating hundreds to thousands of samples would have many FP peaks and greatly affect the enrichment analysis. Conversely, DNase I footprints are found in high signal peaks that are less likely to be false-positives. One approach to deal with this is to select ATAC-seq peaks matched to the peak signal in DHS peaks with footprints.

      Minor comments:

      1. Figure 1c -- no legend is provided specifying what the bar colors represent.
      2. Pg. 10 -- "To overcome this, detection of footprints could be improved by enzymatic digestion bias correction (Calviello et al., 2019)." The DNase I footprinting dataset used in this paper performs extensive bias correction using a 6mer statistical model. Nevertheless, I completely agree the sentiment of the authors that low and variable sensitivity of footprinting is certainly driving a high false-negative rate with regards to comprehensively identifying function variants.
      3. Figure 3 is a little too complicated for its purpose, which is to show the enrichment of bQTLS, caQTLs and raQTLs.

      Significance

      This manuscript provides a rigorous analysis characterize how various markers of chromatin help aid in the interpretation of non-coding genetic variation. The findings are not entirely novel, however, the analyses and approaches described are nevertheless useful for variant prioritization. This manuscript is broadly applicable and useful to anyone interested studying non-coding genetic variation.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      The paper by Yammine et al addresses a major problem in peculiarities of genotype to phenotype manifestation in collagen II and chondrodysplasia. It is a lucid and comprehensive study detailing what they see as the fundamental mechanism of Gly1170 Ser mutated Col2a1 gene.

      At the heart of the matter is the debunking of the results from a mouse model generated by liang et al (Plos one 2014) paper in which the authors suggested that the phenotype seen only homozygous mice (heterozygous mice appear normal), was related to ER stress -UPR-apoptosis cascade resulting in the chondrodysplasia. Yammine et al paper uses a different model, a robust human iPSC-based tissue, with a CRISPRed variant show that despite the ability of the variant chondrocytes to deposit a Gly1170Ser-substituted collagen II in both the hetero- and homozygous models, is not accompanied by any substantive UPR. The authors of this current paper also argue that their model system is most closely resemble the human context, where heterozygous individual show pathology.

      We appreciate the Reviewer highlighting the significance of this manuscript addressing a major issue in the field.

      I have sieved all the data related to this topic and have gone back to examine the data and what struck me was the repeated use of the phrase "slow to fold" in the current paper and wondered whether the element of "TIME" is as important in the chondrogenesis of either models and it is this element that generate the difference between the two results? While iPSC-based tissue takes up to 44 days, a female mouse would have had two litters in this time and made many more growth plates. Could it be by "slowing" the chondrogenesis pathway, which is part of the procedure of differentiation of iPS cells into chondrocytes, the ER is not as "stressed" as in mouse development? I would like the authors to reflect and comment and put forward their view given UPR signaling pathways play a crucial role in chondrocytes in phases of high protein synthesis, e.g., during bone development by endochondral ossification (Journal of Bone Metabolism, vol. 24, no. 2, pp. 75-82, 2017).

      The Reviewer here emphasizes a likely benefit of the human model that we had not previously considered, as differentiation and growth in our model are indeed far more similar to humans than the rapid timeline in mice. We also note that the evidence that collagen is slow to fold comes from the gold standard assay in the field for collagen folding rate, a point discussed in greater detail in response to Reviewer 2’s query (see below).

      The literature evidence does indicate that transient UPR signaling is relevant for chondrogenesis. We selected UPR timepoints that do not interface with the differentiation process, but rather the tissue deposition process while chondrocytes are still actively depositing and maintaining the extracellular matrix, to be able to distinguish a physiological transient UPR during differentiation from a potential chronic and possibly pathologic one. We now clarify this point in the manuscript (see text below).

      “These timepoints were selected to reflect an early and a late stage of cartilage maturation, but with both timepoints harvested post-chondrogenesis so as not to interfere with the physiologic transient UPR activation that can be important in that process.”

      This is not withstanding the good argument given by the authors in defending their robust results, namely that the there is no evidence that the hydrophilic triple-helical domain of pro-collagen binds BiP, the main detector of accumulated misfolded proteins. What then do they make out of the immunostaining and qPCR with ER stress related genes in Liang et al paper?? I know that the data is not theirs but a comment on the indisputable data gives the reader a better understanding.

      It is critical to note that the evidence for ER stress that induces the UPR is, at best, exceptionally weak for heterozygotes in the Liang et al paper, and arguably also weak for homozygotes. Liang et al observed, via quantitative PCR, that the mRNA levels of just Chop (which is also a marker of the integrated stress response and not a good readout for UPR activity) and ATF6 (whose RNA-level upregulation is not a standard marker of the UPR) were significantly upregulated in the disease-relevant heterozygous mice – given that other (more valid) UPR markers were not altered, this is not so different from our observation of a lack of UPR in the disease-relevant heterozygotes.

      A somewhat more comprehensive set of UPR markers, including Chop, Xbp1(Total and Spliced), Grp78 (BiP), ATF4, and ATF6, was significantly upregulated only in homozygous mice compared to wild-type. PERK is one of many kinases upstream of ATF4 and Chop that can be activated by a variety of processes (the pathway is part of the integrated stress response, for example). Moreover, transcriptional upregulation of ATF4 (which is actually induced translationally, not transcriptionally) and ATF6 (which is actually induced proteolytically, not transcriptionally) are not normally used to read out UPR activation, so it is not so clear to us that a robust UPR was induced even in homozygotes. Moreover, there was not a substantial increase in Xbp1-S (S = spliced) relative to Xbp1-T (T= total) in the study of homozygous mice, which is the most appropriate measure of UPR activation – rather than change in Xbp1-T and Xbp1-S. The use of mostly non-standard genes to assess UPR induction, the weak upregulation of BiP (2.5-fold), and the unchanged ratio of Xbp1-S to Xbp1-T raise some questions regarding UPR induction even in the homozygotes. Regardless of these homozygote data, as noted above, the evidence for a UPR in heterozygotes is very weak, despite ER stress being the focus of the Liang et al paper.

      With respect to immunostaining, Liang et al observed that tissue from homozygous mice (but not heterozygotes) contained significantly more apoptotic cells. Apoptosis could be a result of chronic, unresolved UPR signaling, but it could also result from any number of other pathways and is certainly not direct evidence for UPR-inducing ER stress. Additionally, for the homozygote apoptosis assay, Liang et al do not note how many mice were analyzed for each genotype, a value they did report for their other assays. While examining multiple sections for each genotype is valuable (they state ≥10), the assessment of biological replicates (additional mice) seems critical to confidently reach a conclusion.

      Although I understand the choice of cell lines for overexpression, the transfection of the HT-1080 cells using wild-type and Gly1170Ser COL2A1encoding plasmids are not a match to the in vivo model (variation of efficiency, etc.) the appearance of BiP even at a lower fold increase does not negate ER stress, as the authors acknowledge but more important is what other paracrine signals which triggers the UPR signally pathway which is not linked to BiP? or an iPS system may lack? Is there anything else not only ATF6α (activating transcription factor 6 alpha), but IRE1α (inositol-requiring enzyme 1 alpha), and PERK (protein kinase RNA-like endoplasmic reticulum kinase).

      Our finding that the UPR is not activated is based on comprehensive RNA-sequencing performed in the physiologically more relevant iPSC-derived chondrocyte, as opposed to the tumor cell line HT-1080. Our interactomic finding that BiP interacts to the same extent with wild-type and Gly1170Ser procollagen-II (in HT-1080 cells) strongly supports our proposal that the reason the UPR is not activated is that BiP fails to recognize unfolded triple-helical domains.

      We note that, although HT-1080 cells are not a perfect match, they are the most accessible option for interactome-based studies. Because there is no MS-grade antibody for collagen-II IP, we need to IP a transfected, tagged collagen. We cannot do this in chondronoids, or in isolated chondrocytes that transfect poorly and rapidly dedifferentiate. Critically, Prockop and co-workers extensively validated HT-1080 cells as a platform for fibrillar collagen biochemical studies in Matrix 1993, 13, 399. Our own lab further characterized their capacity to properly handle fibrillar collagen variants in great molecular detail in ACS Chem Biol 2016, 11__, __1408.

      Since our chondronoid system contains only chondrocyte cells, as is the case in cartilage, the cells can receive paracrine signals from other chondrocytes, but not other cell types. In joints within a whole animal, it is true that paracrine crosstalk occurs between different cell types of different tissues, including inflammatory cells for example. The chondronoid is very useful for elucidating the defects that occur at the chondrocyte-level, without confounding secondary effects. At the chondrocyte-level, Gly1170Ser-substituted procollagen-II does not activate the UPR.

      The Reviewer’s comment regarding the absence of paracrine signals in an iPSC-based system is well-taken, and we added discussion as follows:

      “These observations indicate that the chondrocytes were not raising such stress responses, at least when examined in the absence of paracrine signals from other cell types in the joint.”

      The authors have given us plenty of alternatives that are relevant, and they prepared us for yet another paper on articular cartilage using iPS tissue model which I am looking forward to.

      We are also excited about the upcoming potential of this model system!

      Significance

      I think this paper is publishable and it is important in understanding the mechanism by which mutation in collagen type II affect chondrogenesis and therefore bone formation. This paper will appeal to musculoskeletal scientist especially those who are interested in bone and its pathology. It would be important for the authors to respond to the critique of "TIME" and speed of protein synthesis which create a duress in the ER pathway.

      We greatly appreciate the Reviewer’s comment again on the significance of this work, and their scholarly input which has substantially improved the paper. We hope they will agree that the manuscript is now ready for publication.

      Reviewer #2

      Evidence, reproducibility and clarity

      *System: The investigators have used a human iPSC chondrocyte model system to investigate the biochemistry of the Chondrodysplasia caused by the p.Gly1170Ser mutation in the type II collagen gene (COL2A1). They studied presumably homogeneous chondronoids formed by 3 cell lines they previously reported in which the chondrocytes were either homozygous wild type for the gene, homozygous for the Cas Crispr induced mutation or heterozygous for the two alleles (their refs 42-45). In addition, they utilized cultured HT1080 human fibrosarcoma cells transfected with wild type and mutant Col2A1 to study differences in the interactomes of the two proteins.

      *

      *Analytic Parameters: They investigated the extracellular matrix formed by the three cells using collagen and proteoglycan staining and TEM and the transcriptional responses in chondronoids expressing the wild type and mutant genes.

      *

      *Observations: matrix formation was defective in the two mutation bearing cell populations, reflecting defective fibril formation proportional to the abnormal gene dose. They found increased accumulations of post-translational modifications (hydroxylation, and O-glycosylation) on the mutant collagen extracted from the chondronoids and EM evidence of collagen retention in the ER. They studied the comparative transcriptional profiles in the three phenotypes and failed to find a profound UPR response late in culture and only a mild upregulation of UPR genes in the young cultures. They could not find evidence for activation of the ISR except in the homozygous mutant cells.

      *

      *Using transfected HT-1080 cells (previously shown by these investigators not to express endogenous pro-collagen II but able to synthesize transfected pro-collagen genes) they were able to study the comparative wt and mutant pro-collagen interactomes.

      *

      Conclusions: They conclude that the p.gly1170ser mutation in Col2A1 results in abnormal folding which results in trapping of the protein in the ER and some interaction with cellular elements of the proteostatic response. They concluded that the cellular proteostasis machinery can recognize slow-folding Gly1170Ser through increased interactions with certain ER network components but not in the same fashion that has been described for liver cells producing mutated versions of high volume secreted proteins.

      We appreciate this careful summary of our work.

      *Major comments:

      *

      Their first conclusion, stated in the abstract, "Biochemical characterization reveals that Gly1170Ser procollagen-II is notably slow to fold and secrete." that the mutant polypeptide chain is slower folding than the wild type chain is based on the premise that the longer the chains are in the ER the greater the degree of lysine hydroxylation and O-glycosylation. Although this may be true, they do not provide a reference and I could not find a definitive description of the phenomenon. Their reference 48 only discusses the occurrence of intracellular post-translational modification of the lysines and continuing modification extracellularly but does not relate these phenomena to the rate at which the peptides traverse the cell. I think the reader would benefit from seeing experiments in which the rate of folding and secretion of the wild type and mutant chains are measured and the degree of post-translational modification are compared. Cabral WA et al showed differences in collagen folding and secretion rates in cyclophilin wt, knockouts and heterozygotes osteoblasts and fibroblasts by western blots. (2014) Abnormal Type I Collagen Post-translational Modification and Crosslinking in a Cyclophilin B KO Mouse Model of Recessive Osteogenesis Imperfecta. PLoS Genet 10(6): e1004465. doi:10.1371 / journal.pgen. 1004465). Performing such experiments in their chondronoids would confirm the authors' interpretation that the increased post-translational modification portrayed in their figure 4 reflects slowed folding and secretion related to the mutation.

      We apologize for failing to provide essential background references and information to assess our assay for slow folding/secretion of procollagen. In fact, slow migration on SDS-PAGE is not only a widely used assay for comparing the rate of folding of procollagens, it has also remained the gold standard in the field for the past forty years. The studies cited below are some of the seminal papers in the field linking collagen’s rate of folding with its extent of posttranslational modifications and its electrophoretic mobility. We have now updated our citations accordingly.

      1. Bateman, J.F.; Mascara, T.; Chan, D.; Cole, W.G. “Abnormal type I collagen metabolism by cultured fibroblasts in lethal perinatal osteogenesis imperfecta” Biochem J 1984, 217, 103.
      2. Bonadio, J.; Holbrook, K.A.; Gelinas, R.E.; Jacob, J.; Byers, P.H. “Altered triple helical structure of type I procollagen in lethal perinatal osteogenesis imperfecta” J Biol Chem 1985, 260, 1734.
      3. Bateman, J.F.; Chan, D.; Mascara, T.; Rogers, J.G.; Cole, W.G. “Collagen defects in lethal perinatal osteogenesis imperfecta” Biochem J 1986, 240, 699.
      4. Godfrey, M.; Hollister, D.W. “Type II achondrogenesis-hypochondrogenesis: Identification of abnormal type II collagen” Am J Hum Genet 1988, 43, 904. The basis for this collagen-specific assay of folding rate is that the ER-localized procollagen proline and lysine hydroxylases require monomeric collagen strands as substrates, and cannot accommodate a folded triple helix in their active sites. Thus, accumulation of post-translational modifications on collagen depends on the procollagen triple-helical domain’s residence time as an unfolded monomeric region of the assembling triple-helical trimer within the ER. Some fraction of the hydroxylated lysines are later glycosylated, which slows migration on SDS-PAGE gels. We have now clarified our slow folding conclusion with more precise references and discussion in the manuscript.

      Pulse-chase experiments like those suggested by the Reviewer would indeed be beneficial if they were possible in this system, but they simply are not. Although it might be possible to soak in a radiolabeled amino acid over a short time period, the assay still relies on separating the cell fraction from the secreted fraction. This is possible in monolayer cultures, but in a chondronoid composed of complex cartilage and cells we have no way to do it. One could propose that we extract the chondrocytes and then do the pulse-chase in a monolayer culture, but this unfortunately is also not possible as chondrocytes do not behave well outside the tissue setting and rapidly differentiate into other cell types. Fortunately, the procollagen overmodification assay is a widely used and well-accepted measure of slow folding, and thus addresses the issue.

      I think Figure 4 needs more explanation for the reader. While, as expected, the homozygous mutant band is much slower than the homozygous wild type band, in the heterozygotes the band is intermediate rather than showing a discrete mixture of wild type and mutant proteins, reflecting different degrees of post-translational modification. Is this a function of mixed triple helices with heterogeneous degrees of post-translational modification? It deserves more comment, since the argument relating the degree of post-translational modification to the rate of folding is dependent on this observation. It would also be helpful to show the whole gel with collagen II markers.

      We modified Figure 4 __to show the whole gel (in the SI, see __Fig. S3) and molecular weight markers. It also shows the wild-type collagen-II band. Most of the procollagen produced by the heterozygote is heterotrimeric for the disease-causing substitution (>87% of trimers will contain at least one mutant chain and thus experience delayed folding) and, therefore, the diffuse banding structure is to be expected. Further, we would speculate that in these challenged ER, even the folding of wild-type only trimers is impaired. The Reviewer’s comment suggests there may be some basis for that speculation. We added a note to this effect.

      “The presence of a single broad, slow-migrating band as opposed to distinctive overmodified mutant versus normally modified wild-type strands is due to fact that the vast majority of trimers formed in heterozygotes (>85%) contain at least one Gly1170Ser strand that delays triple-helix folding.”

      Another approach to the question of intracellular accumulation due to a slow rate of folding of the mutant collagen would be to perform pulse chase labeling of the three types of chondronoids with radiolabeled amino acids and sugars and processing the media and lysates with analysis using antibodies specific for the two collagen chain types. Given the authors extensive experience in studying collagen biosynthesis (e.g. Chan et al J. Biochem. Biophys. Methods 36 (1997) 11-29), such a supporting study would firmly establish whether the rate of folding/secretion differs between the wt and the homozygous and heterozygous chondroidinomas. Until the slow folding can be directly demonstrated in a quantitative fashion rather than by monitoring the secondary phenomenon of post-translational modification the hypothesis remains unproven.

      Discussed above in response to the Reviewer’s earlier suggestion of pulse-chase and question regarding the post-translational modification assay, unfortunately the pulse-chase experiment is infeasible. Fortunately, the modification-based assay is already the gold standard in the collagen field.

      Another issue that does not appear to be addressed is the consequence of having misfolded collagen chains in the dilated ER. Liang et al, using mice transgenic for one or two copies of the mutant human gene showed apoptosis in the homozygotes but not in the hets a finding similar to that of Kimura et al using transgenics carrying a different human COL2A1 mutation. Okada et al, using chondrocytes converted from human fibroblasts with clinical collagenopathy (heterozygous), although not the same mutation as in the present study, showed dilated ER and some level of apoptosis in the cultured cells. Hintze et al, examining chondrocytes expressing different mutants associated with different forms of spondyloepiphyseal dysplasia, suggested that the degree of stability of the mutations might determine whether apoptosis occurred, i.e. the thermolabile p.R989C was associated with apoptosis while cells expressing the more thermostable mutants p.275C, P.719C and p.G853E did not reveal any evidence for ongoing apoptosis R989. Is it possible that the smaller size of the homozygous chondronoids reflect fewer cells rather than less matrix (or both) as result of apoptosis? Examination of the chondronoids with reagents for caspase 3 or Tunel staining. One could also measure by Col/DNA ratio in wt, hets and homos. It might also have been useful for these experiments been more quantitative, i.e. by cell sorting rather than by eye. Would ImageJ software been helpful?

      We greatly appreciate this suggestion. We now added results of TUNEL assays performed on sections of the chondronoids (see Fig. 8), including quantification of the results. Notably, we do not observe a significant difference in apoptosis between genotypes at the timepoint considered. This result is also supported by our transcriptional data, where we do not observe upregulation of apoptosis-related pathways, via the UPR or otherwise.

      It is also unclear as to the conformation of chains trapped in the ER. There are many examples in which the natural tendency of misfolded proteins is to aggregate. This is certainly true in the neurodegenerative diseases. While at the magnification used here in the TEM's the ER inclusions appear homogeneous and amorphous, perhaps at higher magnification/resolution a more discrete structure might be seen.

      From collagen-II immunohistochemistry confocal images, the intracellular collagen appears sometimes as aggregated puncta, and in other cases more diffuse and amorphous. Given this heterogeneity, we were not able to readily obtain clear additional structural characterization of the intracellular procollagen-II fraction.

      *While the choice of time points for the transcriptional analysis, i.e. early and late seems well thought out, the lack of a significant response may be due to the timing and it might have been useful to do earlier or later time points or intermediate time points in case the response was transient, particularly since other laboratories have reported UPR activation and abnormalities in the context of the silencing of Xbp1, the spliced form of which is a major driver of at least one arm of the UPR. *

      While our RNA-sequencing results at the specific timepoints we chose cannot rule out a transient activation of the UPR, they do indicate that chronic, unresolved UPR signaling is not the underlying cause of pathology, which is the main point we are making.

      The notion that pro-collagen is largely hydrophilic without the potential for exposure of hydrophobic regions that might engage BiP, thus is not sensitive to BiP sensing, is interesting. Is it possible that the tendency of the mutant polypeptides to form the triple helix which in itself acts as kind of a self chaperoning structure? Looking at the kinetics of assembly inside the cell, see suggestions above, might provide further insight into the process beyond that obtained by looking at the modified state of the lysines.

      We believe this notion is very strongly supported by the interactomic experiment showing that BiP fails to preferentially engage the poorly folding triple-helical variant. There are, however, many other chaperones and folding enzymes that assist collagen folding, including prolyl isomerases and Hsp47. Hence, it is not clear to us that substantial self-chaperoning occurs. Still, the self-chaperoning idea is intriguing, and we will note that prior work does indicate that triple-helical domains of individual procollagen polypeptides are strongly pre-organized for triple-helix formation (for a review, see Annu Rev Biochem 2009, 78, 929). That said, we hesitate to speculate here on the self-chaperoning idea without additional evidence.__

      __Minor comments:

      As I mentioned above, while the transcriptional interactome experiments are computationally sophisticated the cell biology and biochemistry would benefit from more and better quantitation.

      We have included quantitation of the extent of intracellular procollagen accumulation and the extent of apoptotic cells, which we hope helps to address this point.

      The paper is written in a style in which results and discussion are intermingled. Personally I prefer that the introductions are short, the results clearly and briefly presented and the discussion deals with the interpretation and conclusions. I thought that whole paragraphs could have been omitted. e.g. in the introduction *Omit paragraph "The fibrillar.........achondrogenesis type II" Omit paragraph "Conventional and... for example." Omit "Excitingly........in vitro and in vivo (36)." Results: First paragraph repeats last paragraph of introduction and not necessary in one place or the other, condense. *

      We appreciate this feedback and have accordingly edited the manuscript for clarity and brevity, which includes deleting or significantly shortening all the paragraphs indicated by the Reviewer. These improvements are indicated in the track-changes version of the manuscript we resubmitted.

      Figure 2 by eye MGP (Matrix gla protein inhibits vascular calcification of type II collagen) seems highly over-expressed in the homozygous mutants; MGP is supposedly an inhibitor of calcification, does its over-expression here reflect something about the adequacy of the matrix

      Overexpression of MGP could indeed reflect a defect in the matrix of the homozygous variants. It is also likely a reflection of the delayed hypertrophy and maturation observed in the homozygous variants, as matrix calcification is a step in the endochondral ossification process. We did not follow-up on this particular observation, as it is exclusively observed in the less clinically relevant homozygous variant. We added a note to the manuscript to capture the Reviewer’s point about MGP, as below:

      “The upregulation in the homozygous system of Matrix Gla Protein (MGP) (Fig. 2A), which inhibits vascular calcification of the matrix in vivo, further supports the delay in hypertrophy, and could lead to differences in the biomechanical properties of the matrix.”

      Figure 5 is good but can it be confirmed by quantitative biochemistry?

      We have included quantitation of the extent of intracellular procollagen accumulation and the extent of apoptotic cells.

      __ __Did you stain with antibodies to other ER resident chaperones other than calreticulin?

      Yes, we also stained the ER with PDI. However, the chondronoids require extensive optimization for immunostaining and we could obtain much better images using the ER marker for calreticulin, hence our choice of images to present in the manuscript.__

      __Do cells with large amounts of intracellular G1170S die?

      As indicated by the newly included TUNEL data, interestingly, even cells expressing exclusively the Gly1170Ser variant of procollagen-II do not seem to apoptose at a significantly higher rate than wild-type, at least at the timepoint considered. As mentioned above, we added these data as Fig. 8, and added discussion of these results and methodology in the relevant sections of the manuscript.__

      __Does higher magnification EM reveal any structure of the material within the dilated ER?

      We have so far not been able to use EM to obtain higher-resolution insight into intracellular procollagen structures, but we will work on this idea in future studies.__

      __Are there any inflammatory cells in the Chondronoids? To respond to aberrant proteins?

      There should not be any such cells present in the chondronoids, and we indeed do not observe any inflammatory response. As noted in the response to Reviewer 1, we added discussion regarding the absence of paracrine signals in this type of model system, which we do believe has major advantages for biochemical studies like those performed here.__

      __Paragraph

      * "Bypassing the UPR.......often do not" Is discussion not results*

      Corrected, thanks.

      Significance

      The experimental system described here is clearly the wave of the present. Generating human ipSC's of different lineages is now being exploited to study a variety of disorders, to achieve better understanding of pathogenesis at the molecular level to serve as appropriate models for drug development, particularly in the context of high throughput screening. In addition, as in this case, relatively rare autosomal dominant disorders with phenotypes that resemble more common sporadic disease, may allow the development of treatments that are relevant for the sporadic disorder. While it is likely that the osteoarthritis that develops in the carriers of the COL2A1 mutations is a function of the host response to the aberrant mechanics resulting from the defective extra-cellular matrix caused by the mutation, having a pure system in which the primary defect can be corrected and the predisposing matrix deficit reversed, could allow normal reparative processes to mitigate the functional joint disability. While the transgenic mice are useful as a disease model, they represent not only the expression of the primary defect but the host pathophysiologic response to that defect, i.e. in this case how the mouse responds to the defective matrix state and whether those responses add additional pathogenic factors to the disease course. Having a tool in which to relatively assess the pure chondrocyte effect should allow more granular analysis of the primary process.

      We appreciate the Reviewer’s careful and enthusiastic assessment of the significance of our work.__

      __

      Their findings reinforce the notion that involvement of the UPR as well as the other arms of the proteostatic response in chondrocytes expressing a variety of mutant collagens suggests a degree of heterogeneity, perhaps depending on the mutation involved. While I do not believe that their current data prove or rigorously test their proposed hypothesis, i.e. that "perhaps due to the pathologic substitution occurring within a triple-helical domain that lacks hydrophobic character, this ER protein accumulation is not recognized by cellular stress responses, such as the unfolded protein response", it is worth considering.

      We provide that hypothesis as a reasonable explanation for the absence of a UPR, and it is strongly supported by our interactomic studies. Furthermore, neither we nor others have found evidence for BiP binding the triple helical domain of procollagen in any other studies. Still, that hypothesis is not the core point of the paper and we do appreciate the Reviewer’s perspective.

      Given the fact that this is a relatively small field with a variety of observations concerning the role of proteostasis and the UPR in particular which seem to vary depending on the system, i.e. transgenic mice, transfected fibroblasts, the chondroidomas, these observations particularly with additional biochemistry to confirm their notions regarding folding rates etc, represent a useful technical addition to the field and should be interesting for people working on collagen biology, arthritis and protein folding.

      I am not a collagen biologist hence my knowledge of some of the nuances of collagen biology may not be extensive. My own areas of interest include the assembly of multi-peptide proteins (such as immunoglobulins) for secretion; the mechanisms that allow them to exit the cell and the aggregation of misfolded proteins as exemplified by the amyloidoses and other forms of clinically relevant protein aggregation. Hence, I am very familiar with tissue culture, transgenic animals as disease models, studies of protein aggregation, and as a former rheumatologist, osteoarthritis.

      We greatly appreciate the Reviewer providing such valuable and scholarly input from the perspective of a scientist with deep expertise in the secretory pathway and other diseases of protein misfolding, as well as from rheumatology. Specifically from the perspective of expertise in collagen biology/biochemistry, we hope that our detailed explanations of assays that are possible versus not possible with collagen in this system, the additional context for why our assessment of the modification of procollagen is correlated with folding/secretion rate, and the further analyses added to the paper, now make a convincing case that the improved manuscript is of high significance and is ready for publication.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      System: The investigators have used a human iPSC chondrocyte model system to investigate the biochemistry of the Chondrodysplasia caused by the p.Gly1170Ser mutation in the type II collagen gene (COL2A1). They studied presumably homogeneous chondronoids formed by 3 cell lines they previously reported in which the chondrocytes were either homozygous wild type for the gene, homozygous for the Cas Crispr induced mutation or heterozygous for the two alleles (their refs 42-45). In addition, they utilized cultured HT1080 human fibrosarcoma cells transfected with wild type and mutant Col2A1 to study differences in the interactomes of the two proteins.

      Analytic Parameters: They investigated the extracellular matrix formed by the three cells using collagen and proteoglycan staining and TEM and the transcriptional responses in chondronoids expressing the wild type and mutant genes.

      Observations: matrix formation was defective in the two mutation bearing cell populations, reflecting defective fibril formation proportional to the abnormal gene dose. They found increased accumulations of post-translational modifications (hydroxylation, and O-glycosylation) on the mutant collagen extracted from the chondronoids and EM evidence of collagen retention in the ER. They studied the comparative transcriptional profiles in the three phenotypes and failed to find a profound UPR response late in culture and only a mild upregulation of UPR genes in the young cultures. They could not find evidence for activation of the ISR except in the homozygous mutant cells. Using transfected HT-1080 cells (previously shown by these investigators not to express endogenous pro-collagen II but able to synthesize transfected pro-collagen genes) they were able to study the comparative wt and mutant pro-collagen interactomes.

      Conclusions: They conclude that the p.gly1170ser mutation in Col2A1 results in abnormal folding which results in trapping of the protein in the ER and some interaction with cellular elements of the proteostatic response. They concluded that the cellular proteostasis machinery can recognize slow-folding Gly1170Ser through increased interactions with certain ER network components but not in the same fashion that has been described for liver cells producing mutated versions of high volume secreted proteins.

      Major comments:

      Their first conclusion, stated in the abstract, "Biochemical characterization reveals that Gly1170Ser procollagen-II is notably slow to fold and secrete." that the mutant polypeptide chain is slower folding than the wild type chain is based on the premise that the longer the chains are in the ER the greater the degree of lysine hydroxylation and O-glycosylation. Although this may be true, they do not provide a reference and I could not find a definitive description of the phenomenon. Their reference 48 only discusses the occurrence of intracellular post-translational modification of the lysines and continuing modification extracellularly but does not relate these phenomena to the rate at which the peptides traverse the cell. I think the reader would benefit from seeing experiments in which the rate of folding and secretion of the wild type and mutant chains are measured and the degree of post-translational modification are compared. Cabral WA et al showed differences in collagen folding and secretion rates in cyclophilin wt, knockouts and heterozygotes osteoblasts and fibroblasts by western blots. (2014) Abnormal Type I Collagen Post-translational Modification and Crosslinking in a Cyclophilin B KO Mouse Model of Recessive Osteogenesis Imperfecta. PLoS Genet 10(6): e1004465. doi:10.1371 / journal.pgen. 1004465). Performing such experiments in their chondronoids would confirm the authors' interpretation that the increased post-translational modification portrayed in their figure 4 reflects slowed folding and secretion related to the mutation.

      I think Figure 4 needs more explanation for the reader. While, as expected, the homozygous mutant band is much slower than the homozygous wild type band, in the heterozygotes the band is intermediate rather than showing a discrete mixture of wild type and mutant proteins, reflecting different degrees of post-translational modification. Is this a function of mixed triple helices with heterogeneous degrees of post-translational modification? It deserves more comment, since the argument relating the degree of post-translational modification to the rate of folding is dependent on this observation. It would also be helpful to show the whole gel with collagen II markers.

      Another approach to the question of intracellular accumulation due to a slow rate of folding of the mutant collagen would be to perform pulse chase labeling of the three types of chondronoids with radiolabeled amino acids and sugars and processing the media and lysates with analysis using antibodies specific for the two collagen chain types. Given the authors extensive experience in studying collagen biosynthesis (e.g. Chan et al J. Biochem. Biophys. Methods 36 (1997) 11-29), such a supporting study would firmly establish whether the rate of folding/secretion differs between the wt and the homozygous and heterozygous chondroidinomas. Until the slow folding can be directly demonstrated in a quantitative fashion rather than by monitoring the secondary phenomenon of post-translational modification the hypothesis remains unproven.

      Another issue that does not appear to be addressed is the consequence of having misfolded collagen chains in the dilated ER. Liang et al, using mice transgenic for one or two copies of the mutant human gene showed apoptosis in the homozygotes but not in the hets a finding similar to that of Kimura et al using transgenics carrying a different human COL2A1 mutation. Okada et al, using chondrocytes converted from human fibroblasts with clinical collagenopathy (heterozygous), although not the same mutation as in the present study, showed dilated ER and some level of apoptosis in the cultured cells. Hintze et al, examining chondrocytes expressing different mutants associated with different forms of spondyloepiphyseal dysplasia, suggested that the degree of stability of the mutations might determine whether apoptosis occurred, i.e. the thermolabile p.R989C was associated with apoptosis while cells expressing the more thermostable mutants p.275C, P.719C and p.G853E did not reveal any evidence for ongoing apoptosis R989. Is it possible that the smaller size of the homozygous chondronoids reflect fewer cells rather than less matrix (or both) as result of apoptosis? Examination of the chondronoids with reagents for caspase 3 or Tunel staining. One could also measure by Col/DNA ratio in wt, hets and homos. It might also have been useful for these experiments been more quantitative, i.e. by cell sorting rather than by eye. Would ImageJ software been helpful? It is also unclear as to the conformation of chains trapped in the ER. There are many examples in which the natural tendency of misfolded proteins is to aggregate. This is certainly true in the neurodegenerative diseases. While at the magnification used here in the TEM's the ER inclusions appear homogeneous and amorphous, perhaps at higher magnification/resolution a more discrete structure might be seen.

      While the choice of time points for the transcriptional analysis, i.e. early and late seems well thought out, the lack of a significant response may be due to the timing and it might have been useful to do earlier or later time points or intermediate time points in case the response was transient, particularly since other laboratories have reported UPR activation and abnormalities in the context of the silencing of Xbp1, the spliced form of which is a major driver of at least one arm of the UPR. The notion that pro-collagen is largely hydrophilic without the potential for exposure of hydrophobic regions that might engage BiP, thus is not sensitive to BiP sensing, is interesting. Is it possible that the tendency of the mutant polypeptides to form the triple helix which in itself acts as kind of a self chaperoning structure? Looking at the kinetics of assembly inside the cell, see suggestions above, might provide further insight into the process beyond that obtained by looking at the modified state of the lysines.

      Minor comments:

      As I mentioned above, while the transcriptional interactome experiments are computationally sophisticated the cell biology and biochemistry would benefit from more and better quantitation.

      The paper is written in a style in which results and discussion are intermingled. Personally I prefer that the introductions are short, the results clearly and briefly presented and the discussion deals with the interpretation and conclusions. I thought that whole paragraphs could have been omitted. e.g. in the introduction

      Omit paragraph "The fibrillar.........achondrogenesis type II" Omit paragraph "Conventional and... for example." Omit "Excitingly........in vitro and in vivo (36)."

      Results:

      First paragraph repeats last paragraph of introduction and not necessary in one place or the other, condense. Figure 2 by eye MGP (Matrix gla protein inhibits vascular calcification of type II collagen) seems highly over-expressed in the homozygous mutants; MGP is supposedly an inhibitor of calcification, does its over-expression here reflect something about the adequacy of the matrix

      Figure 5 is good but can it be confirmed by quantitative biochemistry?

      Did you stain with antibodies to other ER resident chaperones other than calreticulin?

      Do cells with large amounts of intracellular G1170S die?

      Does higher magnification EM reveal any structure of the material within the dilated ER?

      Are there any inflammatory cells in the Chondronoids? To respond to aberrant proteins?

      Paragraph "Bypassing the UPR.......often do not" Is discussion not results

      Referees cross-commenting

      Are other reviewers concerned about the precise definition of slowed folding rather than utilizing the degree of post-translational modification as a surrogate?

      Significance

      The experimental system described here is clearly the wave of the present. Generating human ipSC's of different lineages is now being exploited to study a variety of disorders, to achieve better understanding of pathogenesis at the molecular level to serve as appropriate models for drug development, particularly in the context of high throughput screening. In addition, as in this case, relatively rare autosomal dominant disorders with phenotypes that resemble more common sporadic disease, may allow the development of treatments that are relevant for the sporadic disorder. While it is likely that the osteoarthritis that develops in the carriers of the COL2A1 mutations is a function of the host response to the aberrant mechanics resulting from the defective extra-cellular matrix caused by the mutation, having a pure system in which the primary defect can be corrected and the predisposing matrix deficit reversed, could allow normal reparative processes to mitigate the functional joint disability. While the transgenic mice are useful as a disease model, they represent not only the expression of the primary defect but the host pathophysiologic response to that defect, i.e. in this case how the mouse responds to the defective matrix state and whether those responses add additional pathogenic factors to the disease course. Having a tool in which to relatively assess the pure chondrocyte effect should allow more granular analysis of the primary process.

      Their findings reinforce the notion that involvement of the UPR as well as the other arms of the proteostatic response in chondrocytes expressing a variety of mutant collagens suggests a degree of heterogeneity, perhaps depending on the mutation involved. While I do not believe that their current data prove or rigorously test their proposed hypothesis, i.e. that "perhaps due to the pathologic substitution occurring within a triple-helical domain that lacks hydrophobic character, this ER protein accumulation is not recognized by cellular stress responses, such as the unfolded protein response", it is worth considering.

      Given the fact that this is a relatively small field with a variety of observations concerning the role of proteostasis and the UPR in particular which seem to vary depending on the system, i.e. transgenic mice, transfected fibroblasts, the chondroidomas, these observations particularly with additional biochemistry to confirm their notions regarding folding rates etc, represent a useful technical addition to the field and should be interesting for people working on collagen biology, arthritis and protein folding.

      I am not a collagen biologist hence my knowledge of some of the nuances of collagen biology may not be extensive. My own areas of interest include the assembly of multi-peptide proteins (such as immunoglobulins) for secretion; the mechanisms that allow them to exit the cell and the aggregation of misfolded proteins as exemplified by the amyloidoses and other forms of clinically relevant protein aggregation. Hence, I am very familiar with tissue culture, transgenic animals as disease models, studies of protein aggregation, and as a former rheumatologist, osteoarthritis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The paper by Yammine et al addresses a major problem in peculiarities of genotype to phenotype manifestation in collagen II and chondrodysplasia. It is a lucid and comprehensive study detailing what they see as the fundamental mechanism of Gly1170 Ser mutated Col2a1 gene.

      At the heart of the matter is the debunking of the results from a mouse model generated by liang et al (Plos one 2014) paper in which the authors suggested that the phenotype seen only homozygous mice (heterozygous mice appear normal), was related to ER stress -UPR-apoptosis cascade resulting in the chondrodysplasia. Yammine et al paper uses a different model, a robust human iPSC-based tissue, with a CRISPRed variant show that despite the ability of the variant chondrocytes to deposit a Gly1170Ser-substituted collagen II in both the hetero- and homozygous models, is not accompanied by any substantive UPR. The authors of this current paper also argue that their model system is most closely resemble the human context, where heterozygous individual show pathology.

      I have sieved all the data related to this topic and have gone back to examine the data and what struck me was the repeated use of the phrase "slow to fold" in the current paper and wondered whether the element of "TIME" is as important in the chondrogenesis of either models and it is this element that generate the difference between the two results?

      While iPSC-based tissue takes up to 44 days, a female mouse would have had two litters in this time and made many more growth plates. Could it be by "slowing" the chondrogenesis pathway, which is part of the procedure of differentiation of iPS cells into chondrocytes, the ER is not as "stressed" as in mouse development? I would like the authors to reflect and comment and put forward their view given UPR signaling pathways play a crucial role in chondrocytes in phases of high protein synthesis, e.g., during bone development by endochondral ossification (Journal of Bone Metabolism, vol. 24, no. 2, pp. 75-82, 2017).

      This is not withstanding the good argument given by the authors in defending their robust results, namely that the there is no evidence that the hydrophilic triple-helical domain of pro-collagen binds BiP, the main detector of accumulated misfolded proteins. What then do they make out of the immunostaining and qPCR with ER stress related genes in Liang et al paper?? I know that the data is not theirs but a comment on the indisputable data gives the reader a better understanding.

      Although I understand the choice of cell lines for overexpression, the transfection of the HT-1080 cells using wild-type and Gly1170Ser COL2A1encoding plasmids are not a match to the in vivo model (variation of efficiency, etc.) the appearance of BiP even at a lower fold increase does not negate ER stress, as the authors acknowledge but more important is what other paracrine signals which triggers the UPR signally pathway which is not linked to BiP ? or an iPS system may lack? Is there anything else not only ATF6α (activating transcription factor 6 alpha), but IRE1α (inositol-requiring enzyme 1 alpha), and PERK (protein kinase RNA-like endoplasmic reticulum kinase). The authors have given us plenty of alternatives that are relevant, and they prepared us for yet another paper on articular cartilage using iPS tissue model which I am looking forward to.

      Significance

      I think this paper is publishable and it is important in understanding the mechanism by which mutation in collagen type II affect chondrogenesis and therefore bone formation.

      This paper will appeal to musculoskeletal scientist especially those who are interested in bone and its pathology.

      It would be important for the authors to respond to the critique of "TIME" and speed of protein synthesis which create a duress in the ER pathway.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility, and clarity (Required)):

      1. In this manuscript, Imoto et al. analyze the specific role of the Dynamin1 splice variant Dyn1xA in so-called ultrafast endocytosis, an important mechanism of synaptic vesicle recycling at synapses. In a previous publication (Imoto et al. Neuron 2022), some of the authors had shown that Dyn1xA, and not the other splice variant Dyn1xB, is essential for ultrafast endocytosis. Moreover, Dyn1xA forms clusters around the active zone for exocytosis and interacts with Syndapin 1 in a phosphorylation dependent manner. However, it was unclear which molecular interactions underlie the specific role of Dyn1xA. Here, the authors provide convincing evidence with pull down assays and CSP that Dyn1xA PRR interacts with EndophilinA1/2 with two binding sites. The first binding site lies in the part common to xA and xB, was previously characterized. The second site was previously uncharacterized, is specific for Dyn1xA, and is regulated by phosphorylation (phosphobox 2). The location of these splice variants and mutated forms at presynaptic sites correlate with the prediction made by the biochemical assays. Finally, the authors perform rescue experiments ('flash and freeze' and VGLUT1-pHluorin imaging experiments) to show that Dyn1xA-EndophilinA1/2 binding is important for ultrafast endocytosis. I find the results interesting, providing an important step in the understanding of the interplay between dynamin and the endocytic proteins interacting with it (endophilin, syndapin, amphiphysin) in the context of synaptic vesicle recycling. The manuscript is clearly written and for the most part the data supports the authors' conclusions (see specific comments below). However, there are some issues which need to be clarified before this manuscript is fully suitable for publication.

      We thank the reviewer for noting the importance of our study. Indeed, our previous study has raised the question as to why only the Dyn1xA splice variant mediates ultrafast endocytosis, and our current manuscript now resolves this issue.

      Introduction: the dynx1B Calcineurin binding motif is written PxIxIT consensus but actual sequence is PRITISDP. Is this a typo?

      The sequence is correct. One thing we failed to mention is that the last amino acid in this motif can be either threonine or serine for calcineurin binding, as we demonstrated previously [Jing, et al., 2011 JBC; PMC3162388]. We have amended the text as follows.

      1. calcineurin-binding motif (PxIxI[T/S]) 19.

      Figure 1: the difference between the constructs used in panels C and D is not clear. In D, is it a truncation without residues 796 and 845? If so, it should be labelled clearly in the Western blots. In Panel E, Dyn1xA 746-798 should be labeled Dyn1x 746-798 because it is common to both splice variants.

      We thank the reviewer for pointing this out. Both C and D used the full-length PRRs of Dyn1xA-746 to 864 and xB-746 to 851. To make the labeling clear, we changed Dyn1xA PRR to “Dyn1xA PRR (746-864)” and Dyn1xB PRR to “Dyn1xB PRR 746-851” in Figure 1. In the main text, we made the following changes.

      1. 4: “To identify the potential isoform-selective binding partners, the full-length PRRs of Dyn1xA746-864 and xB746-851 (hereafter, Dyn1xA-PRR and Dyn1xB-PRR, respectively).”

      Figure 1: For amphiphysin binding the authors write that "No difference in binding to Amphiphysin 1 was observed among these peptides (Figure1D-F)." They should write that Dyn1x 746-798 does not bind Amphiphysin1 SH3 domain, confirming the specificity of binding to the 833-838 motif.

      We edited the sentence as suggested.

      1. “Dyn1x 746-798 does not bind Amphiphysin1 SH3 domain (Figure 1G), confirming the specificity of binding to the 833-838 motif as reported in previous studies 29,30. (Figure 1D-F).”

      Figure S2. The panels are way too small to see the shifts and the labelling. Please provide bigger panels

      As suggested, we have now provided bigger panels in Figure S2, and amended the text and Figure legend accordingly.

      We also removed Figure S2B as it was not referred to in the text in any way. (It was the reverse experiment – HSQCs of 15N-labelled SH3 titrated with unlabelled dynamin).l

      Figure 2 panel B. There is a typo in the connecting line between the sequence and the CSP peaks. It is 846 instead of 864 (after 839).

      Corrected.

      Figure 3 panel E. In the text, the authors write that "Western blotting of the bound proteins from the R838A pull-down experiment showed that R838A almost abolished both Endophilin and Amphiphysin binding in xA806-864 (Figure 3D), and reduced Endophilin binding to xA-PRR (Figure 3E)." I think they should write "only slightly reduced Endophilin binding..." it is more faithful to the result and consistent with the conclusion that Endophilin A1 has two binding sites on Dyn1xA PRR.

      We have now provided quantitative data for R838A and R846A (Fig. 3F and G). Endophilin binding is significantly reduced with R846A.

      It is unclear why the R846A mutant affects binding of Dyn1xA 806-864 but not Dyn1xA-PRR-.

      The reviewer asks why the R846A mutant affects binding of Dyn1xA 806-864, but not so much of Dyn1xA-PRR. The explanation is simply that there are two endophilin binding sites in Dyn1xA-PRR. The first is not present in the xA806-864 peptide, while both are present in Dyn1xA-PRR (the full length tail). When doing pull-down experiments, the binding tends to saturate – even when the second site is blocked by R846A. The first site is still able to bind, and the binding appears as normal. The same applies to the R838A mutant.

      Moreover, it affects binding to endophilin as well as amphiphysin, and therefore it is not specific. It is thus not correct to write that "R846 is the only residue found to specifically regulate the Dyn1 interaction with Endophilin as a part of an SDE". In the Discussion (page 11), the authors refer to the R846A mutation as specifically affecting Endophilin binding. This should be toned down, as it also affects Amphiphysin binding. For this important point, the data on quantification of Endophilin binding should be presented.

      The reviewer’s concern is about our claims of specificity of Endophilin A binding in Dyn1xA R846 mutation experiments. The reviewer is correct, and we have now defined specific parameters for those claims. Specifically, we have added new quantitative data from the Western blots in Fig 3E (full-length Dyn1aX-PRR) as Fig 3F-G. We used full-length Dyn1aX-PRR rather than the xA806-864 peptide because the subsequent transfection experiments use full length Dyn1xA. In the new figures 3F and 3G, we quantified Endophilin A, Amphiphysin and Syndapin1 amounts from the multiple Western blots such as Figure 3E (now n=14, 6 experiments, each in with 2-4 replicates for Dyn1xA PRR). R846A mutated in Dyn1xA-PRR significantly reduces the binding to Endophilin A, but it does not significantly affect the binding to Amphiphysin 1and Syndapin1 (Fig 3G). Therefore, this particular Dyn1xA-PRR mutation specifically affects Endophilin A binding, in the context of the full-length tail Dyn1aX-PRR. To make these results clear, we modified the text as below.

      P7. “R838A and R846A caused smaller reductions in Endophilin binding compared to wild-type Dyn1xA-PRR, (Figure 3E, 3F, R838A, median 68.5 ; Figure 3G, R846A, median 59.3 % : R838A reduced the Dyn1/Amphiphysin interaction (Figure 3E, 3F, median 14.2 % binding compared to wild-type Dyn1xA-PRR). By contrast, R846A did not affect Amphiphysin and Syndapin binding to Dyn1xA-PRR (Figure 3E, 3G). Therefore, R846, being part of an SDE, is the only residue we found to specifically regulate the Dyn1 interaction with Endophilin in the context of the full length tail (DynxA-PRR)”.

      Additionally, the reviewer notes that “the authors refer to the R846A mutation as specifically affecting Endophilin binding. This should be toned down, as it also affects Amphiphysin binding.” In the light of the above data and new quantitative analysis (Fig 3F-G), we have clarified the conclusion. However, to be clear that this statement is only correct in the context of the full-length DynxA-PRR, we amended texts as follows:

      P7. “By contrast, R846A did not affect Amphiphysin and Syndapin binding to Dyn1xA-PRR (Figure 3E, 3G). Therefore, R846, being part of an SDE, is the only residue we found to specifically regulate the Dyn1 interaction with Endophilin in the context of the full length tail (DynxA-PRR)”.

      New legends for Figure 3F and G have now been added as follows.

      “(F) The binding of Endophilin A, and Amphiphysin 1 and Syndapin1 to Dyn1xA-PRR (wild type) or R838A mutant quantified from Western blots in (E). n=14 (6 experiments with 2-4 replicates in each). Median and 95% confidential intervals are shown. Kruskal-Wallis with Dunn’s multiple comparisons test (**p (G) The binding of Endophilin A, and Amphiphysin 1 and Syndapin1 to Dyn1xA-PRR (wild type) or R846A mutant quantified from Western blots in (E). n=14 (6 experiments with 2-4 replicates in each). Median and 95% confidential intervals are shown. Kruskal-Wallis with Dunn’s multiple comparisons test was applied (*p

      Figure 3F-G (which are now 3H and 3I in the revised text): what do the star symbols represent in the graphs? I guess the abscissa represents retention time. Please write it clearly instead of a second ordinate for molecular mass, which does not make much sense if this reflects the estimate for the 3 conditions.

      The “stars” are crosses (x) and represent individual data points. The figure legends have been updated for clarity. The reviewer is correct that the X-axis is retention time (min). The second Y-axis is needed to define the points in the curve marked with crosses (x’s). The legends for Figure 3H and I are now changed as follows.

      “(H) SEC-MALS profiles for Dyn1xA alone (in green), Endophilin A SH3 alone (in red) and the complex of the two (in black) are plotted. The x-axis shows retention time. The left axis is the corresponding UV absorbance (280 nm) signals in solid lines, and the right axis shows the molar mass of each peak in crosses. The molecular weight of the complex was determined and tabulated in comparison with the predicted molecular weight. x represent individual data points.

      (I) SEC-MALS profiles for a high concentration of Dyn1xA-PRR/Endophilin A SH3 complex (0.5 mg) (in dark blue) and a low concentration of Dyn1xA-PRR/endophilin A SH3 complex (0.167 mg) (in blue). The x-axis shows retention time. The left axis is the corresponding UV absorbance (280 nm) signals in solid lines, and the right axis shows the molar mass of each peak in crosses. The molecular weight of the complex was determined and tabulated in the table. x represent individual data points.”

      Figure 4: The statement that "By contrast [to Dyn1xA], Endophilin A1 or A2 formed multiple clusters (1-5 clusters)" is not at all clear on the presented pictures. The authors should provide views of portions of axons with several varicosities, for the reader to appreciate the cases where there are more EndoA clusters than Dyn1 clusters.

      In the revised Figure S4, we added additional STED images for a region of axons with more EndoA1/2 clusters than Dyn1xA clusters. The locations of Dyn1xA and EndoA1/2 clusters are annotated in each image based on the local maximum of intensity, which is determined using our custom Matlab analysis scripts (Imoto, et al., Neuron 2022; for the description of the methods, please refer to the Point #14 below). We also added Figure S3 to describe our analysis pipelines. In the Dyn1xA channel, outer contour indicates 50% of local maxima (boundary of Dyn1xA cluster) while inner contour indicates 70% of local maxima of the clusters. In the EndoA1/2 channel, local maxima of the clusters are indicated as points. To reflect these changes, we modified text as below.

      P 9. “By contrast, Endophilin A1 or A2 formed multiple clusters (1-5 clusters) (Figure S4)”

      The legends for Figure S4 are now as follows.

      “Figure S4. Additional STED images for Figure 4.

      (A) The top image shows an axon containing multiple boutons. Signals show overexpression of GFP-tagged Dyn1xA (Dyn1xA) and mCherry-tagged Endophilin A1 (EndoA1). The bottom images show magnifications of four boutons in the top image. Red hot look-up table (LUT) images on the right side of Dyn1xA and EndoA1 images are enhanced contrast images. Outer and inner contours represent 50% and 70% of local maxima of the Dyn1xA, respectively. Black circles represent local maxima of Endophilin A1. In these boutons, multiple EndophilinA1 puncta are present.

      (B) The top image shows an axon congaing multiple boutons. Signals show overexpression of mCherry-tagged Dyn1xA (Dyn1xA) and GFP-tagged Endophilin A1 (EndoA1). The bottom images show magnifications of four boutons in the top image. Red hot LUT images on the right side of Dyn1xA and EndoA2 images are enhanced contrast images. Outer and inner contours represent 50% and 70% of local maxima of the Dyn1xA, respectively. Black circles represent local maxima of Endophilin A2. In these boutons, multiple EndophilinA2 puncta are present.

      (C) STED micrographs of the same synapses as in Figure 4E with an active zone marker Bassoon (magenta) visualized by antibody staining. GFP-tagged Dyn1xA, Dyn1xA S851D/857D or Dyn1xA R846A (green) are additionally stained with GFP-antibodies. Local maxima of Dyn1xA, Dyn1xA S851D/857D or Dyn1xA R846A signals and minimum distance to the active zone boundary are indicated by dark blue lines.”

      Moreover, overexpression of EndophilinA1/2-mCherry is not sufficient to assess its localization. Please consider either immunofluorescence or genome editing (e.g. Orange or TKIT techniques).

      We agree with the reviewer that overexpression obscures the endogenous localization of proteins. To address this point in our previous publication, we titrated the amount of plasmids for Dyn1xA-GFP and transfected neurons just for 20 hours – this protocol allowed us to uncover the endogenous localization of Dyn1xA despite the fact that it was overexpressed in wild-type neurons (Imoto, et al., 2022). We also confirmed this localization by ORANGE-based CRISPR knock-in of GFP-tag in the endogenous locus of Dyn1 just after the exon 23 and confirm the true endogenous localization of Dyn1xA (Imoto, et al., 2022). Similar approaches were taken by the Chapman lab to localize Synaptotagmin-1 and Synaptobrevin 2 in axons (Watson et al, 2023, eLife, PMID: 36729040). We did not emphasize this in the first submission, but we took the same approach for the EndoA1/2 localization. This does not mean that they also unmask the endogenous localization, and the reviewer is correct that additional evidence would strengthen the data here. Thus, as suggested, we have looked at the endogenous EndophilinA1 localization by antibody staining. As the reviewer is likely aware, EndophilinA1 also localizes to other places including dendrites and postsynaptic terminals, making it difficult to analyze the data. However, we observe colocalization of Dyn1xA with endogenous EndoA1. Thus, we believe that our major conclusion here drawn based on EndoA1/2-mCherry overexpression is valid (Reviewer’s Figure 1). Since the Endophilin signals in neighboring processes obscures its localization in synapses-of-interest, repeating this localization experiments with ORANGE-based knock-in would be ideal. However, with the lead author starting his own group and many validations needed to confirm the knock-in results, this experiment would require us at least 4-6 months, and thus, it is beyond the scope of our current study. We will follow up on this localization in the near future, but given that endophilin is required for ultrafast endocytosis (Watanabe, et al., Neuron 2018, PMID: 29953872) and these proteins need to be in condensates at the endocytic sites for accelerating the kinetics of endocytosis (Imoto, et al., Neuron 2022, PMID: 35809574), we are confident that endogenous

      EndoA1/2 are localized with Dyn1xA.

      The analysis of the confocal microscopy data is not explained. How is the number of clusters determined? How far apart are they? Confocal microscopy may not have the resolution to distinguish clusters within a synapse.

      We apologize for the insufficient description of the method. We had provided a more thorough description of the methods in our previous publication (Imoto, et al., Neuron 2022, PMID: 35809574). To make this more automated, we improved our custom Matlab scripts. Please note that all the analysis for the cluster location is performed on STED images, not on normal confocal images. To determine the cluster, first, presynaptic regions (based on Bassoon signals or Dyn1xA signals within boutons) in each STED image are cropped with 900 by 900 nm (regions-of-interest) ROIs. Then, our Matlab scripts calculate the local maxima of fluorescence intensity within the ROIs. To determine the distance between the active zone and the Dyn1xA or EndoA1/2 clusters, the Matlab scripts perform the same local maxima calculations in both channels and make contours at 50% intensity of the local maxima. The minimum distance reflects the shortest distance between the active zone and Dyn1xA/EndoA1/2 contours. To make these points clearer, we modified the main text and the Methods section. In addition, we have added workflow of these analysis as Figure S3.

      P9. Main. “Signals of these proteins are acquired by STED microscopy and analyzed by custom MATLAB scripts, similarly to our previous work23.”

      P20. Methods. “All the cluster distance measurements are performed on STED images. For the measurements, a custom MATLAB code package23 was modified using GPT-4 (OpenAI) to perform semi-automated image segmentation and analysis of the endocytic protein distribution relative to the active zone marked by Bassoon or relative to Dyn1xA cluster in STED images. First, the STED images were blurred with a Gaussian filter with radius of 1.2 pixels to reduce the Poisson noise and then deconvoluted twice using the built-in deconvblind function: the initial point spread function (PSF) input is measured from the unspecific antibodies in the STED images. The second PSF (enhanced PSF) input is chosen as the returned PSF from the initial run of blind deconvolution62. The enhanced PSF was used to deconvolute the STED images to be analyzed. Each time, 10 iterations were performed. All presynaptic boutons in each deconvoluted image were selected within 3030-pixel (0.81 mm2) ROIs based on the varicosity shape and bassoon or Dyn1xA signals. The boundary of active zone or Dyn1xA puncta was identified as the contour that represents half of the intensity of each local maxima in the Bassoon channel. The Dyn1xA clusters and Endophilin A clusters were picked by calculating pixels of local maxima. The distances between the Dyn1xA cluster and active zone boundary or Endophilin A clusters were automatically calculated correspondingly. For the distance measurement, MATLAB distance2curve function (John D'Errico 2024, MATLAB Central File Exchange) first calculated the distance between the local maxima pixel and all the points on the contour of the active zone or Dyn1xA cluster boundary. Next, the shortest distance was selected as the minimum distance. Signals over crossing the ROIs and the Bassoon signals outside of the transfected neurons were excluded from the analysis. The MATLAB scripts are available by request.”

      In the legend of Figure S3,

      “Protein localization in presynapses is determined by semi-automated MATLAB scripts (see Methods).

      (A) Series of deconvoluted STED images are segmented to obtain 50-100 presynapse ROIs in each condition.

      (B) Two representations of the MATLAB analysis interface are shown. The first channel (ch1, green) is processed to identify the pixels of local maxima within this channel. The second channel (ch2, magenta) is normally an active zone protein, Bassoon. Active zone boundary is determined by the contour generated at 50% intensity of the local maxima of ch2. The contours outside of the transfected neurons are manually selected on the interface and excluded from the analysis. Minimum distances from each pixel of the local maxima in ch1 to the contour in ch2 are calculated and shown in the composite image. The plot “Distance distribution” shows all the minimum distance identified in this presynapses ROI (unit of the y axis is nanometer). The plot “Accumulated distance distribution” shows the accumulated distance distribution from the initial to the current presynapses ROI. The plot “Histogram of total intensity” shows the intensity counts around individual local maxima pixels in ch1.”

      For the STED microscopy, a representation of the processed image (after deconvolution) and the localization of the peaks would be important to assess the measurement of distances. If Dyn1xA S851/857D is more diffuse, are there still peaks to measure for every synapse?

      We thank the reviewer for bringing up this important question. In Figure S4C, we have added the position of the local maxima of wild-type and mutant Dyn1xA shown in the main Figure 4E. As the reviewer pointed out, when a protein is more diffuse, it is difficult to find the peak intensity by STED. However, since these proteins are still found at a higher density within a very confined space of a presynapse and synapses are packed with organelles like synaptic vesicles and macromolecules, signals from even diffuse proteins can be detected as clusters, and local maxima can be detected in these images.

      To illustrate this point better, we added Reviewer’s Figure 2 below. In this experiment, we transfected neurons with a typical amount of plasmids (2.0 µg/well) or ~10x lower amount (0.25 µg/well). When the density of cytosolic proteins is high (Reviewer’s Figure 2A), the depletion laser has to be strong enough to induce sufficient stimulated emission and resolve protein localization. Insufficient power would produce low resolution images, leading to inappropriate detection of the local maxima (Reviewer’s Figure 1A). Thus, we set our excitation and depletion laser powers to resolve the protein localization to ~40-80 nm at presynapses. Furthermore, to avoid mislocalization of proteins due to the overexpression, we use 0.25-0.5 ug/well (in 12-well plate) of plasmid DNA for transfection, which is around 10 times lower than the amount used in the typical lipofectamine neuronal transfection protocol (Imoto, et al., Neuron 2022). We also change the medium around 20 hours after the transfection instead of the typical 48 hours (Imoto, et al., Neuron 2022). With these modifications and settings, we can obtain the location of the local maxima of the diffuse signals (Reviewer’s Figure 1B and Figure 4E and Figure S4). We modified the Method section to make these points clearer.

      P 17, “Briefly, plasmids were mixed well with 2 µl Lipofectamine in 100 µl Neurobasal media and incubated for 20 min. For Dyn1xA and Endophilin A expressions, 0.5 µg of constructs were used to reduce the overexpression artifacts23. The plasmid mixture was added to each well with 1 ml of fresh Neurobasal media supplemented with 2 mM GlutaMax and 2% B27. After 4 hours, the medium was replaced with the pre-warmed conditioned media. To prevent too much expression of proteins, neurons were transfected for less than 20 hours and fixed for imaging.”

      P 20, “Quality of the STED images are examined by comparing the confocal and STED images and measuring the size of signals at synapses and PSF (non-specific signals from antibodies).”

      Legends for Figure S4C,

      “(C) STED micrographs of the synapses shown in Figure 4F with an active zone marker Bassoon (magenta). GFP-tagged Dyn1xA, Dyn1xA S851D/857D or Dyn1xA R846A are visualized by antibody staining of GFP (green). Local maxima of Dyn1xA, Dyn1xA S851D/857D or Dyn1xA R846A signals and minimum distance to the active zone boundary are overlaid.”

      Figures 5 and 6: No specific comment. The data and its analysis are very nice and elegant. The comment on the lack of rescue of Dyn1xA on endosome maturation may be a bit overstated, because many "controls" (shRNA control Figure S5 or Dyn3 KO in Imoto et al. 2022) have a significant number of endosomes 10 s after stimulation.

      We thank the reviewer for noting the strength of our data and pointing out this issue on endosomal resolution. In particular, the reviewer is concerned about our interpretation of the ferritin positive endosomes present at 10 s in time-resolved electron microscopy experiments. Indeed, the number of ferritin positive endosomes in Dyn1 KO, Dyn1xA OEx neurons (0.1/profile) is similar to the control conditions: scramble shRNA control (0.1/profile, Figure S5) and Dyn3KO neurons (0.2/profile) in our previous study (Imoto et al. 2022). Although we do not consider Dyn3 KO as a control, given the presence of abnormal endosomal structures, we agree with the reviewer that scramble shRNA control in Figure S5 does indicate that some ferritin-positive endosomes even at 10 s after stimulation. We would like to note that this result is in stark contrast to our previous studies where we observed the number of ferritin positive endosomes returning to the basal level in both wild-type neurons and many scramble shRNA controls (Watanabe et al. 2014, 2018, Imoto et al 2022). Thus, the majority of the data we have indicate that the number of ferritin positive endosomes returns to basal level by 10 s, suggesting that endosomes are typically resolved into synaptic vesicles by this time. However, given that we do not know the nature of the inconsistency here and we cannot exclude the possibility of overexpression artifact of Dyn1xA as an alternative, we changed the following lines.

      P. 10, “Interestingly, the number of ferritin-positive endosomes did not return to the baseline (Figure 5E, F) as in previous studies3,35,36, suggesting that Dyn1xA may not fully rescue the knockout phenotypes or that overexpression of Dyn1xA causes abnormal endosomal morphology.”

      By the way, why did the authors use Dyn1 KO in this study, and not Dyn1,3 DKO as in Imoto et al. 2022?

      This is simply because Dyn3KO displayed an endosomal defect in our previous study (Imoto et al 2022), and we wanted to focus on endocytic phenotypes of Dyn1 KO and mutant rescues in this study.

      In the Discussion, the authors present the binding sites (for endophilin and amphiphysin SH3 domains) as independent. However, these proteins form dimers or even multimers as they cluster around the neck of a forming vesicle. Even though they provide evidence in vitro (Figure 3) that in these conditions of high concentration one dyn1xA-PRR binds one SH3 domain, in cells multiple binding sites on the PRR to these proteins may involve avidity effects, as discussed for example in Rosendale et al. 2019 doi 10.1038/s41467-019-12434-9. For example, the high affinity binding of Dyn1-PRR to amphiphysin cannot be explained only by the sequence 830-838.

      The reviewer suggests “In the Discussion, the authors present the binding sites (for endophilin and amphiphysin SH3 domains) as independent.” However, we do not claim these interactions are functionally independent, except in the context of in vitro experiments where they are sequence-independent.

      They also suggest “However, these proteins form dimers or even multimers as they cluster around the neck of a forming vesicle”. However we do not agree with this in the context of our Discussion, because the evidence of multimers and clustering is convincing but is entirely in vitro data.

      Thirdly they comment that “For example, the high affinity binding of Dyn1-PRR to amphiphysin cannot be explained only by the sequence 830-838.” We fully agree with the statement and felt we had addressed this in the manuscript. To explain, it’s important to point out our relatively new concept here and previously reported by us (Lin Luo et al 2016, PMID: 26893375) of the existence and importance of SDE and LDE for SH3 domains (Endophilin here, syndapin in our previous report). These elements act at a distance from the so-called core PxxP motifs and they provide much higher affinity and specificity than the core region alone. We had further mentioned this in the p11 discussion “Although this is a previously characterized binding site for Amphiphysin and is also present in Dyn1xB-PRR, the extended C-terminal tail of Dyn1xA contains short and long distance elements (SDE and LDE) essential for Endophilin binding, making it higher affinity for Endophilin.” Because the NMR identified F862 as a chemical shift for dynamin, we performed a pulldown with this mutant in the xA746-798 construct (which only contains the higher affinity site) and found that indeed “.F862A reduced Endophilin binding 29% (pOverall, the reviewer correctly points out that “multiple binding sites on the PRR to these proteins may involve avidity effects*” could play a role in vivo. We agree that avidity is an additional possibility, not examined in our study. Therefore, as suggested, we added the following sentence to the discussion on the SDE and LDE impacts.

      P. 11. “Our pull-down results showed that R846A abolished endophilin binding to xA806-864 (which contains only the second and higher affinity binding site and the associated SDE (A839) and LDE (F862)) and reduced about 40% of endophilin binding to the Dyn1xA-PRR (which contains both binding sites) without affecting its interaction with Amphiphysin, providing important partner specificity, although we cannot exclude the possibility that avidity effects may additionally come in play in vivo 42

      Reviewer #1 (Significance (Required)):

      This study provides a significant advance on the mechanisms of dynamin recruitment to endocytic zones in presynaptic terminals. The work adds a significant step by experienced labs (Robinson, Watanabe) who have provided important insight in the mechanisms by many publications in the last years.

      We thank the reviewer for the careful read of our manuscript and positive outlook of our work.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      1. This is a compelling study that reports a key discovery to understand the molecular mechanism of ultrafast endocytosis. The authors demonstrate that the Dynamin splice version 1xA (Dynamin 1xA) uniquely binds Endophilin A, in contrast to Dynamin splice version 1xB (Dynamin 1xB) that does not bind Endophilin A and it is not required for ultrafast endocytosis. In addition, the Endophilin A binding occurs in a dephosphorylation-regulated manner. The study is carefully carried out and it is based on high quality data obtained by means of advanced biochemical methodologies, state-of-art flash-freezing electron microscopy analysis, superresolution microscopy and dynamic imaging of exo-and endocytosis in neuronal cultures. The results convincingly support the conclusions.

      We thank the reviewer for supporting the conclusions of our study.

      1. Although additional experiments are not essential to support the claims of the paper there is room, however, for improvement within the pHluorin experiments. These experiments, that are clearly informative and consistent with the rest of experimental data, do not apply the useful approach to separate endo- from exocytosis. The use of bafilomycin or folimycin to block the vesicular proton pump allows the unmasking the endocytosis that is occurring during the stimulus, that should correspond to ultrafast endocytosis. It would be very elegant to demonstrate that such a component, as expected according to the electron microscopy data, requires the binding of Endophilin A to Dynamin 1xA. If the authors have the pHluorin experiments running, the suggested experiments are very much doable because the reagents and the methodology is already in place and the new data could be generated in around six weeks.

      We thank the reviewer for the suggestion. The reviewer is concerned that vGlut1 pHluorin experiment in Figure 6 may not correspond to ultrafast endocytosis. We agree that bafilomycin/folimycin treatment will reveal the amount of endocytosis that takes place while neurons are stimulated. However, we are not certain that endocytosis during this phase would fully correspond to ultrafast endocytosis because reacidification of endocytosed vesicles typically takes 3-4 s (Atluri and Ryan, 2006, PMID: 16495458; although see https://elifesciences.org/articles/36097) and thus, the nature of endocytosis cannot be fully determined by this assay. To claim that endocytosis measured by pHluorin assay during stimulation all correspond to ultrafast endocytosis, we would need to perform very careful work to track single pHluorin molecules at the ultrastructural level and corelate their internalization to pHluorin signals. Perhaps, a rapid acid quench technique used by the Haucke group would also be appropriate to estimate the amount of ultrafast endocytosis (Soykan et al. 2017 PMID: 28231467), but we are not set up to perform such experiments here. Also, our lead author, Yuuta Imoto, is leaving the lab to start up his own group, and it will take us months rather than weeks to get the requested experiments done. Since the point of this experiment was to test whether the interaction of Dyn1xA and EndoA is essential for protein retrieval regardless of the actual mechanisms and the reviewer acknowledges that this point is sufficiently supported by the experiments, we will set this experiment as the priority for the next paper.

      Instead of the bafilomycin or rapid acid quenching experiments, we have now added data from vglut1-pHluorin experiment with a single action potential. With a single action potential, all synaptic vesicle recycling is mediated by ultrafast endocytosis in these neurons (Watanabe et al, 2013 PMID: 24305055; Watanabe et al. 2014, PMID: 25296249). Our electron microscopy experiments in Figure 5 is also performed with a single action potential. As with 10 action potentials, 20 Hz experiments, re-acidification of vglut1-pHluorin is blocked when Dyn1 and EndophilinA1 interaction is disrupted (Figure 6 F-I). We added a description of this result as below.

      P 11. “Similar defects were observed when the experiments were repeated with a single action potential – synaptic vesicle recycling is mediated by ultrafast endocytosis with this stimulation paradigm25 (S851/857 recovery is 73.3% above the baseline; R846A, recovery is 30.0% above the baseline) (Figure S9 A-D). Together, these results suggest that the 20 amino acid extension of Dyn1xA is important for recycling of synaptic vesicle proteins mediated by specific phosphorylation and Endophilin binding sites within the extension.”

      The methods are carefully explained. Some of the experiments are only replicated in two cultures and the authors should justify the reasons to convince the audience that the approaches used have enough low variability for not increasing the n number. The pHluorin experiments, however, are performed only in a single culture; they should replicate these experiments in at least 3 different cultures (three different mice).

      The reviewer is correct. The variability is very low in our ultrastructural studies and STED imaging, and thus, in all our previous publications, two independent cultures are used. We do agree that in the ideal case, we would like to have three independent cultures, but given the nature of ultrastructural studies (control, mutants, and multiple time points), triplicating the data would add another year to our work. We are currently developing AI-based segmentation analysis, and once this pipeline is established, we will be able to increase N. However, please note that for these experiments, we examine around 200 synapses from each condition in electron microscopy studies (Table S2)– these numbers are far more than the gold standard in the field. Likewise, 50-100 synapses are examined for STED experiments (Table S2). To examine variability of our analysis results, we compared a significance between the dataset using cumulative curves and Kolmogorov–Smirnov test (Figure S11). As shown in the summarized data and p value in each condition, there are no significant difference between the datasets.

      For pHluorin analysis, the reviewer is correct. We repeated the experiments twice to increase the N after the initial submission. The data are consistent, and the conclusions are not changed by the additional experiments (Figure 6 and Figure S9). We also changed the Statistical analysis section in Methods as below.

      P. 19. “All electron microscopy data are pooled from multiple experiments after examined on a per-experiment basis (with all freezing on the same day); none of the pooled data show significant deviation from each replicate (Table S2).”

      p 19, “All fluorescence microscopy data were first examined on a per-experiment basis. For Figure 4, the data were pooled; none of the pooled data show significant deviation from each replicate (Figure S11 and Table S2). Sample sizes were 2 independent cultures, at least 50-100 synapses from 4 different neurons in each condition..”

      Legends for Figure S11

      Figure S11. Data variability in Figure 4.

      Cumulative curves are made from each dataset of (A) distance of Endophilin A1 puncta from the edge of Dyn1xA puncta, (B) distance of Endophilin A2 puncta from the edge of Dyn1xA puncta, distance distribution of Dyn1xA from active zone edge in (C) neurons expressing wild-type Dyn1xA-GFP, (D) Dyn1xA-S851/857-GFP and (E) Dyn1xA-R846-GFP. n > 4 coverslips from 2 independent cultures. Kolmogorov–Smirnov (KS) test, p values are indicated in each plot.

      Minor comments: 4. Prior studies referenced appropriately and the text and figures are clear and accurate.

      We thank the reviewer for the careful read of our manuscript.

      The authors should discuss about the mediators (enzymes) responsible for dephosphorylation of phosphor-box 2 that is key for the Dynamin 1xa-Endophilin A interaction.

      We thank the reviewer for the suggestion. We added a discussion on a potential mediator, Dyrk1, as below.

      P. 12. ”What are the kinases that regulate Dyn1? The phosphorylation of phosphobox-1 is mediated by Glycogen synthase kinase-3 beta (GSK3ß) and Cyclin-dependent kinase 5 (CDK5)17, while phosphobox-2 is likely phosphorylated by Trisomy 21-linked dual-specificity tyrosine phosphorylation-regulated kinase 1A (Mnb/Dyrk1)44,45 since Ser851 in phosphobox-2 is shown to be phosphorylated by Mnb/Dyrk1 in vitro32. Furthermore, overexpression of Mnb/Dyrk1 in cultured hippocampal neurons causes slowing down the retrieval of a synaptic vesicle protein vGlut146. Consistently, our data showed that phosphomimetic mutations in phosphobox-2 results disruption of Dyn1xA localization, perturbation of ultrafast endocytosis, and slower kinetics of vGlut1 retrieval. However, how these kinases interplay to regulate the interaction of Dyn1xA, Syndapin1 and Endophilin A1 for ultrafast endocytosis is unknown.”

      It would be very helpful to include a final cartoon depicting the key protein-protein interactions regulated by dephosphorylation (activity) and the sequence of molecular events that leads to ultrafast endocytosis

      As suggested, we made a model figure, (new Figure 7) showing how Dyn1xA and its interaction with EndoA and Syndapin1 increases the kinetics of endocytosis at synapses. Regarding the sequence of molecular events, we think that there are already dephosphorylated fraction of Dyn1xA molecules sitting on the endocytic zone at the resting state and they mediate ultrafast endocytosis. However, it is equally possible that activity-dependent dephosphorylation of Dyn1xA also may play a role (Jing et al. 2011, PMID: 21730063). However, we have no evidence about the sequence of activity dependent modulation of Dyn1xA and its binding partners during ultrafast endocytosis yet. This is much beyond what we have reported in this work and therefore, excluded from the model figure. We added the following to the end of the discussion:

      p13, “Nonetheless, these results suggest that Dyn1xA long C-terminal extension allows multivalent interaction with endocytic proteins and that the high affinity interaction with Endophilin A1 permits phospho-regulation of their interaction and defines its function at synapses (Figure S7)”.

      Figure legend Figure 7,

      “Figure 7. Schematics depicting how specific isoforms Dyn1xA and Endophilin A mediate ultrafast endocytosis.

      A splice variant of dynamin 1, Dyn1xA, but not other isoforms/variants can mediate ultrafast endocytosis. (A) Dyn1xA has 20 amino acid extension which introduces a new high affinity Endophilin A1 binding site. Three amino acids, R846 at the splice site boundary, S851 and S857, act as long-distance element which can enhance affinity of proline rich motifs (PRM) to SH3 motif from outside of the PRM core sequence PxxP. (B) At a resting state, Dyn1xA accumulates at endocytic zone with SH3 containing BAR protein Syndapin 123 and Endophilin A1/2. When phosphobox-1 (Syndapin1 binding) and phosphobox-2 (Endophilin A1/2 binding, around S851/S857) within Dyn1xA PRD are phosphorylated, these proteins are diffuse within the cytoplasm. A dephosphorylated fraction of Dyn1xA molecules can interact with these BAR domain proteins. Loss of interactions including Dyn1xA-R846A or -S851/857D mutations, disrupts endocytic zone pre-accumulations. Consequently, ultrafast endocytosis fails.”

      Reviewer #2 (Significance (Required)):

      This is a remarkable and important advance in the field of endocytosis. The study reports a key discovery to understand the molecular mechanism of ultrafast endocytosis. Scientist interested in synaptic function and the general audience of cell biologist interested in membrane trafficking will very much value this study. The mechanism reported will potentially be included in textbooks in the near future.

      My field of expertise includes molecular mechanisms of presynaptic function and membrane trafficking.

      I have not enough experience to evaluate the quality of the NMR experiments, however, I do not have any problem at all with, in my opinion, elegant results reported.

      We thank the reviewer for the positive outlook of our manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is a compelling study that reports a key discovery to understand the molecular mechanism of ultrafast endocytosis. The authors demonstrate that the Dynamin splice version 1xA (Dynamin 1xA) uniquely binds Endophilin A, in contrast to Dynamin splice version 1xB (Dynamin 1xB) that does not bind Endophilin A and it is not required for ultrafast endocytosis. In addition, the Endophilin A binding occurs in a dephosphorylation-regulated manner. The study is carefully carried out and it is based on high quality data obtained by means of advanced biochemical methodologies, state-of-art flash-freezing electron microscopy analysis, superresolution microscopy and dynamic imaging of exo-and endocytosis in neuronal cultures. The results convincingly support the conclusions.

      Although additional experiments are not essential to support the claims of the paper there is room, however, for improvement within the pHluorin experiments. These experiments, that are clearly informative and consistent with the rest of experimental data, do not apply the useful approach to separate endo- from exocytosis. The use of bafilomycin or folimycin to block the vesicular proton pump allows the unmasking the endocytosis that is occurring during the stimulus, that should correspond to ultrafast endocytosis. It would be very elegant to demonstrate that such a component, as expected according to the electron microscopy data, requires the binding of Endophilin A to Dynamin 1xA. If the authors have the pHluorin experiments running, the suggested experiments are very much doable because the reagents and the methodology is already in place and the new data could be generated in around six weeks.

      The methods are carefully explained. Some of the experiments are only replicated in two cultures and the authors should justify the reasons to convince the audience that the approaches used have enough low variability for not increasing the n number. The pHluorin experiments, however, are performed only in a single culture; they should replicate these experiments in at least 3 different cultures (three different mice).

      Minor comments:

      Prior studies referenced appropriately and the the text and figures are clear and accurate.

      1. The authors should discuss about the mediators (enzymes) responsible for dephosphorylation of phosphor-box 2 that is key for the Dynamin 1xa-Endophilin A interaction.
      2. It would be very helpful to include a final cartoon depicting the key protein-protein interactions regulated by dephosphorylation (activity) and the sequence of molecular events that leads to ultrafast endocytosis

      Significance

      This is a remarkable and important advance in the field of endocytosis. The study reports a key discovery to understand the molecular mechanism of ultrafast endocytosis. Scientist interested in synaptic function and the general audience of cell biologist interested in membrane trafficking will very much value this study. The mechanism reported will potentially be included in textbooks in the near future.

      My field of expertise includes molecular mechanisms of presynaptic function and membrane trafficking.

      I have not enough experience to evaluate the quality of the NMR experiments, however, I do not have any problem at all with, in my opinion, elegant results reported.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Imoto et al. analyze the specific role of the Dynamin1 splice variant Dyn1xA in so-called ultrafast endocytosis, an important mechanism of synaptic vesicle recycling at synapses. In a previous publication (Imoto et al. Neuron 2022), some of the authors had shown that Dyn1xA, and not the other splice variant Dyn1xB, is essential for ultrafast endocytosis. Moreover, Dyn1xA forms clusters around the active zone for exocytosis and interacts with Syndapin 1 in a phosphorylation dependent manner. However, it was unclear which molecular interactions underlie the specific role of Dyn1xA. Here, the authors provide convincing evidence with pull down assays and CSP that Dyn1xA PRR interacts with EndophilinA1/2 with two binding sites. The first binding site lies in the part common to xA and xB, was previously characterized. The second site was previously uncharacterized, is specific for Dyn1xA, and is regulated by phosphorylation (phosphobox 2). The location of these splice variants and mutated forms at presynaptic sites correlate with the prediction made by the biochemical assays. Finally, the authors perform rescue experiments ('flash and freeze' and VGLUT1-pHluorin imaging experiments) to show that Dyn1xA-EndophilinA1/2 binding is important for ultrafast endocytosis. I find the results interesting, providing an important step in the understanding of the interplay between dynamin and the endocytic proteins interacting with it (endophilin, syndapin, amphiphysin) in the context of synaptic vesicle recycling. The manuscript is clearly written and for the most part the data supports the authors' conclusions (see specific comments below). However, there are some issues which need to be clarified before this manuscript is fully suitable for publication.*

      Introduction: the dynx1B Calcineurin binding motif is written PxIxIT consensus but actual sequence is PRITISDP. Is this a typo? Figure 1: the difference between the constructs used in panels C and D is not clear. In D, is it a truncation without residues 796 and 845? If so, it should be labelled clearly in the Western blots. In Panel E, Dyn1xA 746-798 should be labeled Dyn1x 746-798 because it is common to both splice variants. For amphiphysin binding the authors write that "No difference in binding to Amphiphysin 1 was observed among these peptides (Figure1D-F)." They should write that Dyn1x 746-798 does not bind Amphiphysin1 SH3 domain, confirming the specificity of binding to the 833-838 motif. Figure S2. The panels are way too small to see the shifts and the labelling. Please provide bigger panels Figure 2 panel B. There is a typo in the connecting line between the sequence and the CSP peaks. It is 846 instead of 864 (after 839). Figure 3 panel E. In the text, the authors write that "Western blotting of the bound proteins from the R838A pull-down experiment showed that R838A almost abolished both Endophilin and Amphiphysin binding in xA806-864 (Figure 3D), and reduced Endophilin binding to xA-PRR (Figure 3E)." I think they should write "only slightly reduced Endophilin binding..." it is more faithful to the result and consistent with the conclusion that Endophilin A1 has two binding sites on Dyn1xA PRR. It is unclear why the R846A mutant affects binding of Dyn1xA 806-864 but not Dyn1xA-PRR. Moreover, it affects binding to endophilin as well as amphiphysin, and therefore it is not specific. It is thus not correct to write that "R846 is the only residue found to specifically regulate the Dyn1 interaction with Endophilin as a part of an SDE". In the Discussion (page 11), the authors refer to the R846A mutation as specifically affecting Endophilin binding. This should be toned down, as it also affects Amphiphysin binding. For this important point, the data on quantification of Endophilin binding should be presented. Figure 3F-G: what do the star symbols represent in the graphs? I guess the abscissa represents retention time. Please write it clearly instead of a second ordinate for molecular mass, which does not make much sense if this reflects the estimate for the 3 conditions. Figure 4: The statement that "By contrast [to Dyn1xA], Endophilin A1 or A2 formed multiple clusters (1-5 clusters)" is not at all clear on the presented pictures. The authors should provide views of portions of axons with several varicosities, for the reader to appreciate the cases where there are more EndoA clusters than Dyn1 clusters. Moreover, overexpression of EndophilinA1/2-mCherry is not sufficient to assess its localization. Please consider either immunofluorescence of genome editing (e.g. Orange or TKIT techniques). The analysis of the confocal microscopy data is not explained. How is the number of clusters determined? How far apart are they? Confocal microscopy may not have the resolution to distinguish clusters within a synapse. For the STED microscopy, a representation of the processed image (after deconvolution) and the localization of the peaks would be important to assess the measurement of distances. If Dyn1xA S851/857D is more diffuse, are there still peaks to measure for every synapse? Figures 5 and 6: No specific comment. The data and its analysis is very nice and elegant. The comment on the lack of rescue of Dyn1xA on endosome maturation may be a bit overstated, because many "controls" (shRNA control Figure S5 or Dyn3 KO in Imoto et al. 2022) have a significant number of endosomes 10 s after stimulation. By the way, why did the authors use Dyn1 KO in this study, and not Dyn1.3 DKO as in Imoto et al. 2022? In the Discussion, the authors present the binding sites (for endophilin and amphiphysin SH3 domains) as independent. However, these proteins form dimers or even multimers as they cluster around the neck of a forming vesicle. Even though they provide evidence in vitro (Figure 3) that in these conditions of high concentration one dyn1xA-PRR binds one SH3 domain, in cells multiple binding sites on the PRR to these proteins may involve avidity effects, as discussed for example in Rosendale et al. 2019 doi 10.1038/s41467-019-12434-9. For example, the high affinity binding of Dyn1-PRR to amphiphysin cannot be explained only by the sequence 830-838.

      Significance

      This study provides a significant advance on the mechanisms of dynamin recruitment to endocytic zones in presynaptic terminals. The work adds a significant step by experienced labs (Robinson, Watanabe) who have provided important insight in the mechanisms by many publications in the last years.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Please find below a point-by-point reply to the reviewers, with our comments in plain text, and reviewer comments in italics. Direct quotations of MS revisions in the below point-by-point reply are in quotation marks.


      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **The manuscript "Circadian regulation of protein turnover and proteome renewal" investigates the role of protein degradation in the circadian control of proteostasis. The researchers suggest that the relatively static levels of protein levels in a cell are incongruent with the known oscillation in protein synthesis. They therefore hypothesize that there should be a compensatory mechanism to counteract rhythmic protein synthesis, rhythmic protein degradation. To investigate this, they employ bulk pulse chase labeling to study the process of degradation. They identify a synchronization between the creation and turnover of proteins in a cell, implying the clock helps to maintain homeostasis through a novel mechanism. They note that these phases align with energy availability, granting a plausible reasoning behind the biological implementation of this regulation. In summary, this is a sound manuscript that adds to the research field. The experiments in this manuscript are well thought out, organized, and explained. In general, the authors do not go further in their conclusions than I think is warranted given the data that they have, though I think that there are some key items that should be addressed before the publication of this manuscript. *

      Thank you for reading and appreciating our work

      Major notes: 1) In figure 1, a clearer idea of what the ** means would be appreciated. What was the standard of significance for this measure?

      Thank you, this was already reported in the methods section but is now reported in the figure legend also.

      * 2) In Figure 1b, it is important to note clearly in the text that the this is not a direct measure of protein degradation, but a subtractive proxy. Though I don't think that necessarily makes the authors conclusions incorrect, the same result could also be obtained if an extra 15% of the proteins were moved into the insoluble fraction. This is the same for Figure 1E and F. *

      Considering only the pulse shown in the left-hand graph of 1B, the reviewer is correct that this could arise by rhythmic partitioning of nascently synthesised proteins between digitonin-soluble and insoluble fractions. This could not readily explain the variation in the % of nascently synthesised digitonin-soluble protein that is degraded however (right hand graph), hence the need for pulse-chase rather than pulse alone. As such, we do not exclude circadian-regulated solubility of nascently synthesised protein or that there is a rhythm of protein synthesis in the soluble fraction, both are likely true. Rather Figure 1B indicates the relative proportion of nascently-synthesised protein in the soluble fraction that is degraded within 1h of synthesis is not constant over time. This is consistent with current understanding of the regulated increase in activity of protein quality control mechanisms (including proteasome-mediated degradation) that are required to maintain protein homeostasis upon an increase in bulk translation (Gandin and Topisirovic, Translation, 2014).

      In contrast, the lysates probed in Fig 1F were extracted in denaturing urea/thiourea buffer and so cannot be explained by variation in protein solubility.

      Considering 1E, to explain this result entirely through solubility changes would require that puromycinylated polypeptides to become more soluble, at discrete phases of the circadian cycle, but only when the proteasome is inhibited. Whilst we cannot formerly exclude this possibility, we are not aware of evidence to support it, whereas there is prior evidence supporting circadian regulation of protein synthesis and proteasome activity.

      To communicate all of this more clearly we have made the following revisions to the text:

      Page 6: ".The experiment was performed over a 24h time series followed by soluble protein extraction using digitonin, which preferentially permeabilises the plasma membrane over organelle membrane."

      Page 6: " Importantly, the proportion of degraded protein varied over time, being highest at around the same time as increased protein synthesis (Fig 1B), indicating time-of-day variation in digitonin-soluble protein turnover which cannot be solely attributed to previously reported circadian regulation of protein solubility (Stangherlin et al, 2021b). Rather, it suggests that global rates of protein degradation may be co-ordinated with protein synthesis rates, and may vary over the circadian cycle."

      Fig 1a legend: "...with digitonin buffer"

      Fig 1e legend: "...in digitonin buffer"

      Fig1f legend: "... and extracted with urea/thiourea buffer"

      * 3) In figure 1c, is the noted oscillation in protease activity due to the oscillation of these proteins? What are the predicted mechanisms behind this? I don't think that this is necessarily within the scope of this paper but should be addressed in the discussion. Also, the peak degradation rate from Figure 1B is 4 hours before the peak enzyme activities. How can this observation be reconciled? *

      Besides this study, our two previous proteomic investigations of the fibroblast circadian proteome detected no biologically significant or consistent rhythm in proteasome subunit abundance (Wong et al., EMBO J, 2021; Hoyle et al., Science Translational Medicine, 2017). Moreover, proteasomes are long-lived stable complexes whose activity is determined by a combination of substrate-level, allosteric and post-translational regulatory mechanisms that includes their reversible sequestration into storage granules (Albert et al., PNAS, 2020; Fu et al., PNAS, 2021; Yasuda et al., Nature, 2020). It is therefore very likely that the observed rhythm in trypsin- and chymotrypsin-like activity occurs post-translationally. Proteasome subunit composition is also known to change, which might be another reason for differences between the protease activities (Marshall and Vierstra, Front Mol Biosci, 2019; Zheng et al., J Neurochem, 2012).

      Due to the nature of the experiment, the degradation rate inferred from Figure 1B does not reflect proteasome activity, exclusively. Rather it reflects the combined sum of processes that remove nascently produced proteins from the cell's digitonin-soluble fraction, which includes proteasomal degradation, but also autophagy, protein secretion and sequestration into other compartments. Therefore, the peak degradation in Fig 1B would not necessarily be expected to coincide with the peak of proteasome activity in Fig 1C. Figure 1A/B is intended as an exemplar for the investigation's rationale and was the first to be performed chronologically.

      To communicate this succinctly, we have revised the relevant text as follows:

      Page 7: "Previous proteomics studies under similar conditions have revealed minimal circadian variation in proteasome subunit abundance (Wong et al, 2022), suggesting that proteasome activity rhythmicity, and therefore rhythms in UPS-mediated protein degradation, are regulated post-translationally (Marshall & Vierstra, 2019; Hansen et al, 2021)"

      * 4) For the pSILAC analysis, the incorporation scheme has a six-hour window between the comparison of the light and heavy peptides. This makes it somewhat difficult to assess whether you are looking a clock effect from T1 or T1+6. This does not negate the findings, but it does question when the synthesis is occurring and what is being compared, which I think should be more clearly discussed in the manuscript. This is discussed later in the manuscript but should be mentioned in this section. *

      Thank you for this suggestion. To communicate this more clearly, we have rearranged the labels at the top of schematic graphs in figures 2b and 3b in order to clearly distinguish the pulse-labelling window from the time of sample collection. The following text has been added to the methods section:

      Page 9: "To enable sufficient heavy labelling for detection, a 6h time window was employed, thus measuring synthesis and abundance within each quarter of the circadian cycle "

      * 5) There are no error bars on figure 2C. What the pSILAC just done in a singlet? If so, the rhythms estimation is likely a large overestimate and should be noted. *

      This first pSILAC experiment was performed in singlet with respect to external time for the RAIN analysis, but is duplicate for the two-way ANOVA that is also reported, by treating each cycle as a separate replicate. In fact, the 6.2% of proteins that were significantly rhythmically abundant by RAIN actually agree well with two previous experiments we performed using mouse fibroblasts under identical conditions: the first with 3h resolution over 3 cycles in singlet (7% rhythmic), the second with 4 biological independent replicates over one cycle (8% rhythmic) (Wong et al., EMBO J, 2021). The curve fits shown in 2C are the standard damped sine wave fits, with p-values from RAIN reported in the figure legend.­­

      Most importantly however, and as noted in the text, the absolute % of rhythmically abundant proteins is rather irrelevant and indeed the absolute numbers of 'rhythmic' proteins can vary wildly, dependent on the analysis method and stringency. The only important point to be gleaned from the estimates shown in Figure 2e is that by either statistical test, most rhythmically abundant proteins are not rhythmically synthesised, and vice versa; however, the % of proteins that are both rhythmically synthesised and rhythmically abundant is 6 to 11--fold higher than would be expected by chance (taking proteins rhythmic by RAIN and ANOVA, respectively; in both cases the overlap between the two sets is highly significant) . This serves as a positive control, i.e., a minority of proteins show correlated rhythms of synthesis and abundance that are consistent with the canonical activity of 'clock-controlled genes' which cannot be explained by overestimation of rhythmicity.

      Odds Ratio comparison synthesis vs total

      Synthesis rhythmic by RAIN - listA size=148, e.g. A8Y5H7, B2RUR8, E9Q4N7

      Total rhythmic by RAIN - listB size=149, e.g. A1A5B6, A2A6T1, A2AI08

      Intersection size=34, e.g. A8Y5H7, O08795, O54910

      Union size=263, e.g. A8Y5H7, B2RUR8, E9Q4N7

      Genome size=2528

      Contingency Table:

      notA inA

      notB 2265 114

      inB 115 34

      Overlapping p-value=5.4e-13

      Odds ratio=5.9

      Overlap tested using Fisher's exact test (alternative=greater)

      Jaccard Index=0.1

      Synthesis rhythmic by ANOVA - listA size=66, e.g. A8Y5H7, O35639, O55143

      Total rhythmic by ANOVA - listB size=83, e.g. A8Y5H7, B2RQC6, E9Q6J5

      Intersection size=16, e.g. A8Y5H7, P22561-2, Q3TB82

      Union size=133, e.g. A8Y5H7, O35639, O55143

      Genome size=2528

      Contingency Table:

      notA inA

      notB 2395 50

      inB 67 16

      Overlapping p-value=9.7e-11

      Odds ratio=11.4

      Overlap tested using Fisher's exact test (alternative=greater)

      Jaccard Index=0.1

      Nevertheless, we agree with the reviewer's general point and have revised the text as follows:

      Page 9: "... and may be susceptible to overestimation of rhythmicity."

      Page 9: "Consistent with similar previous studies, Page 9: "The proportion of such proteins was more than expected by chance (pMethods, Page 21: "...(n=1 per timepoint)"

      * 6) Why were the genes selected in 2C? these are not discussed anywhere else in the manuscript.*

      These are simply illustrative examples so that the reader can better understand what we mean, i.e., two proteins in different phases and one that did not change, all within a similar range of abundance. The selected proteins were not discussed because we do not expect the reader to attach any specific meaning to them. We have revised the figure to include in 2C examples of each rhythmicity category shown in 2E. To make this clear, we now state the following:

      Figure 2 legend: "No specific meaning is inferred from the protein identities”.

      • 7) The authors note that for Figure 2 "These observations are consistent with widespread rhythmic regulation of protein degradation." However, only 5-10% of the proteome is oscillating at any level and less with a discrepancy between synthesis and abundance, so "widespread" is an exaggeration and this statement should be limited to the degradation in the rhythmic proteome. *

      We take the reviewer's point, but the term rhythmic proteome is also inaccurate since half the proteins with rhythmic degradation did not show an abundance rhythm in both mass spec experiments. We therefore revised this sentence as follows:

      Page 10: "These observations are consistent with widespread temporal organisation of protein degradation within the circadian-regulated proteome."

      * 8) The authors note that their more developed strategy in figure 3 would allow for the detection of less abundant proteins. However, they do not discuss that they in fact found less proteins overall, or if they were able to detect proteins of lower abundance. This is of some concern in determining if this is indeed the better method that they predict. How can the authors reconcile this issue? How can they rationalize this explains their increase in oscillating elements? *

      Thank you for raising this point, we did not explain ourselves sufficiently clearly. As stated in the revised text, once we had analysed the first iteration of pSILAC (Fig 2), we realised that detection of heavy-labelled proteins was "inevitably limited and biased the proteome coverage towards abundant proteins with higher synthesis rates". In other words, in order to be considered in our analysis both unlabelled and heavy-labelled peptides needed to be detected in every sample at every time point. In fact, if we do not consider heavy-labelling, the overall coverage in the Fig 3 experiment (6577 proteins) was better than the Figure 2 experiment (6264 proteins), as expected, due to technical improvements in the methods used (by the time of the experiment in Fig. 3, we were able to perform the analysis using mass spectrometry techniques with better fractionation and detection, namely FAIMS and MS3). When the analysis criteria are applied however, this falls to 2302 and 2528 proteins, respectively. Because of the way that mass spectrometry works, many proteins needed to be excluded from analysis because the heavy label wasn't detected in one or more samples. In these cases, we cannot infer that no heavy-labelled protein was present in that sample or even that it was present at lower levels than other samples - it simply wasn't detected and therefore we cannot make any quantitative comparisons. Non-detection of any given heavy peptide may occur for several reasons, the most likely being that it co-elutes from the chromatography column at the same time as other much more abundant (light) peptides and simply escapes detection. This is an unavoidable limitation of the technique, we hope the reviewer can understand our need to restrict the analysis to those proteins whose nascent synthesis, and total abundance in the MMC fraction, can be confidently quantified.

      As the experiments in Fig 2 and Fig 3 were performed independently, with separate TMT sets and different instrumentation, we are also unable to compare absolute abundances of the proteins between the two.

      To communicate this more clearly we have amended Figures 2e and 3e to state the total coverage in the legends, as well as clearly stating the coverage of heavy-labelled proteins in the figure itself. We have also added the following explanation to the text:

      Page 11:

      “Despite enriching for only one cellular compartment, the overall coverage in this experiment was similar to the previous one (6577 and 6264 proteins, respectively), due to the altered and more targeted approach; with heavy peptides detected for 2302 proteins."

      *9) In the comparison of complex turnover rates, the authors need to provide a metric that backs their statement that "the majority of component subunits not only showed similar average heavy to total protein ratios but also a similar change in synthesis over the daily cycle" for figure 3F. *

      Our apologies for this oversight, this is now presented in new Fig S3D.

      * 10) In reference to the AHA incorporation, why is the hypothesis not that, like the puramycin, you would not see oscillation unless you add BTZ? Shouldn't the active degradation regulate the incorporation of AHA such that there is no visible rhythm unless you suppress degradation? *

      AHA is a methionine analogue that is sparsely incorporated into polypeptide chains with minimal effect on protein function/structure (Dietrich et al., PNAS, 2006). Unlike puromycin, therefore, AHA does not lead to chain termination or protein misfolding/degradation (Dermit et al., Mol Biosyst, 2017) and so pulsed application at different phases of the circadian cycle is sufficient to reveal protein synthesis rhythms. The novelty in Fig 3H is the combination of AHA labelling with native PAGE that allows us to validate rhythmic production of high molecular weight protein complexes. This would not be possible with puromycin because prematurely-terminated polypeptide chains are not able to assemble into native complexes unless chain termination happens to occur at the extreme C-terminus and the C-terminus does not partake in any intermolecular interactions within the assembled complex.

      * 11) The authors claim that there is enrichment of the actin cytoskeleton, but where this data can be found should be explained. The only thing that is shown is a few selected graphs of proteins in this pathway. *

      We previously reported circadian regulation of the actin cytoskeleton in Hoyle et al. (Sci Trans Med, 2017). The extremely high relative amplitude of Beta-actin (the structural component of microfilaments) in the MMC fraction is, in and of itself, entirely sufficient to demonstrate a circadian rhythm in the relative ratio of globular to filamentous actin that was originally identified by Ueli Schibler's lab (Gerber et al., Cell, 2013) and then shown to have a cell-autonomous basis in fibroblasts in Hoyle et al (2017). We have included further examples of an actin-binding protein (Corinin1b) and a motor protein (Myosin 6) to further illustrate this, but do not feel further discussion is warranted because it was comprehensively addressed in our previous work. The enrichment for actin was determined by GO analysis, which is now shown in the Fig 4A and referred to in the text.

      The important point in Fig 4C is the difference in phase with the examples shown in Fig 4B and summarised in Figure 4A, i.e., there are a small number of proteins whose presence in the MMC fraction is highest in advance of the majority of rhythmically abundant proteins, but this earlier group doesn't show any significant synthesis rhythm. Actin is one of the most abundant cellular proteins, and by mass it accounts for 67% of the circadian variation of rhythmically abundant proteins that peak in this fraction at the same phase. All these data and analyses are available for scrutiny in Supplementary Table 2.

      To communicate this more clearly we have expanded on this point as follows:

      Page 13: " These proteins were enriched by 9-fold for actin and associated regulators of the actin cytoskeleton (q* 12) The authors note an oscillation in the total levels of p-eif2, commenting that these do not arise from the rhythms in total eif2a but temperature and feeding rhythms. However, unless I misunderstood, this work was done in fibroblast cell culture, so in this case, where would these temperature and feeding rhythms come from? *

      We were insufficiently clear. Daily rhythms of p-eIF2 have been observed under physiological conditions in mouse, in vivo. We do not observe similar rhythms in cultured fibroblasts under constant conditions unless the cells are challenged by stress. By inference therefore, it seems likely that daily rhythms of p-eIF2 in vivo arise from the interaction between cell-autonomous mechanisms and daily systemic cues such as, insulin/IGF-1 signalling and body temperature that are in turn driven by daily rhythms in CNS control, daily feed/fast rhythms and daily rest/activity rhythms, respectively. We have amended the text as follows:

      Page 15: "...and so suggest that daily p-eIF2α rhythms in mouse tissues likely arise through the interaction between cell-autonomous mechanisms and daily cycles of systemic cues, e.g., insulin/IGF-1 signalling and body temperature rhythms driven by daily feed/fast and rest/activity cycles, respectively."

      * 13) In Figure 5d, the treatment impeding degradation is causing cell death while the inhibition of translation does not. However, wouldn't too much, or not enough, translation, without compensatory regulation from degradation cause a problem in the same way that degradation does? *

      It is well-established that acute treatment with high concentrations of proteasomal inhibitors rapidly leads to proteotoxic stress that will trigger apoptosis unless resolved (Dantuma and Lindsten, Cardiovasc Res, 2010). Treatment with CHX is certainly stressful to cells, but in a different way, and cells die through mechanisms generally regarded to be necrotic and certainly do not involve the canonical proteotoxic stress responses that are activated by MG132 and similar drugs. Our findings show that, by whatever mechanisms cells die with CHX treatment, it does not change over the circadian cycle whereas death via proteotoxic stress does, consistent with our prediction. We hope the reviewer agrees it is beyond the scope of our study to explain why CHX-mediated cell death does not show a circadian rhythm in mouse fibroblasts.

      *Reviewer #1 (Significance (Required)):

      *The information that stems from this work is relevant and of interest to circadian clock field as how the regulation of the output of the circadian clock is implemented is still a major question in the field. This manuscript suggests a novel and plausible method for how, at least in part, this regulation occurs. However, the manuscript uses methods that do not measure degradation directly, which is a minor limitation. In addition, the mechanisms by which this regulation is imparted are not addressed in any meaningful way, even in the discussion.

      We are sorry that we did not adequately discuss the extensive previous work that has already addressed regulatory mechanisms. We would like to stress that this manuscript concerns protein turnover and proteome renewal, of which degradation is obviously an important part but not the sole focus.

      To communicate this more clearly, we have amended the title to:

      "Circadian regulation of macromolecular complex turnover and proteome renewal"

      ... which we previously explicitly predicted in the discussion of previous papers (Feeney et al., Nature, 2016; O'Neill et al., Nat Comms, 2020; Wong et al., EMBO J, 2022) and our recent review (Stangherlin et al., Curr Opin Syst Biol, 2021).

      With respect to measurement of degradation - Physiologically, cellular rates of proteasomal degradation are so intimately coupled with protein synthesis that, over circadian timescales, the former cannot meaningfully be studied in isolation. It is possible that the reviewer is alluding to historical methods that measure change over time in the presence of translational or proteasomal inhibitors, but these have long been known to introduce artifacts - because translational inhibition rapidly leads to reduced proteasome activity, whereas proteasomal inhibition rapidly reduces protein synthesis rates through the integrated stress response. We would be interested to hear of any more direct method for measuring protein degradation proteome-wide than the pulsed SILAC method we developed, as we are not aware of any. Even proteasomal proximity labelling coupled with MG132 treatment, recently developed by the Ori lab, does not directly measure degradation (bioarxiv https://www.biorxiv.org/content/10.1101/2022.08.09.503299v1). By definition, degradation can only be measured through the disappearance of something that was previously present, usually by comparing its rate of production with the change in steady state concentration (if any), which we have done using multiple methods.

      With respect to regulation of degradation - We speculated on the mechanisms regulating rhythms in protein turnover in our several previous papers (Feeney et al., Nature, 2016; O'Neill et al., Nat Comms, 2020; Wong et al., EMBO J, 2021; Stangherlin et al, Nat Comms, 2021), whereas outside the circadian field these mechanisms have been addressed extensively. This was also discussed in detail in our recent review on the topic (see Stangherlin et al., COISB, 2021). In this review, we lay out the evidence for a model whereby most aspects of circadian cellular physiology might be explained by daily rhythms in the activity of mammalian target-of-rapamycin complexes (mTORC). This model makes multiple predictions and informs the central hypothesis which is tested in the current manuscript: that circadian rhythms in complex turnover and proteome renewal should be prevalent over abundance rhythms. An enormous body of work over the last two decades has already clearly established mTORC1 as the master regulator of bulk protein synthesis and degradation, and a substantial number of independent observations have demonstrated circadian regulation of mTORC1 activity in vivo and in cultured cells. The mechanisms that drive cell-autonomous mTORC1 signalling are only partially understood (e.g. Feeney et al., Nature, 2016; Wu et al., Cell Metab, 2019), and we continue to explore this experimentally but they certainly lie well beyond the scope of this investigation.

      Therefore, to address the reviewer's concern about inadequate discussion of mechanism, we have expanded on mTORC in the introduction and discussion, as follows:

      Page 3: "Daily rhythms of PERIOD and mTORC activity facilitate daily rhythms of gene expression and protein synthesis. In particular, mTORC1 is a master regulator of bulk 5'-cap-dependent protein synthesis, degradation and ribosome biogenesis (Valvezan & Manning, 2019) whose activity is circadian-regulated in tissues and in cultured cells (Ramanathan et al, 2018; Feeney et al, 2016a; Stangherlin et al, 2021b; Mauvoisin et al, 2014; Jouffe et al, 2013; Sinturel et al, 2017; Cao, 2018). It is plausible that daily rhythms of mTORC activity underlie many aspects of daily physiology (Crosby et al, 2019; Stangherlin et al, 2021a; Beale et al, 2023b)."

      Page 17: "The mechanistic underpinnings for cell-autonomous circadian regulation of the translation and degradation machineries remain to be fully explored, but are likely to be driven by daily rhythms in the activity of mTORC: a key regulator of protein synthesis and degradation as well as macromolecular crowding and sequestration (Stangherlin et al, 2021b, 2021a; Cao, 2018; Adegoke et al, 2019; Ben-Sahra & Manning, 2017; Delarue et al, 2018). In particular, global protein synthesis rates are greatest when mTORC1 activity is highest, in tissues and cultured cells, whereas pharmacological treatments that inhibit mTORC1 activity reduce daily variation in crowding and protein synthesis rates (Feeney et al, 2016a; Lipton et al, 2015; Stangherlin et al, 2021b). Given our focus on proteomic flux and translation-associated protein quality control, autophagy was not directly within the scope of this study but is also mTORC-regulated and subject to daily regulation (Ma et al, 2011; Ryzhikov et al, 2019). In vivo, daily regulation of mTORC activity arises primarily through growth factor signalling associated with daily feed/fast cycles (Crosby et al, 2019; Byles et al, 2021). The mechanisms facilitating cell-autonomous circadian mTORC activity rhythms are incompletely understood but may include Mg.ATP availability (Feeney et al, 2016a) and its direct regulation by PERIOD2 (Wu et al, 2019). This will be an important area for future work."

      *Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: This is a very interesting and well written paper that addresses key questions in the circadian organization of proteostasis. The paper investigates origins of cellular circadian rhythms, invoking a premise early that there is a poor correlation between rhythmic gene expression - regulated by the canonical TTFL - and rhythms of the proteome, which are rather meager. Specifically, they ask how a relatively stable proteome is possible if cells engage in rhythms of cellular protein synthesis? Their hypothesis is that protein degradation must rhythmically compensate for rhythms of synthesis and much of the manuscript is focused on defining the relationship between rhythmic global synthesis and rhythmic degradation. They employ a series of detailed proteomic investigations and biochemical assessments of protein synthesis coupled with various circadian reporters to assess proteosome function. The proteomic experiments reveal a limited number of proteins with oscillations in either synthesis or abundance or both and no discernible pathway organization however, a followup and more refined study that utilized fractionated samples and boosted heavy SILAC identified strikingly, that many proteins in relatively heavy fractions are rhythmic and that these fall into possible complexes including ribosome and chaperonins. Finally, they perform in vivo experiments testing whether the timing of proteotoxic stimuli regulates the degree of the integrated stress response measured as pEif2a. Overall, I think that this is a fascinating paper that addresses and important question but falls short on mechanistically unifying them and completely contextualizing the findings in light of the canonical modes of circadian timekeeping leaving us with an important, but mostly descriptive set of findings. In addition, there are a number of important questions about data interpretation, some issues with data quality that should be addressed outlined below. With revision and further explication, this study will be an excellent addition to the growing field of circadian organization of the cellular proteome. *

      Thank you for reading and appreciating our work

      *Major and minor Comments. Figure 1. Fig 1a. The difference in Pulse and Chase at ZT24 does not appear to reflect the quantified data in 1b. This should be reconciled to make the figure convincing. *

      When working with radioactive cell lysates it is not possible to equalise the level of protein loaded on each gel beforehand as would happen with a western blot, for example. For this reason, the radioactive signal was normalised to the protein level subsequently measured by coomassie staining, as is standard practise for this type of assay, with all 4 replicates being shown in supplementary Fig.1A. An overnight phosphor screen image is presented in the main Fig.1A for illustrative purposes, but we take the point that this might not be immediately obvious. In revised Fig 1A we therefore now also show the relevant coomassie as well as labelling to make clear that the radioactive signal was normalised to protein levels.

      * How was the timing of the chase collection determined? *

      For these proof-of-principle experiments, we empirically determined the minimum duration of pulse and chase necessary to detect a quantifiable signal.

      *Fig 1d-e. What is the evidence that puro labeling results in 'rapid' turnover. *

      Apologies, this has been established for some time. Some additional papers are now cited in this section of the text (Liu et al, PNAS, 2012; Lacsina et al., PLoS One, 2011; Szeto et al., Autophagy, 2006)

      *Fig 1e seems to be missing the data from the treated and untreated conditions? How are the lines produced (e.g. linear versus rhythmic? Are these drawn lines or actual regressions?). *

      Fig 1e depicts the result of the experiment schematically explained in 1d. The only conditions were +Puro or +Puro+BTZ. There was no completely untreated condition, as puromycin incorporation is the basis of the assay (Lacsina et al., PLoS One, 2012; Szeto et al., Autophagy, 2006) and puromycin does not occur naturally in cells. We realise the figure could potentially be confusing without the associated raw data (anti-puromycin blots) - these are shown in supplementary Fig. 2A.

      To explain the method more clearly, the following has been added to the results section where this experiment is described:

      " As determined by anti-puromycin western blots, over two days under constant conditions, puromycin incorporation in the presence of BTZ showed significant circadian variation. In contrast, cells that were treated with puromycin alone showed no such variation, and nor did total cellular protein levels (Fig 1E, Fig S2A).”

      The fit lines are produced by statistical comparison of fits, i.e., our hypothesis (damped cosine fit) vs null hypothesis (no or constant change over time, linear fit, y = mx+c), using sum-of-squares F test. The statistically preferred fit is plotted and p-value displayed on the graph, i.e., the regression line of the preferred fit and parameters are plotted. These details are reported in the figure legends.

      * Why was 30 minutes chosen as labeling time? It seems hard to understand here how protein degradation kinetics can be measured by puromycin labeling if the authors' claim that puromycin labeling potentially changes degradation rates as a function - primary or secondary - of the labeling itself. It seems they are measuring the potential to degrade proteins. *

      Puromycin labelling is a 20 year-old widely-used technique that can be employed in a range of applications. It was first used in a circadian context by Lipton et al (Cell, 2015) whose work we quickly followed (Feeney et al, Nature, 2016). Briefly, puromycin mimics tyrosyl-tRNA to block translation by labelling and releasing elongating polypeptide chains from translating ribosomes. When used at low concentrations (1 ug/mL in this case) puromycin is sparsely and sporadically incorporated into a small minority of elongating polypeptide chains. Those prematurely terminated chains have puromycin at the C-terminus, which can be detected by western blotting. We chose 30 minutes after optimisation experiments, as it was the shortest incubation time where a robust signal could be observed in these cells with this concentration of puromycin. The puromycinylated peptides are preferentially degraded by the ubiquitin-proteasome system because they are efficiently recognised as misfolded/aberrant proteins by chaperones within tens of minutes of being translated. Unless used at much higher concentrations, or over much longer timescales, there is no reason to believe that puromycin affects the degradation machinery itself, but the degradation of puromycinylated peptides depends on the proteasome. Therefore, puromycin+a proteasome inhibitor provides a reliable proxy for translation rate in the preceding 30 minutes, whereas puromycin alone tells us the steady state concentration under normal conditions, i.e., where proteasomes remain active. By subtracting the latter from the former we can infer the level of degradation of puromycinylated peptides that must have occurred in the previous 30 minutes. It is not a perfect technique, but its results agree with other findings in this manuscript: that protein turnover varies more than steady state protein abundance. With respect to the potential to degrade proteins, this is measured in Fig 1C.

      * How do they determine that they are measuring degradation of functionally relevant proteins as opposed to a host of premature truncations? *

      We do not. This is measured by stable isotope labelling in Figures 2-4. Figure 1 provides the rationale for what follows in subsequent figures, i.e., proof-principle experiments suggesting that turnover is not constant over the circadian cycle. No single experiment in Figure 1 is expected to convince the reader that of circadian turnover. Rather, several independent methods suggest that bulk protein synthesis and degradation (turnover) are not constant over time, and deviate from the null hypothesis with variation that appears to change over the 24h circadian cycle.

      * Fig 1e bottom - again is this a true regression line? *

      It is not a regression line, otherwise a p-value of fit would be shown. Fig1e bottom shows the bioluminescence measured at each timepoint from parallel control cultures (average of triplicates, error bars shown as dotted lines). Due to very high temporal resolution (every 30 min) and robustness of the cell line, it appears as a virtually perfect damped (co)sine wave. We apologise that this was not explained more clearly in the figure legend, now amended as follows:

      "Parallel PER2::LUC bioluminescence recording from replicate cell cultures (mean +/- SEM, every 30 min) is shown below, acting as phase marker."

      *Perhaps two time points should be examined here - similar to the pulse chase performed with 35S labeling? *

      We are sorry we were not fully clear with our method here. The puromycin (+/- BTZ) labelling was performed over two days every 4h (so 12 timepoints in total), which can be inferred from the data points in the top two graphs in Fig. 1E, and x-axis - but is now also clearly stated in the figure legend. The bottom right graph was a continuous bioluminescence recording, integrated every 30 min from the set of parallel culture dishes. The bioluminescence data serves as a circadian phase marker, so that we can infer at which biological times synthesis and inferred turnover was higher vs lower.

      We’ve adjusted the text to explain our method more clearly:

      “Acute (30 min) puromycin treatment of cells in culture, with or without proteasomal inhibition (by bortezomib, BTZ), allowed us to measure both total nascent polypeptide production (+BTZ) and the amount of nascent polypeptides remaining when the UPS remained active (-BTZ). This allowed inference of the level of UPS-mediated degradation of puromycylated peptides within each time window, as a proxy for nascent protein turnover (Fig. 1D).”

      * Fig 1f. It appears that Puro labeling results in a rhythm between ZT1 and ZT13 but no statistic is provided and appears that the 'ns' is the results of variance in the data as opposed to difference in means? - would this not contradict the cellular result? What accounts for the rhythm reversal in the presence/absence of BTZ. *

      To be clear, we measured the level of puromycin incorporation in mouse liver in vivo following a similar method employed by Lipton et al, Cell, 2015 (Figure 2). The prediction was that, exactly as in cells (Fig 1E), treatment with a proteasome inhibitor would lead to a much greater increase in puromycinylated peptides at ZT13 than ZT1, because this is when protein synthesis is known to be higher and thus (we predict) protein degradation should also be higher. The experiment was not designed or powered to detect a time effect, it was designed to detect an interaction between time-of-puromycin treatment and BTZ, with the specific prediction being that BTZ would have a greater effect during the active phase. This is what we observed.

      * While the authors have previously demonstrated an increase in rhythmicity of the proteome in Cry1/Cry2 double knockout cells, it would have been welcome here to test a global loss of circadian transcription in the degradation assay. One might expect that these rhythms would also be even higher. What I am really asking is: what is the mechanism for rhythmic degradation and is it dependent on the canonical clock? *

      To address the reviewer's curiosity, we used the proteasome-Glo assay (also used in Fig 1C) to assess whether there was an interaction between genotype (WT vs CKO) and time at opposite phases of the circadian cycle over 2 days. We found a significant interaction by two-way ANOVA, indicating that components of the 'canonical clock' regulate the temporal organisation of proteasomal activity (see revised Figure S1). Circadian regulation of mammalian cellular functions, such as protein turnover, is a complex and dynamic process, whereas gene deletion affects the steady state and may be epistatic to phenotype rather than revealing gene function. We are therefore reluctant to speculate what this result means in the present manuscript, which is focused entirely on testing the hypothesis that global protein turnover and complex biogenesis have cell-intrinsic circadian rhythms in non-stressed, wild type cells.

      To communicate this, the text has been revised as follows:

      "Moreover, we detected a significant interaction between genotype and biological time when comparing trypsin-lik proteasome activity between wild type and Cryptochrome1/2-deficient cells, that lack canonical circadian transcriptional feedback repression (Fig S1B-E). "

      * **Fig 2. How was the 'fixed window' timeframe determined? *

      A trial experiment was performed with labelling windows of various length, and 6h was determined to be the shortest window where enough heavy label incorporation was detected to be able to assess circadian changes. This was the case with our first methodology, which was subsequently improved (Figure 3), and therefore labelling window reduced to 1.5h.

      * *Fig 3h. While admittedly difficult, the native PAGE is not of great quality and kind of unconvincing. Also not really sure why the AHA labeling is used here an nowhere else in the paper.

      AHA is a methionine analogue that is sparsely incorporated into polypeptide chains with minimal effect on protein function/structure (Dietrich et al., PNAS, 2006). Unlike puromycin, therefore, AHA does not lead to chain termination or protein misfolding/degradation (Dermit et al., Mol Biosyst, 2017). In Figure 1, the aim was to validate previous reports of rhythmic protein synthesis assess whether there was any evidence for rhythmic turnover. To this end, we employed two independent methods (35S-labelling and puromycin-incorporation). We did not want to rely on AHA for measuring turnover: although it has been validated and used for this purpose in some studies (McShane et al., Cell, 2016), AHA is not fully equivalent to methionine, and cellular aminoacyl-tRNA synthetases have much higher affinity to methionine than they do to AHA (Ma and Yates, Expert Rev Proteomics, 2018). It is thus impossible to perform AHA labelling without methionine-free medium, and in turn methionine starvation and media changes are known to have an effect on cell signalling and cell metabolism, which would be particularly pronounced in circadian context (over days rather than over hours).

      By contrast, in Fig 3H, we use AHA with native PAGE to specifically validate one inference from the mass spectrometry analyses: circadian production of high molecular weight protein complexes. This would not be possible with puromycin because prematurely terminated polypeptide chains are not able to assemble into native complexes unless chain termination happens to occur at the extreme C-terminus and the C-terminus does not partake in any intermolecular interactions within the assembled complex.

      The raw data (full gels, all replicates) are presented in Figure S2e, which of course was used for quantification. We have now picked a different example for the main figure, which hopefully allows for clearer representation.

      The text in the results section describing the AHA experiment is now amended as follows:

      " To validate these observations by an orthogonal method, we pulse-labelled cells with methionine analogue L-azidohomoalanine (Dieterich et al, 2006). AHA is an exogenous substrate, that cells have lower affinity to than methionine, and it could potentially impact on stability of the labelled proteins (Ma & Yates, 2018) – therefore, we only used AHA to assess nascent complex synthesis, rather than turnover. We analysed the incorporation of the newly synthesised, AHA labelled proteins into highest molecular weight protein species detected under native-PAGE conditions (Fig 3H, S3F). We observed a high amplitude daily rhythm of AHA labelling, indicating the rhythmic translation and assembly of nascent protein complexes. Taken together, these results show that daily rhythms in synthesis and degradation may be particularly pertinent for subunits of macromolecular protein complexes"

      Fig 4. I was a little disappointed here that the authors did not directly assess macromolecular assembly of at least one of their "hits" and demonstrate functional relevance and most of the analysis is maintained at a very superficial, systemic level. STRING assemblies are not terribly helpful without clear k-means clustering or some other clearly visualizable metric for stratifying and organizing the putative PPI data - this figure (S3) could be markedly improved.

      We agree that validation is important. The ribosome is by far the most abundant macromolecular complex in the cell, and was one of the major complexes to show clear evidence for circadian regulation of turnover, but not abundance, by our pSILAC proteomics. To validate this result, we took advantage of two important observations: (1) that all fully assembled ribosomes incorporate ribosomal RNA (rRNA) which can readily be separated from other cellular RNA by density gradient centrifugation; (2) pulse-labelling with heavy uridine-15N2 allows nascent RNA to be distinguished from pre-existing RNA. Thus, combining stable isotope labelling with ribosome purification, we can distinguish nascently assembled ribosomes from total when the RNA is extracted, digested with RNAse, and the % heavy/total UMP quantified by mass spectrometry. These data are presented in new figure 5, and are consistent with findings in Figures 3/4 that circadian regulation of ribosome turnover is prevalent over abundance, and that the phase of highest ribosome turnover coincides with the phases of high translation and turnover overall. We hope by addressing the reviewer's question by an entirely orthogonal method, they can share more confidence in our conclusions.

      The statistical metric for STRING, specifically the p-value for enrichment in physical protein-protein interactions, is presented in the main Fig. 3G. It is now also reported in the legend for new Figure S4 itself.

      * Is it possible that some macromolecular complexes have rhythms because their constituent proteins have differential half-lives when in one complex compared with another in circadian time? This possibility was not discussed. *

      To our knowledge, there is no evidence that any major macromolecular complex in the cell has a functionally significant rhythm in abundance on a cell-autonomous basis. The reviewer’s suggestion is an intriguing possibility, but we can think of no way that it could be measured, even in principle. The simplest interpretation of our data from the independent techniques we employ (pSILAC with fractionation, native PAGE + AHA incorporation) is a rhythm in synthesis.

      *Fig. 5. Why is the first histogram in 3c not at unity? *

      This measures the average fold-induction in aggregation when cells are treated with MG132 for 4h at the indicated timepoints. Unity would indicate no induction at all, so the presented quantifications show that MG132 always elicited an increase in aggregation, with an effect size that varied with circadian phase.

      * Do ZT24 and ZT48 differ, similarly do ZT36 and ZT60?*

      No, neither difference is statistically significant (adjusted p-values of p=0.9 and p=0.07, respectively). This is now specified in the figure legend. Tendency to aggregate is also likely to change as a function of time in culture, which is why we think there is a slight increase overall in the second day of the experiment.

      * Fig S4f is not of good quality with missing eIF2a total and therefore no loading controls. *

      Thank you for prompting us to double-check this. We found that the levels of eIF2a were quite variable between the animals, and therefore we performed this experiment with 6 biological replicates. We have double-checked the quantification, and have now excluded 3 unreliable samples (the ones with undetectable levels of total eIF2a – ZT18 +BTZ replicate 1 & ZT18 -BTZ replicate 2, as well as ZT6 +BTZ replicate 4, where a smear does not allow for a reliable quantification of phospho-eIF2a) instead of 2 that were excluded originally. This still leaves at least 5 biological replicates in each group. In fact, the difference between BTZ and control in ZT6 is now deemed to be even more significant, going down to adjusted p=0.0007.

      *S4e? true regression lines? *

      The same method was used as in Figure 1. The fit lines are produced by statistical comparison of fits, i.e. our hypothesis (damped cosine fit) vs null hypothesis (no change over time, linear fit), using sum-of-squares F test. The statistically preferred fit is plotted and p-value displayed on the graph. These details are reported in the figure legends and methods section.

      While I thought these experiments were effective, they did not tie back well to the rest of the paper. What are the consequences of a temporally sensitive ISR? Which pathways does it effect in circadian time? Here, the main holes in this study are somewhat exposed; namely, a lack of mechanistic depth in explaining the very fascinating, albeit mostly descriptive, findings. The implicit assumption made here is that aggregation is 'bad' but could the opposite be just as true? Taking these considerations in account would further strengthen the discussion.

      The purpose of (former) Fig 5 was entirely to test the functional consequences and potential translational relevance of a daily rhythm in protein turnover. The mechanisms upstream and downstream of the ISR, and link with many diseases, are already quite well understood but we apologise that we did not draw more heavily on the prior literature to provide sufficient context for this experiment. Protein aggregation has long been associated with proteotoxic stress, and we do not assume it is good or bad, we simply use it as an additional validation of a temporally sensitive ISR. To correct this omission we have added the following to the results section before these experiments are introduced:

      "Disruption of proteostasis and sensitivity to proteotoxic stress are strongly linked with a wide range of diseases (Wolff et al, 2014; Harper & Bennett, 2016; Labbadia & Morimoto, 2015; Hipp et al, 2019). Evidently, global protein translation, degradation and complex assembly are crucial processes for cellular proteostasis in general, so cyclic variation in these processes would be expected to have (patho)physiological consequences....

      ...Informed by our observations, we predicted that circadian rhythms of global protein turnover would have functional consequences for maintenance of proteostasis. Specifically, we expected that cells would be differentially sensitive to perturbation of proteostasis induced by proteasomal inhibition using small molecules such as MG132 and BTZ, depending on time-of-day."

      Reviewer #2 (Significance (Required)):

      This is a fascinating paper that addresses key questions in the circadian organization of the proteome. The paper's main findings are that rhythms of protein synthesis and degradation are temporally coordinated to maintain overall stability of the proteome in mouse fibroblasts. Furthermore, the authors present evidence that this temporal organization may be important for assembly of macromolecular complexes. While very interesting, the main limitations are a lack of biochemical and mechanistic explanation and evidence that verifies these, mostly descriptive, findings.

      The fundamental biochemical mechanisms of protein synthesis, degradation, protein quality control and stress response have been studied for decades and are increasingly well understood, at least in cultured cancer cells. What is not understood is the extent to which all of these essential cellular systems are subject to physiological variation over the circadian cycle in quiescent cells. This is the fundamental knowledge gap our study attempts to fill by testing the discrete hypotheses that (1) circadian regulation of macromolecular complex turnover is more prevalent than abundance and that (2) proteome renewal is more prevalent than compositional variation. We suggest that establishing these essential principles of circadian cellular physiology is an essential prerequisite for performing the type perturbational experiments we presume the reviewer would prefer. We would like to reassure the reviewer that such studies have been and are being performed, but we are concerned that the inclusion of a very extensive additional body of work within this manuscript would detract from the clear communication of our major finding that complex turnover and proteome renewal has a cell-autonomous basis.

      *There are some relatively minor statistical and data quality issues that are probably addressable relatively quickly.

      **Upon revision the study would be a welcome addition to investigators interested in proteostasis, circadian biology, cell biology and proteomics.

      **I am a physician-scientist with expertise in circadian rhythms, cell biology, protein synthesis, and biochemistry.

      **Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      **Seinkmane et al investigate circadian regulation of protein synthesis and degradation in cultured cells and in mice. Their main new finding is that protein synthesis and degradation are in many cases rhythmic but coordinated such that the proteome is rhythmically renewed without an apparent rhythm in total protein abundance. Particularly the pool of large protein complexes is rhythmically renewed in this fashion.

      Using pulsed SILAC in combination with mass spectrometry, the authors are able to distinguish between total and newly synthesized protein levels in mouse lung fibroblasts. Analysis of these data shows that the synthesis of a large number of proteins is rhythmic although the total amount is constant, or that proteins are synthesized at a constant rate but the total amount is rhythmic, suggesting that degradation is rhythmic. By analyzing macromolecular complexes, defined as a high-speed pellet, they also present evidence that the rhythmic components of large complexes oscillate in the same phase and have a similar protein turnover rate. The authors conclude that complexes assemble rhythmically. **The authors also present evidence that the activity of the proteasome oscillates in a circadian manner. Based on this observation, they show (in fibroblasts and in mice) that the response to proteotoxic stress (monitored by eIF2alpha phosphorylation levels, protein aggregation, and apoptosis) is higher at circadian times of high proteasome activity.

      **I am an expert in the circadian field, and the hypothesis and concept behind the work presented here are potentially very interesting, and the experimental design is in principle suitable to answer these questions. However, after reading the paper several times, I cannot find the set of experiments that would convincingly support the authors' conclusions.

      **Major questions/points:

      *The major limitation of the manuscript is that the conclusions rely heavily on statistical analysis and massive processing of data from a bewilderingly large number of very different experiments. In looking at the figures, I have often wondered if the presence or absence of a rhythm is real or a product of the heavily processed data. The fact that a cosine wave fits through data points better than a straight line does not necessarily mean that a circadian rhythm is present.

      We agree that comparison of fits alone does not provide sufficiently reliable evidence. However, the fact that many independent methods (cosinor, RAIN, ANOVA) yield similar overall findings lends more confidence to our findings. We would also argue that the large number of different experiments is a positive aspect of the paper and lends weight to the general conclusions. We instead ask the reviewer to consider an alternative question - we and many other labs have found no evidence for any change in total cellular protein content, and yet there is extensive evidence from independent labs for a 'translational rush hour' whilst (excepting some low abundance transcription factors) very few cellular proteins change by more than 10% over the circadian cycle (see Stangherlin et al, COISB, 2022 for extended discussion of this). We hypothesised a parsimonious explanation for this clear contradiction, and designed experiments whose data were analysed by widely used methods that yielded results that were consistent with prediction. Perhaps the reviewer will at least concede that, if the presented findings do not refute the hypothesis, it should not be rejected until a superior one is proposed?

      * I think that in particular, the SILAC experiment(s) should be repeated and also performed with an arrhythmic control (such as CRY1/2 KO). *

      Whilst we agree that CRY1/2 KO cells show no circadian regulation of transcription and much more variable rhythms in PER2::LUC activity than wild type controls (Putker et al., EMBO J, 2021), in our hands circadian rhythms in proteome composition and protein phosphorylation in CRY1/2 KO are at least as prevalent as in wild type cells (see Wong et al., EMBO J, 2022). Indeed, when we performed a proteasome activity assay in CRY1/2 KO fibroblasts, we observed there was an apparent circadian variation, similar to WT but with a different phase. These data are now presented in revised Figure S1. Similarly, Lipton et al (Cell, 2015) showed circadian translational rhythms in cultured Bmal1 KO cells (see final figure), therefore it is not clear what would constitute an appropriate 'arrhythmic' control.

      In this study, for proteomics experiments, we used a combination of SILAC and TMT, as each technique alone would not be sufficient to answer our specific questions. These two techniques are very resource-intensive on their own, and even more so in combination. We therefore had to prioritise and for the second SILAC-TMT experiment decided to focus on cellular fractionation and questions pertaining macromolecular complexes, which were directly relevant to our hypothesis. While it would undoubtedly also be interesting to study how canonical clock genes, such as Cry1/2, impact turnover on a proteome-wide scale, the focus of our study is physiological regulation of proteome composition, rather than the function of Cryptochrome genes which we already explored in previous work (Putker et al., EMBO J, 2021; Wong et al., EMBO J, 2022).

      Comparability between the whole cell and MMC SILAC experiments is also limited due to the different experimental conditions (6h vs. 1.5h pulse, +booster).

      We do not make any direct comparisons, other than to report that broadly comparable numbers of proteins were detected. Implicitly this means there must be greater coverage of protein complexes in the second pSILAC experiment, which our data bears out. If we were not to report the first experiment, the reader would not understand why we refined the method used in the second. In reporting the results of the 6h pulse, we make the limitations of this experiment very clear i.e. biased towards highly abundant, highly turnover proteins, irrespective of cellular compartment. We should add that even in this experiment there was a clear trend towards rhythmic turnover of ribosomal proteins, but this did not quite achieve significance (p = 0.07) and so we did not want to make claims beyond the data.

      *The essential and new message of the paper is that (at least some) macromolecular complexes undergo circadian renewal (degradation and synthesis). Rather than just analysing an operationally defined pellet fraction by mass spectrometry, this could be shown in more detail and directly for one or two specific macromolecular complexes. Ribosomes, for example, seem particularly suitable, because there would also be the very simple approach of measuring the synthesis of ribosomal RNA by pulse labelling. To me, such an analysis would be perfectly sufficient as a proof of principle. I would then omit aspects such as rhythmic stress response, since many additional experiments are needed to demonstrate this convincingly. *

      Thank you for the excellent suggestion, we agree that validation is important. The ribosome is by far the most abundant macromolecular complex in the cell and was one of the major complexes to show clear evidence for circadian regulation of turnover, but not abundance, by our pSILAC proteomics. To validate this result, we took advantage of two important observations: (1) that all fully assembled ribosomes incorporate ribosomal RNA (rRNA) which can readily be separated from other cellular RNA by density gradient centrifugation; (2) pulse-labelling with heavy uridine-15N2 allows nascent RNA to be distinguished from pre-existing RNA. Thus, combining stable isotope labelling with ribosome purification, we can distinguish nascently assembled ribosomes from total ribosomes when the RNA is extracted, digested with RNAse, and the ratio of light to heavy UMP quantified by mass spectrometry. These data are presented in new figure 5, and are consistent with findings in Figures 3/4 that circadian regulation of ribosome turnover is prevalent over abundance, and that the phase of highest ribosome turnover coincides with the phases of high translation and turnover overall. We hope by addressing the reviewer's question by an entirely orthogonal method, he/she can share more confidence in our conclusions.

      The final figure is included because it tests predictions that were informed by the preceding experiments. It is not intended to be comprehensive exploration of how the integrated stress response changes with the circadian cycle, nor have we claimed this.

      * Specific points: The reader is strongly influenced by the cosine wave or straight lines in the graphs (e.g. 1c, e, 3h, 5b, etc) produced by the analysis of rhythmicity, which basically only gives a yes or no answer. But it is not really that simple. If the algorithm detects a rhythm what is its period? Is it the same as the period of the luciferase reporter? If the period lengths correlate, do the phases as well (e.g. see differences in phases 1c and e)? These questions are not addressed. *

      The temporal resolution of the time course data is much lower than the luciferase reporter and so the error of the fit is greater (usually 1-2h). For the cosine wave curve fit and the associated extra sum-of-squares F test, the period of the oscillation was fixed at either 24h or 25h, as determined from a parallel PER2::LUC control recording. This is now explicitly stated in the methods section

      In terms of phase, the general trend across all experiments is that bulk protein turnover, synthesis and degradation is higher during the 6-8h following the peak of PER2::LUC than at any other point in the circadian cycle. This is also consistent with our previous findings in mouse and human cells (Feeney et al, Nature, 2016; Stangherlin et al., Nat Comms, 2021) as well as findings from many different labs in vivo (e.g. Janich et al., Genome Res, 2016; Atger et al., 2015, PNAS; Sinturel et al., 2017, Cell). We are cautious about trying to be any more specific than this because each assay is measuring something different, and (as can be seen across the figures) there is also some modest variation in the phase of PER2::LUC between experiments, with respect the prior entraining temperature cycle (this will be reported in our forthcoming publication, Rzechorzek et al, in prep). To address the reviewer's point therefore, we have added the following to the discussion:

      "Across all experiments in this study, we find that protein synthesis, degradation and turnover is highest during the 6-8h that follow maximal production of the clock protein PER2. This is coincident with increased glycolytic flux and respiration (Putker et al, 2018), increased macromolecular crowding in the cytoplasm, decreased intracellular K+ concentration and increased mTORC activity (Feeney et al, 2016a; Stangherlin et al, 2021b; Wong et al, 2022)."

      * **The algorithm in Fig 1c predicts a rhythm for the chymotrypsin-like and the trypsin-like but not for the caspase-like activity. The peptide assay measures core proteasome activity independent of ubiquitylation and should therefore be dependent on proteasome concentration in the sample. How can then only two of the three proteasomal activities be rhythmic? Please elaborate and repeat with arrhythmic cells (e.g. CRY1/2 KO). The period length does not seem to correlate with the one of the reporter. Why is that? *

      The arrhythmic controls idea is partially addressed in the response above. We did perform a proteasome activity assay in CRY1/2 KO fibroblasts, and observed daily variation similar to WT, albeit with a different apparent phase. These data are now shown in Figure S1, and referred to in the main text as follows:

      "Moreover, we detected a significant interaction between genotype and biological time when comparing trypsin-like proteasome activity between wild type and Cryptochrome1/2-deficient cells, that lack canonical circadian transcriptional feedback repression (Fig S1B-E)".

      Besides this study, our two previous proteomic investigations of the fibroblast circadian proteome detected no biologically significant or consistent rhythm in proteasome subunit abundance (Wong et al., EMBO J, 2021; Hoyle et al., Science Translational Medicine, 2017). Moreover, proteasomes are long-lived stable complexes whose activity is determined by a combination of substrate-level, allosteric and post-translational regulatory mechanisms that includes their reversible sequestration into storage granules (Albert et al., PNAS, 2020; , Fu et al., PNAS, 2021; Yasuda et al., Nature, 2020). It is therefore very likely that the observed rhythm in trypsin- and chymotrypsin-like activity occurs post-translationally. Proteasome subunit composition is also known to change, which might be another reason for differences between the protease activities (Marshall and Vierstra, Front Mol Biosci, 2019; Zheng et al., J Neurochem, 2012).

      To communicate this succinctly, we have revised the relevant text as follows:

      Page 7: "Moreover, we detected a significant interaction between genotype and biological time when comparing trypsin-like proteasome activity between wild type and Cryptochrome1/2-deficient cells, that lack canonical circadian transcriptional feedback repression (Fig S1B, (Wong et al, 2022)). Previous proteomics studies under similar conditions have revealed minimal circadian variation in proteasome subunit abundance (Wong et al, 2022), suggesting that proteasome activity rhythmicity, and therefore rhythms in UPS-mediated protein degradation, are regulated post-translationally (Marshall & Vierstra, 2019; Hansen et al, 2021)."

      Regarding period length, we apologise for an oversight in Fig 1c: unlike all other experiments presented here, these fits were originally done with a flexible period length (between 20h and 36h). This has now been re-fitted in a similar manner to the other experiments (fixed period of 24h, same as the parallel PER2::LUC controls), and the updated data are presented. This has not influenced the results of the statistical tests (only changed the p-values slightly, but the significance levels remain the same).

      Fig. 1a,b suggest that there is a rhythm in global protein synthesis with a significant peak at 40h. Yet, Fig. 1e suggests otherwise. How can that be? Also, the degradation graph (lower panel 1c) has to be plotted with the ratios calculated from the data points and not the heavily processed fitted graphs. This can be very misleading.

      Fig1a,b was performed under quite different conditions to 1e. As described in the methods section, 35S-labelling experiments require a medium change during both pulse and chase (to replace normal Met with radioactive Met, and vice versa). To avoid growth factor/mTORC1-mediated stimulation of protein synthesis & turnover, these acute media changes must occur in the absence of serum; otherwise media changes would introduce artifacts. In contrast, puromycin labelling (Fig 1e) is performed without any media changes (as puromycin can be added directly to culture cell media), and therefore was performed in normal culture conditions of 10% serum. Thus, due to its well-established effect of growth factor/mTORC1 signalling on bulk translation rate, it is very likely that differences in the phase of translational rhythms between Fig1a,b and 1e are attributable to differing serum concentrations – this phenomenon of serum-dependency of phase is also described in Beale et al, 2023, bioRxiv https://doi.org/10.1101/2023.06.22.546020. The only important point, is that neither of these proof-of-principle experiments support the null hypothesis: that translation rate and turnover remains constant over the circadian cycle. Thus, the hypothesis being tested in Figure 1 is not rejected, and provides the rationale for the subsequent proteome-wide analyses.

      With respect to 1E, given the variance of measurement, the curve fits to Puro and Puro+BTZ already serve to test whether there is any significant ~24h component, a ratio of the respective data points would simply compound the error of measurement. The degradation plot is provided purely for illustrative purposes to help the reader i.e. if these fits were true, what would be expected? We have revised the figure to more clearly communicate that the degradation plot is presented purely as a visual aid, labelled “inferred”, and now show ratio plots in revised Figure 1.

      * **It also strikes me as odd that the amplitude of degradation increases (peak at 28h lower than at 30h) while the amplitude of the core clock oscillation dampens over time (peak at 54h higher than at 53h due to desynchronisation. Only two data values around 54h are responsible for the detected rhythm (2nd peak). Furthermore, phase and period do not agree with the rhythm of proteolytic activities shown in 1c. How can this be explained? *

      Due to the nature of the experiment, the degradation rate inferred from Figure 1B & 1E does not reflect proteasome activity exclusively. Rather it reflects the combined sum of processes that remove nascently produced proteins from the cell's digitonin-soluble fraction, which includes proteasomal degradation, but also autophagy, protein secretion and sequestration into other compartments. Therefore, the peak degradation in Fig 1B & E would not necessarily be expected to coincide with the peak of proteasome activity in Fig 1C. Again, these experiments in Figure 1 simply serve to test the hypothesis (change over circadian cycle) vs the null hypothesis (no change over the circadian cycle).

      To the question of amplitude increase, we speculate that this is due to metabolic changes in cultures over the course of three days – as serum and nutrients from the last medium change at T0 are depleted, cells need to increase degradation to promote turnover and recycling. As we suggest that the rhythms in turnover help cellular bioenergetic efficiency, it is quite plausible that amplitude increases as nutrient-concentrations fall. We are in process of further investigation into how exactly these rhythms vary with nutrient and serum status.i

      * Regarding the MS data shown in Figure 2, is it possible to show a positive / quality control? Best would be MS data of Luciferase (or PER2,3, RevErb/alpha, DBP) to show oscillation of protein levels with the same phase and period as the reporter. *

      Unfortunately, none of these low abundance transcription factors were detected in our MS runs. This is not surprising, given that their copy numbers are estimated at * In Fig. 2c examples of the 4 groups of proteins presented in 2e should be shown (both synthesis and total abundance arrhythmic, either one rhythmic or both rhythmic) and not just what appears to be random examples of rhythmic and arrhythmic proteins. *

      As also requested by another reviewer, we have revised the figure to include examples of each of the rhythmicity categories. No specific meaning is inferred from the chosen protein identities.

      Is it possible at all to distinguish between synthesis/turnover and assembly/disassembly of macromolecular complexes in the MMC SILAC experiment? If so, how?

      We followed the established protocol originally developed in our collaborator Kathryn Lilley's lab, where it has previously been shown that most proteins in the MMC fraction are in macromolecular assemblies (Geladaki et al, Nat Commun, 2019). Proteins that are rhythmically abundant in this fraction, but without an accompanying synthesis rhythm (e.g. Beta-actin, see Hoyle et al., Sci Trans Medicine, 2017) can be reliably assumed to arise solely from rhythmic assembly/disassembly i.e. they are captured in this fraction when assembled, but lost, and therefore not detected, in this fraction when disassembled. However, in the case of rhythmic synthesis and abundance, it is not possible with this technique to directly infer that rhythmic synthesis of a given protein is responsible for its rhythmic assembly in a complex, though they do correlate.

      Therefore, our new figure 5 (with thanks again for this suggestion) approaches this by an orthogonal method, relying on the important observations that a) ribosomes incorporate ribosomal RNA (rRNA) b) this can be readily separated from most other cellular RNA by density gradient centrifugation and c) pulse-labelling with heavy uridine-15N2 allows nascent RNA to be distinguished from pre-existing RNA. Using this technique, we validate a rhythm in production and assembly of mature ribosomes, with its peak consistent with the highest turnover time as measured in Figs 1 and 3, and MMC fraction proteomics (Supplemental table 3), at the descending phase of PER2::LUC.

      * **Looking at Fig. 4b,c, what is the fraction of rhythmic proteins from the MMC experiment that also oscillate in either synthesis, total abundance or both in the whole cell? Is there a general correlation at all? Please show. *

      There were no correlations greater than would be expected by chance (the sets of proteins rhythmic in either synthesis or degradation did not overlap significantly between whole-cell and MMC fractions, as determined by an odds ratio test).

      To communicate this we have added the following text:

      "It is also worth noting that although there were small sets of proteins that were rhythmic in both whole-cell (Figure 2) and MMC fractions (Figure 3), in both synthesis and total abundance, none of these four overlaps were higher than would have been expected by chance."

      * **Why is the phase of the oscillating proteins different in the two experiments (compare Figs. 2f,g and 4a) and does either of them match with the phase of the PER2::LUC reporter, which should be the peak synthesis phase of the clock? *

      This was a labelling error on our part, our apologies and thanks for drawing it to our attention. We had attempted to harmonise all these phase values so that they were mutually comparable between the two mass spec experiments, but omitted to update all the figures. They have now all been updated to be inter-consistent. From our experiments, the peak of PER2::LUC consistently precedes the timing of maximum bulk translation. This phase difference is, at least in part, attributable to the inactivation kinetics of firefly luciferase (see Feeney et al., J Biol Rhythms, 2016), i.e., under conditions of saturating luciferin substrate, PER2 protein abundance peaks several hours later than PER2::LUC activity when measured in longitudinal live cell assays.

      * Regarding the sensitivity to MG132 in Fig. 5b it doesn't make sense that, while eIF2alpha phosphorylation is arrhythmic in untreated cells and the levels of eIF2alpha phosphorylation are (apparently) not exhibiting a rhythmic change by administration of MG132 at different circadian timepoints, the ratio of P-eIF2alpha with and without MG132 suddenly is. Please show in Fig. S4b quantifications of the individual experiments with and without MG132. What is presented in 5b is after all the ratio of ratios of quantifications of Western blots, each of which individually does not display any appreciable rhythm. For me this is two much of processing of data. In my opinion, the MG132 4h acute treatment must show a detectable rhythm.*

      We apologise for being unclear in this panel and description. Our hypothesis concerned the fold-induction of the p-eIF2alpha:eIF2alpha ratio changing as a function of MG132 and time. Our reasoning being that the ratio may be more biologically-relevant as it is the relative change that cells sense and respond to, and not the absolute abundance of p-eIF2alpha. We applied a quantitative, two-channel fluorescent antibody technique to enable detection and quantification of p-eIF2alpha and eIF2alpha from each replicate at each time point from the same band of the same blot. We agree that no p-eIF2alpha rhythm is evident from a cursory inspection of any of the blots. This is due to the innate variance between dishes in extracted protein concentration, as well as the levels of basal eIF2alpha and its phosphorylation, and is the reason that we took great pains to be as quantitative as possible using the two-channel immuno-detection (LICOR). Due to the natural and stochastic variation in eIF2alpha levels and extraction between replicates and over time, it is difficult to get identical eIF2alpha loading to reveal the overlying rhythm in p-eIF2alpha, and furthermore, identical loading would give a misleading impression of the level of temporal variation of eIF2alpha levels. Quantification reveals temporal variation in the MG132 treated samples but not in the untreated controls (Supp Fig 5A) – suggesting that there may be circadian regulation of the cellular response to MG132 challenge, rather than a cell-autonomous p-eIF2alpha rhythm under basal conditions. We quantified fold-induction from MG132 vs untreated to present in Figure 6A. We have presented all the raw data in supplementary figure 5 for readers to validate through their own analysis.

      *Minor:

      In Fig. 1f please show dot blot with error bars as well as the individual experiments in the supplementals. Please check the graph legend (N>=3?) *

      Thank you for pointing out these omissions. The dot blot with error bars is now shown in Fig. 1F, and the full gels are now included as Fig. S2B. The main figure legend for 1f has also had the following added (explaining the N numbers):s

      "Four mice were used per condition, but in some cases one of the four injections were not successful i.e. no puromycin labelling was observed and so no quantification could be performed (full data in Fig. S2B)."

      * Please explain the mechanism of the "booster" used in the second SILAC experiment. *

      The following has been revised in the text:

      " Namely, we added a so-called booster channel: an additional fully heavy-labelled cell sample within a TMT mixture (Klann et al, 2020). When the mixture is analysed by MS, heavy peptides from the booster channel increase the overall signal of all identical heavy peptides at MS1 level; at MS2 and MS3 this results in improved detection of heavy proteins in the other TMT channels of interest, and is particularly advantageous for the proteins with lower turnover that would fall below the MS1 detection limit without the booster."

      *

      **p10 3rd paragraph: S2e not S3e *

      Thank you, this has been fixed.

      p12 last paragraph please add reference to Figs. 5f,g

      Thank you, this has been added.

      *Reviewer #3 (Significance (Required)): *

      xxxxx

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Seinkmane et al investigate circadian regulation of protein synthesis and degradation in cultured cells and in mice. Their main new finding is that protein synthesis and degradation are in many cases rhythmic but coordinated such that the proteome is rhythmically renewed without an apparent rhythm in total protein abundance. Particularly the pool of large protein complexes is rhythmically renewed in this fashion.

      Using pulsed SILAC in combination with mass spectrometry, the authors are able to distinguish between total and newly synthesized protein levels in mouse lung fibroblasts. Analysis of these data shows that the synthesis of a large number of proteins is rhythmic although the total amount is constant, or that proteins are synthesized at a constant rate but the total amount is rhythmic, suggesting that degradation is rhythmic. By analyzing macromolecular complexes, defined as a high-speed pellet, they also present evidence that the rhythmic components of large complexes oscillate in the same phase and have a similar protein turnover rate. The authors conclude that complexes assemble rhythmically. The authors also present evidence that the activity of the proteasome oscillates in a circadian manner. Based on this observation, they show (in fibroblasts and in mice) that the response to proteotoxic stress (monitored by eIF2alpha phosphorylation levels, protein aggregation, and apoptosis) is higher at circadian times of high proteasome activity.

      I am an expert in the circadian field, and the hypothesis and concept behind the work presented here are potentially very interesting, and the experimental design is in principle suitable to answer these questions. However, after reading the paper several times, I cannot find the set of experiments that would convincingly support the authors' conclusions.

      Major questions/points:

      The major limitation of the manuscript is that the conclusions rely heavily on statistical analysis and massive processing of data from a bewilderingly large number of very different experiments. The critical experiments lack replicates and obvious controls. In looking at the figures, I have often wondered if the presence or absence of a rhythm is real or a product of the heavily processed data. The fact that a cosine wave fits through data points better than a straight line does not necessarily mean that a circadian rhythm is present. I think that in particular, the SILAC experiment(s) should be repeated and also performed with an arrhythmic control (such as CRY1/2 KO). Comparability between the whole cell and MMC SILAC experiments is also limited due to the different experimental conditions (6h vs. 1.5h pulse, +booster). The essential and new message of the paper is that (at least some) macromolecular complexes undergo circadian renewal (degradation and synthesis). Rather than just analysing an operationally defined pellet fraction by mass spectrometry, this could be shown in more detail and directly for one or two specific macromolecular complexes. Ribosomes, for example, seem particularly suitable, because there would also be the very simple approach of measuring the synthesis of ribosomal RNA by pulse labelling. To me, such an analysis would be perfectly sufficient as a proof of principle. I would then omit aspects such as rhythmic stress response, since many additional experiments are needed to demonstrate this convincingly.

      Specific points:

      The reader is strongly influenced by the cosine wave or straight lines in the graphs (e.g. 1c, e, 3h, 5b, etc) produced by the analysis of rhythmicity, which basically only gives a yes or no answer. But it is not really that simple. If the algorithm detects a rhythm what is its period? Is it the same as the period of the luciferase reporter? If the period lengths correlate, do the phases as well (e.g. see differences in phases 1c and e)? These questions are not addressed.

      The algorithm in Fig 1c predicts a rhythm for the chymotrypsin-like and the trypsin-like but not for the caspase-like activity. The peptide assay measures core proteasome activity independent of ubiquitylation and should therefore be dependent on proteasome concentration in the sample. How can then only two of the three proteasomal activities be rhythmic? Please elaborate and repeat with arrhythmic cells (e.g. CRY1/2 KO). The period length does not seem to correlate with the one of the reporter. Why is that?

      Fig. 1a,b suggest that there is a rhythm in global protein synthesis with a significant peak at 40h. Yet, Fig. 1e suggests otherwise. How can that be? Also, the degradation graph (lower panel 1c) has to be plotted with the ratios calculated from the data points and not the heavily processed fitted graphs. This can be very misleading. It also strikes me as odd that the amplitude of degradation increases (peak at 28h lower than at 30h) while the amplitude of the core clock oscillation dampens over time (peak at 54h higher than at 53h due to desynchronisation. Only two data values around 54h are responsible for the detected rhythm (2nd peak). Furthermore, phase and period do not agree with the rhythm of proteolytic activities shown in 1c. How can this be explained?

      Regarding the MS data shown in Figure 2, is it possible to show a positive / quality control? Best would be MS data of Luciferase (or PER2,3, RevErb/alpha, DBP) to show oscillation of protein levels with the same phase and period as the reporter. In Fig. 2c examples of the 4 groups of proteins presented in 2e should be shown (both synthesis and total abundance arrhythmic, either one rhythmic or both rhythmic) and not just what appears to be random examples of rhythmic and arrhythmic proteins.

      Is it possible at all to distinguish between synthesis/turnover and assembly/disassembly of macromolecular complexes in the MMC SILAC experiment? If so, how? Looking at Fig. 4b,c, what is the fraction of rhythmic proteins from the MMC experiment that also oscillate in either synthesis, total abundance or both in the whole cell? Is there a general correlation at all? Please show.

      Why is the phase of the oscillating proteins different in the two experiments (compare Figs. 2f,g and 4a) and does either of them match with the phase of the PER2::LUC reporter, which should be the peak synthesis phase of the clock?

      Regarding the sensitivity to MG132 in Fig. 5b it doesn't make sense that, while eIF2alpha phosphorylation is arrhythmic in untreated cells and the levels of eIF2alpha phosphorylation are (apparently) not exhibiting a rhythmic change by administration of MG132 at different circadian timepoints, the ratio of P-eIF2alpha with and without MG132 suddenly is. Please show in Fig. S4b quantifications of the individual experiments with and without MG132. What is presented in 5b is after all the ratio of ratios of quantifications of Western blots, each of which individually does not display any appreciable rhythm. For me this is two much of processing of data. In my opinion, the MG132 4h acute treatment must show a detectable rhythm.

      Minor:

      In Fig. 1f please show dot blot with error bars as well as the individual experiments in the supplementals. Please check the graph legend (N>=3?)

      Please explain the mechanism of the "booster" used in the second SILAC experiment.

      p10 3rd paragraph: S2e not S3e

      p12 last paragraph please add reference to Figs. 5f,g

      Significance

      xxxxx

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This is a very interesting and well written paper that addresses key questions in the circadian organization of proteostasis. The paper investigates origins of cellular circadian rhythms, invoking a premise early that there is a poor correlation between rhythmic gene expression - regulated by the canonical TTFL - and rhythms of the proteome, which are rather meager. Specifically, they ask how a relatively stable proteome is possible if cells engage in rhythms of cellular protein synthesis? Their hypothesis is that protein degradation must rhythmically compensate for rhythms of synthesis and much of the manuscript is focused on defining the relationship between rhythmic global synthesis and rhythmic degradation. They employ a series of detailed proteomic investigations and biochemical assessments of protein synthesis coupled with various circadian reporters to assess proteosome function. The proteomic experiments reveal a limited number of proteins with oscillations in either synthesis or abundance or both and no discernible pathway organization however, a followup and more refined study that utilized fractionated samples and boosted heavy SILAC identified strikingly, that many proteins in relatively heavy fractions are rhythmic and that these fall into possible complexes including ribosome and chaperonins. Finally, they perform in vivo experiments testing whether the timing of proteotoxic stimuli regulates the degree of the integrated stress response measured as pEif2a.

      Overall, I think that this is a fascinating paper that addresses and important question but falls short on mechanistically unifying them and completely contextualizing the findings in light of the canonical modes of circadian timekeeping leaving us with an important, but mostly descriptive set of findings. In addition, there are a number of important questions about data interpretation, some issues with data quality that should be addressed outlined below. With revision and further explication, this study will be an excellent addition to the growing field of circadian organization of the cellular proteome.

      Major and minor Comments.

      Figure 1.

      Fig 1a. The difference in Pulse and Chase at ZT24 does not appear to reflect the quantified data in 1b. This should be reconciled to make the figure convincing. How was the timing of the chase collection determined? Fig 1d-e. What is the evidence that puro labeling results in 'rapid' turnover. Fig 1e seems to be missing the data from the treated and untreated conditions? How are the lines produced (e.g. linear versus rhythmic? Are these drawn lines or actual regressions?). Why was 30 minutes chosen as labeling time? It seems hard to understand here how protein degradation kinetics can be measured by puromycin labeling if the authors' claim that puromycin labeling potentially changes degradation rates as a function - primary or secondary - of the labeling itself. It seems they are measuring the potential to degrade proteins. How do they determine that they are measuring degradation of functionally relevant proteins as opposed to a host of premature truncations? Fig 1e bottom - again is this a true regression line? Perhaps two time points should be examined here - similar to the pulse chase performed with 35S labeling? Fig 1f. It appears that Puro labeling results in a rhythm between ZT1 and ZT13 but no statistic is provided and appears that the 'ns' is the results of variance in the data as opposed to difference in means? - would this not contradict the cellular result? What accounts for the rhythm reversal in the presence/absence of BTZ.

      While the authors have previously demonstrated an increase in rhythmicity of the proteome in Cry1/Cry2 double knockout cells, it would have been welcome here to test a global loss of circadian transcription in the degradation assay. One might expect that these rhythms would also be even higher. What I am really asking is: what is the mechanism for rhythmic degradation and is it dependent on the canonical clock?

      Fig 2. How was the 'fixed window' timeframe determined?

      Fig 3h. While admittedly difficult, the native PAGE is not of great quality and kind of unconvincing. Also not really sure why the AHA labeling is used here an nowhere else in the paper.

      Fig 4. I was a little disappointed here that the authors did not directly assess macromolecular assembly of at least one of their "hits" and demonstrate functional relevance and most of the analysis is maintained at a very superficial, systemic level. STRING assemblies are not terribly helpful without clear k-means clustering or some other clearly visualizable metric for stratifying and organizing the putative PPI data - this figure (S3) could be markedly improved.

      Is it possible that some macromolecular complexes have rhythms because their constituent proteins have differential half-lives when in one complex compared with another in circadian time? This possibility was not discussed.

      Fig. 5.

      Why is the first histogram in 3c not at unity? Do ZT24 and Zt48 differ, similarly do ZT36 and ZT60? Fig S4f is not of good quality with missing eIF2a total and therefore no loading controls. S4e? true regression lines?

      While I thought these experiments were effective, they did not tie back well to the rest of the paper. What are the consequences of a temporally sensitive ISR? Which pathways does it effect in circadian time? Here, the main holes in this study are somewhat exposed; namely, a lack of mechanistic depth in explaining the very fascinating, albeit mostly descriptive, findings. The implicit assumption made here is that aggregation is 'bad' but could the opposite be just as true? Taking these considerations in account would further strengthen the discussion.

      Significance

      This is a fascinating paper that addresses key questions in the circadian organization of the proteome. The paper's main findings are that rhythms of protein synthesis and degradation are temporally coordinated to maintain overall stability of the proteome in mouse fibroblasts. Furthermore, the authors present evidence that this temporal organization may be important for assembly of macromolecular complexes. While very interesting, the main limitations are a lack of biochemical and mechanistic explanation and evidence that verifies these, mostly descriptive, findings.

      There are some relatively minor statistical and data quality issues that are probably addressable relatively quickly.

      Upon revision the study would be a welcome addition to investigators interested in proteostasis, circadian biology, cell biology and proteomics.

      I am a physician-scientist with expertise in circadian rhythms, cell biology, protein synthesis, and biochemistry.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript "Circadian regulation of protein turnover and proteome renewal" investigates the role of protein degradation in the circadian control of proteostasis. The researchers suggest that the relatively static levels of protein levels in a cell are incongruent with the known oscillation in protein synthesis. They therefore hypothesize that there should be a compensatory mechanism to counteract rhythmic protein synthesis, rhythmic protein degradation. To investigate this, they employ bulk pulse chase labeling to study the process of degradation. They identify a synchronization between the creation and turnover of proteins in a cell, implying the clock helps to maintain homeostasis through a novel mechanism. They note that these phases align with energy availability, granting a plausible reasoning behind the biological implementation of this regulation. In summary, this is a sound manuscript that adds to the research field. The experiments in this manuscript are well thought out, organized, and explained. In general, the authors do not go further in their conclusions than I think is warranted given the data that they have, though I think that there are some key items that should be addressed before the publication of this manuscript.

      Major notes:

      1. In figure 1, a clearer idea of what the ** means would be appreciated. What was the standard of significance for this measure?
      2. In Figure 1b, it is important to note clearly in the text that the this is not a direct measure of protein degradation, but a subtractive proxy. Though I don't think that necessarily makes the authors conclusions incorrect, the same result could also be obtained if an extra 15% of the proteins were moved into the insoluble fraction. This is the same for Figure 1E and F.
      3. In figure 1c, is the noted oscillation in protease activity due to the oscillation of these proteins? What are the predicted mechanisms behind this? I don't think that this is necessarily within the scope of this paper but should be addressed in the discussion. Also, the peak degradation rate from Figure 1B is 4 hours before the peak enzyme activities. How can this observation be reconciled?
      4. For the pSILAC analysis, the incorporation scheme has a six-hour window between the comparison of the light and heavy peptides. This makes it somewhat difficult to assess whether you are looking a clock effect from T1 or T1+6. This does not negate the findings, but it does question when the synthesis is occurring and what is being compared, which I think should be more clearly discussed in the manuscript. This is discussed later in the manuscript but should be mentioned in this section.
      5. There are no error bars on figure 2C. What the pSILAC just done in a singlet? If so, the rhythms estimation is likely a large overestimate and should be noted.
      6. Why were the genes selected in 2C? these are not discussed anywhere else in the manuscript.
      7. The authors note that for Figure 2 "These observations are consistent with widespread rhythmic regulation of protein degradation." However, only 5-10% of the proteome is oscillating at any level and less with a discrepancy between synthesis and abundance, so "widespread" is an exaggeration and this statement should be limited to the degradation in the rhythmic proteome.
      8. The authors note that their more developed strategy in figure 3 would allow for the detection of less abundant proteins. However, they do not discuss that they in fact found less proteins overall, or if they were able to detect proteins of lower abundance. This is of some concern in determining if this is indeed the better method that they predict. How can the authors reconcile this issue? How can they rationalize this explains their increase in oscillating elements?
      9. In the comparison of complex turnover rates, the authors need to provide a metric that backs their statement that "the majority of component subunits not only showed similar average heavy to total protein ratios but also a similar change in synthesis over the daily cycle" for figure 3F.
      10. In reference to the AHA incorporation, why is the hypothesis not that, like the puramycin, you would not see oscillation unless you add BTZ? Shouldn't the active degradation regulate the incorporation of AHA such that there is no visible rhythm unless you suppress degradation?
      11. The authors claim that there is enrichment of the actin cytoskeleton, but where this data can be found should be explained. The only thing that is shown is a few selected graphs of proteins in this pathway.
      12. The authors note an oscillation in the total levels of p-eif2, commenting that these do not arise from the rhythms in total eif2a but temperature and feeding rhythms. However, unless I misunderstood, this work was done in fibroblast cell culture, so in this case, where would these temperature and feeding rhythms come from?
      13. In Figure 5d, the treatment impeding degradation is causing cell death while the inhibition of translation does not. However, wouldn't too much, or not enough, translation, without compensatory regulation from degradation cause a problem in the same way that degradation does?

      Significance

      The information that stems from this work is relevant and of interest to circadian clock field as how the regulation of the output of the circadian clock is implemented is still a major question in the field. This manuscript suggests a novel and plausible method for how, at least in part, this regulation occurs. However, the manuscript uses methods that do not measure degradation directly, which is a minor limitation. In addition, the mechanisms by which this regulation is imparted are not addressed in any meaningful way, even in the discussion.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Abe et al. generated multiple novel mouse model that follows and labels Col4a2 by fusing it with GFP. The creation of this mouse model allowed for the analysis of the basement membrane in real-time using live imaging of embryonic explant cultures. The combination of the mouse model with live imaging revealed new insights into how the basement membrane turnover occurs during early hair follicle development. After setting the stage the authors establish its utility with embryonic tissue, which revealed membrane buds that could be visualized with eGFP at different time intervals for 19h. The growth of the hair follicle bud during that time could be divided three distinct areas, the upper, lower stocks, and the tip. Where the tip and lower stalk increased in BM length, while the upper stalk decreased. BM length correlated with increased cellular replication. Using a double transgenic model system, they evaluated the differences between 'cell displacement', 'BM expansion', and 'cell autonomous movement'. Diving deeper with the Col4a2-mKiGR mouse model investigated Col4a2 turnover to reveal similar results mentioned above. Lastly MMP inhibitors were used to investigate how hair follicle growth is affected, which revealed inhibition of cellular replication and inhibition of Col4a2 turnover. These results inhibited hair follicle elongation which made the forming hair follicles thicker.

      Overall, this is an interesting paper for hair follicle biologists since it investigates the early moments of the budding follicle in real time using a ex vivo culture model. In addition, there is additional appeal because the authors use live imaging the hair follicle neogenesis as a basement membrane model, which is interesting and novel. The manuscript is clearly written, and the data support the overall conclusions. My comments below are to help in clarity which may be used to develop clearer figures and additional text.

      Major:

      In Figure 2 for the cell displacement and expansion part. I found the figure and text confusing, particularly in regard to how the data shows the bleached edge of the BM moving alongside the cells. Could the authors make this clearer in the figure and in the writing of the text?

      The authors created a Col4a2-mKikGR mouse line that is supposed to follow Col4a2 turnover. It is difficult to understand how the authors can claim that it was turnover with the explanation in the text of how the mouse model works. Could the authors write a better description of the mouse model in the text?

      Significance

      The strengths of the paper are the use of a novel mouse model that can track collagen (Col4a2) with a tagged GFP and the live imaging model system. This has led to a novel and important knowledge on the interplay between the BM and cells. The limitation of the manuscript is that the entire manuscript utilizes a single protocol (live imaging) to investigate BM dynamics. This might be due to the sophisticated nature of the model systems, but it is a limitation.

      To my knowledge the study is novel because of the mouse model to track Col4a2 dynamics with GFP. This model led to some interesting findings about hair follicle development in that different regions expand differently with the BM. This is good foundational knowledge using the latest state of the art techniques in live imaging of embryonic tissue culture systems.

      The audience that would be interested in this manuscript is broad, which would span cell and developmental biology but also specialists in membrane and protein biology that can finally see in real time the dynamics of Col4a2 building skin and hair.

      My expertise in in hair follicle development and repair and stem cell biology.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors successfully generated a knock-in mouse line expressing fluorescence-tagged endogenous collagen IV. Homozygous mice exhibited normal survival with no apparent developmental abnormalities. Through imaging of the back skin of COL4A2-eGFP mice in ex vivo culture, the authors observed a faster expansion rate of the basement membrane (BM) at the growing tips of developing hair follicles, while the upper stalk region showed a slower BM expansion rate. Real-time imaging also revealed that the BM and basal epithelial cells moved in the same direction. Fluorescence recovery after photobleaching (FRAP) and the photoconvertible protein mKikGR experiments demonstrated a faster turnover rate of COL4A2 at the tips of the hair follicles. The authors used MMP inhibitors to suppress the recovery of COL4A2-eGFP, BM expansion, and observed abnormalities in the developmental process of hair follicle morphology. The findings are novel and interesting, some aspects of how the experiments and quantifications were conducted should be clarified to allow the readers to fully appreciate the results.

      Major Comments:

      1. BM Expansion Statistics: Regarding the statistical analysis of BM expansion, is the length of the BM affected by the z-axis plane at the time of imaging? How is it ensured that the BM length measured at different time points corresponds to the same z-axis?
      2. Cell Movement Statistics (Figure 2): In the statistical analysis of cell movement in Figure 2, the lack of other references raises questions about ensuring that the cells measured at different time points are the same cells.
      3. The authors concluded that the inhibition of MMP delayed the recovery of COL4A2-eGFP after photobleaching, indicating the crucial role of MMP activity in the incorporation of COL4A2 into the BM. Does MMP inhibition lead to a reduction in the synthesis of the COL4A2 protein, thereby delaying the recovery of COL4A2 in the BM?

      Minor Comments:

      In Figure 1D, the representative image and the statistical graph are inconsistent. For example, in the upper stalk region, all basal epithelial cells have Ki67 signals according to the representative image.

      Significance

      The authors successfully generated knock-in mice expressing fluorescence-tagged endogenous collagen IV. With this novel mouse model, researchers can directly observe the dynamic changes in the extracellular matrix of different organs in mammals during development and disease, revealing the potential roles of the extracellular matrix.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Wuergezhen et al. generated two fluorescently tagged Col4a2 mouse models, including EGFP-Col4a2 and the photoconvertible mKikGR-Col4a2. They showed that tagged collagen IV get incorporated in the basement membranes of various tissues in the developing and adult mice, and fully support embryonic development, fertility, and physiological functions. The authors followed up investigating Col4a2 dynamics during embryonic development of the hair follicle. Using FRAP, they showed that the basement membrane expands with a faster rate near the tip region. This coincided with faster turnover rates of Col4a2 at the tip relative to the lower and upper stalk. In addition, the authors demonstrated that Col4a2 turnover depends on MMPs.

      Major comments:

      • Are the key conclusions convincing?

      The major conclusions of the manuscript are convincing. These include that tagging collagen IV does not compromise its function, differential expansion of the basement membrane, differential turnover of the Col4a2, the MMP dependence for normal basement membrane expansion and turnover. However, some claims detailed below need to be clarified or addressed to improve the manuscript. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      In Figure 2B-C, the authors conclude that basement membrane expands at different rates, depending on the region with higher expansion rates near the hair follicle tip. From their methods, they conducted repetitive photobleaching cycles every 2-3 hours to maintain the bleached status of ROIs and tracked changes in basement membrane length. However, it sounds challenging to repetitively photobleach the same ROI in a dynamically expanding tissue, which may compromise the accuracy of length measurements. It would be helpful if the authors could provide movies corresponding to these experiments for clarity. In addition, this repetitive bleaching should be highlighted in the figure and figure legend, as readers could get confused about why there is no recovery here when comparing to the FRAP experiment.

      In Lines 180-198 and Figure 2F-H, the authors interpreted cell movement as contributed by cell-autonomous vs basement membrane expansion, which is speculative. Another possibility is that cell movement was all autonomous, while basement membrane expansion was caused by mechanical stretching that was in turn caused by cell proliferation. This conclusion should be rephrased.

      Following MMPi treatment, the authors found that basement membrane expansion was halted (Figure 4C-D). Yet, they later showed that hair follicles widened under MMPi treatment (Figure 4E-J). Under these conditions, I would have expected such widening to be accompanied by basement membrane expansion at the lower or upper stalk, which is not the case, at least for the first 7 hours (Figure 4C-D). How do the authors interpret this? - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Considering the rapid expansion of BM near the tips, I wonder whether the authors have explored possible structural differences of the basement membrane in different regions? Is the basement membrane near the tips thinner and/or does it have microperforations as reported in other systems? Looking into this may further support their observations, not only in control conditions but also in MMPi treated follicles. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      Yes, for possible structural changes, 3D rendering of fluorescence imaging as shown in Figure 1J may be sufficient.

      Increasing sample number using existing mouse strains should also be feasible. It would take 2-3 weeks from setting up timed pregnant mice to imaging the embryonic skin explants. Adding image analysis and figure preparation, it should be doable within 1 month. - Are the data and the methods presented in such a way that they can be reproduced?

      In Figure 2D/ Figure 5C, it is unclear how ROIs corresponding to tip/lower/upper stalk are being drawn for quantification of Ki67+ cells.

      The images in 2B and 4C, 3A and 4E, are identical. Images should not be re-used in figures. This raised the concern that probably not sufficient samples were imaged to have different representative images. It should also be clarified where the data was re-used, for example, the control data in 2C and 4D, 3B and 4B. - Are the experiments adequately replicated and statistical analysis adequate?

      A few experiments have low sample numbers while the data was quite variable. These include Figure 2G-H (n = 3 cells), Figure 3C (unknown sample number), Figure 4B (n = 3 hair follicles), Figure 5B (n = 3 hair follicles). More sample numbers (~10 total) should be included to solidify the findings.

      For the mKikGR experiment in Figure 3C, quantification should be included. A ratiometric measurement of green/red fluorescence over time should be a good complementary way of demonstrating region-specific recovery.

      In Figure 5B, the authors claim that the percentage of dividing cells in control follicles being different from that of MMPi-treated follicles. How are they extracting these percentages? From the plots, control and MMPi-treated columns do not appear to be normalized as 100% to make such comparisons. Moreover, having two mean+/-SD in each column makes these data confusing to interpret. The authors should consider replotting their data either by combining their data into a unique population per conditions and reporting the percentages, or alternatively, they may consider splitting each of these columns into two (i.e., dividing vs. non-dividing cells), and comparing both conditions as ratios of dividing versus/non-dividing cells.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Already addressed above. - Are prior studies referenced appropriately?

      I believe so. <br /> - Are the text and figures clear and accurate?

      All plots measuring basement membrane length are labeled as 'increase in BM length', even when in some cases the BM length is reduced. The authors should consider relabeling these as 'BM length change' or something similar.

      The second and third paragraphs of the discussion are too long and should be condensed into a single paragraph. - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Use better color combination for visualization of multi-channel images. For instance, magenta/green is a better combination than red/green for the color blind. Color combinations are not consistent across figures in the manuscript.

      Include movies for all data derived from live imaging.

      Include statistical tests used for all plots, some are missing.

      The authors should consider fitting their FRAP data in each condition and report percentages corresponding to the mobile and immobile fractions.

      Examples of horizontal vs perpendicular cell division appear to be mislabeled in Video5.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.
      • The large molecular weight of extracellular proteins makes it challenging to generate genetically engineered versions of such proteins to investigate their function and dynamics in vivo. The present study has addressed such issues for Col4a2, a major component of the basement membrane. This study further provides insights towards the understanding of BM dynamics during embryonic organ development.
      • Place the work in the context of the existing literature (provide references, where appropriate).
      • Addressed above
      • State what audience might be interested in and influenced by the reported findings.
      • Cell and developmental biology, extracellular matrix biology, organ development and regeneration.
      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.
      • Tissue morphogenesis, extracellular matrix
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript has potential interest as a preclinical study for glioblastoma treatment. There is a substantial amount of data that is promising, but there are numerous issues that will require additional experimental effort before publication.

      Major concerns:

      1. The title is overly general and uninformative. The authors should include the drug name (mubritinib) and the specific tumour type (glioblastoma).
      2. The concept that mubritinib functions through metabolic effects is not surprising given recent publications (PMID: 37382244; PMID: 35429141; PMID: 33245718; PMID: 31287994) and that it impacts the blood-brain barrier (PMID: 36178590). However, there are limitations to the strength of this observation. Most of the experiments are associations between drug treatment and metabolic changes. The NDI1 partially rescue experiments in Figures 1h and 1i are nice but show that NDI1 expression itself increases OCR. This experiment is also performed over a very brief window of time. A better set of experiments would include measurement of cell number over a prolonged time course (Figure 2d has one time point) and to use a genetic targeting strategy against ETC complex 1.
      3. The authors observe that EGFR expressing lines are more sensitive to mubritinib. As the rescue experiments are only partially effective and ERBB family members may be targeted by mubritinib, it is critical to address the effects of mubritinib on EGFR activation and perform rescue studies, as the application of mubritinib in patients may be guided by the EGFR mutational state.
      4. The differences found with NSC responses is interesting but needs to be developed. Why are there differences in mubritinib responses? For example, do NSCs not require ETC complex 1 as much?
      5. I would suggest that the authors also compare sensitivity of the BTSCs (I would suggest a change in nomenclature as these are only from GB) and differentiated tumour cells to determine if the stem cells have greater dependence. Please use similar culture conditions.
      6. The differences in cell cycle are useful but the mechanism is lacking. The claim that self-renewal is drastically or markedly changed is overstated. The ELDAs are not striking. There is no evidence that stemness is a direct target.
      7. The in vivo effects on cell biology need greater analysis in mechanism. I am also not sure why the authors switch lines tested in different assays.
      8. The mechanism of interaction with radiation is not developed. What is happening here? Are there changes in DNA damage repair or simply growth? This is a nice observation that could be better developed.

      Minor concerns:

      1. Grammar needs attention.
      2. Please remove the overuse of "strikingly", "drastically", "importantly", etc. Most of these descriptions are overstated.
      3. The number of in vivo replicates needs to be addressed.
      4. All gene expression data should be deposited. All raw data (numeric) should be made available.
      5. Please replace all normalized data with raw data. The statistical testing was likely incorrectly performed, and this can give rise to false conclusions. I am particularly concerned about the normalization to cell numbers.

      Significance

      This manuscript has potential interest as a preclinical study for glioblastoma treatment. There is a substantial amount of data that is promising, but there are numerous issues that will require additional experimental effort before publication.

      Major concerns:

      1. The title is overly general and uninformative. The authors should include the drug name (mubritinib) and the specific tumour type (glioblastoma).
      2. The concept that mubritinib functions through metabolic effects is not surprising given recent publications (PMID: 37382244; PMID: 35429141; PMID: 33245718; PMID: 31287994) and that it impacts the blood-brain barrier (PMID: 36178590). However, there are limitations to the strength of this observation. Most of the experiments are associations between drug treatment and metabolic changes. The NDI1 partially rescue experiments in Figures 1h and 1i are nice but show that NDI1 expression itself increases OCR. This experiment is also performed over a very brief window of time. A better set of experiments would include measurement of cell number over a prolonged time course (Figure 2d has one time point) and to use a genetic targeting strategy against ETC complex 1.
      3. The authors observe that EGFR expressing lines are more sensitive to mubritinib. As the rescue experiments are only partially effective and ERBB family members may be targeted by mubritinib, it is critical to address the effects of mubritinib on EGFR activation and perform rescue studies, as the application of mubritinib in patients may be guided by the EGFR mutational state.
      4. The differences found with NSC responses is interesting but needs to be developed. Why are there differences in mubritinib responses? For example, do NSCs not require ETC complex 1 as much?
      5. I would suggest that the authors also compare sensitivity of the BTSCs (I would suggest a change in nomenclature as these are only from GB) and differentiated tumour cells to determine if the stem cells have greater dependence. Please use similar culture conditions.
      6. The differences in cell cycle are useful but the mechanism is lacking. The claim that self-renewal is drastically or markedly changed is overstated. The ELDAs are not striking. There is no evidence that stemness is a direct target.
      7. The in vivo effects on cell biology need greater analysis in mechanism. I am also not sure why the authors switch lines tested in different assays.
      8. The mechanism of interaction with radiation is not developed. What is happening here? Are there changes in DNA damage repair or simply growth? This is a nice observation that could be better developed.

      Minor concerns:

      1. Grammar needs attention.
      2. Please remove the overuse of "strikingly", "drastically", "importantly", etc. Most of these descriptions are overstated.
      3. The number of in vivo replicates needs to be addressed.
      4. All gene expression data should be deposited. All raw data (numeric) should be made available.
      5. Please replace all normalized data with raw data. The statistical testing was likely incorrectly performed, and this can give rise to false conclusions. I am particularly concerned about the normalization to cell numbers.
    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript Burban et al explore the effect of the mitochondrial oxidative phosphorylation (OXPHOS) inhibitor Mubritinib on patient derived glioblastoma stem cells and murine xenografts. The authors first show that i) Mubritinib is an inhibitor of OXPHOS in brain tumor stem cells (BTSC), ii) that it impairs cell growth and self-renewal of patient-derived BTSC with different genetic background and iii) has an effect on the expression of genes related to stemness. In addition, the authors convincingly show that Mubritinib is a brain penetrant drug, and by transplanting luciferase expressing BTSC into the brain of immunodeficient mice they show it delays GB tumorigenesis and the animal lifespan (either alone, or more efficiently if combined with IR treatment). Finally, by performing toxicological and behavioral studies in mice models, Burban et al demonstrate that Mubritinib has a well-tolerated and safe profile and does not induce damage to healthy cells.

      The manuscript is well written and organized and the data is clearly presented. The results are convincing, but a few additional experiments and controls would be beneficial to support the claims of the paper, most of which are easily addressable.

      Main comments:

      • Finding suitable control cells for BTSC experiments is a widely acknowledged challenge in the field. However, in line with other studies, it is recommended that the authors consider using a non-oncogenic NSC as control line to demonstrate that the effects reported in Figure 1 and Figure 2 are more pronounced in BTSC compared to NSC (as it was done in Suppl Fig3).
      • Figure 5 presents a significant finding indicating that Mubritinib enhances the sensitivity of GB tumors to IR. Considering that Temozolomide (TMZ) is the primary chemotherapy drug for GB patients, it would be crucial to investigate the potential outcomes of a combined treatment involving Mubritinib and TMZ. This will help determine if the combination exhibits promising results, comparable to what is demonstrated in Figures 5d, 5g, and 5i for Mubritinib and IR. Such experiment would reinforce the drug's potential for clinical trials in GB treatment.

      Minor comments:

      • The authors should specify early in the paper what is the number of samples of patient-derived BTSC they use and the fact that their genetic mutations are known (this information is summarized in the supplemental table 1, but only reported later in the manuscript). This information is important and should be clearly stated at the beginning of the manuscript.
      • In Figure 2a the inhibition at 20nM is significant but not very pronounced. Based on Figure 1 I would have expected to see a stronger effect at this concentration range. Can the authors comment/provide an explanation for this discrepancy?
      • The EdU incorporation experiment presented in Figure2h-I should be repeat with lower concentrations of the drug (in most of the assays the effect of Mubritinib is detectable at much lower concentrations).
      • Since the authors have done RNA-seq on the samples why don't they report the specific subtypes of their samples in the text and in Suppl Table 1 (Proneural, Neural, Classical or Mesenchymal) ? It is known that different molecular subtypes respond differently to treatments; therefore this information would be essential to understand if Mubritinib is effective on a wide range of GB subtypes.
      • In Figure2b and Supplemental Figure1b-c : instead of correlating the effect with genetic mutations, it would be more relevant if the authors could correlate the data with the molecular subtypes inferred by RNA-seq (see my comment above)
      • Regarding the RNA-seq experiment the authors should report what is the percentage (and numbers) of genes that change expression. Is there for example a preference for up- or down- regulation? It would be interesting to see a Gene Ontology (GO) analysis for the up-regulated genes versus a GO analysis of the down-regulated genes to confirm that the relevant categories show dysregulation as expected (e.g. enrichment for cell cycle and stemness genes in the down-regulated list, etc ).
      • In Figure2 m-o the difference between CTL and Mubritinib treated cells do not seem substantial, although it is shown as statistically relevant. Can the authors specify the percentage to be able to better assess the differences?
      • Add p-value for Figure3a-c
      • In the western blot in Supplemental Figure 3b Vinculin shows twice
      • Change" Given that Mubritinib is already completed a phase I clinical trial" into "...has completed..."

      Referees cross-commenting

      I agree with other reviewers that more data is needed to determine if mubritinib could be an effective treatment for various GB subtypes. The models used in this study do not encompass the full spectrum of GBM genetics. The authors should repreat the experiments using models that represent the major genetic/transcriptional subtypes of GBMs and clearly label and identify them in the study. Specifically, the authors should include models like 'classical/EGFR-amplified', 'mesenchymal', 'proneural/PDGFR amplified'. Alternatively, it is advisable to refrain from asserting that mubritinib is effective across genetic alterations in the manuscript.

      Significance

      Considering the limited effectiveness of existing treatments, it is crucial to explore alternative approaches to improve patient outcomes. This study demonstrates promising potential for the clinical translation of Mubritinib in GB treatment.

      A major limitation of this study is the narrow numbers of patient-derived samples used and absence of a proper control cell line. Unfortunately, as evidenced by the existing literature in the field, selecting a control cell line for glioma stem cells research is challenging due to the unknown cell-of-origin for this type of tumor. In addition, all the toxicity/safety tests were performed in mice models and it is difficult to predict how this would translate into human patients. However, the fact that a phase I clinical trial has already been completed for Mubritinib (in the context of a different type of tumor) is encouraging.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors report the use of mubritinib, a drug targeting complex I of the mitochondrial electron transport chain, to halt the proliferation of brain tumor stem cells (BTSCs) isolated as neurospheres from glioblastomas. They demonstrate that mubritinib crosses the blood-brain barrier, and show that this drug delays GBM tumorigenesis and extends lifespan in mouse models that were generated by transplantation of human BTSCs or mouse cell lines. They also provide evidence that this potentially harmful drug is well-tolerated by mice.

      The ability of mubritinib (initially conceived as a ERBB2 inhibitor) to block complex I of the electron transport chain was previously identified in acute myeloid leukemia, where this drug proved to selectively inhibit a subset of cases relying on oxidative phosphorylation (OXPHOS). However, the use of mubritinib is novel in GBM, a lethal malignancy for which a few therapeutic options are available, and no substantial progress has been reported since 2005. In addition, the authors show that accumulation of mubritinib in the brain tissue allows for the use of a reduced dose of mubritinib, thereby reducing the risk of the deleterious effects that blunt enthusiasms about targeting of mitochondrial respiration.

      However, this manuscript presents significant weaknesses that are detailed below. In particular, the models used appear insufficient to support the conclusion, stated in the abstract, that the drug can be effective in GBMs with different oncogenic mutations. Based on the provided evidence, the claim that 'mubritinib potently impairs stemness and growth of patient-derived BTSCs harboring different oncogenic mutations' should be removed, or supported by using BTSCs that adequately represent the major GBM subtypes identified by genetic and transcriptional analysis. Specifically, BTSCs harboring EGFR amplification (displayed by 40% of GBMs) would need to be added. Equally, it is inappropriate to conclude (Discussion, page 14) that the models used, displaying EGFR mutations, are representative of widespread EGFR alterations: indeed the EGFR alteration observed in 40-50% of patients is EGFR amplification, whose pathogenic effects can not be recapitulated by the EGFR mutation harbored in the models used. If models harboring EGFR amplification are unavailable to these authors, making unrealistic to repeat the experiments in at least two different EGFR-amplified BTSCs in the timeframe allowed for revision, not only the claim that mubritinib inhibits BTSCs harboring different oncogenic mutations should be removed, but lack of experiments in EGFR-amplified BTSCs should be discussed as a limitation of this study. In addition, as detailed below, in vitro experiments should be expanded to corroborate mechanistic aspects of the drug, safety should be better demonstrated and some aspects of in vivo experiments should be clarified. Overall, the methodology seems sufficiently detailed to allow reproduction. The experiments are adequately repeated and the statistical analysis is appropriate, but in one case detailed below.

      Major Point N.1

      Figure 2a and 2c. Although statistically significant, the effect of mubritinib at 20 nM is biologically of limited significance in a subset of BTSCs, where the drug reduces viability by less than 25% (Fig. 2a). Therefore, the correlation between oxygen consumption rate (OCR) and mubritinib sensitivity (% of live cells), shown in Fig. 2c, should be presented for all mubritinib doses, and particularly for the dose of 500 nM, which is utilized in subsequent experiments. Additionally, it would be useful to display the correlation not only between viability and basal OCR but also with maximal OCR. This analysis could identify varying levels of sensitivity to ECT inhibition, aligning with the expectation that different BTSCs may exhibit varying degrees of dependency on mitochondrial OXPHOS. This would suggest that different GBMs may require different dosages of the drug. In all the experiments presented in the manuscript, the authors use the lines exhibiting the highest mubritinib sensitivity (BTSC 53, BTSC73), which might not be representative of all GBMs. This selection bias need explicit clarification in the text.

      Major Point N.2.

      In Fig. 2b, the statistics is significantly biased as it is calculated based on technical replicates, rather than on a significant number of independent models featuring either wild-type or mutated EGFR. Presented in this manner, this analysis is unacceptable. Additionally, as noted previously, the models used in this manuscript do not represent the overall GBM genetics, particularly due to lack of EGFR amplified models, which correspond to 40% of cases, and cannot be recapitulated by EGFR mutations. In general, the number of models is too small to draw any conclusion regarding the relationship between genetics and mubritinib sensitivity (including conclusions concerning TP53, or the MGMT status, shown in supplementary figures 1b-c). If the authors intend to claim that GBMs are sensitive to mubritinib independently of the genetic status, they should repeat their experiments by using models representative of the major genetic/transcriptional subtypes of GBMs, by clearly characterizing and identifying them in the experiments: e.g. 'classical/EGFR-amplified'; 'mesenchymal'; 'proneural/PDGFR amplified'. Otherwise (more realistically), it is suggested to remove claims that mubritinib is effective independently of genetic alterations throughout the manuscript (including the abstract and the discussion).

      Major Point N.3

      Fig. 2k-l present a transcriptional analysis with questionable representativeness as it is performed on the single line BTSC147 from a recurrent GBM, which is unlikely to represent primary GBMs. The analysis appears overly descriptive and fails to add significant information beyond the observation that mubritinib induces a proliferative arrest, as assessed in biological experiments. Additionally, the claim that the Neftel 'Neural Progenitor Cell' signature is altered in a biologically significant manner after only 24 hours of mubritinib treatment seems questionable. As such, this analysis should be moved to supplementary information. A more intriguing alternative would be to compare groups of BTSCs that exhibit high or low sensitivity to mubritinib and attempt to identify gene sets that can correlate with and possibly contribute to explain differences in drug sensitivity.

      Major point N. 4

      Figure 3. LDA need to be measured at longer timepoints (14-21 days vs. 7 days shown).

      Major point N. 5

      Supplementary Figure 3c aims to demonstrate that neural stem cells are unaffected by mubritinib merely by showing stem markers in western blots, which is insufficient. To provide convincing evidence, an LDA should be performed using human Neural Progenitor Cells.

      Major Point N. 6

      Page 9. The mechanistic nexus between OXPHOS inhibition and radiosensitization described by the authors remains unclear. In particular, the link between enrichment in OXPHOS proteins observed in recurrent vs.primary GBMs on the one hand, and downregulation of homologous recombination related-pathways on the other hand is difficult to grasp. The authors should endeavor to more clearly explain how mubritinib can interfere with the adaptive response to ionizing radiation, thereby providing a rationale for experiments combining the two treatments. As noted in the discussion (page 14), 'targeting mitochondrial respiration is an emerging strategy to overcome radioresistance in the tumour hypoxic areas'. Thus, a plausible mechanism of mubritinib-induced radiosensitization may involve reducing oxygen consumption, thereby leaving more oxygen available for diffusion and improving radiation response (by increasing generation of reactive oxygen species). To provide convincing mechanistic evidence, the authors should include in vitro experiments assessing the ability of mubritinib to radiosensitize BTSC in both normoxic and hypoxic conditions. LDA or radiobiological clonogenic assays showing the effect of combination treatment on stem cell frequency are recommended.

      Major Point N. 7

      Fig. 5d. Concerning the scheme of in vivo treatment, it is unclear why irradiation is administered 5 days after the beginning of mubritinib treatment, considering that mubritinib reaches its peak brain concentration much earlier, as shown in Fig. 4d. Furthermore, if mubritinib alone is effective against the tumor, comparing tumors that have been treated with IR alone at the same time-point as those treated with IR + mubritinib seems inappropriate. This is because, in the latter scenario, IR is applied to tumors that are likely reduced in volume compared to those treated with vehicle prior to IR. This discrepancy could introduce bias in the evaluation of the combined treatment's efficacy.

      Major Point N. 8.

      Figure 6b. The methodology employed to measure the effect of mubritinib on human neural progenitor cells should be the same as that used for BTSC. Moreover, a positive control (a treatment inducing death, such as bosentan for hepatocytes) needs to be added.

      Minor points

      Minor point N.1

      Please use consistent units (nM or uM) for mubritinib.

      Minor point N.2

      ND1 expression in transduced cells should be shown by western blot (in Supplementary Figures).

      Significance

      This manuscript reports preclinical results on the use of mubritinib, a drug targeting the mithochondrial electron chain transport complex 1, to halt proliferation of glioblastoma (GBM) stem cells in vitro and treat experimental GBMs generated by stem cell transplantation. Given that the standard therapy for GBM still relies on a limited number of conventional options, evidence demonstrating the preclinical effectiveness of mubritinib could exert a significant translational impact. Mubritinib has not yet been proposed for GBM treatment, but there is convincing evidence that the drug may be effective in subsets of acute myeloid leukemia that rely on oxidative phosphorylation (Baccelli et al., Cancer Cell 36:84, 2019. PMID: 31287994). Data provided in this manuscript on potential effectiveness of mubritinib are overall convincing. However, in its current form, the manuscript present major limitations regarding the representativeness of models used, which do not support the claim that mubritinib could be universally useful in GBM, and regarding mechanistic aspects of mubritinib combination with radiotherapy. These and other aspects could be addressed through additional experiments.

      The audience interested in the reported findings includes preclinical and clinical neuro-oncologists. My field of expertise is biology and genetics of glioblastoma stem cells and generation of in vitro and in vivo GBM preclinical models.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary Maintenance of the histone H3 variant CENP-A at centromeres is necessary for proper kinetochore assembly and correct chromosome segregation. The Mis18 complex recruits the CENP-A chaperone HJURP to centromeres to facilitate CENP-A replenishment. Here the authors characterise the Mis18 complex using hybrid structural biology, and determine complex interface separation-of-function mutants.

      Major Comments The SAXS and EM data on the full-length Mis18 components must be included in the main Figures, either as an additional figure or by merging/rearranging the existing figures. The authors discuss these results in three whole paragraphs, which are a very important part of the paper.

      We thank the reviewer for this constructive suggestion. We have now included an additional figure (new Fig. 2, attached below), that highlights the fit of the integrative model against the SAXS and EM data.

      Could the authors also compare the theoretical SAXS scattering curves generated by their final model(s) with the experimental SAXS curves? This would provide some additional evidence for the overall shape of their complex model beyond the consistency with the Dmax/Rg.

      We acknowledge the importance of this suggestion. We have now compared the theoretical SAXS scattering curve of the Mis18a/b core complex (named Mis18a/b DN), which lacks the flexible elements (disordered regions and the helical region flexibility connected to the Yippee domains). The theoretically calculated SAXS scattering curve of the model matches nicely with the experimental data with c2 value of 1.36. This data is now included in new Fig. 2 (Fig. 2f) and is referenced on page 9 line 21.

      Minor Comments

      While the introduction is clearly written, an additional cartoon schematic, representing the system/question would be helpful to a non-specialist reader to interpret the context of the study.

      We have now included a cartoon in the revised Fig. 1 to support the introduction on centromere maintenance and the central role of the Mis18a/b/BP1 complex in this process. Please find the new Fig. 1 below.

      No doubt the authors had a reason for choosing their figure allocation, but I wonder if more material couldn't be brought from the supplementary into the main figures?

      As addressed in our response to one of the major comments, we have now moved key CLMS, SAXS and EM data from the supplemental figure into the main figure, new Fig. 2.

      Page 6 "Mis18-alpha possesses an additional alpha-helical domain" - please make it clear in addition to what (I assume it's in addition to Mis18-beta).

      Apologies for the lack of clarity. We have now rephrased this sentence to highlight that this difference is in comparison with Mis18b on page 6 line 15.

      Page 7 - Report the RMSD of the Pombe vs. Human Mis18-alpha yipee structures?

      The S. pombe Mis18 Yippee structure superposes on to the Human Mis18a Yippee domain with an RMSD of 0.92 angstroms with is now mentioned on page 7 line 9.

      Page 7 - "We generated high-confidence structural models...." is there a metric for the confidence as reported by RaptorX? Perhaps includinging the PAE plots in the supplementary for the AlphaFold generated models would be useful?

      We thank the reviewer for the valid suggestion. We have now included the PAE plot corresponding to the AlphaFold model in the supplementary Fig. S1d and reference on page 7 line 18. RaptorX ranks models based on estimated error. We have now included this information in the new figure legend for Supplementary Fig. S1.

      Figure 1 - Perhaps label figure 1b as being experimentally determined, with the R values (as for Figure 1d), and 1c being a predicted model.

      We have included Rfree and Rwork values for the Mis18a Yippee homo dimer structure and labelled Mis18a/b Yippee hetero-dimer as the predicted model in Fig. 1c and 1d.

      Page 8 "This observation is consistent with the theoretically calculated pI of the Mis18alpha helix" This is a circular argument, of course this region has a low pI due to the amino acid composition. Please remove this statement.

      We have now removed this statement as suggested.

      Page 8 "...reveals tight hydrophobic interactions" these are presumably shown in Figure 1d rather than in the referenced 1e.

      We apologise for the oversight. We have now referred to the correct figure (Fig. 1f in the revised Fig. 1).

      Page 8 - The authors should briefly somewhere discuss why there is a difference between their results and those in Pan et al 2009. As I understand it, the Pan et al paper was based in part on modelling with CLMS data as restraints.

      We thank the reviewer for this suggestion. According to Pan et al., 2009, the model shown by them was generated using CCBuilder, and their CLMS data could not differentiate the two models with the 2nd Mis18a C-terminal helix in either parallel or anti-parallel orientation. We now briefly discuss this on page 8 and line 22 as follows: "Although the Pan et al., 2019 model presented the 2nd Mis18a in a parallel orientation, they did not rule out the possibility of this assembling in an anti-parallel orientation within the Mis18a/b C-terminal helical assembly (Pan et al., 2019)."

      Figure 1 - The labelling of the residues for Mis18-alpha in Figure 1d is problematic, they are black on dark purple (might be my printer/screen/eyes) suggest amending.

      We have now rearranged the label positions to overcome this issue. For clarity, the labels that could not be moved appropriately are shown in white.

      Figure S3a - Do the authors have some data to show the mass of the cross-linked complex that was loaded onto grids is consistent with what is expected?

      Unfortunately, the amount of material that we recover after performing GraFix is not sufficient enough to determine the molecular weight of the crosslinked sample by techniques such as SEC-MALS. However, GraFix fractions were analysed by SDS PAGE, and fractions that ran around the expected molecular weight were selected for EM analysis. We have now included the corresponding SDS-PAGE showing the migration of the crosslinked sample analysed by EM (Supplementary Fig. S3a).

      Figure S3b - scale bar

      Revised Fig. 2d now includes the scale bar shown.

      Figure S3c - Could the authors show or explain the differences between these different 3D reconstructions?

      The models mainly differ in the relative orientations of the bulkier structural features that are referred to as 'ear' and 'mouth' pieces of a telephone handset. This has been mentioned in the text, but we note that the figure is not referenced right next to this statement. We have now amended this (Page 9 line 19), and to make it clear, we have also highlighted the difference using an arrowhead in Fig. 2e and S3b. The different orientations are also stated in the corresponding figure legends.

      Page 9 - The use of "AFM" for AlphaFoldMultimer" is a little confusing since AFM is the established acronym for Atomic Force Microscopy. Perhaps AF2M?

      We have now replaced AFM with AF2M on page 9 to avoid confusion.

      Figure S4a - Control missing for Mis18-alpha wild-type

      Apology for the confusion, this control is present in Fig. 4a. We have now stated this in the figure legend of S4a for clarity.

      Figure S4 d and e - The contrast between the bands and the background is very bad (at least in my copy).

      We have now adjusted the contrast of the blots in Fig. S4d and S4e response to this comment.

      Page 13 "Our structural analysis suggests that two Mis18BP1 fragments.....". How did you arrive at this conclusion? Is this based on the AlphaFold/RaptorX model? What additional evidence do you have that the positioning of the Mis18BP1 is correct? Does the CLMS data support this?

      We confirm that this statement is based on AlphaFold model. We have now explicitly highlighted this on page 14, line 5. As noted in the same paragraph (page 14, line 19), this model agrees with the contacts suggested by the cross-linking mass spectrometry data presented here.

      Figure 4a - Would the authors like to consider using a different colour for Mis18BP1? The contrast is not great, especially in the electrostatic surface inset.

      In response to this suggestion, the Mis18BP1 helix is now shown in grey in the inset of Fig. 5a.

      Reviewer #1 (Significance (Required)):

      General Assessment The paper is extremely clearly written. Likewise the figures are beautifully presented and the data extremely clean and fully supportive of the authors conclusions. Indeed it is seldom that one sees the depth of the structural approaches (X-ray, CLMS, EM, SAXS) in one paper which is a huge strength of the manuscript. In addition the translation of this data into very clean cell biological experiments, makes the paper truly outstanding.

      Advance The authors provide the first model of the Mis18 complex, with extensive evidence to back up this model. The authors provide additional evidence as to how the deposition/renewal of CENP-A might be mediated by the Mis18 complex. The advance comes from both the level of clarity, detail, and scope achieved in this paper.

      Audience This will likely be of great interest to anyone with an interest in chromosome biology, plus be of interest to structural biologists as an outstanding example of hybrid structural biology.

      Expertise I am a biochemist with a background in structural biology with some familiarity with centromere biology

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: The manuscript "structural basis for Mis18 complex assembly: implications for centromere maintenance" by Thamkachy and colleagues describes a study that uses structural analysis to test essential candidate residues in Mis18 complex components in CENP-A loading. For chromosomes to faithfully segregate during cell division, CENP-A levels must be maintained at the centromere. How CENP-A levels are maintained is therefore important to understand at the mechanistic level. The Mis18 complex has been found to be important, but how exactly the various Mis18 complex components interact and how they regulate new CENP-A loading remains not fully understood. This study set out to characterize the critical residues using X-ray crystallography, negative staining EM, SEC analysis, molecular modeling (Raptorx, AlphaFold2, and AlphaFold-multimer) to identify the residues of Mis18a and Mis18b that are critical for the formation of the Mis18a/b hetero-hexamer and which residues are important for Mis18a and Mis18BP1 interactions. A complex beta-sheet interface dictates the Mis18a and Mis18b interactions. Mutating the Mis18a residues that are important for the Mis18a/b interactions resulted in impaired pull-down of Mis18b and reduced centromeric levels of mutated Mis18a. The functional consequences of mutating residues that impair Mis18a/b interactions is that with reduced centomeric levels of Mis18a, also impaired new CENP-A loading. Interestingly, mutated Mis18b did not impact centromeric Mis18a levels and only modestly impaired new CENP-A loading. These data were interpreted that Mis18a is critical for new CENP-A loading, whereas Mis18b might be involved in finetuning how much new CENP-A is loaded. Overall, it is a very well described and well written study with exciting data.

      Major comments:

      • Overall, the structural data and the IF data support the importance of Mis18a residues 103-105 are critical for centromeric localization and new CENP-A loading, whereas Mis18b residues L199 and I203 are critical for centromeric localization, but only very modestly impair centromeric Mis18a localization and new CENP-A loading. In the discussion the authors argue that the N-terminal helical region of Mis18a mediate HJURP binding. This latter is postulated based on published work, but not tested in this work. This should be clarified as such.

      We thank the reviewer for this comment. Our very recent study aimed at understanding the licencing role of Plk1, independent of the work reported here, serendipitously has now validated this suggestion and demonstrates that a Plk1-mediated phosphorylation cascade activates the Mis18a/b complex via a conformational switch of the N-terminal helical region of Mis18a, which facilitates a robust HJURP-Mis18a/b interaction (Parashara et al. bioRxiv 2024). An independent study from the Musacchio lab (Conti et al. bioRxiv, 2024) also reports similar findings, mutually strengthening our independent conclusions. Overall, these studies highlight the importance of the critical structural insights into the Mis18 complex this study reports. We now explicitly discuss the validation of our original hypothesis by citing our recent work along with that of the Musacchio lab. The corresponding section of the last paragraph now reads as follows (page 17 line 10): "Previously published work identified amino acid sequence similarity between the N-terminal region of Mis18a and R1 and R2 repeats of the HJURP that mediates Mis18a/b interaction (Pan et al., 2019). Deletion of the Mis18a N-terminal region enhanced HJURP interaction with the Mis18 complex (Pan et al., 2019). Here, we show that the N-terminal helical region of Mis18a makes extensive contact with the C-terminal helices of Mis18a and Mis18b, which had previously been shown to mediate HJURP binding by Pan et al., 2019. Collectively these observations suggest that the N-terminal region of Mis18a might directly interfere with HJURP - Mis18 complex interaction. Two independent recent studies (Parashara et al., 2024, Conti et al., 2024) reveal that this is indeed the case and a Plk1-mediated phosphorylation cascade involving several phosphorylation and binding events of the Mis18 complex subunits relieve the intramolecular interactions between the Mis18a N-terminal helical region and the HJURP binding surface of the Mis18a/b C-terminal helical bundle. This facilitates robust HJURP-Mis18a/b interaction in vitroand efficient HJURP centromere recruitment and CENP-A loading in cells. Overall, these studies also highlight the importance of the critical structural insights into the Mis18 complex we report here."

      • Overall, the authors clearly describe their data and methodology and use adequate statistical analyses. The structural data of the Mis18a/b complex being a hetero-hexamer is convincing, but the validation in vivo is missing. As structural experiment are not performed under physiological conditions, it is important to establish the stoichiometry in vivo to further support the totality of the findings of the structural experiments and modeling. The data for the hierarchical assembly of Mis18a and Mis18b at the centromere and its importance in new CENP-A loading is convincing. An additional open question is whether "old" centromeric CENP-A or HJURP:new CENP-A complex is needed to recruit Mis18a to the centromere and whether the identified residues have a role in Mis18a centromeric localization. These data would provide a solid link between the Mis18 complex and how it is directly linked to new CENP-A loading.

      We agree that establishing the stoichiometry of Mis18 subunits of the Mis18 complex in vivo would be insightful. However, considering that the Mis18 complex assembles in a specific window of the cell cycle (late Mitosis and early G1), we think characterising the stoichiometry in cells is extremely difficult and technically challenging. However, consistent with our structural model, several lines of independent evidence (Pan et al., 2017 and Spiller et al., 2017) using different biophysical methods (Analytical Ultra Centrifugation (Pan et al., 2017), SEC-MALS (Spiller et al., 2017)) showed that recombinantly purified Mis18 complex (irrespective of the expression host, from both E. Coli or insect cells) is a hetero-octamer made of a hetero-hexameric Mis18a/b (4 Mis18a and 2 Mis18 b) complex bound to two copies of Mis18BP1. These observations suggested that hetero-hexamerisation of the Mis18a/b complex may be needed to bind and dimerise Mis18BP1 in cells. Previously published cellular studies support the in vivo requirement of the hetero-octameric Mis18 assembly as: (i) Perturbing the hetero-hexamerisation of the Mis18a/b complex (by introducing mutations at the Mis18a/b Yippee dimerisation interface, which while did not disrupt Mis18a/b complex formation, perturbed its hetero-hexamerisation and resulted in a hetero-trimeric Mis18a/b complex made of 2 Mis18aand 1 Mis18b) abolished Mis18BP1 binding in vitro and in cells, consequently abolished CENP-A deposition (Spiller et al., 2017) and (ii) artificial dimerisation of Mis18BP1, by expressing Mis18BP1 as a GST-tagged protein, enhanced the centromere localisation of Mis18BP1 highlighting the requirement of Mis18a/b hexameric assembly mediated dimerization of Mis18BP1 in cells (Pan et al., 2017). While these studies highlighted the importance of maintaining the right stoichiometry (hetero-octamer of 4 Mis18a, 2 Mis18b and 2 Mis18BP1), lack of structural information on how this essential biological assembly is established remained a major knowledge gap. Our work presented here fills this critical knowledge gap by showing that a segment of Mis18BP1 (aa 20-51) also binds at the Yippee dimerisation interface. To highlight this, we have included the following statements in the introduction on page 5 and 20 "Perturbing the Yippee domain-mediated hexameric assembly of Mis18a/b (that resulted in a Mis18a/b hetero-trimer, 2 Mis18a and 1 Mis18b) abolished its ability to bind Mis18BP1 in vitro and in cells (Spiller et al., 2017), emphasising the requirement of maintaining correct stoichiometry of Mis18a/b subunits. Consistent with this, artificial dimerisation of Mis18BP1, by expressing Mis18BP1 as a GST-tagged protein, enhanced the centromere localisation of Mis18BP1 (Pan et al., 2017)." and in the Results section on page 14 line 12: "Mis18BP120-51 contains two short b strands that interact at Mis18a/b Yippee interface extending the six-stranded-b sheets of both Mis18a and Mis18b Yippee domains. This provides the structural rationale for why Yippee domains-mediated Mis18a/b hetero-hexamerisation is crucial for Mis18BP1 binding (Spiller et al., 2017)."

      Regarding the question "whether 'old' centromeric CENP-A or HJURP:new CENP-A complex is needed to recruit Mis18a centromere localisation and whether identified residues have a role in Mis18a centromere localisation": According to the published literature, the Mis18 complex associates with centromeres through interaction with CCAN components CENP-C and CENP-I (Shono et al., 2015, Dambacher et al., 2012, Moree et al., 2011, Hoffmann et al., 2020). Considering CCAN assembles on CENP-A nucleosomes, and HJURP:new CENP-A centromere recruitment depends on the Mis18 complex, it will be reasonable to argue that the 'old' centromeric CENP-A contributes to the centromere localisation of the Mis18 complex. Amongst the components of the Mis18 complex, Mis18BP1 and Mis18bhave previously been suggested to interact with CENP-C. Within the Mis18 complex, we (Spiller et al., 2017) and others (Pan et al., 2017) have shown that Mis18a can directly interact with Mis18BP1, but it does so more efficiently when Mis18a hetero-oligomerises with Mis18b via their Yippee domains. Here, our structural analysis mapped the interaction interfaces and showed that Mis18a residues E103, D104 and T105 contribute to Mis18BP1 binding, as mutating these residues abolishes centromere localisation of Mis18a (Fig. 5c and 5d). To accentuate our findings, we have now included the following paragraph in the discussion section (page 17 line 26): "One of the key outstanding questions in the field is how does the Mis18 complex associate with the centromere. Previous studies identified CCAN subunits CENP-C and CENP-I as major players mediating the centromere localisation of the Mis18 complex mainly via Mis18BP1 (Shono et al., 2015, Dambacher et al., 2012, Moree et al., 2011), although Mis18b subunit has also been suggested to interact with CENP-C (Stellfox et al., 2016). Within the Mis18 complex, we and others have shown that the Mis18a/b Yippee hetero-dimers can directly interact with Mis18BP1. Here our structural analysis allowed us to map the interaction interface mediating Mis18a/b-Mis18BP1 binding. Perturbing this interface on Mis18a completely abolished Mis18a centromere localisation and reduced Mis18BP1 centromere levels. These observations show that Mis18a associates with the centromere mainly via Mis18BP1, and assembly of the Mis18 complex itself is crucial for its efficient centromere association, as previously suggested. Future work aimed at characterising the intermolecular contact points between the subunits of the Mis18 complex, centromeric chromatin and CCAN components and understanding if the Mis18 complex undergoes any conformational and/or compositional variations upon centromere association and/or during CENP-A deposition process, will be crucial to delineate the mechanisms underpinning the centromere maintenance."

      Minor comments:

      • The bar graphs shown ideally also show the individual data points for the authros to appreciate the spread of the data. These figures can be replicated in the Supplemental to avoid making the main figures look too busy.

      We thank the reviewers for this suggestion. Reviewer #3 made a similar comment and suggested we use Superplot, which allows visualisation of individual data points of independent experiments. We have now revised all bar graphs using Superplot to address both reviewers' suggestions.

      Reviewer #2 (Significance (Required)):

      • This study uses a broad range of structural techniques, including molecular modeling which were subsequently validated by in vitro pull-down assays, co-IP, and IF. This combination of these techniques is important because many structural techniques cannot be performed under physiological conditions. Validating the main findings of the structural results by IF and co-IP is therefore critical.
      • This work greatly advances our structural understanding how Mis18a, Mis18b, and Mis18BP1 form the Mis18 complex and how the critical residues in especially Mis18a help the Mis18 complex localize to the centromere and influence new CENP-A loading. This study also provides the first strong evidence in hierarchical assembly of the Mis18 complex.
      • How centromere identity is maintained is a critical question in chromosome biology and genome integrity. The Mis18 complex has been identified as an important complex in the process. Several structural and mutational studies (all adequately cited in this manuscript) have tried to address which residues guide the assembly and functional regions of the Mis18 complex. This work builds and expands our understanding how especially Mis18a holds a pivotal role in both Mis18 complex formation and its impact on maintaining centromeric CENP-A levels.
      • This work will be of interest to the chromosome field in general and anyone studying the mechanism of cell division.
      • Chromatin, centromere, CENP-A, cell division. This reviewer has limited expertise in structural biology.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Centromere identity is defined by CENP-A loading to specific sites on genomic DNA. CENP-A loading is known to rely on the Mis18 complex, and several regulators are known; yet how the Mis18 complex achieves this complex process has remained puzzle. By elucidating the structural basis of Mis18 complex assembly using integrative structural approaches the authors show that multiple homo and heterodimeric interfaces of Mis18alpha, beta and Mis18BP1 are involved in centromere maintenance. The authors show that Mis18alpha can associate with centromeres and deposit CENP-A independent of Mis18 β. Mis18α functions in CENP-A deposition at centromeres independent of Mis18β. Mis18β is required for maintaining a specific level of CENP-A occupancy at centromeres. Thus, using structure-guided and separation-of-function mutants the study reveals how Mis18 complex ensures centromere maintenance. Major comments: This is an excellent study on centromere inheritance, combining structural and cell biology techniques. The comments here primarily refer to Cell biology aspect of the work.

      Figures show that new CENP-A deposits in Mis18βL199D/I203D mutants, but the level was reduced moderately. Based on this observation, the authors make a strong conclusion that Mis18β licenses the optimal levels of CENP-A at centromeres. Mis18α may be essential for both CENP-A incorporation and depositing a specific amount of CENP-A, as Mis18α and CENP-A levels are both reduced in Mis18βL199D/I203D mutants which failed to form the triple helical assembly with Mis18α as shown in Figure 3B and 3C. The authors may want to qualify some of these claims as preliminary or speculative.

      We thank the reviewer for this suggestion. We agree that although the reduction in CENP-A levels upon replacing WT Mis18b with Mis18b L199D/I203D is more prominent than the reduction in centromere localised Mis18a, one cannot completely rule out the contribution of reduced Mis18a on CENP-A loading. This also raises an interesting possibility where Mis18b ensures the correct amount of CENP-A deposition by facilitating the optimal level of Mis18a at centromeres. We now explicitly discuss this in the discussion as follows (page 16 line 26): "Whilst proteins involved in CENP-A loading have been well established, the mechanism by which the correct levels of CENP-A are controlled is yet to be thoroughly explored and characterised. The data presented here suggest that Mis18b mainly contributes to the quantitative control of centromere maintenance - by ensuring the right amounts of CENP-A deposition at centromeres - and maybe one of several proteins that control CENP-A levels. We also note that the Mis18b mutant, which cannot interact with Mis18a, moderately reduced Mis18a levels at centromeres, and hence, it is possible that Mis18b ensures the correct level of CENP-A deposition by facilitating optimal Mis18a centromere recruitment. Future studies will focus on dissecting the mechanisms underlying the Mis18b-mediated control of CENP-A loading amounts along with any other mechanisms involved."

      This work and others show that phosphorylation of Mis18BP1 by CDK1 can interfere with complex function (Spiller et al., 2017, Pan et al., 2017). Does the structure provide any insight into PLK1-mediated phosphorylation surfaces for activation of the complex? If yes, a brief discussion would help to link CDK1 and PLK1 mediated opposing actions will strengthen the work.

      As described in our response to the first major comment of Reviewer 2, our very recent study aimed at understanding the licencing role of Plk1, independent of the work reported here, identified and evaluated the functional contribution of Plk1 phosphorylation on the subunits of the Mis18 complex (Parashara et al., bioRxiv 2024). Serendipitously, this recent work has now validated our hypothesis proposed based on the structural characterisation reported here and demonstrates that a Plk1-mediated phosphorylation cascade activates the Mis18a/b complex via a conformational switch of the N-terminal helical region of Mis18a which facilitates a robust HJURP-Mis18a/b interaction (Parashara et al. bioRxiv 2024). An independent study from the Musacchio lab (Conti et al., bioRxiv 2024) also reports similar findings, mutually strengthening our independent conclusions. Overall, these studies highlight the importance of the critical structural insights into the Mis18 complex this study reports. We now explicitly discuss the validation of our original hypothesis by citing our recent work along with that of the Musacchio lab. The corresponding section of the last paragraph now reads as follows (page 17 line 10): "Previously published work identified amino acid sequence similarity between the N-terminal region of Mis18a and R1 and R2 repeats of the HJURP that mediates Mis18a/binteraction (Pan et al., 2019). Deletion of the Mis18a N-terminal region enhanced HJURP interaction with the Mis18 complex (Pan et al., 2019). Here, we show that the N-terminal helical region of Mis18a makes extensive contact with the C-terminal helices of Mis18a and Mis18b, which had previously been shown to mediate HJURP binding by Pan et al., 2019. Collectively these observations suggest that the N-terminal region of Mis18a might directly interfere with HJURP - Mis18 complex interaction. Two independent recent studies (Parashara et al., 2024, Conti et al., 2024) reveal that this is indeed the case and a Plk1-mediated phosphorylation cascade involving several phosphorylation and binding events of the Mis18 complex subunits relieve the intramolecular interactions between the Mis18a N-terminal helical region and the HJURP binding surface of the Mis18a/b C-terminal helical bundle. This facilitates robust HJURP-Mis18a/b interaction in vitro and efficient HJURP centromere recruitment and CENP-A loading in cells. Overall, these studies also highlight the importance of the critical structural insights into the Mis18 complex we report here."

      I am happy with the way cell biology data and the methods are presented so that they can be reproduced. The experiments are adequately replicated and the statistical analysis adequate. It will help to include sample size of cells or centromeres used for building the graphs.

      We have now included this information in figure legends of Fig. 3a, 3c, 4b, 4c, 5b, 5c and 5d.

      This is a strong interdisciplinary study using a variety of in vitro and in vivo techniques. Can the authors discuss if they expect chromatin associated Mis18 complex to host a similar structure as the soluble one? In other words, are they able to comment on any key differences between chromatin and non-chromatin associated Mis18 complexes.

      We thank the reviewer for the suggestion. We agree that one cannot rule out the possibility of the Mis18 complex undergoing compositional and/or conformational variations during the processes of CENP-A loading at centromeres. We now explicitly discuss this possibility in the last paragraph of the discussion section (page 18 line 10): "Future work aimed at characterising the intermolecular contact points between the subunits of the Mis18 complex, centromeric chromatin and CCAN components and understanding if the Mis18 complex undergoes any conformational and/or compositional variations upon centromere association and/or during CENP-A deposition process, will be crucial to delineate the mechanisms underpinning the centromere maintenance."

      Minor comments: -

      In cell biology experiments, fluorescence intensities could be presented as a superplot for added value across cells and repeats (instead of bar graphs). More on superplot:https://doi.org/10.1083/jcb.202001064.

      We thank the reviewers for this kind suggestion. We have now included graphs made using 'superplot' as suggested.

      In general, ACA levels do not appear to change significantly between WT and mutant expressing cells although new CENP-A loading is significantly absent in the presence of a few mutants - please comment if ACA used here can recognise CENP-A. Would this mean that old CENP-A remains normally?

      We thank the reviewer for this comment. While new CENP-A incorporated at centromeres is selectively labelled using the SNAP-tag, the ACA antibody used in these experiments can recognise CENP-A, CENP-B and CENP-C, with CENP-B being the primary target (Kallenberg, Clinical Rheumatology,1990). We would also like to note that ACA has commonly been used to locate the centromere in CENP-A loading assays where new CENP-A levels are assessed via selective labelling (e.g. McKinley 2014).

      It is unclear whether any of the mutant acted in a dominant negative fashion in the presence of endogenous Mis18 proteins. It would have been useful to test this particularly in the context of mis18alpha mutants that seem to fully abolish new CENP-A recruitment.

      As Mis18 subunits oligomerise (homo and hetero), we thought expressing these mutants in the presence of endogenous proteins might interfere with endogenous protein in a heterogenous manner and might make the interpretation difficult. Hence, we did not test this. Instead, as described in the manuscript we have tested these mutants in siRNA rescue experiments (Fig. 3, 4 and 5).

      In figure 3a, GFP panel (input lane, 1) is shown to mark a band corresponding to GFP. Is this expected? Please comment.

      Yes, as a control, an empty vector was transfected to express just GFP along with Mis18a-mCherry. These were used to show that there was no unspecific interaction between the beads used for IP or Mis18a-mCherry and GFP tag, and that any interaction seen was due to Mis18b. A similar control was used in S4b, where mCherry was expressed along with Mis18b-GFP. We have now clarified this in the corresponding legends of Fig. 4a and S4b.

      Would be useful to have the scale for the cropped images presented as insets. Figure 4B should read YFP and not YPF.

      We apologise for this typographical error. We have now corrected this.

      The authors may want to explain whether the tag differences matter for their study (Case in point: His-SUMO-Mis18a191-233 WT and mutant His-MBP-Mis18b188-229 proteins).

      The MBP tag was chosen to perform amylose pull-down assays, whereas the SUMO tag was chosen to increase the protein size. This is crucial as the C-terminal fragments of Mis18a and Mis18b are less than 50 amino acids long and are not easy to visualise by the band intensity in the Coomassie-stained SDS PAGE gels.

      Reviewer #3 (Significance (Required)):

      This work elucidates the structural basis of Mis18 complex assembly and the intermolecular interfaces essential for Mis18 functions. This is a significant advance in the field as it helps researchers in the field better understand CENP-A deposition and mechanism underpinning the maintenance of centromere identity. This is a broad area of research benefitting those studying cell division, genome stability, centromere identity and epigenetics might all be interested in and influenced by these findings. Novelty and strength lies in combining structural and cell biology work. Strengths of the work are structural details of the Mis18 complex. Minor weakness is the link between Mis18 structure and Centromere inheritance is limited to one immunostaining assay (I have mentioned this as a minor comment because addressing this may not be within the scope of this manuscript and is likely to require a repeat of a vast majority of the work with additional reagents which may not directly add value to the current manuscript).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Centromere identity is defined by CENP-A loading to specific sites on genomic DNA. CENP-A loading is known to rely on the Mis18 complex, and several regulators are known; yet how the Mis18 complex achieves this complex process has remained puzzle. By elucidating the structural basis of Mis18 complex assembly using integrative structural approaches the authors show that multiple homo and heterodimeric interfaces of Mis18alpha, beta and Mis18BP1 are involved in centromere maintenance. The authors show that Mis18alpha can associate with centromeres and deposit CENP-A independent of Mis18 β. Mis18α functions in CENP-A deposition at centromeres independent of Mis18β. Mis18β is required for maintaining a specific level of CENP-A occupancy at centromeres. Thus, using structure-guided and separation-of-function mutants the study reveals how Mis18 complex ensures centromere maintenance.

      Major comments:

      This is an excellent study on centromere inheritance, combining structural and cell biology techniques. The comments here primarily refer to Cell biology aspect of the work.

      1. Figures show that new CENP-A deposits in Mis18βL199D/I203D mutants, but the level was reduced moderately. Based on this observation, the authors make a strong conclusion that Mis18β licenses the optimal levels of CENP-A at centromeres. Mis18α may be essential for both CENP-A incorporation and depositing a specific amount of CENP-A, as Mis18α and CENP-A levels are both reduced in Mis18βL199D/I203D mutants which failed to form the triple helical assembly with Mis18α as shown in Figure 3B and 3C. The authors may want to qualify some of these claims as preliminary or speculative.
      2. This work and others show that phosphorylation of Mis18BP1 by CDK1 can interfere with complex function (Spiller et al., 2017, Pan et al., 2017). Does the structure provide any insight into PLK1-mediated phosphorylation surfaces for activation of the complex? If yes, a brief discussion would help to link CDK1 and PLK1 mediated opposing actions will strengthen the work.
      3. I am happy with the way cell biology data and the methods are presented so that they can be reproduced. The experiments are adequately replicated and the statistical analysis adequate. It will help to include sample size of cells or centromeres used for building the graphs.
      4. This is a strong interdisciplinary study using a variety of in vitro and in vivo techniques. Can the authors discuss if they expect chromatin associated Mis18 complex to host a similar structure as the soluble one? In other words, are they able to comment on any key differences between chromatin and non-chromatin associated Mis18 complexes.

      Minor comments:

      In cell biology experiments, fluorescence intensities could be presented as a superplot for added value across cells and repeats (instead of bar graphs). More on superplot: https://doi.org/10.1083/jcb.202001064. In general, ACA levels do not appear to change significantly between WT and mutant expressing cells although new CENP-A loading is significantly absent in the presence of a few mutants - please comment if ACA used here can recognise CENP-A. Would this mean that old CENP-A remains normally?

      It is unclear whether any of the mutant acted in a dominant negative fashion in the presence of endogenous Mis18 proteins. It would have been useful to test this particularly in the context of mis18alpha mutants that seem to fully abolish new CENP-A recruitment.

      In figure 3a, GFP panel (input lane, 1) is shown to mark a band corresponding to GFP. Is this expected? Please comment. Would be useful to have the scale for the cropped images presented as insets.

      Figure 4B should read YFP and not YPF.

      The authors may want to explain whether the tag differences matter for their study (Case in point: His-SUMO-Mis18a191-233 WT and mutant His-MBP-Mis18b188-229 proteins).

      Significance

      This work elucidates the structural basis of Mis18 complex assembly and the intermolecular interfaces essential for Mis18 functions. This is a significant advance in the field as it helps researchers in the field better understand CENP-A deposition and mechanism underpinning the maintenance of centromere identity. This is a broad area of research benefitting those studying cell division, genome stability, centromere identity and epigenetics might all be interested in and influenced by these findings. Novelty and strength lies in combining structural and cell biology work.

      Strengths of the work are structural details of the Mis18 complex. Minor weakness is the link between Mis18 structure and Centromere inheritance is limited to one immunostaining assay (I have mentioned this as a minor comment because addressing this may not be within the scope of this manuscript and is likely to require a repeat of a vast majority of the work with additional reagents which may not directly add value to the current manuscript).

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript "structural basis for Mis18 complex assembly: implications for centromere maintenance" by Thamkachy and colleagues describes a study that uses structural analysis to test essential candidate residues in Mis18 complex components in CENP-A loading. For chromosomes to faithfully segregate during cell division, CENP-A levels must be maintained at the centromere. How CENP-A levels are maintained is therefore important to understand at the mechanistic level. The Mis18 complex has been found to be important, but how exactly the various Mis18 complex components interact and how they regulate new CENP-A loading remains not fully understood. This study set out to characterize the critical residues using X-ray crystallography, negative staining EM, SEC analysis, molecular modeling (Raptorx, AlphaFold2, and AlphaFold-multimer) to identify the residues of Mis18a and Mis18b that are critical for the formation of the Mis18a/b hetero-hexamer and which residues are important for Mis18a and Mis18BP1 interactions. A complex beta-sheet interface dictates the Mis18a and Mis18b interactions. Mutating the Mis18a residues that are important for the Mis18a/b interactions resulted in impaired pull-down of Mis18b and reduced centromeric levels of mutated Mis18a. The functional consequences of mutating residues that impair Mis18a/b interactions is that with reduced centomeric levels of Mis18a, also impaired new CENP-A loading. Interestingly, mutated Mis18b did not impact centromeric Mis18a levels and only modestly impaired new CENP-A loading. These data were interpreted that Mis18a is critical for new CENP-A loading, whereas Mis18b might be involved in finetuning how much new CENP-A is loaded. Overall, it is a very well described and well written study with exciting data.

      Major comments:

      • Overall, the structural data and the IF data support the importance of Mis18a residues 103-105 are critical for centromeric localization and new CENP-A loading, whereas Mis18b residues L199 and I203 are critical for centromeric localization, but only very modestly impair centromeric Mis18a localization and new CENP-A loading. In the discussion the authors argue that the N-terminal helical region of Mis18a mediate HJURP binding. This latter is postulated based on published work, but not tested in this work. This should be clarified as such.
      • Overall, the authors clearly describe their data and methodology and use adequate statistical analyses. The structural data of the Mis18a/b complex being a hetero-hexamer is convincing, but the validation in vivo is missing. As structural experiment are not performed under physiological conditions, it is important to establish the stoichiometry in vivo to further support the totality of the findings of the structural experiments and modeling. The data for the hierarchical assembly of Mis18a and Mis18b at the centromere and its importance in new CENP-A loading is convincing. An additional open question is whether "old" centromeric CENP-A or HJURP:new CENP-A complex is needed to recruit Mis18a to the centromere and whether the identified residues have a role in Mis18a centromeric localization. These data would provide a solid link between the Mis18 complex and how it is directly linked to new CENP-A loading.

      Minor comments:

      • The bar graphs shown ideally also show the individual data points for the authors to appreciate the spread of the data. These figures can be replicated in the Supplemental to avoid making the main figures look too busy.

      Significance

      • This study uses a broad range of structural techniques, including molecular modeling which were subsequently validated by in vitro pull-down assays, co-IP, and IF. This combination of these techniques is important because many structural techniques cannot be performed under physiological conditions. Validating the main findings of the structural results by IF and co-IP is therefore critical.
      • This work greatly advances our structural understanding how Mis18a, Mis18b, and Mis18BP1 form the Mis18 complex and how the critical residues in especially Mis18a help the Mis18 complex localize to the centromere and influence new CENP-A loading. This study also provides the first strong evidence in hierarchical assembly of the Mis18 complex.
      • How centromere identity is maintained is a critical question in chromosome biology and genome integrity. The Mis18 complex has been identified as an important complex in the process. Several structural and mutational studies (all adequately cited in this manuscript) have tried to address which residues guide the assembly and functional regions of the Mis18 complex. This work builds and expands our understanding how especially Mis18a holds a pivotal role in both Mis18 complex formation and its impact on maintaining centromeric CENP-A levels.
      • This work will be of interest to the chromosome field in general and anyone studying the mechanism of cell division.
      • Chromatin, centromere, CENP-A, cell division. This reviewer has limited expertise in structural biology.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Maintenance of the histone H3 variant CENP-A at centromeres is necessary for proper kinetochore assembly and correct chromosome segregation. The Mis18 complex recruits the CENP-A chaperone HJURP to centromeres to facilitate CENP-A replenishment. Here the authors characterise the Mis18 complex using hybrid structural biology, and determine complex interface separation-of-function mutants.

      Major Comments

      The SAXS and EM data on the full-length Mis18 components must be included in the main Figures, either as an additional figure or by merging/rearranging the existing figures. The authors discuss these results in three whole paragraphs, which are a very important part of the paper.

      Could the authors also compare the theoretical SAXS scattering curves generated by their final model(s) with the experimental SAXS curves? This would provide some additional evidence for the overall shape of their complex model beyond the consistency with the Dmax/Rg.

      Minor Comments

      While the introduction is clearly written, an additional cartoon schematic, representing the system/question would be helpful to a non-specialist reader to interpret the context of the study.

      No doubt the authors had a reason for choosing their figure allocation, but I wonder if more material couldn't be brought from the supplementaries into the main figures?

      Page 6 "Mis18-alpha possesses an additional alpha-helical domain" - please make it clear in addition to what (I assume it's in addition to Mis18-beta).

      Page 7 - Report the RMSD of the Pombe vs. Human Mis18-alpha yipee structures?

      Page 7 - "We generated high-confidence structural models...." is there a metric for the confidence as reported by RaptorX? Perhaps includinging the PAE plots in the supplementary for the AlphaFold generated models would be useful?

      Figure 1 - Perhaps label figure 1b as being experimentally determined, with the R values (as for Figure 1d), and 1c being a predicted model.

      Page 8 "This observation is consistent with the theoretically calculated pI of the Mis18alpha helix" This is a circular argument, of course this region has a low pI due to the amino acid composition. Please remove this statement.

      Page 8 "...reveals tight hydrophobic interactions" these are presumably shown in Figure 1d rather than in the referenced 1e.

      Page 8 - The authors should briefly somewhere discuss why there is a difference between their results and those in Pan et al 2009. As I understand it, the Pan et al paper was based in part on modelling with CLMS data as restraints.

      Figure 1 - The labelling of the residues for Mis18-alpha in Figure 1d is problematic, they are black on dark purple (might be my printer/screen/eyes) suggest amending.

      Figure S3a - Do the authors have some data to show the mass of the cross-linked complex that was loaded onto grids is consistent with what is expected?

      Figure S3b - scale bar

      Figure S3c - Could the authors show or explain the differences between these different 3D reconstructions?

      Page 9 - The use of "AFM" for AlphaFoldMultimer" is a little confusing since AFM is the established acronym for Atomic Force Microscopy. Perhaps AF2M?

      Figure S4a - Control missing for Mis18-alpha wild-type

      Figure S4 d and e - The contrast between the bands and the background is very bad (at least in my copy).

      Page 13 "Our structural analysis suggests that two Mis18BP1 fragments.....". How did you arrive at this conclusion? Is this based on the AlphaFold/RaptorX model? What additional evidence do you have that the positioning of the Mis18BP1 is correct? Does the CLMS data support this?

      Figure 4a - Would the authors like to consider using a different colour for Mis18BP1? The contrast is not great, especially in the electrostatic surface inset.

      Significance

      General Assessment

      The paper is extremely clearly written. Likewise the figures are beautifully presented and the data extremely clean and fully supportive of the authors conclusions. Indeed it is seldom that one sees the depth of the structural approaches (X-ray, CLMS, EM, SAXS) in one paper which is a huge strength of the manuscript. In addition the translation of this data into very clean cell biological experiments, makes the paper truly outstanding.

      Advance

      The authors provide the first model of the Mis18 complex, with extensive evidence to back up this model. The authors provide additional evidence as to how the deposition/renewal of CENP-A might be mediated by the Mis18 complex. The advance comes from both the level of clarity, detail, and scope achieved in this paper.

      Audience

      This will likely be of great interest to anyone with an interest in chromosome biology, plus be of interest to structural biologists as an outstanding example of hybrid structural biology.

      Expertise

      I am a biochemist with a background in structural biology with some familiarity with centromere biology

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reply to the Reviewers

      We would like to thank the reviewers for their insightful comments, which will significantly guide us in enhancing our manuscript. We are capable of addressing most of the concerns raised by the reviewers. However, we encounter limitations in addressing Reviewer 2's comment regarding the delineation between cell autonomy and non-autonomy. As highlighted by the reviewer, ideally, we would dissect the mechanisms underlying cell turnover within the M/+ wing pouch, particularly in terms of cell autonomy and non-autonomy. Unfortunately, clonal analysis is not feasible due to the 'salt and pepper' distribution pattern of cell death and subsequent proliferation within the M/+ wing pouch. Despite these challenges, we commit to making efforts to address these issues to the best of our ability. Furthermore, we will aim to present our findings with greater precision and without speculation, an approach we believe will significantly enhance the quality of our manuscript.

      In this manuscript, we extend the insights gained from our previous study (Akai et al., PLOS Genetics, 2021) by uncovering a novel mechanism within the context of the M/+ mutant. We demonstrate that JNK-dependent exocytosis in dying cells is crucial for cell turnover in the M/+ wing pouch. This phenomenon of cell turnover, initiated by cell death and the following proliferation, is fundamental to a variety of processes in multicellular organisms, such as normal development, tissue homeostasis, wound healing, tumor development, and potentially tumor recurrence. While our analysis is specific to the M/+ mutant, the underlying mechanisms of cell-cell communication between dying and proliferating cells, which are still not fully understood, may have broader implications. Our findings offer significant contributions to the field of cell competition and suggest a framework for understanding the operating principles of multicellular communities through cell-cell communications. Although it is yet to be determined how these insights apply in various contexts, they possess the potential to guide future research.

      Referee #1

      It would be important to check how much JNK is sufficient to trigger the exocytosis upregulation. Is the accumulation of Cyt1 and CD63 vesicles in apoptotic cell common to any JNK dependent death or does it require the Minute background? Could the authors check whether clones expressing HepCA transiently in WT background also accumulate the same vesicles?

      Response:

      Following the reviewer's suggestion, we plan to examine whether the accumulation of Syt1- and CD63-positive vesicles in apoptotic cells is a general characteristic of JNK-dependent cell death or requires the Minute background. Specifically, we intend to check whether mitotic clones expressing HepCA in the wild-type eye-antennal disc to see if these vesicles also accumulate. Additionally, we will assess mitotic clones expressing Eiger in the wild-type eye-antennal disc, which activates JNK signaling. This is due to concerns that HepCA may induce cell death too strongly, potentially resulting in clones too small for effective analysis. Moreover, we also plan to express HepCA transiently only in the posterior region of the wing disc to further explore its effects.

      1. It would be relevant to have the status of JNK in Minute disc upon downregualtion of exocytosis. One could imagine some positive feedback between JNK activation, exocytosis, cell death and further JNK activation.

      Response:

      We thank the reviewer for the comment. As pointed out by the reviewer, there could be a positive feedback loop involving JNK activation, exocytosis, cell death, and further JNK activation. Our observations lend support to this hypothesis, as we specifically noted a significant reduction in the number of JNK-activating cells following targeted rab3 knockdown in these cells using puc-gal4 driver within the RpS3/+ wing disc (Fig R1, below). To further confirm the positive feedback loop between JNK activation and exocytosis, we plan to investigate the effects of downregulating unc-13 on JNK activation within the M/+ wing pouch, employing the JNK reporter TRE-DsRed for this purpose.

      Additionally, we need to elucidate the molecular mechanism linking cell death and exocytosis. We are currently exploring two potential relationships between JNK activation, caspase signaling, and exocytosis, as depicted in Fig R2 below. To assess these possibilities, we aim to investigate if inhibiting apoptosis-either by introducing the H99/+ mutation or by overexpressing DroncDN or mirRHG-can affect JNK-mediated exocytosis in the RpS3/+ wing pouch.

      So far, the evidence for the epistatic link between exocytosis, Wg and cell turnover is mostly based on colocalization and the similarity of the phenotype but I believe this may need some additional evidences. Ideally one would need to be able to enhance exocytosis and test whether Wg downregulation suppress the phenotype, but I am not sure that upregulation of core exocytosis genes will be sufficient to do this. Alternatively, if Wg is indeed downstream of the upregulation of exocytosis, the reduction of cell turnover upon Wg flattening (e.g. : ftz2-/+ background) should not be enhanced by the reduction of exocytosis. Moreover, could the authors test the status of Wg downstreams targets upon inhibition of exocytosis in the Minute background (for instance, do they see a supression of the nmo-LacZ upregulation that they previously characterised in Minute wing disc in 2021) ?

      Response:

      We thank the reviewer for the comment. In line with the reviewer's suggestion, exploring the enhancement of exocytosis would be valuable to elucidate the epistatic relationship between exocytosis, Wg signaling, and cell turnover. Therefore, we plan to attempt the overexpression of exocytosis-related genes to check if we can indeed enhance exocytosis. Additionally, in response to the reviewer's comment, we intend to investigate whether the reduction in cell turnover observed upon Wg attenuation (ftz2-/+background or Wg-/+ background) is not exacerbated by further reducing exocytosis. Furthermore, we intend to investigate the status of the Wg downstream target, specifically using nmo-lacZ, when exocytosis is inhibited in the RpS3/+ wing pouch.

      Since Cyt1 and CD63 seem to mostly accumulate in apoptotic cells, it would be interesting to check their status in Minute wing disc upon apoptosis inhibition (e.g. : with H99 or mirRHG).

      Response:

      We thank the reviewer for the comment. Following the reviewer's suggestion, we intend to investigate whether inhibiting apoptosis, either through the introduction of H99/+ or by overexpressing DroncDN or mirRHG, could suppress the increase in CD63- or Sty1-positive vesicles in the M/+ wing pouch.

      I would remain cautious about some of the statements, notably in the abstract, since some of them are mostly speculative and not really based on any experiments. For instance, the statement "This interaction between dying cells and their neighboring living cells is pivotal in determining cell fate, dictating which cells will undergo apoptosis and which cells will proliferate" is not backed up by any experiment (which would require to show that exocytosis and Wg from the dying cell specifically is required for the survival and proliferation of their neighbours, and/or showing that cell death occurs specifically in cells with local differences in Wg signaling). I would recommend to be more cautious here and us a clear conditional statement.

      • *Response:

      We thank the reviewer for their insightful comment. In response, we aim to present our findings with greater precision and to avoid speculative interpretations. At this stage, we have focused on revising the Abstract to include clear conditional statements, as suggested. Furthermore, we plan to comprehensively update the remaining sections of the manuscript to reflect the additional experiments requested by the reviewers. These updates will ensure a more cautious approach throughout the paper, aligning with the reviewer's recommendations.

      __(page 2, line 34-39 in the "Abstract") __

      "Our data also suggest a potential role for the Wg receptor Frizzled-2 (Fz2) in inducing cell-turnover within the M/+ wing pouch. Overall, our findings provide mechanistic insights into robust tissue growth through the orchestration of cell-turnover, which is primarily governed by JNK-mediated exocytosis in the context of Drosophila Minute/+ wing morphogenesis."

      Other minor point:

      The authors document Wg localisation in Minute wing disc upon expression of P35. It would be interesting to describe what is the status of Wg in Rps3+/- compared to WT without p35 (if I am correct, this was done in their previous article, and in that case it would be relevant to describe these former results in the main text).

      Response:

      We thank the reviewer for the comment. In the RpS3/+ wing pouch without p35, we were unable to detect any upregulation of Wg using the anti-Wg antibody (Fig R3, below). However, we observed an increase in GFP-Wg-positive puncta originating from a knock-in allele (McGough et al., Nature, 2020) within areas of massive cell death in the RpS3/+ wing pouch lacking p35, compared to the wild-type control (Fig 3B, compared to Fig 3A in the transferred manuscript). This increase is similar to the phenotype observed in the RpS3/+ wing pouch expressing p35, where Wg-positive puncta are significantly elevated (Fig R4B below, corresponding to Fig 3B in the original manuscript). Moreover, the increase in GFP-Wg-positive puncta in regions of massive cell death in the RpS3/+ wing pouch becomes more pronounced when utilizing a membrane-tethered anti-GFP nanobody (Vhh4-CD8) (McGough et al., Nature, 2020) (Fig 3G, compared to Fig 3F in the transferred manuscript). These observations indicate that Wg-positive puncta are indeed upregulated in the RpS3/+ wing pouch without p35 compared to the wild-type control.

      In the transferred manuscript, we have now made modifications to Fig. 3 by replacing images of the RpS3/+ wing pouch overexpressing p35, which were stained with the anti-Wg antibody, with new images. These new images depict the RpS3/+ wing pouch that harbors the GFP-Wg knock-in allele, which was stained with both anti-GFP and anti-cDCP1 antibodies.

      To clarify this point, we have now modified the sentence as follows:

      (page 7, line 204-210)

      Interestingly, we found that the RpS3/+ wing pouch expressing the cell death inhibitor p35 (which allows dying RpS3/+ cells to survive) exhibited elevated levels of Wg protein, compared to the wild-type control (Fig 3A and 3B). This increase in Wg protein was significantly diminished by overexpressing the JNK inhibitor Puc (Fig 3C), suggesting that Wg expression is upregulated via JNK signaling in the M/+ wing pouch, similar to apoptotic cells in which JNK signaling induces the production of secreted growth factor, including Wg (18, 53, 54). In addition, ____W____e found t____hat GFP-Wg-positive puncta, derived from a knock-in allele ____(34)____, ____were more abundant in the RpS3/+ wing pouch compared to the wild-type control (Fig 3A and 3B). This increase in GFP-Wg-positive puncta, ____especially in the area with massive cell death within the RpS3/+ wing pouch (Fig S3C-S3D'')_, was more evident when using a membrane-tethered anti-GFP nanobody (Vhh4-CD8), which immobilizes GFP-Wg on the cell surface _(34)____ (Fig 3F and 3G, quantified in Fig 3J).

      Referee #2

      Primary Concerns:

      A significant challenge arises concerning the delineation of cell autonomy/non-autonomy. This study focuses on two distinct cell types, namely dying cells and proliferating cells. However, the consistent use of nub-gal4, a wing pouch driver, and the heterozygous minute mutant throughout the paper impedes the ability to conclusively analyze the autonomy of events. The authors previously posited that caspase-induced cell death triggers non-autonomous proliferation, but recent studies also suggested caspase-induced autonomous proliferation in both flies and mammals (Yosefzon et al. Mol. Cell 2018, Shinoda et al., PNAS 2019). Therefore, a meticulous distinction between autonomous and non-autonomous events, particularly through experimentation involving clones, is imperative. This necessity is particularly evident in Fig 4.

      Response:

      • As pointed out by the reviewer, it is ideal to dissect the mechanisms underlying cell turnover within the M/+ wing pouch, especially concerning cell autonomy and non-autonomy. However, clonal analysis is not feasible due to the pattern of cell death and subsequent proliferation occurring in a 'salt and pepper' distribution within the M/+* wing pouch. In response to this challenge, we found that almost dying cells do not undergo proliferation, as assessed by staining with the M phase marker phospho-Histone H3 (Fig R5, below).

      We also plan to investigate whether JNK-activating cells similarly refrain from proliferating, which will be assessed by staining with the anti-phospho-Histone H3 antibody.

      Additionally, about Fig 4, considering the evidence presented in this study, which demonstrates that dying cells increase Wg secretion through JNK-dependent exocytosis (Fig 3G-3H'), and given that Fz2 is expressed in adjacent cells within the M/+ wing pouch (Fig 4B-C'' and Fig S4B-C''), it is plausible that these neighboring cells could receive Wg via Fz2, leading to the upregulation of Wg signaling in these cells in the M/+ wing pouch. This upregulation of Wg signaling is supported by Fig S3A in the transferred manuscript. However, we were unable to delineate the specific role of Fz2, particularly in terms of cell autonomy and non-autonomy. In response to this challenge, we aim to explore whether cell turnover can be inhibited by enhancing Wg signaling exclusively in JNK-activated cells through the overexpression the active form of ArmadilloS10 (Baena-Lopez LA et al., Sci Signal., 2009), utilizing the puc-gal4 driver. Should cell turnover be inhibited under these conditions, it would indicate that differences in Wg signaling activity between dying cells and their neighboring cells drive cell turnover. While the exact mechanism of Fz2 remains unclear, the inhibition of cell turnover under these conditions clearly demonstrates that differences in Wg signaling activity play a significant role in cell turnover. Additionally, in our response to Reviewer 2's comment No. 2, we outline our intention to investigate whether the increase in Wg signaling could be induced by JNK-dependent exocytosis. Accordingly, we plan to assess the effects of downregulating JNK signaling or exocytosis on the elevated Wg signaling activity observed in the RpS3/+ wing pouch.

      Furthermore, we intend to present our findings with greater precision, steering clear of speculative interpretations. At this stage, we have focused on revising the Abstract to include clear conditional statements, as detailed below. We plan to comprehensively update the remaining sections of the manuscript to reflect the additional experiments requested by the reviewers.

      (page 2, line 34-39 in the "Abstract")

      "Our data also suggest a potential role for the Wg receptor Frizzled-2 (Fz2) in inducing cell-turnover within the M/+ wing pouch. Overall, our findings provide mechanistic insights into robust tissue growth through the orchestration of cell-turnover, which is primarily governed by JNK-mediated exocytosis in the context of Drosophila Minute/+ wing morphogenesis."

      Closely tied to the issue of cell autonomy/non-autonomy is the question of how cells differentiate Wg from dying cells and the dorsal-ventral boundary.

      Response:

      Wg is expressed at the dorsal-ventral boundary in both wild-type and M/+ wing discs. However, we observed in our previous study that Wg signaling activity was significantly more elevated in the RpS3/+ pouch compared to the localized activation in wild-type controls at the same developmental stage, as assessed by the nmo-lacZ reporter (Fig 3C-D' in Akai et al., PLOS Genetics, 2021, as also shown in Fig R6A-B' below). Interestingly, the areas of massive cell death in the RpS3/+ wing pouch always corresponded to the areas of relatively lower Wg signaling activity (Fig S3A in the transferred manuscript). Moreover, our previous study revealed that decreasing or increasing Wg signaling activity, thus reducing the aberrant Wg signaling gradient, significantly inhibited cell death in the M/+ wing pouch (Fig 3I-K in Akai et al., PLOS Genetics, 2021, as also shown in Fig R6C-E' below). This suggest that the aberrant Wg signaling gradient is crucial for massive cell-turnover in the M/+ wing pouch. As also described above, considering the evidence presented in this study, which demonstrates that dying cells increase Wg secretion through JNK-dependent exocytosis (Fig 3G-3H'), and given that Fz2 is expressed in adjacent cells (Fig 4B-C'' and Fig S4B-C'') within the M/+ wing pouch, it is plausible that these neighboring cells could receive Wg via Fz2, leading to the upregulation of Wg signaling in these cells within the M/+ wing pouch. To further investigate whether the increase in Wg signaling could be induced by JNK-dependent exocytosis, we plan to examine the effects of downregulating JNK signaling or exocytosis on the elevated Wg signaling activity observed in the RpS3/+ wing pouch. Should Wg signaling activity decrease as a result of these genetic interventions, it would imply that the enhanced Wg signaling in the M/+ wing pouch depends on JNK-mediated exocytosis.

      In Fig 1, the authors interpret the upregulation of exocytosis-related genes as indicative of increased exocytosis. However, this interpretation lacks direct evidence and overlooks the possibility of opposing effects. For example, autophagosome accumulation means either activation or inhibition of autophagy. Or, in case of Dilp secretion, absence of vesicles indicates upregulation of secretion. To substantiate their claim, the authors must provide more conclusive evidence of increased exocytosis. Fig 2 suggests that inhibiting exocytosis-related genes suppresses caspase activation, favoring the proposition that exocytosis is upregulated. However, demonstrating a direct increase in exocytosis in minute cells would bolster their argument. Higher resolution imaging of the exosome marker, with overlayed images, would enhance clarity too.

      Response:

      We have demonstrated an increase in the number of vesicles positive for EGFP-CD63 (an exosome marker) and Syt1-EGFP (a vesicle marker) in the RpS3/+ wing pouch compared to the wild-type control (Fig 1A, 1B, 1G, and 1H in the transferredmanuscript). To more directly investigate whether exocytosis is indeed elevated in the M/+ wing pouch, we plan to assess if cells activating JNK signaling upregulate the production of extracellular vesicles. This will be done by specifically expressing the EGFP-CD63 probe in JNK-activating cells, utilizing the puc-gal4 driver for targeted expression. Additionally, following the reviewer's suggestion, we plan to prepare high resolution images of the exosome marker, with overlayed images in the revised manuscript.

      Fig 3 introduces ambiguity regarding the relationship between cell death and exocytosis. The authors assert that dying cells exhibit elevated Wg protein levels compared to the wild-type control but omit an important comparison to the minute disc. Moreover, while Figs 1-2 propose a signaling cascade involving minute>JNK>exocytosis>cell death, Fig 3 implies that cell death regulates exocytosis. The coherence of their model and logic requires clarification - specifically, elucidating the molecular coupling mechanism between cell death and exocytosis.

      Response:

      We observed an increase in GFP-Wg-positive puncta originating from a knock-in allele (McGough et al., Nature, 2020) in the area of massive cell death within the RpS3/+ wing pouch, compared to the wild-type control (Fig 3A and 3B in the transferredmanuscript). However, as pointed out by the reviewer, it is challenging to determine whether the GFP-Wg-positive puncta are emanating from dying cells, given that Wg is secreted from the cells that produce it. To address this issue, we employed a membrane-tethered anti-GFP nanobody (Vhh4-CD8 morphotrap) designed to capture GFP-Wg on the cell surface, thereby preventing the diffusion of GFP-Wg from its producing cells. By utilizing the Vhh4-CD8 morphotrap, we found that GFP-Wg levels are indeed elevated in areas of cell death compared to adjacent regions within the RpS3/+ wing pouch (Fig 3G in the transferred manuscript).

      To clarify this point, we have now modified the sentence as follows:__ __

      (page 7, line 204-210)

      In addition, ____W____e found t____hat GFP-Wg-positive puncta, derived from a knock-in allele ____(34)____, ____were more abundant in the RpS3/+ wing pouch compared to the wild-type control (Fig 3A and 3B). This increase in GFP-Wg-positive puncta, ____especially in the area with massive cell death within the RpS3/+ wing pouch (Fig S3C-S3D'')_, was more evident when using a membrane-tethered anti-GFP nanobody (Vhh4-CD8), which immobilizes GFP-Wg on the cell surface _(34)____ (Fig 3F and 3G, quantified in Fig 3J).

      Additionally, as pointed out by the reviewer, we need to elucidate the molecular mechanism linking cell death and exocytosis. We are currently exploring two potential relationships between JNK activation, caspase signaling, and exocytosis, as depicted in Fig R2 below. To assess these possibilities, we aim to investigate if inhibiting apoptosis-either by introducing the H99/+ mutation or by overexpressing DroncDN or mirRHG-can affect JNK-mediated exocytosis in the RpS3/+ wing pouch.

      Specific Points:

      In Fig S1D, contrary to the authors' claim, cadp2 is not upregulated in a JNK-dependent manner.

      Response:

      We apologize for any confusion caused. We found that Cadps expression in the RpS3/+ wing pouch was increased compared to both the wild-type control (2.29-fold increase, RpS3/+ compared to wild-type) and the RpS3/+ wing pouch expressing Puckered (Puc), driven by the nub-gal4 driver (3.33-fold increase, RpS3/+ compared to RpS3/+ + Puckered). This suggests that the increase in Cadps expression in the RpS3/+ wing pouch is dependent on JNK signaling. We have now revised Fig. S1D for clearer representation. We also have made modifications in the transferred manuscript as follows:

      (page 5, line 107-113)

      "Mining the list of genes differentially expressed in the RpS3/+ wing pouch cells dependent on JNK signaling (Fig S1C and S2 Table), we noticed that among the genes associated with the "secretion by cell" GO term (Fig S1B), the evolutionarily conserved exocytosis-related genes unc-13, SNAP25, and cadps (Calcium-dependent secretion activator) were upregulated in a JNK-dependent manner (2.29-fold increase, RpS3/+ compared to wild-type; 3.33-fold increase, RpS3/+ compared to RpS3/+ +_ Puckered)_ (Fig S1D)."

      When detecting multiple proteins in the same tissue, it is advisable for the authors to present overlayed images to enhance the clarity of their findings. Many pictures require higher magnification too.

      Response:

      Following the reviewer's suggestion, we have overlayed images in Fig 1A-D, Fig 1G-J, and Fig S1E in the transferredmanuscript. Additionally, following the reviewer's suggestion, we plan to prepare high resolution images in the revised manuscript.

      In Fig 1A, the observed upregulation of EGFP-CD63 and Syt1-EGFP may potentially result from an artifactual effect of apoptosis. To validate their findings are specific, the authors should include negative controls that do not exhibit upregulation in dying cells.

      Response:

      Following the reviewer's suggestion, we used the CD8-PARP-Venus probe as a negative control. We observed that CD8-PARP-Venus-positive puncta were minimally present (Fig R7, below), indicating that the upregulation of EGFP-CD63 and Syt1-EGFP in the RpS3/+ wing pouch is indeed occurring, rather than being an artifactual effect of apoptosis. We intend to incorporate this negative control into the revised manuscript.

      Referee #3

      Major comments:

      Figure 1A-H, S1E. The authors used EGFP-CD63 and Syt1-EGFP as markers of exocytosis. They see an increased number of puncta in apoptotic cells. Is this a specific effect in the dying cells in M mutant discs, or is it a general effect in apoptotic cell death? This should be examined in a condition where apoptosis is induced independently of M mutants such as nub-reaper or nub-hid.

      Response:

      We thank the reviewer for the comment. To ascertain whether the increase in EGFP-CD63-/Sty1-EGFP-positive puncta is specific to dying cells in M/+ mutants or a general characteristic of apoptotic cell death, we examined the presence of EGFP-CD63-positive puncta in the wing pouch, where Reper (Rpr) was expressed under the control of the nub-gal4 driver. Unfortunately, the expression of Rpr driven by nub-gal4 resulted in significant cell death, preventing us from drawing a definitive conclusion (Fig R8, below). Nonetheless, we did observe EGFP-CD63-positive puncta under these conditions, as indicated by the arrowheads in Fig R8 below. To further investigate, we plan to induce temporary expression of Rpr in the wing pouch using a combination of the temperature-sensitive Gal80 and the nub-gal4 driver.

      Is exocytosis actually upstream or downstream of cell death, or both? The authors are kind of vague about it. On one hand, they say the dying cells induce exocytosis and secrete Wg. On the other hand, unc13RNAi can suppress cell death (Figure 1D', J'). Please clarify.

      Response:

      As pointed out by the reviewer, we need to clarify whether exocytosis actually occurs upstream or downstream of cell death. We are currently exploring two potential relationships between JNK activation, caspase signaling, and exocytosis, as depicted in Fig R2 below. To assess these possibilities, we aim to investigate if inhibiting apoptosis-either by introducing the H99/+ mutation or by overexpressing DroncDN or mirRHG-can affect JNK-mediated exocytosis in the RpS3/+ wing pouch.

      Figure 1L-P. The observation of Ca++ flashes is very interesting. However, are they important for exocytosis, cell death and compensatory proliferation? Right now, this is just a stand-alone observation. Can mutants affecting Ca++ signaling block exocytosis, cell death and comp prol?

      Response:

      We thank the reviewer for the comment. we plan to investigate whether the downregulation of Ca++ signaling impacts exocytosis, cell death, and compensatory proliferation in the M/+ wing pouch. This will be achieved by downregulating genes essential for sensing intracellular calcium concentrations (Rizo J and Rosenmund C, Nat Struct Mol Biol., 2008; Sudhof TC, Annu Rev Neurosci., 2004) and genes encoding voltage-gated calcium channels (Kuromi H et al., Neuron, 2004).

      Figure 3B'. Can you visualize aberrant wg expression in RpS3/+ wing discs only in the presence of p35? The expression of p35 in apoptotic cells generates undead cells which by itself induce Wg expression in a JNK-dependent manner (Perez-Garijo et al 2004; Ryoo et al 2004). Also, in Figure 3B', Wg is upregulated in the ventral half of the pouch, whereas in other figures (4B for example) apoptosis is strongly induced in the dorsal half. Does that make sense?

      Response:

      We thank the reviewer for the comment. In the RpS3/+ wing pouch without p35, we were unable to detect any upregulation of Wg using the anti-Wg antibody (Fig R3, below). However, we observed an increase in GFP-Wg-positive puncta, derived from a knock-in allele (McGough et al., Nature, 2020), in areas of massive cell death within the RpS3/+ wing pouch without p35, relative to the wild-type control (Fig 3B, compared to Fig 3A in the transferred manuscript). This increase is similar to the phenotype observed in the RpS3/+ wing pouch expressing p35, where Wg-positive puncta are significantly elevated (Fig R4B below, corresponding to Fig 3B in the original manuscript). Moreover, the increase in GFP-Wg-positive puncta in regions of massive cell death in the RpS3/+ wing pouch becomes more pronounced when utilizing a membrane-tethered anti-GFP nanobody (Vhh4-CD8) (McGough et al., Nature, 2020) (Fig 3G, compared to Fig 3F). These observations indicate that Wg-positive puncta are indeed upregulated in the RpS3/+ wing pouch without p35 compared to the wild-type control.

      In the transferred manuscript, we have now made modifications to Fig. 3 by replacing images of the RpS3/+ wing pouch overexpressing p35, which were stained with the Wg antibody, with new images. These new images depict the RpS3/+ wing pouch that harbors the GFP-Wg knock-in allele, which was stained with both anti-GFP and anti-cDCP1 antibodies.

      To clarify this point, we have now modified the sentence as follows:

      (page 7, line 204-210)

      Interestingly, we found that the RpS3/+ wing pouch expressing the cell death inhibitor p35 (which allows dying RpS3/+ cells to survive) exhibited elevated levels of Wg protein, compared to the wild-type control (Fig 3A and 3B). This increase in Wg protein was significantly diminished by overexpressing the JNK inhibitor Puc (Fig 3C), suggesting that Wg expression is upregulated via JNK signaling in the M/+ wing pouch, similar to apoptotic cells in which JNK signaling induces the production of secreted growth factor, including Wg (18, 53, 54). In addition, ____W____e found t____hat GFP-Wg-positive puncta, derived from a knock-in allele ____(34)____, ____were more abundant in the RpS3/+ wing pouch compared to the wild-type control (Fig 3A and 3B). This increase in GFP-Wg-positive puncta, ____especially in the area with massive cell death within the RpS3/+ wing pouch (Fig S3C-S3D'')_, was more evident when using a membrane-tethered anti-GFP nanobody (Vhh4-CD8), which immobilizes GFP-Wg on the cell surface _(34)____ (Fig 3F and 3G, quantified in Fig 3J).

      Additionally, we apologize for any confusion caused by Fig 3B'. It is noteworthy that cell death sometimes occurs more prominently in the ventral half of the RpS3/+ wing pouch than in the dorsal half (Fig R9, below). Furthermore, the increase in Wg level can be more pronounced on the ventral side, or at times, it is equally strong on both the dorsal and ventral sides (Fig S3C-D in the transferred manuscript).

      Can wg RNAi in cells destined to die (puc-Gal4 UAS-wgRNAi) suppress apoptosis and comp prol?

      Response:

      Following the reviewer's suggestion, we intend to investigate whether expressing Wg-RNAi in JNK activated cells, using the puc-gal4 driver, could suppress apoptosis and compensatory proliferation.

      Figure 3D,E. Use cDcp1 as apoptotic marker instead of CD63-mCherry which is not an apoptotic marker.

      Response:

      Following the reviewer's suggestion, we have now utilized the cDCP1 antibody as an apoptotic marker in Fig S3B-D'' in the transferred manuscript, as described in the Reviewer 3's comment No.4.

      Figure 4A' and B'. I am not sure there is much of a difference in the expression of fz2-lacZ in wild-type and RpS3/+ discs. The quantification in 4F shows there is no difference in the dorsal half of the pouch where massive apoptosis occurs. Can you generate a condition in which only one compartment (anterior or posterior) is mutant for RpS3/+, while the other compartment is wt? That would allow direct side-by-side comparison. Comparisons between different discs is problematic.

      Response:

      We thank the reviewer for the comment. Following the reviewer's suggestion, we conducted a more detailed investigation into the differences in fz2-lacZ expression levels between wild-type and RpS3/+ wing discs. We found that the expression level of fz2-lacZwas indeed elevated in the RpS3/+ wing pouch. This elevation in expression is demonstrated by the results of the genetic rescue experiment in the posterior compartment of RpS3/+ wing discs, where overexpression of RpS3 using the engrailed-gal4 driver led to a reduction in fz2-lacZ expression compared to the anterior RpS3/+ control (Fig R10, below).

      After conducting statistical analysis, we plan to include Fig R10 in the revised manuscript.

      There is some inconsistency in the presence of apoptotic cells and presence of cells which secrete Wg. Apoptotic cells occur in large cell clusters (Fig. 2G, 3H', S3A', S4B), while markers of exocytosis and Wg are present in individual puncta (Fig 3D', E', S3C,E). That does not seem to fit with the authors' conclusion that dying cells secrete Wg by exocytosis. Are all dying cells secreting Wg through exocytosis? Please explain.

      Response:

      We thank the reviewer for the comment. As highlighted by the reviewer, it appears that not all dying cells secrete Wg through exocytosis. Beyond exosomes, various mechanisms for Wg/Wnt transport have been proposed, including those mediated by lipoprotein particles, the cell-surface proteoglycan Dally-like protein (Dlp), and filopodia-like cellular extensions known as cytonemes. Therefore, it's plausible that dying cells may also utilize these additional mechanisms for Wg transport alongside exosomes. We have incorporated this explanation into the transferred manuscript as follows:

      (page 9, line 272-286)

      Previous studies have identified exosomes as carriers of Wnt/Wg in the extracellular space of both mammalian and Drosophila cells, including wing disc cells (28, 49, 50, 51, 52, 64). Concurrently, alternative mechanisms for Wg transport, such as those involving lipoprotein particles or a lipocalin Swim in the Drosophila wing disc, have been reported (29, 65). Additionally, the cell-surface proteoglycan Dally-like-protein (Dlp) has been reported to enable long-range signaling of the palmitoylated Wg _(51)._ Intriguingly, Wnt/Wg transport has also been observed through filopodia-like cellular extensions known as cytonemes (66, 67, 68, 69). In our study of the M/+_ wing disc, a model characterized by massive cell-turnover, we observed partial colocalization of Wg-GFP puncta with exosomal markers such as CD63-mCherry and Hrs._ Our data also suggest that Wg secretion through exocytosis may not uniformly occur among all dying cells within this context. It is noteworthy that whereas dying cells frequently form large clusters, exosomal markers and Wg typically localize within individual puncta. This disparity suggests that while exosomes from dying cells significantly contribute to Wg transport within the ____M/+____ wing pouch, other pathways may also be operative.

      I was a bit confused by the conclusion of the authors about the results in Figure 4G-I. What do they mean by "difference in Fz2 expression"? Are they referring to the gradient that they described in their previous work? Or is it just the absolute level of Fz2 that determines apoptosis and proliferation? If the latter, is the gradient still present (at least in fz2/+), just at a lower level?

      Response:

      I apologize for any confusion caused by the way we presented our conclusions regarding the results depicted in Figure 4G-I in the original manuscript.

      In our previous study, we observed that Wg signaling activity was elevated much more broadly in the RpS3/+ pouch compared to the localized activation observed in the wild-type control at the same developmental stage, as assessed by the nmo-lacZ reporter(Fig 3D in Akai et al., PLOS Genetics, 2021, as also shown Fig. R6 below). Interestingly, the areas of massive cell death in the RpS3/+ wing pouch always corresponded to the areas of relatively lower Wg signaling activity (as also shown in Fig S3A in the transferred manuscript). Moreover, our previous study revealed that decreasing or increasing Wg signaling activity, thus reducing the aberrant Wg signaling gradient, significantly inhibited cell death in the M/+ wing pouch (Fig 3I-K in Akai et al., PLOS Genetics, 2021, as also shown in Fig R6C-E' below). This suggest that the aberrant Wg signaling gradient is crucial for massive cell-turnover in the M/+ wing pouch.

      Considering the evidence presented in this study, which demonstrates that dying cells increase Wg secretion through JNK-dependent exocytosis (Figure 3G-3H'), and given that Fz2 is expressed in adjacent cells within the M/+ wing pouch (Figure 4B-C'' and Fig S4B-C''), it is plausible that these neighboring cells could receive Wg via Fz2, leading to the upregulation of Wg signaling in these cells in the M/+ wing pouch.

      However, we were unable to delineate the specific role of Fz2, particularly in terms of cell autonomy and non-autonomy as highlighted by Reviewer 2, through clonal analysis due to the occurrence of cell death and subsequent proliferation in a salt-and-pepper pattern within the M/+ wing pouch. In response to this challenge, we aim to explore whether cell turnover can be inhibited by enhancing Wg signaling exclusively in JNK-activated cells through the overexpression the active form of ArmadilloS10(Baena-Lopez LA et al., Sci Signal., 2009), utilizing the puc-gal4 driver. Should cell turnover be inhibited under these conditions, it would indicate that differences in Wg signaling activity between dying cells and their neighboring cells drive cell turnover. While the exact mechanism of Fz2 remains unclear, the inhibition of cell turnover under these conditions clearly demonstrates that differences in Wg signaling activity play a significant role in cell turnover. Additionally, in our response to Reviewer 2's comment No.2, we outline our intention to investigate whether the increase in Wg signaling could be induced by JNK-dependent exocytosis. Accordingly, we plan to assess the effects of downregulating JNK signaling or exocytosis on the elevated Wg signaling activity observed in the RpS3/+ wing pouch.

      Furthermore, we intend to present our findings with greater precision, steering clear of speculative interpretations. At this stage, we have focused on revising the Abstract to include clear conditional statements, as detailed below. We plan to comprehensively update the remaining sections of the manuscript to reflect the additional experiments requested by the reviewers.

      (page 2, line 34-39 in the "Abstract")

      "Our data also suggest a potential role for the Wg receptor Frizzled-2 (Fz2) in inducing cell-turnover within the M/+ wing pouch. Overall, our findings provide mechanistic insights into robust tissue growth through the orchestration of cell-turnover, which is primarily governed by JNK-mediated exocytosis in the context of Drosophila Minute/+ wing morphogenesis."

      Regarding point 8, given that reduction of Fz2 can suppress apoptosis in RpS3/+ wing discs, how does it regulate apoptosis when it is down-regulated anyway in apoptotic cells (Fig 4B', S4B')?

      Response:

      As described in our response to point 9, our previous study suggested that we observed that Wg signaling activity was elevated much more broadly in the RpS3/+ pouch compared to the localized activation observed in the wild-type control at the same developmental stage, as assessed by the nmo-lacZ reporter (Fig 3D in Akai et al., PLOS Genetics, 2021, as also shown Fig. R6 above). This elevation leads to the formation of an ectopic cell population with high Wg signaling activity in the wing pouch, potentially causing non-autonomous cell death among cells exhibiting lower Wg signaling activity, a process akin to cell competition. Notably, the significant reduction of apoptosis in RpS3/+ wing discs following Fz2 downregulation suggests that Fz2 may play a crucial role in the massive cell turnover, through a mechanism similar to cell competition, even though the exact mechanism remains unclear. We plan to incorporate such discussions into the discussion section of the revised manuscript.

      Figure S1D. It seems that cadps is further up-regulated if JNK signaling is inhibited in RpS3/+ cells. Why did the authors select this gene?

      Response:

      We apologize for any confusion caused. We found that Cadps expression in the RpS3/+ wing pouch was increased compared to both the wild-type control (2.29-fold increase, RpS3/+ compared to wild-type) and the RpS3/+ wing pouch expressing Puc, driven by the nub-gal4 driver (3.33-fold increase, RpS3/+ compared to RpS3/+ +Puc). This suggests that the increase in Cadpsexpression in the RpS3/+ wing pouch is dependent on JNK signaling. We have now revised Fig S1D for clearer representation. We also have made modifications in the transferred manuscript as follows:

      (page 5, line 107-113)

      "Mining the list of genes differentially expressed in the RpS3/+ wing pouch cells dependent on JNK signaling (Fig S1C and S2 Table), we noticed that among the genes associated with the "secretion by cell" GO term (Fig S1B), the evolutionarily conserved exocytosis-related genes unc-13, SNAP25, and cadps (Calcium-dependent secretion activator) were upregulated in a JNK-dependent manner (2.29-fold increase, RpS3/+ compared to wild-type; 3.33-fold increase, RpS3/+ compared to RpS3/+ +_ Puckered)_ (Fig S1D)."

      Figure S1H'. I don't see that Hrs is upregulated in this panel.

      Response:

      We thank the reviewer for the comment. Our quantitative analysis showed an increase in Hrs-positive puncta in the RpS3/+ wing pouch compared to the wild-type control (Fig S1I). We have substituted the relevant Figure with new images that more clearly demonstrate the elevation of Hrs-positive puncta (Fig S1G-H' in the transferred manuscript).

      Minor points:

      Describe in more detail, what are unc13, SNAP25 and cadps.

      Response:

      We thank the reviewer for the comment. We have now included an explanation for unc-13, SNAP25 and cadps in the transferredmanuscript as follows:

      (page 5, line 117-126)

      "It has been shown that unc-13, SNAP25, and cadps collectively regulate the docking process of secretory vesicles to the plasma membrane during Ca2+-mediated exocytosis ____((21, 23, 24), Reviewed in (22)). ____Specifically, UNC-13, a conserved presynaptic protein with calcium-binding domains, interacts with syntaxin to prime vesicles for fusion, crucial for calcium-regulated exocytosis (____23____). SNAP-25, in conjunction with syntaxin-1 and synaptobrevin, forms the pivotal SNARE complex for neuronal exocytosis, assembling into a four-helix bundle that is essential for drawing vesicle and plasma membranes close together to enable membrane fusion (____24, 25____). Like UNC-13, CAPS possesses conserved C-terminal domains that are instrumental in the assembly of SNARE complexes, thus priming vesicles for Ca2+-induced exocytosis (____26)____.____"

      I wondered about the use of Hrs as exosome marker. To my knowledge, it is an endosomal marker. Same with Alix. Please clarify.

      Response:

      As pointed out by the reviewer, ESCRT-0 component Hrs is also localized to endosome. However, Hrs is also used as an exosome marker in the previous manuscript (McGough IJ et al., Nature, 2020), alongside Alix and Tsg101, which are widely recognized as exosome markers (Willms E et al., Sci Rep., 2016; Dear JW et al., Proteomics, 2013). Indeed, Hrs is a key factor for mediating ESCRT-dependent exosome secretion in mammals and flies (Tamai K et al., Biochem Biophys Res Commun., 2010; Gross JC et al., Nat Cell Biol., 2012; Colombo M et al., J Cell Sci., 2013; Vietri M et al., Nat Rev Mol Cell Biol., 2020). We have incorporated this explanation into the revised manuscript as follows:

      (page 6, line 139-141)

      "____Furthermore, we noted an increase in vesicles positive for the ESCRT protein Hrs, ____an additional exosome marker crucial for exosome secretion (____34, 35____), in the RpS3/+ wing pouch relative to the wild-type control (Fig S1G and S1H, quantified in Fig S1I)."

      In the context of references 16 and 17, Huh et al. (2004), Current Biology needs also to be cited.

      Response:

      We thank the reviewer for the comment. We have referenced the paper by Huh et al. (2004) published in Current Biology in the transferred manuscript as follows:

      (page 3, line 68-71)

      "Apoptotic cells, for example, can secrete mitogens su____ch as Wingless ____(Wg; a Wnt homolog), dpp (a BMP homolog), and Hh, which could promote the proliferation of nearby cells in the Drosophila epithelium (15, _16_, 17, 18, 19)."

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: In previous work, the authors showed that in the pouch of wing imaginal discs in Minute (M) mutants apoptosis and compensatory proliferation are dramatically increased in a Wingless (Wg) and JNK-dependent manner. Here, they show that JNK-induced exocytosis is mediating the secretion of Wg in apoptotic cells. Wg in turn stimulates the Fz2 receptor in neighboring surviving cells promoting their proliferation.

      Major comments:

      1. Figure 1A-H, S1E. The authors used EGFP-CD63 and Syt1-EGFP as markers of exocytosis. They see an increased number of puncta in apoptotic cells. Is this a specific effect in the dying cells in M mutant discs, or is it a general effect in apoptotic cell death? This should be examined in a condition where apoptosis is induced independently of M mutants such as nub-reaper or nub-hid.
      2. Is exocytosis actually upstream or downstream of cell death, or both? The authors are kind of vague about it. On one hand, they say the dying cells induce exocytosis and secrete Wg. On the other hand, unc13RNAi can suppress cell death (Figure 1D', J'). Please clarify.
      3. Figure 1L-P. The observation of Ca++ flashes is very interesting. However, are they important for exocytosis, cell death and compensatory proliferation? Right now, this is just a stand-alone observation. Can mutants affecting Ca++ signaling block exocytosis, cell death and comp prol?
      4. Figure 3B'. Can you visualize aberrant wg expression in RpS3/+ wing discs only in the presence of p35? The expression of p35 in apoptotic cells generates undead cells which by itself induce Wg expression in a JNK-dependent manner (Perez-Garijo et al 2004; Ryoo et al 2004). Also, in Figure 3B', Wg is upregulated in the ventral half of the pouch, whereas in other figures (4B for example) apoptosis is strongly induced in the dorsal half. Does that make sense?
      5. Can wg RNAi in cells destined to die (puc-Gal4 UAS-wgRNAi) suppress apoptosis and comp prol?
      6. Figure 3D,E. Use cDcp1 as apoptotic marker instead of CD63-mCherry which is not an apoptotic marker.
      7. Figure 4A' and B'. I am not sure there is much of a difference in the expression of fz2-lacZ in wild-type and RpS3/+ discs. The quantification in 4F shows there is no difference in the dorsal half of the pouch where massive apoptosis occurs. Can you generate a condition in which only one compartment (anterior or posterior) is mutant for RpS3/+, while the other compartment is wt? That would allow direct side-by-side comparison. Comparisons between different discs is problematic.
      8. There is some inconsistency in the presence of apoptotic cells and presence of cells which secrete Wg. Apoptotic cells occur in large cell clusters (Fig. 2G, 3H', S3A', S4B), while markers of exocytosis and Wg are present in individual puncta (Fig 3D', E', S3C,E). That does not seem to fit with the authors' conclusion that dying cells secrete Wg by exocytosis. Are all dying cells secreting Wg through exocytosis? Please explain.
      9. I was a bit confused by the conclusion of the authors about the results in Figure 4G-I. What do they mean by "difference in Fz2 expression"? Are they referring to the gradient that they described in their previous work? Or is it just the absolute level of Fz2 that determines apoptosis and proliferation? If the latter, is the gradient still present (at least in fz2/+), just at a lower level?
      10. Regarding point 8, given that reduction of Fz2 can suppress apoptosis in RpS3/+ wing discs, how does it regulate apoptosis when it is down-regulated anyway in apoptotic cells (Fig 4B', S4B')?
      11. Figure S1D. It seems that cadps is further up-regulated if JNK signaling is inhibited in RpS3/+ cells. Why did the authors select this gene?
      12. Figure S1H'. I don't see that Hrs is upregulated in this panel.

      Minor points:

      1. Describe in more detail, what are unc13, SNAP25 and cadps.
      2. I wondered about the use of Hrs as exosome marker. To my knowledge, it is an endosomal marker. Same with Alix. Please clarify.
      3. In the context of references 16 and 17, Huh et al. (2004), Current Biology needs also to be cited.

      Referees cross-commenting

      I think all three reviewers agree in their assessment of this manuscript. Some of the comments by the reviewers address the same concerns. I liked the comment by reviewer 2 about the autonomy/non-autonomy of the signaling events in this model. I was thinking that, too, but didn't express it in my review. So, thanks reviewer 2, for bringing this up.

      Significance

      General assessment. A major strength of the work is that the experiments are done in a very good manner. The authors used multiple assays to come to the same conclusions. Limitations and weaknesses are mentioned in "major comments".

      Advance. In my mind, one problem is novelty of this work. The authors showed before that Wg is secreted by dying cells in Minute mutants. It was also shown that Wg can be secreted by exocytosis in many studies from different authors. Here, the authors basically combine both observations and show that Wg is secreted by exosomes in Minute mutants.

      Audience. This work likely addresses a specialized audience involved in Minute-induced cell competition. I don't think it will be of interest beyond this specific field.

      Background of the reviewer. I have an interest in cell competition, apoptosis and neurodegeneration.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors initiate their study by illustrating that minute cells undergo an augmentation in exocytosis-related proteins and calcium flashes (Fig 1). Subsequently, they establish a correlation between JNK-mediated exocytosis and the regulation of caspase activation in minute cells (Fig 2). Further, the authors unveil a dependency of Wg secretion on exocytosis, elucidated in Fig 3. Finally, they discern the involvement of one of the Wg receptors, Fz2, in the cell death/proliferation phenotype of minute cells. Despite the formulation of an intriguing hypothesis, the presented data falls short of robustly supporting their assertions.

      Primary Concerns:

      1. A significant challenge arises concerning the delineation of cell autonomy/non-autonomy. This study focuses on two distinct cell types, namely dying cells and proliferating cells. However, the consistent use of nub-gal4, a wing pouch driver, and the heterozygous minute mutant throughout the paper impedes the ability to conclusively analyze the autonomy of events. The authors previously posited that caspase-induced cell death triggers non-autonomous proliferation, but recent studies also suggested caspase-induced autonomous proliferation in both flies and mammals (Yosefzon et al. Mol. Cell 2018, Shinoda et al., PNAS 2019). Therefore, a meticulous distinction between autonomous and non-autonomous events, particularly through experimentation involving clones, is imperative. This necessity is particularly evident in Fig 4.
      2. Closely tied to the issue of cell autonomy/non-autonomy is the question of how cells differentiate Wg from dying cells and the dorsal-ventral boundary.
      3. In Fig 1, the authors interpret the upregulation of exocytosis-related genes as indicative of increased exocytosis. However, this interpretation lacks direct evidence and overlooks the possibility of opposing effects. For example, autophagosome accumulation means either activation or inhibition of autophagy. Or, in case of Dilp secretion, absence of vesicles indicates upregulation of secretion. To substantiate their claim, the authors must provide more conclusive evidence of increased exocytosis. Fig 2 suggests that inhibiting exocytosis-related genes suppresses caspase activation, favoring the proposition that exocytosis is upregulated. However, demonstrating a direct increase in exocytosis in minute cells would bolster their argument. Higher resolution imaging of the exosome marker, with overlayed images, would enhance clarity too.
      4. Fig 3 introduces ambiguity regarding the relationship between cell death and exocytosis. The authors assert that dying cells exhibit elevated Wg protein levels compared to the wild-type control but omit an important comparison to the minute disc. Moreover, while Figs 1-2 propose a signaling cascade involving minute>JNK>exocytosis>cell death, Fig 3 implies that cell death regulates exocytosis. The coherence of their model and logic requires clarification - specifically, elucidating the molecular coupling mechanism between cell death and exocytosis.

      Specific Points:

      1. In Fig S1D, contrary to the authors' claim, cadp2 is not upregulated in a JNK-dependent manner.
      2. When detecting multiple proteins in the same tissue, it is advisable for the authors to present overlayed images to enhance the clarity of their findings. Many pictures require higher magnification too.
      3. In Fig 1A, the observed upregulation of EGFP-CD63 and Syt1-EGFP may potentially result from an artifactual effect of apoptosis. To validate their findings are specific, the authors should include negative controls that do not exhibit upregulation in dying cells.

      Referees cross-commenting

      I agree with comments by other reviewers.

      Significance

      The authors initiate their study by illustrating that minute cells undergo an augmentation in exocytosis-related proteins and calcium flashes (Fig 1). Subsequently, they establish a correlation between JNK-mediated exocytosis and the regulation of caspase activation in minute cells (Fig 2). Further, the authors unveil a dependency of Wg secretion on exocytosis, elucidated in Fig 3. Finally, they discern the involvement of one of the Wg receptors, Fz2, in the cell death/proliferation phenotype of minute cells. Despite the formulation of an intriguing hypothesis, the presented data falls short of robustly supporting their assertions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this study, Akai and colleagues study the cellular mechanism leading to high cell turnover in the Minute mutant background in Drosophila wing disc. Previously, the same group characterised an unexpected high rate of proliferation in the wing pouch of the Minute heterozygous larvae, which have been long known to have slower development time. This high rate of proliferation is driven by high apoptosis rate, JNK activation, the release of Dilp8 hormone, and the upregulation of Wg signaling (downstream of JNK) necessary for increased apoptosis and proliferation ( Akai et al., Plos Genetics 2021). Here, the authors address now the molecular link connecting JNK activation and the increased turnover. Using RNAseq and comparing WT, M+/- wing disc with M+/- wing disc upon JNK inhibition, they found a number of genes associated with exocytosis that were upregulated in Minute disc which was lost upon JNK inhibition. The authors first confirm the existence of higher number of exocytic vesicles in Minute wing disc as well as higher calcium activity that were all reduced upon JNK inhibition. Moreover, downregulation of several components of the exocytosis machinery abolished the high turnover rate of Minute wing disc (reducing both apoptosis and proliferation) while leading to the appearance of morphological defects in adult wing, while having no effect in a WT background. Interestingly, Wg accumulates in these exocytosis vesicles, and this accumulation relies on JNK and core exocytosis machinery. The role of Wg is confirmed by reducing the concentration (using heterozygous mutant or RNAi) of the Wg receptor Fz2, which is also specifically downregulated in the cluster of apoptotic cells. Altogether, this study suggests that upregulation of exocytosis by JNK activation is a central regulator of the higher cell turnover in Minute background.

      The article is well written and the data overall convincing, including a lot of genetic backgrounds confirming the impact of exocytosis, as well as all the necessary quantifications. Some additional epistatic experiments may help to clearly test to which extend exocytosis is the major downstream target of JNK, and additional control may also help to clarify the sufficiency of JNK for generating such phenotype. Finally, I believe the epistatic link between exocytosis and Wg would deserve more experiments to be definitly proven.

      Major comments:

      1. It would be important to check how much JNK is sufficient to trigger the exocytosis upregulation. Is the accumulation of Cyt1 and CD63 vesicles in apoptotic cell common to any JNK dependent death or does it require the Minute background ? Could the authors check whether clones expressing HepCA transiently in WT background also accumulate the same vesicles ?
      2. It would be relevant to have the status of JNK in Minute disc upon downregualtion of exocytosis. One could imagine some positive feedback between JNK activation, exocytosis, cell death and further JNK activation.
      3. So far, the evidence for the epistatic link between exocytosis, Wg and cell turnover is mostly based on colocalization and the similarity of the phenotype but I believe this may need some additional evidences. Ideally one would need to be able to enhance exocytosis and test whether Wg downregulation suppress the phenotype, but I am not sure that upregulation of core exocytosis genes will be sufficient to do this. Alternatively, if Wg is indeed downstream of the upregulation of exocytosis, the reduction of cell turnover upon Wg flattening (e.g. : ftz2-/+ background) should not be enhanced by the reduction of exocytosis. Moreover, could the authors test the status of Wg downstreams targets upon inhibition of exocytosis in the Minute background (for instance, do they see a supression of the nmo-LacZ upregulation that they previously characterised in Minute wing disc in 2021) ?
      4. Since Cyt1 and CD63 seem to mostly accumulate in apoptotic cells, it would be interesting to check their status in Minute wing disc upon apoptosis inhibition (e.g. : with H99 or mirRHG).
      5. I would remain cautious about some of the statements, notably in the abstract, since some of them are mostly speculative and not really based on any experiments. For instance, the statement "This interaction between dying cells and their neighboring living cells is pivotal in determining cell fate, dictating which cells will undergo apoptosis and which cells will proliferate" is not backed up by any experiment (which would require to show that exocytosis and Wg from the dying cell specifically is required for the survival and proliferation of their neighbours, and/or showing that cell death occurs specifically in cells with local differences in Wg signaling). I would recommend to be more cautious here and us a clear conditional statement.

      Other minor point:

      The authors document Wg localisation in Minute wing disc upon expression of P35. It would be interesting to describe what is the status of Wg in Rps3+/- compared to WT without p35 (if I am correct, this was done in their previous article, and in that case it would be relevant to describe these former results in the main text).

      Referees cross-commenting

      I overall agree with all the other comments. It seems that the overall assessment and criticisms (cell autonomy, better characterisation of the status of exocytsosi) are along the same line

      Significance

      This article is mostly a follow up of a former study published by the same authors in Plos Genetics in 2021. The high turnover (high proliferation and high apoptosis rate) in the minute background was the most surprising observations that was performed previously by the authors, and the role of JNK and Wg was already quite extensively explored in this previous article. The main novelty here is to provide a systematic analysis of expression profile of Minute disc upon JNK inhibition and identify the strong contribution of exocytosis for increased apoptosis and proliferation in Minute wing disc downstream of JNK. In that sense, I believe these are interesting results, but novelty remains a bit limited (at least relative to their previous study). Still, these results could be interesting for the community studying cell competition, ribosomopathies, and growth regulation specially in Drosophila (so mostly for a specialised readership).

      I have expertise in epithelial cell death, apoptosis, and cell competition, specially in Drosophila. I feel confident to evaluate every part of the article.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2024-02352

      Corresponding author(s): Elise, Belaidi

      1. General Statements

      We would like to thank the reviewers for their constructive suggestions and comments. We hope that the “point by point answer” and the revision plan proposed below will convince the reviewers and the editor.

      We thank the reviewer 1 for his/her comments. We would like to specify that the involvement of HIF-1 in IH-induced mitochondrial remodeling has indeed been initiated by RNA-seq analysis and confirmed in a cell-based model as well as in wild-type and HIF-1a+/- heterozygous mice subjected to intermittent hypoxia (IH). In vivo, we originally demonstrated that Metformin reversed IH-induced increase in myocardial infarct size through AMPKa2 and, we proposed that metformin could modify HIF-1 activity. Then, we validated our hypothesis in an in vitro model allowing to demonstrate that Metformin, by increasing HIF-1a phosphorylation decreases its activity. We acknowledge that we used several models and this is the reason why we detailed as much as possible the Materials and Methods section including all models, experimental sets designed and methods details. We hope that the point by point response that we made for the reviewer 1 will increase the clarity of our work and we hope that the new results provided will strengthen the evidences concerning the mechanisms by which metformin can inhibit and modulate the deleterious impact of HIF-1 on IH-induced an increase in myocardial infarct size.

      We thank the reviewer 2 for his/her conclusion highlighting that “our work opens new avenues for exploring the potential effects of metformin as a modulatory of HIF-1𝛂 activity in obstructive sleep apnea syndrome”. We hope that the clarifications and/or justifications brought will convince him/her.<br /> We thank the reviewer 3 for having underlined that “metformin induces HIF-1α phosphorylation, decreases its nuclear localization and subsequently HIF-1 transcriptional activity are very much interesting” and for having highlighting that “our study is convincing”. We hope that the justification and the corrections brought in the point by point answer will convince him/her.

      Alltogether, as underlined by the 3 reviewers, our study is very interesting for translational science in the fields of cardiovascular, respiratory and sleep medicine. We hope that the point by point answer and the revision plan proposed will allow the publication of our article in EMBO Molecular Medicine.

      2. Description of the planned revisions

      • *

      Please find below the revision that we plan to address to answer to the questions of the reviewer 1.


      Figure 1 - it would be helpful to list all of the DEGs (what genes are changed?). Including the expression of HIF-1α and PHD isoforms would be informative. If there is a robust HIF-1α signal, changes in the expression of HIF and PHD isoforms would be anticipated. Fig 1F - with regards to glycolysis and hypoxia pathway analysis, most of the DEGs are not canonical HIF-1α/hypoxia targets.

      Figure 1 aimed at better understanding and manipulate the well-recognized involvement of HIF-1 in response to our specific IH stimulus (Semenza, Physiology 2009; Belaidi, Pharmacol & Ther. 2016). __The results provided by the RNA-seq analysis shows that IH induces cardiac oxidative and metabolic stress which are inter-related with HIF-1 activation. __We did not claim that these genes are HIF-1 targets genes. The RNA seq analysis did not allow to reveal HIF-1a and PHD1-3 transcript as the most dysregulated genes of the panel. In case of publication, bulk data and DEGS will be provided in an online file. We agree with the reviewer that the list of the 40 up and down-regulated genes would be very informative and would increase the value of the paper. Thus, we plan to add the name of the 40 up and down-regulated genes on Figure 1B.

      Figure 5G-I, show cytoplasmic HIF1a as well as nuclear.

      Alternatively, why not use IHC for subcellular localization?

      We think that the comments of the reviewer 1 concern Fig.4G-I and not 5G-I. In this figure, we showed that IH increases nuclear HIF-1____a____ expression compared to N condition and that this IH-effect is abolished in mice treated with Metformin, suggesting that, upon IH, Metformin impacts HIF-1__a __nuclear content and subsequently, its activity. The nuclear localization of HIF-1a is the most relevant mean to indicate its activation. We agree with the reviewer that IHC also allows for the indication of the nuclear localization of HIF-1a. Indeed, we previously performed IHC on nuclear HIF-1a localization and demonstrated that IH increased HIF-1a nuclear localization by IHC that was corroborated by Western-blot (Moulin S, TACD, 2020). Western-blot and IHC are both semi-quantitative techniques with different process of analyses. In this study, we choose Western-blot because we have the material to perform this technique and because IHC is associated with an analysis process (size of a slice, areas to analyze, colorimetry…) that is more complex than the analysis process of Western-blot (densitometry solely).

      While the nuclear localization of HIF-1a is the most relevant mean to indicate its activation; it could be interesting to see that HIF-1a cytosolic content was neither modify by IH nor by Metformin. This would also corroborate the results of the RNA-seq that did not demonstrate any difference in DEGs of HIF-1a or of other members of the HIF family. This would also confirm that Metformin plays a major role on HIF-1 activaty regulation (and not transcription) in the context of IH.

      Thus, we plan to perform a Western-blot of HIF-1a on cytosolic extracts of hearts from mice exposed to N or IH and treated or not with Metformin. These extracts are already available and Western-blot would be performed and replicated in 3 weeks. We could also provide a Western-blot in order to show the purity of our extraction protocol (nucleus vs cytosol).

      Figure 5F, it would be important to show the levels of expression of HIF1a in these experiments. Are there positive and negative controls that the authors could use for HIF21a activity in this experiment?

      In our manuscript, we aimed at demonstrating that Metformin decreases HIF-1 activity in a context of strong HIF-1____a____expression and/or stabilizion those mimics what happens after chronic IH in mice (Belaidi E, Int J Cardiol 2016, Moulin S, Ther Adv Chronic Dis 2020) and in apneic patients (Moulin S, Can J Cardiol 2020). Thus, we used a transfection allowing to overexpress HIF-1a that is one of the best means to increase HIF-1 activity. In the Figure 1 below, HA-HIF-1α-WT Addgene AmpR and 5 HRE GFP AmpR plasmids co-transfection induced a decrease in H9c2 viability and an increase in GFP-positive cells that were not observed in H9c2 transfected with pcDNA 3.1 HA-C AmpR (negative control). __This validates our in vitro model as a good positive control to mimic IH consequences. __ However, we agree with the reviewer that we could add a supplemental figure or a panel demonstrating that our transfection induced an increase in HIF-1a expression. Thus, we will perform a Western-blot targeting HIF-1a on H9c2 transfected with the control plasmid (pcDNA 3.1 HA-C AmpR) or the plasmid allowing the overexpression of HIF-1a (HA-HIF-1α-WT AmpR). This work would be performed in 2 months.

      Moreover, we already improved the lisibility of the Figure 5F to clarify the experimental conditions (table inserted under the graphic); we also completed the Materials and Methods section to specify the plasmid used (modifications are in red in the manuscript).

      Figure to see on the downloaded file.



      Figure 1 : GFP fluorescence in H9c2 cells transfected with pcDNA 3.1 HA-C AmpR (control condition) or HA-HIF-1α-WT AmpR (positive control, overexpression of HIF-1a) and 5 HRE-GFP AmpR plamids and treated with CoCl2 (1mM, 2h); magnification x100.

      • *

      This paragraph concerns only the point 4 of the fifth question.

      In these experiments, as well as subsequent studies, it would be very informative to use a specific AMPK activator e.g. MK-8772, to compare with metformin. It is well known that metformin has a number of other targets in addition to AMPK.

      We agree with the reviewer that metformin has pleiotropic effect. Very interestingly, we demonstrated that the reduced-infarct size is not related to the metabolic systemic effect of metformin since it failed to improve the IH-induced insulin resistance while it improves the answer to insulin in normoxic mice (supplemental Figure S3B). This demonstrates that in our model, the cardioprotective effects of Metformin are independent of a potential systemic effect. Then, we demonstrated that metformin protects the heart against ischemia-reperfusion through AMPK____a____2 activation by using AMPK____a____2KO exposed to IH in which Metformin failed to decrease infarct size (Fig.4N). MK-8772 is not widely used in vivo models. Moreover a recent study indicates that chronic treatment with MK-8772 (14 days 1 month in mice and rats, respectively) induces cardiac hypertrophy characterized by an increase in heart weight (Myers R, Science 2017). In vivo experiments with MK-8772 would be not clinically relevant as the use of metformin that is already used in clinic. However, in order to improve the mechanistic investigation concerning the role of AMPKa2 activation on inhibiting HIF-1 activity, we propose to perform the in vitroexperiments performed in Figure 5 with a specific allosteric small-molecule activator of AMPKa2 such as 991.

      We plan to:

      -Expose H9c2 to CoCl2 and treat them with 991 in order to measure HIF-1a phosphorylation.

      -Transfect H9c2 with our plasmids HA-HIF-1α-WT AmpR and 5 HRE GFP AmpR and treat them or not with 991 in order to measure HIF-1 activity (GFP fluorescence). These experiments would be performed in 2 months.

      __Please find below the revision that we plan to address to answer to the second question of the reviewer 2. __

      • *

      2) The WB images were cut and pasted. Please add the original images

      We acknowledge the reviewer's comment and will address it by submitting a supplementary file containing the uncropped immunoblot images. Since this file already exists, our plan is to standardize it by providing, for each slide (immunoblot), all relevant information pertaining to our experiments, including groups, molecular weight markers, cutting, membrane stripping, and other pertinent details.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      • *

      __Please find below the answers or the revisions that have already been incorporated in response to the comments of the reviewer 1. Please note that we provided new results and new figures at the discretion of the reviewer, but we are ready to insert them as figures or supplemental figures in a new revised manuscript if the reviewers and the editor think that it would improve our message. __

      Was mitochondrial content in the hearts after IH experiment measured e.g. mtDNA measurements? IH results in mitochondrial dysfunction/reduced mitochondrial content. It would have been good to show mitochondrial dysfunction by doing basic functional experiments (e.g. TMRM/MitoROS imaging etc.) by isolating cardiomyocytes from the N and IH experiments.

      We thank the reviewer for these questions about the mitochondrial function and content. The impact of IH on mitochondrial function has already been demonstrated in heart (Moulin S, Antioxydants 2022, Wei Q, Am J Physiol 2012). __Indeed, we previously showed that mitochondria isolated from hearts of mice exposed to IH had a decrease in maximal respiration in complex I and II that was not observed in HIF-1_a_+/- ____mice (Moulin S, Antioxidants 2022), indicating that HIF-1 is responsible for IH-induced mitochondrial dysfunction. __

      Figure 4 shows that Metformin abolished IH-induced mitochondrial remodeling similar to what we observed in HIF-1a+/-(Figure 2). This means that treating with Metformin or partially deleting the gene encoding for HIF-a induce the same impact on IH. Then, we demonstrated that Metformin can control HIF-1 activation and we concluded that metformin could be cardioprotective through inhibiting HIF-1 activation and subsequent mitochondrial stress and remodeling. In this study, we focused on the effects of Metformin on HIF-1 and we did not aim at directly test the effect of metformin on mitochondrial function. Actually, metformin exhibits biphasic effects on bioenergetics of cardiac tissue depending on the modality of administration (i. e. single injection, time of administration during an ischemia-reperfusion procedure); the dose administered and the tissue studied (i. e. hiPSC-CMs, isolated mitochondria…) (Emelyanova N, Transl Res 2021). But, we collected some data that we would like to submit at the discretion of the reviewer. Using oximetry, we measured maximal respiration in complex 1 and 2 on isolated mitochondria from hearts of mice exposed to N, IH and treated or not with metformin during the exposure__. While we observed that IH decreases maximal respiration in complex 1 and 2, we did not find any effect of metformin on mitochondrial respiration alteration induced by IH (Figure 2A, B). Using spectrofluorometry, we measured the mitochondrial membrane potential using TMRM; __we did not find any modification of membrane potential in IH or Metformin-treated mice (Figure 2C). Because we previously did not observe any impact of IH on mtDNA/gDNA ratio (Figure 2D), we did not test metformin on this parameter.

      To conclude, we think that these results are not directly in the scope of our work but if the reviewer thinks that they deserve to be discussed, we could add them in a supplementary figure.

      Please, see the figure on the dowloaded file

      Figure 2 : Mice were exposed to 21 days of Normoxia (N) or Intermittent Hypoxia (IH) (1-min cycle of FiO2 5%-21%) and treated with vehicle (Vh, CmCNa 0.01%, 0,1ml.10g-1) or Metformin (Met, 300mg.kg-1.d-1). (A, B) Mitochondrial function __was measured by oximetry with sequential addition of substrate (state 2), ADP (200mM, state 3, maximal respiration) and oligomycin (12.5 mM, state 4) ; quantification of O2 consumption for NADH-linked mitochondrial respiration (complex I-glutamate-malate, GM, 20mM) (A), and for FADH2-linked mitochondrial respiration (complex II-succinate, S, 5mM, in presence of complex I inhibition by rotenone, 6.25mM) (B) (n=8). (C) Mitochondrial membrane potential measured by spectrofluorometry after Tetramethylrhodamine Methyl Ester (TMRM, 0.2mM) in presence of GM (basal condition), maximal respiration (ADP) and uncoupling condition Carbonyl cyanide 4-(trifluoromethoxy)phenylhydrazone, FCCP, 3mM); fluorescence intensity is expressed relative to fuorescence at baseline (before GM) (n=8). (D__) Mitochondrial content assessed by the expression of mitochondrial DNA (mtDNA, COX1) relative to genomic DNA (gDNA, ApoB) measured after PCR (n=6); *p

      Fig 2A-D - CoCl2 is not a good model to mimic hypoxia due its effect on disrupting iron homeostasis in cells, which can mean that some of the effects are due to changes in iron levels and not HIF stabilisation.

      The capacity of CoCl2 to chelate iron is the main property of CoCl2 that we used in order to stabilize HIF-1____a____. Actually, prolyl-4-hydroxylases need Fe2+ to hydroxylate HIF-1____a____ and induce its degradation. __Then, intermittent hypoxia (IH) is characterized by very rapid changes in PO2. This stimulus was designed to reproduce sleep apnea syndrome and its associated disorders (i. e. insulin-resistance, hypertension, increase in myocardial infarct size). This model was firstly developed and validated in rodents (Dematteis M, ILARJ 2008, Belaidi E, Eur Resp Rev 2022, Harki , Eur Resp J 2022). Compelling evidence indicate that the involvement of HIF-1 in IH-deleterious consequences is related to the repetitive phases of oxygenation and especially to IH-induced oxidative stress (Semenza Physiology 2009, Belaidi Pharmacol. & Ther 2016). In order to increase the level of mechanistic insights on HIF-1, we next attempted to optimize in vitro models. A device was developed by Minoves et al. (Minoves M, Am J Physiol, 2017) to expose endothelial and cancer cells to IH. __However, as illustrated below, this device does not mimic efficient rapid hypoxia-reoxygenation cycles able to induce cardiac cell death (Figure 3A). However, CoCl2 decreases H9C2 viability by 60% (Figure 3B) that is associated with a sustained stabilization of HIF-1____a (Figure 3C,D). Thus, we choose this in vitro model as it replicates cardiac cell death and HIF-1_a_overexpression or stabilization which we similarly observe in our in vivo model and in apneic patients (Moulin S, Can J Cardiol 2020).

      Please see the figure on the dowloaded file

      Figure 3 : (A-B) Cell viability measured by 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyl-2H- tetrazolium bromide (MTT) of H9c2 cells exposed to 6 hours (h) of repetitive cycles of Intermittent hypoxia (2 minutes (min) PO2 16% - 2 min PO2 2%) (n=3) (A) or treated with CoCl2 (1mM, 2h) (n=6) (B). (C-D), __quantification of __total ____HIF-1____𝛂 expression relative to tubulin (C) and representative image of Wetsern-blot (D) (n=2-3); *p****p*

      Fig. 2K, change in BNIP3 expression is modest, but change in Parkin is very dramatic. BNIP3 is a HIF-1α target but Parkin is not, so it is plausible that mitophagy could be occurring through a HIF-1α independent mechanism.

      Fig. 2K is a representative panel of 2-3 independent experiments. The quantification reported in Figures 2I and 2J demonstrated a significant decrease in BNIP3 and Parkin expressions in HIF-1a heterozygous mice exposed to Intermittent Hypoxia (IH) compared to HIF-1a+/+ mice exposed to IH. While we acknowledge that only BNIP3 is a direct target of HIF-1, the role of HIF-1 in IH-induced auto/mitophagy is demonstrated by our experiments performed in HIF-1____a____heterozygous mice. This shows an important role for HIF-1 without excluding any impact of HIF-1a independent mechanisms.

      What are we meant to be looking at in Fig 2L?

      Figure 2L aims at illustrating mitochondrial remodeling under IH. Stars indicate that mitochondria have abnormal fate in IH conditions and arrows point autophagosomal membrane and formation. This figure was magnified to be clearer (please see new Figure 2L).

      Figure 3B-C, the reduction in pT172 on AMPK is modest. It would be good to include pACC as a downstream target for AMPK.

      As recommended by the reviewer, we inserted in the manuscript the quantification of 79Ser-P-ACC/ACC western-blot as well as a representative image of the Western-blot (See new Figure 3). We also modified the legend of the figure. 79Ser-P-ACC is an important target of AMPK; however, in our experimental conditions, its phosphorylation is not associated to the decrease in AMPK phosphorylation. This could be explained by many points. First, Metformin was administered every day and hearts were harvested 24 hours later after the last administration. Most studies demonstrating a modification of AMPK and ACC phosphorylation are experiments performed in vitro or directly (less than 1 hour) after a single dose of Metformin administration. In the context of myocardial ischemia-reperfusion, Yin et al. showed an increase in P-AMPK/AMPK directly after Metformin treatment without showing any data on P-ACC/ACC (Yin M; Am J Physiol 2011); similar data were published in models of chronic cardiac diseases (Soraya H, Eur J Pharmacol, Gundewar S, Circ Res 2009). Second, in line with the previous explanation, the lack of effect of metformin on P-AMPK and/or P-ACC in rodent models could be explained by its rapid distribution (Sheleme T Clin Pharmacokinetics of Metformin 2021) and its short half-life that is around 3.7 hours in mice (Junien N, Arch Int Pharmacodyn Ther 1979).

      To conclude, since we performed all our analysis 24h after the last treatment and exposure to hypoxia, we argue that the slight but significative decrease in AMPK phosphorylation that we observed in our study highlight a robust impact of chronic IH. However, this would be elegant to confirm this result by measuring AMPK through its phosphorylation capacity (Cool B, Cell Metab 2006, Ducommun S, Am J Physiol 2014). We already sent hearts from mice exposed to Normoxia or Intermittent Hypoxia to Luc Bertrand’s lab (IREC, Belgium) where they used to perform this assay.

      Fig. E-G, show data for mice treated with vehicle.

      In Figure 4 I-J­­, we demonstrated that Metformin significantly decreases infract size in IH condition only and this validates our main hypothesis regarding the specific beneficial effect of this drug in the context of chronic IH. __In order to show that the cardioprotective effect of Metformin is relative to AMPKa2 activation, we first showed that 79Ser-PACC/ACC, one of the main downstream targets of AMPKa2 was increased (Fig. E-G). We did not find it necessary to does not exhibit cardioprotective effects. However, as shown in Figure 3 below, __Metformin also increases 79Ser-PACC/ACC in Normoxic mice validating the treatment. Thus, in normoxic conditions, AMPK____a____2 activation does not exert any cardioprotective effect. We acknowledge that this reinforces our result about the specificity of AMPK____a____2 activation by Metformin under chronic IH condition. We could add this Figure in supplemental results.

      Please, see the figure on the dowloaded file

      Figure 4 : AMPK activation in Normoxic mice treated with vehicle (Vh, CmCNa 0.01%, 0,1ml.10g-1) or __ __metformin (Met, 300mg.kg-1.d-1 : __ __172Thr-P-AMPK/AMPK (A) and 79Ser-P-ACC/ACC (B) ratio and representative image of Western-blot (C) (n=3-6); *p

      Fig. 3K - what cre is used for the a2 KO mice?

      As written in the Materials and methods section, AMPKa2KO mice are not inducible Knock-out mice. Constitutive AMPKα2 knockout mice were kindly generated by Benoit Viollet (Viollet B, JCI, 2003).

      Include normoxia data for the a2 KO mice studies.

      The question of the reviewer concerning the cardioprotective effects of metformin is interesting but is not aligned with the objectives of the study. Indeed, we did not treat normoxic mice with Met for several reasons. First, the objective of the study was to find a cardioprotective strategy against IH-induced an increase in infarct size. Second, Fig. 3I shows that Met significantly reduced infarct size upon IH only; this suggests that AMPK____a____2 activation is specifically involved in IH-induced increase in infarct size but not in reducing infarct size in normoxic mice. Moreover, the beneficial impact of metformin in standard models of myocardial ischemia-reperfusion is controversial and has been extensively discussed (Foretz et al. Cell Metab. 2014). Overall, using AMPKa2 mice was legitimated in the context of IH only. We validated the involvement of AMPKa2 in the cardioprotective effect of metformin especially in IH conditions.

      Figure 5G, what is the rationale for switching to CoCl2 in the mice to prove metformin reduced HIF-1α expression? Why not use reduced O2 tension in mice.

      We respectfully disagree with the reviewer since mice were exposed to N and IH and treated or not with Metformin to demonstrate that this drug abolished IH-induced increase in HIF-1a nuclear expression (Figure 4 H, I). The same model was used in Figure 3 to demonstrate the impact of Metformin on infarct size. Fig. 5G was conducted to demonstrate the potential link between AMPKa2 and HIF-1a phosphorylation in basal conditions of AMPKa2 content or in absence of AMPKa2 (AMPKa2-/- mice). The single presence of AMPK____a____2 demonstrates an increase in HIF-1____a____ phosphorylation if its stabilization is increased by CoCl2; this was not observed in AMPK____a____2-/- mice highlighting that AMPK____a____2 plays an important role in HIF-1____a phosphorylation.


      Please find below the answer to the first question asked by the reviewer 2.

      1) Why did authors choose the IH protocol illustrated in Fig. S1A

      The choice of the hypoxic stimulus was based on literature and mainly on our recognized expertise in preclinical studies aiming at better understanding obstructive sleep apnea syndrome (OSA); a chronic pathology associated with several comorbidities such as diabetes, hypertension… We are conscious that the hypoxic stimulus used in this study is very severe, with a nadir arterial oxygen saturation (SaO2) around 60%. However, this experimental design is required to induce detrimental cardiovascular effects __in the absence of any confounding factors (i.e., obesity) or genetic susceptibility for complications (i. e. genetic susceptibility to hypertension) (Dematteis M, ILARJ 2008). Especially in the context of myocardial infarction, exposing rodents to 14 to 21 days of IH at 5% and subjected them to a myocardial ischemia-reperfusion protocol allows us to reproduce the increase in infarct size in rats (Belaidi E, J. Am. Coll. Cardiol., 2009; Bourdier G, Am J Physiol, 2016) and in mice (Belaidi E, Int J Cardiol 2016; Moulin S, Can J Cardiol 2020) similar to what has been observed in apneic patients (Buchner S, EHJ 2014). __Moreover, we recently conducted a meta-analysis based on 23 preclinical studies aiming at investigating the impact of the IH pattern (duration, FiO2, repetition of cycles…) on infarct size and cardiomyocyte death (Belaidi E, Eur. Resp. J 2022). We showed that IH significantly increases infarct size when IH is applied several days (especially 14 to 21 days) and when FiO2 is around 5%; whereas IH decreases infarct size when it is applied a single day at a FiO2 at 10%. This meta-analysis provided the confirmation that we need to apply a chronic and severe stimulus to reproduce an increase in infarct size that is observed in apneic patients which are exposed every day, during several days to a decrease in SaO2. If the reviewers and the editor consider that this point should be discussed in the discussion section, we will be happy to include it.

      • *

      Please find below the answers or the revisions that have already been incorporated in response to the comments of the reviewer 3. They appear in red in the new manuscript except the modifications performed in the “references” section which appear in black.

      1 The authors used H9c2 rat cardiac cells in vitro experiments although they used mouse model in vivo experiments. Using mouse P19.CL6 cardiac cells instead of rat H9c2 cells may much clearer. Why the authors did not use P19.CL6 cells should be explained.

      We thank the reviewer for his/her suggestion. P19CL6 cell line has been isolated from pluripotent P19 embryonal carcinoma (EC) cells after long term culture under conditions for mesodermal differentiation (Habara-Ohkubo A, Cell Struc Funct 1996). Therefore, these cells are mostly used to ____study the differentiation of cardiac muscle. Indeed, they were recognized to avoid large variations in the differentiation rates which were extensively reported (Mueller I, J Biomed Biotechnol 2010). To our knowledges no ventricular non-beating mice cell line. In this study, we used H9c2 which are extensively used and recognized as a gold standard cellular model to study the biology of cardiomyocytes including mechanisms involved in cardiac ischemia-reperfusion injury (Paillard M, Circulation, 2013; Zhang G Circulation 2021__), cardiac hypertrophy__ (Zhang N, Cell Death Diff 2020; Hu H, Cardiovasc Res 2020), intra-organites calcium exchanges (Moulin S, Antioxydants, 2022, Paillard M, Circulation, 2013) as drug testing (Beshay NM, J Pharm and Tox Methods, 2007). Recently, H9c2 and P19.CL6 were exposed to intermittent hypoxia (70 cycles of FiO2 1% (5 min) - FiO2 21% (10%)) in order to “mimic OSA” and investigate the transcription level of a pool of genes. The authors show some similiraties and differences of mRNA expression between the two cell lines that, indeed, could be attributed to variations in the cell origin (Takasawa S, IJMS 2022). However, in this study, there are no experiment allowing to assess the state of cardiac cells (apoptosis, life, metabolism, remodeling) questioning the pathophysiologic transposability of the model. Moreover, the number of experiments conducted on H9C2 (pubmed references : 7000 vs 100 for P19.CL6) to understand the mechanism involved in acute and chronic cardiac pathologies makes our choice confident and relevant.

      2 The authors described, "The protocol was approved by the French minister (APAFIS#23725-2020012111137561.v2)." in Animals (page 16) without showing approval date, the authors should clearly show the approval date together with their approval numbers.

      We added the approvement date in the materials and methods section. It was approved on February 20, 2020.

      3 In Figure 2 L, scale bar(s) should be added because figures are magnified and/or reduced by printer.

      We agree with the reviewer that scale bars were not visible; we highlighted them.

      4 In Figure 3J and 3N, scale bar(s) should be added.

      Scale bars have been now added on figures 3J and 3N. All pictures were acquired at the maximal zoom of a camera placed at an equal distance from the slices. Then, analyses were performed, slice per slice, with Image J with the same zoom (x5 to get an image at 100%). In this context, scale was added based on a photo of slice taken close to a ruler.

      5 In introduction, "HIF-1" should be changed to "hypoxia-inducible factor 1 (HIF-1)".

      Thank you, we replaced HIF-1 by Hypoxia Inducible Factor-1 (HIF-1) in the introduction section.

      6 In Results, "Angpt1, Txnip, Nmrk2, Nuak1 or Pfkfb1" should be changed to "Angiopoietin 1 (Angpt1), Thioredoxin-interacting protein (Txnip), Nicotinamide riboside kinase 2 (Nmrk2), NUAK family SNF1-like kinase 1 (Nuak1) or 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 1 (Pfkb1)".

      We did the modification and let the abbreviation in italic since it concerns genes name.

      7 In Chronic intermittent hypoxia, "FiO2" should be changed to "FiO2".

      Thank you, we changed it in the materials and methods section.

      8 In Western-blot, "Bio-Rad, California, USA" should be changed to "Bio-Rad, Hercules, CA".

      Thank you, we did it.

      9 In Western-blot, what "tubulin" (α-tubulin or β-tubulin) should be clarified.

      We agree with the reviewer that this point should be specified; α-tubulin was stained, we added the “α”.

      As mentioned below, we have done all the modifications required by the reviewer in the references section.

      10 In Ref. 8, "Antioxidants (Basel) 11 (2022)" should be changed to "Antioxidants (Basel) 11, 2326 (2022)".

      Done

      11 In Ref.10, "Pharmacol Ther (2016)" should be changed to "Pharmacol Ther 168, 1-11 (2016)".

      Done

      12 In Ref.13, "Antioxidants (Basel) 11 (2022)" should be changed to "Antioxidants (Basel) 11, 1462 (2022).

      Done

      13 In Ref. 21, "Diabetes (2017)" should be changed to "Diabetes 66, 2942-2951 (2017)".

      Done

      14 In Ref. 28, "Adv Biol (Weinh), e2300292 (2023)" should be changed to "Adv Biol (Weinh), 8, 2300292 (2023)".

      Done

      15 In Ref. 37, "J Am Heart Assoc 6 (2017)" should be changed to "J Am Heart Assoc 6, e006680 (2017)".

      Done

      16 In Ref. 41, "Eur Respir Rev 32 (2023)" should be changed to "Eur Respir Rev 32, 230083 (2023)".

      Done

      17 In Ref. 46, "Int J Mol Sci 22 (2020)" should be changed to "Int J Mol Sci 22, 268 (2021)".

      Done

      18 In Ref. 47, "Int J Mol Sci 21 (2020)" should be changed to "Int J Mol Sci 21, 2428 (2020)". Done

      4. Description of analyses that authors prefer not to carry out

      __Please find below the answers to the comment of the reviewer 3 that we cannot provide and that is not in the scope of the study. __

      • *

      Figure 5, multiple phosphorylation sites have been identified on HIF1a. What is the nature of the Thr/SerP-HIF1a antibody? It would be far more preferable (essential?) to identify the site(s) within HIF1a that are phosphorylated by AMPK.

      The antibody was provided by Cell Signalling, ref. 9631. Phospho-(Ser/Thr) Phe Antibody detects phospho-serine or threonine in the context of tyrosine, tryptophan or phenylalanine.

      The identification of the Phosphorylation sites will require a long-time consuming phosphoproteomic analysis and subsequent functional validation in vivo and in vitro (directed mutagenesis, knock-in mice, …) which are out of the scope of our paper.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Although the findings that metformin inhibits intermittent hypoxia (IH)-induced mitophagy in myocardium and decreases hypoxia-inducible factor 1-α (HIF-1α) nuclear expression in mice subjected to IH. In vitro demonstrated that metformin induces HIF-1α phosphorylation, decreases its nuclear localization and subsequently HIF-1 transcriptional activity are very much interesting, numbers of points need clarifying and certain statements require further justification. These are given below.

      Point

      1. The authors used H9c2 rat cardiac cells in vitro experiments although they used mouse model in vivo experiments. Using mouse P19.CL6 cardiac cells instead of rat H9c2 cells may much clearer. Why the authors did not use P19.CL6 cells should be explained.
      2. The authors described, "The protocol was approved by the French minister (APAFIS#23725-2020012111137561.v2)." in Animals (page 16) without showing approval date, the authors should clearly show the approval date together with their approval numbers.
      3. In Figure 2 L, scale bar(s) should be added because figures are magnified and/or reduced by printer.
      4. In Figure 3J and 3N, scale bar(s) should be added.
      5. In introduction, "HIF-1" should be changed to "hypoxia-inducible factor 1 (HIF-1)".
      6. In Results, "Angpt1, Txnip, Nmrk2, Nuak1 or Pfkfb1" should be changed to "Angiopoietin 1 (Angpt1), Thioredoxin-interacting protein (Txnip), Nicotinamide riboside kinase 2 (Nmrk2), NUAK family SNF1-like kinase 1 (Nuak1) or 6-phosphofructo-2-kinase/fructose-2,6-biphosphatase 1 (Pfkb1)".
      7. In Chronic intermittent hypoxia, "FiO2" should be changed to "FiO<sub>2</sub>".
      8. In Western-blot, "Bio-Rad, California, USA" should be changed to "Bio-Rad, Hercules, CA".
      9. In Western-blot, what "tubulin" (α-tubulin or β-tubulin) should be clarified.
      10. In Ref. 8, "Antioxidants (Basel) 11 (2022)" should be changed to "Antioxidants (Basel) 11, 2326 (2022)".
      11. In Ref.10, "Pharmacol Ther (2016)" should be changed to "Pharmacol Ther 168, 1-11 (2016)".
      12. In Ref.13, "Antioxidants (Basel) 11 (2022)" should be changed to "Antioxidants (Basel) 11, 1462 (2022).
      13. In Ref. 21, "Diabetes (2017)" should be changed to "Diabetes 66, 2942-2951 (2017)".
      14. In Ref. 28, "Adv Biol (Weinh), e2300292 (2023)" should be changed to "Adv Biol (Weinh), 8, 2300292 (2023)".
      15. In Ref. 37, "J Am Heart Assoc 6 (2017)" should be changed to "J Am Heart Assoc 6, e006680 (2017)".
      16. In Ref. 41, "Eur Respir Rev 32 (2023)" should be changed to "Eur Respir Rev 32, 230083 (2023)".
      17. In Ref. 46, "Int J Mol Sci 22 (2020)" should be changed to "Int J Mol Sci 22, 268 (2021)".
      18. In Ref. 47, "Int J Mol Sci 21 (2020)" should be changed to "Int J Mol Sci 21, 2428 (2020)".

      Significance

      the study is convincing

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the submitted paper by Moulin et al., the authors investigated the effect of metformin treatment (an oral anti-diabetic drug and AMPK activator) on cardiac response to ischemia-reperfusion in mice exposed to intermittent hypoxia (IH), a major component of obstructive sleep apnea (OSA). The data demonstrates that IH alters the expression profile of a subset of genes involved in myocardial mitochondrial dysfunction, such as Angpt1, Txnip, Nmrk2, Nuak1 or Pfkfb1, hallmark genes of hypoxia and glycolysis. In H9c2 cells treated with CoCl2 with or without chloroquine, the authors claim that IH induces mitophagy through HIF-1 activation, associated with an increase in autophagic flux. This effect is abolished in HIF-1𝛂+/- mice. Metformin treatment in mice reverses IH-increased in infarct size through AMPK activation, decreases IH-induced mitophagy and HIF-1𝛂 nuclear localization.

      The paper is interesting but needs some clarifications:

      1. Why did authors choose the IH protocol illustrated in Fig. S1A)
      2. The WB images were cut and pasted. Please add the original images

      Significance

      This work opens new avenues for exploring the potential effects of metformin as a modulatory of HIF-1𝛂 activity in obstructive sleep apnea syndrome.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: The manuscript by Moulin and colleagues begins by exploring changes in gene expression in mice subjected to intermittent hypoxia (IH). Based on these results, the authors postulate a link with HIF1a, and so use a cell-based model to examine this link. Next, the authors turn to AMPK showing that IH decreases Thr172 phosphorylation on AMPK. In view of this, the authors use metformin as a way to activate AMPK during IH and monitor necrosis in hearts following ischemia-reperfusion. Metformin treatment reduces necrosis in mice subjected to IH. Mechanistically, the authors link decreased HIF1a activity following AMPK activation in response to metformin and suggest that this is due to phosphorylation of HIF1a by AMPK. Overall, this reviewer found some of these connections to be quite weak and in some cases the evidence supporting the claims was equivocal. In some instances, the lack of appropriate controls or the use on non-specific reagents, undermined the findings. Adding to these issues, there was a lack of experimental detail both in the Figure legends and in the M&M section. Below I have listed some of the major concerns that if addressed satisfactorily would greatly strengthen the manuscript.

      Major Comments

      Major Points

      1. Figure 1 - it would be helpful to list all of the DEGs (what genes are changed?). Including the expression of HIF-1α and PHD isoforms would be informative. If there is a robust HIF-1α signal, changes in the expression of HIF and PHD isoforms would be anticipated. Fig 1F - with regards to glycolysis and hypoxia pathway analysis, most of the DEGs are not canonical HIF-1α/hypoxia targets.
      2. Was mitochondrial content in the hearts after IH experiment measured e.g. mtDNA measurements? IH results in mitochondrial dysfunction/reduced mitochondrial content. It would have been good to show mitochondrial dysfunction by doing basic functional experiments (e.g. TMRM/MitoROS imaging etc.) by isolating cardiomyocytes from the N and IH experiments.
      3. Fig 2A-D - CoCl2 is not a good model to mimic hypoxia due its effect on disrupting iron homeostasis in cells, which can mean that some of the effects are due to changes in iron levels and not HIF stabilisation. Fig. 2K, change in BNIP3 expression is modest, but change in Parkin is very dramatic. BNIP3 is a HIF-1α target but Parkin is not, so it is plausible that mitophagy could be occurring through a HIF-1α independent mechanism. What are we meant to be looking at in Fig 2L?
      4. Figure 3B-C, the reduction in pT172 on AMPK is modest. It would be good to include pACC as a downstream target for AMPK. Fig. E-G, show data for mice treated with vehicle. Fig. 3K - what cre is used for the a2 KO mice? Include normoxia data for the a2 KO mice studies. In these experiments, as well as subsequent studies, it would be very informative to use a specific AMPK activator e.g. MK-8772, to compare with metformin. It is well known that metformin has a number of other targets in addition to AMPK.
      5. Figure 5G-I, show cytoplasmic HIF1a as well as nuclear. Alternatively, why not use IHC for subcellular localisation?
      6. Figure 5, multiple phosphorylation sites have been identified on HIF1a. What is the nature of the Thr/SerP-HIF1a antibody? It would be far more preferable (essential?) to identify the site(s) within HIF1a that are phosphorylated by AMPK.
      7. Figure 5F, it would be important to show the levels of expression of HIF1a in these experiments. Are there positive and negative controls that the authors could use for HIF21a activity in this experiment?
      8. Figure 5G, what is the rationale for switching to CoCl2 in the mice to prove metformin reduced HIF-1α expression? Why not use reduced O2 tension in mice.

      Minor Comments

      Nothing at this stage of the process. The authors need to focus on the major points to improve the quality of the manuscript.

      Significance

      Overall, the lack of robust mechanistic studies makes it difficult to fully interpret the impact of the study.

      This feels like a preliminary exploration of a potentially important biological system, but the manuscript seems incomplete and requires more attention to details.

      Upon revision, I think the study would be of interest to basic and clinical researchers working in the filed of cardiac metabolism and hypoxia.

      My expertise is in metabolism and cell signalling.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Here, Ahel et al characterize complex composiiton, chromatin binding, and respressive function of ADNP2. First they show that ADNP2 forms a complex with CHD4 and HP1b, very similar to, although biochemically distinct from, the ChAHP complex formed by ADNP. This complex is prevalently bound at repeats, and in particular at LTR retrotranspoons. Recruitment of ADNP2 is mostly mediated by HP1b binding to H3K9 and a PxVxL mutant that cannot bind to HP1 does not localize properly to chromatin. While ADNP has its own specific targets in SINE elements, it also binds some ADNP2 targets in an HP1-dependent manner. Depletion of ADNP or ADNP2 results in upregulation of some shared and some distinct transposons, indicating distinct roles but also partial redundancy.

      This is a solid study that adds to our knowledge of these repressive complexes and their transposable element targets. I feel that the data could be analyzed a bit more in depth, especially in the comparisons of ChIP-seq vs. RNA-seq.

      Major points

      • Fig. 5: it would be interesting to analyze differences between repeats that are only under ADNP2 control vs. those that are sensitive to additional loss of ADNP. Presumably they both have K9me3 so why some are also repressed by ADNP?
      • Fig. S5B, S7A: degrons are known to destabilize some proteins even before induction of degradation. These western blots should include a WT line to compare abundance of the endogenous protein with the tagged version.
      • FIg. 5: it would be good to show a bit more overlap between RNA-seq and ChIP-seq. How many of the bound transposons are derepressed? Can co-occupancy by ADNP and ADNP2 explain which transposons will display synergistic effets from removal of both proteins?

      Minor points

      • Fig. 2B: Upset plot seems like a strange choice for visualization. I would show as stacked bar plot compared to genome distribution. I.e. are peaks enriched on repeats or is it just that a larger genomic space is occupied by repeats compared to TSSs?
      • Fig. 3A: if ADNP2 only binds to a subset of K9me+ retrotransposons, why arent' there regions of K9me+ ADNP2- in this plot?
      • Check figure calls for 3B and 3C, some of them seem off.
      • Fig. S7B: the text says several dozen genes but I only see 21 up and 15 down upon ADNP with ADNP2 present.

      Significance

      This is a solid study that adds to our knowledge of these repressive complexes and their transposable element targets. I feel that the data could be analyzed a bit more in depth, especially in the comparisons of ChIP-seq vs. RNA-seq.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this work, Ahel, Pandey and Schwaiger et al. examine the function of ChAHP complexes in transposon silencing in mouse embryonic stem cells. This works builds on a proteomics approach previously developed in the Buhler lab, which identified ADNP1-ChAHP as a repressor complex of SINE retrotransposons. In their previous work they also identified the zinc-finger protein ADNP2 as an interactor of the ChAHP component CHD4. Here, the authors test the hypothesis of whether ADNP2 is part of an alternative ChAHP, which they termed ChAHP2. They first establish, through proteomics/biochemistry experiments, that the ChAHP2 complex can form independently of ChAHP. By ChIP-sequencing they then show that the ChAHP2 member ADNP2 predominantly occupies retrotransposons of the LTR families (e.g., ERV elements). They further show that recognition of target transposable elements (TEs) is conferred by the ChAHP2 complex member HP1 and its affinity for H3K9me3. Using RNA-seq they then addressed the repressive activities of ChAHP and ChAHP2 on TEs through degradation of ADNP, ADNP2 or both. The experimental approach is very well thought out and experiments are done with extensive controls.

      I have the following comments/suggestions:

      • Regarding the ChIP-seq experiments, it is not entirely clear from the manuscript text and also not from the Material and Methods whether ChIP-seq was done with a tagged-version of ADNP2, which is what I assume, or an endogenous ADNP2 antibody. Could you please state this clearly?
      • The authors map short read sequencing data to individual TE insertions and conclude that ADNP2 binds retrotransposons both internally and at their terminal sequences. Particularly for the internal parts, how sure can the authors be that their approach, mapping to repeat consensus sequences, is not confounded by e.g., genetic differences between their ESC lines and the references they map to, or simply by the limitations of short read data?
      • Page 7, the first sentence starting with "Contrary to the expected behavior of a TF, ...". Could the authors please elaborate on what they mean with "the expected behavior of a TF"?
      • In Figure S4B, could the authors please clarify if HP1, CHD4 and Tubulin are high/low exposure?
      • SETDB1 is the H3K9me3 methyltransferase responsible for LTR/ERV silencing in mESCs and its disruption leads to a pronounced upregulation of TEs (Karimi et al., 2011). Could the authors comment on the effect of their degron-mediated SETDB1 depletion on de-repression of LTR/ERV elements? At least for the elements shown to lose H3K9me3 upon SETDB1 depletion (Figure S5D - strong depletion replicate)? Could you please also provide information about morphological/pluripotency characteristics of the 2HA-FKBPSETDB1 ESC lines before and after the 48h treatment with 500nM dTAG13?
      • Maybe I missed this, but are H3K9me3 levels affected in the ADNP and/or ADNP2 degron lines where transcriptional dysregulation of TEs is observed (Figure 5)? If not H3K9me3, is TE de-repression accompanied by reduction in repressive histone marks in ADNP2 degraded lines?
      • When looking at TE de-repression, the upregulation seems very modest (Figure 5) and Log2FoldChanges >0.9 (FDR<0.05) were considered as statistically significant. Is this the result of using mESCs that "approximate a constitutive KO situation through inducible ADNP2 degradation" and 14 days of treatment with 250nM dTAG13? As mention above, could you please also provide information about morphological/pluripotency characteristics?

      Significance

      The findings are interesting and expand on our understanding of how complexes of chromatin bound proteins control epigenetic silencing of individual retrotransposon classes/families in mouse embryonic stem cells. More specifically, this study adds a new player, ChAHP2, a ChAHP alternative that associates with ERV and LINE1 retrotransposons via HP1-mediated binding of H3K9me3 and is required for transcriptional repression of LTR class retrotransposons. Although, in my opinion, it remains somewhat uncertain how depletion of ADNP2 results in TE upregulation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The Buhler group previously identified the ChAHP complex and showed it to be made up of the transcription factor ADNP, chromatin remodeler CHD4, and HP1 proteins. They found that genetic removal of ADNP leads to increased expression of SINE B2 elements in mouse ES cells (mESCs), and an increase in chromatin accessibility and CTCF binding.

      In the current manuscript they describe a related protein complex they name ChAHP2. ChAHP2 has a vertebrate paralogue of ADNP, a transcription factor they call ADNP2 and the same chromatin remodeler CHD4, and HP1 proteins. By investigating chromatin occupancy they find that ChAHP2 chromatin binding specificity is distinct from ChAHP, and predominantly associates with ERV and LINE1 retrotransposons via HP1beta-mediated binding of H3K9 trimethylated histones. The ChAHP and ChAHP2 complexes control a wide variety of molecularly disparate retrotransposons, including SINEs, LINEs, and ERVs.

      The findings are of significance as the ChAHP2 complex is novel - it has not previously been identified or characterized and its description is therefore worthy of publication. Overall this study was well-performed and nicely-presented. In general, the text is clear, the flow of experiments seemed logical, easy to follow and supported by the presented evidence without the findings being overstated.

      The data is mainly descriptive and it was at times hard to identify what were the clear conclusions from their study. Having identified the new ChAHP2 complex they show that its chromatin binding characteristics are distinct from ChAHP, which predominantly binds SINE elements. In contrast, ChAHP2 binds different classes of retrotransposons and recruitment to H3K9me3 heterochromatin is dependent on HP1-Beta. This was all relatively clear. It became a little more confusing trying to unravel the functional consequences of the loss of ChAHP2 - either alone, or in combination with ChAHP.

      Although they see changes - both up- and down-regulation of a large number of genes, they say that: 'None of the changing gene categories showed strong and significant patterns either in terms of GO term enrichment (FigureS7C), distance between the promoter and the nearest ADNP/ADNP2 peaks'- does this mean that none of the upregulated genes following ADNP1/2 deletion show ADNP/ADNP2 peaks over the specific gene in question - i.e. do they think that the majority of effects here are indirect?

      When it came to the 'repeat' families of genes - the situation was also not entirely clear.

      They say: All these differentially expressed repeats belonged to the LTR class of retrotransposons, including families identified as ADNP2-bound in ChIP sequencing - it would be helpful to know which familes are bound and maybe show snapshots of ADNP2 binding. Are these differentially expressed repeats directly regulated by ADNP2? - they seem like a fairly heterogenous group of repeat elements? - do they have any shared features? They say that expression of two LINE1 subclasses were significantly upregulated - do these L1s (L1Mdas) have common features - are they old or young - it would be helpful if these different issues could be clarified.

      In the discussion they are upfront about their inability to identify any specific sequence motifs required for ADNP2 occupancy. Since the repression is not obviously H3K9Me3 dependent, then what is the role of H3K9me3 in repressing here? They surmise that repression is likely to be regulated through chromatin remodeling by CHD4 - would ATAC-seq help clarify this suggestion? - particularly as they suggest that having CHD4 as an integral component of the repressor complex is a unique feature of ChAHP1/2?

      In this context, do they not consider MORC2 to be an essential component of HUSH-dependent silencing?

      Other points

      The language in the introduction somewhat confusing and could be improved: They jumble ZNFs/TRIM28/HUSH/HP1 as if there is no specificity here - there clearly is - needs a more balanced and informative description.

      What is firstly referring to? Firstly, histone deacetylation and the removal of activating histone methylations disfavor transcription13,14. Secondly, the underlying DNA is extensively methylated, further repressing the locus

      They state: Like ERVs, LINE1 retrotransposons are autonomous- would help if they could clarify exactly what is meant here.

      Significance

      The findings are of significance as the ChAHP2 complex is novel - it has not previously been identified or characterized and its description is therefore worthy of publication. Overall this study was well-performed and nicely-presented. In general, the text is clear, the flow of experiments seemed logical, easy to follow and supported by the presented evidence without the findings being overstated.

      The findings will be of broad interest to chromatin biologists - particularly those interested in the regulation of retroelements and repeat regions

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the reviewers for carefully reviewing our manuscript submitted to Review Commons. Their constructive comments have led us to identify key additional experiments to perform (testing endothelial cells, investigating the role of CXCR4/Sdf1 axis in the asymmetric division). They have shed light on specific points of the manuscript that had to be amended (see below).

      First, we are pleased that the reviewers acknowledged the quality of the experiments performed (Reviewer #1: “In general, is a well-done work” ; Reviewer #2: “the evidence is convincing, experiments are rigorously performed with adequate replicates and reproducibility”), as well as interest and large scope our study (Reviewer #1: * “it gives a few more details describing that when HSPCs divide asymmetrically it seems there is an association between centrosome and lysosome distribution”, Reviewer #2: “These results collectively contribute to a deeper understanding of how the hematopoietic niche and interactions with neighboring cells influence HSPC behavior and their commitment to distinct cell fates during division”, “The findings have significance in understanding how the microenvironment influences HSPC behavior. This is broadly significant for stem cell biology”. Reviewer #3 “The manuscript is likely to have broader interest to not only HSPC researchers, but stem cell biologists and even engineers due to the technologies used”).*

      Our detailed answers to the reviewers’ concerns are listed below.


      Reviewer #1 (Evidence, reproducibility and clarity):

      In this manuscript, Candelas and colleagues investigated asymmetric cell division of human hematopoietic stem and progenitor cells (HSPCs) upon heterotypic interactions with osteoblasts. They developed a new in vitro system to test this and found that upon interaction, HSPCs polarized in interphase with centrosome and the Golgi apparatus and lysosomes positioned close to the site of contact. In addition, during mitosis, HSPCs were found to orient their spindle perpendicular to the plane of contact. This division gave rise to siblings with unequal amounts of lysosomes and CD34. In general, is a well-done work and it gives a few more details describing that when HSPCs divide asymmetrically it seems there is an association between centrosome and lysosome distribution.

      Authors: We thank the reviewer for this positive assessment of our work.

      Reviewer #1 (Significance (Required)):

      However, all of these features (described above) have already been independently described even in human HSPCs thus this work does not represent a major advancement in this field (e.g. on the potential molecular mechanism by which heterotypic interactions with osteoblasts promote such behaviours). Overall, this stage is descriptive in nature.

      Authors: We respectfully disagree with the reviewer on this point. The main finding of our work does not simply concern the ability of HSPCs to undergo asymmetric division. This ability has indeed been previously shown in mouse and human models, and we refer to the princeps works in our introduction (page 2 “In vitro HSPCs can undergo asymmetric divisions 11 12 13 14 15 16 17 ”). Our main finding is about the instrumental role of heterotypic interactions with stromal cells of the niches. This had not been shown previously, mostly due to the limitations of classical co-culture systems and of in vivo tracking of HSPCs. We have engineered artificial niches in microwells specifically to overcome these limitations. We are confident that this finding is of interest for of hematopoietic niches biology and more globally for the field of stem cell biology. In that sense, the corresponding biorXiv preprint has just been quoted by the seminal review by Hans Clevers “Hallmarks of stemness in mammalian tissues” (doi: 10.1016/j.stem.2023.12.006).

      One major caveat of this work is that CD34+ HSPCs represent a very heterogeneous population. Although the underlying features described in this work could be similar between different cell populations, the frequency of these events (e.g. % magnupodium; polarization index, % of asymmetric inheritance, etc) is likely to be different between stem and progenitor cells. This may also explain the wide spread of their data points. Experiments should also be conducted with different cell populations, in particular with HSCs (either with the CD34+CD38-CD45RA-CD90+ or CD34+CD38-CD45RA-CD90+CD49f+ HSC enriched fraction or the highly CD34+CD38-CD45RA-EPCR+ HSCs).

      Authors: We agree that we are dealing with an heterogenous population of stem and progenitor cells; we cannot exclude that more pronounced effects could be obtained by analyzing separately stem cells, lymphoid and myeloid progenitors. The use of CD34+ heterogenous population has been a strategical choice. We are working with human primary cells, harvested from cord blood, a rare and precious material, who’s access has been in addition reduced during the COVID period and since. Working with the global CD34+ population was giving us the ability to work with a higher number of cells and avoid material limitations to perform experiments. Importantly, in our initial publication (Bessey et al doi:10.1083/jcb.202005085), we both used CD34+, or CD34+/CD38- versus CD38+/CD34+ subpopulations (see figure 4): the effects we described using global CD34+ population were significant, validating a posteriori this choice. Similarly, the effects observed in the present work with CD34+ population have always been validated by appropriate statistic tests. So, we are confident that the use of the CD34+ population, despite its heterogeneity, did not alleviate the validity and pertinence of our analyzes and conclusions.

      Results shown in Figure 2 were obtained with a very limited number of cells; also data obtained for Figure 4; this should be substantiated;

      Authors: We acknowledge that the analyses performed on fixed mitotic cells are based on relatively small samples. We have decided to fix and analyze the cells at 40h of culture which corresponds to the peak of mitosis, instead of synchronizing HSPCs in mitosis with available drugs, in order to avoid potential perturbations of these drugs. The consequence was the scarcity of the mitotic HSPCs in each experiment. This limitation has also been encountered by other laboratories working on HSPCs : our samples sizes are in fact within the range of the samples analyzed in other works (see Hinge et al., 2020, doi:10.1016/j.stem.2020.01.016: around 20 cells; Florian, et al., 2018: doi: 10.1371/journal.pbio.2003389. from 8 to 40 cells). However, in order to improve this limitation, few additional experiments and analyses have been performed to increase the number of mitotic cells (Figure 3: from 27 to 38 cells).

      In addition, data from Figure 2 adds little and should be combined with Figures 3 and 4 to make a single stronger message.

      Authors : We agree on this comment: Figures 2 and 3 have now been merged

      It is/was unclear why the authors investigated the distribution of CD34 and CD33 and the major and important question remained to be answered: whether the symmetric cell divisions were/are differentiating or self-renewal in nature hence, the asymmetric cell divisions promoted by the osteoblasts may represent asymmetric self-renewal ones; this needs to be investigated further and potential molecular mechanisms for such promotion should be highlighted.

      Authors: CD34 and CD33 were chosen as our “gold standards” according to the princeps work of Loeffler et al (Loeffler et al., 2019, and 2022). Our selection is based on the need for markers that encompass this diverse population. We focused on CD34 and CD33 as they are among the most prevalent CD markers on HSPCs, particularly when working with the CD34+ population, which is broad and necessitates widely expressed CD markers. Importantly, previous works have reported that CD34 anti-correlates with lysosome asymmetric inheritance after the first division, whereas CD33 does not (see Fig3E, Loeffler et al., 2022). So using these markers we can compare two markers: one that follows lysosome inheritance after the first division, and one which doesn’t. We found that osteoblasts inducing asymmetric lysosome inheritance in HSPC do increase asymmetric production of CD34 but not CD33, in concordance with the literature. This has been now explained in more detail in the text.

      In addition, higher lysosome inheritance has been previously demonstrated to be associated with the most stem cell (Loeffler et al., 2019), implying a more stem-like profile for the proximal cell during division in our work. However, recent studies using single-cell sequencing approaches challenge the notion of discrete differentiation within this stem cell model (reviewed by Zhang et al., 2018; Laurenti and Gottgens, 2018; Cheng et al., 2020; Liggett and Sankaran, 2020). Our study contributes to this discussion by providing insights into the relationship between CD markers, lysosome inheritance, and the stem cell profile during asymmetric division, and presenting this heterogeneity observed in single-cell sequencing works.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The paper titled "Heterotypic Interaction Promotes Asymmetric Division of Human Hematopoietic Stem and Progenitor Cells" presents research on how interactions between Hematopoietic Stem and Progenitor Cells (HSPCs) and osteoblasts impact cell division symmetry. The use of polyacrylamide microwells to simulate the hematopoietic niche and study osteoblast interactions is a novel and valuable approach. The paper effectively explains the experimental design, making it easy for others to replicate and verify the results. The paper provides detailed insights into how interactions with osteoblasts influence HSPCs, particularly in terms of cell polarization, spindle orientation, and organelle distribution during cell division. The evidence is convincing, experiments are rigorously performed with adequate replicates and reproducibility.

      Authors: We thank the reviewer for this positive evaluation of our work.

      • The study acknowledges that not all HSPCs respond to heterotypic interaction, suggesting that individual variability in HSPC behavior plays a role. Future research could explore the factors that determine which HSPCs undergo asymmetric divisions. This raises questions about the heterogeneity in HSPC responses. The researcher starts culturing CD34+ which is a heterogeneous group with osteoblast. This heterogeneity by default will give different cells progeny which mimics the in vivo BM status. But, to minimize the invitro experiment variables examining a more pure HSC population such as isolated CD34+CD38-CD90+ cells would strengthen the findings?

      Authors: We agree that we are dealing with an heterogenous population of stem and progenitor cells; we cannot exclude that more pronounced effects could be obtained by analyzing separately stem cells, lymphoid and myeloid progenitors. The use of CD34+ heterogenous population has been a strategical choice. We are working with human primary cells, harvested from cord blood, a rare and precious material, who’s access has been in addition reduced during the COVID period and since. Working with the global CD34+ population gave us the ability to work with a higher number of cells and avoid material limitations to perform experiments. Importantly, in our initial publication by Bessy et al (doi:10.1083/jcb.202005085), we used both CD34+, or in some cases CD34+/CD38- versus CD38+/CD34+ subpopulations (see figure 4), to investigate HSPC mode of polarization. The effects observed in the case of total CD34+ were significant, validating a posteriori this choice. Similarly, the effects observed in the present work with CD34+ population have always been validated by appropriate statistic tests. Our study indicates that the mechanism we described is conserved throughout the entire HSPC population.

      In conclusion, we are confident that the use of the CD34+ population, despite its heterogeneity, did not alleviate the validity and pertinence of our analyzes and conclusions.

      What was the Osteoblast/HSCs ratio used in the experimental setup? Does this ratio affect the results?

      Authors: The loading of HSPC has been optimized to get one HPSC per microwell, as indicated in the result section (page 3 line 3). This has now been clarified in the material and methods section. However, we observe some wells with more than one HSPC on the stromal cell, but we did not notice any clear impact on cell division asymmetry and did not investigate further this parameter.

      • The paper primarily focuses on lysosome inheritance and CD34 expression, leaving room to explore other lineage-specific markers and their correlation with asymmetric division. Do osteoblast/ HSCPs interactions have also an effect on mitochondrial inheritance?

      Authors: We fully agree that other markers could have been analyzed to comfort our observations.

      A far as mitochondria are concerned, recent publications suggest that lysosomes can be considered as a “inversed” readout of mitochondria inheritance: (1) mitophagy has been shown to take place in quiescent hematopoietic stem cells (Ito et al., 2016), suggesting that lysosomes do participate in the regulation mitochondria levels in HSPCs. (2) Loeffler et al. (2019) has shown an opposite pattern between mitochondria and lysosomes, lysosomes co-inheriting with mitophagosomes. Considering that this mutual or opposite inheritances were already described, we did not investigate them further. However we agree that it could be the focus for future works.

      • While the study hints at the clinical implications of its findings, it does not show the practical applications in detail. How the knowledge gained from these interactions could be translated into clinical practices or therapies is not explicitly discussed. Bridging this gap is necessary for understanding the practical utility of these results.

      Authors: we are confident that microwell technology could find, in the long term, more medical applications. However, we did not wish to speculate on this point in the present manuscript. The words “clinical” and “therapy” show no occurrence in our text. The potential implications of our findings in the field of leukemia are now mentioned in the discussion section with the corresponding references (page 6 line 10) “Such spatial control of the division is a conserved feature of asymmetric divisions in other stem cell niches 6. It may account for in vivo observations of organization into clusters of blood cell differentiation 44, and may play an important role in competition mechanisms at play within the hematopoietic niches in physiological 45 and pathological 46 contexts.”

      __Reviewer #2 (Significance (Required)): __

      • Investigating the process of HSPC asymmetric division is highly important to understand HSPC fate decisions and functions. The findings have significance in understanding how the microenvironment influences HSPC behavior. This is broadly significant for stem cell biology.
      • The paper's observation of uneven lysosome distribution and its potential impact on differentiation markers, including CD34 and CD33, contribute to our understanding of HSPC division and lineage determination.
      • Upon interaction with osteoblasts, HSPCs polarize during interphase, with centrosomes, the Golgi apparatus, and lysosomes positioned close to the site of contact. This reorganization of intracellular architecture plays a significant role in asymmetric division.
      • These results collectively contribute to a deeper understanding of how the hematopoietic niche and interactions with neighboring cells influence HSPC behavior and their commitment to distinct cell fates during division.

      Authors: We thank the reviewer for this quite positive evaluation of our work.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary: In this study, the authors systematically work to identify the contribution of hematopoietic niche stromal cells in the control of HSPC asymmetric divisions. Using live cell imaging techniques, the authors show that heterotypic interaction with osteoblasts promotes asymmetric division of human HSPCs. The authors find that HSPCs polarize in interphase with centrosome, the Golgi apparatus and lysosomes positioned close to the site of contact. Moreover, during mitosis, HSPCs orient their spindle perpendicular to the plane of osteoblast contact and the subsequent division gives rise to siblings with unequal amounts of lysosomes and differentiation markers such as CD34. Taken together, the authors suggest that this asymmetric inheritance generates heterogeneity in progeny, which is likely to contribute to plasticity during the early steps of hematopoiesis.

      Major Comments:

      1. The study that uses polyacrylamide microwells as minimalist niches to evaluate the role of heterotypic interactions in the asymmetric division of HSPCs. Using this model system, the data clearly indicate that osteoblasts promote HSPC polarization when compared to fibronectin or skin fibroblasts. However, osteoblasts weren't compared to other HSPC stromal niche components. For example, is the same polarization observed with endothelial cells? This would help to determine whether this observation is specific to only osteoblasts in the niche or if there's a broader role for heterotypic interactions, which is the claim made in the discussion.

      Authors: We have now performed additional experiments using endothelial cells (see Figure 1 and 2, Page 3 and 4) and found that endothelial cells do also promote asymmetric division.

      This is primarily a descriptive study that doesn't attempt to address or even postulate the type of receptors potentially responsible for the polarization-driving heterotypic interactions. Perhaps inhibitors to candidate receptors could be incorporated or at least candidate receptors could be mentioned and described in the discussion as future directions.

      Authors: We have now performed additional experiments to address this point: We have analyzed the effect of the CXCR4 antagonist AMD3100 on HSPC division upon interaction with osteoblast and found that blocking these receptors reduce the asymmetry of daughter cells to the level of isolated HSPC (see Figure 3 E -G; Figure S3 H-J; page 4).

      __Minor Comments: __

      1. There is no reference to Figures 2C and D in the text. Authors : This oversight has been corrected.

      Page 3: HSPC cultured on fibronectin exhibited spindle oriented parallel to the well bottom, (Figure 2A-C and S3C and D)”.

      Page 4: In contrast, both equal and unequal LysoBriteTM segregations could be observed (Figure 2D and S3E).

      The different circle colors in Figure 4C and D are a bit difficult to distinguish.

      Authors: The figure has been modified accordingly

      It's difficult to identify where the sites of cell contact are in Figure 2. Perhaps contact sites could be indicated in the images.

      Authors: The sites of contact have now been systematically indicated using dashed lines, in Figure 2 and 3.

      It's not clear how spindle orientation is being identified in Figure 3.

      Authors: We are aware of the fact that spindle orientation is not easy to visualize in the still images presented in Figure 3A and B. These images are extracted from the movies that were used to determine the cell division plane visually. This plane, following the long-axis rule, is perpendicular to the spindle orientation, which could therefore be determined. Movies 6 and 7 are shown to support these observations and illustrate how amenable these movies are to support spindle orientation analysis.

      Some grammatical errors need to be addressed throughout the manuscript.

      Authors: We thank the reviewer for drawing our attention on this point: the modified text has been carefully examined, and hopefully improved.

      Reviewer #3 (Significance):

      Significance: Overall, this is straightforward study that presents data illustrating the polarization driving capacity of osteoblast interactions with HPSC. The findings are important to the field, as little is known about asymmetric vs symmetric division in HSPCs. However, in its current form, the manuscript lacks clarity with respect to whether specifically osteoblasts promote HSPC polarization, or any stromal niche player has this capacity. Moreover, there is a lack of mechanistic data and/or discussion about how osteoblasts may be driving the observed polarization.

      Authors: We have now added data showing the similar role of endothelial cells and the implication of CXCR4 receptors.

      The manuscript is likely to have broader interest to not only HSPC researchers, but stem cell biologists and even engineers due to the technologies used.

      Authors: We thank the reviewer for his/her enthusiasm for our work



    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Title: Heterotypic interaction promotes asymmetric division of human hematopoietic stem and progenitor cells

      Authors: Adrian Candelas, Benoit Vianay, Matthieu Gelin, Lionel Faivre, Jerome Larghero, Laurent Blanchoin, Manuel Théry, Stephane Brunet

      Summary: In this study, the authors systematically work to identify the contribution of hematopoietic niche stromal cells in the control of HSPC asymmetric divisions. Using live cell imaging techniques, the authors show that heterotypic interaction with osteoblasts promotes asymmetric division of human HSPCs. The authors find that HSPCs polarize in interphase with centrosome, the Golgi apparatus and lysosomes positioned close to the site of contact. Moreover, during mitosis, HSPCs orient their spindle perpendicular to the plane of osteoblast contact and the subsequent division gives rise to siblings with unequal amounts of lysosomes and differentiation markers such as CD34. Taken together, the authors suggest that this asymmetric inheritance generates heterogeneity in progeny, which is likely to contribute to plasticity during the early steps of hematopoiesis.

      Major Comments:

      1. The study that uses polyacrylamide microwells as minimalist niches to evaluate the role of heterotypic interactions in the asymmetric division of HSPCs. Using this model system, the data clearly indicate that osteoblasts promote HSPC polarization when compared to fibronectin or skin fibroblasts. However, osteoblasts weren't compared to other HSPC stromal niche components. For example, is the same polarization observed with endothelial cells? This would help to determine whether this observation is specific to only osteoblasts in the niche or if there's a broader role for heterotypic interactions, which is the claim made in the discussion.
      2. This is primarily a descriptive study that doesn't attempt to address or even postulate the type of receptors potentially responsible for the polarization-driving heterotypic interactions. Perhaps inhibitors to candidate receptors could be incorporated or at least candidate receptors could be mentioned and described in the discussion as future directions.

      Minor Comments:

      1. There is no reference to Figures 2C and D in the text.
      2. The different circle colors in Figure 4C and D are a bit difficult to distinguish.
      3. It's difficult to identify where the sites of cell contact are in Figure 2. Perhaps contact sites could be indicated in the images.
      4. It's not clear how spindle orientation is being identified in Figure 3.
      5. Some grammatical errors need to be addressed throughout the manuscript.

      Significance

      Overall, this is straightforward study that presents data illustrating the polarization driving capacity of osteoblast interactions with HPSC. The findings are important to the field, as little is known about asymmetric vs symmetric division in HSPCs. However, in its current form, the manuscript lacks clarity with respect to whether specifically osteoblasts promote HSPC polarization, or any stromal niche player has this capacity. Moreover, there is a lack of mechanistic data and/or discussion about how osteoblasts may be driving the observed polarization.

      The manuscript is likely to have broader interest to not only HSPC researchers, but stem cell biologists and even engineers due to the technologies used.

      I have experience with questions of HSPC regulation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper titled "Heterotypic Interaction Promotes Asymmetric Division of Human Hematopoietic Stem and Progenitor Cells" presents research on how interactions between Hematopoietic Stem and Progenitor Cells (HSPCs) and osteoblasts impact cell division symmetry. The use of polyacrylamide microwells to simulate the hematopoietic niche and study osteoblast interactions is a novel and valuable approach. The paper effectively explains the experimental design, making it easy for others to replicate and verify the results. The paper provides detailed insights into how interactions with osteoblasts influence HSPCs, particularly in terms of cell polarization, spindle orientation, and organelle distribution during cell division. The evidence is convincing, experiments are rigorously performed with adequate replicates and reproducibility.

      • The study acknowledges that not all HSPCs respond to heterotypic interaction, suggesting that individual variability in HSPC behavior plays a role. Future research could explore the factors that determine which HSPCs undergo asymmetric divisions. This raises questions about the heterogeneity in HSPC responses. The researcher starts culturing CD34+ which is a heterogeneous group with osteoblast. This heterogeneity by default will give different cells progeny which mimics the in vivo BM status. But, to minimize the invitro experiment variables examining a more pure HSC population such as isolated CD34+CD38-CD90+ cells would strengthen the findings? What was the Osteoblast/HSCs ratio used in the experimental setup ? Does this ratio affect the results?
      • The paper primarily focuses on lysosome inheritance and CD34 expression, leaving room to explore other lineage-specific markers and their correlation with asymmetric division. Do osteoblast/ HSCPs interactions have also an effect on mitochondrial inheritance ?
      • While the study hints at the clinical implications of its findings, it does not show the practical applications in detail. How the knowledge gained from these interactions could be translated into clinical practices or therapies is not explicitly discusssed. Bridging this gap is necessary for understanding the practical utility of these results.

      Significance

      • Investigating the process of HSPC asymmetric division is highly important to understand HSPC fate decisions and functions. The findings have significance in understanding how the microenvironment influences HSPC behavior. This is broadly significant for stem cell biology.
      • The paper's observation of uneven lysosome distribution and its potential impact on differentiation markers, including CD34 and CD33, contribute to our understanding of HSPC division and lineage determination.
      • Upon interaction with osteoblasts, HSPCs polarize during interphase, with centrosomes, the Golgi apparatus, and lysosomes positioned close to the site of contact. This reorganization of intracellular architecture plays a significant role in asymmetric division.
      • These results collectively contribute to a deeper understanding of how the hematopoietic niche and interactions with neighboring cells influence HSPC behavior and their commitment to distinct cell fates during division.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Candeleas and colleagues investigated asymmetric cell division of human hematopoietic stem and progenitor cells (HSPCs) upon heterotypic interactions with osteoblasts. They developed a new in vitro system to test this and found that upon interaction, HSPCs polarized in interphase with centrosome and the Golgi apparatus and lysosomes positioned close to the site of contact. In addition during mitosis, HSPCs were found to orient their spindle perpendicular to the plane of contact. This division gave rise to siblings with unequal amounts of lysosomes and CD34. In general, is a well-done work and it gives a few more details describing that when HSPCs divide asymmetrically it seems there is an association between centrosome and lysosome distribution.

      Significance

      However, all of these features (described above) have already been independently described even in human HSPCs thus this work does not represent a major advancement in this field (e.g. on the potential molecular mechanism by which heterotypic interactions with osteoblasts promote such behaviours). Overall, this stage is descriptive in nature.

      One major caveat of this work is that CD34+ HSPCs represent a very heterogeneous population. Although the underlying features described in this work could be similar between different cell populations, the frequency of these events (e.g. % magnupodium; polarization index, % of asymmetric inheritance, etc) is likely to be different between stem and progenitor cells. This may also explain the wide spread of their data points. Experiments should also be conducted with different cell populations, in particular with HSCs (either with the CD34+CD38-CD45RA-CD90+ or CD34+CD38-CD45RA-CD90+CD49f+ HSC enriched fraction or the highly CD34+CD38-CD45RA-EPCR+ HSCs).

      Results shown in Figure 2 were obtained with a very limited number of cells; also data obtained for Figure 4; this should be substantiated; in addition, data from Figure 2 adds little and should be combined with Figures 3 and 4 to make a single stronger message.

      It is/was unclear why the authors investigated the distribution of CD34 and CD33 and the major and important question remained to be answered: whether the symmetric cell divisions were/are differentiating or self-renewal in nature hence, the asymmetric cell divisions promoted by the osteoblasts may represent asymmetric self-renewal ones; this needs to be investigated further and potential molecular mechanisms for such promotion should be highlighted.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      • Albeit the link between CSA and NE integrity the work is in my eyes too preliminary. Although the data presented are well done and carefully evaluated they mostly (except Fig 1A) rely on direct comparisons of one patient cell line (CS-A or CS-B) to the same cells expression the wildtype protein. It remains thus open whether the effects seen on LEM2 expression, LEM2-LaimA/C interaction, stress fibre formation, cGAS/STING signaling pathway activation in the CS-A cells are representative for a number of different CS patient derived cells. This is especially important given the small changes observed. Please note that there are alos clear differences between the CSA-wt and CSB-wt cells. Would the HPA CSA KO cells show in addition to NE irregularities (not even quantified) the same phenotypes and can they be reverted by re-expression of the wildtype CSA protein? We would like to thank the reviewer for this comment. Indeed we have previously observed nuclear circularity defects in CSA KO HAP1 cells but we haven’t investigated the other phenotypes in this cell line. One of the main reasons behind this is the technical difficulties associated with performing immunofluorescence with HAP1 cells who are very small and tend to grow in aggregates.

      Proposed experimental plan: To address this reviewer’s comment, we will attempt again to use HAP1 cells (WT and CSA KO) and look at nuclear circularity (with quantification), stress fiber formation and cGas foci. If we don’t succeed, we will use an alternative isogenic cell model consisting of fibroblasts in which we will knock out CSA using CRISPR/Cas9. We will then repeat the same experiments as proposed above for the HAP1 cells.

      • The link between CSA and SUN1 is not well worked out. What is the effect of SUN2 downregulation and that of nespirins? It remains unclear whether the observed effects are indeed LINC mediated. Proposed experimental plan: To address this point, we will downregulate SUN2 and nesprins using siRNAs in two different cell models (as described above) and assess nuclear shape as well as cGas/STING pathway activation.

      Minor comments:

      * Fig 1B: Why is HA-Tagged CSA not shown on the CSA western? This would be helpful to compare to the endogenous levels at least in CSB cells. A western showing an housekeeping marker would allow better comparison. Judging from the proteins markers HA-tagged CSA seems much larger as endogenous CSA (first versus second row). Again, less cropped western blots would help.*

      We are sorry for the confusion and we have realised that the molecular weights on the western blots were incorrectly labeled on this figure. This will be modified, and the full, uncropped WB will be provided as a supplementary file.

      Fig 3A: Is CSA FLAG or HA-tagged? Or both? If both are expressed the question raises of why the CSA-LEM 2 interaction is only seen in an overexpression situation.

      • *

      CSA is HA tagged. To address this reviewer’s comment we will try performing immunoprecipitation on endogenous proteins.

      Fig 5: Inconsistency between figure and figure legend: 20 vs 25 nM Jasplakinolide. I assume Latrunculin A should read Cytochalasin D?

      Thank you for pointing this out, and yes this is indeed a mistake on our labeling. We will rectify this on the figure legend.

      Fig5B: Not clear why in "CSA-wT cells" Cytochalasin D and Jasplakinolide have the same effect on nuclear envelope shape yet only Jasplakinolide increases the number of blebs.

      Cyt D inhibits actin polymerization while jasplakinolide increases polymerization. Likely actin polymerization increases blebs through extra force being put on the nucleus through actin cables/ actin based motility. Both drugs decrease nuclear roundness as they disrupt the normal actin network leading to worsening of nuclear shape through different mechanisms.

      Page 10: Method for IF: 4% (v/v) paraformaldehyde and 2% /v/v) should likely read (w/v). Page 19: replace "withl" by "with".

      We will rectify both these points.

      Reviewer #2

      The paper is well-written and for the most part, the data support the conclusions of the authors. Some minor caveats could be addressed to improve the quality of the manuscript.

      We would like to thank the reviewer for their positive feedback on our manuscript.

      • The phenotype of decreased LEMD2 incorporation into the NE in CS-A cells is minor. Only ~20% and thus, it is not clear whether this is causal of any of the NE abnormalities. It should be better explained how these data add to the story.

      To address this point, we will overexpress LEMD2 in CS-A cells and assess whether the NE phenotype can be significantly rescued. This will add value to this part of story.

      • Inducing actin polymerization and depolarization impact nuclear morphological abnormalities and nuclear blebbing. Do these treatments impact nuclear fragility and cGAS accumulation at NE break sites?* This is a good point indeed. To address this question, we will include cGas foci staining and quantification upon treatment with these chemicals.

      • Depletion of SUN1 in CS-A cells increased nuclear circularity, decreased blebbing, and phosphorylation of TBK1. The impact of SUN1 depletion in cGAS foci formation at NE break sites and phosphorylation of STING is not shown. Such experiments will provide stronger evidence that CS-A activates the cGAS-STING pathway in a SUN1 (mechanical stress)-dependent manner.

      * We will address this question by analysing cGas foci and cGas-STING pathway activation upon SUN1 depletion by siRNA.

      Reviewer #3

      • *

      The data are generally clear, well performed and well interpreted with some exceptions:*

      1) I appreciate the use of isogenic cell lines (a big plus when dealing with patient-derived cell lines). However, these lines were established 30 years ago and the reported phenotypes might be due to genetic drifts. To exclude this, I suggest to complement the HAP-1 ERCC8 KO cell line with exogenously expressed CSA and assess if this rescues the phenotypes reported. Validation of the KO in these lines, either by western blotting or sequencing is needed.*

      This point has also been raised by the first reviewer, and will be addressed as described above (and pasted below):

      We would like to thank the reviewer for this comment. Indeed we have previously observed nuclear circularity defects in CSA KO HAP1 cells but we haven’t investigated the other phenotypes in this cell line. One of the main reasons behind this is the technical difficulties associated with performing immunofluorescence with HAP1 cells who are very small and tend to grow in aggregates.

      Proposed experimental plan: To address this reviewer’s comment, we will attempt again to use HAP1 cells (WT and CSA KO) and look at nuclear circularity (with quantification), stress fiber formation and cGas foci. If we don’t succeed, we will use an alternative isogenic cell model consisting of fibroblasts in which we will knock out CSA using CRISPR/Cas9. We will then repeat the same experiments as proposed above for the HAP1 cells.

      2) Related to the complementation of patient cell lines, the exogenous HA-CSA is not recognised by the anti-CSA in the CSA-null patient cell lines (Fig 1B, second blot). Shouldn't you be able to see this exogenous protein? HA-GFP-CSB in the complemented CSB-null patient cell line runs at the same weight as endogenous CSB (Fig 1B, fourth blot). This is also unexpected. I think you need better characterisation of your cell lines and need to demonstrate the level of exogenous transgenes that have been used to complement the cells and that they localise appropriately, presumably to the nucleus. You should also make sure to cite the paper where they were isolated and describe that they were immortalised (Troelstra et al., 1992) and the paper in which transgenes were stably overexpressed (Qiang et al., 2021).*

      *

      As mentioned above, we have realised that we made some mistakes with the labeling of the molecular weight on this western blot. This will be corrected and the full uncropped western blot will be provided as a supplementary figure.

      We will also cite the suggested papers accordingly.

      3) Immunolocalisation of INM proteins is notoriously tricky and the permeabilisation steps include only 0.2% Tx100, which can be insufficient to permeabilise the INM. I appreciate the Emerin and Lamin immunostaining seems to have worked, but in many cases successful immunostaining can be antibody-specific. Can you try harsher permeabilisation to expose LEM2 epitopes? I'm somewhat uncomfortable with the suggestion that there is a cytosolic (ER?) pool of endogenous LEM2 as this runs counter to the literature and feel that your antibody or fixation conditions are illuminating a non-specific protein. The WB in Fig 2E shows that there is virtually no LEM2 in the "soluble" fraction. I would be more cautious on this cytoplasmic/nuclear pool interpretation. Biochemical nuclear and cytoplasmic fractionation would help clarify the signal in a NE vs a non-NE pool.*

      *

      As suggested by the reviewer, we will try harsher permeabilisation conditions to test the LEMD2 antibody. As we suggest in the manuscript however, we think that the “cytoplasmic” LEMD2 pool we observed by IF in the absence of pre-extraction is indeed unspecific. This is why we have performed the rest of the experiments with a pre-extraction step, that we have shown to give a specific LEMD2 signal that disappear upon depleting LEMD2 by siRNA.

      4) Page 15: "Using a Proximity Ligation Assay (PLA), we showed a significant reduction in the number of PLA foci in CS-A cells compared to the WT(HA-CSA) cells, reflecting a reduced number of LEMD2-lamin A/C complexes (Figure 2G, 2H). This data suggests defects in the incorporation of LEMD2 into the NE and lamin protein complexes in CS-A cells". If you have less LEM2 in the NE, it is quite expected that you will have less "LEM2-laminA/C" complexes. To me the logic doesn't hold and this data does not suggest that there is an underlying defect in LEM2-lamin interaction. To ascertain whether there is such a defect one could perform an IP against LEM2 and quantify laminA/C, normalizing by the amount of LEM2 in the input.

      We feel we may not have been clear in how we interpreted this data. What we mean is that in each individual cell, the number of Lamin-LEMD2 complexes is decreased, probably indeed due to the fact that there is less LEMD2 altogether within the nucleus in the absence of CSA. We will clarify this in the text.

      5) "We overexpressed LEMD2-GFP and Flag-CSA constructs, followed by GFP pulldown in WT(HA-CSA) cells". Since the co-IP data are obtained in overexpression conditions (of both HA-CSA and Flag-CSA?), the authors should validate the interaction between LEM2 and CSA using an orthogonal approach. Perhaps anti-HA capture of the WT(HA-CSA) cells would allow you to immunoblot for endogenous LEM2?*

      *

      To address this point, we will try to immunoprecipitate HA-CSA and look at endogenous LEMD2.

      6) Related to the CSA-LEM2 binding in the above experiment, the procedure involves combining a native detergent-extracted cytoplasmic pool with a denatured (RIPA-extracted) nuclear pool for performing the GFP-trap. From which pool was the tagged CSA bound to LEM2 in?*

      *

      We are sorry about the confusion. We didn’t try to run the IP from the different pools but instead from the combined pools, to ensure we were looking in the whole cell extract. We would expect however that the interaction occurs in the nuclear pool as both CSA and LEMD2 are nuclear proteins.

      7) "The absence of CSA in CS-A patient cells does not affect the mobility of LEMD2 at the NE but instead decreases its interaction with A-type lamins". To me the fact that loss of CSA decreases LEM2-lamin interaction is not well supported (see point 3).

      • *

      See our response to point 4

      8) "Here, we showed by immunoprecipitation that LEMD2 also interacts with CSA. This suggests that the recruitment and stabilization of LEMD2 to the NE is mediated by an interaction with CSA, although the mechanism remains unclear". I think this is an overstatement: there are no data suggesting that CSA recruits or stabilises LEM2 at the NE.

      * *We will tone down this statement in the text

      9) As the authors suggest in the discussion, it would be worth checking whether LEM2 overexpression is able to rescue some of the NE defects reported, strengthening the hypothesis that LEM2 levels are at least in part responsible for the phenotypes reported.

      To address this point, we will perform LEMD2 overexpression in the CSA cells, and analyse the nuclear envelope defects and ruptures (shape and cGas foci quantification)

      10) To me it is not clear how the reported phenotypes are interrelated. The first part of the manuscript shows that CSA interacts with LEM2, and that loss-of-function CSA impacts on LEM2 levels and LEM2-lamin interaction, suggesting a direct role for CSA at the nuclear envelope. The second part of the manuscript shows that cells with defective CSA have more actin stress fibres and releasing the cytoskeleton-nuclear tethering is able per se to rescue the nuclear membrane and cGAS phenotypes. How do the authors reconciliate these two parts? Is CSA directly involved in both inner nuclear membrane homeostasis and actin cytoskeleton modulation or is this latter role upstream and the NE defects a mere consequence of increased cytoskeleton rigidity?

      At this point indeed we cannot draw definitive conclusions as to whether the two described phenotypes are inter-related. However, by addressing the other points raised by the reviewers, we hope this will help clarifying the mechanism.

      11) It is not clear how or why actin stress fibres are elevated in the CS-A cells. Can the authors provide any insight based on their RNAseq analysis? Demonstrating a link to ROCK, LIMK or Rho signalling would be interesting and verifying ppMLC2 levels would help explain why contractility is enhanced. Additionally, is the increase in contractility dependent upon any of the genes identified as up- or downregulated in RNAseq? Presently, the manuscript is missing a link between its two halves.*

      *

      We would like to reiterate that the RNASeq analysis we performed was done on previously published data from another group (as described in the text). To address the point raised by the reviewer, we will look more specifically into our analysis to look at ROCK, LIMK or Rho signalling to see if any of these pathways appear to be modulated by the absence of CSA.

      12) Related to point 1, the RNAseq comparison was performed on patient cells lacking CS-A and patient cells lacking CS-A and later over-expressing HA-CSA, and this comparison is used extensively for phenotype description in the manuscript. In isn't clear to me that this is the most insightful comparison to make; the rescue by overexpression is not as elegant as CRISPR reversion and the ko fibroblasts have presumably been surviving well in culture without CS-A before this protein was overexpressed. Can you validate the differential expression of any identified proteins in the acute HAP1 ko? Can you validate any of the differentially expressed proteins in comparison to normal fibroblasts (e.g., 13O6, as per Qiang et al., 2021)?

      As we will validate our experiments in an additional cell model (as described above), we will also indeed validate the level of expression of cytoskeletal proteins upon CSA KO/rescue.

      Minor comments

      * - Page 14: "To characterize the NE phenotypes further, we obtained CS patient-derived cell lines carrying loss-of-function mutations in CSA (CS-A cells) or CSB (CS-B cells), and their respective isogenic control cell lines (WT(HACSA) and WT(HA-GFP-CSB))." What type of loss-of-function? Is the mutant protein still produced? In Fig 6A there seem to be a band in the CS-A blot (second lane), but in Fig 1B, there isn't. I think this is important to know to interpret the phenotype related to LEM2 interaction.*

      We can clarify that in the text. Indeed, the loss of function mutation leads to the absence of CSA protein.

      - Figure 1B is poorly annotated. What do - and + stand for? In general, I find a bit confusing how the WB are presented throughout the manuscript, specifically how the antibodies are reported (e.g., HA-CSA instead of HA). Please mark up all western blots with antisera used. Please make sure all expected bands are within the crops - e.g., Fig 3B, the anti-LEM2 blot should be expanded vertically to show the LEM2-GFP relative to endogenous LEM2.

      We will correct these on the figures

      - From the methods, it appears that you obtained a Please provide clarity on which construct was used in which figure, and verify that an N-terminally tagged LEM2 still localises to the NE.

      We actually cloned LEMD2 into an empty pEGFP vector but still maintained LEM2-GFP. We will remove the C1plasmid from the methods to avoid confusion as we removed the MCS and GFP and just used the blank vector and inserted lem2-gfp as we obtained it.

      - Fig 1I: there is some text on top of the upper panels (DAPI, cGAS, Merge).

      • *

      * - "Through gene ontology analysis, we found that genes involved in endoplasmic reticulum (ER) stress were differentially expressed (Figure 4B)". I don't think that the way data are shown in Fig 4B is effective. Since GO has been performed, I would replace the table with a GO enrichment analysis graph. Ensure to report all the data in a supplementary .xls so that others can see and reuse it. Is there a mandated repository that accepts RNAseq data?*

      The RNAseq experiment and data was performed by another group and reported in a previous study, as referenced in the main text of the manuscript (Epanchintsev A, Costanzo F, Rauschendorf MA, Caputo M, Ye T, Donnio LM, et al. Cockayne’s Syndrome A and B Proteins Regulate Transcription Arrest after Genotoxic Stress by Promoting ATF3 Degradation. Mol Cell. 2017 Dec;68(6):1054-1066.e6.). Here, we only re-analysed their data using STRING pathway analysis, as detailed in the Material and Methods. However, as suggested by the reviewer, we will replace the table by a GO enrichment graph.

      - The volcano plot looks weird with many values at the maximum log10 (P-value) - is the data processed appropriately?

      As mentioned above, the RNA Seq analysis was performed and published in a different study. We think this is because the Y axis shows adjusted P values.

      - Figure 5B: the legend says "Latrunculin A". Please correct.

      We will correct this

      - For a Wellcome funded researcher, I'm surprised that the mandated OA statement and RRS is absent from the acknowledgements.

      We will of course comply with the open access policy of the Wellcome Trust. However, and based on the WT requirements detailed on their website, we believe the acknowledgement section complies with the funder’s policy: “All research publications must acknowledge Wellcome's support and list the grant reference number which funded the research reported.”

      Maybe mention the changes in nuclear shape is not a causative of nuclear blebbing. But maybe not say that they are completely mutually exclusive phenotype to each other.

      suggestion

      Maybe say that we will overexpress LEMD2 in CS-A cells and show that the NE phenotype can be significantly rescued. This will add value to this part of story. I remember when I did the FRAP experiment, CS-A cells with expression of LEMD2-GFP (that doesn’t form aggregates) looks better in term of shape.

      I think Anne, please check the plasmid map? According to the lab inventory (Plasmids Anne), it is LEMD2-GFP. So probably GFP is at C-terminus.

      I think there was a part in discussion was LEMD2-GFP was mistakenly written as GFP-LEMD… But I am sure I used LEMD2-GFP throughout the work

      We cloned it into an empty pEGFP vector but still maintained LEM2-GFP. Maybe remove the C1 in the methods to avoid confusion as we removed the MCS and GFP and just used the blank vector and inserted lem2-gfp as we obtained it.

      Same construct was used for GFP pulldown and for FRAP. And we can see in FRAp that they localise to the NE. SO it should localise to the NE. Maybe mention that we will do a LEMD2-GFP over expression experiment in CS-A cells and show that they do localise to the NE.

      I don’t remember fully if Denny did this and what came out. I thought he did and ER stress and cytoskeleton regulation came out as enriched terms?

      Denny will have to check this but I think this is because the Y axis shows adjusted P values?? I have the same in my data and Jack told me this is an artefact of the analysis if you adjust for multiple comparisons and is something more often seen in mass spec data

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, Yang and colleagues report that the absence of CSA (a protein with a well-characterised role in DNA damage repair and whose mutations cause a premature ageing syndrome) leads to defects in nuclear envelope (NE) integrity. Using a Cockayne syndrome patient-derived cell line they show that CSA deficiency correlates with the presence of an aberrant nuclear morphology, nuclear blebbing and increased cGAS-signalling. Moreover, they suggest that this CSA defective cell line presents decreased LEM2 at the NE and a consequent reduction in LEM2-laminA/C interaction. They were able to observe interaction betweenCSA and LEM2, andsuggested that CSA might interfere with LEM2 functionality at the nuclear envelope. Starting from the analysis of an already published RNAseq dataset, the authors go on investigating the cytoskeleton status in the CSA mutant cell line and identify an increase in actin, which they suggest leads to an increase in actin stress fibres. By promoting general actin depolymerisation and by disanchoring the cytoskeleton from the nuclear envelope, Yang et al are able to rescue the dysfunctional phenotypes reported above, suggesting that increased mechanical forces transmitted from the actin cytoskeleton to the nuclear envelope might cause defects in nuclear envelope integrity. Finally, the authors report that alleviating the nuclear envelope defects in CSA mutant cells does not decrease their sensitivity to a UV mimetic, hinting that CSA role in maintaining nuclear integrity is independent from its DNA damage repair function.

      Major comments

      The key conclusions of the manuscript are convincing and supported by the data. However, there are some conceptual concerns that limit my enthusiasm.

      Firstly, that the actin cytoskeleton can transmit forces to the NE via the LINC complex is established. That these forces that lead to NE deformation, and in some cases, rupture, is somewhat unsurprising. The interesting observation in this manuscript is that CSA mutant patient cells exhibit more stress fibres. This leads to more LINC-dependent transmission of forces to the NE, with consequential effects on NE morphology and rupture. However, the mechanism by which actomyosin contractility is elevated in CSA-mutant patient cells is unexplored.

      Secondly, it is unclear whether CSA controls the quality, or simply the number, of LEM2/Lamin interactions. As such, a mechanistic link between elevated stress fibres and CSA-dependent NE/lamina interactions is not provided and the paper sits as two presently unconnected observations.

      The data are generally clear, well performed and well interpreted with some exceptions:

      1. I appreciate the use of isogenic cell lines (a big plus when dealing with patient-derived cell lines). However, these lines were established 30 years ago and the reported phenotypes might be due to genetic drifts. To exclude this, I suggest to complement the HAP-1 ERCC8 KO cell line with exogenously expressed CSA and assess if this rescues the phenotypes reported. Validation of the KO in these lines, either by western blotting or sequencing is needed.
      2. Related to the complementation of patient cell lines, the exogenous HA-CSA is not recognised by the anti-CSA in the CSA-null patient cell lines (Fig 1B, second blot). Shouldn't you be able to see this exogenous protein? HA-GFP-CSB in the complemented CSB-null patient cell line runs at the same weight as endogenous CSB (Fig 1B, fourth blot). This is also unexpected. I think you need better characterisation of your cell lines and need to demonstrate the level of exogenous transgenes that have been used to complement the cells and that they localise appropriately, presumably to the nucleus. You should also make sure to cite the paper where they were isolated and describe that they were immortalised (Troelstra et al., 1992) and the paper in which transgenes were stably overexpressed (Qiang et al., 2021).
      3. Immunolocalisation of INM proteins is notoriously tricky and the permeabilisation steps include only 0.2% Tx100, which can be insufficient to permeabilise the INM. I appreciate the Emerin and Lamin immunostaining seems to have worked, but in many cases successful immunostaining can be antibody-specific. Can you try harsher permeabilisation to expose LEM2 epitopes? I'm somewhat uncomfortable with the suggestion that there is a cytosolic (ER?) pool of endogenous LEM2 as this runs counter to the literature and feel that your antibody or fixation conditions are illuminating a non-specific protein. The WB in Fig 2E shows that there is virtually no LEM2 in the "soluble" fraction. I would be more cautious on this cytoplasmic/nuclear pool interpretation. Biochemical nuclear and cytoplasmic fractionation would help clarify the signal in a NE vs a non-NE pool.
      4. Pag 15: "Using a Proximity Ligation Assay (PLA), we showed a significant reduction in the number of PLA foci in CS-A cells compared to the WT(HA-CSA) cells, reflecting a reduced number of LEMD2-lamin A/C complexes (Figure 2G, 2H). This data suggests defects in the incorporation of LEMD2 into the NE and lamin protein complexes in CS-A cells". If you have less LEM2 in the NE, it is quite expected that you will have less "LEM2-laminA/C" complexes. To me the logic doesn't hold and this data does not suggest that there is an underlying defect in LEM2-lamin interaction. To ascertain whether there is such a defect one could perform an IP against LEM2 and quantify laminA/C, normalizing by the amount of LEM2 in the input.
      5. "We overexpressed LEMD2-GFP and Flag-CSA constructs, followed by GFP pulldown in WT(HA-CSA) cells". Since the co-IP data are obtained in overexpression conditions (of both HA-CSA and Flag-CSA?), the authors should validate the interaction between LEM2 and CSA using an orthogonal approach. Perhaps anti-HA capture of the WT(HA-CSA) cells would allow you to immunoblot for endogenous LEM2?
      6. Related to the CSA-LEM2 binding in the above experiment, the procedure involves combining a native detergent-extracted cytoplasmic pool with a denatured (RIPA-extracted) nuclear pool for performing the GFP-trap. From which pool was the tagged CSA bound to LEM2 in?
      7. "The absence of CSA in CS-A patient cells does not affect the mobility of LEMD2 at the NE but instead decreases its interaction with A-type lamins". To me the fact that loss of CSA decreases LEM2-lamin interaction is not well supported (see point 3).
      8. "Here, we showed by immunoprecipitation that LEMD2 also interacts with CSA. This suggests that the recruitment and stabilization of LEMD2 to the NE is mediated by an interaction with CSA, although the mechanism remains unclear". I think this is an overstatement: there are no data suggesting that CSA recruits or stabilises LEM2 at the NE.
      9. As the authors suggest in the discussion, it would be worth checking whether LEM2 overexpression is able to rescue some of the NE defects reported, strengthening the hypothesis that LEM2 levels are at least in part responsible for the phenotypes reported.
      10. To me it is not clear how the reported phenotypes are interrelated. The first part of the manuscript shows that CSA interacts with LEM2, and that loss-of-function CSA impacts on LEM2 levels and LEM2-lamin interaction, suggesting a direct role for CSA at the nuclear envelope. The second part of the manuscript shows that cells with defective CSA have more actin stress fibres and releasing the cytoskeleton-nuclear tethering is able per se to rescue the nuclear membrane and cGAS phenotypes. How do the authors reconciliate these two parts? Is CSA directly involved in both inner nuclear membrane homeostasis and actin cytoskeleton modulation or is this latter role upstream and the NE defects a mere consequence of increased cytoskeleton rigidity?
      11. It is not clear how or why actin stress fibres are elevated in the CS-A cells. Can the authors provide any insight based on their RNAseq analysis? Demonstrating a link to ROCK, LIMK or Rho signalling would be interesting and verifying ppMLC2 levels would help explain why contractility is enhanced. Additionally, is the increase in contractility dependent upon any of the genes identified as up- or downregulated in RNAseq? Presently, the manuscript is missing a link between its two halves.
      12. Related to point 1, the RNAseq comparison was performed on patient cells lacking CS-A and patient cells lacking CS-A and later over-expressing HA-CSA, and this comparison is used extensively for phenotype description in the manuscript. In isn't clear to me that this is the most insightful comparison to make; the rescue by overexpression is not as elegant as CRISPR reversion and the ko fibroblasts have presumably been surviving well in culture without CS-A before this protein was overexpressed. Can you validate the differential expression of any identified proteins in the acute HAP1 ko? Can you validate any of the differentially expressed proteins in comparison to normal fibroblasts (e.g., 13O6, as per Qiang et al., 2021)?

      Minor comments

      • Pag 14: "To characterize the NE phenotypes further, we obtained CS patient-derived cell lines carrying loss-of-function mutations in CSA (CS-A cells) or CSB (CS-B cells), and their respective isogenic control cell lines (WT(HACSA) and WT(HA-GFP-CSB))." What type of loss-of-function? Is the mutant protein still produced? In Fig 6A there seem to be a band in the CS-A blot (second lane), but in Fig 1B, there isn't. I think this is important to know to interpret the phenotype related to LEM2 interaction.
      • Figure 1B is poorly annotated. What do - and + stand for? In general, I find a bit confusing how the WB are presented throughout the manuscript, specifically how the antibodies are reported (e.g., HA-CSA instead of HA). Please mark up all western blots with antisera used. Please make sure all expected bands are within the crops - e.g., Fig 3B, the anti-LEM2 blot should be expanded vertically to show the LEM2-GFP relative to endogenous LEM2.
      • From the methods, it appears that you obtained a LEM2-GFP, and cloned it into an expression vector (pEGFPC1) to make GFP-LEM2. Please provide clarity on which construct was used in which figure, and verify that an N-terminally tagged LEM2 still localises to the NE.
      • Fig 1I: there is some text on top of the upper panels (DAPI, cGAS, Merge).
      • "Through gene ontology analysis, we found that genes involved in endoplasmic reticulum (ER) stress were differentially expressed (Figure 4B)". I don't think that the way data are shown in Fig 4B is effective. Since GO has been performed, I would replace the table with a GO enrichment analysis graph. Ensure to report all the data in a supplementary .xls so that others can see and reuse it. Is there a mandated repository that accepts RNAseq data?
      • The volcano plot looks weird with many values at the maximum log10 (P-value) - is the data processed appropriately?
      • Figure 5B: the legend says "Latrunculin A". Please correct.
      • For a Wellcome funded researcher, I'm surprised that the mandated OA statement and RRS is absent from the acknowledgements.

      Referees cross-commenting

      I think the other reviews are fair and accurate

      Significance

      This work provides interesting insights on a possible new moonlighting role of the CSA protein. This could enhance the pathophysiological comprehension of some clinical manifestations in CS patients. The nuclear envelope defects are well described and convincing; however, there is no clear understanding of how (or even if) the reported cytoskeleton and nuclear envelope defects depend on CSA. Better characterising mechanistic roles of CSA in both nuclear envelope integrity and in stress fibre formation will boost the overall impact of the manuscript. At this stage, the manuscript could be of interest for a specialised audience (premature ageing syndromes) but with more mechanistical dissection it could become of broader interest (basic research, nuclear architecture/integrity readership).

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript by Yang et al. shows that ERCC8 (Cockayne syndrome A protein) mutations/loss of function causes nuclear envelope (NE) abnormalities and fragility, in addition to the well described deficiencies in transcription-coupled nucleotide excision repair (TC-NER). Cells from CS-A patients show decreased LEMD2-lamin A/C complexes at the NE and increased actin stress fibers that cause mechanical stress in the NE, in addition to more blebbing and ruptures of NE. This is turn causes activation of the innate/immune cGAS/STING pathway. Importantly, disrupting the LINC complex rescued NE problems and activation of the cGAS/STING pathway. This effect of ERCC8 dysfunction on NE integrity and activation of the cGAS/STING pathway may be behind patients' phenotypes of neuroinflammation, but not UV sensitivity.

      The paper is well-written and for the most part, the data support the conclusions of the authors. Some minor caveats could be addressed to improve the quality of the manuscript.

      • The phenotype of decreased LEMD2 incorporation into the NE in CS-A cells is minor. Only ~20% and thus, it is not clear whether this is causal of any of the NE abnormalities. It should be better explained how these data add to the story.
      • Inducing actin polymerization and depolarization impact nuclear morphological abnormalities and nuclear blebbing. Do these treatments impact nuclear fragility and cGAS accumulation at NE break sites?
      • Depletion of SUN1 in CS-A cells increased nuclear circularity, decreased blebbing, and phosphorylation of TBK1. The impact of SUN1 depletion in cGAS foci formation at NE break sites and phosphorylation of STING is not shown. Such experiments will provide stronger evidence that CS-A activates the cGAS-STING pathway in a SUN1 (mechanical stress)-dependent manner.

      Referees cross-commenting

      I am more positive than the other reviewers. I agree with reviewer 1 that another patient line would be great, and that some of the westerns need improvement. However, I still find that the phenotypes found in this cell line, which are rescued by the wild-type protein, are interesting and worth reporting. I do not fully agree with technical recommendations from reviewer 3, or suggesting that that the authors address questions outside the focus of their study.

      Significance

      The significance of the study is that shows for the first time the nuclear envelop defects and fragility in cells from Cockayne Syndrome patients (mutations in ERCC8) are due to mechanical stress that is alleviated by depletion of the LINC complex. In addition, the study shows activation of the cGAS-STING pathway in these cells, although the evidence about this phenotype could be strengthen.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Cockayne syndrome (CS), an autosomal recessive premature ageing/progeria disease, is caused by mutations in the genes coding for CSA and CSB. These play a well-studied role in transcription-coupled nucleotide excision repair. In this manuscript, CSA but not CSB dysfunction is associated with nuclear envelope defects, a typical phenotype of cells from progeria diseases caused by mutations in NE proteins. HPA model cells lacking CSA show an irregular nuclear envelope not observed in cells lacking CSB. Patient cells with defective CSA, compared to isogenic cells expressing the wild-type CSA protein, show more severe nuclear envelope deformation and blebbing and activation of the cGAS/STING signaling pathway, indicating nuclear envelope damage, a slight reduction in the expression of LEM2, a protein of the inner nuclear membrane, and a reduced interaction of LEM2 with the lamina component lamin A/C. Patient cells with defective CSA show aberrant acThe proposed link between CSA and nuclear envelope integrity is interesting. CSA would in this case a factor involved in DNA repair as well as nuclear envelope integrity, two processes linked to progeria. However, the work relies on a single patient derived cell line which is compared to the same cell line expressing the wild type protein. To consolidate, more cells form different CSA patients need to be included. If worked out this idea will be interesting all cell biologists interested in DAN damage repair and nuclear envelope structure and function as well as researchers interested in the molecular mechanisms of progeria syndromes. My expertise: nuclear envelope structure and function, nuclear transport tin stress fiber and their nuclei show different effects toward actin polymerization inhibitors or stabilizers. Downregulation of SUN1, one of the nuclear envelope proteins linking to the cytoskeleton, improves nuclear envelope blebbing phenotypes in CSA patient cells

      Major comments:

      Albeit the link between CSA and NE integrity the work is in my eyes too preliminary. Although the data presented are well done and carefully evaluated they mostly (except Fig 1A) rely on direct comparisons of one patient cell line (CS-A or CS-B) to the same cells expression the wildtype protein. It remains thus open whether the effects seen on LEM2 expression, LEM2-LaimA/C interaction, stress fibre formation, cGAS/STING signaling pathway activation in the CS-A cells are representative for a number of different CS patient derived cells. This is especially important given the small changes observed. Please note that there are alos clear differences between the CSA-wt and CSB-wt cells. Would the HPA CSA KO cells show in addition to NE irregularities (not even quantified) the same phenotypes and can they be reverted by re-expression of the wildtype CSA protein? The link between CSA and SUN1 is not well worked out. What is the effect of SUN2 downregulation and that of nespirins? It remains unclear whether the observed effects are indeed LINC mediated.

      Minor comments:

      Fig 1B: Why is HA-Tagged CSA not shown on the CSA western? This would be helpful to compare to the endogenous levels at least in CSB cells. A western showing an housekeeping marker would allow better comparison. Judging from the proteins markers HA-tagged CSA seems much larger as endogenous CSA (first versus second row). Again, less cropped western blots would help.

      Fig 3A: Is CSA FLAG or HA-tagged? Or both? If both are expressed the question raises of why the CSA-LEM 2 interaction is only seen in an overexpression situation.

      Fig 5: Inconsistency between figure and figure legend: 20 vs 25 nM Jasplakinolide. I assume Latrunculin A should read Cytochalasin D?

      Fig5B: Not clear why in "CSA-wT cells" Cytochalasin D and Jasplakinolide have the same effect on nuclear envelope shape yet only Jasplakinolide increases the number of blebs.

      Page 10: Method for IF: 4% (v/v) paraformaldehyde and 2% /v/v) should likely read (w/v).

      Page 19: replace "withl" by "with".

      Significance

      The proposed link between CSA and nuclear envelope integrity is interesting. CSA would in this case a factor involved in DNA repair as well as nuclear envelope integrity, two processes linked to progeria. However, the work relies on a single patient derived cell line which is compared to the same cell line expressing the wild type protein. To consolidate, more cells form different CSA patients need to be included. If worked out this idea will be interesting all cell biologists interested in DAN damage repair and nuclear envelope structure and function as well as researchers interested in the molecular mechanisms of progeria syndromes.

      My expertise: nuclear envelope structure and function, nuclear transport

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors conducted a comprehensive investigation into the spatial and temporal expression patterns of Pcdh10 during brain development and employed two deficient mouse models to delve into its neurobehavioral functions, with a specific emphasis on ultrasonic vocalizations (USV). Interestingly, the authors found that heterozygous cKO pups showed an exaggerated effect on USV as compared to the ubiquitous heterozygous KO pups. These observations in general agree with previous studies of Pcdh10's complicated function in ASD-related neurodevelopment, as well as USV, indicating its important function in neurodevelopment. However, several key concerns warrant attention:

      Major comments:

      1. The authors analyzed published scRNAseq adult mouse dataset and found that Pcdh10-expressing cells differentially expressed genes involved in vocalization behaviors, including Foxp2, Cntnap2, Nrxn1 and Nrxn3, as compared to cells that did not express Pcdh10. Given that the authors also performed bulk RNA-seq experiments using the sorted cKO cells, it would be interesting and important to include these data in the analysis and validate whether the above genes are differentially expressed in the animal model in the current study. This may provide additional molecular mechanisms of the behavioral abnormality observed in the animal models.
      2. In Figure 1S, the synapse assembly is one of the most significant GO-terms. Does Pcdh10 deficiency affect neuromophology and synaptic density/assembly? Characterizing the neuroanatomy of the animal model will provide a neuronal explanation for the behavioral abnormalities observed in this study.
      3. Could the authors explain why cHE mice display a stronger phenotype than cKO at P6? Also, why cHE and cKO show stronger phenotypes than whole-body KO? Does this data indicate that loss of Pcdh10 in the inhibitory neurons only resulted in E-I in-balance? The authors should at least discuss the possibilities of this result they observed.

      Minor comments:

      1. It would be interesting and important to know the level of PCDH10 at the adult stage after P7 to learn whether this gene also plays an important role beyond early development.
      2. Besides USV, does Pchd10 deficiency in mice show other autistic phenotypes in adolescence and adulthood, such as social interaction and repetitive behavior?

      Significance

      The experiments are well designed and the detailed characterization of USV in the animal models would be informative for the audience who are interested in Pcdh10's function in neurodevelopment and Social Communication. However, because previous studies already showed the function of PCDH10 in multiple mouse models, the novelty of this study is limited. The observation that cKO/cHE mice show stronger phenotype than whole-body KO mice could be potentially interesting, it would be nice if the authors could provide some explanation of this result with additional experimental evidence.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In this paper the authors present a new cKO mouse model for PCDH10-related ASD. This model consists of the ablation of PCDH10 specifically in interneurons of the basolateral complex. Interestingly, the use of this mouse model together with a complete KO adds evidence towards an excitatory/inhibitory imbalance causing the ASD phenotype, rather than the complete ablation of a protein (in this case PCDH10). Firstly, the developmental dynamics of PCDH10 are measured with diverse techniques. The presence of PCDH10 at embryonic stages in different areas of the basal forebrain. Here it is made clear why the specific downregulation of PCDH10 in Gsh2- lineage interneurons. The cKO mouse model is validated with bulk RNA sequencing fluorescence imaging of the eGFP reporter in the telencephalon. The KO mouse model is validated with WB assay showing the gradual decrease of PCDH10 in Het and Ho mice. Secondly, the USV emitted by isolated pups throughout development (P3, P6, P9 and P12) are analysed with different parameters in both mouse models.

      Major Comments

      1. The role of anxiety in the phenotype described in this work should be supported by behavioural experiments in adulthood (Open field/light/dark/Plus Maze test)2-3M for mice to reach the appropriate age + 2 weeks for performing the experiments and extracting results.
      2. The WB in panel F1C&E should be done with non-pooled biological replicates to be informative 3 weeks
      3. The statistical test used for F2B and F3D-I needs to be specified.
      4. The reduced GABA input in the amygdala that is hypothesized to be causing the phenotype could be studied by iPSP analysis through LFP (OPTIONAL) 1M

      Minor comments

      1. The results section should be subdivided in sections corresponding to each figure
      2. Detail the pinhole opening in M&M used for the imaging of the images in panel of Figure 1 M-R
      3. The group size and the power calculation used to determine it should be detailed in M&M
      4. The WB membrane image in Panel 1F has saturated pixels, the image needs to be changed
      5. Instead of asterisks, writing the exact p-value is more informative in the graphs
      6. Figure 1B & D: detail what the Pcdh10 levels are normalised to. In the legend there's a typo "no-way ANOVA"
      7. Figure 1F: specify what it means P17.5 (norm)
      8. Figures 3-4: choose higher contrast colours for an easier readability and more accessibility.
      9. Figure 3B: the WB image needs to be at a higher resolution
      10. Figure 3C: the colour coding for dBFS in the spectrogram needs to be specified in the maximum and minimum number.
      11. Figure 3D-I & 6G: results would be more clear if shown as a ratio of the WT (would be also more evident the mouse model differences). Also the titles of the graphs are misleading, as the graphs are showing data from all the genotypes of the mouse models.
      12. Figure 2B: specify what it is normalised to
      13. Figure 4 E-H: it is not described what the dotted lines correspond to.
      14. In all the figures, the panels are excessively subdivided, the following panels should be grouped in one:
        • a. Figure 1: i. C & E are showing the same data

      ii. G-I are showing the same data

      iii. J-L are showing the same data

      iv. M-R are showing the same data - b. Figure 2: panel 2C-G are showing the same data - c. Figure 3: i. A-B are showing the same data

      ii. D-I are showing the same data - d. Figure 4: i. A-C are showing the same data

      ii. E-H are showing the same data - e. Figure 5: the frequency parameters (A-E) should be all 1 panel f. Figure 6: B-E are showing the same data

      Significance

      The detailed study of the socio-affective communication of these mouse models is accurate and quite informative. There is still a big body of work to do for classifying and using pup USV as biomarkers for mouse model phenotyping, and this thorough work is a step forward. This type of work will be of interest to neurodevelopmental neuroscientists interested in behaviour and mouse model phenotyping. However, the claim of an autistic-like phenotype would be much stronger with additional behavioural assessments in adulthood (as USV in mating behaviour, social novelty recognition and stereotypical behaviours). In addition, the hypothesis of an excitation/inhibition imbalance is interesting, but correlational in this work. Further experiments would need to be done to prove causality.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper provides evidence that Pcdh10 (associated with autism spectrum disorder) is involved in anxiety-like behavior and socio-affective communication in developing mouse pups. Using a specific Cre-driver line targeting Gsh2-lineage interneurons, the authors performed a series of behavioral analyses, including isolation-induced ultrasonic vocalization. Particularly, the authors provided detailed analyses to provide distinct clusters that might correspond to identified call types. This work is excellent and I do not have any further comments, except one minor comment: why did the authors use CD1 line to generate the mouse lines, instead of C57BL/6 substrains?

      Significance

      This is of paramount significance, such that Pcdh10 expressed in Gsh2-lineage interneurons (a subpopulation in the basolateral complex of the amygdala) is required for isolation-induced ultrasonic vocalization.

    1. Note: This response was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      General Statements

      We thank the reviewers for their constructive feedback on our study. We have updated the manuscript according to their recommendations.

      Two of the three reviewers raised concerns about the quality of some of our data, in particular of the DP population. Since most DP samples suffered from low coverage, introducing bias when being compared to the higher quality data, we moved all results involving DP to the Supplement. We mention this in the Discussion. The main text and its figures were updated to focus on the remaining three wild populations and two laboratory strains.

      We rewrote parts of the Abstract, Introduction and Discussion to clarify the comparison between the NLR families of Arabidopsis thaliana and Danio rerio. In regard to this, we also added an extra panel to figure 3.

      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      General comments:<br /> 1) Odd statement in abstract comparing plants to fishes - why not put in context of animals in general, or vertebrates in particular? Same in the introduction and discussion. Either a more broad comparative framework should be set - from plants, through invertebrates to vertebrates, or it should be shortened and perhaps discussed in the light of diversity present in e.g., fish or just vertebrates. A persistent focus on plants is peculiar and does not seem to provide adequate background/context._

      We have made an effort to clarify the reason for mentioning NLRs from A. thaliana in the Abstract, Introduction and Discussion. We added a plot to figure 3, which is analogous to figure 3A from Van de Weyer et al., 2019 where the pan-NLRome of A. thaliana is analysed.

      Furthermore, we added a paragraph discussing copy number variation of another immune gene family (MHC genes) in vertebrates.

      2) Methods seem overall well described, and the analyses seem to have been performed diligently-although I am not an expert in the field. One aspect that would be important to evaluate methodology is the repeatability of NLR detection - If we perform the procedure twice on DNA sample from a single individual, how repeatable collection do we get?

      It is possible that we might have failed to identify a few NLRs from some individuals. Unfortunately, sequencing could not be repeated due to the limited amount of raw tissue. The population DP in particular is problematic since we had low sequencing depths across all of its samples. We have now excluded it from the main results and moved it to the Supplement. Still, we are confident that the general feature of widespread copy number variation both within and between populations persists even if the experiments were repeated. One argument for this is that the non-linear functions describing pan-NLRome sizes are fairly consistent among wild populations. Also, sample sizes of about 20 individuals per (wild) population are large enough to compensate for poor data quality of one or two indviduals. Finally, we found a substantial fraction of NLRs to be present in all wild populations, including DP. It would be unlikely to detect so many shared genes if the amplified sequences were spurious or the NLR identification procedure faulty.

      We have expanded the discussion of the caveats of our approach.

      3) Discussion is quite speculative, and some claims seem exaggerated; in conclusions, eg: "This study advances our understanding of the evolutionary dynamics affecting very large gene families." - as the study is mostly descriptive documentation od CNV of gene (or, gene fragments) I am not convinces how it really advances "understanding of the evolutionary dynamics".

      We toned down Discussion and Conclusions.

      Minor points:

      At this stage the journal and so the audience is unknown, so perhaps this will not be an issue - but for a broader audience, a better explanation of what PRY/SPRY/B30.2 are would be useful._

      We added an explanation.

      English could be smoothed, certain sentences sound odd: eg. lines 87-89; or 90-93 - Studies have shown that viral and bacterial infections can induce the expression of specific fish NLRs (reviewed in (24)). Some have PYD or CARD domains and can even form inflammasomes similar to mammalian NLRs (25, 26). - could be read as if infections had PYD or CARD domains, not NLRs. Lines 239-240 - not sure what does "presence/absence variation" mean here.

      We made an effort to improve language style.

      Sequence of baits used should be provided in some supplement or repository.

      The sequences are now attached to the manuscript as a supplementary dataset.

      Reviewer #1 (Significance):

      The article tackles an overall interesting subject: an expansion of a relevant group of genes in one of the major model species of biomedical importance. The main strength of the study is putting in context variation found in the lab strains - via comparison to wild populations. A limited and "haphazard" genetic variation (especially of genes involved in immune processes) of laboratory models can have paramount implications for the interpretation of experimental studies._

      We expand on this in the Discussion._

      That being said, expansion of NLR family in teleosts in general, and in zebrafish in particular, has been previously described (eg, https://bmcecolevol.biomedcentral.com/articles/10.1186/1471-2148-8-42_), and this study mainly expands description of this phenomena.

      The expansion of NLRs has indeed been described before and in great detail. The manuscript contains multiple references on this topic, including the one recommended by the reviewer (e.g. Stein et al., 2007, Laing et al., 2008, Howe et al., 2016). The previously reported expansion of NLRs in the reference genome was a description, but not an interpretation "in the light of evolution". We demonstrate that gene birth, death, and presence/absence variation are ongoing processes and active on a population-genetic time scale.

      Given the methodology, little can be said about possible functions of the discovered diversity. In particular, a technical aspect imposing limitation of sequence length resulted in a collection of exons, but not full genes - precluding e.g., deepened analysis of domain architecture.

      We agree that our study does not reveal much about the function of the NLR-C genes. The focus on function in the discussion was disproportionate to our findings and we have reduced it to avoid speculation.

      Selection analysis is also quite limited in scope; there are several, more sophisticated tools to infer balancing or purifying selection, either pervasive or episodic (see eg. excellent tools of_ https://www.datamonkey.org__/_).

      Yes. However, with our limited amount of data, we deliberately do not want to speculate too much about selection. As mentioned in the text, many genes are monomorphic in our sample. To reliably infer the action of selection, and more so to distinguish it from signatures of demographic history, a much broader basis of sequence data is needed.

      Nonetheless, the study certainly highlights the possible importance of such an expanded group of genes in this species, with a potential to inspire further - more mechanistic/functional research in this area. The article will likely interest a few groups of rather specialized audience - e.g., those working in NLR genes in particular, or in immunity of zebrafish (or fish in general). Another, somewhat broader angle, able to attract a wider audience would be subjects of general evolution of multigenic families (birth-and-death models, trade-offs at different CNV etc.) and comparative analysis to other groups of animals - or genes - evolving in a similar way. My field of expertise is diversity and evolution of adaptive branch of the immune system in vertebrates, in particular non-model vertebrates.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary:

      The authors have examined the variation of the immune related gene family "Nucleotide-binding domain Leucine-rich Repeat containing" (NLR) in the zebrafish Danio rerio. These proteins, while highly divergent, are conserved across animal and plant species. In humans or rodents, the number of these genes is relatively small (20-30) but other species such as Arabidopsis appear to have >10,000 NLR genes.<br /> Schäfer et al. collected 67 wild-type zebrafish from 4 independent sites near the Bay of Bengal and added 8 fish each from the laboratory strains Tübingen (TU) and Cologne (CGN). Using DNA capture and sequencing technology, they identified 1,560 unique "FISNA-NACHT" domains representing different NLR genes and 574 NLR-associated B30.2 domains. A subset, 714 and 229 respectively, were identified in all of the fish populations representing a "core" set common NLR genes. A significant subset of the genes could not be aligned to the GRCz11 reference suggesting significant variation across subpopulations of zebrafish.

      Major comments:<br /> None. The work is straight-forward, carefully done and conservatively interpreted.

      Minor comments:<br /> GRCz11 is still a very fragmented assembly, particularly chr4 where heterochromatin repeats on the long arm were intractable to short read sequences. There is a more recent assembly generated by the Tree of Life initiative (GCA_944039275.1) from the SAT fish line that may allow a more robust alignment and placement of the NLR genes, perhaps even some phasing of unassembled fragments._

      We have examined some of the existing long-read based zebrafish assemblies and found that even the length of chromosome 4 that contains most of the NLRs can significantly differ between different strains and different genome assemblies. This is also in agreement with the recent findings of McConnell et al., 2023 on large structural differences on this chromosome between three strains derived from the AB genetic background. We now mention this finding and the alternative existing zebrafish assemblies in the Discussion.

      Reviewer #2 (Significance):

      General assessment:<br /> Obtaining more data on the inherent variation within species of NLR genes as well as collecting more cross-species data for evolutionary comparisons is valuable for our understanding of this interesting but poorly understood class of immune response genes as well as the dynamics of gene duplication/deletion in complex, repeated arrays.<br /> -The major strength of the study is the efforts put into capturing wild-type zebrafish from multiple different locations to maximize the diversity of the NLR sequences.<br /> -The primary limitation is targeted capture is always contingent on having enough homology in the probes to capture all the desired genes in roughly even proportions. Some of the more interesting NLR genes might have diverged too much to be properly captured but could provide critical information on evolutionary functional adaptation. Similarly, while based on cost, it is understandable why sequence capture was chosen, but phasing information across clusters could help explain (or discover) very diverse haplotype sequences in the introns of many of these gene arrays. As sequencing costs drop, this is an important aspect to the evolution of zebrafish chromosome 4 (as well as the rest of the genome). "Pangenome" differences in the zebrafish genome might be quite radically different.

      Yes, we are aware of this possibility and discussed it now in more detail._

      Advance:<br /> The study represents an important characterization of a poorly understood but important class of immune genes. A very similar (but more detailed) characterization of Arabidopsis NLR genes by Van de Weyer et al., Cell 2019, is considered an important advance to the plant biology community and has been highly cited since its publication. The collected data is potentially quite useful for future studies that want to understand the complex dynamics of highly repetitive, yet functional regions of the genome and how it can result in the creation and extinction of new genes.

      Audience:<br /> There are two main likely audiences, those interested in how NLR genes are involved immune protection and those interested in genomes and evolution. It probably doesn't reveal a fundamental biological principle that would be of broad, general interest to the research community.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary:<br /> The authors performed exon capture sequencing of FISNACHT and B30.2 domains from individuals among various inbred and wild zebrafish populations. This study provides average numbers of domains per fish and thus a valuable estimate for the size of the pan-NLRome in zebrafish.

      Major comments:<br /> - Are the key conclusions convincing?_ Yes, and overall their findings are also consistent with what has been found in other model systems, most notably Arabidopsis (Weyer 2019, https://www.cell.com/cell/fulltext/S0092-8674(19)30837-2).<br /> - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?<br /> 'Complex patterns of inheritance' is unclear from significance statement.

      We have reformulated this sentence.

      For mechanisms driving copy number differences, haplotypes/segregation need not be presented as a model opposed to birth and death gene evolution/tandem duplication. These may both operate at different scales.

      Yes, we reformulated.

      • Are the data and the methods presented in such a way that they can be reproduced?<br /> Yes, overall, though the authors could provide sequences for their adapters, similar to Weyer 2019. It seems that raw data for this study is not yet available in NCBI: https://www.ncbi.nlm.nih.gov/bioproject/966920

      The data will be made available at the time of publication. The sequences for all baits used, including non-NLR-ones, have been added as a supplementary .fasta-formatted text file.

      Minor comments:<br /> - Specific experimental issues that are easily addressable.<br /> Their targeted exon capture approach cannot ensure that all NLR genes will be sequenced. This method also does not sequence across other exons from NLR genes, providing only a partial view of NLR gene structure and evolution. This may miss for example additional integrated domain architectures or evidence of physical clustering in the genome (Thatcher 2023, https://bsppjournals.onlinelibrary.wiley.com/doi/10.1111/mpp.13319). Nevertheless, focusing on capturing and enumerating FISNACHT domains as the most conserved and characteristic domain for NLRs appears to be a reasonable approach for estimating rough copy numbers.

      We agree with the reviewer and have now added explanatory text on this topic to the discussion.

      Unfortunately, mapping these domains back to only a single reference genome also means that the accuracy of their predictions for these domains, including which belong to genes or pseudogenes, is likely to be more error-prone (Wang 2019,https://www.pnas.org/doi/10.1073/pnas.1910229116), particularly for more divergent sequences. This may lead to additional limitations for assigning one-to-one relationships between FISNACHT domains, which they measure, and genes, which they seek to enumerate.

      Thank you for pointing out the above reference. This is a limitation that we realized early on in the study after which we decided to not rely on the reference genome at all, instead opting to assemble and cluster all data de novo. In the manuscript we only use the reference genome for checking which of our identified NLRs have a clear homologue to satisfy potential interest by the biomedical community. All the other analyses were conducted based on the de novo assemblies and orthologous clusters._

      In addition, many of their samples appear to be of poor DNA quality, meaning that

      modeling estimates of domain copy number for the population can appear erroneous, even perhaps off by as much as a factor of 10 for the DP strain. This may contribute to some curious 'artifact' in some of the figures, such as Figs S2B1, S3B, S4C2, particularly for the B30.2 sequences in DP. The authors mention 'low sequencing depths' and 'low coverage' as contributing factors for this 'artifact' but also invoke possible 'evolutionary factors' in the discussion. Additional discussion for the apparent experimentally-induced effects on underestimating B30.2 sequences (perhaps due to increased DNA fragmentation and smaller domain size?), rather than coincidence of low coverage and actual biological strain-specific loss of B30.2 domains, appears warranted.

      As mentioned in the text, although samples from the population CHT were older than the others and slightly degraded, they were rescued by applying the PreCR Repair Mix from New England Biolabs. We incorporated incubation with the reagent to the standard protocol that we used for all samples. The other samples had no quality problem.

      Furthermore, the reviewer raises concerns regarding data quality for the DP population samples. After careful consideration, we decided to exclude the DP data and its analysis from the main text and deferred it to the supplementary material with the necessary remarks of caution. The main text now focuses on the other three wild populations, which did not have those issues.

      Additionally, as per the reviewer’s suggestion, we discuss the additional caveats of bait-based targeted sequencing, especially for B30.2 domains._

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?<br /> Statement in the abstract about fewer (not 'less') NLRs in lab strains vs wild: consider revising to average # NLRs per individual._

      We changed the respective sentence in the abstract.

      Revise Fig 3C legend: 'Totally discovered NLR genes', e.g., to 'total # NLR genes'.

      The recommended change was made to the figure legend.

      Reviewer #3 (Significance):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.<br /> This study elucidates the pan-NLRome of zebrafish via application of a somewhat new technique.
      • Place the work in the context of the existing literature (provide references, where appropriate).<br /> The authors apply a targeted capture and long read technology previously used in plants (Weyer 2019) to fish, thereby elucidating the pan-NLRome of zebrafish, a vertebrate with a large # of NLR genes. Previous studies has shown NLR gene variation in zebrafish, but had not compared levels within and between strains, or estimated a total NLR # across populations._

      • State what audience might be interested in and influenced by the reported findings.<br /> Those interested in immune function, genetic diversity and genome evolution.

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.<br /> Immune genes and evolution. The specific statistical approach_

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The authors performed exon capture sequencing of FISNACHT and B30.2 domains from individuals among various inbred and wild zebrafish populations. This study provides average numbers of domains per fish and thus a valuable estimate for the size of the pan-NLRome in zebrafish.

      Major comments:

      • Are the key conclusions convincing?

      Yes, and overall their findings are also consistent with what has been found in other model systems, most notably Arabidopsis (Weyer 2019, https://www.cell.com/cell/fulltext/S0092-8674(19)30837-2).<br /> - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      'Complex patterns of inheritance' is unclear from significance statement.

      For mechanisms driving copy number differences, haplotypes/segregation need not be presented as a model opposed to birth and death gene evolution/tandem duplication. These may both operate at different scales.<br /> - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      Not necessary<br /> - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      NA<br /> - Are the data and the methods presented in such a way that they can be reproduced?

      Yes, overall, though the authors could provide sequences for their adapters, similar to Weyer 2019. It seems that raw data for this study is not yet available in NCBI: https://www.ncbi.nlm.nih.gov/bioproject/966920<br /> - Are the experiments adequately replicated and statistical analysis adequate?

      Yes

      Minor comments:

      • Specific experimental issues that are easily addressable.

      Their targeted exon capture approach cannot ensure that all NLR genes will be sequenced. This method also does not sequence across other exons from NLR genes, providing only a partial view of NLR gene structure and evolution. This may miss for example additional integrated domain architectures or evidence of physical clustering in the genome (Thatcher 2023, https://bsppjournals.onlinelibrary.wiley.com/doi/10.1111/mpp.13319). Nevertheless, focusing on capturing and enumerating FISNACHT domains as the most conserved and characteristic domain for NLRs appears to be a reasonable approach for estimating rough copy numbers.

      Unfortunately, mapping these domains back to only a single reference genome also means that the accuracy of their predictions for these domains, including which belong to genes or pseudogenes, is likely to be more error-prone (Wang 2019, https://www.pnas.org/doi/10.1073/pnas.1910229116), particularly for more divergent sequences. This may lead to additional limitations for assigning one-to-one relationships between FISNACHT domains, which they measure, and genes, which they seek to enumerate.

      In addition, many of their samples appear to be of poor DNA quality, meaning that modeling estimates of domain copy number for the population can appear erroneous, even perhaps off by as much as a factor of 10 for the DP strain. This may contribute to some curious 'artifact' in some of the figures, such as Figs S2B1, S3B, S4C2, particularly for the B30.2 sequences in DP. The authors mention 'low sequencing depths' and 'low coverage' as contributing factors for this 'artifact' but also invoke possible 'evolutionary factors' in the discussion. Additional discussion for the apparent experimentally-induced effects on underestimating B30.2 sequences (perhaps due to increased DNA fragmentation and smaller domain size?), rather than coincidence of low coverage and actual biological strain-specific loss of B30.2 domains, appears warranted.<br /> - Are prior studies referenced appropriately?

      yes<br /> - Are the text and figures clear and accurate?

      yes<br /> - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Statement in the abstract about fewer (not 'less') NLRs in lab strains vs wild: consider revising to average # NLRs per individual.

      Revise Fig 3C legend: 'Totally discovered NLR genes', e.g., to 'total # NLR genes'.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This study elucidates the pan-NLRome of zebrafish via application of a somewhat new technique.<br /> - Place the work in the context of the existing literature (provide references, where appropriate).

      The authors apply a targeted capture and long read technology previously used in plants (Weyer 2019) to fish, thereby elucidating the pan-NLRome of zebrafish, a vertebrate with a large # of NLR genes. Previous studies has shown NLR gene variation in zebrafish, but had not compared levels within and between strains, or estimated a total NLR # across populations.<br /> - State what audience might be interested in and influenced by the reported findings.

      Those interested in immune function, genetic diversity and genome evolution.<br /> - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Immune genes and evolution. The specific statistical approach

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors have examined the variation of the immune related gene family "Nucleotide-binding domain Leucine-rich Repeat containing" (NLR) in the zebrafish Danio rerio. These proteins, while highly divergent, are conserved across animal and plant species. In humans or rodents, the number of these genes is relatively small (20-30) but other species such as Arabidopsis appear to have >10,000 NLR genes.<br /> Schäfer et al. collected 67 wild-type zebrafish from 4 independent sites near the Bay of Bengal and added 8 fish each from the laboratory strains Tübingen (TU) and Cologne (CGN). Using DNA capture and sequencing technology, they identified 1,560 unique "FISNA-NACHT" domains representing different NLR genes and 574 NLR-associated B30.2 domains. A subset, 714 and 229 respectively, were identified in all of the fish populations representing a "core" set common NLR genes. A significant subset of the genes could not be aligned to the GRCz11 reference suggesting significant variation across subpopulations of zebrafish.

      Major comments:

      None. The work is straight-forward, carefully done and conservatively interpreted.

      Minor comments:

      GRCz11 is still a very fragmented assembly, particularly chr4 where heterochromatin repeats on the long arm were intractable to short read sequences. There is a more recent assembly generated by the Tree of Life initiative (GCA_944039275.1) from the SAT fish line that may allow a more robust alignment and placement of the NLR genes, perhaps even some phasing of unassembled fragments.

      Significance

      General assessment:

      Obtaining more data on the inherent variation within species of NLR genes as well as collecting more cross-species data for evolutionary comparisons is valuable for our understanding of this interesting but poorly understood class of immune response genes as well as the dynamics of gene duplication/deletion in complex, repeated arrays.

      • The major strength of the study is the efforts put into capturing wild-type zebrafish from multiple different locations to maximize the diversity of the NLR sequences.
      • The primary limitation is targeted capture is always contingent on having enough homology in the probes to capture all the desired genes in roughly even proportions. Some of the more interesting NLR genes might have diverged too much to be properly captured but could provide critical information on evolutionary functional adaptation. Similarly, while based on cost, it is understandable why sequence capture was chosen, but phasing information across clusters could help explain (or discover) very diverse haplotype sequences in the introns of many of these gene arrays. As sequencing costs drop, this is an important aspect to the evolution of zebrafish chromosome 4 (as well as the rest of the genome). "Pangenome" differences in the zebrafish genome might be quite radically different.

      Advance:

      The study represents an important characterization of a poorly understood but important class of immune genes. A very similar (but more detailed) characterization of Arabidopsis NLR genes by Van de Weyer et al., Cell 2019, is considered an important advance to the plant biology community and has been highly cited since its publication. The collected data is potentially quite useful for future studies that want to understand the complex dynamics of highly repetitive, yet functional regions of the genome and how it can result in the creation and extinction of new genes.

      Audience:

      There are two main likely audiences, those interested in how NLR genes are involved immune protection and those interested in genomes and evolution. It probably doesn't reveal a fundamental biological principle that would be of broad, general interest to the research community.