6,070 Matching Annotations
  1. Sep 2023
    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This paper from the Heisenberg lab takes a reductionist approach to understanding how BMP and Nodal signaling interact to coordinate morphogenesis. They mostly use blastoderm explants that they culture in vitro. These explants elongate over time, with Nodal signaling that induces mesendoderm driving the cell intercalations that explain the elongation. They show that increased BMP signaling inhibits this process, but reducing BMP signaling has no effect. They see that reducing Nodal signaling results in an upregulation of BMP activity as read out by phosphorylated Smad5 staining and increasing Nodal signaling has the opposite effect. They explain this mostly by the observation that Nodal induces the expression of the BMP antagonist, Chordin, and validate this idea by demonstrating that a reduction in Chordin expression reduces explant elongation. Returnign to the embryo, the authors show that manipulation of Nodal signaling levels influences the size of the BMP activity gradient as expected from the in vitro results. Finally, they show that reduction of Nodal signaling with SB505124 sensitises the embryos the effects of bmp2b overexpression, and that BMP overeactivation at 90% epiboly reduced C&E movements.

      Major comments

      In general I think the work is well done and the data justify the conclusions.

      I have several suggestions for additional experiments and discussion that I think would improve the paper.

      1. In Figure S1 they present data on elongation of explants treated with a Nodal inhibitor. It would be good to show some examples of images of the explants.
      2. In Figure 1G and 3A, the same wildtype images are shown. This is mentioned and I assume therefore that the results were all part of the same experiment. How many times were these experiments performed? It would be much better to use different biological replicates in the two figures.
      3. It is important for the authors to make clear how many biological replicates each of the experiments correspond to.
      4. In Figure 4E, it would be good to show the levels of P-Smad2 in the Oep and MZ lefty1, 2 explants.
      5. On page 11 the authors mention chordin-independent inhibition of BMP signaling. The most likely candidate would be noggin as it too is expressed dorsally and is at least in part activated by Nodal. This should be tested in their model.
      6. The authors focus on Chordin as downstream of Nodal signaling, and discuss the role of Nodal signaling in inducing chordin as being due to peak Nodal signaling. However, Chordin has been shown to also be downstream of Fgf signaling and Bozozok (PMIDs 23499658 and 16873584), which likely explains its dorsal expression domain. Furthermore, Rogers et al, (PMID 33174840) who the authors refer to, also show that to disrupt BMP signaling in embryos, inhibition of Nodal and Fgf is required. These issues need to be discussed in more detail. It is the combinatorial signaling that is thought to be responsible for the dorsal location of the chordin (and noggin) expression domains.

      Minor comments

      I think in general the manuscript is well written and the figures are clear. Previous data is generally well cited. My only comment is that there is a wealth of data from Xenopus and zebrafish that BMP antagonists are induced as a result of combinatorial Nodal signaling and other pathways (dorsal wnt and fgf) that inhibit BMP signaling. I think this could be better referenced.

      Significance

      The paper is well done and provides important information about the interactions between Nodal and BMP signaling to induce axis elongation. I think the work would be improved if the authors revise it along the lines suggested above. In terms of novelty, many of the component parts of the paper are known (Nodal signaling is important for elongation via cell intercalation and Nodal and BMP can antagonize on another by the induction of BMP antagonists by Nodal), but it is novel to put them together to investigate axis extension using explants. The paper will be of interest to those interested in how these signaling pathways operate in early vertebrate development and to those interested in morphogenesis.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The authors presented intriguing observations on the molecular mechanisms regulating morphogenic cell movement, with a particular focus on convergent-extension (CE) movement associated with cell type specification in the zebrafish blastoderm explant. In this manuscript, Schauer et al. identified the CE movement of the mesendoderm as triggering the elongation of the zebrafish embryonic explant. In this process, the Nodal signal represses the BMP signal, which negatively regulates the movement of the mesendoderm precursors, through the induction of its inhibitor chordin. This suggests that the Nodal signal is the key factor coordinating cell fate specification and morphogenesis in the zebrafish blastoderm explant. Finally, suppression of Nodal signalling increases sensitivity to BMP signalling in the CE movement of intact embryos. This suggests that promotion of mesendoderm cell intercalation via BMP suppression by Nodal may be involved in conferring robustness to morphogenic cell movement in vivo.

      Major comments

      1. While one of the main conclusions of this manuscript is that "Nodal signaling regulates CE movement of mesendodermal cells by promoting their intercalation through inhibition of BMP signaling". However, this was predicted by changes in individual cell morphology and cell dispersal, and the authors didn't directly examine the behavior of individual cells. It would be better to confirm intercalation during the process of explant elongation by cell tracking analysis.
      2. Although the authors discuss that Nodal signaling inhibits BMP signaling in the later gastrulation stage, this has not been experimentally tested. If possible, the time window in which Nodal signaling acts should be investigated by temporal inhibition of Nodal signaling using chemical inhibitors.
      3. Only the signal gradient of pSmad5 and axis elongation were examined in the intact embryo part of the study (Fig. 6 and Fig. S7). The information on the domain of pSmad2 and the expression of chordin would be helpful for the comparison of the blastoderm explant and the intact embryos.

      Minor concerns

      The first letter of a gene name should be in lowercase. ( ex. Fig.S3C; Smad5 MO)

      Significance

      The zebrafish blastoderm explant assay has the potential to elucidate the molecular mechanisms regulating the complex processes of morphogenesis during vertebrate gastrulation, as the authors demonstrate in this paper. In this manuscript, the authors addressed the molecular mechanism coordinating cell fate specification and morphogenic cell movement in the blastoderm explant. All of the experiments are well-designed, the interpretation of the results is convincing and the paper is well-written. Also, the conclusion is very clear and well supported by the presented data. These findings provide fundamental and important insights for studying morphogenic cell movements in early vertebrate embryos using zebrafish blastoderm explants. On the other hand, most of the molecular mechanisms reported in this manuscript are already predicted by previous studies using intact embryos. Therefore, the impact of this work may be limited to ex vivo research.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary: This manuscript describes molecular mechanisms by which ACBD3 is recruited to the Golgi complex. ACBD3 recruits PI4KIIIb which is required to generate PI4P, a phosphoinositide which is key for the recruitment of essential Golgi proteins and hence is key to Golgi identity. The authors have used a combination of mass spectrometry, high quality fluorescence imaging, transient CRISPR knockdowns, and biochemical approaches such as IPs to identify the key determinant for recruitment of ACBD3 to the Golgi complex. They map the interaction between ACBD3 and the Golgi as a unique region (UR) upstream of its GOLD domain, identifying, in particular, an MWT motif as key for this recruitment. Using mass spectrometry they identify several novel interactors of ACBD3 as well as some established binding partners. Knockdown of these interactors reveal a key role for the SNARE, SCFD1, where reduced levels lead to complete loss of ACBD3 localisation to the Golgi without apparent disruption of Golgi structure. They further validate this interaction and that of another SNARE (Sec22b), which is part of the same SNARE complex as SCFD1, mapping the interaction to the longin domain of Sec22b. Surprisingly however they demonstrate that the UR domain does not mediate the interaction between ACBD3 and these SNAREs suggesting an alternative mechanism of recruitment. Previously identified ACBD3 interactors, Golgi proteins giantin and golgin-45 were also identified in the mass spectrometry screen and the authors demonstrate that these two proteins can recruit ACBD2 to the Golgi and this is dependent on the MWT motif identified in the UR domain. By knocking down SCFD1, they show reduced recruitment of ACBD3 leading them to propose a model of sequential recruitment of ACBD3 by SCFD1 followed by interactions with the golgins.

      Major points: This study is a well-executed and rigorous study of the molecular requirements for the recruitment of ACBD3 to the Golgi. The experimental approaches are state-of-the-art and the data are clean and convincing. The only caveat, raised by the authors themselves, is their interpretation that there are two sequential steps for Golgi recruitment of ACBD3. While they show that loss of SCFD1 reduces the interaction of ACBD3 with giantin and golgin 45, their model depends on doing the reverse experiment, i.e. assessing the effects of knocking down either giantin or golgin-45. This is especially relevant given the demonstration that golgin-45 is sufficient to recruit ACBD3 to mitochondria. It may well be that recruitment involves a tripartite complex, which is not uncommon in vesicular transport mechanisms Giantin is not an essential protein do it should be feasible to perform this experiment. The authors are equipped in the quantitative fluorescence microscopy which would be required and which would help resolve whether sequential or redundant mechanisms are required for ACBD3 recruitment.

      We thank the reviewer for the positive comments and are glad that they consider our study "well-executed and rigorous". We totally agree with the reviewer that our conclusions regarding the sequential aspect of the recruitment of ACBD3 in the original submission could be better supported. We have worked to strengthen this in our resubmission. As the reviewer states, this limitation was already discussed in the original submission. To further support our model, we have performed the experiment suggested by the reviewer, in which we test the effects of knocking down both giantin and golgin45 (double knockdown) on the binding of ACBD3 to SCFD1.

      The results of this experiment further support our sequential model with little to no effect of loss of the Golgins on ACBD3. As we already knew, a large effect of SCFD1 KO on the binding of the Golgins to ACBD3 was also observed here. We should note that this was performed in a different cell line than before (HeLa cells rather than HEK cells), as the efficiency of multiple knockdowns was much lower in HEK cells, as determined by qPCR. Taken together, the new data in Figure 7 supports a sequential model for Golgi recruitment. We also agree that other, less likely models could explain our data and have included this openly in the discussion. In conclusion, we thank the reviewer for their comments and have revised the manuscript with a new experiment with the relevant repeats, which supports our model.

      Reviewer #1 (Significance):

      Significance PI4P is a phosphoinositide that is important for the recruitment of Golgi proteins. As with most PIs it is likely to act by coincidence detection in that Golgi associated proteins will recognise PI4P as well as other factors on Golgi membranes. This results in different local membrane environments which will be specific for particular functions. PI4KIII__b_ is key for PI4P production although the absolute levels of PI4P are likely to be determined by a balance of lipid kinases and phosphatases. However, since ACBD3 is key for the recruitment of PI4KIII__b, it is important to understand the molecular mechanisms by which it is recruited. The manuscript thus makes a significant contribution to understanding one of the underlying mechanisms for PI4KIII__b _recruitment although, as indicated above, stops short of establishing a clear model for the roles SCDF1 and Sec22b versus golgin 45 and giantin. For the future it will be of interest to determine why either a sequential or a redundant mechanism is required for the recruitment of ACBD3 as a scaffold protein.

      We thank the reviewer for this set of positive comments on the manuscript and for agreeing that this is a significant contribution. Our revised version further supports our sequential model of ACBD3 recruitment to the Golgi apparatus, and the comments here have helped us further to strengthen the quality and clarity of the manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary This is a very interesting and potentially important paper for the field of membrane biology and membrane trafficking, in which the authors have studied the molecular mechanisms by which ACBD3 (and consequently PI4KIIIb) is recruited to the cis-Golgi membranes. The authors suggest that this recruitment is based on a two-step process, mediated by interactions to, on the one hand, SCFD-1 (SLY1) and, on the other hand, two redundant golgins (golgin-45 and giantin).

      We once again thank the reviewer for the positive comments and are glad that they consider our manuscript important.

      Comments:- Pg.1 : arfaptins, as far as I know, have not been shown to be involved in intra-golgi trafficking but rather in Golgi export (see e.g. ref. 12)

      We thank the reviewer for pointing this out. We have corrected the text accordingly.

      • Pg. 1: reigon --> region

      We thank the reviewer for noticing this typo. We have corrected the text accordingly.

      • Arf1 also recruits PI4KIIIb right?

      This is correct. The De Matteis lab has shown that PI4KIIIβ associates with the Golgi complex in an Arf1-dependent manner (Godi et al. 1999). We think this is excellent work. However, Arf1 is somewhat of a master regulator of the Golgi, affecting the recruitment and localisation of many different Golgi proteins. It has also previously been reported that Arf1 does not directly interact with PI4KIIIβ (Klima et al. 2016). Overall, the molecular relationship between Arf1 and the kinase remains unclear. We do not exclude, however, that there are factors other than ACBD3 important for recruiting and regulating PI4KIIIβ levels at the Golgi. We have changed the wording in the manuscript to reflect that there are multiple ways that PI4KIIIβ is recruited to the Golgi apparatus.

      Fig. S1: the information about the number of cells per experiment is missing. Also, please add the information about what exactly is represented in the box plots (is it the distribution of the mean value of R per experiment? or the total distribution on a cell-by-cell basis of a representative experiment?)

      For each experiment, a minimum of 100 cells per condition were imaged. The Pearson's correlation was then calculated, and the average was taken for each biological repeat. The plot in Fig. S1B represents 3 independent biological repeats. We have included this information in the revised manuscript.

      • The definition of Avg. Golgi int/avg. cell int. (a.u.) in Fig 1E,F is a bit difficult to understand to me. If I understand correctly, the total fl. int in the Golgi mask was computed and divided by the area of the Golgi mask (this is the av. Golgi intensity). A similar computation is done for the entire cell (including the Golgi), i.e., total fl. intensity in the cell mask is computed and divided by the area of the cell mask. Then the two av. intensities are divided (ratio = av. Golgi int / av. cell int.). This ratio, for a protein that is enriched in the Golgi area, should be larger than 1. For a protein that is equally distributed all over the cell, it should be 1, and for a protein that is excluded from the Golgi area, smaller than 1. Then to this value, the authors subtract the value of the ratio found for an inert construct (GFP of Halo alone), which I imagine should have an original ratio value of the order of 1, and hence, after this subtraction, norm. ratio values larger than 0 mean that they are more enriched at the Golgi area than GFP/HaloTag themselves. Is this correct? In principle, I don't see anything entirely wrong with this way of thought, but I just found it a bit difficult to understand, and in general one has to be careful when computing rations (quotients) and then subtract another ratio. Also, the units are not a.u., the value is dimensionless, what is "arbitrary" is the definition of 0 value and the based on this definition, also the actual value. I think it would probably be much clearer for the readers to compute somthing like the relative enrichment in the Golgi area as compared to the rest of the cell (excluding the Golgi area). That is, a value r'=(Int. Golgi mask / Area Golgi mask) / [(Int. Cell mask - Int. Golgi mask)/(Area cell mask - Area Golgi mask)]. This can be computed directly or defining a mask that is the cell mask - the Golgi mask. Also, some maths (unless I made a mistake) give that this r'= r (1-aG)/(1-r aG); where r is the ratio (before subtraction) defined by the authors, and aG=Area cell mask/Area Golgi mask. In any case, I'd suggest the authors to either adopt this other quantitation (without subtraction of the GFP/HAloTAG), which gives directly the fold-enrichment in the intensity density in the Golgi area with respect to the rest of the cell; or explain in more detail the maths of the value they are plotting now.

      We thank the reviewer for these well-reasoned and thoughtful suggestions for our imaging analysis. These are issues that we have also considered when quantifying this dataset. At the heart of it, the second method of calculation (Golgi/outside of Golgi), results in a non-linear distribution, as the pool of proteins re-distribute from inside the Golgi to the cytosol. This is why we have chosen to use the first method of Golgi/total, as it provides a linear distribution.

      The reviewer is also correct that the GFP (inert protein) ratio is 1 without adjustment. We have chosen to normalise to GFP/HaloTag (inert protein) as we think this is the clearest way of conveying our conclusions from these experiments. We have included the non-normalised graph here for the reviewer to see; however we thought that this conveys the key result less clearly. Overall, we agree this was poorly communicated in the manuscript and we have clarified it in the revised version.

      • Fig. 1C&F: Besides the MWT mutant, the FKE mutant also seems to have a somewhat compromised Golgi localization. Have the authors followed on that, or what is the reason that they have just focused on the MWT mutant?

      In contrast to the MWT mutant, the FKE mutant does not affect ACBD3 localisation significantly. In addition, when having a close look at the pdb structure of the GOLD domain of ACBD3 with 3A protein of Aichivirus A (5LZ3), the MWT patch, in particular residues M and T, make clear contact with protein 3A, which is not the case for FKE residues. Therefore we focused on the MWT residues, which we hypothesised to interact with a Golgi resident protein which competes with protein 3A to interact with ACBD3.

      • Very minor point, and without wanting to sound pedant at all, but I think (I might be wrong of course, so apologies if I am) that the plural of apparatus in latin is not apparati, but apparatus (fourth declination). So, I'd change the word in page 2 (or just rephrase the sentence: e.g. "resulting in Golgi fragmentation"). But of course, I'd leave this to the authors' discretion.

      We thank the reviewer for this precision, do not consider it pedantic, and have made the suggested change to the text.

      • Fig. 3A: have the authors tried or been able to perform IF of the endogenous SCFD1 protein?

      As suggested by the reviewer, we attempted to perform IF of endogenous SCFD1, as shown below. Despite trying several different antibodies, we were not satisfied that we were detecting real SCFD1 signal as there was no change in this staining upon SCFD1 CRISPR KO. Please see an example of this IF below (ProteinTech, 12569-1-AP). We have contacted the antibody manufacturers to inform them of this issue.

      • Similarly to what has been done for other panels, could you quantify Fig. 3C? Are PI4KIIIb protein levels affected upon the different KOs?

      As suggested by the reviewer, we are now showing in Figure S2D the percentage of cells with a partial or total loss of PI4KIIIβ at the Golgi in CRISPR-Cas9 KO cells of either PI4KIIIβ, ACBD3 or SCFD1. 3 independent biological repeats were performed and approximately 150 cells were quantified (~50 cells per condition). The results show that the PI4KIIIβ antibody used (BD Bioscience, 611816) is specific (93.22% of cells lose the antibody signal) and that ACBD3 and SCFD1 KO affects PI4KIIIβ recruitment to the Golgi in 88% and 73% of the cells, respectively._-

      The last paragraph of the "SCFD1 and ACBD3 interact upstream of PI4KIIIβ recruitment to the Golgi apparatus" section reads a bit odd placed there. I think it is more appropriate for the discussion or for the intro part on SCFD1.

      Many thanks to the reviewer for pointing this out. We simplified that paragraph to describe the relationship between SCFD1 and SEC22B.

      • I am confused on Fig. 5A/B. The labels in the blots show that 390-528 (without UR) does not bind sec22 or scfd1, but the 368-529 does? Or I guess, judging by the MW seen in the middle blots, that there's some error in the labelling?

      Many thanks to the reviewer for noticing this, which was clearly a labelling error. We corrected this accordingly in Figures 5A and B. We apologise for this oversight.

      also, the IP efficiency of the MWT mutant in the panel A blot is quite low, still sec22 seems to be very efficiently pulled down. Can the authors comment on that please? Would co-IPing against endogenous sec22 and scfd1 would work (so you don't need to rely on HaloTag+ligand?)

      We know that the MWT residues of ACBD3 are important for recruiting ACBD3 to the Golgi (Figure 1C and F). We also know that ACBD3 interacts with SEC22B and SCFD1 (Figure 3B and 4A) and that SCFD1 is important for ACBD3 Golgi recruitment. Therefore we initially speculated that ACBD3 interacts with SEC22B and SCFD1 through the MWT residues. However, as the reviewer points out, Figure 5 shows the opposite. Mutating MWT residues makes the interaction of ACBD3 with SEC22B and SCFD1 stronger. For this reason, we hypothesised that another player(s) also contributes to ACBD3 recruitment through interactions with the MWT residues. We have shown that the second recruitment factors are the 2 golgins, golgin-45 and giantin (Figure 6C). In short, whilst we agree that the IP efficiency is low, the binding is actually stronger, supporting our conclusions. No interaction of ACBD3 with endogenous SEC22B could be detected due to a lack of a sufficiently sensitive antibody (we tried Abcam ab181076 and ProteinTech 14776-1 AP).

      • I really like the experiment 6B. Have the authors tested whether SEC22 is also recruited to mitochondria in those conditions? But not SCFD1?

      We thank the reviewer for the positive comment. We have performed the suggested experiment and are now including this as an additional figure (Figure S3). Ectopic expression of golgin-45 targeted to the mitochondria is not sufficient to redistribute SCFD1-HaloTag or HaloTag-SEC22B to the mitochondria (Figure S3A and B, respectively). We, therefore, speculate that the fraction of ACBD3 that gets redirected in Figure 6B must be the small fraction of ACBD3 that is spontaneously in an open conformation and compatible for interaction with golgin-45.

      • The results shown in Fig 7 might show a partial depletion in the interactions, but to be fully trusted they would need to be quantified and a statistical test used to compare the values. I think this part is important to show very clearly, because even with low binding to golgins (remember, single knockouts do not prevent Golgi localization of ACBD3), one could expect that ACBD3 still localized to the Golgi but it does not in the absence of SCFD1 as shown in this paper. A prediction of the proposed model is that in cells depleted of the two Golgins, SCFD1 and ACBD3 should still bind to one another, right? Did the authors test this?

      We fully agree with the reviewer. As discussed in the replies to reviewer 1, we have repeated this experiment, including both sets of KO. This was not trivial, as a double transient KO is technically challenging and involves validation with qPCR and switching cell types (HEK cells to HeLa). The new data supports our current model and suggests some additional regulatory mechanisms at play.

      • The model presented here (fig 8) seems to suggest that only the conformational variation of ACBD3 that binds Golgins is able to recruit (bind) PI4KIIIb. Is this known, or is there any experimental evidence for that?

      HDX-MS experiments show that the ACBD and GOLD domains undergo conformational changes in the presence of 3A proteins (McPhail et al. 2017). Demonstrating this would require a complicated reconstitution experiment which is technically very challenging and would involve purifying various complex proteins, including SNAREs, SM proteins and golgins. This could perhaps be the subject of several future studies.

      • Have the authors thought about testing the FKE mutant in the experiemnts shown in Fig. 5?

      As mentioned above, since the FKE residues are not making any contact with the protein 3A and since the loss of ACBD3 recruitment to the Golgi is not statistically significant (Figure 1F), we haven't tested the FKE mutant for the binding to SEC22B and SCFD1. We do, however, agree with the reviewer that there might be something interesting happening here. We would like to experimentally interrogate this in future studies and develop more sensitive assays to test if there is a significant effect with the FKE mutant.

      In general, I think the title might be a bit misleading because of the use of PI4Kiiib. I understand what the authors mean, but because they have not thoroughly tested PI4Kiiib recruitment in their experiments, I think they should focuse rather on the mechanism of recruitment of ACBD3 the authors have found.

      We thank the reviewer for their advice regarding the manuscript title, and this is something that we have discussed internally. We chose that title as it highlights the key mechanistic impact of our findings and note that we did include a figure on the recruitment of PI4KIIIβ. However, we remain open to discussing this with advice from the journal editorial team.

      Reviewer #2 (Significance):

      I think, as said above, that this is potentially an important paper for the field of membrane trafficking and membrane biology. Most of the experiments are in general well performed and well controlled, and the paper is clearly written and follows a logical line.

      We once again thank the reviewer for their comments and overall thoughtful and considered review. We believe that the suggestions here have improved the manuscript.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Stalder and colleagues report experiments designed to identify interactors of the Golgi-localized protein ACBD3 (a.k.a. GCP60), and to delineate mechanisms that allow ACBD3 to localize at Golgi compartments. ACBD3 is a 528aa protein with diverse previously reported interactions and functions, both in normal physiology and as a host factor in viral assembly processes. Stalder et al. first map which domains of ACBD3 are required for Golgi localization in HeLa cells, concluding that residues 368-528 are sufficient for localization. This region includes a GOLD (GOLgi Dynamics) domain previously reported to interact with Golgin tethering proteins. Alanine scanning identifies the motif MWT just upstream of the GOLD motif as necessary for Golgi localization. Acute CRISPR knockout identifies two Golgins, Golgin45 and Giantin, as necessary for ACBD3 Golgi localization, and IP indicates that the MWT motif breaks this interaction. These data are a bit scattered around the paper but taken together are reasonably persuasive, particularly when viewed in context with published work. This reader would have found the manuscript easier to follow had the Golgin and MWT motif data been presented en bloc.

      We thank the reviewer for these comments and have considered presenting and rewriting the data as the reviewer suggested. On reflection, we have decided to present it in the original order. We feel that this allows us to highlight the two independent mechanisms individually, bringing them together at the end. In addition, as the experiments were performed in the order presented, it allows for more appropriate controls for each experiment rather than trying to combine them. We hope the reviewer accepts our preferred order.

      In a second set of experiments, IP-mass spec is used to identify ACBD3 interactors that might assist in the protein's localization. The MS data presented are filtered to exclude proteins not already identified as Golgi-localized. This is, I think, a mistake. Even if the authors choose to focus on known Golgi interactors as candidates for a localization function, the biological functions of ACBD3 are far from fully understood, and the full dataset would be of value to both cell biologists and virologists.

      We agree with the reviewer that there are many interesting mysteries surrounding ACBD3 and have therefore included an additional table (table S1) in the revised manuscript, showing the dataset of newly identified ACBD3 interactors before applying the Golgi localisation filter.

      Hits in the filtered dataset include the R-SNARE Sec22B, and the SNARE chaperone Sly1/SCFD1. Acute CRISPR inactivation of Sec22 decreases ACBD3 localization to the Golgi and SCFD1 inactivation more or less abolishes localization. Co-IP experiments are used to argue that ACBD3 interacts with the N-terminal regulatory Longin domain of SEC22B, as well as with SCFD1. The Sec22 data are more detailed and persuasive. No experiments with purified proteins are presented to establish that the detected interactions are direct rather than mediated through a bridging factor or factors. Importantly, SCFD1 is likely to have multiple different client SNARE complexes that operate at different stages of ER and Golgi traffic. Hence its inactivation is likely to be pleiotropic and consequently phenotypes arising must be interpreted with caution.

      We completely agree that studying membrane trafficking in an interconnected system is challenging. We also agree that direct binding experiments in reconstituted systems would be key to proving our model. Our data uses multiple different experimental approaches, including co-localisation, co-immunoprecipitation, CRISPR-KO, and biochemistry, to support our model. In the future, we agree full reconstitution would be necessary to examine this further, and we hope that either ourselves or others can do this in further studies.

      Lastly, the authors perform IP experiments which show that ACBD3-Golgin co-IP efficiency is lower in cells with acute inactivation of SCFD1. This epistatic relationship is used to argue for a sequential model of recruitment with SCFD1 and perhaps client SNARE proteins operating upstream of ACBD3-Golgin interaction. This argument is not persuasive because we do not know whether SCFD1 and its downstream activities increase the rate of ACBD3-Golgin complex asssembly, or alternatively stabilizes ACBD3-Golgin complexes, decreasing the rate of their dissociation.

      We agree with this weakness in our original submission, and it is a comment shared among all reviewers. Overall, we feel that we have chosen the model that best summarises our data. We, of course, accept that there are still components of this pathway that need clarification and are open for further study. This includes the issue raised here by the reviewer, as well as the intriguing observation that both golgins are transcriptionally upregulated upon SCFD1 KO in HeLa cells. In the revised manuscript, we have more clearly laid out the weaknesses of our model in the discussion and suggested future experiments to help clarify some of these issues. We have also modified the model to reflect some of these potential additional regulatory mechanisms.

      In general the methods are fairly clear but that there is room for improvement. The "high throughput" imaging pipeline is not clearly described.

      We agree with the reviewer, and apologise for not clearly explaining this. We feel that this unbiased approach of quantification is particularly rigorous and we have clarified this in the methods section of the updated manuscript.

      Each figure legend should specify the microscopy methods used, and for each result the number of biological replicates and cells analyzed should be specified.

      We agree with the reviewer and have included these details appropriately in the revised manuscript.

      The statistical methods (Student, Tukey, etc.) used for each experiment should be specified. Saying that statistics were calculated using Python 3.7 is useless without additional details. e.g. at least the libraries and codebase used should be indicated or deposited.

      We agree with the reviewer and have updated the manuscript accordingly. In short, all comparisons were made using either Student's t-test or Multiple Comparison of Means - Tukey HSD, FWER=0.05. These were conducted in Python 3.9 using pandas, matplotlib, seaborn and scipy. We used the MultiComparison function in scipy, and the comp.tukeyhsd for the post-hoc adjustment.

      Many figure labels (e.g. Fig. 2) use absurdly small fonts.

      We apologise for this. We believe that this is because we submitted it with in-line formatting. Our resubmission has full-page figures, and we feel the text is clearer now.

      The mass spec hits obtained should be provided both with and without exclusion of non-Golgi-localized proteins.

      We agree with the reviewer. Please see the new Table S1.

      Reviewer #3 (Significance):

      In general I think this is a useful and well controlled set of experiments producing useful insights. However, the interpretations need to be more carefully considered, and alternative interpretations must laid out as clearly as possible. Specifying the limitations of the study will make it more, not less, useful to the field. If the authors want to make the case more robustly that the interactions described are mediated through direct binding, or that the operation of SCFD1 and Golgins operate sequentially to recruit ACBD3, additional wet bench work will be required which will of course take time to complete.

      We once again thank the reviewer for the thoughtful and critical comments. These have helped to strengthen the manuscript. We have performed the additional bench work requested by the reviewer, which has further supported the paper and our model.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Stalder and colleagues report experiments designed to identify interactors of the Golgi-localized protein ACBD3 (a.k.a. GCP60), and to delineate mechanisms that allow ACBD3 to localize at Golgi compartments. ACBD3 is a 528aa protein with diverse previously reported interactions and functions, both in normal physiology and as a host factor in viral assembly processes. Stalder et al. first map which domains of ACBD3 are required for Golgi localization in HeLa cells, concluding that residues 368-528 are sufficient for localization. This region includes a GOLD (GOLgi Dynamics) domain previously reported to interact with Golgin tethering proteins. Alanine scanning identifies the motif MWT just upstream of the GOLD motif as necessary for Golgi localization. Acute CRISPR knockout identifies two Golgins, Golgin45 and Giantin, as necessary for ACBD3 Golgi localization, and IP indicates that the MWT motif breaks this interaction. These data are a bit scattered around the paper but taken together are reasonably persuasive, particularly when viewed in context with published work. This reader would have found the manuscript easier to follow had the Golgin and MWT motif data been presented en bloc.

      In a second set of experiments, IP-mass spec is used to identify ACBD3 interactors that might assist in the protein's localization. The MS data presented are filtered to exclude proteins not already identified as Golgi-localized. This is, I think, a mistake. Even if the authors choose to focus on known Golgi interactors as candidates for a localization function, the biological functions of ACBD3 are far from fully understood, and the full dataset would be of value to both cell biologists and virologists. Hits in the filtered dataset include the R-SNARE Sec22B, and the SNARE chaperone Sly1/SCFD1. Acute CRISPR inactivation of Sec22 decreases ACBD3 localization to the Golgi and SCFD1 inactivation more or less abolishes localization. Co-IP experiments are used to argue that ACBD3 interacts with the N-terminal regulatory Longin domain of SEC22B, as well as with SCFD1. The Sec22 data are more detailed and persuasive. No experiments with purified proteins are presented to establish that the detected interactions are direct rather than mediated through a bridging factor or factors. Importantly, SCFD1 is likely to have multiple different client SNARE complexes that operate at different stages of ER and Golgi traffic. Hence its inactivation is likely to be pleiotropic and consequently phenotypes arising must be interpreted with caution.

      Lastly, the authors perform IP experiments which show that ACBD3-Golgin co-IP efficiency is lower in cells with acute inactivation of SCFD1. This epistatic relationship is used to argue for a sequential model of recruitment with SCFD1 and perhaps client SNARE proteins operating upstream of ACBD3-Golgin interaction. This argument is not persuasive because we do not know whether SCFD1 and its downstream activities increase the rate of ACBD3-Golgin complex asssembly, or alternatively stabilizes ACBD3-Golgin complexes, decreasing the rate of their dissociation.

      In general the methods are fairly clear but that there is room for improvement. The "high throughput" imaging pipeline is not clearly described. Each figure legend should specify the microscopy methods used, and for each result the number of biological replicates and cells analyzed should be specified. The statistical methods (Student, Tukey, etc.) used for each experiment should be specified. Saying that statistics were calculated using Python 3.7 is useless without additional details. e.g. at least the libraries and codebase used should be indicated or deposited. Many figure labels (e.g. Fig. 2) use absurdly small fonts. The mass spec hits obtained should be provided both with and without exclusion of non-Golgi-localized proteins.

      Significance

      In general I think this is a useful and well controlled set of experiments producing useful insights. However, the interpretations need to be more carefully considered, and alternative interpretations must laid out as clearly as possible. Specifying the limitations of the study will make it more, not less, useful to the field. If the authors want to make the case more robustly that the interactions described are mediated through direct binding, or that the operation of SCFD1 and Golgins operate sequentially to recruit ACBD3, additional wet bench work will be required which will of course take time to complete.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This is a very interesting and potentially important paper for the field of membrane biology and membrane trafficking, in which the authors have studied the molecular mechanisms by which ACBD3 (and consequently PI4KIIIb) is recruited to the cis-Golgi membranes. The authors suggest that this recruitment is based on a two-step process, mediated by interactions to, on the one hand, SCFD-1 (SLY1) and, on the other hand, two redundant golgins (golgin-45 and giantin).

      Comments:

      • Pg.1 : arfaptins, as far as I know, have not been shown to be involved in intra-golgi trafficking but rather in Golgi export (see e.g. ref. 12)
      • Pg. 1: reigon --> region
      • Arf1 also recruits PI4KIIIb right?
      • Fig. S1: the information about the number of cells per experiment is missing. Also, please add the information about what exactly is represented in the box plots (is it the distribution of the mean value of R per experiment? or the total distribution on a cell-by-cell basis of a representative experiment?)
      • The definition of Avg. Golgi int/avg. cell int. (a.u.) in Fig 1E,F is a bit difficult to understand to me. If I understand correctly, the total fl. int in the Golgi mask was computed and divided by the area of the Golgi mask (this is the av. Golgi intensity). A similar computation is done for the entire cell (including the Golgi), i.e., total fl. intensity in the cell mask is computed and divided by the area of the cell mask. Then the two av. intensities are divided (ratio = av. Golgi int / av. cell int.). This ratio, for a protein that is enriched in the Golgi area, should be larger than 1. For a protein that is equally distributed all over the cell, it should be 1, and for a protein that is excluded from the Golgi area, smaller than 1. Then to this value, the authors subtract the value of the ratio found for an inert construct (GFP of Halo alone), which I imagine should have an original ratio value of the order of 1, and hence, after this subtraction, norm. ratio values larger than 0 mean that they are more enriched at the Golgi area than GFP/HaloTag themselves. Is this correct? In principle, I don't see anything entirely wrong with this way of thought, but I just found it a bit difficult to understand, and in general one has to be careful when computing rations (quotients) and then subtract another ratio. Also, the units are not a.u., the value is dimensionless, what is "arbitrary" is the definition of 0 value and the based on this definition, also the actual value. I think it would probably be much clearer for the readers to compute somthing like the relative enrichment in the Golgi area as compared to the rest of the cell (excluding the Golgi area). That is, a value r'=(Int. Golgi mask / Area Golgi mask) / [(Int. Cell mask - Int. Golgi mask)/(Area cell mask - Area Golgi mask)]. This can be computed directly or defining a mask that is the cell mask - the Golgi mask. Also, some maths (unless I made a mistake) give that this r'= r (1-aG)/(1-r aG); where r is the ratio (before subtraction) defined by the authors, and aG=Area cell mask/Area Golgi mask. In any case, I'd suggest the authors to either adopt this other quantitation (without subtraction of the GFP/HAloTAG), which gives directly the fold-enrichment in the intensity density in the Golgi area with respect to the rest of the cell; or explain in more detail the maths of the value they are plotting now.
      • Fig. 1C&F: Besides the MWT mutant, the FKE mutant also seems to have a somewhat compromised Golgi localization. Have the authors followed on that, or what is the reason that they have just focused on the MWT mutant?
      • Very minor point, and without wanting to sound pedant at all, but I think (I might be wrong of course, so apologies if I am) that the plural of apparatus in latin is not apparati, but apparatus (fourth declination). So, I'd change the word in page 2 (or just rephrase the sentence: e.g. "resulting in Golgi fragmentation"). But of course, I'd leave this to the authors' discretion.
      • Fig. 3A: have the authors tried or been able to perform IF of the endogenous SCFD1 protein?
      • Similarly to what has been done for other panels, could you quantify Fig. 3C? Are PI4KIIIb protein levels affected upon the different KOs?
      • The last paragraph of the "SCFD1 and ACBD3 interact upstream of PI4KIIIβ recruitment<br /> to the Golgi apparatus" section reads a bit odd placed there. I think it is more appropriate for the discussion or for the intro part on SCFD1.
      • I am confused on Fig. 5A/B. The labels in the blots show that 390-528 (without UR) does not bind sec22 or scfd1, but the 368-529 does? Or I guess, judging by the MW seen in the middle blots, that there's some error in the labelling? also, the IP efficiency of the MWT mutant in the panel A blot is quite low, still sec22 seems to be very efficiently pulled down. Can the authors comment on that please? Would co-IPing against endogenous sec22 and scfd1 would work (so you don't need to rely on HaloTag+ligand?)
      • I really like the experiment 6B. Have the authors tested whether SEC22 is also recruited to mitochondria in those conditions? But not SCFD1?
      • The results shown in Fig 7 might show a partial depletion in the interactions, but to be fully trusted they would need to be quantified and a statistical test used to compare the values. I think this part is important to show very clearly, because even with low binding to golgins (remember, single knockouts do not prevent Golgi localization of ACBD3), one could expect that ACBD3 still localized to the Golgi but it does not in the absence of SCFD1 as shown in this paper.
      • A prediction of the proposed model is that in cells depleted of the two Golgins, SCFD1 and ACBD3 should still bind to one another, right? Did the authors test this?
      • The model presented here (fig 8) seems to suggest that only the conformational variation of ACBD3 that binds Golgins is able to recruit (bind) PI4KIIIb. Is this known, or is there any experimental evidence for that?
      • Have the authors thought about testing the FKE mutant in the experiemnts shown in Fig. 5?
      • In general, I think the title might be a bit misleading because of the use of PI4Kiiib. I understand what the authors mean, but because they have not thoroughly tested PI4Kiiib recruitment in their experiments, I think they should focuse rather on the mechanism of recruitment of ACBD3 the authors have found.

      Significance

      I think, as said above, that this is potentially an important paper for the field of membrane trafficking and membrane biology. Most of the experiments are in general well performed and well controlled, and the paper is clearly written and follows a logical line.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      This manuscript describes molecular mechanisms by which ACBD3 is recruited to the Golgi complex. ACBD3 recruits PI4KIII which is required to generate PI4P, a phosphoinositide which is key for the recruitment of essential Golgi proteins and hence is key to Golgi identity. The authors have used a combination of mass spectrometry, high quality fluorescence imaging, transient CRISPR knockdowns, and biochemical approaches such as IPs to identify the key determinant for recruitment of ACBD3 to the Golgi complex. They map the interaction between ACBD3 and the Golgi as a unique region (UR) upstream of its GOLD domain, identifying, in particular, an MWT motif as key for this recruitment. Using mass spectrometry they identify several novel interactors of ACBD3 as well as some established binding partners. Knockdown of these interactors reveal a key role for the SNARE, SCFD1, where reduced levels lead to complete loss of ACBD3 localisation to the Golgi without apparent disruption of Golgi structure. They further validate this interaction and that of another SNARE (Sec22b), which is part of the same SNARE complex as SCFD1, mapping the interaction to the longin domain of Sec22b. Surprisingly however they demonstrate that the UR domain does not mediate the interaction between ACBD3 and these SNAREs suggesting an alternative mechanism of recruitment. Previously identified ACBD3 interactors, Golgi proteins giantin and golgin-45 were also identified in the mass spectrometry screen and the authors demonstrate that these two proteins can recruit ACBD2 to the Golgi and this is dependent on the MWT motif identified in the UR domain. By knocking down SCFD1, they show reduced recruitment of ACBD3 leading them to propose a model of sequential recruitment of ACBD3 by SCFD1 followed by interactions with the golgins.

      Major points:

      This study is a well-executed and rigorous study of the molecular requirements for the recruitment of ACBD3 to the Golgi. The experimental approaches are state-of-the-art and the data are clean and convincing. The only caveat, raised by the authors themselves, is their interpretation that there are two sequential steps for Golgi recruitment of ACBD3. While they show that loss of SCFD1 reduces the interaction of ACBD3 with giantin and golgin 45, their model depends on doing the reverse experiment, i.e. assessing the effects of knocking down either giantin or golgin-45. This is especially relevant given the demonstration that golgin-45 is sufficient to recruit ACBD3 to mitochondria. It may well be that recruitment involves a tripartite complex, which is not uncommon in vesicular transport mechanisms Giantin is not an essential protein do it should be feasible to perform this experiment. The authors are equipped in the quantitative fluorescence microscopy which would be required and which would help resolve whether sequential or redundant mechanisms are required for ACBD3 recruitment.

      Significance

      PI4P is a phosphoinositide that is important for the recruitment of Golgi proteins. As with most PIs it is likely to act by coincidence detection in that Golgi associated proteins will recognise PI4P as well as other factors on Golgi membranes. This results in different local membrane environments which will be specific for particular functions. PI4KIII is key for PI4P production although the absolute levels of PI4P are likely to be determined by a balance of lipid kinases and phosphatases. However, since ACBD3 is key for the recruitment of PI4KIII it is important to understand the molecular mechanisms by which it is recruited. The manuscript thus makes a significant contribution to understanding one of the underlying mechanisms for PI4KIII recruitment although, as indicated above, stops short of establishing a clear model for the roles SCDF1 and Sec22b versus golgin 45 and giantin. For the future it will be of interest to determine why either a sequential or a redundant mechanism is required for the recruitment of ACBD3 as a scaffold protein.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank all reviewers for taking the time to evaluate our manuscript fairly and critically. Many helpful suggestions and discussion points were raised. One important group of comments raised concerns whether our proposed timer and counter models were the appropriate conceptual framework to discuss nuclear multiplication in schizogony, whether they were mutually exclusive, and whether other alternatives should be considered. These comments were instrumental for us to uncover some inconsistencies in our previous modeling approach. In the new manuscript, we now define the counter and timer models much more rigorously in the context of Plasmodium cell division. Based on these refined models we now provide a new statistical analysis that goes beyond the previous analysis, significantly improving the statistical support for our conclusions. Details are given in the following individual replies.

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary

      Malaria parasites replicating in human red blood cells show a striking diversity in the number of progeny per replication cycle. Variation in progeny number can be seen between different species of malaria parasites, between parasite isolates, even between different cells from the same isolate. To date, we have little understanding of what factors influence progeny number, or how mechanistically it is controlled. In this study, the authors try to define how the mechanism that determines progeny number works. They propose two mechanisms, a 'counter' where progeny number is determined by the measurement of some kind of parasite parameter, and a 'timer' where parasite lifecycle length would be proportional to progeny number. Using a combination of long-term live-cell microscopy and mathematical modelling, the authors find consistent support for a 'counter' mechanism. Support for this mechanism was found using both Plasmodium falciparum, the most prominent human malaria parasite, and P. knowlesi, a zoonotic malaria parasite. Of the parameters measured in this study, the only thing that seemed to predict progeny number was parasite size around the onset of mitosis. The authors also found that during their replication inside red blood cells, malaria parasites drastically increase their nuclear to cytoplasmic ratio, a cellular parameter remains consistent in the vast majority of cell-types studied to date.

      Major Comments

      It is stated a few times in this study that P. knowlesi has an ~24 hour lifecycle, and while this is the case for in vivo P. knowlesi, it was established in the study when P. knowlesi A1-H1 was adapted to human RBCs (Moon et al., 2013) that this significantly extended the lifecycle to ~27 hours, which should be made clear in the text. As much of this study revolves around lifecycle length and timing, the authors should consider some of their findings with the context that in vitro adaption can significantly alter lifecycle length.

      The reviewer raises an important point that we didn’t discuss for P. knowlesi. We now mention this directly in the introduction chapter (line 67) and in the discussion (lines 470ff). We are aware that P. knowlesi takes about 27 hours in the lab, which was also communicated by the Moon lab. We now cite relevant studies again in this context. We further address the issue of modified cell cycle time in vitro in the discussion in the sense that absolute values must be taken with caution and the focus of this study is about the relative ratio and correlation between the different cell cycle metrics.

      • The dichotomous distinction between 'timer' and 'counter' as mutually exclusive mechanisms seems to be a drastic oversimplification. Considering the drastic variation we see in merozoite number across species, between isolates, and between cells, it seems much more likely that there are factors controlled by both time-sensed and counter-sensed mechanisms that both influence progeny number.

      The study of progeny regulation in malaria parasites is very much in the early stages. We can agree that our models are simplifications, as is the case with all models. Our choice of just the two models timer and counter was driven by the number of cellular parameters we measure, i.e., duration of division phase and progeny number. These data essentially allow us to test the two competing models we presented. As we quantify more and more cellular parameters, based on the quantitative live cell imaging protocols established here, we will be able to test more complex cell cycle models. With our current data, we believe more complex models are not warranted.

      However, this valuable criticism, in conjunction with related remarks by other reviewers, made us reevaluate the constraints of our model more precisely. We noticed that the criteria used in the previous version in the manuscript contained unnecessary additional assumptions. Briefly, the previous counter model also required that final merozoite number was tightly controlled, while the previous timer model required the growth rate to be tightly controlled. These side assumptions were not made explicit in the manuscript and could bias the support towards one or the other model.

      We now improved the modeling approach substantially by removing implicit side assumptions, and clearly defining timer and counter models in terms of their correlations. The refined formulation of the timer posits that between individual parasites the target duration and the nuclear multiplication rate vary in a statistically independent way; while in a counter, target number and nuclear multiplication rate are statistically independent. We now explain this extended analysis in more detail in the introduction (lines 86ff). We also now more clearly state the dichotomous nature of the model (line 488). A new results paragraph (lines 213ff) and an entirely new Fig. 2 (and Fig. S4) contains the model predictions and statistical comparison between the models.

      This more rigorous treatment showed that including the variance of the multiplication rate was critical to allow a clean discrimination between the models. Also, with the sole exception of P.knowlesi H2B, where no model was clearly favored (Fig. 2G-H,K), the timer model was found to be inconsistent with the data, while the counter was clearly favored. Our new goodness-of-fit analysis also showed that although the counter is strongly simplified, it produced adequate fits, demonstrating that potential model refinements would need to be justified by new, more extensive data.

      It is also important to consider that the degree of variation in merozoite number could rather be an expression of varying growth conditions and does not directly predict which of the proposed models are true. For instance, a counter where the target merozoite number varies strongly depending on growth conditions, would be consistent with all available data. It is an interesting question for future work whether a counter would indeed describe growth across different isolates.

      The biological reality of growth regulation is certainly complex, and the counter model will likely need to be refined in the future, which we acknowledge in a corresponding statement in the discussion (lines 491ff). Nevertheless, we find it encouraging that a simple model can explain the vast majority of our data very well.

      Additionally, the only parasite parameter measured in this study, size at time of first nuclear division, explained only a small proportion of the variance observed in merozoite number.

      It is indeed the case that amongst the measured parasite parameters i.e. schizont stage duration, nuclear volume, and cell size we only found the latter to correlate with the final progeny number. We did not aim to imply that all variation in progeny number is explained by cell size. It is likely that a putative counter relies on a set of factors, which are somehow linked to cell size. In addition, intrinsic stochasticity in nuclear growth is likely to contribute to final merozoite number variability, which is included in our models via a variable growth rate. Defining the actual limiting factor or combination of factors will be an exciting challenge for the future studies building on this one.

      • For modelling of a timer-based mechanism, the designation of t0 is subjective. The authors chose the time of first nuclear division as their t0. It is possible that a timer-based mechanism could not be supported based on this model the chosen t0 differs from when the "parasite's timer" starts. For example, t could also have been designated as the time from merozoite invasion (t0) to egress (tend). It would be unreasonable to suggest the authors repeat experiments with a longer time-frame to address this, but this possibility should be discussed as a limitation of the model. It may also be possible to develop a different model where t0 = merozoite invasion and tend = egress, and test this model against the data already collected in this study.

      This is a valid point. We indeed, considered the time point of invasion as the other relevant time point in the IDC for a possible timer. Due to necessary compromises in imaging protocols between acquisition length, temporal, and spatial resolution we have not been able yet to combine full-length IDC measurements with quantification of progeny number. Given the choice, however, between time point of invasion and the onset of nuclear division as starting point for a potential timer we would still favor the latter: An argument can be made that a timer that regulates offspring number would be more accurate when activated at the moment of the relevant cellular events rather than “running” for a very prolonged growth phase before any “decision” concerning parasite replication. We are still convinced that the entry into the schizont stage, which we analyze here, marks an important cell cycle transition point that has been highlighted in many different studies. As suggested, we now discuss the limitations of our selection of t0 in the text (lines 146ff).

      • The calculation of the multiplication rate is confusingly defined. In Figure 1 it is stated that it is "...based on t and n", which would imply that the multiplication rate is the number of merozoites formed per hour of schizogony, which would give an average value of ~2 for P. falciparum and ~1.5 for P. knowlesi. The averages rate values shown, however, are in the range of 0.15-3. The authors should clarify how these values were determined.

      Thank you for pointing out the need for more clarity. Since the nuclear multiplication, similar to e.g. cell population growth, follows an exponential law, the multiplication rate used (lambda) is in fact a logarithmic growth rate. Therefore, it occurs in the exponent (not as a coefficient) in the exponential growth function ( ), which explains the range. We now mention this more explicitly in the results (lines 163ff).

      • In Figure 2, the time from tend until egress is calculated, and this is interpreted as the time required for segmentation. In the Rudlaff et al., 2020 study cited in this paper, it is shown that segmentation starts before the final round of nuclear divisions are complete. Considering this, the time from tend until egress is not an appropriate proxy for segmentation time. The authors should consider rewording to something akin to "time from final nuclear division until egress" to more accurately reflect these data.

      Thank you for indicating our imprecise use of the nomenclature. Indeed, some essential segmentation-associated structures such as rhoptries and subpellicular microtubules are clearly forming before the last division. We were referring to “segmentation” as the time window where actual ingression of the plasma membrane occurs between nuclei with the concurrent formation of more prominent IMC-associated sub-pellicular microtubules between nuclei (as in Fig. 1A last panel). We can, however, agree that consistently using the term “merozoite formation” is more adequate here. We have now corrected the terminology according to the suggestions of the reviewer (lines 271ff).

      • There is a significant discrepancy between the data in Figure 5 and Supplementary Figure 8. In Supplementary Figure 8, the authors establish that culturing parasites in media diluted 0.5x has a marginal effect on parasite growth, with no discernible change in parasitaemia over 96 hours. By contrast, in Figure 5a the parasitaemia of parasites cultured in 0.5x diluted media is approximately 5-fold lower than those in 1x media. The authors should explain the significant discrepancy between these results.

      The reviewer correctly points out a difference in parasitaemia between two parasite culture experiments, shown in Figs 5a (now 6A) and S8 (now S11), respectively. There were several differences in the experimental setup used in the two experiments that could explain this discrepancy. In Fig. 5a the parasites were synchronized to early ring stages while in Fig. S8 we used asynchronous cultures (maybe with a slight majority of late stages). One could speculate that by the time the synchronized ring stage culture reached egress the effect of nutrient depletion, which started at t = 0 h is more pronounced. This effect could have been exacerbated by the more frequent media change of 24 h in Fig. 5a vs 48h in Fig. S8. Lastly, the starting parasitemia was differently set being higher at around 0.5% in the Fig. 5a while only 0.2% in Fig. S8. Possibly a lack of nutrient is “felt less” by the culture at lower parasitemias. Generally, in Fig. S8 we were more focused on highlighting the difference between 1x/0.5x and the more diluted conditions on the long-term culture and to show that continuous culture is actually possible in 0.5x medium. We have now expanded the legends to highlight those differences more clearly.

      • In Supplementary Figure 4, the mask on the cell at t0 shows two distinct objects, but it seems very unlikely that they are two distinct nuclei as they vary approximately 5-fold in diameter. The authors should provide more detail on how their masking was performed for their volumetric analysis. Specifically, whether size thresholds were also applied during object detection.

      Thank you for requesting clarification here. Fig S4 (now S7) shows only one z-slice (not a projection) of the entire image stack, to illustrate how the thresholding approach was performed on every single image slice. The two objects in the shown cell are indeed two nuclei, but because they are not in the same z-plane appear to be of different size. In particular, only a slice of the upper part of the nucleus on the lower right is visible in the shown slice. Throughout the study, volume determination was realized by adding up the individual slices, as is explained in detail in the Materials and Methods sections. We have now added a more explanation in the figure legend to clarify the procedure.

      Minor Comments

      • Line 45-48 mentions that merozoite number influences growth rate and virulence, but the corresponding reference (Mancio-Silva et al., 2013) only discusses the relationship between merozoite number and growth rate, not virulence.

      We thank the reviewer for requesting this distinction. Merozoite number and virulence have not been correlated in vivo so far. Certainly, because one can’t retrieve late-stage P. falciparum parasites from patients, but maybe partly because merozoite number has not gotten significant attention as a metric in the previous decades. Even if merozoite number is intuitively connected to growth rate which might causes higher parasitemia which is in turn linked to more severe disease outcome it is important to emphasize that those are certainly not equivalent. We have therefore removed the statement about virulence (line 48).

      • Line 59 states that a 48 hour lifecycle is a baseline from which in vitro cultured parasites deviate. Clinical isolates also show variation in lifecycle length and so it is more accurate to just say that 48 hours is an average, rather than a baseline.

      The word “baseline” has been changed to “average” (line 61).

      • Line 63 cites a study for the lifecycle length of P. knowlesi (Lee et al., 2022), but there seems to be no mention of lifecycle length in this reference

      This reference was meant to serve as an introductory review article to research in P. knowlesi. Actually, to the knowledge of the authors, there is no study presenting quantitative data showing that the in vitro cycle of P. knowlesi is actually around 27 h. Our lab experience is however coherent with a 27 h cycle, which was confirmed by personal communication by the Moon lab. We now also cite in the next sentence the inaugural P. knowlesi adaptation publication (Moon et al. 2013) showing some time course data indicating the duration of the IDC to be around ~27h (lines 67ff).

      • If I am interpreting Figure 3B correctly, this is essentially a paired analysis where the same erythrocytes are measured twice, once at t0 and once at tend. If this is the case, this data may be better represented with lines that connect the t0 and tend values.

      Yes, these are the same erythrocytes measured twice. We have modified Figure 3 (now Fig. 4) accordingly.

      • Figure 3A seems to imply that to calculate diameter of the erythrocytes, three measurements were made and averaged for each cell. I think this is a nice way to get a more accurate erythrocyte diameter, but if this is the case, it should be specified in the figure legend or methods.

      This is already described in the figure legend (line 305).

      • In Figure 4I it is shown that in P. falciparum merozoite number doesn't correlate with nucleus size, but for P. knowlesi in Supplementary Figure 7c, a significant anticorrelation is observed. The authors should state this in the text and discuss this discrepancy.

      Contrary to all other graphs, visual inspection of the distribution of data points in Fig. S10C shows that it contains two outlier data points at the bottom right. Those two specific points are also responsible for the significant anticorrelation. We did not filter or remove any quantification results but also didn’t have sufficient confidence in this data distribution (which is further based on the segmentation of the Histone2B not on an NLS mCherry signal) to make substantial claims about anticorrelation. Because we considered it informative we still decided to show it in the supplements. We now briefly mention the issues with the data set and its interpretation in the text (lines 350ff).

      • The authors show that merozoite number roughly correlates with cell size at t0 but it would be interesting to see whether cell size at tend also corresponds with cell size at t0. This might help answer whether the cell is larger because it has more merozoites, or whether it has more merozoites because it is larger.

      Plotting parasite cell volume at t0 against cell volume at tend (as well as between t-2 and tend) indeed shows a positive correlation (see below). While it is an interesting thought we concluded after some discussion that no convincing causal relationship between cell size and merozoite number can be inferred based on this analysis. Since we consider the possible statement that cells that are bigger in the beginning are also bigger in the end unavailing, we decided not to include the data.

      • I don't feel that "nearly identical" is an appropriate summary of erythrocyte indices in Supplementary Figure 9, considering there is a statistically significant increase in mean cell volume. I think it is unlikely that this change is consequential, and performing these haematology analyses is a nice quality control step, but this change should be stated in the text.

      In the modified text we now express the significant change in MCV in terms of percentage, which is around 1.2% (line 381).

      • In Supplementary Figure 8, parasitaemia only increases ~2-fold compared to >5-fold the previous two cycles. It seems likely that at the final timepoint on this graph the parasites are starting to crash, and therefore it may be best to end the graph with the 96 hour timepoint.

      The reviewer suggests that cultures at those parasitemias might not be in perfect health. Our Giemsa stains did not show signs of an unhealthy culture and kept growing. It was, however, important for us to show that cultures can be maintained in culture over a prolonged period of time in 0.5x medium, even when resulting in reduced growth, while this was not possible with lower dilutions. Therefore, we would like to keep the data point. We have added a cautionary comment in the legend.

      • The error bars in Figure 5C aren't easily visible, moving them in front of the datapoints may help their visibility.

      Error bars were moved in front of the data points.

      • In Figure 6D & E, the y-axis labels should be changed to whole integers as all the values in the graph are whole numbers.

      We have changed the y-axis labels accordingly.

      • My interpretation of Figure 6 C-E, is that these are the same cells measured at three time points (t-2, t0 and tend). If this is the case, 6C is missing the cell that has a merozoite number of 8, which is presumably why the y-axes are not equalised for the three graphs.

      It is correct that the same cells are displayed in all three plots, with the exceptions of three cells in 6C (for the timepoint t-2), which are missing for the following reasons: 1) it was not possible to determine the volume at this respective timepoint due to technical issues or 2) the cell was already just before t0 at the start of the movie so that t-2 had already passed. We now note this in the figure legend and have also equalized the y-axes (now Fig. 7C-E).

      Reviewer #1 (Significance):

      In the asexual blood-stage of their lifecycle, malaria parasites replicate through a process called schizogony. During schizogony an initially mononucleated parasite undergoes multiple asynchronous rounds of mitosis followed by nuclear division without cytokinesis, producing a variable number of daughter nuclei. Parasites then undergo a specialised cytokinesis, termed segmentation to where nuclei are packaged into merozoites that go on to invade new host cells. While nucleus, and therefore merozoite, number are known to be varied between cells, across isolates, and across species, little is known about the mechanisms regulating merozoite number. In this study, the authors use live-cell microscopy to understand how parasites determine their progeny number. They suggest that parasites regulate their progeny number using a 'counter' mechanism, which would respond to the size or concentration of a cellular parameter, as opposed to a 'timer' mechanism. Long-term live-cell microscopy experiments using malaria parasites are extremely technically challenging, and the authors should be commended for their efforts in this regard. While I agree that the data generated from these experiments are technically sound, I have some reservations expressed above about the interpretation of some of these results. I would strongly encourage the authors to consider rewording some of their interpretations taking into account some of the caveats listed above. I would also consider fitting/testing an additional mathematical model where the time-frame proposed for the 'timer' mechanism begins following merozoite invasion.

      We thank the reviewer for the appreciation of our work and hope we have sufficiently reworked the manuscript based on the comments listed above. Furthermore, we think the improved model statement and analysis improves the clarity of our conclusions. Indeed, we would like to test additional models including the full IDC once, as mentioned above, we are technically able to generate these data.

      This work is of specific interest to anybody who grows malaria parasites, as the dynamics of their growth is obviously important to understand. Further, this work is of interest more generally to cell biologists who study the regulation of progeny number or cell size. I have no experience with the application of mathematical modelling to understand biological systems, and so I cannot comment on the interest of this work to that field.

      Reviewer #2 (Evidence, reproducibility and clarity):

      This is a solid study that further characterises the dynamics of nuclear division in Plasmodium falciparum and P. knowlesi. Of two, among potentially several, models for how the number of daughter nuclei, and thus parasites - (called merozoites in this genus), are one that posits nuclei divide until a fixed timer ends, and one that posits that nuclei divide to reach a fixed number that is defined by a cellular counter. I find some practical difficulties in definitive measurement of either model, one issue with the former is that experimental definition of the start of the timer is problematic - we may define the starter's gun (eg by the first nuclear division) but it isn't necessary that the cell is using that same start time.

      We are pleased that the Reviewer found our study ‘solid’. Concerning the timer model, we agree that the selection of the starting point is a critical aspect of this study, as also Reviewer 1 pointed out. We selected this particular “t0” because the entry into the mitotic phase marks an important cell cycle transition. Several studies have suggested a “schizogony entry checkpoint” might be active just before (Matthews et al, 2018; Voß et al, 2023; van Biljon et al, 2018; McLean & Jacobs-Lorena, 2020). Once cells are committed to the schizont stage they are less responsive to stimuli. Alternatively, the timepoint of erythrocyte invasion could be a legitimate starting point. Due to necessary compromises in our imaging protocol between acquisition length, temporal, and spatial resolution we have not been able yet to combine full-length IDC measurements with quantification of progeny number, and therefore we leave exploration of an earlier timer start for future work. Within the confines of the model comparison in the current study, we think the selected t0 is already highly informative. We now explain the selection and limitations more explicitly in the text (line 144ff).

      Additionally, as the authors confirm here, being sure when that first nuclear division has occurred is particularly tricky with Plasmodium parasites, in part because the first few nuclei seem to clump together, preventing one from unambiguously calibrating the first division.

      The Reviewer is concerned about difficulties with precise reporting of the time point of first nuclear division. We suspect there was a misunderstanding here. In the text (line 137) we had written the following:

      “Although separating individual nuclei after the first two rounds of division was challenging due to their spatial proximity, the improvements in resolution and 3D image analysis allowed us to count the final number of nuclei routinely and reliably at the transition into the segmenter stage.”

      To clarify, when analyzing 3D image stacks produced by the LSM900 Airyscan the first nuclear division can consistently and unambiguously be detected. In anaphase the nuclei are pushed apart quite substantially before getting a bit closer together afterwards (see e.g. Fig. 1B and C). Hence the precision of the detection is only limited by the 30 min interval of the time lapse. Later, at the four nuclei stage, crowding makes distinction more difficult. In the final segmenter stage, the reorganization and condensation of nuclei makes reliable counting possible again. We have now reformulated the quoted sentence for more clarity (lines 137ff).

      Furthermore, getting decent replicate numbers is hard because of the difficulties of time lapse microscopy, and most Plasmodium studies (including this one) suffer from low enough numbers that it isn't always clear whether the numbers support one model over another.

      The reviewer points out the difficulty of obtaining enough replicates in Plasmodium time-lapse studies. We agree that depending on technology, sufficient replicates can be challenging. In the present study we obtained Ns between 25 and 35 for all conditions in P. falciparum and P. knowlesi from three independent replicas. To gain confidence in the conclusions from a limited, but not austere, data, it is essential to 1) reduce model complexity to a minimum and 2) perform stringent statistical analysis including accounting for small-sample variation. Motivated by this concern of the Reviewer and a similar point raised by Reviewer 1, we have revisited our modeling approach in the revised manuscript. This led us to a corrected, more rigorous definition of what precisely we mean by ‘counter’ and ‘timer’ models: The timer posits that between individual parasites the target duration and the nuclear multiplication rate and vary in a statistically independent way, while in a counter target number and nuclear multiplication rate are statistically independent. With no further adjustable parameters, the two models are thus both mutually exclusive and minimal. Although biological reality is likely to be more complex, we feel that these minimal models are adequate for the amount and resolution of our current, state-of-the art data. The general result remained the same: The counter model is strongly preferred in almost all our experiments data (new Fig. 2), with the sole exception of P. knowlesi H2B, where indeed more data may be needed to come to a clear conclusion. Furthermore, we have taken care to scrutinize these conclusions accounting for goodness-of-fit for the respective sample size N. This analysis showed, surprisingly, that the counter model was sufficient to account for the data: the real dataset was as similar to the counter prediction as synthetic, counter-generated data. We hope that this improved statistical analysis can help the reader judge the robustness of our conclusions.

      Nonetheless, several recent studies, particularly a study from the same institute (Klaus et al., 2022) employing timelapse imaging of nuclei, and timing the nuclear division of parasites, finds poor correlation between the duration of "schizogeny" (although perhaps using a different definition to the one used by the parasite) and the final number or merozoites. They therefore argue that there is poor evidence for a timer, and conclude by elimination that a counter must exist instead. A review by some of the authors of that study and some of this current study (Voß et al 2023), also concludes that the data from Klaus and colleagues "strongly support" a counter model. This current study also concludes that a counter model controls final nuclear/merozoite number in P. falciparum and P. knowlesi. This much at least is not particularly novel given the recent work on this topic, although the addition of the P. knowlesi data is interesting and consistent with the prior P. falciparum work.

      Our present work, indeed, does confirm the previous report of a counter over a timer, through a more targeted approach. While Klaus et al. used timing data of first nuclear cycle vs. the full duration, we now provide, thanks to an improvement microscopy setup and protocol, simultaneous measurements of timing and final progeny number, i.e. counting of merozoites/nuclei. While the preference for a counter model is not fundamentally novel, the additional information that the counter model holds in different strains, conditions and species is, in our opinion, not trivial and points to some degree of evolutionary conservation. We also demonstrate here that the counter model is not only preferred over the timer, it also fits the data adequately, so that it can be considered ‘correct’ at this level of complexity. Another, possibly more important, value of this study lies in the quantitative and time-resolved assessment of multiple important parasite metrics such a cell volume and nuclear volume together with merozoite number at the single cell level. Although descriptive, this has not been achieved in Plasmodium until now.

      As above, the authors concede that it is difficult to determine with strong confidence when the first nuclear division has occurred, so it may well be that there is substantial noisiness in the time that they define schizogeny to commence. If that were the case, this would contribute to the poor correlation observed between schizogeny duration and number of merozoites produced, so this could be an important confounding experimental factor. This deserves some more discussion by the authors.

      Concerning the confidence with which we identify the first nuclear division we could hopefully clarify in the section above that our precision is only limited by the time resolution of the acquired time-lapse. Therefore, the uncertainty about the start time is not particularly high, and moreover, can expected to affect timer and counter (via the growth rate) to a similar degree. We see no unfair advantage for the counter for this reason.

      Alternative methods to count absolute DNA content (rather than trying to count individual nuclei) might be useful ways of independently confirming this phenomenon. Alternative possibilities for what constitutes the "start" of a possible timer are also warranted - it could be for example, the first division of one of the other organelles.

      This is an interesting suggestion. Next generation fluorogenic DNA dyes have been used by us and the Ganter group (Simon et al. 2021, Klaus et al. 2022, Wenz et l. 2023) to assess DNA content of single cells over time. Our experience shows that there are some caveats to using these Hoechst based dyes, some of which we discussed in the aforementioned publications. While they allow some reasonable absolute quantification of DNA content for the very first S-Phase (and subsequent nuclear division), in later stages only relative quantification can be achieved. One underlying reason is the apparent increase of dye permeability, and therefore higher intensity, at late schizont stages. This issue is exacerbated by the asynchronous DNA replication of multiple nuclei. Further, nuclear division itself can be delayed or even inhibited when increasing the concentration of the dye, which suggest an impact on cell physiology (well documented for Hoechst based dyes in other organisms). When reaching the segmenter stage, the resulting variance in fluorescent intensity would make it challenging to assign a reliable number of nuclei required for analysis, a problem that does not occur when counting individual nuclei. Taken together, unfortunately, all these confounding factors make DNA content analysis in live single cells for the entire schizont stage unachievable at this point.

      These and previous authors in any case conclude that a counter model must exist through exclusion of a timer model. I am less convinced that the evidence discounting the timer is conclusive, and that a straight counter model is the only alternative. Indeed I am unconvinced by the suitability of this strictly dichotomous two-model system to categorise the division of unicellular eukaryotes, and these theories are not universally held to be sufficient to describe division.

      We thank the Reviewer for this insightful comment. As already detailed above, we have clarified and corrected our model definitions in the revised manuscript. Further, we want to make the important distinction between organisms, including unicellular ones that undergo binary fission and the ones like Plasmodium that use schizogony. Our model, although inspired by model organisms, is tailored to a multinucleated division mechanism, and clearly defined within those boundaries. The timer and counter models we consider are defined by their correlation structures. They are at two extremes of a continuum of models which could be characterized, for instance, by the ratio of correlations (growth rate - nuclear number) vs. (growth rate – duration) as an additional parameter. As the reviewer points out, excluding the timer model is not equivalent to proving the counter model, and indeed a partially correlated model, or a more complex model entirely, could yield a better fit. However, within the realm of models without additional parameters, and which are testable with the available data, only timer and counter remain, as different timer start points are not experimentally accessible. Importantly and somewhat surprisingly, the counter model also gave a fit that is as good as can be reasonably expected for the experimental sample size (new Fig. 2). So, we maintain that within the current experimental constraints, the counter model is the only viable option for almost all our tested conditions. The observation that in H2B-GFP expressing P. knowlesi parasites no clear distinction can be made between the models, indeed, suggest that the reality of multiplication rate regulation is more complex and may be limited by different constraints in different growth regimes. We now state these limitations and the room for further model adjustments with more data in the Discussion section.

      Nonetheless, if a counter exists, what is being counted that determines the final number? The authors consider that this might be a physical object or resource inside the parasite, or an extrinsic/extracellular resource. They investigate this by comparing the final cell number to a number of factors. First, the authors investigate the size of the RBC (by musing the diameter as an indicator)- little information is given about the source of the blood used, but it appears to be from a single donor of unknown age, who has approximately typical variance in RBC diameter (at least, after manipulation and storage). The authors observe little correlation between these variables.

      We share the curiosity of the reviewer about what might be “counted” by the parasite. This shall be the subject of future studies, and our present study provides the necessary basis for asking this question and defines a framework to investigate it. Concerning the size of the host cell, the blood used was from a different donor for each of the replicas, which we now specify in the figure legend (line 302). No significant difference between the RBC diameters between the donors was observed. A correlation between RBC diameter and progeny number was indeed not observed.

      Second the authors measure parasite size at the onset of schizogeny, and find that bigger parasites result in more daughter merozoites early in schizogeny (perhaps not surprising, given the earlier mentioned technical problems with measuring the first few steps of schizogeny), but that this different initial cell size doesn't result in a different final merozoite number, or as they describe it "not quite significant anymore". Previous p values were taken as cause for rejecting the timer hypothesis and the timer model. In this case the authors instead interpret the data as suggesting "that the setting of the counter might correlate with parasite cell size". This is inconsistent statistical and analytical handling, and highlights the earlier potential pitfall of rejecting timer-based models based on not gathering data that statistically show a correlation. This needs reworking to highlight that these data are inherently noisy, difficult to measure accurately, and aren't necessarily going strongly reveal a trend even where one biologically exists, and that this ought not be used as grounds for confident rejection of a model.

      The Reviewer raises concerns about the consistency of the statistical interpretation of our data. We care deeply about the well-foundedness of our conclusions and hope to eliminate these concerns in the following. First, we hope that the issue about the “technical problems” in measuring the first division has been solved in our response to previous comments. Next, to clarify an apparent misunderstanding: As stated in the text (lines 329ff) and shown in now Fig. 5D-E, cell size at onset of nuclear division or 2 hours prior does significantly correlate with final merozoite number. The lack of significant p-value (0.08) only pertains to the correlation of cell size at the end of the schizont stage (tend) with merozoite number (now Fig. 5F). We have removed the unfortunate wording “not quite significant anymore” in that context. Finally, regarding potential mechanisms, a potential counter must be set before the first nuclear division is completed because only that way it can be set independent of the speed of nuclear multiplication. This observation gives the statistically significant correlation of volume at the onset of division and progeny number its relevance. We have reformulated the marked sentence for more clarity (lines 331ff). Furthermore, we point out that our rejection of the timer is now based on a revisited statistical analysis (Fig. 2), which is no longer based on a simple correlation between final number and duration, as detailed above.

      Finally, the authors grow the parasites in dilute media, and find that they produce fewer daughter parasites. This is anecdotally unsurprising, as most Plasmodium laboratories are aware that sub-optimal growth conditions result in less healthy schizonts with fewer viable merozoites (and lower magnitudes of single-cycle expansion), but is nonetheless an important result that highlights explicitly how much this occurs in the specific conditions of dilute media. Given the lack of investigation of exactly which nutrient, carbon source, or combination thereof leads to the reduced merozoite number, it is unclear if or how much this is relevant to the scenario of a natural infection and realistic levels of that nutrient in a human or primate parasite environment.

      As rightfully pointed out by the reviewer suboptimal growth conditions affecting parasite growth and multiplication rate have been shown in many instances. The number of studies that actually quantify a reduction in merozoite number under different growth conditions is certainly much lower (Brancucci et al. 2017 (lipids), Mancio-Silva et al. 2017 (calorie-restriction in mice), Tinto-Font et al. 2022 (temperature) come to mind). What our study adds to this body of literature is to which extent duration of the schizont stage and cell volume are affected in relation to progeny number at the single cell level. Importantly, we wanted to test whether the counter model still holds under these more adverse conditions, which we found to be the case. Along the lines of the work on calorie restriction and the likely implication of isoleucine in the process investigated in the laboratory of Maria Mota, it will be exciting to identify a “limiting factor” in future studies. Indeed, any study done in complete RPMI culture medium can be questioned regarding its physiological relevance and we added a sentence addressing this aspect in the discussion (lines 514ff). Yet, our medium dilution experiments suggest that at least to some degree an extracellular resource is implicated, which makes sense from a biological function point-of-view.

      Minor issues

      The manuscript confuses the terms "less" and "fewer". Fewer should be used for countable nouns (fewer daughter cells, fewer nuclei, fewer merozoites), less for uncountable nouns (e.g. less speed, less volume).

      Thank you for pointing this out. The words have been replaced accordingly.

      I didn't understand lines 93-95; "This excluded a timer and thereby confirmed a counter as the mechanism regulating termination of nuclear multiplication (Klaus et al., 2022). A direct correlation between duration of schizont stage and merozoite number is, however, still missing." If I understand the first sentence concludes that there ought not be a direct correlation between schizont duration and merozoite number, but the second sentence, says that that correlation is "however" missing. Isn't this expected? Perhaps reword for clarity?

      Thank you for requesting clarification here. The exclusion of the timer by Klaus et al. 2022 was based on the correlation between duration of the first nuclear division cycle and the total duration of all nuclear replication phases. At no point did Klaus et al. count merozoites in live single cells, which was mainly due to lower spatial resolution of their images (M. Ganter, personal communication). Therefore, they could not directly assess the relation between progeny number and schizont stage duration, which we now report for the first time. The sentence was supposed to convey that this type of data was missing and was now reformulated for more clarity (line 114).

      Lines 104

      "We further uncover that throughout schizogony P. falciparum infringes on the otherwise ubiquitously constant N/C-ratio (Cantwell and Nurse, 2019)" This seems obvious to me, and not something uncovered by this study. In most of the numerous apicomplexans that divide by endoschizogeny, the cells achieve a near final size considerably before the final rounds of nuclear division so the N/C ratio must not remain constant - this is a direct corollary of many previous descriptions and not a novel finding of this study, and this claim here should be made more modest.

      We understand the point raised by the reviewer but still think that our claim is justified due to several aspects. There are examples of eukaryotic cells that undergo multinucleated stages during division were the N/C-ratio is constant (Dundon et al. 2016, Cantwell and Nurse, 2019), while we are not aware of any counter-example in the literature. Studies have also shown that e.g. certain mutant yeast that fail to undergo cytokinesis will increase their volume by factor of up to 16 alongside the still replicating and growing nucleus maintain the N/C-ratio (Neumann et al. 2007, Jorgensen et al. 2007). This demonstrates the tremendous plasticity that cells can reveal with respect to nucleus and cell size regulation. Until the contrary was shown, it was conceivable that nuclear compaction, which does occur (Fig. 5H), compensates for the increase in nuclear number while the cell volume is only increasing slightly. Importantly, we are not aware of any literature where nuclear volume has been quantified for blood stage Plasmodium. Cell volume quantifications remain limited to modelling and the study by Waldecker et al., which provides a few datapoints throughout the IDC. Whether this finding is expected or not, formally speaking, our claim is justified, but for more clarity we replace “uncover” with “demonstrate”. We also introduce the N/C-ratio as cellular parameter in P. falciparum pointing out another divergent aspect of its biology and might in the future understand the functional implication of this usually constant ratio, which is still unclear.

      Dundon SE, Chang SS, Kumar A, Occhipinti P, Shroff H, Roper M, Gladfelter AS. Clustered nuclei maintain autonomy and nucleocytoplasmic ratio control in a syncytium. Mol Biol Cell. 2016 Jul 1;27(13):2000-7.

      Neumann FR, and Nurse P. Nuclear size control in fission yeast. J. Cell Biol. 2007; 179: 593–600. pmid:17998401

      Jorgensen P, Edgington NP, Schneider BL, Rupeš I, Tyers M & Futcher B Molecular Biology of the Cell 18 (2007) The size of the nucleus increases as yeast cells grow.

      Helena Cantwell, Paul Nurse; A homeostatic mechanism rapidly corrects aberrant nucleocytoplasmic ratios maintaining nuclear size in fission yeast. J Cell Sci; 132 (22)

      I lack specialist statistical knowledge to comment on the statistical analyses performed on the correlation data, and in particular, whether the high p values for t-Tests for correlation are sufficient to support the argument that there is not a correlation, and whether these observations are sufficiently powered to robustly test that hypothesis.

      We are confident that our reworked model analysis, as explained above, now sufficiently supports our hypotheses.

      Reviewer #2 (Significance):

      The manuscript purports to find a counting mechanism that determines parasite merozoite numbers, and that this coutner is set by an externally provided and diffusible resource. Many nutrients are in excess in normal culture media, but not all. If that counted nutrient(s) were normally in excess in the bloodstream, it could hardly be said to be the factor that is counted and that therefore defines merozoite number. Conversely, if the amount of that nutrient were increased in normal media, would parasites make even more merozoites? Further, if the "counted" item is a freely diffusible compound in the media, it should be equally accessible to each parasite in a culture condition, and isn't a reasonable explanation for the variable merozoite numbers in the normal media conditions. To me, it is unsurprising that parasites that are healthy and well fed are able to produce more merozoites, but I don't see this as being the same as support for a counter model where the parasite senses and counts a set number of merozoites to produce in response to a specific external counter. I think the shoehorning of this phenomenon into a paradigm used to describe some other eukaryotes may not be appropriate, and that the rejection of one overly simplistic timer model should not automatically lead to us dichotomously accepting a simple counter method as the alternative. The authors need to do more to either identify a countable input whose gradual increase leads to a predictable and gradual increase in merozoite number, to show that they do use a counter, or provide substantially more caveats to their argument that the parasites are using a counter based on an externally provided resource to determine merozoite number.

      The reviewer comments on the feasibility of a counter mechanism based on an externally provided and diffusible resource. In fact this is a limited view of how a counter may arise and not the one we subscribe to. Rather, while a resource may be diffusible in the medium, it would need to be consumed during schizogony, and insufficiently replenished, in order to enable counting by dilution in the host cell. Furthermore, the reviewer has doubts that the fact that “healthy and well fed […] produce more merozoites” implies “support for a counter model”. We fully agree, and we argue in the manuscript that it is the correlations between schizogony durations and merozoite counts that support a counter model.

      As we have argued above, the two alternative models we consider are inspired by paradigm from other eukaryotes, but their definitions in the present context are simple enough for them to be considered natural minimal models of schizogony. As the simplest imaginable phenomenological models of multiplication control, we find it natural to compare them, and we hope our new introductory section introduces them appropriately now. Naturally, we hope to expand on this simple model in the future and identify more precisely the limiting resources and describe a more direct response.

      Audience - relatively specialised - likely interested audience would combine apicomplexan cell biologists, as well as theorists of cell division mechanism

      Advance - limited - confirms phenomenon also described by other researchers in their institute, and extends to another related organism.

      We would like to add that the present data are the first quantitative joint measurements of schizogony dynamics and outcome in P.falciparum and knowlesi. They allowed for the first time a direct correlation of duration and merozoite number, thereby accessing the question of growth control head on. Further they provide a quantitative reference of several key cellular parameters for anybody studying asexual blood stage parasites.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary:

      Stürmer and colleagues used super-resolution time-lapse microscopy to probe the mechanism regulating the number of merozoites produced by a single cell in Plasmodium falciparum and P. knowlesi. The authors conclude the followings-

      1. P. knowlesi has similar duration of schizont stage to P. falciparum, although having a 24 h intraerythrocytic developmental cycle (IDC) to 48 h of P. falciparum.
      2. Nuclear multiplication dynamics suggests a counter mechanism of division- which is further suggested by a significant relation of merozoite numbers with schizont size at the onset of division.
      3. Nutritional deprivation caused increase in nuclear volume and decrease in merozoite number. For the most part, the experiments that are presented in this manuscript support the conclusion of the authors. The data are presented in a concise and clear manner. However, some clarification and a couple of experiment (listed below) would improve this manuscript.

      Major comments:

      1. The authors generated at least 3 transgenic lines for this study, But the did not present any genetic validation of the lines in the manuscript. For completeness, I recommend to provide genetic validation (either pcr genotyping or whole genome sequencing) of the lines that were generated and used in this study in the supplement.

      Our study exclusively used episomal expression of the respective fluorescent reporter (H2B-GFP, NLS-mCherry, and cytoplasmic GFP). As is customary in the field resistance to selection drugs and distinct fluorescent signals are assumed to sufficiently validate the presence of the plasmids. We now added the schematic maps of the plasmids in a new Fig. S1 to make our approach more visually clear.

      1. In the H2B-GFP lines, the authors episomally GFP-tagged histone 2B to label the nuclear chromatin for both P. falciparum and P. knowlesi. This provides a very useful parasite line which enables the live time-lapse microscopy. Using these parasite lines, the authors first show that despite having a 24 h IDC in P. knowlesi vs 48 h in P. falciparum, both these parasites have a similar duration of the schizont stage (8.s vs 9.4 h). My concern here is whether this GFP-tagging is influencing the growth dynamics as in slowing down the P. knowlesi parasites. However, if that was the case authors should have seen that for P. falciparum too. Also, for the P. falciparum parasites that episomally express cytosolic GFP and Nuclear mCherry have a higher number of merozoites compared to the H2B-GFP P. falciparum and the authors speculate this is probably because of not tagging Histone 2B. Given this, it is important to show that none of the H2B-GFP parasites show any significant fitness cost due to GFP tagging of histone. I recommend a simple experiment to compare the multiplication rate of H2B-GFP lines to the parental lines in identical growth conditions. This suggested experiment was described in PMID: 35164549 to determine fitness cost of knockout lines. This experiment is vital for validation of the H2B-GFP lines and subsequent interpretation of the data that were presented in this manuscript.

      We thank the reviewer for this excellent suggestion. To validate our lines further we now have carried out multiplication rate measurements similar to the one described in the designated publication for all the used lines alongside their parental strains (Fig. S2). We found no significant differences in between the wild type and the episomally expressing parasite lines (lines 131ff), which gives us confidence that episomal expression of tagged proteins do not significantly alter growth dynamics in these cases.

      1. The authors used the microtubule live cell dye SPY555-Tubulin in P. falciparum to validate the findings presented in 1D and 1E. They did not do that for P. knowlesi. If there is no unsurmountable technical difficulty, I suggest doing the same with P. knowlesi. This will also address the concern that I have pointed out in #1.

      Thank you for this suggestion. We have now generated the requested data with P. knowlesi, added it to what is now Supplemental Figure 3 and included it in our new analysis (Fig. 2I-J). The numerical values align well with the observations made when measuring schizont stage dynamics with the H2B-GFP expressing P. knowlesi line (line 158). A notable difference is that the Tubulin data strongly support the (refined) counter model, while the H2B data alone allow no distinction.

      1. The data in Figure 3 shows that merozoite number does not depend on host cell diameter. My question here is, were these data collected using different donor blood? Or were this measured from different biological replicate? These are not clear from the writing. I am not sure about whether blood from various donor would have on the data, however, different preparation of the cells across various biological replicate will have some effect on host cell diameter hence on data. State if these were collected from independent biological replicates and about the donor blood.

      The data results where indeed collected from three independent biological replicates using different donor blood batches. This is now stated in the figure legend. The batches displayed no difference in RBC diameter.

      1. It is interesting to see that nutrient-limited conditions increase average nuclear volume but less merozoite numbers. In this experiment, as I understand, complete media was diluted 0.5x, which basically diluted every component of the media by half. From this experiment I can see nutritional deprivation as a whole having an effect and supports the counter mechanism, it would be intriguing to see if there is any effect of a particular nutrient have any effect on progeny division. For example, parasites can be grown in amino acid deprived media (except isoleucine) which makes the parasites fully dependent on host cell amino acids. This sort of specific nutrient deprivation will probably allow the authors to probe for specific nutrients that plays role as counter mechanism factor.

      This is indeed a very exciting direction we would like to investigate in more detail in follow-up studies. Our aim for this study was to confirm that nutrient deprivation actually affects “counting” and to provide a workflow to investigate individual nutrients. In the meantime the Mota group, in a study we now cite in the discussion (lines 507ff), actually reported that isoleucine (and possibly methionine) levels are linked to progeny number. A follow-up on this topic using our strains and methodology is certainly worthwhile but requires more detailed analysis in the future.

      Minor comments:

      1. P. knowlesi is sometimes just written as knowlesi. Please, write P. Knowlesi.

      Has been corrected.

      1. Supplemental figure 1D, missing x-axis label.

      We added the x-axis label.

      1. In line 105, define N/C.

      Done.

      1. In line 205, I assume the authors mean episomally, not episomally.

      Thank you for pointing this out. We have replaced “ectopically” with “episomally” throughout the text.

      1. In line 275, Duration of Schizont stage was slightly....

      Has been corrected.

      1. All 'ml' or 'µl' should be 'mL' or 'µL'.

      Changes have been made.

      1. Define iRPMI.

      We added a definition (line 610).

      1. In line 475, replace 'as' with 'and'.

      Done.

      Reviewer #3 (Significance):

      The factors that regulate the number of progenies in malaria parasites remain unknown. While there are few previous studies attempting to answer the question, those studies were done on fixed stained cells. In this study, the authors used genetically modified fluorescent P. falciparum and P. knowlesi parasites that enable live microscopy. These parasites coupled with super-resolution time-lapse microscopy the authors attempt to investigate the mechanism(s) at play in regulating progeny division. This manuscript provides data to suggest that external resources might have some role in progeny division and supports the counter mechanism. More careful validation of the transgenic lines that were used to collect data presented needs to be more systematic and rigorous.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Stürmer and colleagues used super-resolution time-lapse microscopy to probe the mechanism regulating the number of merozoites produced by a single cell in Plasmodium falciparum and P. knowlesi. The authors conclude the followings:<br /> - a. P. knowlesi has similar duration of schizont stage to P. falciparum, although having a 24 h intraerythrocytic developmental cycle (IDC) to 48 h of P. falciparum.<br /> - b. Nuclear multiplication dynamics suggests a counter mechanism of division- which is further suggested by a significant relation of merozoite numbers with schizont size at the onset of division.<br /> - c. Nutritional deprivation caused increase in nuclear volume and decrease in merozoite number.<br /> For the most part, the experiments that are presented in this manuscript support the conclusion of the authors. The data are presented in a concise and clear manner. However, some clarification and a couple of experiment (listed below) would improve this manuscript.

      Major comments:

      1. The authors generated at least 3 transgenic lines for this study, But the did not present any genetic validation of the lines in the manuscript. For completeness, I recommend to provide genetic validation (either pcr genotyping or whole genome sequencing) of the lines that were generated and used in this study in the supplement.
      2. In the H2B-GFP lines, the authors ectopically GFP-tagged histone 2B to label the nuclear chromatin for both P. falciparum and P. knowlesi. This provides a very useful parasite line which enables the live time-lapse microscopy. Using these parasite lines, the authors first show that despite having a 24 h IDC in P. knowlesi vs 48 h in P. falciparum, both these parasites have a similar duration of the schizont stage (8.s vs 9.4 h). My concern here is whether this GFP-tagging is influencing the growth dynamics as in slowing down the P. knowlesi parasites. However, if that was the case authors should have seen that for P. falciparum too. Also, for the P. falciparum parasites that episomally express cytosolic GFP and Nuclear mCherry have a higher number of merozoites compared to the H2B-GFP P. falciparum and the authors speculate this is probably because of not tagging Histone 2B. Given this, it is important to show that none of the H2B-GFP parasites show any significant fitness cost due to GFP tagging of histone. I recommend a simple experiment to compare the multiplication rate of H2B-GFP lines to the parental lines in identical growth conditions. This suggested experiment was described in PMID: 35164549 to determine fitness cost of knockout lines. This experiment is vital for validation of the H2B-GFP lines and subsequent interpretation of the data that were presented in this manuscript.
      3. The authors used the microtubule live cell dye SPY555-Tubulin in P. falciparum to validate the findings presented in 1D and 1E. They did not do that for P. knowlesi. If there is no unsurmountable technical difficulty, I suggest doing the same with P. knowlesi. This will also address the concern that I have pointed out in #1.
      4. The data in Figure 3 shows that merozoite number does not depend on host cell diameter. My question here is, were these data collected using different donor blood? Or were this measured from different biological replicate? These are not clear from the writing. I am not sure about whether blood from various donor would have on the data, however, different preparation of the cells across various biological replicate will have some effect on host cell diameter hence on data. State if these were collected from independent biological replicates and about the donor blood.
      5. It is interesting to see that nutrient-limited conditions increase average nuclear volume but less merozoite numbers. In this experiment, as I understand, complete media was diluted 0.5x, which basically diluted every component of the media by half. From this experiment I can see nutritional deprivation as a whole having an effect and supports the counter mechanism, it would be intriguing to see if there is any effect of a particular nutrient have any effect on progeny division. For example, parasites can be grown in amino acid deprived media (except isoleucine) which makes the parasites fully dependent on host cell amino acids. This sort of specific nutrient deprivation will probably allow the authors to probe for specific nutrients that plays role as counter mechanism factor.

      Minor comments:

      1. P. knowlesi is sometimes just written as knowlesi. Please, write P. Knowlesi.
      2. Supplemental figure 1D, missing x-axis label.
      3. In line 105, define N/C.
      4. In line 205, I assume the authors mean episomally, not ectopically.
      5. In line 275, Duration of Schizont stage was slightly....
      6. All 'ml' or 'µl' should be 'mL' or 'µL'.
      7. Define iRPMI.
      8. In line 475, replace 'as' with 'and'.

      Significance

      The factors that regulate the number of progenies in malaria parasites remain unknown. While there are few previous studies attempting to answer the question, those studies were done on fixed stained cells. In this study, the authors used genetically modified fluorescent P. falciparum and P. knowlesi parasites that enable live microscopy. These parasites coupled with super-resolution time-lapse microscopy the authors attempt to investigate the mechanism(s) at play in regulating progeny division. This manuscript provides data to suggest that external resources might have some role in progeny division and supports the counter mechanism. More careful validation of the transgenic lines that were used to collect data presented needs to be more systematic and rigorous.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This is a solid study that further characterises the dynamics of nuclear division in Plasmodium falciparum and P. knowlesi. Of two, among potentially several, models for how the number of daughter nuclei, and thus parasites - (called merozoites in this genus), are one that posits nuclei divide until a fixed timer ends, and one that posits that nuclei divide to reach a fixed number that is defined by a cellular counter. I find some practical difficulties in definitive measurement of either model, one issue with the former is that experimental definition of the start of the timer is problematic - we may define the starter's gun (eg by the first nuclear division) but it isn't necessary that the cell is using that same start time. Additionally, as the authors confirm here, being sure when that first nuclear division has occurred is particularly tricky with Plasmodium parasites, in part because the first few nuclei seem to clump together, preventing one from unambiguously calibrating the first division. Furthermore, getting decent replicate numbers is hard because of the difficulties of time lapse microscopy, and most Plasmodium studies (including this one) suffer from low enough numbers that it isn't always clear whether the numbers support one model over another.

      Nonetheless, several recent studies, particularly a study from the same institute (Klaus et al., 2022) employing timelapse imaging of nuclei, and timing the nuclear division of parasites, finds poor correlation between the duration of "schizogeny" (although perhaps using a different definition to the one used by the parasite) and the final number or merozoites. They therefore argue that there is poor evidence for a timer, and conclude by elimination that a counter must exist instead. A review by some of the authors of that study and some of this current study (Voß et al 2023), also concludes that the data from Klaus and colleagues "strongly support" a counter model. This current study also concludes that a counter model controls final nuclear/merozoite number in P. falciparum and P. knowlesi. This much at least is not particularly novel given the recent work on this topic, although the addition of the P. knowlesi data is interesting and consistent with the prior P. falciparum work. As above, the authors concede that it is difficult to determine with strong confidence when the first nuclear division has occurred, so it may well be that there is substantial noisiness in the time that they define schizogeny to commence. If that were the case, this would contribute to the poor correlation observed between schizogeny duration and number of merozoites produced, so this could be an important confounding experimental factor. This deserves some more discussion by the authors. Alternative methods to count absolute DNA content (rather than trying to count individual nuclei) might be useful ways of independently confirming this phenomenon. Alternative possibilities for what constitutes the "start" of a possible timer are also warranted - it could be for example, the first division of one of the other organelles.

      These and previous authors in any case conclude that a counter model must exist through exclusion of a timer model. I am less convinced that the evidence discounting the timer is conclusive, and that a straight counter model is the only alternative. Indeed I am unconvinced by the suitability of this strictly dichotomous two-model system to categorise the division of unicellular eukaryotes, and these theories are not universally held to be sufficient to describe division. Nonetheless, if a counter exists, what is being counted that determines the final number? The authors consider that this might be a physical object or resource inside the parasite, or an extrinsic/extracellular resource. They investigate this by comparing the final cell number to a number of factors. First, the authors investigate the size of the RBC (by musing the diameter as an indicator)- little information is given about the source of the blood used, but it appears to be from a single donor of unknown age, who has approximately typical variance in RBC diameter (at least, after manipulation and storage). The authors observe little correlation between these variables. Second the authors measure parasite size at the onset of schizogeny, and find that bigger parasites result in more daughter merozoites early in schizogeny (perhaps not surprising, given the earlier mentioned technical problems with measuring the first few steps of schizogeny), but that this different initial cell size doesn't result in a different final merozoite number, or as they describe it "not quite significant anymore". Previous p values were taken as cause for rejecting the timer hypothesis and the timer model. In this case the authors instead interpret the data as suggesting "that the setting of the counter might correlate with parasite cell size". This is inconsistent statistical and analytical handling, and highlights the earlier potential pitfall of rejecting timer-based models based on not gathering data that statistically show a correlation. This needs reworking to highlight that these data are inherently noisy, difficult to measure accurately, and aren't necessarily going strongly reveal a trend even where one biologically exists, and that this ought not be used as grounds for confident rejection of a model.

      Finally, the authors grow the parasites in dilute media, and find that they produce fewer daughter parasites. This is anecdotally unsurprising, as most Plasmodium laboratories are aware that sub-optimal growth conditions result in less healthy schizonts with fewer viable merozoites (and lower magnitudes of single-cycle expansion), but is nonetheless an important result that highlights explicitly how much this occurs in the specific conditions of dilute media. Given the lack of investigation of exactly which nutrient, carbon source, or combination thereof leads to the reduced merozoite number, it is unclear if or how much this is relevant to the scenario of a natural infection and realistic levels of that nutrient in a human or primate parasite environment.

      Minor issues

      The manuscript confuses the terms "less" and "fewer". Fewer should be used for countable nouns (fewer daughter cells, fewer nuclei, fewer merozoites), less for uncountable nouns (e.g. less speed, less volume).

      I didn't understand lines 93-95;<br /> "This excluded a timer and thereby confirmed a counter as the mechanism regulating termination of nuclear multiplication (Klaus et al., 2022). A direct correlation between duration of schizont stage and merozoite number is, however, still missing."<br /> If I understand the first sentence concludes that there ought not be a direct correlation between schizont duration and merozoite number, but the second sentence, says that that correlation is "however" missing. Isn't this expected? Perhaps reword for clarity?

      Lines 104<br /> "We further uncover that throughout schizogony P. falciparum infringes on the otherwise 105 ubiquitously constant N/C-ratio (Cantwell and Nurse, 2019)" This seems obvious to me, and not something uncovered by this study. In most of the numerous apicomplexans that divide by endoschizogeny, the cells achieve a near final size considerably before the final rounds of nuclear division so the N/C ratio must not remain constant - this is a direct corollary of many previous descriptions and not a novel finding of this study, and this claim here should be made more modest.

      I lack specialist statistical knowledge to comment on the statistical analyses performed on the correlation data, and in particular, whether the high p values for t-Tests for correlation are sufficient to support the argument that there is not a correlation, and whether these observations are sufficiently powered to robustly test that hypothesis.

      Significance

      The manuscript purports to find a counting mechanism that determines parasite merozoite numbers, and that this coutner is set by an externally provided and diffusible resource. Many nutrients are in excess in normal culture media, but not all. If that counted nutrient(s) were normally in excess in the bloodstream, it could hardly be said to be the factor that is counted and that therefore defines merozoite number. Conversely, if the amount of that nutrient were increased in normal media, would parasites make even more merozoites? Further, if the "counted" item is a freely diffusible compound in the media, it should be equally accessible to each parasite in a culture condition, and isn't a reasonable explanation for the variable merozoite numbers in the normal media conditions. To me, it is unsurprising that parasites that are healthy and well fed are able to produce more merozoites, but I don't see this as being the same as support for a counter model where the parasite senses and counts a set number of merozoites to produce in response to a specific external counter. I think the shoehorning of this phenomenon into a paradigm used to describe some other eukaryotes may not be appropriate, and that the rejection of one overly simplistic timer model should not automatically lead to us dichotomously accepting a simple counter method as the alternative. The authors need to do more to either identify a countable input whose gradual increase leads to a predictable and gradual increase in merozoite number, to show that they do use a counter, or provide substantially more caveats to their argument that the parasites are using a counter based on an externally provided resource to determine merozoite number.

      Audience - relatively specialised - likely interested audience would combine apicomplexan cell biologists, as well as theorists of cell division mechanism

      Advance - limited - confirms phenomenon also described by other researchers in their institute, and extends to another related organism.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      Malaria parasites replicating in human red blood cells show a striking diversity in the number of progeny per replication cycle. Variation in progeny number can be seen between different species of malaria parasites, between parasite isolates, even between different cells from the same isolate. To date, we have little understanding of what factors influence progeny number, or how mechanistically it is controlled. In this study, the authors try to define how the mechanism that determines progeny number works. They propose two mechanisms, a 'counter' where progeny number is determined by the measurement of some kind of parasite parameter, and a 'timer' where parasite lifecycle length would be proportional to progeny number. Using a combination of long-term live-cell microscopy and mathematical modelling, the authors find consistent support for a 'counter' mechanism. Support for this mechanism was found using both Plasmodium falciparum, the most prominent human malaria parasite, and P. knowlesi, a zoonotic malaria parasite. Of the parameters measured in this study, the only thing that seemed to predict progeny number was parasite size around the onset of mitosis. The authors also found that during their replication inside red blood cells, malaria parasites drastically increase their nuclear to cytoplasmic ratio, a cellular parameter remains consistent in the vast majority of cell-types studied to date.

      Major Comments

      • It is stated a few times in this study that P. knowlesi has an ~24 hour lifecycle, and while this is the case for in vivo P. knowlesi, it was established in the study when P. knowlesi A1-H1 was adapted to human RBCs (Moon et al., 2013) that this significantly extended the lifecycle to ~27 hours, which should be made clear in the text. As much of this study revolves around lifecycle length and timing, the authors should consider some of their findings with the context that in vitro adaption can significantly alter lifecycle length.
      • The dichotomous distinction between 'timer' and 'counter' as mutually exclusive mechanisms seems to be a drastic oversimplification. Considering the drastic variation we see in merozoite number across species, between isolates, and between cells, it seems much more likely that there are factors controlled by both time-sensed and counter-sensed mechanisms that both influence progeny number. Additionally, the only parasite parameter measured in this study, size at time of first nuclear division, explained only a small proportion of the variance observed in merozoite number.
      • For modelling of a timer-based mechanism, the designation of t0 is subjective. The authors chose the time of first nuclear division as their t0. It is possible that a timer-based mechanism could not be supported based on this model the chosen t0 differs from when the "parasite's timer" starts. For example, t could also have been designated as the time from merozoite invasion (t0) to egress (tend). It would be unreasonable to suggest the authors repeat experiments with a longer time-frame to address this, but this possibility should be discussed as a limitation of the model. It may also be possible to develop a different model where t0 = merozoite invasion and tend = egress, and test this model against the data already collected in this study.
      • The calculation of the multiplication rate is confusingly defined. In Figure 1 it is stated that it is "...based on t and n", which would imply that the multiplication rate is the number of merozoites formed per hour of schizogony, which would give an average value of ~2 for P. falciparum and ~1.5 for P. knowlesi. The averages rate values shown, however, are in the range of 0.15-3. The authors should clarify how these values were determined.
      • In Figure 2, the time from tend until egress is calculated, and this is interpreted as the time required for segmentation. In the Rudlaff et al., 2020 study cited in this paper, it is shown that segmentation starts before the final round of nuclear divisions are complete. Considering this, the time from tend until egress is not an appropriate proxy for segmentation time. The authors should consider rewording to something akin to "time from final nuclear division until egress" to more accurately reflect these data.
      • There is a significant discrepancy between the data in Figure 5 and Supplementary Figure 8. In Supplementary Figure 8, the authors establish that culturing parasites in media diluted 0.5x has a marginal effect on parasite growth, with no discernible change in parasitaemia over 96 hours. By contrast, in Figure 5a the parasitaemia of parasites cultured in 0.5x diluted media is approximately 5-fold lower than those in 1x media. The authors should explain the significant discrepancy between these results.
      • In Supplementary Figure 4, the mask on the cell at t0 shows two distinct objects, but it seems very unlikely that they are two distinct nuclei as they vary approximately 5-fold in diameter. The authors should provide more detail on how their masking was performed for their volumetric analysis. Specifically, whether size thresholds were also applied during object detection.

      Minor Comments

      • Line 45-48 mentions that merozoite number influences growth rate and virulence, but the corresponding reference (Mancio-Silva et al., 2013) only discusses the relationship between merozoite number and growth rate, not virulence.
      • Line 59 states that a 48 hour lifecycle is a baseline from which in vitro cultured parasites deviate. Clinical isolates also show variation in lifecycle length and so it is more accurate to just say that 48 hours is an average, rather than a baseline.
      • Line 63 cites a study for the lifecycle length of P. knowlesi (Lee et al., 2022), but there seems to be no mention of lifecycle length in this reference
      • If I am interpreting Figure 3B correctly, this is essentially a paired analysis where the same erythrocytes are measured twice, once at t0 and once at tend. If this is the case, this data may be better represented with lines that connect the t0 and tend values.
      • Figure 3A seems to imply that to calculate diameter of the erythrocytes, three measurements were made and averaged for each cell. I think this is a nice way to get a more accurate erythrocyte diameter, but if this is the case, it should be specified in the figure legend or methods.
      • In Figure 4I it is shown that in P. falciparum merozoite number doesn't correlate with nucleus size, but for P. knowlesi in Supplementary Figure 7c, a significant anticorrelation is observed. The authors should state this in the text and discuss this discrepancy.
      • The authors show that merozoite number roughly correlates with cell size at t0 but it would be interesting to see whether cell size at tend also corresponds with cell size at t0. This might help answer whether the cell is larger because it has more merozoites, or whether it has more merozoites because it is larger.
      • I don't feel that "nearly identical" is an appropriate summary of erythrocyte indices in Supplementary Figure 9, considering there is a statistically significant increase in mean cell volume. I think it is unlikely that this change is consequential, and performing these haematology analyses is a nice quality control step, but this change should be stated in the text.
      • In Supplementary Figure 8, parasitaemia only increases ~2-fold compared to >5-fold the previous two cycles. It seems likely that at the final timepoint on this graph the parasites are starting to crash, and therefore it may be best to end the graph with the 96 hour timepoint.
      • The error bars in Figure 5C aren't easily visible, moving them in front of the datapoints may help their visibility.
      • In Figure 6D & E, the y-axis labels should be changed to whole integers as all the values in the graph are whole numbers.
      • My interpretation of Figure 6 C-E, is that these are the same cells measured at three time points (t-2, t0 and tend). If this is the case, 6C is missing the cell that has a merozoite number of 8, which is presumably why the y-axes are not equalised for the three graphs.

      Significance

      In the asexual blood-stage of their lifecycle, malaria parasites replicate through a process called schizogony. During schizogony an initially mononucleated parasite undergoes multiple asynchronous rounds of mitosis followed by nuclear division without cytokinesis, producing a variable number of daughter nuclei. Parasites then undergo a specialised cytokinesis, termed segmentation to where nuclei are packaged into merozoites that go on to invade new host cells. While nucleus, and therefore merozoite, number are known to be varied between cells, across isolates, and across species, little is known about the mechanisms regulating merozoite number. In this study, the authors use live-cell microscopy to understand how parasites determine their progeny number. They suggest that parasites regulate their progeny number using a 'counter' mechanism, which would respond to the size or concentration of a cellular parameter, as opposed to a 'timer' mechanism. Long-term live-cell microscopy experiments using malaria parasites are extremely technically challenging, and the authors should be commended for their efforts in this regard. While I agree that the data generated from these experiments are technically sound, I have some reservations expressed above about the interpretation of some of these results. I would strongly encourage the authors to consider rewording some of their interpretations taking into account some of the caveats listed above. I would also consider fitting/testing an additional mathematical model where the time-frame proposed for the 'timer' mechanism begins following merozoite invasion.

      This work is of specific interest to anybody who grows malaria parasites, as the dynamics of their growth is obviously important to understand. Further, this work is of interest more generally to cell biologists who study the regulation of progeny number or cell size. I have no experience with the application of mathematical modelling to understand biological systems, and so I cannot comment on the interest of this work to that field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary:<br /> In this original article from Mutscher et al., the authors developed a compartmentalized dissociated mouse myelinating SC-DRG coculture system to investigate the distinct roles of Schwann cells in axon protection and degeneration after injury. The innovation of this approach relies on (i) the use of mouse SCs and mouse DRGS neurons instead of rat cells; (ii) the use of microfluidic chambers, seeded by axons and SCs in different compartment; (iii) the possibility to perform a traumatic injury in vitro. While this novel approach offers new ways to study peripheral nerve regeneration and SC-axon interaction, and technical the study is robust, the paper is currently limited by the exploration of their model.

      Major points:<br /> Reviewer 1. It is unclear is this approach will ever lead to the identification of key mechanism or key candidates. This is a major miss in the current manuscript form. In short: the authors should demonstrate that their in vitro system can lead to significant leap in our understanding of peripheral nerve regeneration by identifying novel targets/pathways or mechanisms.

      Author response: We agree with the reviewer that cell culture approaches have limitations however we would disagree that it is not a viable approach given that a number of seminal studies in the field have already helped identified key cellular and molecular steps using rat SC-DRG cocultures or using mouse DRGs and rat SCs in combination with in vivo study. We have added the following to the introduction to highlight this point in more detail:

      Introduction.

      Dissociated myelinating SC-DRG cocultures from rats were first developed by the Bunge laboratory in the 1980’s to investigate PNS myelination in a more dynamic way (Bunge et al, 1989; Eldridge et al, 1987)__. These cultures have been used to make seminal discoveries in uncovering the cellular and molecular mechanisms of SC myelination alongside in vivo investigation. These include how the inner SC membrane (mesaxon) advances to myelinate axons, and the role of b__-neuregulin-1 (__b__NRG1) and polarity proteins in SC myelination (Bunge et al, 1989; Shen et al, 2014; Chan et al, 2006; Taveggia et al, 2005)__. Similarly, SC-DRG cocultures have been useful in demonstrating how SCs proliferate after axon injury, transfer metabolites, such as pyruvate, to delay axon degeneration, how placental growth factor (Plgf) regulates axon fragmentation by SCs and how SC JUN promotes axon outgrowth after injury (Arthur-Farraj et al, 2011; Babetto et al, 2020; Vaquié et al, 2019; Salzer & Bunge, 1980)__. The use of a coculture system to study axon-SC interactions during axon degeneration and regeneration offers some advantages over in vivo approaches as both neurons and SCs can be genetically manipulated separately and live imaged with ease.

      Discussion.

      Most importantly SCs and DRG neurons from various transgenic mice can be used to perform in vitro analysis to complement findings from in vivo transgenic mouse studies.

      Author response: Furthermore, as this is a methods paper, demonstrating novel molecular mechanisms is outside the scope of this article. However, we have already used this technique with a collaborator to study the role of cdk7 in myelination (see link to conference abstract below) and this manuscript is under preparation to be submitted soon. Additionally, we have ongoing projects within the lab using this technique to help characterise novel molecular targets in nerve injury. https://scholar.google.com/citations?view_op=view_citation&hl=en&user=uEtwAd8AAAAJ&sortby=pubdate&citation_for_view=uEtwAd8AAAAJ:_Qo2XoVZTnwC

      Author response: Despite this, we have shown that the axo-protective effect of SCs is independent of myelination status, which is an advance on what is known in the literature.

      Author response: We have realised through all of the reviewers’ comments that the title and the aims of the manuscript were confusing. We have made this clearer by removing the word novel in the title changing the title to the following:

      A method for mouse myelinating Schwann cell-DRG neuron compartmentalised cocultures: myelination status does not influence the axo-protective effect of Schwann cells after injury.

      Author response: We have also made it much clearer what the purpose of our study is and where and how it fits in with the previous literature by adding the following paragraph to the introduction.

      Introduction.

      Indeed there has only ever been one laboratory detailing convincing myelin formation in dissociated mouse myelinating SC-DRG neuron cocultures, however this was never published as a step by step detailed protocol (Stevens et al, 1998; Stevens & Fields, 2000)__. In the last twenty years, there have been no published studies demonstrating myelination in fully dissociated mouse SC-mouse DRG cocultures. This has largely prevented the use of cells, particularly SCs, from transgenic mice in cocultures and thus restricted the ability to study SC-axon interactions in a system that can be readily manipulated and live imaged and results directly applied back to in vivo findings in the same species.

      Reviewer 1. The use of embryonic DRG neurons or SC isolated from P2 animals are arguably physiologically not the same cells that are affected by traumatic nerve injury, which happen most often than not in adult. This is a problem in the long-term reliance on this approach to study axotomy peripheral nerve regeneration.

      Author response: We agree with the reviewer that one should always be cautious with the use of embryonic/neonatal cells to directly refer them to adult cellular mechanisms. We have added discussion of this point to the discussion:

      Discussion.

      One limitation of our coculture model and indeed all coculture and cell culture models that are used to investigate cellular and molecular mechanisms in nerve injury is that the cells are obtained from embryonic or neonatal animals. This is an important caveat when applying results from cell culture to adult in vivo nerve injury. However, while we would argue that cell culture approaches should always be used in combination with in vivo study it is important to remember that nerve injury is not restricted to adults and brachial plexus injury secondary to birth trauma is unfortunately a significant clinical problem (Pondaag et al, 2007)__. Furthermore, neonatal SCs replicate many of key cellular and molecular mechanisms seen in adult SCs after injury, including JUN upregulation, myelinophagy, promotion of axon growth and expression of key repair program transcripts (Arthur-Farraj et al, 2012; Gomez-Sanchez et al, 2015; Arthur-Farraj et al, 2017; Parkinson et al, 2008)__. A future development would be to try to adapt this protocol to make a coculture model with adult mouse or even human cells.

      Author response: Additionally, we already know Schwann cells in P5 neonatal mice in vivo after nerve transection demyelinate in a similar way to Schwann cells in adult mice and that neonatal cells in vivo and in vitro require the transcription factor c-Jun to do so (Parkinson et al., 2008 JCB Fig.7).

      Moderate points:<br /> Reviewer 1"there are no established protocols in the field describing the use of mouse SCs with mouse DRG neurons in dissociated myelinating cocultures". The use of mouse cells is laudable, but it is not necessarily a technical innovation, or at least the current manuscript does not explain why their approach particularly suitable to mouse Schwann cells.

      Author response: We feel that a detailed working protocol for compartmentalised dissociated mouse myelinating cocultures showing convincing and extensive myelination has been missing from our field for a long time. We agree that it is an incremental technical advance, but it is an important one. We have modified the title as we explained above. We have explained this point more clearly in the introduction, results, and discussion with the following additions:

      Introduction.

      Indeed there has only ever been one laboratory detailing convincing myelin formation in dissociated mouse myelinating SC-DRG neuron cocultures, however this was never published as a step by step detailed protocol (Stevens et al, 1998; Stevens & Fields, 2000)__. In the last twenty years, there have been no published studies demonstrating myelination in fully dissociated mouse SC-mouse DRG cocultures. This has largely prevented the use of cells, particularly SCs, from transgenic mice in cocultures and thus restricted the ability to study SC-axon interactions in a system that can be readily manipulated and live imaged and results directly applied back to in vivo findings in the same species.

      The consensus within the field is that inducing myelination in dissociated mouse SCs is challenging. Certainly, induction of myelin differentiation with cyclic adenosine monophosphate (cAMP) analogues or elevating agents, such as forskolin, is more difficult in mouse SC monocultures compared to rat SC cultures. This is because mouse SCs require additional exogenous b__-neuregulin-1 (__b__NRG1), plating on poly-L-lysine (PLL) instead of poly-D-lysine (PDL), and low concentration horse serum as opposed to foetal calf serum (Stevens et al, 1998; Arthur-Farraj et al, 2011; Päiväläinen et al, 2008)__.

      Author response: We have now explained more clearly that without plating on Matrigel and the regular addition of Matrigel to the myelination medium that mouse cocultures do not myelinate with ascorbic acid or indeed addition of NRG1 nor forskolin. Please see NEW DATA in Supplemental figure 1. We have added the following paragraph to the results section.

      Results

      Importantly, we found that L-ascorbic acid was insufficient to induce substantial myelination in our cultures, unlike in rat SC-DRG cocultures, and in the one previously published dissociated mouse SC-DRG protocol (Stevens et al, 1998)__. In fact, plating cocultures on laminin, adding ascorbic acid (50 m__g ml−1), b__NRG1 (10 ng ml−1) and forskolin (10 m__M) induced very few myelin sheaths (Supp. Fig. 1). Only when cultures were plated on Matrigel__â and further Matrigel__â was added to the myelination medium for each medium change, were we able to visualise robust reproducible myelination in our cocultures (Supp. Fig. 1).

      Author response: We have also added further discussion on how our protocol differs from the Stevens 1998 and other protocols in the discussion.

      Discussion

      Our protocol differs somewhat from the one used by Stevens et al., 1998 to induce myelination in dissociated mouse SC-DRG cocultures__, as they used ascorbic acid and 10% horse serum and presumably plated their cultures on laminin, though they do not explicitly detail this (Stevens et al, 1998)__. In our preliminary experiments we were unable to visualise much myelination with use of laminin, ascorbic acid or indeed if b__NRG1 and high concentration forskolin was added to the medium for up to four weeks. However, if we plated cocultures on Matrigel® and continuously added it to the myelination medium then we saw comparable levels of myelination in our mouse cocultures to that of rat cocultures (Eldridge et al, 1987)__. This approach of using Matrigel® to enhance myelination has previously been successfully employed in cultures of human iPSC sensory neurons with rat SCs and in non-dissociated mouse DRG explant cultures (Clark et al, 2017; Päiväläinen et al, 2008)__. Importantly, we used growth factor depleted Matrigel® as standard Matrigel® preparations contain substantial amounts of Transforming growth factor b (TGF__b__) which is a known inhibitor of myelination (Einheber et al, 1995)__. Additionally, the majority of rat and mouse coculture protocols plate cells on glass whereas we found cultures were healthier and myelinated better when cultured on plastic Alcar® coverslips.

      Discussion

      Furthermore, as this is a dissociated and compartmentalised purely mouse cell culture system, one can utilise the vast array of transgenic and knockout lines available to study neuron-SC interactions in more detail, without concern of contaminating endogenous SCs and other non-neuronal cells that remains a drawback of current mouse dissociated or non-dissociated DRG explant models.

      Reviewer 1: The figures in the paper are largely descriptive. They are very little quantitative measurement. Thus, the readers will have a hard to determine, if they replicate the proposed approach, whether their efficient is on par with the current authors.

      Author response: We have now added quantification of myelin segments per mm2, percentage of SCs that myelinate and quantification of the interperiodic distance of the myelin formed. This is all included in a new version of TABLE 1. We have discussed this data in the results section as follows.

      Results

      Quantifying the number of PRX positive myelin segments we found that there were 325.33 ± 12.3 sheaths per mm2, comparable to what has been originally described in rat SC/DRG cocultures and two fold more extensive myelination than recently described compartmentalised rat cocultures models (n=3; Table 1; Eldridge et al, 1987; Vaquié et al, 2019)__. Furthermore, 25.47 ± 1% of Schwann cells were myelinating in our cultures (n=3; Table 1). To confirm that cocultured myelinated SCs formed compact myelin we performed electron microscopy (EM), which revealed compact myelin formation with multiple myelin wraps and formation of readily visible major dense (MDL) and intraperiod lines (IPL; Fig. 2D). Additionally, we measured the periodicity, i.e., the distance between two adjacent major dense lines, to make sure myelin was compacted. Interperiodic distance was 12.16 ± 0.28 nm, in line with previous reports (n=3; Table 1; Boutary et al, 2021; Fernando et al, 2016; García-Mateo et al, 2018; Giese et al, 1992; Perrot et al, 2007)__.

      Author response: Interestingly, after we performed this analysis, we realised we have double the level of myelination in our mouse cultures (325.33 ± 12.3 per mm2) than in the compartmentalised myelinating rat cocultures in Vaquie et al., 2019 (147 ± 27 internodes per mm2 (n = 3)).

      Author response: We have also quantified the JUN upregulation after injury in both myelinating and aligned cocultures. See Fig. 3B-E).

      Results

      Additionally, we noted a strong upregulation of JUN protein in SCs 12 hours after axotomy (Fig. 3B and C). We also saw significant JUN upregulation 12 hours after axotomy in cocultures with aligned SCs (Fig. 3D and E).

      Author response: The above quantification is in addition to the quantification of the rate of axon degeneration in the presence and absence of aligned and myelinating Schwann cells in Fig. 4B. We have also quantified the % of Schwann cells that contained axonal debris after injury – this data is now quoted in the text as we removed Fig. 4E.

      We thank the reviewer for asking for additional quantification as this has improved the manuscript.

      Minor points:<br /> Fig.4B and E should show individual data points.

      Author response: We have added the individual data points to Fig.4B. We have removed Fig.4E and instead quoted the data in the results section as follows:

      Results

      When we quantified this phenomenon, we found that 97.84 ± 1.462% (n=2) of SCs in our cocultures contained mCherry-labelled axonal fragments.

      Reviewer #1 (Significance):

      In addition, to the demonstration of feasibility of this in vitro approach, the main finding by the authors is that SCs have a role in neuronal protection and support is key for peripheral nerve regeneration. Thus while in vitro approach does not add key information that do not already exists in the field, it somewhat confirms that the effect is SC autonomous. Overall the approach is interesting and has potential, but the study currently lack a demonstration of its usefulness to the community.

      It would have been interesting to have the authors discuss the advantages of their approach in comparison to other innovative approaches to study SC-axon interactions that have been developed in the last decade (i.e., 3D environment, microfluidic approach, transwell systems). There is also a lack of citations about similar studies in the field.

      Author response: We direct the reviewer to the following paragraphs in the introduction and the discussion, which we have now elaborated on further post peer review. We discuss all relevant cocultures studies in mouse as well as all the relevant microfluidic studies and 3D coculture studies as well as the one human nerve organoid study. We found two additional studies, Numata-Uematasu 2023 using DRG mouse explant cultures and Park et al., 2021 using motorneuron-SC cocultures which we have now added to the discussion. We also briefly discuss transwell studies to assess migration as the reviewer requested.

      We also discuss in detail the two microfluidic coculture injury studies Babetto et al., 2020 and Vaqiue et al., 2019 extensively throughout the manuscript. We have added further discussion of the similarities and differences between theirs and our approach in the discussion.

      Introduction.

      Protocols exist where endogenous mouse SCs are used to myelinate dissociated or non-dissociated DRG explant cultures. (Shen et al, 2014; Harty et al, 2019; Sundaram et al, 2021; Stettner et al, 2013; Numata-Uematasu et al, 2023)__. Furthermore, another protocol seeded exogenous SCs onto non-dissociated DRG explant cultures (Päiväläinen et al, 2008)__. Other laboratories seed cultured rat SCs onto dissociated mouse DRG axons (Taveggia & Bolino, 2018)__. Use of dissociated or non-dissociated DRG explants cultures preclude many experimental uses, such as using SCs from different transgenic animals and separate transfection of SCs and neurons with viruses for live imaging or genetic manipulation, and easy use of microfluidic chambers to allow injury studies and separate drug treatments to neurons or SCs. The reason for this is that antimitotics cannot be used in dissociated or non-dissociated DRG explant cultures as this depletes SCs, and the culture quickly becomes contaminated with other non-neuronal cell types, such as satellite cells and fibroblasts migrating out of the DRG. Furthermore, use of exogenous SCs in a non-dissociated DRG explant culture risks, after a period of antimitotic exposure, which was developed by Päiväläinen et al., 2008 still risks potential contamination from endogenous SCs and satellite glia migrating out of the DRG explant over time. This occurs because antimitotic treatment is unlikely to fully penetrate the whole DRG without prior dissociation. Additionally, a compartmentalised culture system cannot be readily used with non-dissociated DRG explant cultures (Päiväläinen et al, 2008)__.

      Discussion.

      Furthermore, our protocol is complementary to the recently described 3D mouse myelinating SC-motor neuron coculture system using collagen hydrogels (Hyung et al, 2021; Park et al, 2021)__. It will be interesting in the future to up titrate the concentration of Matrigel®, which is similar to collagen hydrogels, in our cultures to see whether further increasing extracellular matrix viscosity and stiffness improves our myelination efficiency even further. While it is possible to study cell migration in microfluidic cell culture devices, transwell models offer significant advantages to study this cellular phenomenon (Negro et al, 2022)__. To date, there have been no published studies of successful myelination in human SC-neuron coculture systems. Despite this rat SCs have been shown to readily myelinate human-induced pluripotent stem cell (iPSC)-derived sensory neurons and an iPSC-derived peripheral nerve organoid system which does contain myelinating SCs has recently been described (Clark et al, 2017; Van Lent et al, 2022)__.

      These findings confirm the observations of both Babetto et al., 2020 and Vaquié et al., 2019 who used rat SCs in similar microfluidic culture systems (Vaquié et al, 2019; Babetto et al, 2020)__. We have shown that the axo-protective observation seen by Babetto et al., 2020 does not rely on myelination status, which was an outstanding question from that study (Babetto et al, 2020)__. Furthermore, in an advance from previous studies, we have visualised the axo-protective and axon clearance phenomena in the same culture and shown that they are temporally separated, with axon fragmentation and debris clearance by SCs occurring at much later timepoints after axotomy. Several different culture and experimental conditions preclude direct comparison of our study with both those of Vaquié et al., 2019 and Babetto et al., 2020. These include the use of rat SCs in both prior studies and that Babetto et al., 2020 mixed rat SC with mouse DRG axons; length of time in culture, time points quantified after injury and distance from injury and site of analysis. Babetto et al., 2020 performed axotomy on relatively short term cocultures (6 days in vitro) whereas Vaiquié et al., 2019 cultured for at least 4 weeks and in our case 6 weeks prior to axotomy. Vaiquié et al., 2019 removed nerve growth factor (NGF) prior to laser axotomy and quantified proximally (though also imaged distally) whereas both our study and Babetto et al., 2020 kept NGF in the medium, performed axotomy with a scalpel and quantified more distally and in our case extremely distal, where only individual neurites and no axon bundles were visible. Vaquié et al., 2019 had SCs on both sides of the barrier in the microfluidic chambers whereas we seeded SCs only in the axonal/bottom compartment. Additionally, our cocultures had both forskolin and b__NRG1 added to help induce myelination, whereas these factors are not required in rat myelinating cocultures. Finally, it is important to permeabilise myelinated cultures with acetone after fixation__, as we did, to visualise the entire axon through heavily myelinated segments otherwise axon integrity cannot be reliably assessed in a quantitative manner (Vaquié et al, 2019; Babetto et al, 2020)__.

      Because of the lack of key novel mechanisms, and lack of discussion on what this approach is superior to others in vitro approach limits the impact of the study and the excitement of the reader, even from the SC-axon community.

      Author response: We have developed the first compartmentalised fully dissociated mouse myelinating coculture system in over twenty years. Thanks to the reviewer’s suggestions, we have now shown that myelination is comparable to the original rat cocultures from the Bunge lab, which is the gold standard in the field, and superior to recently described compartmentalised rat coculture system by Vaquie et al., 2019. We have provided a detailed step by step protocol to allow other researchers to use our technique.

      Additionally, thanks to the reviewer, we have now described in detail exactly how our protocol differs from others and why we succeeded to get mouse SCs to myelinate so robustly in a fully dissociated coculture (see previous answers). This is an incremental but important advance given that studies currently use a coculture system using entirely cells from rat, or where rat Schwann cells are seeded on mouse axons, or dissociated or non-dissociated mouse explant cultures are used which abrogates using neurons and SCs from different transgenic mice.

      As this is a methods paper, we did not intend to describe novel molecular mechanisms though our method is already being used for such purposes by ourselves and a collaborator as outlined above in prior answers. We did not make this clear and we hope the extensive revision of the manuscript now addresses this point. Despite this, we have shown that the axo-protective effect of SCs is independent of myelination status, which is an advance on what is known in the literature.

      Finally, in addition to myelination, we have demonstrated that one can study all the key components of the SC and axonal response to injury in a quantifiable way in addition to demonstrating that these cocultures can be live imaged and used for drug studies. None of the prior mouse studies looked at injury responses of axons nor SCs. We believe this will be of use to the community.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In the manuscript entitled "Distinct axo-protective and axo-destructive roles for Schwann cells after injury in a novel compartmentalised mouse myelinating coculture system", Arthur-Farraj and colleagues detail a method of dissociated coculture of mouse-DRG neurons and Schwann cells in microfluidic chambers. In this system, neurons and Schwann cells harvested from the same or from different animals are grown in different compartments that are connected by microgrooves, thereby allowing for spatial and diffusive separation. Neurons are shown to extent their axons across the microgroove barrier to the glial compartment where Schwann cells align with the axons and myelination can be induced. Detailed analysis of myelination and axon injury/degradation are presented as use cases, including the capability to genetically and pharmacologically manipulate neurons and Schwann cells independently, which also enabled fluorescent life cell imaging. The authors then examine the effect of immature/premyelinating and myelinating Schwann cells on the rate of axon degeneration. Upon axotomy Schwann cells significantly delayed degeneration, with no difference between non-myelinating and myelinating Schwann cells. Finally, live imaging during axon degeneration with fluorescent proteins separately expressed in neurons and Schwann cells demonstrated that Schwann cells ingest axonal fragments.

      Major comment:<br /> In establishing a much needed in-vitro system for PNS myelination and injury research the paper represents a valuable contribution to the PNS community. However, I find the presentation of aspects concerning a protective/destructive role of Schwann cells somewhat inconclusive. That these roles exist has been known, as the authors discuss. Then what does this study contribute concerning the open question that was raised by the discrepancies between Babetto et al., 2020 and Vaquié et al., 2019, i.e. how Schwann cells contribute to axon survival/regeneration after injury? Essentially, the only significant conclusion in this regard is that myelinating and non-myelinating mouse Schwann cells do not differ in their capability to protect axons from degeneration. The manuscript, including the title, would benefit from focusing more on this aspect. In particular, the discussion of the factors that lead to the still remaining apparent discrepancies between Babetto et al., 2020 and Vaquié et al., 2019 and this study should be revised. The authors state that "The study by Vaiquié et al., 2019 quantified axon fragmentation proximally in the microgrooves at timepoints starting from 12 hours after axotomy." (Discussion). While this observation is accurate, Jacob and colleagues also show accelerated, obviously distal axon degeneration in the presence of Schwann cells (Figure 3C in Vaquié et al., 2019). It is therefore unlikely that the discrepancies stem from analysis of more proximal vs. more distal axons, or the timepoints of analyses. In my opinion, a further study (using the coculture system presented in this manuscript) that compares the role of Schwann cells from rats and mice, and that includes analysis of more proximal and distal axon degeneration as well analysis of axon regeneration is needed. In a rework of the manuscript, the authors may therefore elaborate more on the shortcomings of the present study, or alternatively soften the aims of the study in the first place.

      Author response: We thank the reviewer for their comments, and we agree that the title and the aims of the manuscript were confusing. We didn’t make it explicit that this was a methods paper, and that we didn’t intend to show entirely novel findings but instead thoroughly characterise our mouse system so that it is comparable to what has been done for rat cocultures. We have now made this clearer by removing the word novel in the title and changing the title to the following:

      A method for mouse myelinating Schwann cell-DRG neuron compartmentalised cocultures: myelination status does not influence the axo-protective effect of Schwann cells after injury.

      Author response: We have decided to focus the manuscript more on the comparison of myelinating versus non-myelinating cocultures, given that we have shown that the axo-protective effect of SCs is independent of myelination status, which is an advance on what is known in the literature. We have added further characterisation of our aligned cocultures with p75NTR immuno and EM images (Fig.1D and E). We have added the following summary of how our study relates to findings of Babetto et al., 2020 and Vaquie et al., 2019 in the discussion, in line with the reviewer’s suggestions.

      Discussion.

      These findings confirm the observations of both Babetto et al., 2020 and Vaquié et al., 2019 who used rat SCs in similar microfluidic culture systems (Vaquié et al, 2019; Babetto et al, 2020)__. We have shown that the axo-protective observation seen by Babetto et al., 2020 does not rely on myelination status, which was an outstanding question from that study (Babetto et al, 2020)__. Furthermore, in an advance from previous studies, we have visualised the axo-protective and axon clearance phenomena in the same culture and shown that they are temporally separated, with axon fragmentation and debris clearance by SCs occurring at much later timepoints after axotomy.

      Author response: We have now added more in-depth discussion of the similarities and differences between Babetto et al., 2020 and Vaquie et al., 2019 and our approach in the discussion.

      Discussion.

      Several different culture and experimental conditions preclude direct comparison of our study with both those of Vaquié et al., 2019 and Babetto et al., 2020. These include the use of rat SCs in both prior studies and that Babetto et al., 2020 mixed rat SC with mouse DRG axons; length of time in culture, time points quantified after injury and distance from injury and site of analysis. Babetto et al., 2020 performed axotomy on relatively short term cocultures (6 days in vitro) whereas Vaiquié et al., 2019 cultured for at least 4 weeks and in our case 6 weeks prior to axotomy. Vaiquié et al., 2019 removed nerve growth factor (NGF) prior to laser axotomy and quantified proximally (though also imaged distally) whereas both our study and Babetto et al., 2020 kept NGF in the medium, performed axotomy with a scalpel and quantified more distally and in our case extremely distal, where only individual neurites and no axon bundles were visible. Vaquié et al., 2019 had SCs on both sides of the barrier in the microfluidic chambers whereas we seeded SCs only in the axonal/bottom compartment. Additionally, our cocultures had both forskolin and b__NRG1 added to help induce myelination, whereas these factors are not required in rat myelinating cocultures. Finally, it is important to permeabilise myelinated cultures with acetone after fixation__, as we did, to visualise the entire axon through heavily myelinated segments otherwise axon integrity cannot be reliably assessed in a quantitative manner (Vaquié et al, 2019; Babetto et al, 2020)__.

      Author response: We would like to add that we showed Claire Jacob, senior author of the Vaquie et al., 2019 study, our manuscript prior to peer review and she offered helpful comments, which we incorporated into the manuscript, which is why she is acknowledged. We have also discussed our findings with Elisabetta Babetto as well.

      While it would be a great future study to compare both axon degeneration rates in rat and mouse cocultures this was not the original intention of our study. We believe we have included enough detail of our experimental procedures, including the distance from the barrier we imaged axon degeneration, a crucial bit of information missing from the other studies, should others want to perform a comparative study between rats and mice.

      Methods

      Five images at a distance of between 1.2-1.4mm from the microgroove barrier (the most distal part of the culture that could be imaged) were quantified per culture, taken in comparable locations in each culture.

      Author response: I would add that one of our previous studies (Arthur-Farraj et al., 2012 Neuron, Fig5I) has already looked at axon outgrowth/regeneration in dissociated non-myelinating mouse SC-DRG co-cultures. We showed the presence of Schwann cells accelerates axon regeneration/outgrowth and this relies upon Schwann cell c-JUN.

      We have now added quantification of the extent of myelination in our cocultures and it is comparable to the original Bunge lab rat cocultures and more extensive than the Vaquie et al., 2019 coculture.

      Results

      Quantifying the number of PRX positive myelin segments we found that there were 325.33 ± 12.3 sheaths per mm2, comparable to what has been originally described in rat SC/DRG cocultures and two fold more extensive myelination than recently described compartmentalised rat cocultures models (n=3; Table 1; Eldridge et al, 1987; Vaquié et al, 2019)__. Furthermore, 25.47 ± 1% of Schwann cells were myelinating in our cultures (n=3; Table 1).

      Methods

      To quantify the number of myelin segments per area, we counted the number of myelin segments for five areas per culture for three cultures and normalised this per mm2. To quantify the percentage of Schwann cells in myelinating cocultures that are actively myelinating, we quantified the number of myelin segments and the number of DAPI-positive nuclei for five areas per culture for three cultures.

      Minor comments:<br /> - Figure 2D: From the electron micrograph there is no doubt that compact myelin is formed, however to me it seems the compaction is not complete. A rough estimation with the aid of the provided scale bar resulted in an interperiodic distance of about 17 nm, which contrasts with the remarkably well reproduced values reported in multiple reports using conventional specimen preparation (like in this study), of which I am citing just a few: about 13 nm in rat ex vivo nerve (Peterson and Pease, J. Ultrastruct Res 1972; Fledrich et al., Nat Commun 2018), 12 nm (Giese et al., Cell 1992), 12.2 nm (Perot et al., J Neurosci 2007), about 12 (Fernando et al., Nat Commun 2016), about 13 nm (García-Mateo et al., Glia 2017) or about 13 nm in mouse ex vivo (Boutary et al., Commun Biol 2021), which was also reproduced with about 13 nm in rat in vitro (Taveggia and Bolino, Methods Mol Biol 2018). The authors should acknowledge this deviation and might discuss possible reasons.

      Author response: We have now provided a more representative EM image of our myelination (Fig. 2D). Additionally, thanks to the reviewer’s comments we have now quantified the interperiodic distance and find it is comparable to the studies the reviewer suggested. We have added the data to the new Table 1 and added the references the reviewer advised. Please see the additions to the methods and the results section regarding this new data below.

      Results

      To confirm that cocultured myelinated SCs formed compact myelin we performed electron microscopy (EM), which revealed compact myelin formation with multiple myelin wraps and formation of readily visible major dense (MDL) and intraperiod lines (IPL; Fig. 2D). Additionally, we measured the periodicity, i.e., the distance between two adjacent major dense lines, to make sure myelin was compacted. Interperiodic distance was 12.16 ± 0.28 nm, in line with previous reports (n=3; Table 1; Boutary et al, 2021; Fernando et al, 2016; García-Mateo et al, 2018; Giese et al, 1992; Perrot et al, 2007)__.

      Methods

      To measure interperiodic distance, we measured at least 10 periods per myelinated fibre for at least three fibres per sample for three separate samples.

      • Figure 4A,B: The result of a slowed axon degeneration in coculture relies on the accurate assessment of continuity of the NFL staining. While the authors report that acetone permeabilization was necessary to afford complete penetration of the used antibodies in myelinating cultures, I cannot see why the authors have not used the same staining protocol for all cultures, as it is detailed in the method section. While I consider it unlikely that the staining conditions have led to an apparent delay of degeneration in coculture, experiments should generally be performed under identical conditions, unless there are good reasons not to do so. If this is not the case, it will be reassuring to see the same effect when identical staining conditions are employed. On the same note, do the compared cultures have the same age, i.e. have the neuron monocultures been in vitro for the same time as the cocultures?

      Author response: We thank the reviewer for picking up this inaccuracy in the manuscript. We can confirm that for the purposes of the axon degeneration experiment all cultures were stained using exactly the same staining protocol. Additionally, we were very careful to maintain all cultures for exactly the same time in culture – 6 weeks. Additionally, axon only cultures were maintained in myelination medium to make sure medium constituents were not responsible for the observed differences in degeneration rate. We have added the following elaboration to the methods section to clarify these points.

      Methods

      Axon only cultures related to Figure 1 were permeabilised in PBS + 0.5% Triton (Merck) + 5% HS + 5% donkey serum (DS, Merck - D9663) at RT for 1 hour. For the purposes of quantifying the rate of axon degeneration (Figure 4) both axon only cultures and cocultures with SCs were permeabilised in 50% Acetone for 2 minutes, 100% Acetone for 2 minutes, 50% Acetone for 2 minutes (all at RT), and then blocked in PBS + 0.5% Triton + 5% HS + 5% DS at RT for 1 hour.

      Methods

      All cultures (axon only, aligned SCs and myelinating SCs) were cultured for 6 weeks prior to axotomy experiments. To minimise the possibility that medium constituents were responsible for differences in axon degeneration rates, axonal compartments of axon only cultures were cultured in medium containing 10 ng ml-1 b__NRG1 and 10 m__M forskolin (axon only medium, extended methods section D6) once SCs were seeded on other cultures, and then switched into myelination medium (additional Matrigel__â and 50 m__g ml−1 L-Ascorbic Acid), 24 hours before axotomy. Bottom compartments of aligned SC cultures, 24 hours before axotomy, were switched into DRG/SC medium containing 10 ng ml-1 b__NRG-1, 10 m__M forskolin and 50 m__g ml−1 L-Ascorbic Acid, which is insufficient to induce myelination in mouse cultures. Bottom compartments of myelinating cocultures were medium changed into fresh myelination medium (Extended methods section D7) 24 hours prior to axotomy.

      • In several instances of the manuscript, the term "transfection" is used to refer to lentiviral gene transfer. I advise to use the more appropriate term "transduction" instead
      • I could not seem to find a meaningful reference to the microfluidic chambers that were used in the study. The protocol should contain details on the device and source of supply in order to enable potential readers to execute the protocol

      Author response: We thank the reviewer for this comment. We have replaced transfection with transduction throughout the manuscript. Please see the track changes manuscript for all instances.

      Reviewer #2 (Significance):

      The paper presents a convincing establishment of a dissociated coculture derived exclusively from mouse that leads to robust myelination. As the manuscript correctly states, Schwann cell culture and especially coculture with neurons has been experienced difficult in the field, and by providing a detailed protocol as well as demonstrating how the coculture system can be used to address important questions of PNS myelination and repair, the paper fills an important gap. However, the experiments directed to the role of Schwann cells in axon degeneration do not clarify much, which should be better addressed in the discussion and also by modifying the title accordingly.<br /> The paper will be of high value for basic researchers that are interested in performing studies addressing cellular and molecular mechanisms of myelination and repair in the PNS. Importantly, the paper can pave the way to usage of transgenic or knockout mouse models in coculture. Thereby it might spark interest also in those researchers that use transgenic and knockout mouse models and who have so far refrained from using coculture models.

      Field of expertise of the reviewer: Cellular and molecular mechanisms of myelination and growth signaling in the PNS; in-depth experience with DRG coculture models from rats and mice

      Author response: We thank the reviewer for their kind comments. We have now modified the title, aims and discussion of the manuscript in line with the reviewer’s suggestions.

      Reviewer #3 (Evidence, reproducibility and clarity):

      SUMMARY<br /> The authors present a detailed protocol for co-cultures of mouse DRGs with mouse SCs using microfluidics. In this model, cells of interest grow in different compartments while allowing for axons to grow in between, thereby making them accessible to injury induction. Using this experimental system, the authors show that myelination occurs, myelin gets compacted and acquires nodal organization. The authors then show that such a system allows for compartment-specific lentivirus transduction and live imaging. Next, they perform physical and chemical axonal injury and show that at early time point pos- injury the presence of SCs protects from axonal degeneration regardless of the myelination status, and helps with clearing of damaged axons at later time points.<br /> Major comments:

      The novelty of the study is questionable.<br /> While the model is well described and appears to be useful for the proposed applications (live imaging, transduction, injury model), the arguments provided regarding its novelty are not fully convincing. The main argument from the authors of this paper is that there are no established protocols describing the use of mouse SCs with mouse DRG neurons in dissociated myelinating cocultures. However, this appears to be inaccurate, as the model described in Stevens et al., 1998 (cited in the paper) uses mouse DRG neurons dissected at E13.5 with mouse SCs dissected at P3 to study myelination. Also, in Päiväläinen et al., 2008, mouse DRGs and SCs are cultured from transgenic mice at different developmental ages, thereby arguing that coculture models have been previously successfully implemented. The main difference appears to be rather the compartmentalization of SCs and DRGs which appears to be a mouse adaptation of the rat model described by Vaquie et al,2019. Based on the above, it seems imperative for the authors to tone down the novelty aspect and provide a more thorough discussion on how the current novel differs from protocols in published study, highlighting advantages and caveats for each.

      Author response: We agree with the reviewer that we did not make the case clear enough for how our coculture model adds to what is currently described in the literature. We have now changed the title and removed the word novel. New title:

      A method for mouse myelinating Schwann cell-DRG neuron compartmentalised cocultures: myelination status does not influence the axo-protective effect of Schwann cells after injury.

      Author response: We have now added the following paragraph to the introduction:

      Introduction.

      Indeed there has only ever been one laboratory detailing convincing myelin formation in dissociated mouse myelinating SC-DRG neuron cocultures, however this was never published as a step by step detailed protocol (Stevens et al, 1998; Stevens & Fields, 2000)__. In the last twenty years, there have been no published studies demonstrating myelination in fully dissociated mouse SC-mouse DRG cocultures. This has largely prevented the use of cells, particularly SCs, from transgenic mice in cocultures and thus restricted the ability to study SC-axon interactions in a system that can be readily manipulated and live imaged and results directly applied back to in vivo findings in the same species.

      Author response: Regarding the study by Päiväläinen et al, 2008_,_ they did not fully dissociate their DRGs (see Fig.1 which demonstrates a DRG explant) and thus it is a non-dissociated DRG explant model. While they demonstrated convincing myelination due to the use of Matrigel which we acknowledge them for, their model is not perfectly suited for the use of neurons and SCs from different transgenic animals as the use of a DRG explant, even with temporary use of an antimitotic, risks contamination by endogenous SCs and satellite glia over time, especially as their model is not compartmentalised. We discuss the caveats of their protocol and those using dissociated mouse explant cocultures in a revised paragraph in the introduction.

      Introduction.

      Protocols exist where endogenous mouse SCs are used to myelinate dissociated or non-dissociated DRG explant cultures. (Shen et al, 2014; Harty et al, 2019; Sundaram et al, 2021; Stettner et al, 2013; Numata-Uematasu et al, 2023)__. Furthermore, another protocol seeded exogenous SCs onto non-dissociated DRG explant cultures (Päiväläinen et al, 2008)__. Other laboratories seed cultured rat SCs onto dissociated mouse DRG axons (Taveggia & Bolino, 2018)__. Use of dissociated or non-dissociated DRG explants cultures precludes many experimental uses, such as using SCs from different transgenic animals and separate transfection of SCs and neurons with viruses for live imaging or genetic manipulation, and easy use of microfluidic chambers to allow injury studies and separate drug treatments to neurons or SCs. The reason for this is that antimitotics cannot be used in dissociated or non-dissociated DRG explant cultures as this depletes SCs, and the culture quickly becomes contaminated with other non-neuronal cell types, such as satellite cells and fibroblasts migrating out of the DRG. Furthermore, use of exogenous SCs in a non-dissociated DRG explant culture risks, after a period of antimitotic exposure, which was developed by Päiväläinen et al., 2008 still risks potential contamination from endogenous SCs and satellite glia migrating out of the DRG explant over time. This occurs because antimitotic treatment is unlikely to fully penetrate the whole DRG without prior dissociation. Additionally, a compartmentalised culture system cannot be readily used with non-dissociated DRG explant cultures (Päiväläinen et al, 2008)__.

      Author response: We have also added further discussion on how our protocol differs from the Stevens 1998 and other protocols in the discussion.

      Discussion

      Our protocol differs somewhat from the one used by Stevens et al., 1998 to induce myelination in dissociated mouse SC-DRG cocultures__, as they used ascorbic acid and 10% horse serum and presumably plated their cultures on laminin, though they do not explicitly detail this (Stevens et al, 1998)__. In our preliminary experiments we were unable to visualise much myelination with use of laminin, ascorbic acid or indeed if b__NRG1 and high concentration forskolin was added to the medium for up to four weeks. However, if we plated cocultures on Matrigel® and continuously added it to the myelination medium then we saw comparable levels of myelination in our mouse cocultures to that of rat cocultures (Eldridge et al, 1987)__. This approach of using Matrigel® to enhance myelination has previously been successfully employed in cultures of human iPSC sensory neurons with rat SCs and in in non-dissociated mouse DRG explant cultures (Clark et al, 2017; Päiväläinen et al, 2008)__. Importantly, we used growth factor depleted Matrigel® as standard Matrigel® preparations contain substantial amounts of Transforming growth factor b (TGF b__) which is a known inhibitor of myelination (Einheber et al, 1995)__. Additionally, the majority of rat and mouse coculture protocols plate cells on glass whereas we found cultures were healthier and myelinated better when cultured on plastic Alcar® coverslips.

      Author response: We have added further discussion of comparable models in the literature in the discussion.

      Discussion.

      Furthermore, our protocol is complementary to the recently described 3D mouse myelinating SC-motor neuron coculture system using collagen hydrogels (Hyung et al, 2021; Park et al, 2021)__. It will be interesting in the future to up titrate the concentration of Matrigel®, which is similar to collagen hydrogels, in our cultures to see whether further increasing extracellular matrix viscosity and stiffness improves our myelination efficiency even further. While it is possible to study cell migration in microfluidic cell culture devices, transwell models offer significant advantages to study this cellular phenomenon (Negro et al, 2022)__. To date, there have been no published studies of successful myelination in human SC-neuron coculture systems. Despite this rat SCs have been shown to readily myelinate human-induced pluripotent stem cell (iPSC)-derived sensory neurons and an iPSC-derived peripheral nerve organoid system which does contain myelinating SCs has recently been described (Clark et al, 2017; Van Lent et al, 2022)__.

      Next, the authors emphasize the conflicting results of two articles, Babetto et al., 2020 and Vaquie et al., 2019, as the basis to use their newly developed model in the same species and testing two ages corresponding to distinct myelination states. However, both studies reach the same conclusion as the current study, that SCs have a protective role, although at two different developmental time points. As such, it is likely that multiple mechanisms may account for the protective effect of SC on axonal damage, and therefore the different studies do not seem conflicting but rather complementary. Yet, it is interesting that this manuscript shows that the myelination status of SCs does not impact their ability to slow down degeneration and yet it confirms that different timing after injury elicits different behaviors in SCs, as suggested by the studies of Babetto et al., 2020 and Vaquie et al., 2019. In other words, a more accurate description of the results of these two studies is needed and a better explanation of what the authors consider to be conflicting and why (there could be more differences than species and myelination, for instance, such as the method used for axotomy - laser vs cut with scalpel which tear and pull membranes).

      Author response: We would like to humbly correct the reviewer that the studies by Babetto et al., 2020 and Vaquie et al., 2019 do not reach the same conclusion that Schwann cells have a protective role. Instead, they describe axon protection (Babetto et al., 2020) and axon fragmentation (Vaquie et al., 2019). Our studies now visualise both phenomena in the same culture system. We have now made this point more explicit as well as highlighted the one conceptual advance our methods paper makes on the current literature, which is that myelination status does not influence the SC axo-protection, as the reviewer suggested.

      Discussion.

      These findings confirm the observations of both Babetto et al., 2020 and Vaquié et al., 2019 who used rat SCs in similar microfluidic culture systems (Vaquié et al, 2019; Babetto et al, 2020)__. We have shown that the axo-protective observation seen by Babetto et al., 2020 does not rely on myelination status, which was an outstanding question from that study (Babetto et al, 2020)__. Furthermore, in an advance from previous studies, we have visualised the axo-protective and axon clearance phenomena in the same culture and shown that they are temporally separated, with axon fragmentation and debris clearance by SCs occurring at much later timepoints after axotomy.

      Author response: We have now added more in-depth discussion of the similarities and differences between Babetto et al., 2020 and Vaquie et al., 2019 and our approach in the discussion.

      Discussion.

      Several different culture and experimental conditions preclude direct comparison of our study with both those of Vaquié et al., 2019 and Babetto et al., 2020. These include the use of rat SCs in both prior studies and that Babetto et al., 2020 mixed rat SC with mouse DRG axons; length of time in culture, time points quantified after injury and distance from injury and site of analysis. Babetto et al., 2020 performed axotomy on relatively short term cocultures (6 days in vitro) whereas Vaiquié et al., 2019 cultured for at least 4 weeks and in our case 6 weeks prior to axotomy. Vaiquié et al., 2019 removed nerve growth factor (NGF) prior to laser axotomy and quantified proximally (though also imaged distally) whereas both our study and Babetto et al., 2020 kept NGF in the medium, performed axotomy with a scalpel and quantified more distally and in our case extremely distal, where only individual neurites and no axon bundles were visible. Vaquié et al., 2019 had SCs on both sides of the barrier in the microfluidic chambers whereas we seeded SCs only in the axonal/bottom compartment. Additionally, our cocultures had both forskolin and b__NRG1 added to help induce myelination, whereas these factors are not required in rat myelinating cocultures. Finally, it is important to permeabilise myelinated cultures with acetone after fixation__, as we did, to visualise the entire axon through heavily myelinated segments otherwise axon integrity cannot be reliably assessed in a quantitative manner (Vaquié et al, 2019; Babetto et al, 2020)__.

      Author response: We would like to add that we showed Claire Jacob, senior author of the Vaquie et al., 2019 study, our manuscript prior to peer review and she offered helpful comments, which we incorporated into the manuscript, which is why she is acknowledged. We have also discussed our findings with Elisabetta Babetto as well.

      Overall, the title does not appear to be the most appropriate because the content rather proposes a detailed protocol and gives examples of applications, rather than focusing on the protective versus destructive role of SCs on axons. It also appears to be misleading, as "axo-destructive" appears to suggest a negative role of Schwann cells on axons, whereas SC are rather helpful in clearing degenerative axons, a step which facilitates regeneration.

      Author response: We have now changed the title and the focus of the manuscript in line with the reviewer’s comments. New title:

      A method for mouse myelinating Schwann cell-DRG neuron compartmentalised cocultures: myelination status does not influence the axo-protective effect of Schwann cells after injury.

      Author response: We have removed the phrase axo-destructive throughout the manuscript and instead referred to axon fragmentation and axon debris clearance roles of SCs in line with the reviewer’s suggestion. Please see track changes manuscript for all instances where this was modified.

      The number of biological replicates for each experiment is not always indicated, and if the "n=" represent cultures prepared independently/passaged or wells/cell. It is essential to be rigorous and clearly indicate the number of technical replicates and biological samples throughout the manuscript and provide a thorough description of them. One example is Fig. 4 E were only 10 cells from a single culture appeared to have been imaged. Is this accurate? This aspect is essential to evaluate reproducibility, especially in view of the technical and biological variability.

      Author response: We have now added quantification of myelin segments per mm2, percentage of SCs that myelinate and quantification of the interperiodic distance of the myelin formed. This is all included in a new version of TABLE 1. We have discussed this data in the results section as follows.

      Quantifying the number of PRX positive myelin segments we found that there were 325.33 ± 12.3 sheaths per mm2, comparable to what has been originally described in rat SC/DRG cocultures and two fold more extensive myelination than recently described compartmentalised rat cocultures models (n=3; Table 1; Eldridge et al, 1987; Vaquié et al, 2019)__. Furthermore, 25.47 ± 1% of Schwann cells were myelinating in our cultures (n=3; Table 1). To confirm that cocultured myelinated SCs formed compact myelin we performed electron microscopy (EM), which revealed compact myelin formation with multiple myelin wraps and formation of readily visible major dense (MDL) and intraperiod lines (IPL; Fig. 2D). Additionally, we measured the periodicity, i.e., the distance between two adjacent major dense lines, to make sure myelin was compacted. Interperiodic distance was 12.16 ± 0.28 nm, in line with previous reports (n=3; Table 1; Boutary et al, 2021; Fernando et al, 2016; García-Mateo et al, 2018; Giese et al, 1992; Perrot et al, 2007)__.

      Author response: We have discussed the number of cultures used for each quantification in the methods section. See below.

      Quantification of Myelination in cocultures

      To quantify the number of myelin segments per area, we counted the number of myelin segments for five areas per culture for three cultures and normalised this per mm2. To quantify the percentage of Schwann cells in myelinating cocultures that are actively myelinating, we quantified the number of myelin segments and the number of DAPI-positive nuclei for five areas per culture for three independently prepared cultures. To measure interperiodic distance, we measured at least 10 periods per myelinated fibre for at least three fibres per sample for three separate samples.

      Quantification of Degeneration

      Five images at a distance of between 1.2-1.4mm from the microgroove barrier (the most distal part of the culture that could be imaged) were quantified per culture, taken in comparable locations in each culture. A line was drawn across each image, and each axon crossing this line was either scored as degenerated or intact. Images were blinded prior to quantification. A minimum of three independently prepared cultures were assessed per timepoint for each condition.

      Author response: We have removed Fig.4E and instead quoted the data in the results section as follows:

      When we quantified this phenomenon, we found that 97.84 ± 1.462% (n=2) of SCs in our cocultures contained mCherry-labelled axonal fragments.

      Author response: We apologise as the n number for this experiment was 2 (not 10), with cells in 10 areas quantified throughout all imaging timepoints from each independently prepared culture. We have included the following description in the methods section:

      To quantify number of SCs with fragments, each cell was defined as a region of interest and checked for the presence of mCherry positive fragments at all timepoints. Two separate independently prepared cultures and cells in 10 areas per culture were analysed.

      Author response: Additionally for Fig. 4B we have now included individual data points from independently prepared cultures.

      N numbers are included in all figure legends and always refers to independently prepare cultures/biological replicates.

      We have added to the relevant figure legends (Fig.3 and 4 and Table 1) the phrase:

      n number refers to independently prepared cultures from separate litters of mice.

      Minor comments:

      • Does myelination reach axons in the microgrooves (it seems to from 2C, but up to where)? Where is axotomy performed and are axons myelinated where the cut was performed?

      Author response: Myelination occasionally reaches the beginning of the microgrooves. We didn’t visualise myelination in the DRG cell body compartment. We have added the following detail to the methods section:

      Traumatic axotomies were carried out by carefully removing the microfluidic chamber (SND150 and RND150, Xona Microfluidics__Ò__) from the Aclar__â coverslip using sterile forceps and severing axons with a surgical blade under a light microscope. Axotomies were carried out at the level of the microgroove barrier. To confirm all axons were severed, a second higher cut was performed and axons between the cut sites removed using the surgical blade.

      Author response: Given this, we cannot exclude that the odd proximal myelin segment is cut, but the vast majority of axons are not myelinated at the site of cut (lower cut).

      • Since the model allows for comparison of aligned vs myelinating SCs, and that both aligned and myelinating SCs seem to slow down degeneration, and that c-JUN is upregulated after in vivo injury, have the authors measured if c-JUN levels increase similarly in both myelinating vs aligned SCs?

      Author response: We thank the reviewer for this suggestion. We have now quantified the JUN upregulation after injury in both myelinating and aligned cocultures as well as adding images of JUN upregulation in aligned cocultures. See Fig. 3B-E).

      Additionally, we noted a strong upregulation of JUN protein in SCs 12 hours after axotomy (Fig. 3B and C). We also saw significant JUN upregulation 12 hours after axotomy in cocultures with aligned SCs (Fig. 3D and E).

      Author response: We have decided to focus the manuscript more on the comparison of myelinating versus non-myelinating cocultures, given that we have shown that the axo-protective effect of SCs is independent of myelination status, which is an advance on what is known in the literature. In addition to changing the title, as we have mentioned previously, we have added further characterisation of our aligned cocultures with p75NTR immuno and EM images (Fig.1D and E).

      We have

      • On clarity:<br /> - In the step-by-step protocol, wording needs to be improved.

      Author response: We have substantially edited the step-by-step protocol. Please see track changes document for all specific changes in wording.

      • Temperatures for centrifugations are missing.

      Author response: We have added temperatures for all centrifugation steps. Please see track changes document

      • The MOI described for lentivirus is 2-10 in the protocol but 200 in the legend of Figure 3F.

      Author response: The MOI for DRGs was 2-10 and SCs was 200 in Figure 3F. This is described similarly in the extended methods section. DRGs are transduced much more easily than SCs.

      We have added the following sentence to the results section to emphasise this point:

      Importantly dissociated mouse SCs required a much higher multiplicity of infection (MOI) than dissociated mouse DRGs (see extended methods section).

      • Certain citations in the references list are incomplete (i.e. Babetto et al.; Catenaccio et al.,).

      Author response: We have updated the reference list.

      Reviewer #3 (Significance):

      The advance for the field proposed by this paper is mostly technical, as it details a new model to be used by the field, of mouse SCs-mouse DRGs in dissociating myelinating cultures. The tested applications allowed the authors to also confirm a protective role for SCs on axonal damage, which was independent from myelination status.

      Being a method paper, it is essential that the authors provide clear statements on the number of biological replicates, and technical repeats, as well as a very thorough and accurate description of the methodology.

      The model described has similarities with existing models in the field such as Stevens et al., 1998 and Vaquié et al., 2019. To place it in context in a more helpful way, the authors should emphasize on the novelty brought by their protocol compared to existing models. The authors compare their findings to results from Vaquié et al., 2019 and Babetto et al., 2020 that they describe as conflicting, when it seems they rather address different mechanisms of SCs in protection and repair, occurring at different time points.

      Audience might be interested in the detailed step by step protocol to use this in vitro model for the applications described, and investigate further why SCs myelination status does not influence their ability to protect from neurodegeneration early on or how to make use of this for neuroprotection studies.

      Author response: We have now rephrased the description of Vaquié et al., 2019 and Babetto et al., 2020 studies in line with the reviewer’s suggestions. We have now added further discussion of our model in the context of all other models in the field as we have outlined in detail in above responses.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      The authors present a detailed protocol for co-cultures of mouse DRGs with mouse SCs using microfluidics. In this model, cells of interest grow in different compartments while allowing for axons to grow in between, thereby making them accessible to injury induction. Using this experimental system, the authors show that myelination occurs, myelin gets compacted and acquires nodal organization. The authors then show that such a system allows for compartment-specific lentivirus transduction and live imaging. Next, they perform physical and chemical axonal injury and show that at early time point pos- injury the presence of SCs protects from axonal degeneration regardless of the myelination status, and helps with clearing of damaged axons at later time points.

      Major comments:

      The novelty of the study is questionable.<br /> While the model is well described and appears to be useful for the proposed applications (live imaging, transduction, injury model), the arguments provided regarding its novelty are not fully convincing. The main argument from the authors of this paper is that there are no established protocols describing the use of mouse SCs with mouse DRG neurons in dissociated myelinating cocultures. However, this appears to be inaccurate, as the model described in Stevens et al., 1998 (cited in the paper) uses mouse DRG neurons dissected at E13.5 with mouse SCs dissected at P3 to study myelination. Also, in Päiväläinen et al., 2008, mouse DRGs and SCs are cultured from transgenic mice at different developmental ages, thereby arguing that coculture models have been previously successfully implemented. The main difference appears to be rather the compartmentalization of SCs and DRGs which appears to be a mouse adaptation of the rat model described by Vaquie et al,2019. Based on the above, it seems imperative for the authors to tone down the novelty aspect and provide a more thorough discussion on how the current novel differs from protocols in published study, highlighting advantages and caveats for each.

      Next, the authors emphasize the conflicting results of two articles, Babetto et al., 2020 and Vaquie et al., 2019, as the basis to use their newly developed model in the same species and testing two ages corresponding to distinct myelination states. However, both studies reach the same conclusion as the current study, that SCs have a protective role, although at two different developmental time points. As such, it is likely that multiple mechanisms may account for the protective effect of SC on axonal damage, and therefore the different studies do not seem conflicting but rather complementary. Yet, it is interesting that this manuscript shows that the myelination status of SCs does not impact their ability to slow down degeneration and yet it confirms that different timing after injury elicits different behaviors in SCs, as suggested by the studies of Babetto et al., 2020 and Vaquie et al., 2019. In other words, a more accurate description of the results of these two studies is needed and a better explanation of what the authors consider to be conflicting and why (there could be more differences than species and myelination, for instance, such as the method used for axotomy - laser vs cut with scalpel which tear and pull membranes).

      Overall, the title does not appear to be the most appropriate because the content rather proposes a detailed protocol and gives examples of applications, rather than focusing on the protective versus destructive role of SCs on axons. It also appears to be misleading, as "axo-destructive" appears to suggest a negative role of Schwann cells on axons, whereas SC are rather helpful in clearing degenerative axons, a step which facilitates regeneration.

      The number of biological replicates for each experiment is not always indicated, and if the "n=" represent cultures prepared independently/passaged or wells/cell. It is essential to be rigorous and clearly indicate the number of technical replicates and biological samples throughout the manuscript and provide a thorough description of them. One example is Fig. 4 E were only 10 cells from a single culture appeared to have been imaged. Is this accurate? This aspect is essential to evaluate reproducibility, especially in view of the technical and biological variability.

      Minor comments:

      • Does myelination reach axons in the microgrooves (it seems to from 2C, but up to where)? Where is axotomy performed and are axons myelinated where the cut was performed?
      • Since the model allows for comparison of aligned vs myelinating SCs, and that both aligned and myelinating SCs seem to slow down degeneration, and that c-JUN is upregulated after in vivo injury, have the authors measured if c-JUN levels increase similarly in both myelinating vs aligned SCs?
      • On clarity:
      • In the step-by-step protocol, wording needs to be improved.
      • Temperatures for centrifugations are missing.
      • The MOI described for lentivirus is 2-10 in the protocol but 200 in the legend of Figure 3F.
      • Certain citations in the references list are incomplete (i.e. Babetto et al.; Catenaccio et al.,).

      Significance

      The advance for the field proposed by this paper is mostly technical, as it details a new model to be used by the field, of mouse SCs-mouse DRGs in dissociating myelinating cultures. The tested applications allowed the authors to also confirm a protective role for SCs on axonal damage, which was independent from myelination status.

      Being a method paper, it is essential that the authors provide clear statements on the number of biological replicates, and technical repeats, as well as a very thorough and accurate description of the methodology.

      The model described has similarities with existing models in the field such as Stevens et al., 1998 and Vaquié et al., 2019. To place it in context in a more helpful way, the authors should emphasize on the novelty brought by their protocol compared to existing models. The authors compare their findings to results from Vaquié et al., 2019 and Babetto et al., 2020 that they describe as conflicting, when it seems they rather address different mechanisms of SCs in protection and repair, occurring at different time points.

      Audience might be interested in the detailed step by step protocol to use this in vitro model for the applications described, and investigate further why SCs myelination status does not influence their ability to protect from neurodegeneration early on or how to make use of this for neuroprotection studies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript entitled "Distinct axo-protective and axo-destructive roles for Schwann cells after injury in a novel compartmentalised mouse myelinating coculture system", Arthur-Farraj and colleagues detail a method of dissociated coculture of mouse-DRG neurons and Schwann cells in microfluidic chambers. In this system, neurons and Schwann cells harvested from the same or from different animals are grown in different compartments that are connected by microgrooves, thereby allowing for spatial and diffusive separation. Neurons are shown to extent their axons across the microgroove barrier to the glial compartment where Schwann cells align with the axons and myelination can be induced. Detailed analysis of myelination and axon injury/degradation are presented as use cases, including the capability to genetically and pharmacologically manipulate neurons and Schwann cells independently, which also enabled fluorescent life cell imaging. The authors then examine the effect of immature/premyelinating and myelinating Schwann cells on the rate of axon degeneration. Upon axotomy Schwann cells significantly delayed degeneration, with no difference between non-myelinating and myelinating Schwann cells. Finally, live imaging during axon degeneration with fluorescent proteins separately expressed in neurons and Schwann cells demonstrated that Schwann cells ingest axonal fragments.

      Major comment:

      In establishing a much needed in-vitro system for PNS myelination and injury research the paper represents a valuable contribution to the PNS community. However, I find the presentation of aspects concerning a protective/destructive role of Schwann cells somewhat inconclusive. That these roles exist has been known, as the authors discuss. Then what does this study contribute concerning the open question that was raised by the discrepancies between Babetto et al., 2020 and Vaquié et al., 2019, i.e. how Schwann cells contribute to axon survival/regeneration after injury? Essentially, the only significant conclusion in this regard is that myelinating and non-myelinating mouse Schwann cells do not differ in their capability to protect axons from degeneration. The manuscript, including the title, would benefit from focusing more on this aspect. In particular, the discussion of the factors that lead to the still remaining apparent discrepancies between Babetto et al., 2020 and Vaquié et al., 2019 and this study should be revised. The authors state that "The study by Vaiquié et al., 2019 quantified axon fragmentation proximally in the microgrooves at timepoints starting from 12 hours after axotomy." (Discussion). While this observation is accurate, Jacob and colleagues also show accelerated, obviously distal axon degeneration in the presence of Schwann cells (Figure 3C in Vaquié et al., 2019). It is therefore unlikely that the discrepancies stem from analysis of more proximal vs. more distal axons, or the timepoints of analyses. In my opinion, a further study (using the coculture system presented in this manuscript) that compares the role of Schwann cells from rats and mice, and that includes analysis of more proximal and distal axon degeneration as well analysis of axon regeneration is needed. In a rework of the manuscript, the authors may therefore elaborate more on the shortcomings of the present study, or alternatively soften the aims of the study in the first place.

      Minor comments:

      • Figure 2D: From the electron micrograph there is no doubt that compact myelin is formed, however to me it seems the compaction is not complete. A rough estimation with the aid of the provided scale bar resulted in an interperiodic distance of about 17 nm, which contrasts with the remarkably well reproduced values reported in multiple reports using conventional specimen preparation (like in this study), of which I am citing just a few: about 13 nm in rat ex vivo nerve (Peterson and Pease, J. Ultrastruct Res 1972; Fledrich et al., Nat Commun 2018), 12 nm (Giese et al., Cell 1992), 12.2 nm (Perot et al., J Neurosci 2007), about 12 (Fernando et al., Nat Commun 2016), about 13 nm (García-Mateo et al., Glia 2017) or about 13 nm in mouse ex vivo (Boutary et al., Commun Biol 2021), which was also reproduced with about 13 nm in rat in vitro (Taveggia and Bolino, Methods Mol Biol 2018). The authors should acknowledge this deviation and might discuss possible reasons.
      • Figure 4A,B: The result of a slowed axon degeneration in coculture relies on the accurate assessment of continuity of the NFL staining. While the authors report that acetone permeabilization was necessary to afford complete penetration of the used antibodies in myelinating cultures, I cannot see why the authors have not used the same staining protocol for all cultures, as it is detailed in the method section. While I consider it unlikely that the staining conditions have led to an apparent delay of degeneration in coculture, experiments should generally be performed under identical conditions, unless there are good reasons not to do so. If this is not the case, it will be reassuring to see the same effect when identical staining conditions are employed. On the same note, do the compared cultures have the same age, i.e. have the neuron monocultures been in vitro for the same time as the cocultures?
      • In several instances of the manuscript, the term "transfection" is used to refer to lentiviral gene transfer. I advise to use the more appropriate term "transduction" instead
      • I could not seem to find a meaningful reference to the microfluidic chambers that were used in the study. The protocol should contain details on the device and source of supply in order to enable potential readers to execute the protocol

      Significance

      The paper presents a convincing establishment of a dissociated coculture derived exclusively from mouse that leads to robust myelination. As the manuscript correctly states, Schwann cell culture and especially coculture with neurons has been experienced difficult in the field, and by providing a detailed protocol as well as demonstrating how the coculture system can be used to address important questions of PNS myelination and repair, the paper fills an important gap. However, the experiments directed to the role of Schwann cells in axon degeneration do not clarify much, which should be better addressed in the discussion and also by modifying the title accordingly.

      The paper will be of high value for basic researchers that are interested in performing studies addressing cellular and molecular mechanisms of myelination and repair in the PNS. Importantly, the paper can pave the way to usage of transgenic or knockout mouse models in coculture. Thereby it might spark interest also in those researchers that use transgenic and knockout mouse models and who have so far refrained from using coculture models.

      Field of expertise of the reviewer: Cellular and molecular mechanisms of myelination and growth signaling in the PNS; in-depth experience with DRG coculture models from rats and mice

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      In this original article from Mutscher et al., the authors developed a compartmentalized dissociated mouse myelinating SC-DRG coculture system to investigate the distinct roles of Schwann cells in axon protection and degeneration after injury. The innovation of this approach relies on (i) the use of mouse SCs and mouse DRGS neurons instead of rat cells; (ii) the use of microfluidic chambers, seeded by axons and SCs in different compartment; (iii) the possibility to perform a traumatic injury in vitro. While this novel approach offers new ways to study peripheral nerve regeneration and SC-axon interaction, and technical the study is robust, the paper is currently limited by the exploration of their model.

      Major points:

      It is unclear is this approach will ever lead to the identification of key mechanism or key candidates. This is a major miss in the current manuscript form. In short: the authors should demonstrate that their in vitro system can lead to significant leap in our understanding of peripheral nerve regeneration by identifying novel targets/pathways or mechanisms.

      The use of embryonic DRG neurons or SC isolated from P2 animals are arguably physiologically not the same cells that are affected by traumatic nerve injury, which happen most often than not in adult. This is a problem in the long-term reliance on this approach to study axotomy peripheral nerve regeneration.

      Moderate points:

      "there are no established protocols in the field describing the use of mouse SCs with mouse DRG neurons in dissociated myelinating cocultures". The use of mouse cells is laudable, but it is not necessarily a technical innovation, or at least the current manuscript does not explain why their approach particularly suitable to mouse Schwann cells.

      The figures in the paper are largely descriptive. They are very little quantitative measurement. Thus, the readers will have a hard to determine, if they replicate the proposed approach, whether their efficient is on par with the current authors.

      Minor points:

      Fig.4B and E should show individual data points.

      Significance

      In addition, to the demonstration of feasibility of this in vitro approach, the main finding by the authors is that SCs have a role in neuronal protection and support is key for peripheral nerve regeneration. Thus while in vitro approach does not add key information that do not already exists in the field, it somewhat confirms that the effect is SC autonomous. Overall the approach is interesting and has potential, but the study currently lack a demonstration of its usefulness to the community.

      It would have been interesting to have the authors discuss the advantages of their approach in comparison to other innovative approaches to study SC-axon interactions that have been developed in the last decade (i.e., 3D environment, microfluidic approach, transwell systems). There is also a lack of citations about similar studies in the field.

      Because of the lack of key novel mechanisms, and lack of discussion on what this approach is superior to others in vitro approach limits the impact of the study and the excitement of the reader, even from the SC-axon community.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      In this manuscript the author is presenting a deep-learning model used to predict the development stage of zebrafish embryo. A robust method that can accurately classify a zebrafish into different development stages is highly relevant for many researchers working with zebrafish and hence the importance in developing methods like this is high.

      The manuscript is overall ok. However, more data is needed to convince the reader that the method is robust enough to work with other samples in other labs. This would greatly improve the impact of the publication.

      We agree with the reviewer and have included in our revised manuscripts additional test data that was acquired at a different laboratory to the training data (Figures 5 - 7).

      Page 6.<br /> - How is the data acquired?

      Images used to do initial model training are the same as those used in a previous study - the details of image acquisition are contained in the relevant publication (doi: 10.12688/wellcomeopenres.18313.1). However, we have now added “Zebrafish Husbandry” and “Live Imaging” for newly-acquired images. We have added a table (Table 1) listing details of all image data used in the study.

      Page 8.<br /> "This indicates that whileKimmelNet can be used successfully with noisier test data than that on which it was trained,there is an upper limit to how noisy the data can be."<br /> - This is an obvious statement there will always be an upper limit on noise.

      We agree with the reviewer that this statement is not terribly informative. This section (“KimmelNet’s prediction accuracy is not significantly impacted by moderate levels of additive noise”) has been removed from the revised manuscript in favour of incorporating a section detailing testing of the model on newly-acquired images (“KimmelNet can generalise to previously unseen data”).

      Page 9.<br /> - Are only wildtype embryos used? How would this work on different mutants. To evaluate the robustness of the method this it would be valuable to test on some mutant line with known developmental difference from the wild type.

      We agree with the reviewer that testing on a mutant line would lend more weight to our findings. For example, the p53-/- zebrafish has a reported, published developmental delay, but we did not have access to that line. However, the developmental delay reported for the p53-/- mutant is virtually indistinguishable from that effected by a temperature change. We therefore focussed our efforts on developmental delay affected by altering incubation temperature only.

      Image data.<br /> - I would strongly suggest that the author should include a description of the data in the manuscript. A description of how the data is acquired, microscope, different batches, type of embryos used.

      The image data used in the first draft of the manuscript is the same as that used in a previous publication (Jones et al. 2022), which contains all the relevant details the reviewer seeks. However, we have now added the relevant information for the newly-acquired image data.

      "Random160translation in the y-direction was avoided due to the aspect ratio of the images (width>161height) - any artifacts introduced by translation in the x-direction would be removed by the162centre crop layer, but this would not be the case for translation in the y-direction."<br /> - Could this be solved by using some border method reflection, repetition or fixed value?

      The reviewer is correct that some form of image reflection or repetition could be utilised. However, given the nature of our images, if an embryo is located close to the image boundary, reflection/repetition can result in some odd artefacts, so we minimised the use of such fill methods (also used by the random zoom augmentation layer). We could instead use an arbitrary fixed value, as the reviewer suggested, but finding a value suitable for all images is difficult.

      Page 10.<br /> Addition of Noise to Image Data<br /> - This should be added in the training phase. This would probably improve the robustness of the network and also improve the results on the test data.

      We agree with the reviewer and have now added a random Gaussian noise layer for data augmentation purposes during model training (see Figure 1).

      • Supplementary 3 images with high noise. It is worrying that the network is not able to handle the noise in this figure. Looks like the features that is used to distinguish the developmental stage of the embryo is still clearly seen with this high noise level? Retrain the model with noise as an augmentation to improve this.

      As the reviewer suggested, addition of random noise is now incorporated into model training. The new version of the manuscript does not include the same supplemental figures, but instead includes additional testing on newly-acquired data instead.

      Reviewer #1 (Significance):

      The development of methods like this is highly relevant in the zebrafish community. Staging and evaluating the developmental stage for zebrafish is common and is of interest to the broad community. A lot of this work today is done manually, limiting the throughput, and adding human bias.

      The limit of this study is the dataset used for training and evaluation. Firstly, it is not clear about the structure of the data and how it is acquired, different types of fish or imaging setup etc. For a method to be useful to the community it needs to be robust enough to handle different types of fish (transgenic lines). The manuscript would be greatly improved by adding this to the training and evaluation.

      We have now added additional datasets for the purposes of evaluating the model.

      My expertise is image analysis and machine learning for quantification of biological samples, with focus on zebrafish screening.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary<br /> The paper "Automated staging of zebrafish embryos with KimmelNet" by Barry et al., presents a method to automatically stage developmental timepoints of zebrafish embryos based on convolutional neural networks (CNN). The authors show that a CNN trained on ~20k images can determine time post fertilization on test-image sets with an accuracy on the range of a few hours. This technique undoubtedly has the potential to become very useful for any zebrafish researchers interested in developmental timing as it eases analysis and removes potential subjective bias.

      Major comments<br /> In its current form the paper lacks sufficient graph annotations and method descriptions. This makes it hard in places to judge the validity of the claims. I do believe that the presented method can be useful and is likely valid but to be convincing, the authors need to spend more time expanding the methods, justifying their choices of analysis and clarifying figure annotations.

      We believe that we have addressed the reviewer’s concerns in this revised manuscript, as detailed in response to the specific points below.

      Specific points:<br /> 1) The annotation of the training data is not described and specifically it is unclear how valid the staging of the training data itself is. The authors state in the introduction "the hours post fertilization (hpf) [...] provides only and approximation of the actual developmental stage". It is therefore critical to know how this was accounted for in the annotation of the training data. Since the quality of the training data will ultimately limit the best-case quality of Kimmel Net. The authors need to go into some detail here even though the training data appears to be from another published dataset.

      The reviewer raises a valid point – two individual zebrafish embryos that are x hours post-fertilisation are not necessarily at the same developmental stage. However, we believe it is reasonable to assume that two populations of embryos x hours post-fertilisation are, on average, at the same developmental stage and it is this assumption that forms the basis for our approach. Given the inherent variability the reviewer refers to, we are not suggesting that our model would be particularly accurate for staging individual embryos. However, we are very confident (and we believe the data in the manuscript supports this) that given a population of embryos, our model will provide an accurate rate of development. We have added a paragraph (lines 131-141) to address this point.

      2) Why were "test predictions fit to a straight line through the origin". On the one hand this makes sense (since the slope would indicate the correspondence) but it should be clarified why an intercept was omitted in the fit. After all it is unclear if Kimmel net correctly identifies 0Hpf embryos.

      The reviewer makes a valid point – we do not know what predictions KimmelNet would produce for images of embryos closer to 0 hpf. However, we felt an equation of the form y=mx was a reasonable choice for two reasons. First of all, it matches the form of the Kimmel equation, which, despite its flaws, we are using as a benchmark of sorts – the absence of a y intercept makes comparisons with the Kimmel equation straightforward. Secondly, a “perfect” model would produce a straight line fit with y=x, so the lack of a y intercept seemed a reasonable constraint to impose. We have added some brief text (lines 103-105) to clarify this choice.

      3) The methods do not list how the mean of the absolute error was calculated from 3B/C. I think this should be the mean of the absolute error (not the mean of the error) but in that case the numbers listed in the text appear rather small given the histograms in 3 B/C. A clear statement in the methods would clarify this issue.

      We have now added a “Statistical Analysis” section under Materials & Methods to detail exactly what was used to calculate the values given for error analysis. We have calculated the mean of the error, not the mean of the absolute error, as we wish to illustrate that the mean is close to zero. We have used the standard deviation of the errors to illustrate that there is a significant spread in the error values, as depicted in Figure 3C and D.

      Minor comments<br /> 1) The Y-axis in Figure 2B is simply labeled "Loss" - what is the unit of this loss? HPF? Or HPF^2? This is important for judging the quality of the fit

      We thank the reviewer for drawing our attention to this omission. The loss is hpf2 (mean squared error) and we have updated the plot to reflect this.

      2) Figure 3 B. I would suggest changing the labels of the confidence intervals in the legend. "Inner and outer" is already clear from the figure itself, so labels that state that these are derived from n=100 vs. n=20 test image sized samples would be more useful to the reader

      We thank the reviewer for this suggestion – we have updated the figure legend accordingly.

      Referees cross-commenting

      I concur with comments issued by the other reviewers. I think it will be especially important to address the comments related to testing the method on mutants (Reviewer #1) and training the model in the presence of noise to increase its robustness (Reviewers #1 and #3) as well as addressing the overall annotation/generation of the training data (Reviewers #1 and #2).

      We believe we have now addressed all of these concerns. The model has been retrained with additional data augmentation incorporating random noise, tested on newly-acquired data and we have added tables summarising the details of all image data used in this study.

      I think these points will be critical to make the paper useful by increasing transparency and ensuring reproducibility in other labs with different imaging conditions, strains, mutants, etc.

      Reviewer #2 (Significance):

      Developmental delay is a common occurrence that can be caused by genetic and environmental background effects. It is therefore highly desirable to properly quantify this variable. The work presented here makes an important step in this direction, by allowing to quantify developmental timepoints independent of subjective staging. This speeds up analysis, increases reproducibility and enhances rigor. However, as my comments above indicate, the significance also depends on the ability of potential users to judge the quality of the work. Once those issues have been addressed, I think the work will be of broad interest to the developmental biology community, first and foremost labs utilizing the zebrafish model. However, as the authors state, the presented model architecture could be trained with the data from other species as well.

      Expertise: Zebrafish, quantitative analysis, behavior, neuroscience

      We thank the reviewer for their positive feedback.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary:

      Properly staging embryos of zebrafish embryos is important, yet provides challenging since it can depend on many factors, such as temperature, water quality, fish population density, etc. Here, the authors provide a deep-learning-based model called KimmelNet that allows the prediction of the age of zebrafish embryos, using 2D brightfield images. The technique is robust to weak measurement noise and can also be used to identify developmental delays from a very small number of experimental data.

      The code is accessible to the reader, open-source and should be useable by experimentalists without huge effort.

      Major comments:

      I suggest retraining the model and application of the model to additional data for the following reasons:<br /> • Why did the authors not train for (high) measurement noise and heterogeneous background illumination? Would that not make the model more robust? In principle, creating training should not be considerably harder than before, since the manipulation of the images has been already shown in the manuscript and the authors just need to run it again on the HPC cluster. If there are no technical or administrative constraints (access to the cluster, computational effort, high costs, etc.), the authors should retrain their model.

      We thank the reviewer for this suggestion. As detailed in Figure 1, with a view to making the model more robust, we have now added several more layers of data augmentation, including the addition of random noise, and retrained our model.

      • For Fig. S2 and S3 it is not clear if there is such a strong deviation from the Kimmel equation due to measurement noise or due to the background illumination. The saliency maps appear as if they are mainly affected by the illumination, and maybe less by the noise. Would it be possible to apply the model to a case without artificial noise, but with heterogeneous background illumination to identify what has a bigger impact?

      We thank the reviewer for this suggestion. We have now replaced the “artificial” examples used in the previous version of the manuscript with newly-acquired data (Figure 5), which exhibits different characteristics to that used for training.

      Additionally, the authors need to clarify what exactly they are comparing in this manuscript and rework their interpretation of their findings:<br /> • When comparing the predictions between KimmelNet and the Kimmel equation, the authors use an equation of the form y=mx. Could the authors please elaborate on why they introduce the constraint of y(0)=0? It might be naturally given by the so-called Kimmel equation, but by looking at Fig 3a, it seems like an equation of the form y=mx+a would be more appropriate and it appears like KimmelNet introduces an offset of around a=2h for 25 Celsius. The authors need to discuss this.

      The main rationale for using an equation of the form y=mx is to be consistent with the Kimmel equation (see lines 103-105). The reviewer is correct that an equation of the form y=mx+c may well produce a better fit to the data, but omitting a y intercept makes comparison with the Kimmel Equation trivial.

      • In lines 5-8 the authors say that developmental stage progression does not only depend on temperature, but also on population density, water quality etc. and they explain that usually not only hpf, but also staging guides based on morphological criteria are used! If that is true, how good is their training data set that only uses hpf and not the other important guides? How did the authors test that these factors have no impact on their training data?

      We have now added a paragraph (lines 131-141) to address this point.

      Since this tool has the potential to have a big impact on zebrafish research, it would be nice to provide some examples of how exactly this could be achieved:<br /> • Could the authors discuss how exactly their tool is useful to experimentalists? Is it the idea that if an experimentalist wants to investigate an embryo of a particular stage, they apply KimmelNet to images of embryos to identify the stage of the embryo and only then undertake their planned experiment? Is that a realistic undertaking?

      As is evidenced by the errors presented in Figure 3C & D, testing KimmelNet on individual images of embryos may well result in a large error in the predicted hpf. As such, it is not appropriate to use the tool in such a manner. However, to modify the example provided by the reviewer, should an experimentalist have a population of embryos they wished to stage, then yes, KimmelNet would certainly be an appropriate tool for doing so.

      • Would it be possible to provide a tutorial (or even video tutorial if such skills are available in the group of authors) that provides real examples of how to apply the technique? This would make it easier for people without advanced Python/Deep-Learning skills to use the tool, hence improving the impact of KimmelNet.

      A lack of user-friendly interfaces for applying deep learning methods in biology is well-documented – basic knowledge of python and tools like jupyter notebooks are often necessary (https://doi.org/10.1038/s41592-023-01900-4). However, we have endeavoured to make the running of KimmelNet as easy as possible for new users. A jupyter notebook instance can be run on Binder with absolutely no set-up required. To run KimmelNet on their own data, biologists just need to download the Git repo and replace the test images with their own data. We have updated the landing page on the GitHub repo to provide more specific step-by-step instructions for each of these tasks. We will also endeavour to upload our model to the BioImage Model Zoo (https://bioimage.io/#/) to further increase accessibility.

      I am very critical towards equation 1. Please note that I don't think this has any impact on the quality of the technique provided in this manuscript and the significant flaws can already be found in Kimmel 1995 (which is not under review here). That is why I suggest rewriting of this manuscript to not support an over-interpretation of this equation.<br /> • I do not think that the Kimmel equation is an established term. At least a Google Scholar Search for "Kimmel equation" only gives one result: the preprint of this manuscript.<br /> • The equation has no mathematical meaning regarding its units (subtracting temperature and a unitless value). I also very rarely see equations with Degrees Celsius and not Kelvin.<br /> • Additionally, the equation provides a linear relationship between the development time and temperature h(T) and in Kimmel et al, it is shown that this is only true for 25-33 Celsius. Such a linearisation is not very surprising for a small temperature range, but I am not sure how true it is for higher temperature differences. Hence, I feel that it is very bold to give a specific name to such an equation, giving it an importance that it does not deserve.

      We appreciate the reviewer’s concerns and have removed explicit references to “The Kimmel Equation”, without substantively changing the content of the manuscript.

      Minor comments:

      • For the measurement noise cases it would be nice to have some example images of fish with the specific noise levels in Fig S1 and Fig S2.

      We have now removed the “synthetic” additive noise test data, previously depicted in Figures S1-3, in favour of newly-acquired images in Figures 5-7.

      • Could the authors hypothesize why they predict a slower dynamic for 25 Celsius than predicted by the Kimmel equation?

      Referring to Figure 2 in Kimmel et al (1995), it is apparent that the straight lines are by no means perfect fits to the datapoints. In Fig 2A in particular, some datapoints for the 25C data fall well below the line fit. While the published equation suggests a rate of development 80.5% of the rate at 28.5C, according to Fig 2A, an alternative line fit could give a developmental rate as low as 70-75%, which would be in agreement with our data.

      Reviewer #3 (Significance):

      Strengths of the study:

      An easy-to-use method to automatically stage zebrafish embryos and identify differences in the developmental stage is very important for the zebrafish community and the technique in this manuscript definitely novel. The tool is can be used without large effort and the authors suggest that it can also find applications beyond zebrafish embryos. Hence, it is not only interesting to the zebrafish community, but to a broader developmental biology audience.

      Weakness of the study:<br /> The main weakness of the manuscript is in the training data used for the deep-learning model and the apparent large impact of heterogeneous background illumination. If that is not solved, it is unclear if this technique will find an application in the zebrafish community.

      We believe this weakness has now been addressed by incorporating additional data augmentation measures in the training process and testing the model on newly-acquired data.

      Field of expertise of the reviewer: Image Analysis, Mathematical Modelling, Biological Physics. While I have limited experience in deep learning, I cannot evaluate the specific details of the network architecture. I also have no experience in zebrafish research.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Properly staging embryos of zebrafish embryos is important, yet provides challenging since it can depend on many factors, such as temperature, water quality, fish population density, etc. Here, the authors provide a deep-learning-based model called KimmelNet that allows the prediction of the age of zebrafish embryos, using 2D brightfield images. The technique is robust to weak measurement noise and can also be used to identify developmental delays from a very small number of experimental data.

      The code is accessible to the reader, open-source and should be useable by experimentalists without huge effort.

      Major comments:

      I suggest retraining the model and application of the model to additional data for the following reasons:<br /> - Why did the authors not train for (high) measurement noise and heterogeneous background illumination? Would that not make the model more robust? In principle, creating training should not be considerably harder than before, since the manipulation of the images has been already shown in the manuscript and the authors just need to run it again on the HPC cluster. If there are no technical or administrative constraints (access to the cluster, computational effort, high costs, etc.), the authors should retrain their model.<br /> - For Fig. S2 and S3 it is not clear if there is such a strong deviation from the Kimmel equation due to measurement noise or due to the background illumination. The saliency maps appear as if they are mainly affected by the illumination, and maybe less by the noise. Would it be possible to apply the model to a case without artificial noise, but with heterogeneous background illumination to identify what has a bigger impact?

      Additionally, the authors need to clarify what exactly they are comparing in this manuscript and rework their interpretation of their findings:<br /> - When comparing the predictions between KimmelNet and the Kimmel equation, the authors use an equation of the form y=mx. Could the authors please elaborate on why they introduce the constraint of y(0)=0? It might be naturally given by the so-called Kimmel equation, but by looking at Fig 3a, it seems like an equation of the form y=mx+a would be more appropriate and it appears like KimmelNet introduces an offset of around a=2h for 25 Celsius. The authors need to discuss this.<br /> - In lines 5-8 the authors say that developmental stage progression does not only depend on temperature, but also on population density, water quality etc. and they explain that usually not only hpf, but also staging guides based on morphological criteria are used! If that is true, how good is their training data set that only uses hpf and not the other important guides? How did the authors test that these factors have no impact on their training data?

      Since this tool has the potential to have a big impact on zebrafish research, it would be nice to provide some examples of how exactly this could be achieved:<br /> - Could the authors discuss how exactly their tool is useful to experimentalists? Is it the idea that if an experimentalist wants to investigate an embryo of a particular stage, they apply KimmelNet to images of embryos to identify the stage of the embryo and only then undertake their planned experiment? Is that a realistic undertaking?<br /> - Would it be possible to provide a tutorial (or even video tutorial if such skills are available in the group of authors) that provides real examples of how to apply the technique? This would make it easier for people without advanced Python/Deep-Learning skills to use the tool, hence improving the impact of KimmelNet.

      I am very critical towards equation 1. Please note that I don't think this has any impact on the quality of the technique provided in this manuscript and the significant flaws can already be found in Kimmel 1995 (which is not under review here). That is why I suggest rewriting of this manuscript to not support an over-interpretation of this equation.<br /> - I do not think that the Kimmel equation is an established term. At least a Google Scholar Search for "Kimmel equation" only gives one result: the preprint of this manuscript.<br /> - The equation has no mathematical meaning regarding its units (subtracting temperature and a unitless value). I also very rarely see equations with Degrees Celsius and not Kelvin.<br /> - Additionally, the equation provides a linear relationship between the development time and temperature h(T) and in Kimmel et al, it is shown that this is only true for 25-33 Celsius. Such a linearisation is not very surprising for a small temperature range, but I am not sure how true it is for higher temperature differences. Hence, I feel that it is very bold to give a specific name to such an equation, giving it an importance that it does not deserve.

      Minor comments:

      • For the measurement noise cases it would be nice to have some example images of fish with the specific noise levels in Fig S1 and Fig S2.
      • Could the authors hypothesize why they predict a slower dynamic for 25 Celsius than predicted by the Kimmel equation?

      Significance

      Strengths of the study:

      An easy-to-use method to automatically stage zebrafish embryos and identify differences in the developmental stage is very important for the zebrafish community and the technique in this manuscript definitely novel. The tool is can be used without large effort and the authors suggest that it can also find applications beyond zebrafish embryos. Hence, it is not only interesting to the zebrafish community, but to a broader developmental biology audience.

      Weakness of the study:

      The main weakness of the manuscript is in the training data used for the deep-learning model and the apparent large impact of heterogeneous background illumination. If that is not solved, it is unclear if this technique will find an application in the zebrafish community.

      Field of expertise of the reviewer:

      Image Analysis, Mathematical Modelling, Biological Physics. While I have limited experience in deep learning, I cannot evaluate the specific details of the network architecture. I also have no experience in zebrafish research.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      The paper "Automated staging of zebrafish embryos with KimmelNet" by Barry et al., presents a method to automatically stage developmental timepoints of zebrafish embryos based on convolutional neural networks (CNN). The authors show that a CNN trained on ~20k images can determine time post fertilization on test-image sets with an accuracy on the range of a few hours. This technique undoubtedly has the potential to become very useful for any zebrafish researchers interested in developmental timing as it eases analysis and removes potential subjective bias.

      Major comments

      In its current form the paper lacks sufficient graph annotations and method descriptions. This makes it hard in places to judge the validity of the claims. I do believe that the presented method can be useful and is likely valid but to be convincing, the authors need to spend more time expanding the methods, justifying their choices of analysis and clarifying figure annotations.

      Specific points:

      1. The annotation of the training data is not described and specifically it is unclear how valid the staging of the training data itself is. The authors state in the introduction "the hours post fertilization (hpf) [...] provides only and approximation of the actual developmental stage". It is therefore critical to know how this was accounted for in the annotation of the training data. Since the quality of the training data will ultimately limit the best-case quality of Kimmel Net. The authors need to go into some detail here even though the training data appears to be from another published dataset.
      2. Why were "test predictions fit to a straight line through the origin". On the one hand this makes sense (since the slope would indicate the correspondence) but it should be clarified why an intercept was omitted in the fit. After all it is unclear if Kimmel net correctly identifies 0Hpf embryos.
      3. The methods do not list how the mean of the absolute error was calculated from 3B/C. I think this should be the mean of the absolute error (not the mean of the error) but in that case the numbers listed in the text appear rather small given the histograms in 3 B/C. A clear statement in the methods would clarify this issue.

      Minor comments

      1. The Y-axis in Figure 2B is simply labeled "Loss" - what is the unit of this loss? HPF? Or HPF^2? This is important for judging the quality of the fit
      2. Figure 3 B. I would suggest changing the labels of the confidence intervals in the legend. "Inner and outer" is already clear from the figure itself, so labels that state that these are derived from n=100 vs. n=20 test image sized samples would be more useful to the reader

      Referees cross-commenting

      I concur with comments issued by the other reviewers. I think it will be especially important to address the comments related to testing the method on mutants (Reviewer #1) and training the model in the presence of noise to increase its robustness (Reviewers #1 and #3) as well as addressing the overall annotation/generation of the training data (Reviewers #1 and #2).

      I think these points will be critical to make the paper useful by increasing transparency and ensuring reproducibility in other labs with different imaging conditions, strains, mutants, etc.

      Significance

      Developmental delay is a common occurrence that can be caused by genetic and environmental background effects. It is therefore highly desirable to properly quantify this variable. The work presented here makes an important step in this direction, by allowing to quantify developmental timepoints independent of subjective staging. This speeds up analysis, increases reproducibility and enhances rigor. However, as my comments above indicate, the significance also depends on the ability of potential users to judge the quality of the work. Once those issues have been addressed, I think the work will be of broad interest to the developmental biology community, first and foremost labs utilizing the zebrafish model. However, as the authors state, the presented model architecture could be trained with the data from other species as well.

      Expertise: Zebrafish, quantitative analysis, behavior, neuroscience

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript the author is presenting a deep-learning model used to predict the development stage of zebrafish embryo. A robust method that can accurately classify a zebrafish into different development stages is highly relevant for many researchers working with zebrafish and hence the importance in developing methods like this is high.

      The manuscript is overall ok. However, more data is needed to convince the reader that the method is robust enough to work with other samples in other labs. This would greatly improve the impact of the publication.

      Page 6.<br /> - How is the data acquired?

      Page 8.<br /> "This indicates that whileKimmelNet can be used successfully with noisier test data than that on which it was trained,there is an upper limit to how noisy the data can be."<br /> - This is an obvious statement there will always be an upper limit on noise.

      Page 9.<br /> - Are only wildtype embryos used? How would this work on different mutants. To evaluate the robustness of the method this it would be valuable to test on some mutant line with known developmental difference from the wild type.

      Image data.<br /> - I would strongly suggest that the author should include a description of the data in the manuscript. A description of how the data is acquired, microscope, different batches, type of embryos used.

      "Random160translation in the y-direction was avoided due to the aspect ratio of the images (width>161height) - any artifacts introduced by translation in the x-direction would be removed by the162centre crop layer, but this would not be the case for translation in the y-direction."<br /> - Could this be solved by using some border method reflection, repetition or fixed value?

      Page 10.<br /> Addition of Noise to Image Data<br /> - This should be added in the training phase. This would probably improve the robustness of the network and also improve the results on the test data.

      • Supplementary 3 images with high noise. It is worrying that the network is not able to handle the noise in this figure. Looks like the features that is used to distinguish the developmental stage of the embryo is still clearly seen with this high noise level? Retrain the model with noise as an augmentation to improve this.

      Significance

      The development of methods like this is highly relevant in the zebrafish community. Staging and evaluating the developmental stage for zebrafish is common and is of interest to the broad community. A lot of this work today is done manually, limiting the throughput, and adding human bias.

      The limit of this study is the dataset used for training and evaluation. Firstly, it is not clear about the structure of the data and how it is acquired, different types of fish or imaging setup etc. For a method to be useful to the community it needs to be robust enough to handle different types of fish (transgenic lines). The manuscript would be greatly improved by adding this to the training and evaluation.

      My expertise is image analysis and machine learning for quantification of biological samples, with focus on zebrafish screening.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      The authors greatly appreciate Review Commons’ innovative approach to scientific review and publishing. We thank the reviewers for their kind words regarding the manuscript overall quality and for highlighting the quantitative approach and reproducibility of this work. We further thank the concerns raised and suggestions made that have contributed to improving the manuscript. Below is a point-by-point response to the reviewers, organized into sections that discriminate the alterations already made and plans for further experiments and revisions. We hope that they appropriately address the reviewers' concerns.

      2. Description of the planned revisions

      Reviewer #2: Figure 6 in particular, the number of analyzed embryos is small, given the fact that there is a lot of inter-individual heterogeneity in this process it could well be that the authors got, by chance, two embryos out of three having the same pattern of Hairy1 expression.

      R: The authors appreciate the concern raised by Reviewer#2. This experiment is very time consuming and difficult to execute, which is why the number of samples is limited. Overall, we analyzed 7 embryos and 5 recapitulated the pattern of gene expression. We were particularly interested in the occipital somites and, in this time window, 3 out of 4 showed the same expression pattern. Nevertheless, further experiments will be performed to increase the number of analyzed individuals. We are confident this will contribute to strengthening the conclusions of our work.

      Reviewer #2: I believe that an additional shorter time point (+15 or 30 min) with a different pattern of the oscillatory gene would also add to the characterization of the dynamics (same for Fig 5c). This is particularly true given that the domain of expression of Hairy 1 analyzed in Figure 6 is localized quite rostral which might be interpreted as a phase 1 or a phase 2 as well (as initially described in Palmeirim et al 1997).

      R: We thank Reviewer#2 for this suggestion. We have preliminary data (20-40 min) evidencing different patterns of expression and will perform more experiments to complement these results. Figures will be modified to include samples incubated for shorter time intervals, to evidence different expression patterns obtained in these conditions.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1 | Major points:

      1. In the RESULTS session, "Occipital somites are formed faster than cervical and trunk somites," the authors argue that the occipital somites form with greater temporal variability than the neck and trunk somites. Judging from Figures 3C and 3D, I feel it is the case. However, the authors should demonstrate it through statistical analysis.

      2. In the RESULTS session, "Anterior-posterior length of rostral somites," the authors concluded that "the large variability in measurements of somites 17-20 most probably results from the rotation of the embryo body in these developmental stages." Probably they mentioned data of the length of #17-#20 somites in Table1. They should demonstrate it through statistical analysis to show the large variability in the specific area. I understand that embryo rotation could be a reason for the variability. The authors should show evidence. Or they should discuss various possibilities from a broad perspective.

      R: The authors thank Reviewer#1 for suggesting a statistical approach to better characterize the data variability obtained. We performed a Brown-Forsythe test and found that, indeed, there is a statistically significant difference between the temporal variability (period) of somites 1-7 and 8-20 (P-value = 0.02319). Application of the Brown-Forsythe test also found a non-equal variability in the length of the somites #17-20 (P-value = 0.005403). A new Supplementary Figure 4 was added, displaying these results.

      The Brown-Forsythe test is a statistical method used to assess the equality of variances in a dataset across different groups, in this case, the period or the length of early and late somites. It is a robust alternative to the traditional Levene's test, particularly useful when the assumption of homogeneity of variances is not met, such as when the data distributions are skewed or contain outliers (which is the case with our data). It calculates the absolute deviations of individual observations from their respective group medians, which makes it less sensitive than the Levene's test to extreme values. By comparing these deviations between groups, the Brown-Forsythe test helps determine whether the variance differs significantly across the groups. We are confident that this result confers robustness to our claims, and hope that it appropriately addresses the reviewer's concerns.

      Regarding the reasons underlying the variability in the length of #17-#20 somites, we believe it is mainly due to technical constraints. In early developmental stages the chicken embryo is flat, and measurements are easily performed along the anterior-posterior (A-P) somite axis. When somites 17-20 are formed, the embryonic axis starts undergoing rotation, meaning that in some cases we may be measuring along a rotated somite axis. In our work somite length is determined as soon as the posterior intersomitic cleft is formed, so an alternative explanation could be that each somite is formed with a variable length, that is soon after consolidated, resulting in the characteristic consistent metameric organization of somites along the embryo body axis. This is highly unlikely because the length of somites 17-20 long after they are formed (Herrmann et al., 1951) is within the same value range we observed.

      The manuscript has been altered to include the above-mentioned information, as follows:

      • Methods section, under Statistical analysis (Line 153): “To assess the homogeneity of variances between early and late somites, we applied the Brown-Forsythe test on both the period and length measurements. This method involves computing the absolute deviations of individual observations from their respective group medians, rendering it less sensitive to extreme values (outliers). Through the comparison of these deviations across groups, the Brown-Forsythe test aids in determining the statistical significance of variance disparities.”
      • Results section, under Anterior-posterior length of rostral somites (Line 201): “A larger variability was obtained for measurements of somites 17-20 (Supplementary Figure 4A), although this most probably results from the rotation of the embryo body in these developmental stages, hindering precise length measurements of the somite A-P axis.”
      • Results section, under Occipital somites are formed faster than cervical and trunk somites (Line 216): “Remarkably, there is substantial variability in the time of formation of the early-most somites (Figure 3C; Supplementary Figures 3), which gradually stabilizes until somite 8 onwards, where both somite formation time and variability is equivalent to that observed for somites 15-20 (Figure 3C, D; Table 1; Supplementary Figure 4B).”
      1. The authors do not describe the expression patterns of hairy1 in the PSM in the manuscript, but they merely judged whether they are different or the same (recapitulate). The description of the expression pattern needs to be revised totally. The authors should describe the expression patterns of hairy1 in the PSM of each sample carefully and in detail. Fortunately, the previous report (Pourquie and Tam, Developmental Cell, 1, 619-620, 2001) categorized the expression patterns of the EC genes into three phases. The authors should at least categorize each sample according to the criterion by Pourque and Tam. If arrows of brackets indicate the area of expression, it is reader-friendly.

      R: A thorough characterization of segmentation clock gene expression (including hairy1) in the PSM of early somitogenesis chick embryos has been previously described (Rodrigues et al, 2006). For this reason, the authors focused mainly on a comparative analysis of the hairy1 expression patterns obtained in explants incubated for different periods of time. The authors acknowledge, however, the need for further description of the expression patterns obtained in early gastrulation stages, which haven’t been previously documented. Overall, the following alterations were made to the manuscript text:

      Line 231: “The regions with greater variability of hairy1 expression included the neural plate, anterior to the node, the epiblast posterior to the node encompassing the precursors of the paraxial mesoderm (Psychoyos & Stern, 1996) and the caudal-most epiblast. hairy2 expression was also very dynamic along the embryo A-P axis (n=20) (Figure 4B), evidencing chevron-like expression domains, that appear at different levels of the primitive streak, as previously described by Jouve and collaborators (Jouve et al., 2002).”

      Line 245: “As somitogenesis takes place, hairy1 and hairy2 expression patterns retain their dynamic properties in the PSM (Figure 5A, B), as previously described (Rodrigues et al., 2006).”

      The authors further thank Reviewer#1’s suggestion to include brackets indicating the areas of hairy1 expression. Figures 4, 5 and 6 have been altered accordingly, which, indeed, makes figure interpretation more reader-friendly. The gene expression phases presented in Palmeirim et al (1997) and then by Pourquié and Tam (2001) mean to summarize a dynamic expression, with continuous intermediate phases, making it difficult to clearly categorize each pattern obtained. Since our purpose was to evaluate if the entire expression pattern was recapitulated (irrespective of the specific phase), we believe that categorizing each sample in phases is not paramount for result interpretation.

      Reviewer #1 | Minor points.

      1. In the RESULTS session, "Anterior-posterior length of rostral somites," the authors described that the average length of somites ranges from 118 - 191 μ__m. But according to Table-1, the lowest length of somite is 115.92 μ__m (__∼__116 μ__m). So, the lower limit should be corrected here.

      R: The authors thank the reviewer for pointing this out. The text has been appropriately modified in Line 199 of the revised manuscript.

      1. In the DISCUSSION section, the authors mentioned that they presented a thorough characterization of the size and time of formation of the first ten somites in the chicken embryo. But based on the tables and figures, it will only be the first nine somites, not the ten.

      R: The authors agree with the reviewer’s comment. The text has been appropriately modified in Lines 128, 176 and 279 of the revised manuscript.

      1. In the GRAPHICAL ABSTRACT, if the color of the oscillation line is the same as the corresponding somites, it is intuitive.

      R: The authors thank the reviewer for this suggestion and have modified the graphical abstract accordingly. We employed multiple shades of the same color (corresponding to different positions along the body axis) to represent different embryos.

      1. It would be helpful if the manuscript contained both page and line numbers.

      R: Page and line numbers were added to the revised manuscript.

      Reviewer #2

      Minor comment: In 5 c similar patterns and newly formed somites should be pointed out by arrows on the figure to help the readers.

      R: We thank Reviewer#2 for this suggestion. Arrows and brackets have been added to the figure to highlight the newly formed somites and gene expression domains, respectively.

      Minor comment: To be more specific the term segmentation clock should be used instead of embryonic clock as I believe there are other embryonic clocks (cell cycle, circadian, etc..)

      R: The authors appreciate the suggestion of Reviewer#2 regarding the term used to identify the molecular oscillator in our work. The term “segmentation clock” or “somitogenesis clock” is commonly used to refer to oscillations in hairy1/2 gene expression because their discovery and subsequent study has mainly focused on the somitogenesis process. Oscillations of hairy1/2 expression (Hes1/7 in mouse), however, have also been described in cells and developmental stages that are not associated with somite formation, and herein we describe dynamic expression in epiblast regions containing precursors that don’t give rise to segmented structures. As discussed in our recent paper (Carraco et al., Front. Cell Dev. Biol, 2022), the broader term Embryo Clock may be used to refer to molecular oscillations in embryonic cells, controlled by negative feedback regulation, that play a role in temporally controlled morphogenetic processes and/or cell fate specification.

      In the beginning of our manuscript (Line 75), we clearly state that we are referring to the embryo clock operating during somitogenesis: “(…) somitogenesis embryo clock (EC), comprising genes with cell-autonomous oscillatory expression in the PSM driven by negative feedback loops (reviewed in Carraco et al., 2022)”, so we believe that the term used will be clearly perceived by the reader. For further clarification, however, the subtitle The Embryo Clock in early somitogenesis in the Discussion section has been modified to (Line 321): “The Embryo Segmentation Clock in early somitogenesis”

      Reviewer #3 | Minor comments:

      The authors did not consider the fact that the first formed somite is the second somite. After the formation of the second somites, the real first somite forms anterior to the second somite. Furthermore, the real first and the third somite seems to be formed simultaneously. It is worthy for the authors to re-examine the data, whether the real first somite and the third somite are formed at the same time. And to check whether the first somite was counted to the segmented region. And this point should be at least discussed.

      R: The authors thank Reviewer#3 for the opportunity to clarify this important issue in our manuscript. It was previously described that the first morphological somite formed is, in fact, the second somite, while the “real” first somite is formed later, anteriorly to this one (Hamburger and Hamilton, 1951). This rostral-most somite-like structure is not anteriorly delimited by a fissure and has thus been termed an “incomplete” or “rudimentary” somite (Hinsch & Hamilton, 1956). Since the methodology used in our work relies on measuring the length between the rostral-most and the posterior-most intersomitic clefts, the “rudimentary” somite is not included in our data, and we considered somite #1 as the first somite delimited both anteriorly and posteriorly by intersomitic clefts. This was stated in the Methods section, under Embryo measurements, and has now also been made explicit under Somite nomenclature (Line 120): “Only structures delimited both anteriorly and posteriorly by intersomitic clefts were counted as somites.”

      We, indeed, observe the formation of the “rudimentary” somite anteriorly to somite #1, when somites 3-4 and formed. This information was included in the Discussion section, under Spatio-temporal properties of the rostral somite segmentation (Line 311): “Note that our analysis did not consider the “rudimentary somite”, as defined by Hinsch and Hamilton (Hinsch & Hamilton, 1956) since it does not possess an anterior somitic cleft. We found that this structure becomes clearly visible, rostrally to somite 1, as somites 3-4 are formed.”

      4. Description of analyses that authors prefer not to carry out

      Reviewer #1 (Significance (Required)):

      General assessment: The results are not conceptually new or surprising. However, their careful quantitative analysis is informative and worthwhile to be published because it has yet to be done in the early somite formation. In this manuscript, the authors focused on the early stage of somitogenesis. It will be more informative if they complete this analysis for the whole somite area including the end of somitogenesis.

      Reviewer #3 | Referees cross commenting****

      I find all criticisms are justified. The most advance, as stated by other both reviewers, is the quantitative assay of the somite formation, since this is no yet done previously. As suggested by the first reviewer, it will be more informative if the authors complete this analysis for all regions. For all somites would be too much work, but they can select some representative somites of each region in addition to occipital region, such as 3 somites for one region, including the cervical, thoracic, lumbal, sacral and caudal region. Thus, the dynamic of the temporal somite formation of the whole embryo can be analysed using the same method. This will provide much more impact for this work.

      R: The authors thank the Reviewers #1 and #3 for the kind words and for highlighting the quantitative approach taken. Regarding completing the analysis for the whole somite area including the end of somitogenesis, the authors agree that this would be interesting for the community. The focus of this work, however, was a detailed understanding of early somite segmentation, where measurements of somites 14-20 were performed for validation purposes alone of the technical approach developed, since their time of formation has been previously well established. Characterization of somite formation dynamics along the entire embryonic axis, while informative, would entail significant technical challenges, which are beyond the focus of this work. Briefly, we performed live imaging using the EC culture system (Chapman et al, 2001). This appropriately reproduces in ovo development of early embryos but imposes significant constraints on embryo development in older developmental stages, including the ones corresponding to the formation of the last somites. A possible alternative to perform these measurements would be to apply the tissue explant culture system developed by Palmeirim et al., 1997 to different portions of the embryo body, and optimize it for real-time imaging. However, we believe that the time and effort required are beyond the scope of this work and would not significantly contribute to elucidating the main questions addressed in this manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Somites form consecutively along the anterior to posterior (AP) axis. The time of the formation of a somite is controlled by the segmentation clock, oscillation of cyclic genes in the presomitic mesoderm. The length of an oscillation cycle differs between species and should also differ between the axial levels. In chicken embryos, one cycle for a trunk somite requires 90 minutes, while it is much slower (150 minutes) for a posterior-most somite. Is this quicker or slower for an anterior-most somite? Andrade' group addressed this question and measured the time of the formation of each occipital somites (somite 1-5). They found that the formation of an occipital somite requires only 75 minutes, while somites from somite 6 onwards takes as long as the trunk somites (about 90 minutes). The faster formation of occipital somites is correlated with the time of the cyclic expression of hairy1 and hairy2.

      Major comments:

      The conclusion is well supported by the data. The measurement of the length increments of the segmented region and then assay using algorithm are well established. Thus, the data are well reproducible.

      Minor comments:

      The authors did not consider the fact that the first formed somite is the second somite. After the formation of the second somites, the real first somite forms anterior to the second somite. Furthermore, the real first and the third somite seems to be formed simultaneously. It is worthy for the authors to re-examine the data, whether the real first somite and the third somite are formed at the same time. And to check whether the first somite was counted to the segmented region. And this point should be at least discussed.

      Referees cross commenting

      I find all criticisms are justified. The most advance, as stated by other both reviewers, is the quantitative assay of the somite formation, since this is no yet done previously. As suggested by the first reviewer, it will be more informative if the authors complete this analysis for all regions. For all somites would be too much work, but they can select some representative somites of each region in addition to occipital region, such as 3 somites for one region, including the cervical, thoracic, lumbal, sacral and caudal region. Thus, the dynamic of the temporal somite formation of the whole embryo can be analysed using the same method. This will provide much more impact for this work.

      Significance

      Significance: The measurement of the length increments of the segmented region and then assay using algorithm are the novelty and strengths of this study. So, the data are reproducible and objective.

      The results of this study extend our understanding about the dynamic process of the somitogenesis. Especially, the most interesting point is that based on this result, we can see that the segmentation clock runs faster in the head region, and then slow down gradually along the AP axis.

      Audience: specialized, basic research<br /> The developmental biologist will be interested in this topic.

      My expertise is the somite development, somite differentiation, mesoderm development.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Maia-Fernandez et al used time-lapse imaging of chicken embryos to analyze the formation of the first formed somite, by doing in situ hybridization, they checked the dynamic nature of well-known segmentation clock genes during this process. They found that the segmentation clock period is faster for the formation of the first five somites (most anterior) and that this process is underlain by dynamic/cyclic expression of Hairy 1 and Hairy 2 as it has been described for more posterior somites.

      • I believe that there few issues that should be addressed to strengthen the conclusions of the manuscript:<br /> Figure 6 in particular, the number of analyzed embryos is small, given the fact that there is a lot of inter-individual heterogeneity in this process it could well be that the authors got, by chance, two embryos out of three having the same pattern of Hairy1 expression.
      • I believe that an additional shorter time point (+15 or 30 min) with a different pattern of the oscillatory gene would also add to the characterization of the dynamics (same for Fig 5c). This is particularly true given that the domain of expression of Hairy 1 analyzed in Figure 6 is localized quite rostral which might be interpreted as a phase 1 or a phase 2 as well (as initially described in Palmeirim et al 1997).

      Minor comments:

      In 5 c similar patterns and newly formed somites should be pointed out by arrows on the figure to help the readers.<br /> To be more specific the term segmentation clock should be used instead of embryonic clock as I believe there are other embryonic clocks (cell cycle, circadian, etc..)

      Significance

      In this study, the authors address the question of the formation of the first-formed somites using bird a model system; this is a conceptual advance in the sense that our knowledge of the dynamics of these critical morphological events is minimal. The technical advances (time-lapse, image analysis, dissection) made by the authors are quite remarkable and allow for filling the gap of knowledge the community has in this particular domain. The article is well written, data are well presented and it is interesting for a large community of developmental biologists.

      My expertise is in cell and tissue morphogenesis of amniote embryos

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Maia-Fernandes et al. investigated somite formation dynamics in the chick embryo's early stage in this manuscript. They found that the cranial most somites (1-5) form faster than the trunk. They also show that the oscillatory expression pattern of hairy1, regarded as the somitogenesis embryo clock (EC), is coupled to the somite segmentation in the occipital somites. The results are not conceptually new or surprising; they merely show what has been widely believed. However, their careful quantitative analysis is informative and worthwhile to be published because it has yet to be done in the early somite formation. To improve the manuscript, I have several concerns to be addressed.

      Major points.

      1. In the RESULTS session, "Occipital somites are formed faster than cervical and trunk somites," the authors argue that the occipital somites form with greater temporal variability than the neck and trunk somites. Judging from Figures 3C and 3D, I feel it is the case. However, the authors should demonstrate it through statistical analysis.
      2. The authors do not describe the expression patterns of hairy1 in the PSM in the manuscript, but they merely judged whether they are different or the same (recapitulate). The description of the expression pattern needs to be revised totally. The authors should describe the expression patterns of hairy1 in the PSM of each sample carefully and in detail. Fortunately, the previous report (Pourquie and Tam, Developmental Cell, 1, 619-620, 2001) categorized the expression patterns of the EC genes into three phases. The authors should at least categorize each sample according to the criterion by Pourque and Tam. If arrows of brackets indicate the area of expression, it is reader-friendly.
      3. In the RESULTS session, "Anterior-posterior length of rostral somites," the authors concluded that "the large variability in measurements of somites 17-20 most probably results from the rotation of the embryo body in these developmental stages." Probably they mentioned data of the length of #17-#20 somites in Table1. They should demonstrate it through statistical analysis to show the large variability in the specific area. I understand that embryo rotation could be a reason for the variability. The authors should show evidence. Or they should discuss various possibilities from a broad perspective.

      Minor points.

      1. In the RESULTS session, "Anterior-posterior length of rostral somites," the authors described that the average length of somites ranges from 118 - 191 μm. But according to Table-1, the lowest length of somite is 115.92 μm (∼116 μm). So, the lower limit should be corrected here.
      2. In the DISCUSSION section, the authors mentioned that they presented a thorough characterization of the size and time of formation of the first ten somites in the chicken embryo. But based on the tables and figures, it will only be the first nine somites, not the ten.
      3. In the GRAPHICAL ABSTRACT, if the color of the oscillation line is the same as the corresponding somites, it is intuitive.
      4. It would be helpful if the manuscript contained both page and line numbers.

      Significance

      General assessment: The results are not conceptually new or surprising. However, their careful quantitative analysis is informative and worthwhile to be published because it has yet to be done in the early somite formation. In this manuscript, the authors focused on the early stage of somitogenesis. It will be more informative if they complete this analysis for the whole somite area including the end of somitogenesis.

      Advance: Previously no one provided quantitative data for somite formation. In this viewpoint, this manuscript has an advantage.

      Audience: Their data could be helpful in the generation of mathematical models.

      My field: Developmental Biology

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      1. The authors claim PA-induced mTORC1 activation is dependent on acetyl-CoA derived from mitochondrial fatty acid oxidation. Using isotope tracing to determine the contribution of PA to acetyl-CoA, would improve.

      Response: We thank the reviewer for this valuable comment. As suggested, we measured the contribution of PA to the total cellular acetyl-CoA pool using metabolic flux assays. The results showed that the incorporation of 13C from [U-13C16]-palmitate into acetyl-CoA was exceeding 60% (Page 40, Figure 4D in the revised version), indicating that exceeding 60% of the acetyl-CoA pool was PA derived. Likewise, we also found that the incorporations of 13C from [U-13C16]-palmitate into 6:0-CoA, 8:0-CoA, 10:0-CoA and 12:0-CoA were all exceeding 50% (Page 40, Figure 4D in the revised version). Thus, these results suggested that a large portion of the PA was used for fatty acid oxidation upon entering the cell. Moreover, we found that fatty acid β oxidation blocked by perhexiline maleate inhibited PA-induced increase of acetyl-CoA, suggesting that the induction of acetyl-CoA content was largely dependent on the fatty acid oxidation of PA. Furthermore, we also demonstrated that inhibition of mitochondrial fatty acid β oxidation by pharmacological inhibitor or genetic knockdown abrogated PA-induced activation of mTORC1. However, using sodium acetate treatment to elevate cellular acetyl-CoA levels rescued impaired mTORC1 activity induced by the inhibition of fatty acid β oxidation under PA condition. Together, these results revealed that PA-induced mTORC1 activation is dependent on acetyl-CoA derived from mitochondrial fatty acid oxidation.

      1. They showed PA increases fatty acid oxidation related gene expression and acetyl-CoA level, while OA and LA could not. why only PA could increases fatty acid oxidation and acetyl-CoA level, considering both of these lipids could be oxidation in mitochondria? Is there any differences in mitochondria among treatment of PA, OA and LA? It is better to monitor fatty acid oxidation in real time using seahorse. And add discussions.

      Response: We thank the reviewer for this constructive question. As suggested, we performed the seahorse real-time cell metabolic analysis and the results showed that PA treatment enhanced mitochondrial OCR and elevated maximal oxygen consumption rates compared with OA or LA treatment in fish myocytes (Page 37, Figure 3B in the revised version). Likewise, we also found that PA-induced increase of fatty acid oxidation-related gene expressions was more robust than OA or LA in vivo and in vitro. Thus, these results indicated that the induction of mitochondrial fatty acid oxidation by pa treatment was stronger than OA or LA treatment.

      In this study, using LC–MS, we showed that PA treatment increased the contents of short/medium-chain acyl-CoA and acylcarnitine in comparison with OA or LA treatment. Thus, these results suggested that although all three fatty acids can be oxidation in mitochondria, PA may be preferred to enter the mitochondria for fatty acid β oxidation, compared with OA or LA. Previous studies have found that OA is more inclined to synthesize triglycerides to induce the formation of lipid droplets than PA (Chen et al., 2023; Plötz et al., 2016). Likewise, we also found that OA significantly increased the contents of 18:1-CoA in comparison with PA. Thus, we speculate that, after entering the cell, OA is more preferentially synthesized to triglyceride for storage than fatty acid oxidation. Moreover, LA is considered to be a precursor of arachidonic acid, and can be converted to a myriad of bioactive compounds called eicosanoids (Whelan & Fritsche, 2013). Similarly, we found that LA markedly elevated the contents of 18:2-CoA/18:3-CoA. Thus, we conjecture that LA preferentially synthesizes functional lipids compared to entering mitochondria for fatty acid oxidation. Together, differences in the levels of acetyl-CoA produced by these three fatty acids may be related to their metabolic pathway preferences.

      There may be two reasons for why PA prefers to enter mitochondrial for fatty acid oxidation. On one hand, due to differences in the structure of PA, OA and LA, the substrate affinity of CPT1B to these fatty acyl-CoAs may be different, that may contribute to the different rates of fatty acid to enter into mitochondria. On the other hand, in contrast to the β-oxidation of SFAs, the β-oxidation of UFAs requires the involvement of 2,4-dienoyl-CoA reductase (You et al., 1989), and thus the β-oxidation of SFAs may be more efficient.

      At present, the understanding of differences in fatty acid oxidation between SFAs and UFAs is insufficient, so more studies are needed in the future to further explore the underling mechanisms behind these differences. The reviewers have raised a very important direction for research, and so we will continue to address this issue in future.

      We have expanded this section of the Discussion (Page 14, line 388-412 in the revised version).

      1. The authors present lots of western blot images, suggest to provide quantification data of these blots.

      Response: We thank the reviewer for their careful assessment of our study. We apologize for not providing quantification data for western blot images in our initial manuscript. To support our conclusions, we have now added a densitomentric and statistical analysis of all western blots in the revised version (Page 33, Figure 1 in the revised version; Page 35, Figure 2 in the revised version; Page 37, Figure 3 in the revised version; Page 40, Figure 4 in the revised version; Page 43, Figure 5 in the revised version; Page 45, Figure 6 in the revised version; Page 47, Figure 7 in the revised version; Page 51, Figure S1 in the revised version; Page 53, Figure S2 in the revised version; Page 54, Figure S3 in the revised version; Page 56, Figure S4 in the revised version; Page 57, Figure S5 in the revised version).

      1. It is better to discuss the relationship between fatty acid oxidation and mTOR signaling.

      Response: The reviewer’s comments are very valuable. We apologize for not discussing enough for the relationship between fatty acid oxidation and mTOR signaling in our initial manuscript. We have now expanded this section of the Discussion (Page 13, line 366-387 in the revised version, see below).

      Growing lines of evidence suggested a strong link between mitochondrial fatty acid oxidation and mTORC1 signaling (Ricoult & Manning, 2013). As a central regulator of anabolism, mTORC1 is considered to inhibit fatty acid β oxidation pathway for energy storage or ketogenesis (Aguilar et al., 2007). Several studies revealed that restrained mTORC1 by rapamycin induced fatty acid β oxidation in rat hepatocytes through increasing expression of fatty acid β oxidation related enzymes (Brown et al., 2007; Peng et al., 2002). Likewise, mice with whole-body knockout of S6K1 showed enhanced fatty acid β oxidation and increased expression levels of CPT1 in isolated adipocytes (Um et al., 2004), and S6K1/S6K2 double-knockout mice also exhibited elevated fatty acid β oxidation of fatty acids in isolated myoblasts by activating AMPK (Aguilar et al., 2007). Furthermore, a recent study has established that FOXK1 can mediate the inhibition of fatty acid β oxidation by mTORC1 (Fujinuma et al., 2023). Thus, these collective data revealed that fatty acid β oxidation was restrained by mTORC1. However, conversely, the role of mitochondrial fatty acid oxidation in the regulation of mTORC1 is still controversy. A study in prostate cancer cells suggested that inhibited fatty acid β oxidation by etomoxir reduced mTORC1 activity (Schlaepfer et al., 2014), and another study found that deleting CPT1B specifically in skeletal muscle of mice suppressed mTORC1 by provoking AMPK activation (Vandanmagsar et al., 2016). Consistent with these studies, our results showed that acetyl-CoA derived from mitochondrial fatty acid β oxidation induced mTORC1 activation under PA treatment, indicating that acetyl-CoA may be a novel insight linking fatty acid β oxidation and mTORC1 signaling. Paradoxically, unlike other studies, a recent study found that mice with heart-specific CPT2-deficient exhibited induction of mTORC1 pathway. Thus, the effects of fatty acid β oxidation on mTORC1 pathway are complicated and may differ under variable physiological and pathological conditions. Further studies are needed to determine the sophisticated mechanisms underlying the regulation of fatty acid β oxidation on mTORC1 signaling.

      Reviewer #2:

      Major comments:

      Initial experiment: Among several fatty acid-rich diets, fish were fed a palmitic acid (PA) rich (PO) diet for 10 weeks, and the PO diet significantly raised fasting blood glucose levels compared to control diet (fish oil of equal lipid content). The PO diet also impaired the fish's glucose and insulin tolerance. The PO diet also led to decreased phosphorylation levels of AKT, which regulates glucose metabolism. Therefore, the researchers initially concluded that a palmitic acid-rich diet leads to systemic insulin resistance in fish.

      1. I have a couple of questions on this initial experiment on which all the subsequent studies are based. In Figure 1A, the body weight was identical in control and PO group. Don't you expect PO feeding lead to obesity in fish, as HFD induces obesity in mice?

      Response: We thank the reviewer for this constructive question. In our study, we found that dietary PO diet for 10 weeks failed to affect the body weight of fish, compared with CON diet. Similar to our results, a study in human also found that there was no significant differences in the body weight and body mass index (BMI) between saturated fat diet and monounsaturated fat diet (Vessby et al., 2001). Unlike high-fat diet, the lipid content level of PO diet was not elevated, but only the fatty acid composition was altered, with palmitic acid composition being significantly increased in comparison with CON diet (Page 60, Table S1). Thus, this may be the reason of why the PO diet did not induce weight gain.

      Although accumulating evidence showed that the onset of insulin resistance was often accompanied by weight gain and obesity (Kahn & Flier, 2000; Shoelson et al., 2007), some studies also found that insulin resistance occurred without obesity. A recent study found that mice with liver knockout of Lpcat3 exhibited improved insulin sensitivity without a change in the body weight (Tian et al., 2023). Moreover, another study in mice showed that dietary phenylalanine-rich diet induced insulin resistance, but had no effects on the body weight (Zhou et al., 2022). Likewise, our study also found that dietary PO diet provoked systemic insulin resistance, while did not affect the body weight in fish. Thus, these studies indicated that the development of insulin resistance may not always be entirely accompanied by obesity.

      1. Figure 1G and 1H show glucose and insulin tolerance after PO feeding for 10 weeks. The area under curve (AUC) should be compared to determine if GTT and ITT were statistically different. The ITT curve is particularly interesting as the control fish did not seem to respond to insulin, while the PO-fed fish responded more robustly. The only difference is the initial glucose level. Are the GTT and ITT done after fasting? How long is the fasting? The curves suggest that even though PO increased (fasting) blood glucose levels, it improved insulin sensitivity - therefore the premise that PO induces insulin resistance is not supported here. The lack of insulin induced response in the control group is worrisome. I suggest that the measures should be retaken, and AUC should be used to support if there are any differences in GTT and ITT.

      Response: We thank the reviewer for this valuable comment. We apologize for making this confusion in the initial manuscript and we thank the reviewer for providing this opportunity to correct our manuscript. As suggested, to further investigate whether dietary PO could cause impairment of insulin sensitivity, we have re-performed the GTT and ITT assays. Considering that fish have a poor capacity to utilize glucose, we extended the assay time to 8 h. To make the results more accurate, we also added the biological replicates. Moreover, before injection of glucose or insulin, fish were fasted for 24 h. Furthermore, we added area under curve (AUC) of GTT and ITT, and performed statistical analyses of the AUC data.

      Our results showed that dietary PO diet reduced glucose tolerance and insulin tolerance in fish (Page 33, Figure 1G and 1H in the revised version). Moreover, compared with CON diet, the AUC of GTT and ITT were significantly increased in PO diet (Page 33, Figure 1G and 1H in the revised version). Similarly, we found that dietary PO diet elevated fasting blood glucose levels and plasma insulin concentrations. Furthermore, we showed that dietary PO diet decreased the phosphorylation levels of AKT in the liver and skeletal muscle. In addition, we also demonstrated that PA treatment could induce cellular insulin resistance in fish myocytes and C2C12 myotubes. Thus, in our opinion, the above results could indicate that dietary PO induced insulin resistance in fish.

      1. Based on the assumption that PO induces IR (which needs to be confirmed based on the previous comments), the researchers attempted to understand how PA triggers IR through a series of experiments, predominantly western blot analysis. All the Western blots should be quantified. The model is that PA activates FAO in mitochondrial that elevates cytosolic acetyl-coA, which acetylates Rheh to activate mTORC1. mTORC1 on one hand alters IRS1 phosphorylation and on the other hand inhibits transcriptional activity of TFEB to reduce Irs1 mRNA level. Together reduces IRS1 leads to Insulin Resistance.

      Response: The reviewer’s comments were very important to verify the validity of our findings. We have now added a densitomentric and statistical analysis of all western blots in the revised version (Page 33, Figure 1 in the revised version; Page 35, Figure 2 in the revised version; Page 37, Figure 3 in the revised version; Page 40, Figure 4 in the revised version; Page 43, Figure 5 in the revised version; Page 45, Figure 6 in the revised version; Page 47, Figure 7 in the revised version; Page 51, Figure S1 in the revised version; Page 53, Figure S2 in the revised version; Page 54, Figure S3 in the revised version; Page 56, Figure S4 in the revised version; Page 57, Figure S5 in the revised version).

      1. Figure 1. PA reduces basal and insulin stimulated AKT phosphorylation in fish liver and muscle, as well as in culture fish and murine myocytes (Fig. 1I-M). The results appear to be solid but need to be quantified.

      Response: We thank the reviewer for this kind suggestion. We have now added a densitomentric and statistical analysis of all western blots in Figure 1 (Page 33, Figure 1 in the revised version).

      1. Figure 2 shows that PA provoked hyperactivation of mTORC1 (indicated by elevated phosphorylated S6K levels. This effect was abolished by Rapamycin treatment (an mTORC1 inhibitor) and also abolished by insulin stimulation (2F). Again, the western blots should be quantified.

      Response: We thank the reviewer for this excellent suggestion. We have now added a densitomentric and statistical analysis of all western blots in Figure 2 (Page 35, Figure 2 in the revised version).

      1. Figure 6: the researchers measured the effect of PA treatment on IRS1 phosphorylation in order to understand the mechanism of insulin resistance induced by mTORC1 activation under PA treatment. A PO diet intensified S636/S639 phosphorylation in fish muscle. In fish myocytes and C2C12 myotubes, PA treatment elevated S636/S639 phosphorylation but decreased the Y612 phosphorylation of IRS1 in a dose-dependent manner. Treatment of fish myocytes and C2C12 myotubes with an mTOR inhibitor blocked increased IRS1 S636/S639 phosphorylation levels under PA treatment. Also, PA specifically reduced mRNA levels of Irs1. This indicates that PA-induced, mTOR-dependent alteration of IRS1 phosphorylation and transcription may have contributed to insulin resistance. It is unclear how mTORC induces either increase or decrease in IRS1 phosphorylation depending on the residuals.

      Response: We appreciate the reviewers for this important question. In fact, previous studies have clearly explored how mTORC1 pathway affects S636/S639 phosphorylation of IRS1. On one hand, as a kinase complex, mTORC1 could directly induce S636/S639 phosphorylation of IRS1 in vitro (Ozes et al., 2001). On the other hand, mTORC1 could activate S6K to promote S636/S639 phosphorylation of IRS1 (Shah & Hunter, 2006; Um et al., 2004). In addition, considering that the serine/threonine phosphorylation status of IRS has been shown to affect its tyrosine phosphorylation and protein degradation (Copps & White, 2012), we speculate that the decrease of Y612 phosphorylation of IRS1 is dependent on the induction of IRS1 S636/S639 phosphorylation.

      In this study, we found that PA could induce S636/S639 phosphorylation of IRS1 in a mTORC1-dependent manner. Considering that previous studies have explored the mechanism by which mTORC1 induced IRS1 S636/S639 phosphorylation, we did not conduct further studies on this issue. Notably, we found that mTORC1 could also regulate the transcription of IRS1, so we subsequently investigated the mechanism by which mTORC1 inhibited IRS1 transcription.

      1. Figure 7 shows that PA inhibits nuclear translocation of TFEB to suppress IRS1 transcription. The EMSA in 7D is not convincing.

      Response: We thank the reviewer for this valuable comment and we apologize for providing unclear blots in the initial manuscript. To support our conclusions, we have now re-performed the EMSA assays. The results suggested that TFEB can directly bind to the IRS1 promoter at these two sites (Page 47, Figure 7D in the revised version).

      Minor comments:

      1. Some data appears to weaken the results and/or contradictory. For example, the paper initially showed reduced AKT phosphorylation to support PA induced IR, but shouldn't a lower level of pAKT reduces mTORC activation? But then the rest of the manuscript explores how PA activates mTOR. Part of the IR is manifested by impaired mTORC1 activation, yet the PA activates mTORC1. The authors should present the rationale and flow of the ideas in a better way.

      Response: We thank the reviewer for this excellent suggestion. We appreciate the points that in some insulin resistance conditions, as a downstream of the insulin pathway, mTORC1 activity is manifested to be inhibited. However, mTORC1 activity showed different under other insulin resistance conditions.

      In fact, multiple negative feedback signals exist in cells to maintain cellular homeostasis under diverse environmental challenges and stimulations (Kearney et al., 2021). However, aberrant of negative feedback can lead to impaired intracellular signaling pathway and induce a variety of diseases (Nguyen & Kholodenko, 2016). Similarly, numerous negative feedback mechanisms also exist in insulin signaling to prevent the development of cancers that may be induced by hyperactivation of insulin pathway. The negative feedback of insulin pathway is mainly mediated by mTORC1, which has been found to inhibit insulin signaling transduction by directly or indirectly affecting IRS1 phosphorylation (Copps & White, 2012; Shah & Hunter, 2006; Um et al., 2004). However, under some pathological or stress conditions, mTORC1 is over-activated, resulting in the amplification of the negative feedback of insulin pathway and the development of insulin resistance. A recent study found that imidazole propionate, a metabolite produced by the gut microbiota, provoked insulin resistance through inducing mTORC1 activation and phosphorylation of IRS1 (Koh et al., 2018). Other studies also showed that elevated abundance of branched-chain amino acids (BCAAs) or branched-chain α-keto acid (BCKA) could cause insulin resistance by boosting mTORC1 pathway (Zhou et al., 2019). Thus, mTORC1 activation induced-negative feedback inhibition of insulin pathway may be a critical factor in the development of insulin resistance.

      Consistently, our study found that PA could activate mTORC1 in an acetylation modification-dependent manner. Moreover, activation of mTORC1 inhibited the phosphorylation of AKT and caused insulin resistance by affecting the phosphorylation and transcription of IRS1.Indeed, AKT is considered to activate mTORC1 in multiple manners, and inhibition of AKT results in the reduction of mTORC1 activity. However, mTORC1 activity is not only affected by AKT, but is also regulated by a diverse set of upstream signals (Saxton & Sabatini, 2017). Thus, we considered that the activating effect of PA on mTORC1 activity is higher than the negative effect of mTORC1 activity produced by AKT inhibition. This also led to the fact that mTORC1 remained in an activated state despite the inhibition of AKT in PA condition.

      1. There are also many run-on sentences and grammar issues, making it very hard to read. The writing can be improved.

      Response: We thank the reviewer for this valuable comment and we apologize for these grammar mistakes in the initial manuscript. Following the reviewer’s suggestion, we have invited native speaker to guide the English writing and carefully corrected these run-on sentences and grammar issues. We thank the reviewer for this careful evaluation of our manuscript.

      Reviewer #3:

      Major issues affecting the conclusions:

      1. The conclusions are supported by the data. However, I suggest to perform a densitomentric and statistical analysis of western blots, especially when the authors report a representative blot, showing samples loaded in single.

      Response: We thank the reviewer for this excellent suggestion. We agree that it would be important to verify the validity of our findings and we apologize for not providing quantification data for western blot images in our initial manuscript. We have now added a densitomentric and statistical analysis of all western blots in the revised version (Page 33, Figure 1 in the revised version; Page 35, Figure 2 in the revised version; Page 37, Figure 3 in the revised version; Page 40, Figure 4 in the revised version; Page 43, Figure 5 in the revised version; Page 45, Figure 6 in the revised version; Page 47, Figure 7 in the revised version; Page 51, Figure S1 in the revised version; Page 53, Figure S2 in the revised version; Page 54, Figure S3 in the revised version; Page 56, Figure S4 in the revised version; Page 57, Figure S5 in the revised version).

      1. The methods are clear and reproducible. The authors should better explain how they have dissolved all the powders (i.e. fatty acids) to obtain the stock solutions next diluted (from what concentration?) in the media

      Response: We thank the reviewer for this valuable comment. We apologize for not explaining how to dissolved all the powders in the media. As suggested, we have now provided a detailed explanation of how to dissolved all the powders to obtain the stock solutions in the Methods (Page 19, line 537-586 in the revised version, see below).

      For PA, OA or LA in vitro treatment, fatty acid free BSA (Equitech-Bio, USA) was dissolved in FBS-free DMEM at room temperature according the ratio 1:100 (1 g fatty-acid free BSA: 100 ml FBS-free DMEM). 500 mg PA (Merck, Cat#P0500), OA (Merck, Cat#O1008) or LA (Merck, Cat#L1376) was dissolved in 10 ml ethanol to obtain PA, OA or LA stock solution respectively. Then PA, OA or LA stock solution was blow-drying with nitrogen gas and was dissolved in 0.1 M NaOH and warming at 75°C until clear to obtain 100 mM PA, OA or LA solution. Subsequently, 100 mM PA, OA or LA solution was added to 1% BSA solution according the ratio 1:100 (100 mM PA:1% BSA, v/v) at 50°C. Finally, the mixture was filtered using a 0.45 μM filter and stored at -20°C. For insulin in vitro treatment, insulin powder (Merck, USA) was dissolved in hydrochloric acid (pH=2) to obtain 1 mg/ml stock solution. For rapamycin or Torin1 in vitro treatment, rapamycin (Med Chem Express, #HY-10219, USA) or Torin1 (Med Chem Express, #HY-13003, USA) was dissolved in dimethyl sulfoxide (DMSO, Solarbio, China) to obtain 1 mM stock solution respectively. For MHY1485 in vitro treatment, MHY1485 (Med Chem Express, #HY-B0795, USA) was dissolved in DMSO (Solarbio, China) to obtain 10 mM stock solutions.

      For etomoxir or perhexiline maleate in vitro treatments, etomoxir (Med Chem Express, #HY-50202, USA) or perhexiline maleate (Med Chem Express, #HY-B1334A, USA) was dissolved in DMSO (Solarbio, China) to obtain 50 mM stock solution respectively. For BMS-303141 treatment, BMS-303141 (Med Chem Express, #HY-16107, USA) was dissolved in DMSO (Solarbio, China) to obtain 25 mM stock solutions. For sodium acetate treatment, sodium acetate (Merck, #S2889, USA) was dissolved in ultrapure water from a Milli-Q water system to obtain 5M stock solution. For C646, spermidine or MB-3 treatment, C646 (Med Chem Express, #HY-13823, USA), spermidine (Med Chem Express, #HY-B1776, USA) or MB-3 (Merck, #M2449, USA) was dissolved in DMSO (Solarbio, China) to obtain 50 mM stock solution respectively. For MG149 treatment, MG149 (Med Chem Express, #HY-15887, USA) was dissolved in DMSO (Solarbio, China) to obtain 150 mM stock solution. For TFEB activator 1 treatment, TFEB activator 1 (Med Chem Express, #HY-135825) was dissolved in DMSO (Solarbio, China) to obtain 10 mM stock solution.

      1. Anova analysis should be performed to analyze western blot densitometries.

      Response: The reviewer raises an important point and we appreciate this comment. As suggested, we have now added statistical analyses of all western blot densitometries in the revised version. The data are presented as the means ± SEM and were analyzed using independent t-tests for two groups and one-way ANOVA with Tukey’s test for multiple groups (Page 33, Figure 1 in the revised version; Page 35, Figure 2 in the revised version; Page 37, Figure 3 in the revised version; Page 40, Figure 4 in the revised version; Page 43, Figure 5 in the revised version; Page 45, Figure 6 in the revised version; Page 47, Figure 7 in the revised version; Page 51, Figure S1 in the revised version; Page 53, Figure S2 in the revised version; Page 54, Figure S3 in the revised version; Page 56, Figure S4 in the revised version; Page 57, Figure S5 in the revised version).

      Minor comments:

      Prior studies are referenced appropriately, text and figures are clear. I suggest to add in the abstract all the model systems used. HEK293 also should be inserted in the description of the results. Please add the reference to figure 8 in the text. Please, describe cell origin.

      Response: We thank the reviewer for this careful assessment of our study. We apologize for not making this clearer in our initial manuscript. We have now added all the model systems used in the Abstract (Page 2, line 24-28 in the revised version, see below).

      Here, using a croaker model, we report that dietary palmitic acid (PA), but not oleic acid or linoleic acid, leads to dysregulation of mTORC1 signaling which provokes systemic insulin resistance and glucose intolerance. Mechanistically, using croaker primary myocytes, mouse C2C12 myotubes and HEK293T cells, we show that PA-induced mTORC1 activation is dependent on mitochondrial fatty acid β oxidation.

      Moreover, we have now added the description of HEK293T cells in the Results (Page 10, line 261-265 in the revised version; Page 11-12, line 309-315 in the revised version, see below).

      To further investigate whether the regulation of mTORC1 by Tip60 is dependent on the acetylation of Rheb, the interaction between Tip60 and Rheb was analyzed via co-immunoprecipitation assays, and the results showed that Tip60 can interact with Rheb in HEK293T cells (Figure 5E). Moreover, overexpressed Tip60 reinforced the acetylation of Rheb and phosphorylation levels of S6K in HEK293T cells (Figure 5F).

      Dual luciferase experiments in HEK293T cells showed that TFEB had the strongest ability to elevate the luciferase activity of the IRS1 promoter among the crucial downstream transcription factors of mTORC1 (Figure 7A). Moreover, TFEB enhanced the promoter activity of IRS1 in a dose-dependent manner (Figure 7B) and mutations of the predicted TFEB binding site 4 and site 6 in the IRS1 promoter significantly reduced the promoter activity of IRS1 in HEK293T cells (Figure 7C). Furthermore, ChIP and EMSA experiments in HEK293T cells verified that TFEB can directly bind to the IRS1 promoter at site 4 and site 6 (Figures 7D and 7E).

      As suggested, we have added the reference to figure 8 in the Discussion (Page 17, line 483-487 in the revised version, see below).

      In summary, our work unveils an evolutionarily conserved mechanism by which mitochondrial fatty acid β oxidation flux of acetyl-CoA induces mTORC1 activation through enhancing Tip60-mediated Rheb acetylation under PA condition. Subsequently, hyperactivation of mTORC1 boosted serine phosphorylation of IRS1 and inhibited TFEB-mediated transcription of IRS1, leading to insulin resistance (Figure 8).

      As suggested, we have added the description of cell origin in the Methods (Page 19, line 526-527 in the revised version; Page 19, line 533-534 in the revised version, see below).

      Mouse C2C12 myoblast cells were obtained from the Cell Bank of the Chinese Academy of Sciences (Shanghai, China).

      HEK293T cells were obtained from the Cell Bank of the Chinese Academy of Sciences (Shanghai, China).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Zhao et al aimed to elucidate the mechanisms by which palmitic acids drives insulin resistance. The authors performed experiments in croaker, fish myocytes and mouse differentiated C2C12. They manage in vivo metabolic assays, enzymatic assays, immunoblot procedures, double luciferase assays, RNA analysis, pharmacological inhibitions and genetic knockdown. By using these model systems and procedures, the authors demonstrate that palmitic acid, but not oleic and linoleic acids, induces systemic and cellular insulin resistance through the hyper activation of mTORC1. They show that palmitic acid stimulates the mitochondrial fatty acid β oxidation, increasing the acetyl-CoA levels which enhances the acetylation of Rheb, a well known activator of mTORC1, by Tip60. Moreover, the authors show that mTORC1, beside reinforcing IRS1 phosphorylation, inhibits nuclear translocation of TFEB, thus preventing IRS1 transcription.

      Major issues affecting the conclusions:

      The conclusions are supported by the data. However, I suggest to perform a densitomentric and statistical analysis of western blots, especially when the authors report a representative blot, showing samples loaded in single.<br /> The methods are clear and reproducible. The authors should better explain how they have dissolved all the powders (i.e. fatty acids) to obtain the stock solutions next diluted (from what concentration?) in the media<br /> Anova analysis should be performed to analyze western blot densitometries.<br /> Minor comments:<br /> Prior studies are referenced appropriately, text and figures are clear. I suggest to add in the abstract all the model systems used. HEK293 also should be inserted in the description of the results. Please add the reference to figure 8 in the text. Please, describe cell origin.

      Referee Cross-commenting

      All reviewers have requested densitomentric (and statistical) analysis of western blot to prove the strength of the results. This is the major point to be addressed. Other points should also be only discussed.

      Significance

      The study extends the knowledge in the field of fatty acids-induced insulin resistance, that is a field studied by many researchers from many years, but with a lot of unclear mechanisms yet. Thus, the nature of the advance is conceptual and mechanistic. The only limitation is the lack of evidence in human samples/cells.

      Basic researchers and experts in translational medicine will be interested by this research.<br /> This is the point of view of a basic researcher, mainly interested in the molecular mechanisms underlining type 2 diabetes/obesity and cancer.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The goal of this study was to investigate how saturated fat induce insulin resistance through activating mTOR. More specifically, the researchers show that palmitate (saturated fat) activates mTORC1 to induce insulin via transcriptional and posttranslational suppression of IRS1. Specifically, the researchers show that PA stimulates FAO to raise cytosolic acetyl-coA levels, promoting Tip60-mediated acetylation of Rheb to activate mTORC1 activity. To study the relationship between mTORC1 activity, fatty acid stimulation and insulin resistance, the researchers decided to conduct their study in fish, as fish are known to be glucose intolerant by nature, making them an appropriate model organism for studying insulin resistance. Mouse C2C12 cell line and fish primary myocytes were also used to validate key results.

      Major comments:

      Initial experiment: Among several fatty acid-rich diets, fish were fed a palmitic acid (PA) rich (PO) diet for 10 weeks, and the PO diet significantly raised fasting blood glucose levels compared to control diet (fish oil of equal lipid content). The PO diet also impaired the fish's glucose and insulin tolerance. The PO diet also led to decreased phosphorylation levels of AKT, which regulates glucose metabolism. Therefore, the researchers initially concluded that a palmitic acid-rich diet leads to systemic insulin resistance in fish.

      I have a couple of questions on this initial experiment on which all the subsequent studies are based. In Figure 1A, the body weight was identical in control and PO group. Don't you expect PO feeding lead to obesity in fish, as HFD induces obesity in mice?<br /> Figure 1G and 1H show glucose and insulin tolerance after PO feeding for 10 weeks. The area under curve (AUC) should be compared to determine if GTT and ITT were statistically different. The ITT curve is particularly interesting as the control fish did not seem to respond to insulin, while the PO-fed fish responded more robustly. The only difference is the initial glucose level. Are the GTT and ITT done after fasting? How long is the fasting? The curves suggest that even though PO increased (fasting) blood glucose levels, it improved insulin sensitivity - therefore the premise that PO induces insulin resistance is not supported here. The lack of insulin induced response in the control group is worrisome. I suggest that the measures should be retaken, and AUC should be used to support if there are any differences in GTT and ITT.

      Based on the assumption that PO induces IR (which needs to be confirmed based on the previous comments), the researchers attempted to understand how PA triggers IR through a series of experiments, predominantly western blot analysis. All the Western blots should be quantified. The model is that PA activates FAO in mitochondrial that elevates cytosolic acetyl-coA, which acetylates Rheh to activate mTORC1. mTORC1 on one hand alters IRS1 phosphorylation and on the other hand inhibits transcriptional activity of TFEB to reduce Irs1 mRNA level. Together reduces IRS1 leads to Insulin Resistance.

      Figure 1. PA reduces basal and insulin stimulated AKT phosphorylation in fish liver and muscle, as well as in culture fish and murine myocytes (Fig. 1I-M). The results appear to be solid but need to be quantified.

      Figure 2 shows that PA provoked hyperactivation of mTORC1 (indicated by elevated phosphorylated S6K levels. This effect was abolished by Rapamycin treatment (an mTORC1 inhibitor) and also abolished by insulin stimulation (2F). Again, the western blots should be quantified.

      Figure 3: PA treatment increases mRNA expression levels of fatty acid beta oxidation genes in fish myocytes and C2C12 myotubes, and subsequent suppression of CPT1B and CPT2 (rate-limiting enzymes of FAO) inhibited mTORC1 activity and signaling in muscle, C2C12 myotubes and fish myocytes under PA treatment. This suggests PA-induced mTORC1 activation is dependent on mitochondrial FAO. Inhibition of CPT1 improved suppression of insulin stimulated phosphorylation of AKT under PA treatment in fish myocytes and C2C12 myotubes, indicating that mitochondrial FAO is heavily involved in PA-induced mTORC1 activation that contributes to insulin resistance.

      Figure 4: PO diet increases acetyl-CoA levels in muscle and PA treatment increases intracellular acetyl-CoA in a dose-dependent manner in fish myocytes. Inhibition of FAO by perhexiline maleate diminished induction of acetyl-CoA under PA treatment. In vivo dsRNA knockdown of ATP citrate lyase (ACLY, catalyze acetyl-CoA synthesis from mitochondrial citrate) decreased mTORC1 activity in muscle, and inhibition of ACLY in fish myocytes and C2C12 myotubes decreased induction of mTORC1 activity under PA treatment. This indicates that palmitic acid promotes mTORC1 activation through acetyl-CoA that is derived from mitochondrial FAO. PA treatment elevates acetylation of Rheb in a dose-dependent manner. Inhibition of FAO by perhexiline maleate attenuated PA-stimulated Rheb acetylation, while fish myocytes and C2C12 myotubes treated with sodium acetate (which can enhance acetyl-CoA) exhibited enhanced Rheb acetylation. The data indicate that acetyl-CoA produced by FAO activates mTORC1 signaling through increased Rheb acetylation. Phosphorylation of AKT were enhanced in muscle with dsACLY knockdown injection, and sodium acetate addition blocked recovery of insulin-stimulated glucose uptake and phosphorylation levels of AKT by perhexiline maleate under PA treatment. ACLY inhibition promoted insulin stimulated phosphorylation of AKT under PA treatment. So, in terms of acetyl-CoA's role in PA-induced insulin resistance, the data suggest that acetyl-CoA derived from FAO mediates PA-induced mTORC1 activation and insulin resistance.<br /> Figure 5: Acetyl-CoA can activate lysine acetyltransferases, and the researchers found that mRNA expression of tip60 was elevated in fish myocytes and C2C12 myotubes under PA treatment. Cultured fish myocytes and C2C12 myotubes treated with a Tip60 inhibitor prevented the induction of mTORC1 activity under PA treatment. Tip60 knockdown also blocked PA-induced mTORC1 activation in C2C12 myotubes. The researchers then determined that Tip60 regulation of mTORC1 is dependent on the acetylation of Rheb by studying the interaction between the two via a CoIP assay, which indeed indicated that Tip60 and Rheb interact. Additionally, Tip60 knockdown impaired PA-induced acetylation of Rheb, supporting the notion that Tip60 mediates the acetylation of Rheb under PA treatment. Inhibition of Tip60 attenuated PA-induced suppression of insulin-stimulated glucose uptake in C2C12 myotubes, and inhibition of Tip60 also restored insulin-stimulated phosphorylation of AKT under PA treatment. These data suggest that Tip60 mediates the regulation of Rheb acetylation under PA treatment and may be a novel therapeutic target for insulin resistance.

      Figure 6: the researchers measured the effect of PA treatment on IRS1 phosphorylation in order to understand the mechanism of insulin resistance induced by mTORC1 activation under PA treatment. A PO diet intensified S636/S639 phosphorylation in fish muscle. In fish myocytes and C2C12 myotubes, PA treatment elevated S636/S639 phosphorylation but decreased the Y612 phosphorylation of IRS1 in a dose-dependent manner. Treatment of fish myocytes and C2C12 myotubes with an mTOR inhibitor blocked increased IRS1 S636/S639 phosphorylation levels under PA treatment. Also, PA specifically reduced mRNA levels of Irs1. This indicates that PA-induced, mTOR-dependent alteration of IRS1 phosphorylation and transcription may have contributed to insulin resistance. It is unclear how mTORC induces either increase or decrease in IRS1 phosphorylation depending on the residuals.

      Figure 7 shows that PA inhibits nuclear translocation of TFEB to suppress IRS1 transcription. The EMSA in 7D is not convincing.

      Minor comments:

      Some data appears to weaken the results and/or contradictory. For example, the paper initially showed reduced AKT phosphorylation to support PA induced IR, but shouldn't a lower level of pAKT reduces mTORC activation? But then the rest of the manuscript explores how PA activates mTOR. Part of the IR is manifested by impaired mTORC1 activation, yet the PA activates mTORC1. The authors should present the rationale and flow of the ideas in a better way.

      There are also many run-on sentences and grammar issues, making it very hard to read. The writing can be improved.

      Referee cross-commenting

      Other than lacking quantification of western blots, my major concern is the ITT curve in Figure 1G, which does not support the conclusion that PO induces insulin resistance and therefore the rest of the study is based on a faulty premise. The curve shows that insulin reduced blood glucose much more robustly in the PO group than in the control group, suggesting PO increased insulin sensitivity. Area above curve should be calculated to quantify the difference.

      Significance

      Overall, this research was trying to show that PA-induced IR is dependent on hyper activation of mTORC1. More specifically, acetyl-CoA induces mTORC1 activation under a palmitic acid diet, and this is achieved through Tip60-mediated Rheb acetylation, which ultimately leads to insulin resistance through IRS1 suppression. The study is very mechanistic and important for understanding IR that is associated with diets high in saturated fatty acids and could potentially leads to therapeutic targets for combating insulin resistance and glucose intolerance.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this work, the authors showed PA induces hyperactivation of mTORC1 and insulin resistance. They found acetyl-CoA derived from mitochondrial fatty acid oxidation is required for PA-induced mTORC1 activation and insulin resistance by increasing TIP60-mediated Rheb acetylation. They also showed PA induced mTORC1 activation enhances IRS1 phosphorylation and inhibits transcription of IRS1 by impeding TFEB nuclear translocation. Overall, the authors did a lot of experiments to prove that PA causes mTORC1 activation and insulin resistance. The results are generally convincing, and the finding is novel and instructive.

      1. The authors claim PA-induced mTORC1 activation is dependent on acetyl-CoA derived from mitochondrial fatty acid oxidation. Using isotope tracing to determine the contribution of PA to acetyl-CoA, would improve.
      2. They showed PA increases fatty acid oxidation related gene expression and acetyl-CoA level, while OA and LA could not. why only PA could increases fatty acid oxidation and acetyl-CoA level, considering both of these lipids could be oxidation in mitochondria? Is there any differences in mitochondria among treatment of PA, OA and LA? It is better to monitor fatty acid oxidation in real time using seahorse. And add discussions.
      3. The authors present lots of western blot images, suggest to provide quantification data of these blots.
      4. It is better to discuss the relationship between fatty acid oxidation and mTOR signaling.

      Significance

      The finding is interesting and significant.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      The present manuscript presents a thorough description of the relative localization (in space and time) of a number of proteins of the early secretory pathway. To that aim, the authors used by their custom-made 3D live cell super-resolution microscope (SCLIM) and the yeast S. cerevisiae as a model system. The main claim of these data is that the early secretory pathway in S. cerevisiae is organized by maturation from a newly proposed yeast ERGIC compartment all the way to the trans-Golgi network (TGN).

      Major comments:

      I have two major comments regarding this manuscript:

      1. It is not clear to me how the presented data shows the existence of an ERGIC in the yeast S. cerevisiae. I understand, and appreciate from this text too, that a clear definition of ERGIC, even in a mammalian system, is unclear. For this reason, I would first suggest that the authors provide a clear definition of what ERGIC means to them. Next, the experiments herein presented are all based on a very careful, thorough and nicely organized spatio-temporal mapping of a large number of early secretory pathway proteins, including the ERGIC53 yeast "counterpart" Emp46 (it would help to add, even as a supplementary figure, an alignment/sequence comparison between the human ERGIC53 and the S. cerevisiae emp46). However, the data presented here does not clearly indicate to me that there is a bona fide ERGIC in yeast. Couldn't it just be that what the authors call ERGIC is a cis-Golgi cisterna? I understand that the BFA experiments show a different behavior for some proteins, which fits with what the authors previously names GECCO in plants, so why not calling this GECCO? Again, it will be important to provide definitions of these compartments for the audience. Next, my main concern here is that this is all based on SCLIM, which is a very nice technique, but the resolution is limited in both space and time (by the way, it would be nice to explicitly measure of quantify the spatial resolution in x-y-z). Hence, it is not possible to discern whether an "independent" ERGIC is formed as compared to cis-Golgi cisterna. Electron microscopy (possibly CLEM) could help somehow resolve that and massively increase the strength of the claims, but I do understand this might be difficult for this group and very time consuming, so it might be important to clearly state the limitation of the herein presented data. A possible alternative to test if protein that are seen segregated are within the same membrane (as claimed here) would be to do trapping experiments where a reagent induces dimerization between the two proteins (when tagged with specific tags, such as FKBP/FRB).
      2. I could not find any details (maybe I have missed them) about how many times experiments were replicated and the statistical significance of the findings herein reported. In most figures, examples of microscopy images/videos are shown, and selected lines profiles are presented. However, it is not clear how robust these experiments are. Some ideas:

      2.1.) The major source of quantification is the peak-to-peak time distance between two proteins. In Table S1 some stdev is presented, but not clear how it is find (it is the sted of all n number of puncta? or of the mean duration per cell? or of the mean duration per experiment? I would suggest that the authors provide the results shown in Table S1 plotted as a histogram or superplot (see e.g. https://rupress.org/jcb/article/219/6/e202001064/151717/SuperPlots-Communicating-reproducibility-and) and clearly explain how statistics is performed.

      2.2) Also, the time-lapse movies are acquired with a 5s gap between time points. How is this included in the incertainty of the peak-to-peak duration in Table S1?

      2.3) In pg. 7 the authors write "Although experimental variation was high, the two zones appeared to be spatially segregated". Can the authors provide quantitative and statistical support of this claim?

      2.4) It is not clear to me how the puncta for analysis are selected. For example, in Fig. 1C, the punctum shown already shows some initial co-localization (it could be e.g. that a peak value was prior or after the duration of the time lapse movie, thereby biassing the computation of the peak-to-peak duration). So, if one would consider those spots e.g., positive for Emp46 that do not contain Mnn9 signal, how often do you see conversion (that is, appearance of Mnn9 signal)? Along the same lines, in pg. 8 the authors write "... signal appeared first and then mnn9-mCherry came up". Details on how this quantification is done and statistical analysis would be needed, to my opinion, to support the claim.

      Minor comments:

      1. The color code for the 3 color microscopy images is nice, however, the use of green and red for the 2 color images is a bit unfortunate for some people (like myself) who suffer from color blindness. I'd suggest to use green and magenta instead.
      2. Pg. 8: have the authors tested Rer1 vs Emp46?
      3. Pg. 8: I was of the impression that GRASP65 (GORASP1) is considered to be a cis-Golgi protein (see e.g., Tie et al eLife 2018). Then, what the authors call "ERGIC" couldn't it simply be a cis-Golgi cisterna?
      4. pg. 13: "propose to define Grh1, Rer1, and Sed5 as yeast ERGIC/GECCO...". What about Emp46?
      5. The first part of the manuscript (up to mid page 13) is clearly focused on defining ERGIC in yeast, then the paper appears as a set of experiments aimed at adding more components in their spatio-temporal mapping. This is ok, but is should be clearly motivated and explained in the Title, abstract and intro.
      6. The visualization of colocalization according to the opacity (as said in the methods) is somehow confusing to me. Are the 3D images projections or 3D renderings (no axes are seen)? In e.g. Fig. 6G or 8L, regions where green and magenta (or green and red) are colocalized do not appear white (or yellow), which visually suggests to the inattentive reader that there is no colocalization, when there is.
      7. I have not understood what this sentence in pg. 18 means: Similar segregation patterns are also observed during the Golgi-TGN maturation process (Tojima et al., 2019). "We propose that the ERGIC, Golgi, and TGN can coexist as structurally and functionally distinct zones within a single, maturing cisterna." Are they referring to ERGIC, Golgi, and TGN steady state components (proteins) or the structures themselves?
      8. The introduction of new data (mammalian data) in the discussion is odd. It might be ok, but I would frame it within a results section and use it later in the discussion.
      9. Fig.9: the arrows should go from protein to protein (some seem to go from in between proteins, such as the bottom-most arrow with 87.8 s time duration. Also in panel B, bottom part, some proteins are missing (Erd2 ad Chs5).
      10. Fig. 1 and many other: in the line profiles the distance in the x axis has no units or labels. Please add this and the direction of the line profile (an arrowhead would suffice).

      Significance

      General assessment:

      The experiments herein presented are based on a very careful, thorough and nicely organized spatio-temporal mapping of a large number of early secretory pathway proteins, including the ERGIC53 yeast "counterpart" Emp46. However, the data presented here does not convincingly show that there is a bona fide ERGIC in yeast. A major limitation is that the experiments are all based on a state-of-the-art, but still with a limited resolution, fluorescence microscopy technique. Ultrastructural data (e.g., CLEM) would massively help support or revisit the claims presented in this manuscript regarding the existence of an ERGIC compartment in yeast. Also, adding the information about the number of biological replicates and proper statistical analyses on the presented results would be needed to further support the claims.

      Advance:

      This manuscript builds on the authors' custom build 4D super-resolution microscope (SCLIM) and on previous results (e.g., Tojima et al., J. Cell Sci. 2019). The main novelty is in the study of a number of new early secretory pathway proteins and in the proposal of the existence of a non-stable, maturing ERGIC compartment in S. cerevisiae.

      Audience:

      This paper might be attractive for a broad audience of cell biologists, especially those interested in membrane biology, cell compartmentalization, and intracellular trafficking and secretion.

      Describe your expertise:

      I am an expert in membrane trafficking.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Tojima and colleagues present a very exciting 4D SCLIM analysis of 20 key-proteins controlling or occupying different stages of Golgi-mediated protein trafficking, taking the reader on a trip from the ER export sites to the ERGIC/GECCO, cis-, medial- and trans-Golgi towards the TGN/recycling endosome. The choice of molecular markers to be patiently time-resolved in 3D allowed the authors to assemble a temporal roadmap for the molecular players studied. This is most impressive.

      Major comments

      There is an enormous amount of patient systematic analysis packed in this paper, following a punctate organelle as it emerges from the dark, evolves over time (following a combination of 2 markers) with fluorescence peaks at specific time points, after which the signal disappears again. I am certain other cell biologists will be impressed, as I was, viewing the individual images and graphs presented, culminating ultimately in figure 9 that could go straight into a textbook to form a starting point for anybody who wishes to study a particular protein of interest and chose the most appropriate markers to compare it with. The authors propose that the ERGIC/GECCO/Golgi-remnants compartment is an evolutionary conserved structure even though it has a different subcellular distribution/morphology in different classes of eukaryotes. The data presented here and in earlier work seem to support this notion. In particular, the authors demonstrate that the yeast GRASP 65 homologue Grh1 is the earliest to appear closely followed by Ypt1 and Emp46. The fact that RER1 and ERD2 come slightly later is in line with a proposed gate-keeper function, because if they were instead to recycle continuously they should appear first in line. I agree with the authors that the simple model of ER-derived COPII vesicles fusing with each other and thus creating an ERGIC/GECCO de novo is probably too simplistic. The idea of a more permanent structure, pulsating between cargo-loading and cargo-releasing events, possibly associated with creating zones/subdomains within a single cisterna seems very attractive given the data shown here. This work is descriptive, but it is of very high importance to anybody engaged with experimental approaches to study protein sorting from the Golgi-apparatus back to the ER, or on to the plasma membrane or the lytic compartments. The 5 functional stages proposed for Golgi-maturation is an attractive starting point for future research, and I very much like the notion that ERGIC and cis-Golgi cisternae may start as zones/subdomains within a single cisternae, possibly formed via phase separations involving both protein-protein and protein-lipid interactions.

      Minor comments

      The title strongly focusses on the ERGIC and therefore the earliest sorting steps in the ER-Golgi system, but this manuscripts offers so much more. I was fascinated to learn that Ypt1 appears twice during cisternal maturation in yeast. This may be a yeast-specific phenomenon but it is very interesting. The same can be said about the proposal that Gea1 and Gea2 have different roles in the Golgi and act in different cisternae, and the localisation of AP-3 at the trans-Golgi rather than the TGN. The functional distinction between trans-Golgi and TGN and the differences in their origin are important points and it will be a shame if readers don't realise that this manuscript offers further insight into later steps in Golgi-mediated transport. There may be a case to add something to the abstract and/or modify the title accordingly, but then I also feel that long titles are not ideal and the ERGIC/GECCO portion is the more important take-home message. This is a case for the editorial team and the authors to make the most of the findings.

      Given the importance of ERD2 in sorting soluble proteins to be returned back to the ER, the authors may consider using the biological active XFP-TM-ERD2 fusion instead of ERD2-GFP, but this may be kept for future work. In plants, ERD2-GFP is mainly at the Golgi when overexpressed, its erroneous leakage to the ER is only observed at low expression or when K/HDEL proteins are co-expressed. The XFP-TM-ERD2 construct may be better confined to the ERGIC-GECCO and may have a different temporal pattern.<br /> The authors may consider citing Stornaiuolo et al., Mol Biol Cell 2003 Mar;14(3):889-902 who compared the trafficking of KDEL and KKXX pathways and concluded that KDEL proteins are retrieved prior to KKXX proteins....as this fits nicely into the current findings showing that ERD2 and RER1 appear sooner than COPI markers.

      Significance

      The significance of the work is high because it basically allows to add facts to models. We use models to explain an elusive process because it escapes direct observation. Once we can observe directly, a model can become fact. In simple terms, the authors allow us to see things that we could only speculate about in the past. Therefore, the results present a very significant advance and will be highly relevant to the entire cell biology community. This paper is an important landmark and will help the field to formulate new experimental approaches and models to understand the origins of the Golgi apparatus, the core of the secretory pathway that defines being a eukaryote.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      It is generally believed that budding yeast does not have the ER-Golgi intermediate compartment (ERGIC). In this study, the authors attempt to prove the existence of the ERGIC. They measured the kinetics of a few Golgi markers involved in the early secretory pathway in live cell imaging and observed the recruitment of Grh1 precedes that of Mnn9, suggesting the presence of pre-early cisternae. The authors propose that the Grh1-positive cisternae, assembling at the ER exit site (ERES) and progressing to become the early Golgi cisternae, represent the equivalent of the mammalian ERGIC in yeast.

      Major comments:

      The concept of the ERGIC in mammalian cells was initially proposed based on the protein ERGIC-53 in the 1990s. However, recent nanoscopy imaging data from the Lippincott-Schwartz lab challenges the conventional view of ERGIC by revealing the "ERGIC" is a membrane domain of the ERES (Wegel et al., Cell, 2021; PMID: 33852913), suggesting it might not be appropriate to adopt this concept.

      The authors' observations could be interpreted differently. Since the ERGIC is not molecularly defined in their study, the authors cannot prove its existence in yeast unequivocally. Their data indicate the presence of Golgi cisternae, characterized by Grh1, that precede the earliest known cisternae. Although the authors refer to these Grh1-positive cisternae as the "ERGIC", they are essentially "pre-early" cisternae that progress to become the early Golgi cisternae. Nevertheless, their findings could extend the budding yeast Golgi cisternal progression unit further upstream to include the ERES as the starting point for Golgi cisternal maturation. To further explore this, it would be interesting to investigate the kinetics of COPII subunits in cisternal progression along with Grh1 or Mnn9 and to plot COPII components in the Figure 9 map.

      The second half of the manuscript appears to deviate from the main focus on identifying the ERGIC. This section primarily presents the Golgi localization of four Golgi proteins (Ypt1, Gea1, Gea2, and Alp6) deduced from kinetics. However, it lacks functional studies to substantiate the authors' claims on their cellular functions. As a result, this part of the study remains purely speculative and might not support the authors' claims. Given that Figure 9 provides a highly informative summary of all kinetics and localization data, I recommend the authors keep but significantly abridge this section.

      The manuscript also has a few major concerns.

      1. The analysis of only one fluorescent particle or Golgi cisternal punctate structure is insufficient for a Golgi marker, considering the substantial variation of Golgi cisternae. To improve statistical robustness, the authors should select multiple fluorescent particles from multiple cells, displaying plots with averaged intensities, error bars, and sample sizes (n).
      2. In Fig. 5A, the BFA-induced lumps positive for Grh1, Rer1, and Sed5 may potentially represent the ERES, as observed in mammalian cells (Ward et al., JCB, 2002; PMID: 11706049). To verify this, the authors should co-label these lumps with COPII subunits.
      3. The authors previously reported the "hug-and-kiss" model for cargo transport from the ERES to the early Golgi cisternae. As the current study is highly relevant to the "hug-and-kiss" model, it is disappointing that the authors did not provide further data and comment on it. The "hug-and-kiss" and "ERGIC" transport modes are two distinct ways for secretory cargo transport from the ERES to the early Golgi cisternae. The authors should verify the "hug-and-kiss" transport and report the relative frequency of the two transport modes.
      4. The current version has minimal background knowledge of ERGIC in mammalian and yeast cells. Therefore, the authors should provide a comprehensive introduction to ERGIC.

      Significance

      General assessment:

      The data presented in the manuscript is novel and appears to be convincing. However, one of the major concerns is the lack of statistical robustness, which requires to be addressed. Furthermore, the manuscript's data could be interpreted differently, as elaborated in the major comments.

      Advance:

      While the interpretation of the manuscript's data requires reconsideration, it can contribute to our understanding of secretory trafficking at the Golgi. The manuscript could fill a crucial gap in our knowledge in this field by addressing the major comments.

      Audience:

      Cell biologists in the membrane trafficking field, particularly those working on the Golgi, would find the manuscript interesting.

      My expertise:

      My research focuses on membrane trafficking at the ER, Golgi, and endosome.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Unfortunately, this paper adds only a little to our understanding of uptake in to the flagellar pocket of trypanosomes. It tends to add only detail to information that has been well characterised elsewhere and indeed, as the authors themselves point out, (lines 92-98) it is rather incremental.

      We were disappointed that the reviewer was so unsupportive of the work presented here. It seems possible that the reviewer is partly objecting that the title - which emphasised the main finding of the paper - does not fully capture the content of the paper. We have therefore modified the title to emphasise that the paper is principally a characterisation of TbSmee1 rather than an investigation of the flagellar pocket, with the insight into cargo entry being the most notable finding.

      Not only has Tbsmee1 been studied before but this data in bloodstream forms is not particularly novel since it gives much the same information as the canonical hook protein TbMORN. This work follows the pattern of conclusions made previously with the protein TbMORN. It focusses on the protein TbSmee where RNAi mutants are interpreted to show flagellar pocket enlargement and impaired access by surface bound cargo. Unfortunately, there is little mechanistic or functional conclusion to the study in terms of how TbSmee operates naturally in the cell.

      This is deliberately downplaying the value of the work. TbSmee1 has not previously been characterised in bloodstream form cells, and neither TbMORN1 nor the hook complex are as well-characterised as other cytoskeletal components such as the flagellum and basal body. To criticise the paper for not providing a molecular mechanism of TbSmee1's function is unreasonable given the volume of work provided and the fact that this is a first characterisation of the protein in this life cycle stage. Expectation of a complete molecular mechanism is setting a very high bar for a first characterisation.

      It is also possible that the reviewer has not grasped the main thrust of the argument - when TbMORN1 was characterised it was the first protein shown to have this cargo entry defect. We show here that not only does TbSmee1 share this defect, but that it is in fact a previously-unacknowledged feature of all phenotypes of this type, exemplified by clathrin. We have modified the text to make this finding more clearly emphasised (see for example lines 654-661 in the tracked-changes version of the manuscript).

      There are other possible explanations for the phenotype. That would need to be studied. This large flagellar pocket phenotype is seen with RNAi mutants of many different types of proteins in the trypanosome and so pleiotropic effects are highly likely. Also, there are a good number of alternative possibilities to account for reduced access to the pocket in these mutants and this data could be usefully added.

      This is another statement that seems intended primarily to disparage the paper rather than attempt to improve it. It would have been extremely helpful if the reviewer highlighted what these other possible explanations are instead of making vague allusions. The widespread prevalence of this kind of phenotype means that our insight into restricted cargo access to the flagellar pocket is of general relevance in the trypanosome field.

      Specific points<br /> 1. The transient location for the TbSmee at the FAZ tip - or in this case the groove region - was seen in procyclics (Perry, 2018) so this bloodstream indication merely confirms that concept.

      The reviewer is again downplaying the value of the work rather than providing constructive criticism. While FLAM3 has been shown to be at the tip of the new flagellum in bloodstream form cells (Sunter et al., 2015), at the time of the preprint being published Smee1 was actually the first protein (besides the DOT1 antigen) shown to localise to the groove region in bloodstream form cells. It is also worth noting that procyclic form cells and bloodstream form cells are fairly different in this regard - in procyclic cells, there is an entire flagellar connector structure that is not present in bloodstream form cells, and so demonstrating that Smee1 was present in the groove region was an important experiment. Since this preprint was published, Smithson et al. have identified 13 additional proteins localising to the groove (Smithson et al., 2022) - we have modified the text to include these points (see lines 542-545 of the tracked-changes manuscript).

      1. The C terminal region required for targeting is a reasonable deletion analysis of regions of the protein. But can this data (line 228) be said to "mediate targeting" - or is it just required. For instance, targeting might be OK but it might be needed for stable association, etc etc.

      We have changed the text to say "required" for targeting instead of "mediating" targeting (line 312 of tracked-changes manuscript).

      1. This protein has already been shown to be phosphorylated and the sites and cell cycle possibilities have been mapped by Urbaniak. So that section adds little. https://doi.org/10.1371/journal.ppat.1008129

      The reviewer is again disparaging the significance of the work rather than critiquing it. This is after all only a single panel of a figure and ~15 lines of text, and therefore a minor but still noteworthy element of the manuscript. This also misunderstands what the Urbaniak study does and does not show - while that work showed that Smee1 is phosphorylated, it remained possible that other post-translational modifications were occurring. This experiment shows that the "fuzzy" appearance (variable electrophoretic migration) of TbSmee1 in gels can be solely attributed to phosphorylation as opposed to other post-translational modification. We contacted Dr. Urbaniak to confirm this - his answer is below.

      "__I think your approach to look at the fuzzy banding is actually rather elegant; our data shows that phosphorylation occurs but we did not look for any other PTMs that could influence migration on a gel and probably wouldn't see them without a different enrichment and analysis method. We often see a fuzzy pattern with glycosylation due to the heterogeneity, and I suspect other modifications will also results in a smear. Given that the band collapses to a single band after phosphatase treatment and not with an inhibitor present it is fair to conclude that phosphorylation is responsible for the fuzzy band, not other undefined PTMs like glycosylation.__"

      1. Essentiality in BS forms and pocket enlargement. This is not surprising. A very large number of cytoskeletal proteins show this in RNAi knockdown. Flagella mutants (extensive publications from many groups (Hill, Bastin, Gull, etc) over last 15 years show this very well and so this protein is just one more example.

      This appears to be another comment aimed at downplaying the value of the manuscript rather than providing constructive feedback. The fact that we have demonstrated something previously unobserved in a common phenotype makes the data of general interest to the community, we feel.

      1. I didn't find that the explanations for flagella pocket enlargement are soundly based. The experiments focus on endocytosis and uptake and ignore other plausible reasons and some evidence in literature.

      Again, the reviewer's feedback would be considerably more constructive if they had taken the time to specifically cite the evidence in the literature that they are alluding to, and present some of the "other plausible reasons" they are aware of. We have consulted widely in the community and have not been able to find anybody who knew what work the reviewer is referring to here.

      Lines 84/85. Enlarged pockets may be indicative of endocytosis failure. Presumably the rationale is that endocytosis fails, but exocytosis still occurs and the pocket membrane enlarges. What evidence is there that exocytosis of membrane still occurs? This simple concept might indeed operate in a clathrin mutant but is surface membrane/content exocytosis is maintained in these cytoskeleton mutants? There is good evidence for glycoconjugates within the flagellar pocket. Are these depleted or present still?

      The reviewer is correct that we have not specifically assayed for exocytosis, but the fact that we are able to make the same observations in both the clathrin RNAi (where exocytosis has been assayed - Allen et al., 2003) and the Smee1 RNAi means that this is not a problematic omission. The effect of the enlarged flagellar pocket phenotype on the glycoconjugates in the flagellar pocket is an interesting question but far outside the current focus of the paper.

      1. There are also a number of other publications indicating that clathrin pits are still present on the enlarged pockets of various mutants when viewed by EM. The authors have looked at the flagellar pockets by EM but the EM methods described have extensive washings and centrifugations before fixation. This is a very poor approach and will mean that endo and exocytic traffic is disturbed (extensive references in literature in other systems? This is not a useful approach for exo/endocytosos studies where flux of traffic demands fast chemical or freezing fix in media.

      The reviewer has misunderstood the aim of the experiments described in Figure 5D, which was to observe the morphological changes caused by depletion of TbSmee1. As the reviewer is no doubt aware, high-pressure freezing of trypanosomes gives much better morphological preservation than chemical fixing in media, so the choice of method is not "very poor" but tailored to the experimental aims. We have modified the text to make this point more clearly (lines 355-358 of tracked-changes version). Once again, the referee offers no citation to back up their assertion that endo- and exocytic traffic is disturbed by wash steps, either in trypanosomes or elsewhere.

      1. The EMs and Light microscopy does show that the mutant pockets are substantially abnormal in their cytoskeletal arrangement. They have multiple flagella profiles, flagella structures have not connected with the membrane and are sometimes in the cytoplasm (see a glance of the paraflagellar rod in the cytoplasm in FigS5C and internalised FAZ attachment plaques in Fig 4 D bottom right cell). Given these extensive (and expected) cytoskeletal abnormalities it is highly likely that these pocket abnormalities are a result of motility, cell division/developmental issues and the differential uptake phenotypes merely consequential.

      This is another misinformed argument that is seeking to disparage the data. The reviewer has apparently overlooked the fact that the same phenotype is seen in clathrin RNAi, when flagellar pocket enlargement precedes any downstream effects on cell division cycle progression. We have gone to great lengths (Fig 6) to demonstrate that the enlargement of the flagellar pocket almost certainly precedes the onset of the growth defect in the TbSmee1 RNAi, and it is therefore likely to precede the cytoskeletal abnormalities that the reviewer has highlighted. An effect on cellular motility is possible and would be interesting to investigate in future work.

      1. The authors speak about early phenotypes , but these are often at 15-24 hours. That is probably a couple of cell cycles and so not early.

      To be informative, the analyses of RNAi phenotypes have to be done as soon as possible after the onset of the growth defect, and we have gone to great lengths (Figure 5) to define this point as being at 21 hours. This is already difficult as the number of phenotypic cells at the onset of the growth defect will not be high. We have clarified the text to emphasise that "early" refers to soon after the onset of the phenotype (lines 388-389 of tracked-changes version).

      In relation to the above question of comparison to the same morphology produced by flagella mutants it would be good to know if these hook mutants produce motility phenotypes and whether these are manifest before the uptake phenotypes. There is evidence (cited here) that forward motility of the trypanosome directs material on surface into the pocket. If these cells have motility defects (primary or via failed division) then surely that would provide an alternative simple explanation for uptake differences.

      The reviewer is overlooking the observation that the surface-bound endocytic cargoes (ConA, BSA) are still being sorted/directed as far as the entrance to the flagellar pocket - what is interesting is that the cargo is apparently unable to enter the flagellar pocket. As noted above, it would certainly be interesting to look at motility effects in follow-up work.

      1. There is a general point that if studies are to have real relevance to uptake in the trypanosome then they need to deal with uptake of natural ligands rather than artificial surrogates such as dextran. Such tracers were used historically, but in the last decade a series of receptors and ligands for fluid phase and particularly membrane mediated endocytosis have been discovered. With the investment of a little time these important ligand / receptors such as haptoglobin, transferrin, etc would be much more relevant.

      Dextran is still state-of-the-art as it is an inert fluid phase marker. We are not aware - and have asked widely - of any readily-available alternative to dextran as a fluid phase marker, especially seeing as we have demonstrated in this study that BSA does not behave as a fluid phase marker in the experimental conditions used. The reviewer is also being disingenuous in suggesting that there is a panel of validated physiological reporters for trypanosomes that are readily available commercially - this is not the case. Transferrin is probably the only example, but the transferrin receptor is confined to the flagellar pocket and therefore not relevant to the question of how surface-bound material enters the flagellar pocket in the first place. As suggested by Reviewer 3 and endorsed by Reviewer 2, we have looked at the uptake of anti-VSG antibodies (which are a physiological cargo) in additional experiments and obtained evidence that the same effects are seen (Figure 9).

      **Referees cross-commenting

      this session includes comments from Reviewer 1 and Reviewer 2.<br /> *

      Reviewer 2<br /> <br /> Dear Reviewers 1 and 3:<br /> I agree with many of the points with Reviewer 1 and our divergence is partly a matter of degree. While it is true that this manuscript is incremental in its contribution to our understanding of TbSmee1, it nonetheless adds to our understanding of the role of this protein in the bloodstream life stage and because of that I find value in the work. The fact that it mirrors what was seem in other protein knockdown studies (e.g. TbMORN) doesn't negate its contribution for me. Reviewer 1 makes an important point, however, when stating that this work does not add a mechanistic or functional conclusion as to how TbSmee1 operates and for me that is the biggest shortcoming of the work. Offering mechanistic insight is a high bar and while it would make for a much more exciting story it does not discount the value of the work as presented. What I do appreciate is the speculation about this observation that endocytosis is required for entrance of surface bound material into the pocket and although they are unable to show that this is not a side affect of other processes being disrupted it is and intriguing point. These observation have the potential of stimulating further investigations into crosstalk between the entrance to the pocket and endocytosis. I also agree that the use of ligands for known receptors like transferrin would be far more informative. While I assumed the transferrin receptor was in the pocket itself it would be interesting to see if the ESAG6/7 is also located outside the pocket and transiently binds cargo before being brought inside for endocytosis.<br /> I think that Reviewer 3 brings up a great point with the focus on VSG's. I think that examining VSG turnover in these mutants can add value to the analysis and inform our view of how affecting the hook complex alters VSG endocytosis.

      We appreciate Reviewer 2 taking the time to defend the value of the work, and we concur with Reviewer 2's assessment. Reviewer 2 is also correct that the transferrin receptor appears to be primarily or wholly confined to the flagellar pocket interior, making this likely less informative in this context. Concerning the uptake of anti-VSG antibodies highlighted by Reviewer 3 and endorsed by Reviewer 2, we have carried out these experiments and obtained similar results to those published in the first version of the preprint (Figure 9).

      Reviewer 1<br /> <br /> some fair comment and agreement. This is being sent to general cell biology journals.<br /> when one looks at this area in the round it is it is nearly 50 years (1975) since Langreth and Balber published their seminal work on protein uptake and digestion in bloodstream and culture forms of T. brucei. There has been 50 years intense study and the genome has been around for nearly 20 years as well. So, put simply - for both a general science audience and the wider parasite community - if this is a paper about one protein, TbSmee1,then it has surely has to say something functional about that protein. If it is a paper about uptake in trypanosomes (where mutants are one means of interrogation) then it surely has to say something about mechanisms of uptake of physiological relevant ligands. The days of dextran etc are past.

      Hence, my comment that this does neither and so is very incremental to what is known already. It is 2022 not 1975. Langreth and Baber published their seminal work in J Protozoology for very good reasons no doubt.

      It is striking that Reviewer 1 here extends their aggressive and uncivil approach to attack Reviewer 2's assessment, again substituting forceful wording for informed argument. Reviewer 1 again inexplicably and mistakenly criticises the use of dextran when no state-of-the-art alternative exists. They then go on to needlessly disparage the work done by Langreth & Balber when this work was produced in a totally different publishing landscape. They also appear to fundamentally misunderstand the Review Commons concept, which is to provide journal-independent preprint peer review; it is also worth noting that there are specialist journals such as PLoS Pathogens in the RevComm affiliates as well as general cell biology journals. Given that the mechanism of variant surface glycoprotein (VSG) switching has not yet been fully articulated despite the efforts of multiple labs and many projects over a decades-long time period, it seems extremely unreasonable to be making such demands of this paper.

      Reviewer 2<br /> Thank you for replying and I agree with the spirit of your critique. My only comment, which could result from my own naivete, is to say that despite the incredible work that has been done in dissecting endocytosis in T. brucei over these past 50 years, it appears that we still do not understand how many fundamental of aspects of this activity works in this parasite. Even basic questions regarding how cargo, e.g. transferrin, binding to surface receptors is sensed by the parasite remains unknown and the identity of the specific signaling components which transmit this information internally to initiate endocytosis have not been characterized. In many ways it seems that we don't even understand how the parasite partitions the end/exocytic pathways in the pocket and maintains membrane homeostasis. While we know that some kinases and traditional signaling components must be involved, a high resolution understanding of this process in T. brucei seems lacking. I only say all this to suggest that the field maybe isn't yet that advanced to reject work of this type as so many mechanistic unknowns still remain to be uncovered and maybe incremental advances and phenomenology still can add value to the field. However, I respect your opinion on the matter and my perspective could be due to a lack of a full appreciation of the literature on the subject.

      We completely agree with Reviewer 2's assessment here, which neatly summarises our rationale for the present work. Reviewer 2 is, if anything, being overly accommodating by suggesting that their perspective may be due to a lack of a full appreciation of the literature - on the contrary, Reviewer 2 appears to have a very sound grasp of the topic.

      Reviewer #1 (Significance):

      Unfortunately, I did not find tis to be very significant. It covers old ground in terms of the phenotype described. Many groups have shown the differences between procyclic and bloodstream phenotypes in this enlarged pocket phenomenon. The work is rather incremental from these and other author's work on these hook proteins.<br /> There are alternative explanations for understanding the effect of flagella pocket structure and uptake of ligands into the pocket and trypanosome cell. These would need to be studied before one could see a functional, mechanistic link established.<br /> Other parts of this are of nicely done but do not move on our understanding (eg targeting/phosphorylation) from what has been done previously.

      As noted repeatedly, it appears that Reviewer 1's priority is disparaging the value of the work here and downplaying its significance rather than providing constructive feedback. The reviewer repeatedly makes unrealistic demands (a mechanistic model, use of non-standard reagents), misunderstands the aim of experiments (use of high-pressure freezing), makes vague allusions to other work in the literature but without citing anything specific to support their case, and makes strong and assertive statements that are factually incorrect (design of RNAi experiments, use of dextran). We find this approach unhelpful, uncivil, and unprofessional. It is desperately disappointing that we should have to spend the majority of our response rebutting Reviewer 1's comments rather than implementing constructive criticisms that would strengthen the manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary:<br /> In this manuscript the authors have advanced our understanding of the hook complex component TbSmee1 through a detailed analysis of this protein's role in the endocytosis of surface bound proteins via the flagellar pocket in bloodstream form Trypanosoma brucei. The TbSmee1 protein, previously identified using proximity labeling using TbMORN1 and TbPLK, and characterized in procyclic T. brucei, was confirmed to target to both the shank portion of the hook complex as well as the growing end of the new FAZ in replicating cells. The protein was also shown to likely be phosphorylated as had been suggested previously due to its association with the kinase TbPLK. A domain deletion analysis demonstrated that domains 2 and 3 are important for TbSmee1's proper localization to the hook complex. Loss of TbSmee1 using RNAi based knockdown resulted in a quick cessation of growth in the bloodstream form within 24 hours in contrast to what was seen previously in procyclic cells which had only a decreased growth rate. Loss of TbSmee1 also resulted in an enlargement of the flagellar pocket and in many ways mirrored the phenotype observed with knockdown of TbMORN1. Although prior work on TbSmee1 in procyclic T. brucei demonstrated that loss of this protein altered the morphology of TbMORN1, no such change was seen in bloodstream form cells and only an alteration in the morphology of TbLRRP1 was observed. In characterizing the effect of TbSmee1 depletion on endocytosis the authors showed that the fluid phase marker Dextran could enter into the flagellar pocket of TbSmee1 depleted parasites while the surface bound ConA and BSA remained outside of the flagellar pocket suggesting that TbSmee1 may play a role in allowing larger protein components into the pocket regions. Similar observations were also previously seen with TbMORN1 depletion. Importantly, a knockdown of clathrin recapitulated the TbSmee1 knockdown phenotype suggesting that endocytosis itself was required to allow material bound at the surface to enter into the flagellar pocket. In addition to adding to our understanding of hook complex components, this work raises some interesting questions regarding the role of the hook complex in facilitating endocytosis in this important human pathogen.

      Thank you for the positive assessment.

      Major Critiques:<br /> This is a superbly written manuscript with robust high-quality data that strongly support the major conclusions made by the authors. The flow the article is logical and easy to follow making it accessible to a wide array of readers.

      We are glad that the Reviewer appreciated the effort that went into writing the paper.

      Although I appreciate the brevity of the introduction and how the article gets straight to the point, additional background information on the components and function of the flagellar pocket collar protein could help contextualize the goals of the project. The way in which the flagellar collar structures are introduced to the reader is quite abrupt (beginning on line 75) and simply states the names of TbBILBO1, the centrin arm and hook complex as simple facts without much discussion about the background of these components/regions. A graphical representation of the centrin arm or hook complexes relative to other components like the pocket itself, FAZ or axoneme could make following the story much easier. An expansion of this background could also go a long way to convince readers of the importance of this region in the basic biology and virulence of T. brucei.

      Implemented. We have added more background details on the hook complex, flagellar pocket collar, and centrin arm and added a new schematic image to Figure 1 showing these structures as well as the FAZ (Figure 1A).

      On lines 84-86 the authors cite the way in which 'small' vs 'large' macromolecules enter into the pocket without defining what exactly is meant by these terms as they are relative in nature. Setting some boundaries of size could provide some context to the reader.

      Implemented. We have provided more detail on the approximate sizes in nm (lines 110-113 of tracked-changes manuscript).

      In the domain localization analysis beginning in Figure 4 there is a missed opportunity to also assess which portions of the TbSmee1 protein are important for overall function as well. By either an examination of dominant negative phenotypes resulting from overexpression of the truncated mutant or the expression of the truncated forms designed to be RNAi resistant in the TbSmee1 knockdown cell line, one could also assess which portions of this protein are essential for endocytic function in addition to targeting. Is there a reason this was not performed?

      This is a good point; we did actually investigate overexpression of the TbSmee1(161-766) construct which can target correctly but is missing the first folded domain, but did not observe any phenotypic effects. We have added this point to the results (lines 301-302 of tracked-changes version). We agree that it would be interesting to express the truncations in a TbSmee1 RNAi background in order to simultaneously assay for targeting and function, but this was (unfortunately, perhaps) not part of the original experimental design. To do so now would require generating a completely new panel of truncation constructs with recoded DNA (in order to make them RNAi-resistant) and then generating a new panel of cell lines. While this would be informative, we feel that it would be impractical at present.

      In the analysis of viability changes due to TbSmee1 depletion (lines 237) the authors state that at "72 h post-induction showed widespread lysis, ..." This phenotype seems inconsistent with other related endocytic defect mutants. There is no further mention of this lysis phenomenon here or in the discussion and considering how unique this seems it deserves either additional data to demonstrate or further discussion as to the basis of the phenotype. It seems, at least from this study of TbStarkey1 and prior studies which result in the enlarged flagellar pocket phenotype, that having an enlarged pocket is not the cause of lysis and doesn't even naturally lead to a growth defect.

      Widespread lysis is the usual outcome of bloodstream form cells with strong endocytic defects - we have observed this directly for the clathrin, TbMORN1, and TbSmee1 RNAi cell lines, and it has been documented in a number of other publications (see for example Natesan et al., 2010, Manna et al., 2017). We have clarified this point in the text (see for examples lines 359-341, 474-478 of tracked-changes manuscript).

      The authors do not comment on what is the source for the cessation in growth following TbSmee1 knockdown. Is it nutrient depravation like in other endocytic defect mutants?

      Implemented (see for example lines 359-361, 605-610 of the tracked-changes manuscript). The source of the growth defect is likely to be due to impaired cell division cycle progression due to the gross enlargement of the flagellar pocket and subsequent steric hindrance and imbalance of membrane homeostasis.

      In the end, one of the most interesting observations made by the authors is that loss of TbSmee1 inhibits endocytosis and this has the appearance of not allowing large molecule substrates like ConA and BSA to enter into the flagellar pocket. This appeared to have nothing to do with a gatekeeping type function of the hook complex/flagellar collar and instead, as shown through clathrin knockdown, was related to the ability of the parasite to endocytose. There are a lot of potential interpretations of this phenomenon with one being a simple perturbation of the normal membrane trafficking to and from the flagellar pocket being involved. An analysis of knockdown of exocytic components might reveal whether or not this inability to enter into the pocket is also seen when exocyst proteins are also depleted. It may be impossible to tease apart these two interrelated activities but it might eliminate one side of the equation if these proteins can still enter the flagellar pocket when exocytosis if perturbed although this reviewer understands that that dimension of T. brucei membrane trafficking is poorly understood relative to endocytosis.

      This is an interesting point, and the reviewer is also correct in highlighting that exocytosis is far less characterised than endocytosis in Trypanosoma brucei. The exocyst has been characterised in bloodstream form T. brucei (Boehm et al., 2017) and shown to also have a role in endocytosis, so teasing out the relative contributions of these pathways would undoubtedly be challenging. We would prefer not to go in this direction in this present study, but it is an obvious avenue for future work.

      An intriguing possibility that the authors allude to and which if answered would make this manuscript have a far broader appeal is to determine if loss of TbSmee1 alters the lipid kinase distribution and if this is the source of the negative impact on endocytosis. One important dimension of endocytosis in T. brucei which remains poorly understood is the role of signaling machinery in triggering endocytic events. It is possible that the hook complex serves as the gatekeeping or signaling platform that recruits signaling components (like lipid kinases) that identify and/or modify the membrane lipid phosphatidylinositols harboring cargo laden receptors thus marking them for endocytosis within the pocket. It still seems unclear when in the process of endocytosis is the decision made to pull things into the pocket but it seems that the assumption is that this occurs deep within the pocket. This data suggests that there is possibly another decision point prior to being allowed entrance into the pocket. It may be that this isn't a gatekeeping decision but rather a stop vs. go activity where once cargo laden membrane reaches the collar a choice is made to pull this material in or not there and not after material is already in the pocket.

      These are all really interesting ideas and would be fascinating topics for future work.

      This obvious enigma based on the observation that loss of hook complex components affect the spatially separated site of endocytosis support the idea that the actual endocytic signaling platforms are located at the hook complex and that this area may make the membrane modifications that mark membrane as being ready to be endocytosed via clathin coated vesicles at the bottom of the pocket. This would still allow for fluid phase small molecule entrance which does not require binding to surface proteins. The obvious problems of having both endo/exocytosis occurring in the same close proximity makes the dissection of this phenomenon difficult but it is worth potentially expounding on further in the discussion as this idea is very appealing and adds an important dimension to our understanding of endocytosis in this organism.

      Implemented (lines 722-727 of the tracked-changes manuscript). We have added some more detail to these points in the Discussion. We agree with the reviewer that there are some profoundly interesting questions concerning membrane identify and membrane protein uptake here.

      Minor Critiques:<br /> The authors commit significant time to the analysis of the phosphorylation of TbSmee1, but there is little stated about the role of TbPLK in this activity or the potential connection of TbSmee1 phosphorylation to the cell cycle. Would a knockdown of TbPLK using RNAi potentially demonstrate an altered migration of TbSmee1 due to a lack of phosphorylation? An analysis of radiolabeled TbSmee1 using p32 in vivo would likely support this claim as well. Has mass spectrometry identified potential phosphorylation sites to examine? Additionally, the loss of TbSmee1 has been shown to disrupt localization of TbPLK in procyclic cells and so why this was not also assessed in bloodstream form cells subjected to RNAi was not clear.

      Partly implemented. We have added some discussion of the possible role of TbSmee1 phosphorylation in the cell cycle to the Discussion (lines 562-565 of tracked-changes manuscript), and emphasised the identification of phosphorylation sites in previous phosphoproteomics work (citations of Nett et al., 2009, Urbaniak et al., 2013). Given that the strongest and earliest effect of TbSmee1 depletion was on endocytosis and cargo uptake, we chose to focus on this angle rather than exploring its contribution to the biogenesis of cytoskeleton-associated structures and its interaction with TbPLK. For that reason we would prefer not to carry out the experiments looking at the effects of TbSmee1 depletion on TbPLK or vice versa.

      In the results section (lines 104-108) a model of the protein structure as predicted for example by AlphaFold might be informative and complement the domain analysis work depending on the quality of the prediction.

      Implemented. The AlphaFold prediction is consistent with the predictions made by the other structural analyses, and we have noted this in the text (lines 145-148 and 551 of the tracked-changes version).

      There is an arrow in the Figure 1B Western blot but I can find no mention of what it is trying to highlight in the text.

      Corrected.

      For Figure 1D there is no loading control or control for the distribution of the soluble fraction to validate the separation of the two compartments.

      Implemented. We have carried out additional experiments to show the partitioning of a cytoplasmic protein (the endoplasmic reticulum chaperone BiP) into the detergent-soluble fraction. These results are now displayed in the updated Figure 1.

      The authors fail to comment on the lack of changes in hook complex components they see to that observed by Perry et. al. 2018. This difference merits some minor comment or speculation.

      Implemented. We have added this commentary to the Discussion (lines 592-600 of the tracked-changes version).

      Line 228: domain should be capitalized.

      Implemented.

      Line 230: FigS5C should have a space and period after Fig. and S5C.

      Implemented.

      Line 244: "on" should be inserted in the sentence "...TbSmee1 protein depletion ON either side of the onset..."

      Implemented.

      Line 400: the '...20/21 h post-induction...' is slightly confusing and may read better as 20-21 h.

      Implemented.

      Line 463: a space is needed between '...2009).The...'.

      Implemented.

      Reviewer #2 (Significance):

      This manuscript advances our current conception of endocytosis in T. brucei. Although this model kinetoplastid parasite has been extensively studied with respect to endocytosis there is still a great deal we do not yet understand regarding how this process is regulated at a mechanistic level. This work has begun to connect previously unappreciated aspects of endocytosis in T. brucei by highlighting a potentially novel connection between the flagellar collar/hook complex and the physically separated endocytic events within the flagellar pocket itself. It may be that what appears as regulated entrance into the pocket is in fact the source of signaling that triggers the endocytic events carried out by clathrin. This is an interesting notion that no doubt requires further investigation which lies outside of the scope of this report. While this work appeals primarily to those studying kinetoplastids parasites it has the potential to provide insight into basic protozoan biology as well. Due to my related interest in kinetoplastid endocytosis, I find this work to be of high quality, conceptually interesting and employs many of the cutting-edge techniques currently available in the study of T. brucei.

      We are very happy that the Reviewer formed a favourable impression of the work.

      Reviewer #3 (Evidence, reproducibility and clarity):

      This manuscript begins to dissect the function of the hook complex protein SMEE1 in the mammalian infective form of T. brucei. The hook complex is a cytoskeletal structure associated with the flagellar pocket, the only site of endo/exocytosis in these cells. The authors demonstrate that SMEE1 is required for endocytosis in these cells and that this can occur with minimal change to the molecular make-up of the hook complex. The authors show that endocytosis is important for the access of large molecules e.g. ConA into the flagellar pocket.

      Major comments

      The key conclusion of this study are convincing and the data is generally well presented and clear. The interpretation of the figures matches well with the data presented - there are a few minor issues though that I have highlighted below in minor comments. The authors use a range of molecular cell biology approaches to define the role of SMEE1 and these are appropriate and are well controlled.

      Thank you.

      My major comment focuses on the use of different tracers to study endocytosis but the elephant in the room is what is happening to VSG as this is the surface protein that needs to rapidly removed from the cell surface and cleaned. Given the importance of removal of antibodies bound to the VSG - have the authors looked at this in the SMEE1 depleted cells? Do VSG-antibody complexes accumulate in this region? This is an important experiment as this would give key physiologically relevant data to this study. All the material should be readily available for this as there are a number of VSG antibodies.

      We agree with the Reviewer that the behaviour of these VSG-bound antibodies is a key test of the physiological relevance of the observations we have made using ConA and BSA, and have implemented this request - the results are in the new Figure 9. Although they sound simple, these assays turned out to be far from trivial and much more technically challenging than the other uptake assays, owing to the extremely fast kinetics (seconds) of anti-VSG uptake (Engstler et al., 2007) and the unexpectedly and incredibly high losses of bound antibodies during the assay. This might be due to shedding, as noted in the Discussion.

      Minor comments<br /> Perhaps I have been overthinking this but is surface-bound the right way to describe the cargo, as it clearly goes in both directions onto and off the surface and in fact the experiments in this manuscript are focussing on the removal of this material from the surface so is not surface-bound.

      We have clarified that "surface-bound" refers to material that binds to the surface glycoprotein coat of the trypanosomes and which is subsequently internalised, not material that is bound for (i.e being directed to) the cell surface (lines 77-78 of tracked-changes version). We hope this addresses the Reviewer's point?

      Have the authors investigated the structure of the protein using alphafold and if so how does that compare to the domain structure that was presented in this manuscript?

      Implemented (lines 145-148, 551 of tracked-changes version). We have checked the AlphaFold prediction of the three-dimensional structure of TbSmee1 and noted it in the Results; the prediction is consistent with the earlier bioinformatic analyses.

      The authors raised a number of antibodies to TbSMEE1 and TbSTARKEY1 but it was not clear in the figures which antibody was ultimately used for analysis by western and IF - could the authors clarify, as some looked to have a higher background than others. Line 150 states the same localisation was seen for all three antibodies and references S3C but I couldn't see that data presented.

      Implemented - the 304 antisera was used for most subsequent experiments and we have noted this in the M&M (lines 793-798 of tracked-changes version). Figure S3C shows that the Ty1-TbSmee1 recapitulates the localisation of the antibodies against the endogenous protein - we have clarified this point as well (lines 206-207 of tracked-changes version).

      Line 169 - can the authors provide more detail about the global correlation methodology as I was unable to follow the details in the methods? Is this a pixel per pixel correlation over the image or on a selected region over the area of potential signal overlap? In figure 2E it appears that BILBO1 signal correlates more closely with the SMEE1 signal than MORN and LRRP1 and from the images that would not seem to be the case. Have I interpreted this figure incorrectly?

      Implemented. The original analysis was a global correlation analysis that was determining whether the signals were correlated with each other regardless of spatial overlap, and we agree with the reviewer that these outputs were non-intuitive to interpret. In the revision, we have carried out a new analysis (and updated the accompanying text and M&M section), measuring the degree of spatial correlation between each pair of signals on a pixel-by-pixel basis over the area of each cell, with a total of 30 cells analysed in each pairing. We believe that this addresses the reviewer's point. See lines 223-243, 963-974 of the tracked-changes version).

      The authors have generated a number of different clones and performed experiments on these clones generally more than twice, which is clearly explained in the figure legends but in places the data is then put together and it is difficult to know which experiments/clones it comes from - for example 7C/7F what do those percentages represent? Is this the sum of all experiments? A representative experiment? How many cells per experiment were analysed?

      Implemented. We have double-checked all the figure legends and clarified this point where necessary. Quantifications were always made by compiling data from multiple independent experiments using multiple separate clones - see in particular lines 1323-1324, 1363-1365, 1380-1382 of the tracked-changes version.

      Line 200 - From the image it is not convincing that SMEE1 is slightly behind DOT1 - I agree it looks enveloped but would appear level with the distal end of the DOT1 signal.

      Implemented. We have adopted the Reviewer's wording for this text (line 271 of tracked-changes version).

      For the truncation experiments the authors should explain that these are performed with cells in which the endogenous SMEE1 will be expressed and this may influence the localisation of the truncations, especially as there is no information about whether SMEE1 forms complexes with itself or other proteins.

      Implemented (lines 296-298 of tracked-changes version).

      Figure 4D - should be 1 not T-

      We have relabelled this as "TbSmee1". The values in this column are the immunoblot signal intensities obtained for the endogenous TbSmee1 protein in the -Tet condition. We have also clarified this in the figure legend.

      Line 223 - given the low expression of constructs 2 and 9 I'm not sure it is possible to infer anything from the lack of localisation of these constructs as they appear unstable and would be unlikely to localise to a specific location.

      We have added this caveat to the text (lines 558-562 of tracked-changes version).

      Figure S7 - The images presented were not convincing that there was a reduction in the localisation of LRRP1 to the hook complex on depletion of either SMEE1 or MORN1. The difference looks particularly minor if present at all.

      Agreed, there was some debate in the group about these results. We have changed to text to fit the Reviewer's interpretation (lines 347-348 of the tracked-changes version).

      Line 264 - "implied that the lethal phenotype might be due to a loss of function" - this seems an odd thing to say as it doesn't provide any insight as of course the phenotype is due to a loss of function.

      We have clarified this point (lines 350-353 of the tracked-changes version). We would however disagree with the reviewer that RNAi phenotypes are exclusively due to a loss of individual protein's function(s) - when proteins are present in multiprotein complexes (as is often the case with cytoskeleton-associated proteins), then destabilisation of the complex due to loss of the entire protein can cause the observed phenotype, rather than the loss of the function performed by the individual protein within the complex (this may be a semantic point, however). A very good example of this is with the outer arm dynein complex component LC1 (Ralston et al., 2011) - RNAi against LC1 is lethal because the entire outer arm dynein complex is destabilised, whereas expression of non-functional mutants of LC1 produces viable cells with motility defects due to the specific loss of LC1 function.

      Line 412 - can the authors clarify what they mean by geometric problems?

      Implemented (lines 605-610 of tracked-changes version). We were referring to the fact that enlargement of the flagellar pocket will probably create difficulties for the progression of the cell division cycle.

      Throughout the manuscript can you use log scale for the growth curves.

      Implemented.

      Line 756 - add citation

      Whoops! Implemented (line 1058 of tracked-changes version).

      Line 465/66 - the authors states that the ability of the fluid phase cargo being still able to enter the pocket is evidence that the channel lumen is still open; however, I would think that despite the close apposition of the cell membrane to the flagellar membrane in the flagellar pocket neck region this would be unlikely to impede fluid/soluble material from entering the pocket, as presumably VSG protein can move through this region. This does not alter the ultimate conclusion the authors are drawing but without microscopy evidence for the state of the channel lumen it is difficult to be sure of its status.

      Fair point. We have modified this statement (line 701 in tracked-changes version).

      Reviewer #3 (Significance):

      The flagellar pocket is the key portal into and out of the trypanosome cell and as such has a vital role to play in host-parasite interactions. The flagellar pocket is supported by a number of cytoskeletal structures including the hook complex and the role of these structures in flagellar pocket function are poorly understood. The flagellar pocket is particularly important in the bloodstream form of the trypanosome parasite which infects the mammalian host as it is the route for the surface protein VSG to get onto and off the surface. The VSG is required for antigenic variation and the removal of VSG-antibody complexes helps 'clean' the surface of the parasite. SMEE1 is a component of the hook complex and the manuscript here dissects its role in the mammalian infective parasite and shows that it is vital for the endocytosis of material off the surface. Intriguingly, a block in endocytosis causes a blockage of material outside of the pocket, suggesting a multi-step process in the regulation of uptake of material from the parasite's surface.<br /> This manuscript will be of specific interest to those researchers investigating the long-term persistence of these parasites in the mammalian host. There are potentially some insights into the control of membrane domains for endocytosis that are of interest to more general cell biologists as well.

      We are very grateful to the reviewer for the supportive comments and the constructive evaluation. Many thanks!

      Expert in molecular cell biology of trypanosomes and Leishmania.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript begins to dissect the function of the hook complex protein SMEE1 in the mammalian infective form of T. brucei. The hook complex is a cytoskeletal structure associated with the flagellar pocket, the only site of endo/exocytosis in these cells. The authors demonstrate that SMEE1 is required for endocytosis in these cells and that this can occur with minimal change to the molecular make-up of the hook complex. The authors show that endocytosis is important for the access of large molecules e.g. ConA into the flagellar pocket.

      Major comments

      The key conclusion of this study are convincing and the data is generally well presented and clear. The interpretation of the figures matches well with the data presented - there are a few minor issues though that I have highlighted below in minor comments. The authors use a range of molecular cell biology approaches to define the role of SMEE1 and these are appropriate and are well controlled.

      My major comment focuses on the use of different tracers to study endocytosis but the elephant in the room is what is happening to VSG as this is the surface protein that needs to rapidly removed from the cell surface and cleaned. Given the importance of removal of antibodies bound to the VSG - have the authors looked at this in the SMEE1 depleted cells? Do VSG-antibody complexes accumulate in this region? This is an important experiment as this would give key physiologically relevant data to this study. All the material should be readily available for this as there are a number of VSG antibodies.

      Minor comments

      Perhaps I have been overthinking this but is surface-bound the right way to describe the cargo, as it clearly goes in both directions onto and off the surface and in fact the experiments in this manuscript are focussing on the removal of this material from the surface so is not surface-bound.

      Have the authors investigated the structure of the protein using alphafold and if so how does that compare to the domain structure that was presented in this manuscript?

      The authors raised a number of antibodies to TbSMEE1 and TbSTARKEY1 but it was not clear in the figures which antibody was ultimately used for analysis by western and IF - could the authors clarify, as some looked to have a higher background than others. Line 150 states the same localisation was seen for all three antibodies and references S3C but I couldn't see that data presented.

      Line 169 - can the authors provide more detail about the global correlation methodology as I was unable to follow the details in the methods? Is this a pixel per pixel correlation over the image or on a selected region over the area of potential signal overlap? In figure 2E it appears that BILBO1 signal correlates more closely with the SMEE1 signal than MORN and LRRP1 and from the images that would not seem to be the case. Have I interpreted this figure incorrectly?

      The authors have generated a number of different clones and performed experiments on these clones generally more than twice, which is clearly explained in the figure legends but in places the data is then put together and it is difficult to know which experiments/clones it comes from - for example 7C/7F what do those percentages represent? Is this the sum of all experiments? A representative experiment? How many cells per experiment were analysed?

      Line 200 - From the image it is not convincing that SMEE1 is slightly behind DOT1 - I agree it looks enveloped but would appear level with the distal end of the DOT1 signal.

      For the truncation experiments the authors should explain that these are performed with cells in which the endogenous SMEE1 will be expressed and this may influence the localisation of the truncations, especially as there is no information about whether SMEE1 forms complexes with itself or other proteins.

      Figure 4D - should be 1 not T-

      Line 223 - given the low expression of constructs 2 and 9 I'm not sure it is possible to infer anything from the lack of localisation of these constructs as they appear unstable and would be unlikely to localise to a specific location.

      Figure S7 - The images presented were not convincing that there was a reduction in the localisation of LRRP1 to the hook complex on depletion of either SMEE1 or MORN1. The difference looks particularly minor if present at all.

      Line 264 - "implied that the lethal phenotype might be due to a loss of function" - this seems an odd thing to say as it doesn't provide any insight as of course the phenotype is due to a loss of function.

      Line 412 - can the authors clarify what they mean by geometric problems?

      Throughout the manuscript can you use log scale for the growth curves.

      Line 756 - add citation

      Line 465/66 - the authors states that the ability of the fluid phase cargo being still able to enter the pocket is evidence that the channel lumen is still open; however, I would think that despite the close apposition of the cell membrane to the flagellar membrane in the flagellar pocket neck region this would be unlikely to impede fluid/soluble material from entering the pocket, as presumably VSG protein can move through this region. This does not alter the ultimate conclusion the authors are drawing but without microscopy evidence for the state of the channel lumen it is difficult to be sure of its status.

      Significance

      The flagellar pocket is the key portal into and out of the trypanosome cell and as such has a vital role to play in host-parasite interactions. The flagellar pocket is supported by a number of cytoskeletal structures including the hook complex and the role of these structures in flagellar pocket function are poorly understood. The flagellar pocket is particularly important in the bloodstream form of the trypanosome parasite which infects the mammalian host as it is the route for the surface protein VSG to get onto and off the surface. The VSG is required for antigenic variation and the removal of VSG-antibody complexes helps 'clean' the surface of the parasite. SMEE1 is a component of the hook complex and the manuscript here dissects its role in the mammalian infective parasite and shows that it is vital for the endocytosis of material off the surface. Intriguingly, a block in endocytosis causes a blockage of material outside of the pocket, suggesting a multi-step process in the regulation of uptake of material from the parasite's surface.<br /> This manuscript will be of specific interest to those researchers investigating the long-term persistence of these parasites in the mammalian host. There are potentially some insights into the control of membrane domains for endocytosis that are of interest to more general cell biologists as well.

      Expert in molecular cell biology of trypanosomes and Leishmania.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript the authors have advanced our understanding of the hook complex component TbSmee1 through a detailed analysis of this protein's role in the endocytosis of surface bound proteins via the flagellar pocket in bloodstream form Trypanosoma brucei. The TbSmee1 protein, previously identified using proximity labeling using TbMORN1 and TbPLK, and characterized in procyclic T. brucei, was confirmed to target to both the shank portion of the hook complex as well as the growing end of the new FAZ in replicating cells. The protein was also shown to likely be phosphorylated as had been suggested previously due to its association with the kinase TbPLK. A domain deletion analysis demonstrated that domains 2 and 3 are important for TbSmee1's proper localization to the hook complex. Loss of TbSmee1 using RNAi based knockdown resulted in a quick cessation of growth in the bloodstream form within 24 hours in contrast to what was seen previously in procyclic cells which had only a decreased growth rate. Loss of TbSmee1 also resulted in an enlargement of the flagellar pocket and in many ways mirrored the phenotype observed with knockdown of TbMORN1. Although prior work on TbSmee1 in procyclic T. brucei demonstrated that loss of this protein altered the morphology of TbMORN1, no such change was seen in bloodstream form cells and only an alteration in the morphology of TbLRRP1 was observed. In characterizing the effect of TbSmee1 depletion on endocytosis the authors showed that the fluid phase marker Dextran could enter into the flagellar pocket of TbSmee1 depleted parasites while the surface bound ConA and BSA remained outside of the flagellar pocket suggesting that TbSmee1 may play a role in allowing larger protein components into the pocket regions. Similar observations were also previously seen with TbMORN1 depletion. Importantly, a knockdown of clathrin recapitulated the TbSmee1 knockdown phenotype suggesting that endocytosis itself was required to allow material bound at the surface to enter into the flagellar pocket. In addition to adding to our understanding of hook complex components, this work raises some interesting questions regarding the role of the hook complex in facilitating endocytosis in this important human pathogen.

      Major Critiques:

      This is a superbly written manuscript with robust high-quality data that strongly support the major conclusions made by the authors. The flow the article is logical and easy to follow making it accessible to a wide array of readers. Although I appreciate the brevity of the introduction and how the article gets straight to the point, additional background information on the components and function of the flagellar pocket collar protein could help contextualize the goals of the project. The way in which the flagellar collar structures are introduced to the reader is quite abrupt (beginning on line 75) and simply states the names of TbBILBO1, the centrin arm and hook complex as simple facts without much discussion about the background of these components/regions. A graphical representation of the centrin arm or hook complexes relative to other components like the pocket itself, FAZ or axoneme could make following the story much easier. An expansion of this background could also go a long way to convince readers of the importance of this region in the basic biology and virulence of T. brucei.

      On lines 84-86 the authors cite the way in which 'small' vs 'large' macromolecules enter into the pocket without defining what exactly is meant by these terms as they are relative in nature. Setting some boundaries of size could provide some context to the reader.

      In the domain localization analysis beginning in Figure 4 there is a missed opportunity to also assess which portions of the TbSmee1 protein are important for overall function as well. By either an examination of dominant negative phenotypes resulting from overexpression of the truncated mutant or the expression of the truncated forms designed to be RNAi resistant in the TbSmee1 knockdown cell line, one could also assess which portions of this protein are essential for endocytic function in addition to targeting. Is there a reason this was not performed?

      In the analysis of viability changes due to TbSmee1 depletion (lines 237) the authors state that at "72 h post-induction showed widespread lysis, ..." This phenotype seems inconsistent with other related endocytic defect mutants. There is no further mention of this lysis phenomenon here or in the discussion and considering how unique this seems it deserves either additional data to demonstrate or further discussion as to the basis of the phenotype. It seems, at least from this study of TbStarkey1 and prior studies which result in the enlarged flagellar pocket phenotype, that having an enlarged pocket is not the cause of lysis and doesn't even naturally lead to a growth defect.

      The authors do not comment on what is the source for the cessation in growth following TbSmee1 knockdown. Is it nutrient depravation like in other endocytic defect mutants?

      In the end, one of the most interesting observations made by the authors is that loss of TbSmee1 inhibits endocytosis and this has the appearance of not allowing large molecule substrates like ConA and BSA to enter into the flagellar pocket. This appeared to have nothing to do with a gatekeeping type function of the hook complex/flagellar collar and instead, as shown through clathrin knockdown, was related to the ability of the parasite to endocytose. There are a lot of potential interpretations of this phenomenon with one being a simple perturbation of the normal membrane trafficking to and from the flagellar pocket being involved. An analysis of knockdown of exocytic components might reveal whether or not this inability to enter into the pocket is also seen when exocyst proteins are also depleted. It may be impossible to tease apart these two interrelated activities but it might eliminate one side of the equation if these proteins can still enter the flagellar pocket when exocytosis if perturbed although this reviewer understands that that dimension of T. brucei membrane trafficking is poorly understood relative to endocytosis.

      An intriguing possibility that the authors allude to and which if answered would make this manuscript have a far broader appeal is to determine if loss of TbSmee1 alters the lipid kinase distribution and if this is the source of the negative impact on endocytosis. One important dimension of endocytosis in T. brucei which remains poorly understood is the role of signaling machinery in triggering endocytic events. It is possible that the hook complex serves as the gatekeeping or signaling platform that recruits signaling components (like lipid kinases) that identify and/or modify the membrane lipid phosphatidylinositols harboring cargo laden receptors thus marking them for endocytosis within the pocket. It still seems unclear when in the process of endocytosis is the decision made to pull things into the pocket but it seems that the assumption is that this occurs deep within the pocket. This data suggests that there is possibly another decision point prior to being allowed entrance into the pocket. It may be that this isn't a gatekeeping decision but rather a stop vs. go activity where once cargo laden membrane reaches the collar a choice is made to pull this material in or not there and not after material is already in the pocket.

      This obvious enigma based on the observation that loss of hook complex components affect the spatially separated site of endocytosis support the idea that the actual endocytic signaling platforms are located at the hook complex and that this area may make the membrane modifications that mark membrane as being ready to be endocytosed via clathin coated vesicles at the bottom of the pocket. This would still allow for fluid phase small molecule entrance which does not require binding to surface proteins. The obvious problems of having both endo/exocytosis occurring in the same close proximity makes the dissection of this phenomenon difficult but it is worth potentially expounding on further in the discussion as this idea is very appealing and adds an important dimension to our understanding of endocytosis in this organism.

      Minor Critiques:

      The authors commit significant time to the analysis of the phosphorylation of TbSmee1, but there is little stated about the role of TbPLK in this activity or the potential connection of TbSmee1 phosphorylation to the cell cycle. Would a knockdown of TbPLK using RNAi potentially demonstrate an altered migration of TbSmee1 due to a lack of phosphorylation? An analysis of radiolabeled TbSmee1 using p32 in vivo would likely support this claim as well. Has mass spectrometry identified potential phosphorylation sites to examine? Additionally, the loss of TbSmee1 has been shown to disrupt localization of TbPLK in procyclic cells and so why this was not also assessed in bloodstream form cells subjected to RNAi was not clear.

      In the results section (lines 104-108) a model of the protein structure as predicted for example by AlphaFold might be informative and complement the domain analysis work depending on the quality of the prediction.

      There is an arrow in the Figure 1B Western blot but I can find no mention of what it is trying to highlight in the text.

      For Figure 1D there is no loading control or control for the distribution of the soluble fraction to validate the separation of the two compartments.

      The authors fail to comment on the lack of changes in hook complex components they see to that observed by Perry et. al. 2018. This difference merits some minor comment or speculation.

      Line 228: domain should be capitalized.

      Line 230: FigS5C should have a space and period after Fig. and S5C.

      Line 244: "on" should be inserted in the sentence "...TbSmee1 protein depletion ON either side of the onset..."

      Line 400: the '...20/21 h post-induction...' is slightly confusing and may read better as 20-21 h.

      Line 463: a space is needed between '...2009).The...'.

      Significance

      This manuscript advances our current conception of endocytosis in T. brucei. Although this model kinetoplastid parasite has been extensively studied with respect to endocytosis there is still a great deal we do not yet understand regarding how this process is regulated at a mechanistic level. This work has begun to connect previously unappreciated aspects of endocytosis in T. brucei by highlighting a potentially novel connection between the flagellar collar/hook complex and the physically separated endocytic events within the flagellar pocket itself. It may be that what appears as regulated entrance into the pocket is in fact the source of signaling that triggers the endocytic events carried out by clathrin. This is an interesting notion that no doubt requires further investigation which lies outside of the scope of this report. While this work appeals primarily to those studying kinetoplastids parasites it has the potential to provide insight into basic protozoan biology as well. Due to my related interest in kinetoplastid endocytosis, I find this work to be of high quality, conceptually interesting and employs many of the cutting-edge techniques currently available in the study of T. brucei.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Unfortunately, this paper adds only a little to our understanding of uptake in to the flagellar pocket of trypanosomes. It tends to add only detail to information that has been well characterised elsewhere and indeed, as the authors themselves point out, (lines 92-98) it is rather incremental. Not only has Tbsmee1 been studied before but this data in bloodstream forms is not particularly novel since it gives much the same information as the canonical hook protein TbMORN.

      This work follows the pattern of conclusions made previously with the protein TbMORN. It focusses on the protein TbSmee where RNAi mutants are interpreted to show flagellar pocket enlargement and impaired access by surface bound cargo. Unfortunately, there is little mechanistic or functional conclusion to the study in terms of how TbSmee operates naturally in the cell. There are other possible explanations for the phenotype. That would need to be studied. This large flagellar pocket phenotype is seen with RNAi mutants of many different types of proteins in the trypanosome and so pleiotropic effects are highly likely.

      Also, there are a good number of alternative possibilities to account for reduced access to the pocket in these mutants and this data could be usefully added.

      Specific points

      1. The transient location for the TbSmee at the FAZ tip - or in this case the groove region - was seen in procyclics (Perry, 2018) so this bloodstream indication merely confirms that concept.
      2. The C terminal region required for targeting is a reasonable deletion analysis of regions of the protein. But can this data (line 228) be said to "mediate targeting" - or is it just required. For instance, targeting might be OK but it might be needed for stable association, etc etc.
      3. This protein has already been shown to be phosphorylated and the sites and cell cycle possibilities have been mapped by Urbaniak. So that section adds little. https://doi.org/10.1371/journal.ppat.1008129
      4. Essentiality in BS forms and pocket enlargement. This is not surprising. A very large number of cytoskeletal proteins show this in RNAi knockdown. Flagella mutants (extensive publications from many groups (Hill, Bastin, Gull, etc) over last 15 years show this very well and so this protein is just one more example.
      5. I didn't find that the explanations for flagella pocket enlargement are soundly based. The experiments focus on endocytosis and uptake and ignore other plausible reasons and some evidence in literature.<br /> Lines 84/85. Enlarged pockets may be indicative of endocytosis failure. Presumably the rationale is that endocytosis fails, but exocytosis still occurs and the pocket membrane enlarges. What evidence is there that exocytosis of membrane still occurs? This simple concept might indeed operate in a clathrin mutant but is surface membrane/content exocytosis is maintained in these cytoskeleton mutants? There is good evidence for glycoconjugates within the flagellar pocket. Are these depleted or present still?
      6. There are also a number of other publications indicating that clathrin pits are still present on the enlarged pockets of various mutants when viewed by EM. The authors have looked at the flagellar pockets by EM but the EM methods described have extensive washings and centrifugations before fixation. This is a very poor approach and will mean that endo and exocytic traffic is disturbed (extensive references in literature in other systems? This is not a useful approach for exo/endocytosos studies where flux of traffic demands fast chemical or freezing fix in media.
      7. The EMs and Light microscopy does show that the mutant pockets are substantially abnormal in their cytoskeletal arrangement. They have multiple flagella profiles, flagella structures have not connected with the membrane and are sometimes in the cytoplasm (see a glance of the paraflagellar rod in the cytoplasm in FigS5C and internalised FAZ attachment plaques in Fig 4 D bottom right cell). Given these extensive (and expected) cytoskeletal abnormalities it is highly likely that these pocket abnormalities are a result of motility, cell division/developmental issues and the differential uptake phenotypes merely consequential.
      8. The authors speak about early phenotypes , but these are often at 15-24 hours. That is probably a couple of cell cycles and so not early. In relation to the above question of comparison to the same morphology produced by flagella mutants it would be good to know if these hook mutants produce motility phenotypes and whether these are manifest before the uptake phenotypes. There is evidence (cited here) that forward motility of the trypanosome directs material on surface into the pocket. If these cells have motility defects (primary or via failed division) then surely that would provide an alternative simple explanation for uptake differences.
      9. There is a general point that if studies are to have real relevance to uptake in the trypanosome then they need to deal with uptake of natural ligands rather than artificial surrogates such as dextran. Such tracers were used historically, but in the last decade a series of receptors and ligands for fluid phase and particularly membrane mediated endocytosis have been discovered. With the investment of a little time these important ligand / receptors such as haptoglobin, transferrin, etc would be much more relevant.

      Referees cross-commenting

      This session includes comments from Reviewer 1 and Reviewer 2.

      Reviewer 2

      Dear Reviewers 1 and 3:<br /> I agree with many of the points with Reviewer 1 and our divergence is partly a matter of degree. While it is true that this manuscript is incremental in its contribution to our understanding of TbSmee1, it nonetheless adds to our understanding of the role of this protein in the bloodstream life stage and because of that I find value in the work. The fact that it mirrors what was seem in other protein knockdown studies (e.g. TbMORN) doesn't negate its contribution for me. Reviewer 1 makes an important point, however, when stating that this work does not add a mechanistic or functional conclusion as to how TbSmee1 operates and for me that is the biggest shortcoming of the work. Offering mechanistic insight is a high bar and while it would make for a much more exciting story it does not discount the value of the work as presented. What I do appreciate is the speculation about this observation that endocytosis is required for entrance of surface bound material into the pocket and although they are unable to show that this is not a side affect of other processes being disrupted it is and intriguing point. These observation have the potential of stimulating further investigations into crosstalk between the entrance to the pocket and endocytosis. I also agree that the use of ligands for known receptors like transferrin would be far more informative. While I assumed the transferrin receptor was in the pocket itself it would be interesting to see if the ESAG6/7 is also located outside the pocket and transiently binds cargo before being brought inside for endocytosis.<br /> I think that Reviewer 3 brings up a great point with the focus on VSG's. I think that examining VSG turnover in these mutants can add value to the analysis and inform our view of how affecting the hook complex alters VSG endocytosis.

      Reviewer 1

      some fair comment and agreement. This is being sent to general cell biology journals.<br /> when one looks at this area in the round it is nearly 50 years (1975) since Langreth and Balber published their seminal work on protein uptake and digestion in bloodstream and culture forms of T. brucei. There has been 50 years intense study and the genome has been around for nearly 20 years as well. So, put simply - for both a general science audience and the wider parasite community - if this is a paper about one protein, TbSmee1,then it has surely has to say something functional about that protein. If it is a paper about uptake in trypanosomes (where mutants are one means of interrogation) then it surely has to say something about mechanisms of uptake of physiological relevant ligands. The days of dextran etc are past. Hence, my comment that this does neither and so is very incremental to what is known already. It is 2022 not 1975. Langreth and Baber published their seminal work in J Protozoology for very good reasons no doubt.

      Reviewer 2<br /> Thank you for replying and I agree with the spirit of your critique. My only comment, which could result from my own naivete, is to say that despite the incredible work that has been done in dissecting endocytosis in T. brucei over these past 50 years, it appears that we still do not understand how many fundamental of aspects of this activity works in this parasite. Even basic questions regarding how cargo, e.g. transferrin, binding to surface receptors is sensed by the parasite remains unknown and the identity of the specific signaling components which transmit this information internally to initiate endocytosis have not been characterized. In many ways it seems that we don't even understand how the parasite partitions the end/exocytic pathways in the pocket and maintains membrane homeostasis. While we know that some kinases and traditional signaling components must be involved, a high resolution understanding of this process in T. brucei seems lacking. I only say all this to suggest that the field maybe isn't yet that advanced to reject work of this type as so many mechanistic unknowns still remain to be uncovered and maybe incremental advances and phenomenology still can add value to the field. However, I respect your opinion on the matter and my perspective could be due to a lack of a full appreciation of the literature on the subject.

      Significance

      Unfortunately, I did not find tis to be very significant. It covers old ground in terms of the phenotype described. Many groups have shown the differences between pro cyclic and bloodstream phenotypes in this enlarged pocket phenomenon. The work is rather incremental from these and other author's work on these hook proteins.

      There are alternative explanations for understanding the effect of flagella pocket structure and uptake of ligands into the pocket and trypanosome cell. These would need to be studied before one could see a functional, mechanistic link established.

      Other parts of this are of nicely done but do not move on our understanding (eg targeting/phosphorylation) from what has been done previously.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Kagermeier et al. present a novel and interesting study that attempts to model a severe neurodevelopmental disorder, pontocerebellar hypoplasia type 2a, using neocortical and cerebellar organoids. Brain organoids are an appropriate and promising approach to elucidate disease mechanisms in neurodevelopmental diseases. The authors show a reduction in the size of the organoids which is more pronounced in the cerebellar compared to neocortical organoids. While this finding is interesting and reminiscent of the clinical PCH2a phenotype, i.e., cerebellar hypoplasia, the study is very preliminary and the conclusions of the manuscript are not supported by the data. Additional information and further experiments are necessary to support the claims made.

      Major concerns:

      1. hiPSC lines show considerable inter- and intra-individual variability and therefore the size differences observed between these control and patient-derived organoids may arise from differences in the hiPSC lines used. While the data sufficiently demonstrates the pluripotency of the multiple novel hiPSC lines, major concerns remain as to the appropriateness of the control hiPSC lines. The manuscript should include a table describing the age and sex matching as well as mode of reprogramming for all control and patient lines. Patient and control lines should be matched as closely as possible. Furthermore, figure legends should clearly indicate which clones and lines are shown in the various figure panels.

      We agree with the reviewer that hiPSC variability is an important concern in the field. In order to minimize such effects, all iPSCs lines used in this study were generated following the same protocol in the same lab. All cell lines are derived from male donors, thus, eliminating sex-based variability. Further, there is no report of sex-based variance in the clinical phenotype of PCH2a children and this finding is further corroborated by a currently on-going natural history study in our research team. While it would be ideal to also have age-matched controls, this is not possible for ethical reasons as skin biopsies from healthy children cannot easily be obtained to match the pediatric PCH2a cases. However, based on the literature, we believe that epigenetic age is erased upon reprogramming (Strassler et al 2018, Studer et al 2015). Following the reviewer’s recommendation, we provide a table that clearly indicates the origin of all six cell lines used (see Methods section) and information of respective lines was added to the figure legends as suggested by the reviewer.

      As the hiPSC lines used are not isogenic, it is important that the authors characterise these lines further. This should include a quantification of the rates proliferation and apoptosis in all used hiPSC lines, as these might impact the growth rate of the embryoid bodies / organoids.

      We thank the reviewer for raising this concern. To address the variability of hiPSC lines, we performed an extensive characterization of pluripotency, proliferation and cell cycle dynamics of all six hiPSC lines through immunocytochemistry against pluripotency marker OCT4, proliferation marker Ki-67 and EdU incorporation experiments. We further assessed the apoptosis rate of hiPSCs by staining against apoptotic marker cCas3. These experiments were carried out in three consecutive passages of all iPSC lines providing statistical power to the analyses. All experiments did not result in significant differences between PCH2a and control iPSC lines (see Figure 2).

      The authors state that the hiPSC lines have been characterised by SNP arrays to show that no genomic / chromosomal aberrations have been accrued due to reprogramming. The manuscript should include information as to when the SNP array was performed (i.e., immediately after reprogramming, after initial passaging, etc) and also include the results of the SNP array as additional information. What passage were the hiPSC when the presented experiments were carried out?

      In agreement with this comment, we provide data of SNP arrays that were performed to ensure the chromosomal integrity of all cell lines (see supplement). Further, we added details on passages of the cell lines in the respective figure legends as suggested by the reviewer. In brief, all cell lines were kept below passage 20 and were subjected to pluripotency testing before differentiations were started.

      Given that TSNE54 is broadly and strongly expressed in the developing nervous system, the very limited staining of the organoids for TSNE54 in Figure 2 is surprising. Can the authors provide an explanation for the fact that TSNE54 is only expressed in a small subset of cells? Which cell types are these? Moreover, high-magnification images should be shown to demonstrate subcellular staining pattern of TSNE54. Quantification of TSNE54 protein levels by immunoblotting would also be beneficial.

      Related to this observation, it is puzzling that the large size differences that the authors observe in their organoids would be driven by such a small number of TSNE54-expressing cells. How do the authors explain this discrepancy?

      We thank the reviewer for this comment. We have carefully assessed human cerebellar development transcriptomic datasets which demonstrate that TSEN54 is in fact not strongly but moderately expressed in the human developing nervous system. Additionally, TSEN54 expression is expressed in various different cell types (not limited to a subset of cell types) (Aldinger et al 2021, Sepp et al 2021). We agree with this reviewer and reviewer 3 that Western Blotting or other types of quantification would be informative as well as investigation of the subcellular localization of the protein. However, these questions go beyond the scope of the current manuscript, which aims to present a disease model. We have therefore decided to remove the characterization of TSEN54 expression in organoids from our revised manuscript.

      The generated organoids need to be better characterised with a broader range of markers using both qPCR and immunostaining. At the moment, their identity as "cortical" and "cerebellar" organoids remain unconvincing. This is particularly true for cerebellar organoids, which are challenging to generate and are not widely used. The authors should include additional markers (for example, see PMIDs 25640179, 29397531, 32117945) and immunostaining should clearly show expected staining patterns.

      In Figure 5, it appears that some markers (e.g., SATB2) are expressed differently between control and patient lines, yet this is not commented on by the authors who conclude that control and patient lines show differentiation into organoids.

      We thank the reviewer for this suggestion. We performed further immunostainings using the markers that were used in other cerebellar organoid papers (Muguruma et al 2015, Silva et al 2020, Watson et al 2018) as the reviewer suggested. In detail, we added immunohistochemistry experiments on Day 30 and Day 50 of differentiation for early Purkinje cell markers OLIG2 and SKOR2. We also included ATOH1 as a marker for rhombic lip-derived granule cells. For the neocortical organoids, we believe that the performed characterization is sufficient since the protocol we used is well-established and widely used as also indicated by the reviewer. We agree that the cellular composition of the organoids should be investigated in detail (for instance using single-cell transcriptomics). However, we believe this is out of the scope of this manuscript, which describes the establishment of a brain-region specific model platform.

      The authors attempt to look into a potential mechanism for the size differences observed between control and patient organoids. However, only cleaved caspase-3 is used as a marker for apoptosis and no differences were observed. The authors should include further markers for potential cell death. In addition, immunostaining for proliferation markers (i.e., KI67) should be performed to evaluate whether the difference in organoid size could stem from decreased proliferation rather than increased cell death.

      We agree with the reviewer and included a quantification of the proliferation marker Ki-67 within the SOX2 positive population of cerebellar and neocortical organoids as well as the quantification of SOX2 positive areas within the organoids (Figure 6). We observed significant differences in proliferation between PCH2a and control cerebellar organoids. Moreover, we also analyzed the morphology of organoids and quantified the thickness and number of rosettes and find significant differences between control and PCH2a cerebellar organoids corroborating the notion that proliferation is altered in cerebellar organoids. Neocortical organoids do not show any significant differences in proliferation and Sox2+ structures. Only the thickness of the Sox2+ areas is slightly decreased in neocortical PCH2a organoids compared to controls. In order to deepen our analysis of a possible increased apoptosis in PCH2a organoids, we also quantified cCas3 in Sox2+ structures (Figure 5) as also suggested by Reviewer 2. These analyses did not show any significant differences between PCH2a and control organoids. We therefore suggest that at the early stages of differentiation studied here, proliferative differences are the main reason for the size differences between PCH2a and control organoids.

      Reviewer #1 (Significance (Required)):

      The authors present an innovative approach to study neurodevelopmental disorders using brain organoids and should be of interest to researchers and clinicians working on neurodevelopmental diseases. However, the data presented are too limited to support any conclusions about the phenotype observed. Furthermore, questions remain about the used methodology and more work is needed to demonstrate the successful generation of both cortical and cerebellar organoids.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Please find enclosed my recommendation for the paper submitted by Kagermeier et al entitled' Human organoid model of PCH2a recapitulates brain region-specific pathology'. It describes the development of a human model for PCH2a and its characterization. My overall assessment of the paper is 'Major revision' which is explained below.

      Although the paper is very well written and clearly interesting in that it describes the generation and initial analyses of a human organoid model for PCH2a it should be revised such that it will proof the points it is trying to make. The authors are meticulous in their studies combining cellular characterization and a thorough initial screen of organoid (both cerebellar as well as cortical) integrity, yet hardly any mechanistic data is provided. Nevertheless, if the authors are able to add additional experiments and are able to address the points raised, the reviewer may be willing to consider a more positive outcome.

      Major concerns

      1) The overall quality of the figures is poor. There is a lot of overexposure such that often cellular or tissue structures are blended. It starts with Figure 1 G and H but can be observed throughout the manuscript. Deconvolution would greatly enhance their results.

      We are thankful for this comment and we have improved the quality of all microscopy images.

      2) Especially figure 4 and 5 could have been complemented with quantitative data. It furthermore seems more supplemental figure as these are just proof-of-principle stainings. No conclusions can be drawn from the panels except that all markers are there in the various conditions. And while they are showing a neural rosette in Fig 4A, just tiny ones can be observed in 4B. It is also not clear what the whole mount IHC ads in comparison to the IHC on sections. It is also strange that there is still a lot of SOX2 in the CALB/MAP2-positive area, but again with this magnification hard to appreciate.

      We agree with the reviewer that so far we presented qualitative proof-of-principle stainings that demonstrate cerebellar and neocortical differentiation, respectively. In order to address the comment of the reviewer, we improved the quality of the images and also provided higher magnification and enhanced resolution. Additionally, we now provide detailed quantifications of SOX2+ and Ki67+ neural progenitor cells and show that differences observed between PCH2a and control cerebellar organoids may explain the size differences observed between organoids (Figure 6). Our study provides the basis for more in-depth analysis of differences in differentiation and cell type composition between PCH2a and control organoids in the future, for example through single-cell RNAseq.

      3) If the authors would like to proof the point that cerebellar/cortical development is hampered, more functional assays could have been done. Nothing is analyses on the fraction of progenitor cells present (such as the percentage of Tbr2+ IPC in VZ/CP). Furthermore, if there is a suspicion that the number of cells is affected (which is also not shown), proliferation/cell cycle exit experiments using BrdU/EdU should have been performed. Early cell cycle exit still cannot be rules out and should have been tested by the combination of Ki67-/EdU+ percentage of a certain faction of progenitor cells (eg PAX6+ pool).

      We thank the reviewer for this valuable suggestion and agree that it would be interesting to carry out respective experiments. In this study, we show the establishment of a brain-region-specific organoid platform as a disease model for PCH2a and are only at the beginning of deciphering the underlying mechanism. In the revised manuscript, we quantified Ki-67+/Sox2+ cells in proliferative zones in the organoids. We believe that future studies including BrdU / EdU incorporation assays as well as scRNA-seq will answer the questions raised here and decipher the disease-causing mechanism on both cellular and molecular levels but are beyond the scope of this manuscript.

      4) Instead the author chose to only perform a cCas3 staining. From the panels in Figure 6 it is hard to appreciate which cells are actually cCas3+. Also the analyses were performed on the total pool of cell while it might have been more interesting to look for cell death of the various progenitor pools (eg the SOX2+ pool).

      We agree with the reviewer that a more in-depth analysis of apoptotic cell populations is interesting and performed cCas3/Sox2+ quantification for cerebellar and neocortical organoids. We did not observe significant differences of cCas3 expression within the SOX2+ cell population. (Figure 5)

      Minor concerns

      1) It would greatly enhance the review process if line numbers are added

      We have added line numbers to the manuscript.

      2) On general concepts (such as the generation of organoids in the context of disease) more references could have been added

      We have added more references and discussed the topic of brain organoids as disease models as suggested by this reviewer (Eichmüller & Knoblich 2022, Khakipoor et al 2020, Velasco et al 2020).

      Figures

      Fig. 1: In A, the square is clearly visible and not similar to B. An annotation of which is the control and which is the patient is missing in the figure. The arrows are hardly visibly, would make them slightly bigger and remove the black outer lining. Figure 1C can easily go to the Supplemental material. Fig 1 D is hard to appreciate the staining, a close-up with bright field microscope will help. E-I Most of the panels but especially G and H are overexposed. In J, it is hard to appreciate the TSEN54 staining. Maybe separate channels and a merge?

      We thank the reviewer for bringing these details to our attention. We have changed the arrows in the figure to enhance their visibility. Further we have adjusted the quality of the images overall. Lastly, we have made a comment in the figure legend clearly stating which scan came from which child. The described square was added to hide facial features of the imaged individuals hence they are not identical.

      Fig. 3: Usually go into the supplementals.

      Since organoid size is a major first readout when modeling a disorder that is characterized by a reduction of the volume of specific brain regions, we decided to keep this readout in the main text.

      Fig 4/5: Lack of quantitative data and poor quality of figures (overexposure).

      Fig 6: Many of the SOX2 panels are overexposed

      We thank the reviewer for the suggestions on the figures and addressed the concerns in the revised manuscript.

      CROSS-CONSULTATION COMMENTS

      I completely agree with reviewers #1 and #3. It is good to notice that we are overall on the same page.

      Reviewer #2 (Significance (Required)):

      The authors definitely made an excellent start to model PCH2a. Three controls and three patient lines are good to begin with but isogenic controls using one parental line and a patient line where the mutation is fixed would have been ideal. It is interesting that there seem to be a brain area specific pathology of the phenotype. Yet, more thorough analyses could have been performed such as proliferation and differentiation and cell cycle exit experiments. As for now the mostly descriptive data are only scratching the surface and little can be concluded on the molecular framework they are trying to solve.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this study Kagermeier et al. use human cerebellar and neocortical organoids to investigate the effects of the PCH2a-causing homozygous TSEN54c.919G>T variant on the neurodevelopment of different brain regions. They reveal a substantial growth defect in both neocortical and cerebellar regions with a more profound phenotype in the cerebellum. They continue to investigate major cell types of neurodevelopment in both regions and briefly potential mechanisms underlying the phenotypes. The study is well conceived and addresses the current gap of disease-modeling in cerebellar organoids; nevertheless, some major claims are not sufficiently substantiated in the current version. Below, I provide suggestions on how to improve the manuscript with some additional minor comments that might help with readability and accessibility of the work.

      Major comments:

      1. TSEN54 expression levels: The authors compare RNA and protein expression levels for TSEN54 to investigate the mutation's effect. For this the authors use qPCR on iPSCs and organoids of different age and immunostainings and conclude "we did not find differences in expression between cell and tissue types". There are some issues with this analysis as explained below:

      -The qPCR data (Fig. 2B) is first normalized to a housekeeping gene (GAPDH), however, then all organoid data are additionally normalized to the respective iPSC line. Thus, in case there is already a difference on iPSC level, this normalization might mask any difference in the organoids. It is unclear why this approach was chosen, and it seems more appropriate to show the data just normalized to GAPDH than additionally normalizing to the iPSCs, or at least to show first that iPSCs do not have differences in TSEN54 expression. Furthermore, even though apparently not statistically significant there seems to be a strong trend of lower TSEN54 levels in PCH2a in neocortical organoids, but even more so in cerebellar organoids. In my view this would fit very well with the study and should be further explored before concluding there is no statistical difference. Considering the high error bars of the cerebellar organoid samples, a higher N-number might be necessary to reach statistical significance in the difference in expression. Most importantly, it would be appropriate to show single data points where possible and to mark the different cell lines (as done in other figures), as otherwise it is not possible to judge whether there is a cell line bias in the data.

      -The evidence for protein expression of TSEN54 is immunofluorescence stainings for all conditions. As there is no quantification, the authors should not conclude differences, or the lack thereof, based on this qualitative data. Furthermore, in fact in the on example shown the PCH2a cerebellar condition (Fig 2D) seems to show lower expression levels compared with other conditions. This could be due to the selected image, as all other examples include large neural rosettes with strong staining in the center of the rosettes. Furthermore, it is unclear what cell line these stainings come from, even whether the PCH2a cerebellar and neocortical stainings come from the same cell line. Thus, the authors should select comparable examples for all conditions, and ideally provide staining examples (e.g., as supplementary data) for the other replicates to ensure expression in all replicates. If the authors want to comment on differences in protein expression, maybe a quantitative approach (e.g., quantitative western blot) would be more appropriate. Otherwise, the statements should be adjusted to not conclude whether TSEN54 protein levels differ or not.

      -Irrespective of the above comments the conclusion of the section "TSEN54 expression in cerebellar and neocortical organoids", that currently reads "we did not find differences in expression between cell and tissue types" should be changed, as the authors did not investigate whether there are cell type-specific differences of TSEN54 expression.

      We thank the reviewer for this comment. We agree that the provided data is not suitable for quantitative analysis of TSEN54 expression. Please also see our related response to the similar concern raised by reviewer 1. Thanks to these suggestions, we have decided to exclude the TSEN54 expression data from the current manuscript as a detailed analysis should be part of an extensive future study.

      Organoid growth analysis:

      The organoid growth analysis in Figure 3 and supplementary Figure 2 shows the main phenotype of the study that seems to be very strong. The authors use unpaired t-tests to compare within the different timepoints. Unfortunately, I think this approach might not be appropriate as even though the Welch correction does not rely on similar SDs in the compared groups (Control vs. PCH2a), it still assumes that all data points within each group share the same variance. However, this is not the case, as e.g., the control condition includes three groups (Control-1 to -3), that between groups might have different variance as such not all datapoints are independent from each other. Potentially ANOVA analyses controlling for cell line and timepoint might be more appropriate. Or additionally, the authors could consider using the linear regression analysis in Supplementary Figure 2 to further investigate the difference in organoid growth by e.g., comparing the slope of the regression lines. This might be more appropriately reflecting the growth deficit over time than simply comparing each timepoint individually. Expanding on this analysis the regression analysis requires some more information on the fit (intercept, slope, R-squared of the model), which would help clarifying the growth dynamics in the different systems and conditions.

      We thank the reviewer for the suggestions on statistical analysis and adjusted our approach accordingly. Briefly we performed 3-way-ANOVA analysis for the growth curves which revealed no significant differences between the different lines within the groups (Control or PCH2a) at different time points. Additionally, we added the linear regression model to the results (See Figure 3 and supplementary table 2, with the information on the curve fit).

      The growth ratio analysis (Figure 3D) is essential to the major claim of the paper that the organoids replicate the region-specific differences. As the authors performed all experiments with matching cell lines this could additionally strengthen the argument by generating the ratio of size differences for each cell line separately (instead of just for all PCH2a lines together). This would allow comparison of the same genetic background in both cerebellar and neocortical condition and further corroborate the region-specific severity of the phenotype. Potentially, this would also enable to test these differences statistically.

      We appreciate the suggestion to compare the differentiation protocols by line. Below we display the line-by-line analysis between the two differentiation protocols at D30 (A), D50 (B), and D90 (C). In order to visualize the differences in size between the two protocols more clearly, we have generated ratios of the average organoid sizes between neocortical and cerebellar organoids (D). The analysis corroborates our previous visualizations and statistics (3-way ANOVA) by showing that PCH lines produce neocortical and cerebellar organoids that differ in size more than those of control lines. The differences are most pronounced at D30 and D90. However, we believe that this analysis does not add additional value to our manuscript and have therefore decided not to include it in the revised version.

      Additionally, all growth analyses for the neocortical organoids (Figure 3C, Supplementary Figure 2B and C) seem to lack the PCH-1 cell line and only contain PCH-2 and PCH-3. This cell line should be added or commented on why it was excluded from the analyses.

      We agree with the reviewer. Unfortunately, we experienced contamination in that specific differentiation and therefore cannot provide the data. We have made a related comment in the manuscript. Since all differentiations were performed in parallel, adding this line at a later time point would add additional confounders and is therefore undesirable.

      Potential mechanism of the phenotype (apoptosis analysis):

      In Figure 6 the authors investigate the hypothesis that increased apoptosis contributes to the phenotypes. In the cleaved Caspase 3 staining there appear to be no differences. Unfortunately, the analysis apparently only includes one replicate (one organoid?) per cell line and condition. Considering the variability in the data shown this seems inappropriately low and should ideally contain ~3 replicates per cell line condition to judge technical and biological variability if the authors want to make the point that there is no "significant difference between PCH2a and control organoids at any time point in both cerebellar and neocortical organoids". Otherwise, this claim does not seem to be substantiated enough by the data.

      Finally, due to the absence of a phenotype related to apoptosis the authors conclude that the phenotypes may be due to "deficits in the proliferation of progenitor cells". Although this is mentioned in the introduction and the discussion, there is no evidence in the current study that supports this interesting idea. By adding relatively straight forward co-staining experiments for e.g., SOX2 (progenitors) and Ki67 (proliferating cells), the authors could provide further evidence for this hypothesis using existing organoid sections. This would support this speculative idea and could add a more mechanistic insight to the study, thereby making it more exciting.

      To address this concern, we have now added a table to the supplement that described in detail which organoids / batches / cell lines were used for which experiment (Supplementary table 3). In addition to our previous cCas3 quantifications, we performed the quantification of cCas3 within the population of SOX2-positive cells, which was suggested by Reviewer 2 (Figure 5).

      To assess the alternative hypothesis, that proliferation deficits account for the size differences observed between organoids, we also performed quantifications of SOX2-positive zones in the organoids at D30 and D50 of differentiation as well as quantifications of Ki-67 positive cells within the SOX2-positive population. For cerebellar organoids we found significant differences in these experiments (Figure 6). We believe that this data supports the hypothesis of aberrant proliferation in PCH2a cerebellar organoids explaining the size differences.

      Minor comments:

      • Cell line and quality control: The authors recruit three male patients with PCH2a and reprogram iPSCs. These cell lines are subjected to a well performed extensive quality control. However, it is unclear what cell lines the stainings (e.g., Fig. 1D to I) originate from. Furthermore, the supplementary qPCR analysis (Supplementary Figure 1) includes only the PCH-1 line, and additionally two cell lines that are not explained (F-CO and hESC-I3). It is unclear what the relevance of showing the qPCR of these cell lines is. To ensure proper QC for all used cell lines the authors should provide data for all cell lines (PCH-1 to -3 and control-1 to -3), or at least summarize (e.g., in a table) what QC metrics were applied to which cell line. Most importantly, this information is completely lacking for the control cell lines and the QC is just mentioned in the text. Unfortunately, it is unclear where the control cell lines originate from, and some basic information would be required to judge whether they are appropriate controls: are they iPSC or ESC, were they reprogrammed with a similar paradigm as the PCH2a cells, what is the gender of the control cell lines (all PCH2a cell lines are apparently male)?

      In line with a similar comment from reviewer 1, we have included a table that provides information on the origin of all six cell lines used in the revised manuscript (methods section). Further we provide SNP-Array data on all cell lines as supplementary material. We also performed detailed characterization of pluripotency, proliferation and cell cycle dynamics of all six hiPSC lines through immunocytochemistry against pluripotency marker OCT4, proliferation marker Ki-67 and EdU incorporation experiments (Figure 2). We further assessed the apoptosis rate of hiPSCs by staining against apoptotic marker cCas3. All experiments did not result in significant differences between PCH2a and control iPSC lines (see Figure 2). In line with the suggestion of this reviewer, we removed the qPCR analysis of iPSCs from the manuscript.

      • To make the study more approachable for a medical audience and to judge the variability in phenotype presentation among the recruited patients it would be appreciated if more information on the patients would be provided. The authors write: "We identified three individuals that display the genetic, clinical and brain imaging features previously described for PCH2a.". This information including age/date of birth, as well as other medically relevant information could be provided in the supplementary figure (e.g., is there a difference in disease burden among the different patients?). This would allow judging the recruited cohort better.

      We thank the reviewer for this insightful comment. We provided a table with detailed clinical information (supplementary table 1).

      • According to the method section the cerebellar and neocortical organoids were cultured in very different medium especially at later timepoints. While neocortical organoids were kept in a neural maintenance medium based on Neurobasal-A, cerebellar organoids were kept in a medium based on BrainPhys. These media contain very different levels of nutrients, especially of glucose (25mM vs 2.5mM, Bardy et al. 2015). This can have a strong phenotype on proliferation of progenitors and proliferative phenotypes (e.g., see Eichmüller et al. 2022). Especially as the authors claim that there is a difference in the PCH2a phenotypes between brain regions, it should be excluded that this is due to medium differences at later timepoints. When investigating the growth curves of Figure 3B and C it seems like the major difference in growth speed seems to be that neocortical organoids grow faster in early timepoints (We agree that media composition can greatly influence growth dynamics of cells in 2D and 3D. However, in this study we assess the differences between two groups: the PCH2a and control iPSC-derived organoids. The differences we describe are in relation to the respective control group and iPSCs were generated following the same protocol in the same lab. We believe that by following two protocols and comparing the three PCH2a to the three control lines within each protocol predominantly, we account for different media composition possibly changing growth dynamics.

      • Staining examples shown and presentation: In several figures the authors could improve the presentation of the staining examples with some changes:

      o Cell line information for images: as the authors only ever note the condition (PCH2a or Control) but not the cell line it is unclear if the stainings all come from one cell line or from multiple different cell lines. This prevents comparing the different differentiation conditions. Additionally, for major conclusions the authors should consider including supplemental stainings or further information on how reproducible the results shown are (how many cell lines and batches were used?).

      We thank the reviewer for these suggestions. We added information on cell lines and passages for all experiments shown in this study in the figure legends. Moreover, we also added a table providing information on n-numbers for all experiments (supplementary table 3).

      o Selection of examples: in several cases (Fig 2C/D, 4A, 6A/B) the selected images depict very different regions, e.g., one condition shows a large rosette, while in the other condition no rosette can be seen. It would be more appropriate to show matching examples where possible.

      We agree with the reviewer and have chosen matched regions of interest in the figure panels in the revised version of the manuscript. Please note that for cerebellar organoids we observed a significant difference in the timepoint of appearance of these rosette-like structures. Therefore, an exact matching of regions of interest was not possible due to biological differences between the samples, which we have also quantified (Figure 6).

      o Color code of stainings: Colors do not match throughout the manuscript in immunofluorescence images. E.g., Fig. 4 uses blue, green, red, magenta and Fig. 5 uses blue, green, magenta, cyan. It would be preferable to adhere to one color code. Considering significant fraction of the population is having red-green blindness, the latter color code seems more appropriate as it should ensure readability also for color-blind audiences.

      We are thankful for this comment. We changed the color code to make figures more widely accessible.

      • Small typos:

      o Figure 1 legend: last sentence "The" instead of "Th"

      o Supplementary Figure 1B: PCH-2 is named "PCH-22"

      o Supplementary Figure 2: As in the main figure for neocortical organoids the PCH-1 condition is missing (see comment on organoid growth curves). Additionally, the color/shape code of the plots in B does not always match the legend (e.g., size in left plot is different and color of PCH-3 in middle and left plot differs from legend and right plot).

      o It is unclear why the cortical organoids are referred to as "neocortical organoids" in the figures and the text. The methods and the reference in the methods as well as all major papers rather use the word "cortical".

      We addressed these suggestions and thank the reviewer for bringing these to our attention. Unfortunately, we could not include data on PCH-01 in neocortical differentiation due to a contamination in this batch. We made sure to run all the batches presented here in parallel so that all conditions are equivalent, preventing us from including a different batch at a later time point.

      We believe that in the context of our study, it is important to highlight cortical organoids as neocortical organoids, because we are also showing cerebellar organoids and there is also a cerebellar cortex.

      References:

      Bardy, C. et al. Neuronal medium that supports basic synaptic functions and activity of human neurons in vitro. Proc National Acad Sci 112, E3312 (2015).

      Eichmüller, O. L. et al. Amplification of human interneuron progenitors promotes brain tumors and neurological defects. Science 375, (2022).

      CROSS-CONSULTATION COMMENTS

      I agree with the comments of the other reviewers and as they are mostly matching, this reinforces the importance to improve certain aspects of the manuscript. As there are no deviating issues I do not comment specifically on any reviewer comments.

      Reviewer #3 (Significance (Required)):

      This work is using organoid technology to shed light on brain region-specific phenotypes in PCH2a. Brain organoids have drastically changed the way we study human neurological diseases (Eichmüller and Knoblich 2022), however, most brain organoid research has focused on cortical organoids. Cerebellar organoid protocols exist for some time (Muguruma et al. 2015, Silva et al. 2020, Nayler et al. 2021) but were not yet applied to uncover new disease biology. Especially considering the important role of human-specific cerebellar processes in specific developmental disorders (Haldipur et al. 2021) and cancer (Hendrikse et al. 2022, Smith et al. 2022), disease modeling in human cerebellar organoids holds great potential for understanding disease biology. The work by Kagermeier et al. demonstrates that human cerebellar organoids are recapitulating brain region-specific growth deficits and thus is an important step forward for disease modeling. Therefore, this work will be interesting to researchers working on brain development and disease modeling, especially in in-vitro systems. Nevertheless, the mechanistic insight of the study is limited, as is the insight into how human-specific processes might be involved in the pathogenesis of PCH2a. Therefore, it will be interesting how this disease model will be used in future to investigate the cell types and mechanisms involved in the PCH2a phenotype.

      Personal field of expertise: Brain organoids and disease modeling in organoids especially of neurodevelopmental diseases. Analysis of organoids with stainings, as well as sequencing techniques, and bioinformatics.

      References:

      Eichmüller, O. L. & Knoblich, J. A. Human cerebral organoids - a new tool for clinical neurology research. Nat Rev Neurol 1-20 (2022) doi:10.1038/s41582-022-00723-9.

      Haldipur, P. et al. Evidence of disrupted rhombic lip development in the pathogenesis of Dandy-Walker malformation. Acta Neuropathol 142, 761-776 (2021).

      Hendrikse, L. D. et al. Failure of human rhombic lip differentiation underlies medulloblastoma formation. Nature 609, 1021-1028 (2022).

      Muguruma, K., Nishiyama, A., Kawakami, H., Hashimoto, K. & Sasai, Y. Self-Organization of Polarized Cerebellar Tissue in 3D Culture of Human Pluripotent Stem Cells. Cell Reports 10, 537-550 (2015).

      Nayler, S., Agarwal, D., Curion, F., Bowden, R. & Becker, E. B. E. High-resolution transcriptional landscape of xeno-free human induced pluripotent stem cell-derived cerebellar organoids. Sci Rep-uk 11, 12959 (2021).

      Silva, T. P. et al. Scalable Generation of Mature Cerebellar Organoids from Human Pluripotent Stem Cells and Characterization by Immunostaining. J Vis Exp (2020) doi:10.3791/61143.

      Smith, K. S. et al. Unified rhombic lip origins of group 3 and group 4 medulloblastoma. Nature 609, 1012-1020 (2022).

      References by the authors

      Aldinger KA, Thomson Z, Phelps IG, Haldipur P, Deng M, et al. 2021. Spatial and cell type transcriptional landscape of human cerebellar development. Nat Neurosci 24: 1163-75

      Eichmüller OL, Knoblich JA. 2022. Human cerebral organoids — a new tool for clinical neurology research. Nature Reviews Neurology 18: 661-80

      Khakipoor S, Crouch EE, Mayer S. 2020. Human organoids to model the developing human neocortex in health and disease. Brain Res 1742: 146803

      Muguruma K, Nishiyama A, Kawakami H, Hashimoto K, Sasai Y. 2015. Self-organization of polarized cerebellar tissue in 3D culture of human pluripotent stem cells. Cell Rep 10: 537-50

      Sepp M, Leiss K, Sarropoulos I, Murat F, Okonechnikov K, et al. 2021.

      Silva TP, Fernandes TG, Nogueira DES, Rodrigues CAV, Bekman EP, et al. 2020. Scalable Generation of Mature Cerebellar Organoids from Human Pluripotent Stem Cells and Characterization by Immunostaining. J Vis Exp

      Strassler ET, Aalto-Setala K, Kiamehr M, Landmesser U, Krankel N. 2018. Age Is Relative-Impact of Donor Age on Induced Pluripotent Stem Cell-Derived Cell Functionality. Front Cardiovasc Med 5: 4

      Studer L, Vera E, Cornacchia D. 2015. Programming and Reprogramming Cellular Age in the Era of Induced Pluripotency. Cell Stem Cell 16: 591-600

      Velasco S, Paulsen B, Arlotta P. 2020. 3D Brain Organoids: Studying Brain Development and Disease Outside the Embryo. Annu Rev Neurosci 43: 375-89

      Watson LM, Wong MMK, Vowles J, Cowley SA, Becker EBE. 2018. A Simplified Method for Generating Purkinje Cells from Human-Induced Pluripotent Stem Cells. Cerebellum 17: 419-27

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary: In this study Kagermeier et al. use human cerebellar and neocortical organoids to investigate the effects of the PCH2a-causing homozygous TSEN54c.919G>T variant on the neurodevelopment of different brain regions. They reveal a substantial growth defect in both neocortical and cerebellar regions with a more profound phenotype in the cerebellum. They continue to investigate major cell types of neurodevelopment in both regions and briefly potential mechanisms underlying the phenotypes. The study is well conceived and addresses the current gap of disease-modeling in cerebellar organoids; nevertheless, some major claims are not sufficiently substantiated in the current version. Below, I provide suggestions on how to improve the manuscript with some additional minor comments that might help with readability and accessibility of the work.

      Major comments: 1. TSEN54 expression levels: The authors compare RNA and protein expression levels for TSEN54 to investigate the mutation's effect. For this the authors use qPCR on iPSCs and organoids of different age and immunostainings and conclude "we did not find differences in expression between cell and tissue types". There are some issues with this analysis as explained below: -The qPCR data (Fig. 2B) is first normalized to a housekeeping gene (GAPDH), however, then all organoid data are additionally normalized to the respective iPSC line. Thus, in case there is already a difference on iPSC level, this normalization might mask any difference in the organoids. It is unclear why this approach was chosen, and it seems more appropriate to show the data just normalized to GAPDH than additionally normalizing to the iPSCs, or at least to show first that iPSCs do not have differences in TSEN54 expression. Furthermore, even though apparently not statistically significant there seems to be a strong trend of lower TSEN54 levels in PCH2a in neocortical organoids, but even more so in cerebellar organoids. In my view this would fit very well with the study and should be further explored before concluding there is no statistical difference. Considering the high error bars of the cerebellar organoid samples, a higher N-number might be necessary to reach statistical significance in the difference in expression. Most importantly, it would be appropriate to show single data points where possible and to mark the different cell lines (as done in other figures), as otherwise it is not possible to judge whether there is a cell line bias in the data. -The evidence for protein expression of TSEN54 is immunofluorescence stainings for all conditions. As there is no quantification, the authors should not conclude differences, or the lack thereof, based on this qualitative data. Furthermore, in fact in the on example shown the PCH2a cerebellar condition (Fig 2D) seems to show lower expression levels compared with other conditions. This could be due to the selected image, as all other examples include large neural rosettes with strong staining in the center of the rosettes. Furthermore, it is unclear what cell line these stainings come from, even whether the PCH2a cerebellar and neocortical stainings come from the same cell line. Thus, the authors should select comparable examples for all conditions, and ideally provide staining examples (e.g., as supplementary data) for the other replicates to ensure expression in all replicates. If the authors want to comment on differences in protein expression, maybe a quantitative approach (e.g., quantitative western blot) would be more appropriate. Otherwise, the statements should be adjusted to not conclude whether TSEN54 protein levels differ or not. -Irrespective of the above comments the conclusion of the section "TSEN54 expression in cerebellar and neocortical organoids", that currently reads "we did not find differences in expression between cell and tissue types" should be changed, as the authors did not investigate whether there are cell type-specific differences of TSEN54 expression.

      1. Organoid growth analysis: The organoid growth analysis in Figure 3 and supplementary Figure 2 shows the main phenotype of the study that seems to be very strong. The authors use unpaired t-tests to compare within the different timepoints. Unfortunately, I think this approach might not be appropriate as even though the Welch correction does not rely on similar SDs in the compared groups (Control vs. PCH2a), it still assumes that all data points within each group share the same variance. However, this is not the case, as e.g., the control condition includes three groups (Control-1 to -3), that between groups might have different variance as such not all datapoints are independent from each other. Potentially ANOVA analyses controlling for cell line and timepoint might be more appropriate. Or additionally, the authors could consider using the linear regression analysis in Supplementary Figure 2 to further investigate the difference in organoid growth by e.g., comparing the slope of the regression lines. This might be more appropriately reflecting the growth deficit over time than simply comparing each timepoint individually. Expanding on this analysis the regression analysis requires some more information on the fit (intercept, slope, R-squared of the model), which would help clarifying the growth dynamics in the different systems and conditions. The growth ratio analysis (Figure 3D) is essential to the major claim of the paper that the organoids replicate the region-specific differences. As the authors performed all experiments with matching cell lines this could additionally strengthen the argument by generating the ratio of size differences for each cell line separately (instead of just for all PCH2a lines together). This would allow comparison of the same genetic background in both cerebellar and neocortical condition and further corroborate the region-specific severity of the phenotype. Potentially, this would also enable to test these differences statistically. Additionally, all growth analyses for the neocortical organoids (Figure 3C, Supplementary Figure 2B and C) seem to lack the PCH-1 cell line and only contain PCH-2 and PCH-3. This cell line should be added or commented on why it was excluded from the analyses.

      2. Potential mechanism of the phenotype (apoptosis analysis): In Figure 6 the authors investigate the hypothesis that increased apoptosis contributes to the phenotypes. In the cleaved Caspase 3 staining there appear to be no differences. Unfortunately, the analysis apparently only includes one replicate (one organoid?) per cell line and condition. Considering the variability in the data shown this seems inappropriately low and should ideally contain ~3 replicates per cell line condition to judge technical and biological variability if the authors want to make the point that there is no "significant difference between PCH2a and control organoids at any time point in both cerebellar and neocortical organoids". Otherwise, this claim does not seem to be substantiated enough by the data. Finally, due to the absence of a phenotype related to apoptosis the authors conclude that the phenotypes may be due to "deficits in the proliferation of progenitor cells". Although this is mentioned in the introduction and the discussion, there is no evidence in the current study that supports this interesting idea. By adding relatively straight forward co-staining experiments for e.g., SOX2 (progenitors) and Ki67 (proliferating cells), the authors could provide further evidence for this hypothesis using existing organoid sections. This would support this speculative idea and could add a more mechanistic insight to the study, thereby making it more exciting.

      Minor comments: - Cell line and quality control: The authors recruit three male patients with PCH2a and reprogram iPSCs. These cell lines are subjected to a well performed extensive quality control. However, it is unclear what cell lines the stainings (e.g., Fig. 1D to I) originate from. Furthermore, the supplementary qPCR analysis (Supplementary Figure 1) includes only the PCH-1 line, and additionally two cell lines that are not explained (F-CO and hESC-I3). It is unclear what the relevance of showing the qPCR of these cell lines is. To ensure proper QC for all used cell lines the authors should provide data for all cell lines (PCH-1 to -3 and control-1 to -3), or at least summarize (e.g., in a table) what QC metrics were applied to which cell line. Most importantly, this information is completely lacking for the control cell lines and the QC is just mentioned in the text. Unfortunately, it is unclear where the control cell lines originate from, and some basic information would be required to judge whether they are appropriate controls: are they iPSC or ESC, were they reprogrammed with a similar paradigm as the PCH2a cells, what is the gender of the control cell lines (all PCH2a cell lines are apparently male)?

      • To make the study more approachable for a medical audience and to judge the variability in phenotype presentation among the recruited patients it would be appreciated if more information on the patients would be provided. The authors write: "We identified three individuals that display the genetic, clinical and brain imaging features previously described for PCH2a.". This information including age/date of birth, as well as other medically relevant information could be provided in the supplementary figure (e.g., is there a difference in disease burden among the different patients?). This would allow judging the recruited cohort better.

      • According to the method section the cerebellar and neocortical organoids were cultured in very different medium especially at later timepoints. While neocortical organoids were kept in a neural maintenance medium based on Neurobasal-A, cerebellar organoids were kept in a medium based on BrainPhys. These media contain very different levels of nutrients, especially of glucose (25mM vs 2.5mM, Bardy et al. 2015). This can have a strong phenotype on proliferation of progenitors and proliferative phenotypes (e.g., see Eichmüller et al. 2022). Especially as the authors claim that there is a difference in the PCH2a phenotypes between brain regions, it should be excluded that this is due to medium differences at later timepoints. When investigating the growth curves of Figure 3B and C it seems like the major difference in growth speed seems to be that neocortical organoids grow faster in early timepoints (<d30), but similar at later timepoints, which would exclude effects of the media at late timepoints. Nevertheless, considering the strong effect media glucose concentration can have the authors should investigate whether there is an effect at growth speed at later timepoints by comparing control organoids. This could also strengthen the region-specific phenotype due to PCH2a.

      • Staining examples shown and presentation: In several figures the authors could improve the presentation of the staining examples with some changes: o Cell line information for images: as the authors only ever note the condition (PCH2a or Control) but not the cell line it is unclear if the stainings all come from one cell line or from multiple different cell lines. This prevents comparing the different differentiation conditions. Additionally, for major conclusions the authors should consider including supplemental stainings or further information on how reproducible the results shown are (how many cell lines and batches were used?). o Selection of examples: in several cases (Fig 2C/D, 4A, 6A/B) the selected images depict very different regions, e.g., one condition shows a large rosette, while in the other condition no rosette can be seen. It would be more appropriate to show matching examples where possible. o Color code of stainings: Colors do not match throughout the manuscript in immunofluorescence images. E.g., Fig. 4 uses blue, green, red, magenta and Fig. 5 uses blue, green, magenta, cyan. It would be preferable to adhere to one color code. Considering significant fraction of the population is having red-green blindness, the latter color code seems more appropriate as it should ensure readability also for color-blind audiences.

      • Small typos: o Figure 1 legend: last sentence "The" instead of "Th" o Supplementary Figure 1B: PCH-2 is named "PCH-22" o Supplementary Figure 2: As in the main figure for neocortical organoids the PCH-1 condition is missing (see comment on organoid growth curves). Additionally, the color/shape code of the plots in B does not always match the legend (e.g., size in left plot is different and color of PCH-3 in middle and left plot differs from legend and right plot). o It is unclear why the cortical organoids are referred to as "neocortical organoids" in the figures and the text. The methods and the reference in the methods as well as all major papers rather use the word "cortical".

      References: Bardy, C. et al. Neuronal medium that supports basic synaptic functions and activity of human neurons in vitro. Proc National Acad Sci 112, E3312 (2015). Eichmüller, O. L. et al. Amplification of human interneuron progenitors promotes brain tumors and neurological defects. Science 375, (2022).

      CROSS-CONSULTATION COMMENTS I agree with the comments of the other reviewers and as they are mostly matching, this reinforces the importance to improve certain aspects of the manuscript. As there are no deviating issues I do not comment specifically on any reviewer comments.

      Significance

      This work is using organoid technology to shed light on brain region-specific phenotypes in PCH2a. Brain organoids have drastically changed the way we study human neurological diseases (Eichmüller and Knoblich 2022), however, most brain organoid research has focused on cortical organoids. Cerebellar organoid protocols exist for some time (Muguruma et al. 2015, Silva et al. 2020, Nayler et al. 2021) but were not yet applied to uncover new disease biology. Especially considering the important role of human-specific cerebellar processes in specific developmental disorders (Haldipur et al. 2021) and cancer (Hendrikse et al. 2022, Smith et al. 2022), disease modeling in human cerebellar organoids holds great potential for understanding disease biology. The work by Kagermeier et al. demonstrates that human cerebellar organoids are recapitulating brain region-specific growth deficits and thus is an important step forward for disease modeling. Therefore, this work will be interesting to researchers working on brain development and disease modeling, especially in in-vitro systems. Nevertheless, the mechanistic insight of the study is limited, as is the insight into how human-specific processes might be involved in the pathogenesis of PCH2a. Therefore, it will be interesting how this disease model will be used in future to investigate the cell types and mechanisms involved in the PCH2a phenotype.

      Personal field of expertise: Brain organoids and disease modeling in organoids especially of neurodevelopmental diseases. Analysis of organoids with stainings, as well as sequencing techniques, and bioinformatics.

      References:

      Eichmüller, O. L. & Knoblich, J. A. Human cerebral organoids - a new tool for clinical neurology research. Nat Rev Neurol 1-20 (2022) doi:10.1038/s41582-022-00723-9.

      Haldipur, P. et al. Evidence of disrupted rhombic lip development in the pathogenesis of Dandy-Walker malformation. Acta Neuropathol 142, 761-776 (2021).

      Hendrikse, L. D. et al. Failure of human rhombic lip differentiation underlies medulloblastoma formation. Nature 609, 1021-1028 (2022).

      Muguruma, K., Nishiyama, A., Kawakami, H., Hashimoto, K. & Sasai, Y. Self-Organization of Polarized Cerebellar Tissue in 3D Culture of Human Pluripotent Stem Cells. Cell Reports 10, 537-550 (2015).

      Nayler, S., Agarwal, D., Curion, F., Bowden, R. & Becker, E. B. E. High-resolution transcriptional landscape of xeno-free human induced pluripotent stem cell-derived cerebellar organoids. Sci Rep-uk 11, 12959 (2021).

      Silva, T. P. et al. Scalable Generation of Mature Cerebellar Organoids from Human Pluripotent Stem Cells and Characterization by Immunostaining. J Vis Exp (2020) doi:10.3791/61143.

      Smith, K. S. et al. Unified rhombic lip origins of group 3 and group 4 medulloblastoma. Nature 609, 1012-1020 (2022).

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Please find enclosed my recommendation for the paper submitted by Kagermeier et al entitled' Human organoid model of PCH2a recapitulates brain region-specific pathology'. It describes the development of a human model for PCH2a and its characterization. My overall assessment of the paper is 'Major revision' which is explained below.

      Although the paper is very well written and clearly interesting in that it describes the generation and initial analyses of a human organoid model for PCH2a it should be revised such that it will proof the points it is trying to make. The authors are meticulous in their studies combining cellular characterization and a thorough initial screen of organoid (both cerebellar as well as cortical) integrity, yet hardly any mechanistic data is provided. Nevertheless, if the authors are able to add additional experiments and are able to address the points raised, the reviewer may be willing to consider a more positive outcome.

      Major concerns

      1. The overall quality of the figures is poor. There is a lot of overexposure such that often cellular or tissue structures are blended. It starts with Figure 1 G and H but can be observed throughout the manuscript. Deconvolution would greatly enhance their results.
      2. Especially figure 4 and 5 could have been complemented with quantitative data. It furthermore seems more supplemental figure as these are just proof-of-principle stainings. No conclusions can be drawn from the panels except that all markers are there in the various conditions. And while they are showing a neural rosette in Fig 4A, just tiny ones can be observed in 4B. It is also not clear what the whole mount IHC ads in comparison to the IHC on sections. It is also strange that there is still a lot of SOX2 in the CALB/MAP2-positive area, but again with this magnification hard to appreciate.
      3. If the authors would like to proof the point that cerebellar/cortical development is hampered, more functional assays could have been done. Nothing is analyses on the fraction of progenitor cells present (such as the percentage of Tbr2+ IPC in VZ/CP). Furthermore, if there is a suspicion that the number of cells is affected (which is also not shown), proliferation/cell cycle exit experiments using BrdU/EdU should have been performed. Early cell cycle exit still cannot be rules out and should have been tested by the combination of Ki67-/EdU+ percentage of a certain faction of progenitor cells (eg PAX6+ pool).
      4. Instead the author chose to only perform a cCas3 staining. From the panels in Figure 6 it is hard to appreciate which cells are actually cCas3+. Also the analyses were performed on the total pool of cell while it might have been more interesting to look for cell death of the various progenitor pools (eg the SOX2+ pool).

      Minor concerns

      1. It would greatly enhance the review process if line numbers are added
      2. On general concepts (such as the generation of organoids in the context of disease) more references could have been added

      Figures

      Fig. 1: In A, the square is clearly visible and not similar to B. An annotation of which is the control and which is the patient is missing in the figure. The arrows are hardly visibly, would make them slightly bigger and remove the black outer lining. Figure 1C can easily go to the Supplemental material. Fig 1 D is hard to appreciate the staining, a close-up with bright field microscope will help. E-I Most of the panels but especially G and H are overexposed. In J, it is hard to appreciate the TSEN54 staining. Maybe separate channels and a merge?

      Fig. 3: Usually go into the supplementals

      Fig 4/5: Lack of quantitative data and poor quality of figures (overexposure).

      Fig 6: Many of the SOX2 panels are overexposed

      Referees cross-commenting

      I completely agree with reviewers #1 and #3. It is good to notice that we are overall on the same page.

      Significance

      The authors definitely made an excellent start to model PCH2a. Three controls and three patient lines are good to begin with but isogenic controls using one parental line and a patient line where the mutation is fixed would have been ideal. It is interesting that there seem to be a brain area specific pathology of the phenotype. Yet, more thorough analyses could have been performed such as proliferation and differentiation and cell cycle exit experiments. As for now the mostly descriptive data are only scratching the surface and little can be concluded on the molecular framework they are trying to solve.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Kagermeier et al. present a novel and interesting study that attempts to model a severe neurodevelopmental disorder, pontocerebellar hypoplasia type 2a, using neocortical and cerebellar organoids. Brain organoids are an appropriate and promising approach to elucidate disease mechanisms in neurodevelopmental diseases. The authors show a reduction in the size of the organoids which is more pronounced in the cerebellar compared to neocortical organoids. While this finding is interesting and reminiscent of the clinical PCH2a phenotype, i.e., cerebellar hypoplasia, the study is very preliminary and the conclusions of the manuscript are not supported by the data. Additional information and further experiments are necessary to support the claims made.

      Major concerns:

      1. hiPSC lines show considerable inter- and intra-individual variability and therefore the size differences observed between these control and patient-derived organoids may arise from differences in the hiPSC lines used. While the data sufficiently demonstrates the pluripotency of the multiple novel hiPSC lines, major concerns remain as to the appropriateness of the control hiPSC lines. The manuscript should include a table describing the age and sex matching as well as mode of reprogramming for all control and patient lines. Patient and control lines should be matched as closely as possible. Furthermore, figure legends should clearly indicate which clones and lines are shown in the various figure panels.
      2. As the hiPSC lines used are not isogenic, it is important that the authors characterise these lines further. This should include a quantification of the rates proliferation and apoptosis in all used hiPSC lines, as these might impact the growth rate of the embryoid bodies / organoids.
      3. The authors state that the hiPSC lines have been characterised by SNP arrays to show that no genomic / chromosomal aberrations have been accrued due to reprogramming. The manuscript should include information as to when the SNP array was performed (i.e., immediately after reprogramming, after initial passaging, etc) and also include the results of the SNP array as additional information. What passage were the hiPSC when the presented experiments were carried out?
      4. Given that TSNE54 is broadly and strongly expressed in the developing nervous system, the very limited staining of the organoids for TSNE54 in Figure 2 is surprising. Can the authors provide an explanation for the fact that TSNE54 is only expressed in a small subset of cells? Which cell types are these? Moreover, high-magnification images should be shown to demonstrate subcellular staining pattern of TSNE54. Quantification of TSNE54 protein levels by immunoblotting would also be beneficial. Related to this observation, it is puzzling that the large size differences that the authors observe in their organoids would be driven by such a small number of TSNE54-expressing cells. How do the authors explain this discrepancy?
      5. The generated organoids need to be better characterised with a broader range of markers using both qPCR and immunostaining. At the moment, their identity as "cortical" and "cerebellar" organoids remain unconvincing. This is particularly true for cerebellar organoids, which are challenging to generate and are not widely used. The authors should include additional markers (for example, see PMIDs 25640179, 29397531, 32117945) and immunostaining should clearly show expected staining patterns. In Figure 5, it appears that some markers (e.g., SATB2) are expressed differently between control and patient lines, yet this is not commented on by the authors who conclude that control and patient lines show differentiation into organoids.
      6. The authors attempt to look into a potential mechanism for the size differences observed between control and patient organoids. However, only cleaved caspase-3 is used as a marker for apoptosis and no differences were observed. The authors should include further markers for potential cell death. In addition, immunostaining for proliferation markers (i.e., KI67) should be performed to evaluate whether the difference in organoid size could stem from decreased proliferation rather than increased cell death.

      Significance

      The authors present an innovative approach to study neurodevelopmental disorders using brain organoids and should be of interest to researchers and clinicians working on neurodevelopmental diseases. However, the data presented are too limited to support any conclusions about the phenotype observed. Furthermore, questions remain about the used methodology and more work is needed to demonstrate the successful generation of both cortical and cerebellar organoids.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all three Reviewers for their thorough assessment of our manuscript and their constructive comments and suggestions.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this study, the authors generate several variants of actin that are internally tagged with short peptide tags. They identify one particular position that is able to tolerate various tags of 5-10 amino acids and still shows largely unaltered behavior in cells. They study incorporation of their tagged actins into filaments, characterize the interactions of G-actin variants with different associated proteins and show that retrograde actin flow in lamellipodia and the wound healing response of epithelial cells is not affected by the tagged variants. They then apply the tagged actin to study subcellular distribution of different actin isoforms in mammalian and yeast cells.

      The identification of a specific site in the actin protein that tolerates variable peptide insertions is very exciting and of fundamental interest for all research fields that deal with cytoskeletal rearrangements and cellular morphogenesis. The result demonstrating the functionality of actin variants with peptides inserted between aa 229 and 230 are generally convincing and well done. In particular, the generation of CRISPR/Cas9 genome edited versions of beta- and gamma actin are impressive. I therefore generally support publication of this study. There are however several technical and conceptual issues that should be addressed to improve quality and scope of the study. I listed some specific comments below:

      We thank the Reviewer for their constructive comments and general support for publication of our study.

      Major points

      - The biggest issue I have is the last section on the application of tagged actins to study isoform functions. In principle the application is very clear as there are simply no alternative ways to study isoform distribution in live cells. However, the experimental data are simply not convincing. What the authors define as "cortex" in Fig. 5A seems to rather represent cytosolic background mixed with radial fibers. I am not convinced that even the antibody staining with a relatively clear differential distribution of beta and gamma really shows a genuine accumulation of one isoform on stress fibers. It seems to me that the beta-actin staining has as higher cytosolic background and is generally weaker (gamma nicely labels transverse arcs), which reduces signal/noise and therefore yields a relatively increased level in areas with less-bundled actin. My suggestion is to select more clearly defined actin structures and to use micro-patterned cells to normalize the otherwise obstructing variability in actin organization. Possible structures would be cortical arcs in bow-shaped cells, lamellipodial edges (HT1080 seem to make very nice and large lamellipodia) or cell-cell contacts (confluent monolayer, provided cells don´t grow on top of each other). Stress fibers are possible but need to be segmented very precisely and I did not see any details on this in the methods section. For Fig. 5D: I assume cells were used where only one isoform was tagged? This is technical weak and the double-normalization is probably blurring any difference that might be occurring. Why not use a double-tagging strategy with ALFA/FLAG or ALFA/AU5 tags to exploit the constructs introduced in the previous figures? Also, the unique selling point of the strategy is the possibility of actual live imaging of specific isoforms. Cells that have stably integrated double tags and then transiently express nanobodies for ALFA and either AU5 or FLAG (or other if those don't exist) would make this possible. Considering the work already done in this manuscript, such an approach should actually be possible - did the authors attempt this or is there are reason it is not discussed? If double tagged cells are not possible for some reason it should at the very least be possible to combine ALFA-detection with the specific antibody against the other isoform and get rid of the double normalization.

      We thank the Reviewer for the various suggestions regarding the comparison between the localization of the tagged and native isoforms. In our reply below, we will separately discuss the possibilities and our considerations for fixed samples and live cell imaging. We apologize for the lengthy response but for transparency reasons, we would like to give a thorough overview of our efforts for isoform-specific localization in cells, something for which we have limited space in the manuscript.

      Fixed samples:

      It was a significant experimental challenge to comparing the labeling of the β- and γ-actin specific antibodies with our internally tagged actin system (Fig. 5A-D). The reason for this is that the labeling of the samples with the β- and γ-actin specific antibodies requires treatment with methanol (Dugina et al., J Cell Sci, 2009), most likely to disturb the interaction of actin with actin-binding proteins that prevent the binding of the antibodies due to steric hindrance. Methanol treatment, however, precludes the co-labeling with phalloidin, likely due to changes in the tertiary/quaternary protein structure of F-actin. Initially, we have put a lot of effort in trying to simultaneously label phalloidin with the actin specific antibodies but even very brief methanol treatment (seconds), before or after phalloidin labeling, completely prevents/reverses the binding of phalloidin. Importantly, also the ALFA tag labeling was suboptimal after methanol treatment.

      The fact that we could not perform these double labelings led us to perform different ratio calculations for the β- and γ-actin antibody and the ALFA tag labeling. In the case of the antibody immunofluorescence labeling, we simply divided the signal of the β-actin and γ-actin since we could simultaneously label the isoforms in the same cell. In the case of the ALFA tag labeling, we used phalloidin for independent signal normalization and then performed a second normalization. Although this complicates the normalization procedure (ALFA tag signal of β- and γ-actin is first normalized to total F-actin and then a ratio is calculated) and understandably leads to some confusion, this was the only way forward to obtain the results presented in the manuscript.

      The Reviewer points out that “What the authors define as "cortex" in Fig. 5A seems to rather represent cytosolic background mixed with radial fibers.”. In our images, we observe very little cytosolic background from both antibody stainings. More importantly, for the quantitative analysis, the fluorescence intensity values were corrected for the background values observed in cytosolic areas so even if the signal is present, it should not affect our analysis. We do admit though that we could have been more careful with the term “cortex” since the observed signal could indeed be a mix of radial fibers and the actin cortex. The reviewer further states that “I am not convinced that even the antibody staining with a relatively clear differential distribution of beta and gamma really shows a genuine accumulation of one isoform on stress fibers.” Although the differences are small, we consistently observe a differential fluorescence intensity of β- and γ-actin in actin-based structures with a relatively stronger signal of γ-actin in stress fibers (Fig. 5C). Since we always normalize the fluorescent signal intensity per cell, this strongly indicates a genuine accumulation of one isoform over the other in specific actin-based structures. This observation is very consistent in our experiments and also aligns with many published studies where differences in the localization of β- and γ-actin are reported in various cell types (Pasquier et al., Vasc Cell, 2015; van den Dries et al., Nat Comms, 2019; Malek et al., Int J Mol Sci, 2020). As for the segmentation, we mentioned in the Methods section that we selected small regions (0.5x0.5mm) that exclusively contain stress fiber or “cortex” regions. The regions shown in Fig. 5B are therefore larger than the analyzed regions, something which we will better indicate in the revised manuscript.

      Planned revision: We will provide a more detailed explanation of our quantitative analysis in the Methods section such that it is more clear how our normalization procedure was performed. Furthermore, we will adapt Fig. 5A-B such that it better visualizes how we defined the regions for quantification. As per the Reviewer’s suggestion, we will also apply a different experimental method to show that the tagged isoforms properly localize to actin-based structures. For this, we will attempt to use micropatterned cells to induce clearly define actin-bases structures (the crossbows as suggested by the Reviewer) and also explore the possibilities of investigating the differential localization in double-tagged cells. We will also reconsider the use of the term “cortex” for the region that is pointed out in Fig. 5A-B.

      Live cell imaging:

      We agree with the Reviewer that it would be very valuable to attempt simultaneous live cell imaging of two isoforms. Yet, for this, we would need two tag/fluorophore systems that allow the visualization of internally tagged isoforms in living cells. As presented in our original manuscript, we have successfully inserted many different epitope tags (FLAG/AU1/AU5/ALFA) in the T229/A230 position to demonstrate the versatility of our tagging approach. Yet, despite significant efforts to identify a second tag/fluorophore system that would allow isoform-specific live cell imaging, we only succeeded in designing one strategy to perform live cell imaging, i.e. with the ALFA tag (Götzke, Nat Comms, 2019). Part of the reason for this is that so far, no high affinity nanobodies have been generated against the classical epitope tags (FLAG, AU5 etc.). This is an established challenge since classical epitope tags are typically linear/unstructured while nanobodies require folded secondary structures for epitope recognition such as alpha helices (the ALFA tag was specifically designed as such).

      Besides the successful ALFA tag approach we have tried the following additional approaches for live cell imaging: 1) __full-length GFP, 2) full-length GFP with linker, 3) GFP11 (to complement with GFP1-10 (Cabantous et al., Nat Biotech, 2005) 4) GFP11 with linker 5) FLAG Frankenbodies (Zhao et al., Nat Comms, 2019; Liu et al., Genes Cells, 2021) in FLAG IntAct cells and 6) __Tetracysteine/FlAsH labeling. Importantly, each of these additional internally tagged actins, except for those that contained full-length GFP, showed a high colocalization with the cytoskeleton, again demonstrating the versatility of the T229/A230 position to tag actin. Unfortunately, none of these approaches satisfactorily visualized the actin isoforms in living cells. We will therefore briefly summarize our findings here.

      (1-2, integration of full-length GFP and GFP with linker) Probably not surprisingly, but integrating the entire coding sequence of GFP or GFP flanked by linkers (each 5AA in length) within the T229/A230 position did not results in a proper localization of actin.

      (3-4, integration of GFP11 and GFP11 with linker) Next, we assessed the localization of the GFP11 tagged actin versions (GFP11: 16AA, GFP11+linker: 26AA). Because GFP11 is not visible without GFP1-10 complementation, we also tagged actin at the N-terminus simply for proof of concept where the internally tagged actins would end up. Interestingly, both GFP11-actin and GFP11+linker-actin properly integrated within the cytoskeleton as demonstrated by the FLAG staining. This again demonstrates the versatility of the T229/A230 position and strongly suggests that even the integration of 26AA within this position does only minimally affect the polymerization of actin into the cytoskeleton.

      (3-4) After confirmation of the proper integration of GP11-actin and GFP+linker-actin we continue to express the GFP1-10 in these cells. Unfortunately, this resulted in no or only very minimal localization of the actin to the cytoskeleton, demonstrating that GFP-complementation hampers the integration into the cytoskeleton.

      (5, use of FLAG Frankenbodies) We also expressed FLAG Frankenbodies into our FLAG IntAct cells in an attempt to visualize the isoforms in living cells. FLAG Frankenbodies are single chain antibodies fused to GFP and can be expressed in cells to visualize FLAG-tagged proteins (Liu et al., Genes Cells, 2021). Although a cytoskeletal labeling was indeed discernable in some cells, the FLAG Frankenbody signal overlapped much less with the total actin signal as compared to the FLAG immunofluorescence labeling, indicating that the incorporation of the FLAG-tagged actin was much less in the presence of the FLAG Frankenbody. Also, a significant fraction of the cells demonstrated a homogenous cytosolic signal.

      (6, Use of tetracysteine/FlAsH) Although the tetracysteine tag/FlAsH system is widely known to induce artefacts, we still aimed to evaluate if for live cell imaging of IntAct actins. Similar to GFP11, we first determined the integration of tetracysteine-actin into the cytoskeleton with the use of an additional N-terminal FLAG tag and demonstrate that it was properly integrated into the actin cytoskeleton. Unfortunately, after brief incubation with FlAsH-EDT2, we noted 1) a significant amount of background fluorescence, preventing proper actin visualization and 2) that the cell became static indicating toxicity of the FlAsH-EDT2 compound. Titrating down the amount of FlAsH-EDT2 did not alleviate these drawbacks and only resulted in less fluorescence.

      Overall, based on these experiments, we concluded that the T229/A230 position itself is very versatile, as demonstrated by the proper localization of the GFP11-actin variants and the TetraCys-actin. At the same time, none of these tag/fluorophore systems properly visualized actin in living cells. Although we are unsure what the reason is for this, it is easily imaginable that the on/off kinetics of the split GFP system and the FLAG Frankenbodies are suboptimal to allow for the rapid and continuous integration of actin monomers into the F-actin cytoskeleton. We therefore also concluded that currently, the ALFA tag/nanobody system is apparently unique in its ability to visualize epitope tagged actin in living cells (as shown in the manuscript). For simultaneous visualization of multiple isoforms, we rely on progress on the development of novel nanobody-based tags, something we hope the Reviewer will agree is outside the scope of the current work.

      *- The authors make a point of comparing the internally tagged actin to N-terminal tags that are mostly functional but have been shown to affect translational efficiency. I would strongly suggest to include N-terminally tagged actin as control for all assays in this study. Also for the physiological assays (retrograde flow, wound healing), a positive control is missing that shows some effect. Previous studies showed defects with transiently expressed actin with an N-terminal GFP. As retrograde flow measurements are very sensitive to the exact position of the kymographs and wound healing assays is a very crude and indirect readout, such a positive control is essential. *

      We acknowledge that N-terminally tagged actin has been used extensively for actin research (especially before the introduction of Lifeact). For our studies, however, we were specifically interested in whether the internally tagged actins show similar characteristics as compared to wildtype actin. We have not included N-terminally tagged actin in all of our experiments, since this would not affect our conclusions with respect to the functionality of our internally tagged actins. We expect that for future investigations to for example further establish the importance of actin N-terminal modifications in the differential regulation of actin isoforms, the comparison between internally and N-terminally tagged actins could be very instrumental. Yet, we consider this comparison outside the scope of the current manuscript. For now, the results in the manuscript provide evidence that our approach is unique with respect to the fact that it allows isoform-specific tagging without manipulating the N-terminus. As such, our internal tagging system complements the already existing repertoire of actin reporting methods (N-terminal fusion, Lifeact, F-Tractin, actin nanobodies) and allows researchers to study so far unknown properties of actin variants.

      *- Expression of tagged actins in yeast is a very nice idea but it would be far more informative to express the tagged forms as the only copy of actin. This can either be done by directly replacing endogenous actin gene in S. cerevisiae, or (if the tagged versions are not viable) - using the established plasmid shuffle system (express actin on counter-selectable plasmid, then knock out endogenous copy and introduce additional plasmid with tagged actin, then force original plasmid out). In the presence of endogenous S. cerevisiae actin the shown effects are very hard to interpret as nothing is known about relative protein levels (endogenous vs. introduced). Also, if constitutive expression of the ALFA nanobody is harmful for integration into cables, why not perform inducible expression of the nanobody and observe labeling after induction. For the live imaging a robust cable marker is needed, like Abp140-GFP. Finally, indicate the sequence differences between the used actin forms in yeast (supplementary figure with sequence alignment and clear indication of all variations) *

      We thank the reviewer for their positive comments and feedback regarding expression of IntAct variants in yeast. Currently, we have expressed IntAct as an extra copy in the presence of native Act1 of S. cerevisiae. All the IntAct variants have been expressed under a commonly used constitutive TEF1 promoter. We agree with the Reviewer that it would be valuable to attempt to express the tagged forms as the only copy of actin.

      Planned revisions:

      1) As per the Reviewer’s suggestion, we will attempt to make yeast strains with IntAct as the sole expressing actin copy by using the well-established 5-FOA-based plasmid shuffle system in yeast. We will use a ∆act1 strain containing wildtype act1 in a centromeric ura-plasmid described in Harrer et. al, 2007 (generously shared by Prof. Jessica and Prof. Amberg at Upstate Medical University of New York, USA) and express IntAct exogenously via additional plasmids. Shuffling of these strains on 5-FOA will cause the loss of ura-plasmid containing the wildtype act1 copy and will determine whether yeast cells will be able to survive with IntAct as the sole source of actin. If the cells do survive with IntAct as a sole copy, we will perform subsequent analysis for assessing actin cytoskeleton organization under these conditions.

      2) As the reviewer has mentioned, expression of NbALFA during live-cell imaging experiments hindered incorporation of IntAct into linear actin cables in yeast (Suppl. Fig. S13). As per the reviewer’s suggestion, we will now try to create an inducible-expression system for the NbALFA-mNG and observe its effects on incorporation into formin-made actin cables after induction. We have already created NbALFA-mNG constructs under galactose-inducible GALS and GAL1 promoters and are currently constructing yeast strains for these experiments.

      __3) __We will add an extra supplementary Figure to indicate the sequence differences of the various actin variants that we have expressed in yeast.

      - As the authors clearly show good integration of several tagged actins into filaments I would expand the structural characterization: perform alpha fold predictions of actin monomer structures including the various tags to show the expected orientation. It is striking that the only integration site that seems to work well is at the last position of a short helix, indicating that the orientation of the integrated peptide might be fixed in space and be optimal to minimize interference. Also, a docking of the tag onto the recently published cryoEM structures of the actin filament should be shown to indicate where it resides compared to tropomyosin or the major groove where most side binding proteins seem to bind.

      We already performed AlphaFold predictions of the tagged actin monomers, but we have decided to not include these predictions in the manuscript because of two reasons. First and foremost, while the prediction confidence of the non-tagged region is very high (pLDDT > 90), the prediction confidence of the tagged region is very low (pLDDT https://alphafold.ebi.ac.uk/faq), pLDDT values below 70 should be treated with caution and values below 50 should not be interpreted. Intriguingly, the low confidence aligns with the fact that for both tags, the prediction does not match with known features of the tag. The FLAG tag should be a linear/unstructured region in order to be recognized by the antibody and the ALFA tag should organize into an alpha helix (Götzke et al., Nat Comms, 2019). Yet, in the prediction, the FLAG tag partially continues as an alpha helix and the ALFA tag is only a small helix with part of the tag being unstructured. Second, more minor, reason for not including the predictions is that AlphaFold does not predict to what extend the tag is flexible, which means that even if the tagged region is predicted correctly, it is difficult to say whether the regions will interfere with binding of proteins.

      Despite the low prediction confidence, we used the published actin-tropomyosin cryoEM structure (von der Ecken et al., Nature, 2015) to replace WT actin with ALFA tag actin and the results are shown below. Again, although results should be interpreted with caution, the tag does not seem to obstruct monomer-monomer interactions within an F-actin filament and also the tropomyosin binding surface is relatively distant from the tag region, suggesting that these interactions are likely not disturbed by introducing the tag.

      - For any claims regarding usability of tagged variants for isoform research it would be very important to characterize the known posttranslational modifications of tagged actin variants - are the differences between beta and gamma maintained on this level as well?

      Planned revision: Following the Reviewer’s suggestion, we will perform a western blot analysis to compare posttranslational modification (arginylation) of tagged and wildtype actins.

      Technical issues

      - There is no scale for the color coding in Fig. 5A, B

      We deliberately did not add a numerical scale because the images are normalized which means that presenting the actual numbers might be misleading. The numbers could be interpreted as if they actually present the amount of β-actin relative to γ-actin which is not the case due to staining differences and the normalization procedure.

      - The y-scales for Fig. 5C and D need to be identical to allow direct comparison

      Planned revision: We will adapt the scale of Fig. 5D to make it identical to Fig. 5C. Following the other suggestions of the reviewer, we will also critically evaluate our normalization procedure and present those numbers in Fig. 5C-D if the values turn out to be different.

      - Pearson coefficient should not be normalized to a control value as its already a dimensionless parameter. Always report actual R-value - also remove R2 values for Pearson as this makes no sense in this context (not sure if it was a typo or intended).

      We normalized the Pearson coefficient values for visual representation of the results. The majority of the raw coefficient values (more than 80%) are between 0.20 and 0.75 (see raw values in the associated excel file). Theoretically, Pearson coefficient values are possible between 1 (or-1 for negative correlations) and 0. The much smaller window in our values as compared to the theoretical window (0.55 vs 1) led us to normalize the values such that they can be presented on a scale from “maximum expected colocalization” to “minimum expected colocalization”. In this way, the differences between the various tagged actins are much better appreciated in the Figure. As to reporting the R2, the Reviewer is correct. Reporting the R2 is an inadvertent mistake from our side and we will correct it.

      Planned revision: We will change the R2 in the text to PCC or Pearson Correlation Coefficient.

      *- All values on subcellular regions (like stress fiber or cortex) dependet critically on the way thesese regions were thresholded or identified. Provide all details on how this was done in the methods section and ensure that adequate background subtraction and normalization is applied. Optimally, an unbiased (AI or automated) approach based on simple image statistics is used for this to avoid personal bias. *

      Planned revision: As also indicated above, we will add new experiments to better compare the localization of the isoforms in tagged and parental cells. These new experiments will also be accompanied by a more detailed explanation of how the regions were selected and quantified.

      - In Fig. 2A only heterozygous FLAG-actin cells are used. Why not use a homozygous line (for both beta and gamma actin)? The nice band shift of the FLAG version would allow the precise quantification of the fraction of total actin covered by beta and gamma actin, which then could provide some additional info for the apparently weaker beta staining in Fig. 5 (if beta expression is simply weaker). This would be a very simple and useful advantage of the internal tags that could be widely applied.

      In Fig. 2A, we used the heterozygous FLAG-actin cells to directly compare the production of β-actin from the knock-in allele and the wildtype allele in the same cells. The fact that the two bands observed in this western blot analysis (upper and lower) are almost the same (with the FLAG band being a bit more intense) provides the strongest indication that the tag does not interfere with the expression of actin. In Suppl. Fig. 5D, we show that the expression of β-actin is also unaffected in the hemizygous FLAG actin cells, which exclusively express tagged actin.

      Planned revision: As per the Reviewer’s suggestion, we will also add a western blot analysis on the expression of both actin isoforms and total actin in hemizygous cells.

      *- Fig. 3: control with N-terminal tag is missing. Also, why is it not possible to assay filament binding factors like Myosin, Filamin or alpha actinin - instead of co-IP a simple co-sedimentation assay with cell extracts in F-buffer should pick up any major difference in decoration of filaments containing the ALFA tag. Using two speeds for centrifugation it might even be possible to observe effects on filament bundling. The best approach for this would of course be to purify tagged actins and perform in vitro assays but this is clearly beyond the scope of what the authors intended here. I personally think that a broad acceptance of the marker will only come once the biochemistry has been sufficiently characterized so this is a future direction I would strongly encourage. *

      We kindly refer to our response on Page 5/6 for why we have not included the N-terminal control.

      Planned revision: The co-sedimentation assay is an excellent suggestion by the reviewer. Following the Reviewer’s suggestion, we will perform F/G-actin fractionation and assess the presence of several F-actin associated proteins in the F-actin fraction.

      - Fig. 2A has no loading control

      We show this western blot to indicate that the WT actin and tagged actin are expressed at similar levels in the heterozygous knock-in cells. For this, no loading control is needed because we only compare the intensity of the upper band (tagged actin) with the lower band (WT actin).

      - The RPE-1 data are confusing as several constructs show very different localization (completely cytosolic) to HT1080 cells and there is no possible explanation given for this. Maybe simply remove this data set?

      We agree with the reviewer that the differences in the localization between some of the internally tagged actins between the HT1080 and RPE1 cells might be confusing, especially for the A230-A231 variant for example. Yet, the fact that also in these cells, the T229-A230 variant performs equally well as compared to N-terminally tagged actin is an important confirmation that this variant is properly integrated into actin-based structures, independent of cell type. This makes the support for choosing this variant to continue with our studies stronger. A possible explanation for the differences is that RPE1 cells in general tend to form more stress fibers as compared to the HT1080. Since the localization to stress fibers is different between the internally tagged actins, this may explain the differences observed in colocalization.

      __Planned revision: __We will add a short text, in the Results or the Discussion, on the differences between the colocalization values between HT1080 and RPE1 cells.

      *- The angel measurements for lamellipodial actin is not very meaningful: the angel is determined for the radial bundles, which do not correspond to the Arp2/3 angel of single filaments and is likely the results of different nucleation factors, I would suggest to remove this. If angel measurement are really intended, cryoEM needs to be performed. *

      We apologize for this misapprehension from our side which is also noted by the other two reviewers. In the treadmilling videos of the lamellipodia in HT1080 cells, which were obtained using Airyscan super-resolution microscopy, we clearly observe a consistent filament formation at a constant angle, something which we interpreted as the angle between the mother filament and the daughter filament. After consulting the literature, we indeed have to admit that this cannot be interpreted as such and we will remove these datasets.

      Planned revision: We will remove the datasets with the angle measurements (Suppl. Fig. 7A-B) from our manuscript.

      - Replace all SEM with SD values - use at least 3 biological replicates (4D SEM of n=2)

      Planned revision: We will carefully check our statistics and revise where appropriate.

      Minor points

      - Intro: after listing all the details already understood on actin isoforms it is not very convincing to simply state the molecular principles remain largely unclear (l 34) - maybe better "there is no way to study actin dynamics due to current limitations of specific antibodies to fixed samples. Interesting option would be actually to develop nanobodies that are isoform specific.

      We will rephrase the text in the introduction. Regarding the development isoform-specific nanobodies. Although this sounds like a promising way forward, this would likely not result in isoform-specific targeting in living cells. Similar to the antibodies, isoform-specific nanobodies would have to be generated against the N-terminus which, under native conditions, is likely not available due to the occupation with actin-binding protein. Also, since the N-terminus is not structured, it may be extremely challenging to generate nanobodies against these epitopes.

      *- L 71: "involved" in the kinetics is not a good term - maybe affects or regulates.... *

      We will rephrase the text.

      - L148: "suspect" instead of "expect" - this clonal variation is actually a big danger of the employed approach as possible defects in actin organization could be masked by compensatory changes - it would generally be good to show critical data for at least 3 independent clones to rule out dominant selection effects.

      We will rephrase. We agree that clonal variation could be a danger if actin levels are to be investigated. For future follow-up studies, we plan to make additional cell lines to avoid clone-specific conclusions.

      ***Referees cross-commenting** *

      *I completely agree with the comments by reviewer 2 on the various missing controls - adding several or all of those will make the results much more convincing. The key for the adaptation of any new actin probe will be the level of confidence researchers have on the doumented effects. Even some negative effects on actin behavior (I am sure there will be some) should not prevent usage of the strategy as long as there is robust and convincing documentation of those effects. I also agree that including some basic in vitro characterization will go a long way to convince people dierectly working on actin (there is a very high level of biochemical understanding in that field). *

      Planned revision: We will perform the essential controls as suggested by Reviewer 2. Furthermore, for future experiments, we do envisage the production and purification of internally tagged actins and investigate their binding properties in in vitro reconstitution assays. We have already started with optimizing these approaches through our ongoing collaboration (KD, SP).

      Reviewer #1 (Significance (Required)):

      *Significance: Very useful finding that can be applied to any question related to actin-dependent cellular processes (morphogenesis, cell division, cell polarization, cell migration etc.) *

      *Strength: main finding convincing, strong genome edited cell lines *

      *Limitations: application to study of isoforms very limited and data not convincing, statistics and image quantifications need improvement *

      *Advance: identify new location for integral tagging of actin, which was not really possible before. The main relevance is for fundamental cell biology but the approach can also be applied to the study of disease variants in actin. *

      Audience: general cell biology - very broad interest

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      Actin is highly sensitive to modifications, and tagging it with fluorescent proteins or even smaller motifs can affect its function. The most well-known example of this is that fission yeast where actin has been replaced with GFP-actin are inviable (Wu and Pollard, Science 2005) because the labeled actin cannot incorporate into the formin-dependent filaments that make up the cytokinetic ring. Subsequent experiments revealed that formins filter out GFP-actin monomers, as well as monomers that are labeled with smaller fluorescent motifs (Chen et al, J. Structural Biology 2012). Further, attempts to make mammalian cells lines where GFP-beta-actin was knocked into one allele resulted in extreme down-regulation of the GFP-labeled actin, indicating that there is some implicit toxicity with the labeled version. To my knowledge, all attempts at making homozygous GFP-actin knock-ins have been unsuccessful. Therefore, while GFP-actin or other labeled variants can be over-expressed in many different cell types with some success, there is always the question of how faithful the labeled actin represents bona fide actin localization and dynamics.

      To address this van Zwam et al. have developed a clever strategy of screening actin for internal motifs that can tolerate incorporation of a tag without affecting its function. They appear to have found a good candidate, named IntAct, and provide evidence that this tagging position allows the actin to be functional in both human and yeast cells. The work is very promising, and many of the assays performed satisfy the criteria of rigor and reproducibility. Importantly, the authors have created knock-in human cell lines where the tagged actin is expressed at normal levels, including a double allele knock-in that is viable and has normal proliferation and motility. Additionally, the authors show that labeled S. cerevisiae actin can incorporate into actin cables, which are formin dependent. IntAct constructs were shown to interact with several well-known actin binding proteins and localized well to many different actin structures. There was also interesting data obtained from tagging both beta and gamma actin in human cells. However, as an actin scientist eager for new probes to visualize actin in cells, there are still questions about the functionality of these probes. Addressing these issues, listed below, would alleviate the concerns I still have about IntActs after going through the manuscript. IntActs have the potential to have a large impact on cytoskeletal research if it can be rigorously documented that they are functionally as close to unlabeled actin as possible.

      We thank the Reviewer for their constructive comments and general positive evaluation of our study.

      *Reviewer #2 (Significance (Required)): *

      Concerns:

      1. There are no negative controls performed for either the fixed or live-cell imaging of IntAct. Since the fixed cell data is heavily reliant on the presence of flag-labeled puncta at actin filaments, it is important to show that the immunocytochemistry protocol doesn't produce anything that would mimic the localization of actin. For the live cell data, there has been no effort made to show that the binding of the nanobody to the ALFA tag on InAct is specific.

      Planned revision: __We will add the following controls to exclude that any of the labeling procedures produces anything that would mimic the localization of actin: 1) Immunofluorescence staining of the used tags (FLAG/ALFA) in cells that do not have tagged actins 2) Expression of ALFA-Nb-GFP and ALFA-Nb-mScarlet in cells that do not have tagged actins 3)__ Expression of free GFP in cells that have tagged actins. We will co-stain these cells with phalloidin to visualize F-actin and determine if any signal is specifically localized to the actin cytoskeleton.

      2. The homozygous ALFA-tagged IntAct cells have a 50% reduction in the amount of actin expression (Fig. 2D). What is the F:G ratio in these cells? The F:G measurement is only shown for the FLAG-tagged heterozygous IntAct cells, which have the worst co-localization with phalloidin (Fig. 2F) and were not used for subsequent figures. I appreciate that motility and proliferation were measured and shown to not be affected (Fig. 4D,E) , but in our lab reducing the amount of polymerized actin by 50% (which may be more in ALFA-tagged IntAct cells if the F:G changes) has catastrophic effects on other cytoskeletal and organelle systems. Since the homozygous ALFA IntAct cells are the main ones used in the manuscript, they should be the ones that are fully characterized.

      We would like to point out that the reduction is only 20-25 percent depending on the specific western blot analysis and the loading control. Still, the Reviewer is correct about the necessity of the F:G actin measurements of the ALFA-tagged IntAct cells and we therefore included those as Suppl. Fig. 9 in the original manuscript (text on page 9). The quantification of these assays clearly demonstrated that the F-G actin ratio in the ALFA-tagged IntAct cells is the same as in parental cells.

      3. It is not addressed if expressing the ALFA-Nb-GFP construct in ALFA-IntAct cells alter actin properties? This is essential information for live cell imaging experiments.

      Planned revision: We have already performed proliferation and migration experiments in cells that stably express the ALFA-Nb-GFP. These data indicated that proliferation and migration are not affected by the presence of the nanobody and these data will be included in the revised manuscript. To note, in the original manuscript, we already showed that treadmilling of actin at the lamellipodia is not affected by the presence of the ALFA-Nb-GFP.

      4. It is not addressed how much of the ALFA-IntAct gets labeled with ALFA-Nb-GFP and how uniform the labelling.

      We do not understand this specific request of the Reviewer. To our knowledge, it is not possible to assess how much of a probe (in this case the ALFA-Nb-GFP) binds the target (in this case the ALFA-IntAct actins) in living cells. This is not only the case for the ALFA-Nb-GFP but also for any other probe. As an example, when expressing Lifeact, we also do not know how much of the actin molecules within F-actin get labeled with Lifeact and how uniform the labeling is. From the results of the live-cell imaging we can only conclude that the binding is at least so effective that we can readily observe and discern all the actin-based structures that are also observed by Lifeact (see Suppl. Fig. 8 for Lifeact-GFP/ALFA-Nb-mScarlet cotransfection). Whether the regions that do not have F-actin only contain ALFA-Nb-GFP that is bound to actin monomers or also contains a significant fraction of free ALFA-Nb-GFP seems an issue that cannot be addressed.

      5. To assess lamellapodia architecture, "branched actin angle" is measured using AiryScan imaging of actin filaments. This type of microscopy does not offer the ability to image individual actin filaments; what is actually being measured is the orientation of actin bundles to each other. It should be impossible to image the orientation of actin filaments in Arp2/3 dendritic networks and it is surprising that the measurements average to 70 degrees. A suitable substitute for this would be to measure the size and amount of F-actin in phalloidin-stained lamellipodia using kymograph analysis.

      We apologize for this misapprehension from our side which is also noted by the other two reviewers. In the treadmilling videos of the lamellipodia in HT1080 cells, which were obtained using Airyscan super-resolution microscopy, we clearly observe a consistent filament formation at a constant angle, something which we interpreted as the angle between the mother filament and the daughter filament. After consulting the literature, we indeed have to admit that this cannot be interpreted as such and we will remove these datasets.

      Planned revision: We will remove the datasets with the angle measurements (Suppl. Fig. 7A-B) from our manuscript.

      6. Was it possible to make an IntAct gene substitution in yeast?

      Planned revision: We thank the reviewer for this interesting question and as also suggested by Reviewer 1, we are now constructing yeast strains with IntAct as the sole expressing actin copy by using the well-established plasmid shuffle system in yeast. The results of these experiments will determine the ability of IntAct to completely substitute actin function in yeast.

      Also, while this is not necessary for this manuscript, making a fission yeast strain where actin has been substituted with IntAct and demonstrating that IntAct gets incorporated into the cytoplasmic ring and into Cdc12p-polymerized filaments would alleviate MANY potential concerns people would have about these probes by directly assessing situations were other labeled actins have been documented to fail. Along the same lines, it would have been nice to see a comparison in some of the assays of ALFA-IntAct and GFP-actin or another labeled actin variant.

      We appreciate the reviewer for their constructive feedback and completely agree that it is important to document how IntAct behaves in scenarios where other labelled actins have failed. As a proof of principle, IntAct incorporates into both formin- and Arp2/3- made linear and branched actin filaments in yeast (Fig.5E, Suppl. Fig. 14) and this data shows that IntAct labelling strategy is the first to achieve good integration into both these structures as previous efforts with labelled actin such as GFP-Actin fail to incorporate into formin-made actin filaments (Doyle et al., PNAS, 1996). Thus, we believe that IntAct does perform better than other labelled actins in yeast, although, further optimizations are required to overcome limitations regarding incorporation into actin cables in the presence of the ALFA nanobody.

      Planned revision: We have already extended applicability of IntAct to another well-known fungal model system, the fission yeast Schizosaccharomyces pombe (S. pombe). We expressed IntAct variants of human β- and γ- actin, budding yeast actin (Sc-IntAct) and fission yeast actin (Sp-IntAct) from an exogenous plasmid under the native S. pombe actin promoter in an S. pombe strain that constitutively expresses the Nb-ALFA-mNG. Live-cell microscopy of S. pombe cells expressing these proteins revealed that all IntAct variants localize to actin patch-like structures located at the cell poles and cell division site (during cytokinesis). These structures show similar dynamics as reported for actin patches of S. pombe previously (Pelham et al., Nat Cell Biol, 2001). These preliminary results suggest that IntAct proteins show a similar localization pattern to only branched actin networks found in the actin patches of S. pombe like we had previously observed for the budding yeast, S. cerevisiae (Fig. S13 in manuscript). The underlying mechanism for this exclusion from linear actin cable network from both budding and fission yeast remain unknown and may represent an inherent specificity and sensitivity of yeast formins. Our current and future experiments will express IntAct variants in absence of the ALFA nanobody and determine the level of incorporation into actin cables, patches, and actomyosin ring.

      Planned revision: We have also already performed a quantitative analysis to ascertain the effect of Sc-IntAct expression of cortical actin patch dynamics which represent sites of endocytosis in yeast (Young et al., J Cell Biol, 2004; Winter et al., Curr Biol, 1997). We compared actin cortical patch lifetimes between wildtype cells and cells expressing Sc-Act1 or Sc-IntAct as an extra copy. We used Abp1-3xmcherry as a marker for actin patches and quantified the time window between the appearance and disappearance of a patch (actin patch lifetime) from time-lapse microscopy experiments. Our preliminary results indicate that actin patch lifetimes are unaffected by exogenous expression of both Sc-Act1 or Sc-IntAct suggesting that IntAct does not negatively influence or alter actin patch dynamics. These observations suggest its applicability as a direct visualization strategy for actin at the cortical patches in budding yeast alongside existing surrogate markers like Abp1, Arc15, etc (Goode et al., Genetics, 2015; Wirshing et al., J Cell Biol, 2023).

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      *Summary: *

      This paper tackles a new strategy to tag actin in cells, by identifying that incorporation of a tag of moderate size in subdomain 4 of actin minimally affects actin dynamics in cells, and does not perturb its interaction with known partners, as observed in pull-down assays.

      *Major comments: *

      The paper is interesting and experiments are convincing.

      *My main concerns are the following : *

      - Varland et al, is reporting a phosphorylation on Thr229 : I think the authors should mention and discuss this potential PTM that could be affected in IntAct.

      We thank the Reviewer for pointing this out. We are aware of this review that includes phosphorylation on Thr229 as a possible PTM. Yet, this PTM is only reported in one of the Tables of the Review and not further discussed in the text. It is also unclear how the authors determined that Thr229 is a possible phosphorylation site except for the notion that this residue is a threonine and exposed at the surface of the actin molecule. Together with the fact that there is no evidence from primary studies that Thr229 is phosphorylated, we therefore decided to not include it in our discussion.

      - The sequence in subdomain 4 (the alpha helix containing T229A230) is extremely conserved in animals, as well as in between the 6 human actin isoforms. This usually indicates a strong selection pressure on the residues. I think the authors should discuss how surprising it is that the T229A230 position can accomodate various tags while it is probably the place of interaction with other proteins and is playing an important role in the mechanical structural integrity of the actin itself.

      We thank the Reviewer for bringing up this important point. To a certain extent, the conservation argument is true for all of the residues/domains in actin. Any manipulation will change a conserved part of the actin molecule in one way or another and thereby potentially modify its function. This is also evident from the fact that for most of the internally tagged actins, we observed a very poor colocalization with the actin cytoskeleton (Fig. 1). While for the T229/A230, we have not observed any major effects yet, this certainly does not mean that no further changes or defects will be uncovered in future experiments. Nonetheless, since our approach is unique with respect to the fact that it allows isoform-specific tagging without manipulating the N-terminus, our internal tagging system complements the already existing repertoire of actin reporting methods (N-terminal fusion, Lifeact, F-Tractin, actin nanobodies) and allows researchers to study so far unknown properties of actin variants. We have already included in the discussion that, at this point, we can only speculate as to why this variant performs much better than the others (Page 16 of the manuscript) and that possible explanations are the location at the inner domain and the higher structural plasticity of this region as compared to the rest of the molecule, as found during an alanine mutagenesis screen (Rommelaere et al., Structure, 2003).

      - It is now well established that actin plays active and important roles in the nucleus : is ALFA-actin correctly translocated to the nucleus ?

      Planned revision: This is an interesting suggestion. We will perform nuclear-cytosol fractionation experiments and determine whether ALFA-actin is still correctly translocated to the nucleus.

      *- OPTIONAL: one may regret that there is no classical in vitro assays, such as pyrene assays to assess some kinetcis parameters on epitope-tagged actins. I guess this would make the paper a bit too large. Although, it will prove useful to better understand how much formin activity is affected (see below) *

      For further biochemical characterization and a detailed investigation of the precise assembly kinetics of the tagged actins, we (KD, SP) are already working together to set up in vitro reconstitution experiments. Yet, as also indicated by the Reviewer, we consider these experiments outside of the scope of the current work.

      *Minor comments: *

      Below are points that could be addressed by the authors to improve the manuscript readability and highlight some important points that are sometimes missing or are not properly discussed:

      -line 40 "...but the distinct N-terminal epitope is not available under native conditions preventing" is a bit too obscure. Can the authors say clearly what is meant by 'native conditions'?

      In our understanding, the term ‘native’ is generally used when referring to conditions in which proteins are in their natural state, without alterations due to heat or denaturants, and possibly also still interacting with their binding partners. We will rephrase to better indicate that in this specific case, we mean that the region that harbors the N-terminus is usually occupied by actin-binding proteins, preventing the binding of the antibody due to steric hindrance.

      - figure 1A : make a clearer correspondance between the number shown in panel A and the amino acid numbers displayed in panel C and G.

      Planned revision: This is a good point, we will add extra annotation in the graph to better link the panels with each other. We will also add additional annotation in Fig. 1D-F for the same purpose.

      - figure 1A : it could be informative to indicate subdomains in this panel.

      Planned revision: We will add the numbers for the subdomains in Fig. 1A.

      - figure 1C : normalized correlation cell : I am not sure I understand how the normalization of the Pearson coefficient is done. It is therefore not clear how can it >1 or >-1 ? This should be clearly explained in the method section of the paper.

      __Planned revision: __We will better explain the normalization procedure in the Methods section.

      - figure S4 : comes a bit too early when ALFA-actin has not been yet introduced in the main text. Please, reposition this part or provide data with the FLAG-tag version.

      Planned revision: This is a good point and completely overlooked by us. We will introduce this Figure later such that the ALFA tag is already introduced.

      - section starting line 121 : this section should be better motivated = Why are different tags being tested ? This comes later in the discussion, but the reader fails at following the reasoning/motivation here.

      Planned revision: We will add extra motivation for why we added multiple tags.

      - figure 2D, line 145 "We also evaluated actin protein expression in the homozygous ALFA-β-actin cells and this showed that the total amount of β-actin was slightly lower in the ALFA-β-actin cells compared to parental HT1080 cells (Fig. 2C-D)." 'Slightly' is not a very quantitative nor accurate term. please rephrase. Besides, a statistical test for the paired data would also be informative. Besides, data in figure S6B-D indeed show a correlated increase in the expression of Gamma-actin that compensate for the decrease in the Beta-actin level in ALFA-Beta-actin. Can the authors explain why they conclude otherwise?

      Planned revision: This indeed is an important point and we will change the phrasing of this section to provide a more quantitative and accurate description of the western blot quantifications.

      - figure S7B: I am not ure anyone has ever reported measurement of angle of branched actin filament using epifluorescence microscopy. I would remove this panel, or the authors should explain how this measurement can be done objectively.

      We apologize for this misapprehension from our side which is also noted by the other two reviewers. In the treadmilling videos of the lamellipodia in HT1080 cells, which were obtained using Airyscan super-resolution microscopy, we clearly observe a consistent filament formation at a constant angle, something which we interpreted as the angle between the mother filament and the daughter filament. After consulting the literature, we indeed have to admit that this cannot be interpreted as such and we will remove these datasets.

      Planned revision: We will remove the datasets with the angle measurements (Suppl. Fig. 7A-B) from our manuscript.

      *- Figure 2F : can the authors comment on the (significant ?) lower value for FLAG-tag actin ? *

      The lower value for FLAG-tag actin has likely to do with the properties of the antibody and suitability for immunofluorescence. For reason that we do not know, we usually detect more background for the FLAG tag antibody as compared to the other antibodies/ALFA tag nanobody. Since the Pearson correlation coefficient quickly decreases with suboptimal labeling, this is likely the reason that the values for FLAG-actin are lower as compared to the other tagged actins. Importantly, in our biochemistry experiments (F/G-actin), we detect no difference between FLAG-actin and ALFA-actin indicating that it is rather the immunofluorescence and sensitive Pearson correlation analysis than the integration of actin that causes this difference.

      - line 205 "The results from these experiments show that both DIAPH1 and FMNL2 associate with ALFA-β-actin (Fig. 3D),". It is not so obvious that these formins directly interact with monomeric actin via their FH2 domains in co-immunoprecipitation assays. It might very well be mediated by the interaction with profilin, that in turn bind to the FH1 domain of formins. For me, this assay does not make a correct proof that epitope-labelled actin do not interfere with formin activity.

      Planned revision: The point that the co-immunoprecipitation does not demonstrate direct interactions between formins and actin is well taken. We, however, do not claim that this assay proofs that formin activity, or formin-based integration of actin monomers, is similar with tagged actin as compared to wildtype actin. Nonetheless, we will critically re-evaluate the relevant passages and rephrase the text to avoid any confusion.

      - figure 5C&D : both graph should use the same scale for the y-axis for easier comparison.

      Planned revision: We will adapt the scale of Fig. 5D to make it identical to Fig. 5C. Following the other suggestions of the Reviewer (and of Reviewer #1), we will also critically evaluate our normalization procedure and present those numbers in the Figures if the values turn out to be different.

      - figure 5D: I think the way the ratio is performed is misleading. Why not look at the Beta/Gamma ratio using the isoform specific antibodies used in parental cells, and show the results for ALFA-Beta-actin and for ALFA-Gamma-actin separately ?

      We kindly refer to our answer to Reviewer #1 on Page 2 for a detailed explanation on the experimental challenge of comparing the localization of wildtype and tagged actin isoforms.

      Planned revision: We will critically evaluate our normalization procedure and present those numbers in the Figures if the values turn out to be different. Furthermore, we will add a different experimental method to show that the tagged isoforms properly localize to actin-based structures. For this, we will attempt to use micropatterned cells to induce clearly define actin-bases structures and also explore the possibilities of investigating the differential localization in double-tagged cells.

      *- The limitation observed for unbranched cables in yeast that nanobody-tagged ALFA-actin does not incorporate correctly should be discussed and stressed further in the discussion, as it might prove to be a strong limitation for live-cell imaging to reliably study any type of actin networks. *

      We acknowledge the reviewer’s concern regarding the inability of ALFA-tagged actin to incorporate into yeast actin cables when NbALFA is co-expressed and will discuss this point further in the revised manuscript. We have now observed the same limitation for fission yeast actin cables as well and combined, these observations may represent a tighter control and sensitivity of yeast formins towards any perturbations in actin size (since NbALFA binds to ALFA tag with picomolar affinity). To address this issue and as also suggested by Reviewer 1, we are now creating yeast strains with inducible control of NbALFA expression under GALS/GAL1 promoters and observe the labelling of actin structures after this approach. Additionally, expression of variants of NbALFA with high dissociation rates may also allow labelling of actin cables and would be certainly worth a try in the future. A structural comparison between mammalian and yeast formins may be required to shed some light on the molecular basis of this fundamental difference.

      However, since in the absence of the nanobody, this limitation is overcome (Fig. 5E, Suppl. Fig. 14), we believe that with additional modifications and fast developments in imaging technologies, this limitation can be overcome in the future. Thus, IntAct as a labeling strategy represents an advancement over existing labelled actins with the most important aspect being the identification of the T229/A230 residue pair to be permissive for integration of various tags even as large as GFP11 fragment including a linker (26AA) (Reviewer Fig. 2). Importantly, the T229/A230 site is conserved across many organisms (such as Chlamydomonas reinhardatii, Cryptococcus neoformans, etc) and may act as a framework to study the actin cytoskeleton especially in organisms where known surrogate markers like phalloidin and Lifeact may not work or work only sub optimally.

      *Reviewer #3 (Significance (Required)): *

      *General assessment: *

      *This paper provides a new tagging strategy to monitor actin activity in cells, by specifically inserting the tag along the amino acid sequence. *

      *Advance: *

      *This is a very useful tool, as most existing available probes bind to actin in regions that are common to many other actin binding proteins. The authors provide extensive experiments to validate that tagged-actin are functional and do not perturb the actin expression level, actin network architecture nor dynamics. *

      *Audience: *

      *This research paper will be of interest to a rather broad audience (many cell biologists) that are either sutyding actin dynamics or know that actin is involved in the cell functions they study. *

      *Expertise: *

      *My expertise is in vitro actin biochemistry. *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This paper tackles a new strategy to tag actin in cells, by identifying that incorporation of a tag of moderate size in subdomain 4 of actin minimally affects actin dynamics in cells, and does not perturb its interaction with known partners, as observed in pull-down assays.

      Major comments:

      The paper is interesting and experiments are convincing.

      My main concerns are the following :

      • Varland et al, is reporting a phosphorylation on Thr229 : I think the authors should mention and discuss this potential PTM that could be affected in IntAct.
      • The sequence in subdomain 4 (the alpha helix containing T229A230) is extremely conserved in animals, as well as in between the 6 human actin isoforms. This usually indicates a strong selection pressure on the residues. I think the authors should discuss how surprising it is that the T229A230 position can accomodate various tags while it is probably the place of interaction with other proteins and is playing an important role in the mechanical structural integrity of the actin itself.
      • It is now well established that actin plays active and important roles in the nucleus : is ALFA-actin correctly translocated to the nucleus ?
      • OPTIONAL: one may regret that there is no classical in vitro assays, such as pyrene assays to assess some kinetcis parameters on epitope-tagged actins. I guess this would make the paper a bit too large. Although, it will prove useful to better understand how much formin activity is affected (see below)

      Minor comments:

      Below are points that could be addressed by the authors to improve the manuscript readability and highlight some important points that are sometimes missing or are not properly discussed :

      • line 40 "...but the distinct N-terminal epitope is not available under native conditions preventing" is a bit too obscure. Can the authors say clearly what is meant by 'native conditions' ?
      • figure 1A : make a clearer correspondance between the number shown in panel A and the amino acid numbers displayed in panel C and G.
      • figure 1A : it could be informative to indicate subdomains in this panel.
      • figure 1C : normalized correlation cell : I am not sure I understand how the normalization of the Pearson coefficient is done. It is therefore not clear how can it >1 or >-1 ? This should be clearly explained in the method section of the paper.
      • figure S4 : comes a bit too early when ALFA-actin has not been yet introduced in the main text. Please, reposition this part or provide data with the FLAG-tag version.
      • section starting line 121 : this section should be better motivated = Why are different tags being tested ? This comes later in the discussion, but the reader fails at following the reasoning/motivation here.
      • figure 2D, line 145 "We also evaluated actin protein expression in the homozygous ALFA-β-actin cells and this showed that the total amount of β-actin was slightly lower in the ALFA-β-actin cells compared to parental HT1080 cells (Fig. 2C-D)." 'Slightly' is not a very quantitative nor accurate term. please rephrase. Besides, a statistical test for the paired data would also be informative. Besides, data in figure S6B-D indeed show a correlated increase in the expression of Gamma-actin that compensate for the decrease in the Beta-actin level in ALFA-Beta-actin. Can the authors explain why they conclude otherwise ?
      • figure S7B: I am not ure anyone has ever reported measurement of angle of branched actin filament using epifluorescence microscopy. I would remove this panel, or the authors should explain how this measurement can be done objectively.
      • Figure 2F : can the authors comment on the (significant ?) lower value for FLAG-tag actin ?
      • line 205 "The results from these experimentsshow that both DIAPH1 and FMNL2 associate with ALFA-β-actin (Fig. 3D),". It is not so obvious that these formins directly interact with monomeric actin via their FH2 domains in co-immunoprecipitation assays. It might very well be mediated by the interaction with profilin, that in turn bind to the FH1 domain of formins. For me, this assay does not make a correct proof that epitope-labelled actin do not interfere with formin activity.
      • figure 5C&D : both graph should use the same scale for the y-axis for easier comparison.
      • figure 5D: I think the way the ratio is performed is misleading. Why not look at the Beta/Gamma ratio using the isoform specific antibodies used in parental cells, and show the results for ALFA-Beta-actin and for ALFA-Gamma-actin separately ?
      • The limitation observed for unbranched cables in yeast that nanobody-tagged ALFA-actin does not incorporate correctly should be discussed and stressed further in the discussion, as it might prove to be a strong limitation for live-cell imaging to reliably study any type of actin networks.

      Significance

      General assessment:

      This paper provides a new tagging strategy to monitor actin activity in cells, by specifically inserting the tag along the amino acid sequence.

      Advance:

      This is a very useful tool, as most existing available probes bind to actin in regions that are common to many other actin binding proteins. The authors provide extensive experiments to validate that tagged-actin are functional and do not perturb the actin expression level, actin network architecture nor dynamics.

      Audience:

      This research paper will be of interest to a rather broad audience (many cell biologists) that are either sutyding actin dynamics or know that actin is involved in the cell functions they study.

      Expertise:

      My expertise is in vitro actin biochemistry.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Actin is highly sensitive to modifications, and tagging it with fluorescent proteins or even smaller motifs can affect its function. The most well-known example of this is that fission yeast where actin has been replaced with GFP-actin are inviable (Wu and Pollard, Science 2005) because the labeled actin cannot incorporate into the formin-dependent filaments that make up the cytokinetic ring. Subsequent experiments revealed that formins filter out GFP-actin monomers, as well as monomers that are labeled with smaller fluorescent motifs (Chen et al, J. Structural Biology 2012). Further, attempts to make mammalian cells lines where GFP-beta-actin was knocked into one allele resulted in extreme down-regulation of the GFP-labeled actin, indicating that there is some implicit toxicity with the labeled version. To my knowledge, all attempts at making homozygous GFP-actin knock-ins have been unsuccessful. Therefore, while GFP-actin or other labeled variants can be over-expressed in many different cell types with some success, there is always the question of how faithful the labeled actin represents bona fide actin localization and dynamics.

      To address this van Zwam et al. have developed a clever strategy of screening actin for internal motifs that can tolerate incorporation of a tag without affecting its function. They appear to have found a good candidate, named IntAct, and provide evidence that this tagging position allows the actin to be functional in both human and yeast cells. The work is very promising, and many of the assays performed satisfy the criteria of rigor and reproducibility. Importantly, the authors have created knock-in human cell lines where the tagged actin is expressed at normal levels, including a double allele knock-in that is viable and has normal proliferation and motility. Additionally, the authors show that labeled S. cerevisiae actin can incorporate into actin cables, which are formin dependent. IntAct constructs were shown to interact with several well-known actin binding proteins and localized well to many different actin structures. There was also interesting data obtained from tagging both beta and gamma actin in human cells. However, as an actin scientist eager for new probes to visualize actin in cells, there are still questions about the functionality of these probes. Addressing these issues, listed below, would alleviate the concerns I still have about IntActs after going through the manuscript. IntActs have the potential to have a large impact on cytoskeletal research if it can be rigorously documented that they are functionally as close to unlabeled actin as possible.

      Significance

      Concerns:

      1. There are no negative controls performed for either the fixed or live-cell imaging of IntAct. Since the fixed cell data is heavily reliant on the presence of flag-labeled puncta at actin filaments, it is important to show that the immunocytochemistry protocol doesn't produce anything that would mimic the localization of actin. For the live cell data, there has been no effort made to show that the binding of the nanobody to the ALFA tag on InAct is specific.
      2. The homozygous ALFA-tagged IntAct cells have a 50% reduction in the amount of actin expression (Fig. 2D). What is the F:G ratio in these cells? The F:G measurement is only shown for the FLAG-tagged heterozygous IntAct cells, which have the worst co-localization with phalloidin (Fig. 2F) and were not used for subsequent figures. I appreciate that motility and proliferation were measured and shown to not be affected (Fig. 4D,E) , but in our lab reducing the amount of polymerized actin by 50% (which may be more in ALFA-tagged IntAct cells if the F:G changes) has catastrophic effects on other cytoskeletal and organelle systems. Since the homozygous ALFA IntAct cells are the main ones used in the manuscript, they should be the ones that are fully characterized.
      3. It is not addressed if expressing the ALFA-Nb-GFP construct in ALFA-IntAct cells alter actin properties? This is essential information for live cell imaging experiments.
      4. It is not addressed how much of the ALFA-IntAct gets labeled with ALFA-Nb-GFP and how uniform the labelling.
      5. To assess lamellapodia architecture, "branched actin angle" is measured using AiryScan imaging of actin filaments. This type of microscopy does not offer the ability to image individual actin filaments; what is actually being measured is the orientation of actin bundles to each other. It should be impossible to image the orientation of actin filaments in Arp2/3 dendritic networks and it is surprising that the measurements average to 70 degrees. A suitable substitute for this would be to measure the size and amount of F-actin in phalloidin-stained lamellipodia using kymograph analysis.
      6. Was it possible to make an IntAct gene substitution in yeast?

      Also, while this is not necessary for this manuscript, making a fission yeast strain where actin has been substituted with IntAct and demonstrating that IntAct gets incorporated into the cytoplasmic ring and into Cdc12p-polymerized filaments would alleviate MANY potential concerns people would have about these probes by directly assessing situations were other labeled actins have been documented to fail. Along the same lines, it would have been nice to see a comparison in some of the assays of ALFA-IntAct and GFP-actin or another labeled actin variant.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, the authors generate several variants of actin that are internally tagged with short peptide tags. They identify one particular position that is able to tolerate various tags of 5-10 amino acids and still shows largely unaltered behavior in cells. They study incorporation of their tagged actins into filaments, characterize the interactions of G-actin variants with different associated proteins and show that retrograde actin flow in lamellipodia and the wound healing response of epithelial cells is not affected by the tagged variants. They then apply the tagged actin to study subcellular distribution of different actin isoforms in mammalian and yeast cells.

      The identification of a specific site in the actin protein that tolerates variable peptide insertions is very exciting and of fundamental interest for all research fields that deal with cytoskeletal rearrangements and cellular morphogenesis. The result demonstrating the functionality of actin variants with peptides inserted between aa 229 and 230 are generally convincing and well done. In particular, the generation of CRISPR/Cas9 genome edited versions of beta- and gamma actin are impressive. I therefore generally support publication of this study. There are however several technical and conceptual issues that should be addressed to improve quality and scope of the study. I listed some specific comments below:

      Major points

      • The biggest issue I have is the last section on the application of tagged actins to study isoform functions. In principle the application is very clear as there are simply no alternative ways to study isoform distribution in live cells. However, the experimental data are simply not convincing. What the authors define as "cortex" in Fig. 5A seems to rather represent cytosolic background mixed with radial fibers. I am not convinced that even the antibody staining with a relatively clear differential distribution of beta and gamma really shows a genuine accumulation of one isoform on stress fibers. It seems to me that the beta-actin staining has as higher cytosolic background and is generally weaker (gamma nicely labels transverse arcs), which reduces signal/noise and therefore yields a relatively increased level in areas with less-bundled actin. My suggestion is to select more clearly defined actin structures and to use micro-patterned cells to normalize the otherwise obstructing variability in actin organization. Possible structures would be cortical arcs in bow-shaped cells, lamellipodial edges (HT1080 seem to make very nice and large lamellipodia) or cell-cell contacts (confluent monolayer, provided cells don´t grow on top of each other). Stress fibers are possible but need to be segmented very precisely and I did not see any details on this in the methods section. For Fig. 5D: I assume cells were used where only one isoform was tagged? This is technical weak and the double-normalization is probably blurring any difference that might be occurring. Why not use a double-tagging strategy with ALFA/FLAG or ALFA/AU5 tags to exploit the constructs introduced in the previous figures? Also, the unique selling point of the strategy is the possibility of actual live imaging of specific isoforms. Cells that have stably integrated double tags and then transiently express nanobodies for ALFA and either AU5 or FLAG (or other if those don't exist) would make this possible. Considering the work already done in this manuscript, such an approach should actually be possible - did the authors attempt this or is there are reason it is not discussed? If double tagged cells are not possible for some reason it should at the very least be possible to combine ALFA-detection with the specific antibody against the other isoform and get rid of the double normalization.
      • The authors make a point of comparing the internally tagged actin to N-terminal tags that are mostly functional but have been shown to affect translational efficiency. I would strongly suggest to include N-terminally tagged actin as control for all assays in this study. Also for the physiological assays (retrograde flow, wound healing), a positive control is missing that shows some effect. Previous studies showed defects with transiently expressed actin with an N-terminal GFP. As retrograde flow measurements are very sensitive to the exact position of the kymographs and wound healing assays is a very crude and indirect readout, such a positive control is essential.
      • Expression of tagged actins in yeast is a very nice idea but it would be far more informative to express the tagged forms as the only copy of actin. This can either be done by directly replacing endogenous actin gene in S. cerevisiae, or (if the tagged versions are not viable) - using the established plasmid shuffle system (express actin on counter-selectable plasmid, then knock out endogenous copy and introduce additional plasmid with tagged actin, then force original plasmid out). In the presence of endogenous S. cerevisiae actin the shown effects are very hard to interpret as nothing is known about relative protein levels (endogenous vs. introduced). Also, if constitutive expression of the ALFA nanobody is harmful for integration into cables, why not perform inducible expression of the nanobody and observe labeling after induction. For the live imaging a robust cable marker is needed, like Abp140-GFP. Finally, indicate the sequence differences between the used actin forms in yeast (supplementary figure with sequence alignment and clear indication of all variations)
      • As the authors clearly show good integration of several tagged actins into filaments I would expand the structural characterization: perform alpha fold predictions of actin monomer structures including the various tags to show the expected orientation. It is striking that the only integration site that seems to work well is at the last position of a short helix, indicating that the orientation of the integrated peptide might be fixed in space and be optimal to minimize interference. Also, a docking of the tag onto the recently published cryoEM structures of the actin filament should be shown to indicate where it resides compared to tropomyosin or the major groove where most side binding proteins seem to bind.
      • For any claims regarding usability of tagged variants for isoform research it would be very important to characterize the known posttranslational modifications of tagged actin variants - are the differences between beta and gamma maintained on this level as well?

      Technical issues

      • There is no scale for the color coding in Fig. 5A, B
      • The y-scales for Fig. 5C and D need to be identical to allow direct comparison
      • Pearson coefficient should not be normalized to a control value as its already a dimensionless parameter. Always report actual R-value - also remove R2 values for Pearson as this makes no sense in this context (not sure if it was a typo or intended).
      • All values on subcellular regions (like stress fiber or cortex) dependet critically on the way thesese regions were thresholded or identified. Provide all details on how this was done in the methods section and ensure that adequate background subtraction and normalization is applied. Optimally, an unbiased (AI or automated) approach based on simple image statistics is used for this to avoid personal bias.
      • In Fig. 2A only heterozygous FLAG-actin cells are used. Why not use a homozygous line (for both beta and gamma actin)? The nice band shift of the FLAG version would allow the precise quantification of the fraction of total actin covered by beta and gamma actin, which then could provide some additional info for the apparently weaker beta staining in Fig. 5 (if beta expression is simply weaker). This would be a very simple and useful advantage of the internal tags that could be widely applied.
      • Fig. 3: control with N-terminal tag is missing. Also, why is it not possible to assay filament binding factors like Myosin, Filamin or alpha actinin - instead of co-IP a simple co-sedimentation assay with cell extracts in F-buffer should pick up any major difference in decoration of filaments containing the ALFA tag. Using two speeds for centrifugation it might even be possible to observe effects on filament bundling. The best approach for this would of course be to purify tagged actins and perform in vitro assays but this is clearly beyond the scope of what the authors intended here. I personally think that a broad acceptance of the marker will only come once the biochemistry has been sufficiently characterized so this is a future direction I would strongly encourage.
      • Fig. 2A has no loading control -
      • The RPE-1 data are confusing as several constructs show very different localization (completely cytosolic) to HT1080 cells and there is no possible explanation given for this. Maybe simply remove this data set?
      • The angel measurements for lamellipodial actin is not very meaningful: the angel is determined for the radial bundles, which do not correspond to the Arp2/3 angel of single filaments and is likely the results of different nucleation factors, I would suggest to remove this. If angel measurement are really intended, cryoEM needs to be performed.
      • Replace all SEM with SD values - use at least 3 biological replicates (4D SEM of n=2)

      Minor points

      • Intro: after listing all the details already understood on actin isoforms it is not very convincing to simply state the molecular principles remain largely unclear (l 34) - maybe better "there is no way to study actin dynamics due to current limitations of specific antibodies to fixed samples. Interesting option would be actually to develop nanobodies that are isoform specific 
      • L 71: "involved" in the kinetics is not a good term - maybe affects or regulates....
      • L148: "suspect" instead of "expect" - this clonal variation is actually a big danger of the employed approach as possible defects in actin organization could be masked by compensatory changes - it would generally be good to show critical data for at least 3 independent clones to rule out dominant selection effects.

      Referees cross-commenting

      I completely agree with the comments by reviewer 2 on the various missing controls - adding several or all of those will make the results much more convincing. The key for the adaptation of any new actin probe will be the level of confidence researchers have on the doumented effects. Even some negative effects on actin behavior (I am sure there will be some) should not prevent usage of the strategy as long as there is robust and convincing documentation of those effects. I also agree that including some basic in vitro characterization will go a long way to convince people dierectly working on actin (there is a very high level of biochemical understanding in that field).

      Significance

      Significance: Very useful finding that can be applied to any question related to actin-dependent cellular processes (morphogenesis, cell division, cell polarization, cell migration etc.)

      Strength: main finding convincing, strong genome edited cell lines

      Limitations: application to study of isoforms very limited and data not convincing, statistics and image quantifications need improvement

      Advance: identify new location for integral tagging of actin, which was not really possible before. The main relevance is for fundamental cell biology but the approach can also be applied to the study of disease variants in actin.

      Audience: general cell biology - very broad interest

  2. Aug 2023
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their time, the positive reviews and the useful comments. We answer below and explain the changes made to the manuscript. The comments of the reviewers are in italics.

      Reviewer #1

      1. 'For GWAS, the strains that were fertile after 20 generations were considered non-Mrt.' One aspect of Fig 1D that could be clarified are the dots at generation 21. If these represent strains that were always fertile at generation 21, then perhaps give these a different color to indicate that sterility was never observed?

      Response: This is a good idea. We added colors in Figure 1, which makes it clearer.

      We also provide a different color for surviving replicates in all relevant figures.

      1. 'The mean Mrt values of strains ranged from sterile at 3 generations to fertile after 20 generations at 25°C, with a skewed distribution toward high values (Figure 1B).' Based on Table S2, part of the explanation for this skewed distribution in later generations is that some strains became sterile rapidly for some blocks, whereas the same strain did not become sterile in other blocks. For example, JU1200, JU360, PB303. I suggest providing a second color for Fig. 1D for strains that sometimes displayed sterility and sometimes did not.

      __Response: __We now colored the isolates that never became sterile, with the same color code as in panel B. Because we stopped the scoring at G20 and code fertility at G20 as '21', those with a mean below 21 show some sterility in at least one case.

      Because the number of generations at which we stopped the phenotyping (20) is arbitrary, the fact a line stayed fertile at 20 generations in one replicate is not very meaningful, especially considering that the number of replicates is not the same for all strains. The key point of the variance graph is to show that the strains with the most variance are those with high but

      For those that were sometimes fertile and sometimes sterile, I suggest creating a graph in Figure 1 that shows generations at sterility or lack of sterility, color coded by block. This will allow the significance of strains with high generation Mrt values to be better appreciated for readers who do not look at the supplementary table.

      __Response: __Yes, we added this graph in Figure S1. This is indeed useful.

      1. The GWAS section could benefit from a simple explanation of the premise of GWAS for non-specialist readers.

      __Response: __Yes, we added: "A genome-wide association study (GWAS) is a genetic mapping that uses the natural diversity of a panel of organisms of a given species to test for statistical independence between the allelic state of polymorphic markers and the phenotype of interest (Andersen and Rockman 2022). A statistical association between the marker and the phenotype indicates that a polymorphism tightly linked to the marker in the data (i.e. in linkage disequilibrium with it) causes the variation in phenotype. For statistical reasons, GWAS can only detect polymorphisms that are at intermediate frequencies in the panel, i.e. cases where both alleles occur at frequencies higher than 5%. We only used such polymorphisms in the GWAS (see Methods)."

      And further down:

      "To diminish the multiple testing burden, the initial analysis in Figure 1E used a restricted set of markers, after pruning those that were in high linkage to each other."

      1. One problem might be that the Mrt phenotype is widespread among wild strains. To the authors' credit, they consider results observed in different laboratories as valid, even when the results do not agree. If the Mrt phenotype is influenced by the environment, then some laboratory environments might result in 'false negative' Mrt results that could be ignored in favor of positive results from another lab that appear strong. Might focusing on strains with a set of strong positive results from one lab allow the authors to draw stronger GWAS conclusions?

      2. The authors' perform GWAS based on the variance of the Mrt phenotype data. Would the GWAS data be more illuminating if the authors only considered strains that become sterile fairly rapidly, within 10 generations. The authors might then have a second category that included strains that become sterile from generation 11-20. If the genetic basis for the Mrt phenotypes is the same, then GWAS of strains that become sterile in less than 10 generations might yield similar peaks as GWAS for strains that become sterile between generations 11-20.

      __Response: __These two comments are strongly related so we answer them together. Note that the GWAS is not mapping the variance values but the Mrt values themselves.

      We actually initially only used block 1 (a single replicate, all strains performed in parallel in our laboratory) and also detected the chromosome III association using a categorical variable (threshold at 11), but decided to show the results with all data to maximize power, taking into account the generation value and block effects.

      We investigated other ways to code the data (e.g. categorically) and removing the strains of the most variable middle category, as proposed by the reviewer. This changed the p values and the rank of the markers on chromosome III but not the overall result.

      In summary, we did a variety of tests, which pointed to chromosome III, a region that was validated using crosses (Figure 2).

      Note that in the revision, we updated the GWAS plot and fine mapping table as we noticed a few problems in our previous mapping. 1) We removed 3 isolates that were classified in Lee et al. 2021 as divergent. 2) We included strains that had been lost in the pipeline because their names did not match CeNDR isotypes. This increased the significance of the chromosome III peak.

      __Response: __There was no comment 6.

      1. 'We did not investigate whether a second locus present in JU775 on the right arm of Chr III might have a lesser effect.'

      __Response: __We are not sure what the reviewer meant. Considering the difficulties with the stronger effect locus, we did not try to study loci with a weaker effect.

      1. It might be interesting to test the memory of growth on beneficial bacteria on JU4134, which had a Mrt phenotype that was strongly suppressed by the beneficial bacteria.

      __Response: __We agree that testing other strains would be useful but given the duration of such experiments (30 generations and two weeks of preparation before), we respectfully decline to perform this experiment that does not seem strictly necessary.

      1. The Mrt phenotype of mutants in small RNA inheritance and histone modifying enzymes 'appears however distinct from that of the prg-1/piwi mutant (for which the cause of sterility is debated), especially the latter does not show temperature dependence and is suppressed by starvation.' While it is true that the cause of sterility is debated for the prg-1/piwi mutant, this mutant is defective for small RNA silencing and likely has parallels with some defects in histone modifying enzymes. Anecdotal reports suggest that starvation might affect the Mrt phenotype or longevity of histone modifying enzyme mutants. Moreover, the cause of sterility is not clear for small RNA inheritance and histone modifying enzyme mutants. It is fair to say that the distinction between temperature-sensitivity or lack of temperature sensitivity of small RNA mutants is not understood. Could the authors please comment here about whether any of the wild strains display sterility at 20°C.

      __Response: __The temperature-dependence of the wild isolates is progressive between 20-25°C. We previously showed that strains with a very strong Mrt phenotype, such as QX1211, can display sterility at 20°C (Figure 1B in Frézal et al. 2018). However, its Mrt phenotype is still temperature-dependent as the sterility occurs much earlier at 25°C.

      1. If intracellular bacteria are simply somatic, then how is it that they are transmitted to progeny. If they are released into the environment and then consumed by hatched larvae, this is soma-to-soma transmission.

      __Response: __These microsporidia (which are eukaryotes related to fungi) are indeed transmitted horizontally. To make this clear, we added: "colonizing its intestinal cells and being transmitted horizontally via defecation and ingestion of spores". The soma-to-germline interaction concerns the effect of microsporidia on germline maintenance.

      Minor: 1. 'We measured the mortal germline (Mrt) phenotype'. Mortal Germline (Mrt)

      __Response: __It is unclear as to whether phenotypes start with a capital letter when they are in full words. We did write phenotypes in previous works with a capital letter but have changed because C. elegans nomenclature rules (https://cgc.umn.edu/nomenclature) suggest that they should not: "Phenotypic characteristics can be described in words, e.g., dumpy animals or uncoordinated animals." For the mortal germline phenotype in particular, we find several ways to write it in articles (with 0, 1 or 2 capital letters, including the three reviewers). We are happy to change it if required.

      Reviewer #2

      Major comments: The authors claimed that the variants causing Mrt exist at intermediate frequency in the natural population but the evidence supporting this claim is rather limited.

      __Response: __Thank you for this comment as it helped us clarify the manuscript.

      To better explain the notion of intermediate frequency in the GWAS, we added an explanation of the principle of the GWAS (see above) and again in the Discussion: "The intermediate frequency of the candidate alleles derives from the GWAS approach, which cannot detect rare alleles, such as set-24, that are present in a single strain of the dataset."

      We also illustrated the frequency by adding a plot (Fig. 1F) showing the association of the most associated candidate SNP, with a visual depiction of the frequency. We further added in Results: "For SNPs with a high significance (p-4) in the fine mapping, the frequency of the Mrt associated allele was comprised between 21 and 41% in our GWAS strain set (Table S3); as an example, the Mrt allele of the associated SNP shown in Figure 1F (III:4677491) displayed a frequency of 29% in the restricted strain set. Over the global wild strain set with genotypes at CeNDR in 2020, these numbers are 17-58% and 39%, respectively. "

      To strengthen the claim, the authors should examine the distribution and frequency (perhaps coupled with phylogenetic analysis) of the Ch III haplotype in the wild isolates. The authors should also examine the GWAS peak for the signature of balancing selection (e.g., dN/dS ratio).

      __Response: __Thank you for this comment. The different associated SNPs in Table S3 differ in their allele frequency (Table S3), hence they belong to different haplotypes. We added a supplementary Figure S2 with an analysis of the haplotype structure. Those at a low frequency (around 20%) belong to the same haplotype (e.g. JU775 and MY10) but some associated alleles are present in more haplotypes (40-50%), such as JU1793. Even if we neglect recombination, the history of mutations in the region is complex and there is not a single associated haplotype. We now show the genotypes of these different haplotypes at all SNPs in Table S3. We also added Table S4 that shows the co-occurrence of relevant haplotypes in local populations.

      Concerning tests of balancing selection, without knowing the causal polymorphism and linked haplotype, this is far reaching. We only feel confident to say that the causal polymorphism(s) is present at a significant frequency. We added however the fact that irrespective of which polymorphisms are causal, both alleles were found to coexist locally.

      Results: relevant text was added at the end of the GWAS section.

      Discussion: "The co-occurrence of relevant chromosome III haplotypes on multiple continents and in local populations (Table S4) is suggestive of balancing selection; however, a linked locus other than that causing the Mrt phenotype may be involved."

      Does JU775 carry polymorphisms in genes that are known to be involved in Mrt? These genes may genetically interact with the Ch III variant, as suggested by the partial penetrant phenotypes of the introgressed lines. It would be helpful to have a table summarize the variation in these genes.

      __Response: __It is difficult to deduce much from a genomic variant analysis, so we refrain from showing tables of polymorphisms beyond that used for the fine GWAS mapping in Table S3. For example, a non-synonymous SNP may or may not alter protein activity and cis-regulatory elements are difficult to assess. Moreover, an obviously null allele may be compensated by another polymorphism in the background. The JU775 alleles and bam files are publically available from CeNDR (Erik Andersen's lab): https://caendr.org/data/data-release/c-elegans/latest

      It is curious to me that for experiments with HT115, the expression of the RNAi vectors was induced with IPTG. Is this step necessary? It is known that even the backbone of L4440 could trigger a non-specific RNAi response (PMID: 30838421). I wonder if activating exogenous RNAi response is required for Mrt rescue.

      __Response: __Indeed: this experiment was initially aimed at testing RNAi sensitivity of JU775, thus IPTG was added on the plate (Figure 7, panel B). We therefore repeated the memory experiment with OP50 and without IPTG, with a similar result (Figure 7, panel A).

      In figure 7, it appears that the worms transferred from MG1655/HT115 to OP50 showed an even stronger rescue (higher Mrt value) than the ones constantly on MG1655/HT115. This suggests to me that fluctuations in food composition may strongly affect epigenetic inheritance. Please clarify as this is very interesting, if true.

      __Response: __Note: This answers the comment above (IPTG is not required).

      We indeed noticed this strong rescue but do not wish to make a point as we did no attempt to reproduce this result in the exact same conditions. The experiment in panel B does not show this effect.

      Optional - Numerous studies have shown that SKN-1 regulates metabolism in response to food composition and availability (PMID: 23040073). Additionally, some recent studies have indicated a role of SKN-1 in epigenetic inheritance triggered by exogenous RNAi. In particular, SKN-1 promotes stress-induced epigenetic resetting (PMID: 33729152). I wonder if SKN-1 modulates Mrt based on bacterial diet.

      __Response: __We tested skn-1b/c hypomorphic and gain-of-function mutants in the N2 background on E. coli OP50 and did not see an effect of the skn-1 allele.

      Minor comments Line 47: typo "...they defined..."

      __Response: __We did mean "thus defined".

      Line 100-101: weird sentence structure. Please consider rephrasing.

      __Response: __We simplified to "a wild C. elegans strain can keep the memory of its culture on a suppressing bacterial strain."

      Line 138-139: I don't quite understand what "intermediate-frequency chromosome III alleles" means here. Some SNPs were found in Ch III 4-6Mb? Please expand.

      __Response: __We rephrased to: "because this isolate carries the chromosome III alleles associated in the GWAS analysis with the Mrt phenotype (Table S3)."

      Line 213 - it was unclear to me why the assay was performed at 23C instead of 25C. I later learned in the method section that microsporidia cannot be cultured at 25C. I think it will be helpful to add that information when microsporidia is introduced to improve clarity.

      __Response: __We added: " We used a temperature of 23°C because these microsporidia kill C. elegans too rapidly at 25°C."

      Reviewer #3.

      Minor points 1. Could the authors please define "experimental blocks"

      __Response: __We added the following sentence in Results: "Each Mrt assay started at a certain date constitutes an experimental block."

      1. Legend to supplementary snp table should be completed: define AF, impact, modifier, moderate, AA1, AA2...

      __Response: __This is added in the first sheet of the table. We also simplified the table and removed some of these columns.

      1. Please define "intermediate-frequency allele"

      __Response: __We added in Results: "GWAS can only detect polymorphisms that are at intermediate frequencies in the panel, i.e. cases where both alleles occur at frequencies higher than 5%." We also added below: " "For SNPs with a high significance (p-4) in the fine mapping, the frequency of the Mrt associated allele was comprised between 21 and 41% in our GWAS strain set (Table S3); as an example, the Mrt allele of the associated SNP shown in Figure 1F (III:4677491) displayed a frequency of 29% in the restricted strain set."

      1. Figure 7 legend: Authors should be more specific in describing the figure: After 10 (A panel), 13 or 20 generations (B panel) on the K-12 strain... What is E. coli OP50 start 'G10'? the 15° stock?

      __Response: __We changed to: " After 10 (A panel), 13 or 20 generations (B panel) on the K-12 strain" and added some details in:

      "A control from a 15°C culture maintained without starvation ("15°C stock") was bleached in parallel (labeled "E. coli OP50 start "G10" " in the graph of panel A)."

      Optional: Did the authors attempt to rescue the Mrt phenotype with individual metabolites (eg Vit B12...)? These are not straight forward experiments and most likely part of a future study.

      __Response: __We indeed tested several metabolites that are known to differ in C. elegans raised on E. coli OP50 versus K-12 strains for their effect on the Mrt phenotype. None was able to rescue the mortal germline phenotype. However, especially in these long multigenerational experiments, it is difficult to know whether the metabolites are stable. We monitored vitamin B12 activity by using an acdh-1::GFP reporter that is known to be repressed by vitamin B12 - so we are confident of this negative result, which we now show in Figure S4. As cell wall lipopolysaccharide (LPS) differ between E. coli K-12 and B strains, we also tested the E. coli LPS mutants, which had no eff

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      • The nematode C. elegans is at the forefront of research on transgenerational epigenetic inheritance. In this work the authors studied the effects of natural genetic variations on multigenerational inheritance, using the temperature-sensitive Mortal germline phenotype (Mrt) as a paradigm in C. elegans. In ts Mrt mutants, animals become progressively sterile at 25{degree sign}C (stressful temperature) over subsequent generations and, importantly, this phenotype is reversible. The present study originated from the authors' previous observation that multiple C. elegans wild isolates display a ts-Mrt phenotype when cultured in the lab, raising the question of whether this intrinsically deleterious phenotype may be suppressed in the wild, and how natural genetic variation affects this phenotype.

      • By comparing 132 wild isolates of C. elegans, the authors found a wide distribution in ts-Mrt phenotypes ranging from 3 to 20 generations to reach sterility at 25{degree sign}C. The variance among a restrictive set of 115 replicates was low for strong Mrt values and high at intermediate trait values. Given this distribution, the authors analyzed the data using generalized linear mixed models. This reviewer is unable to evaluate the appropriateness of these models. They then performed GWAS mapping combined with analysis of introgression lines and identified a QTL on chromosome III between 4.66 and 6 .49Mb that includes a number of potentially interesting candidates that were not further analyzed in this work.

      • Because the authors noticed that the Mrt phenotype commonly appears after bleaching the culture, a treatment that kills associated microbes, they then tested the impact of naturally associated microbes on the Mrt phenotype. They found that freshly isolated strains such as JU3224 could be propagated for more than 20 generations at 25{degree sign}C with their associated microbes, while after bleaching on OP50 (bacteria commonly used in lab culture) they developed a Mrt phenotype at 25{degree sign}. They then fed the isolates with naturally associated bacteria isolated in the lab-either their own or from other isolates. Reassociation of single bacterial clones, or a mix of these, fully or partially rescued the Mrt phenotype. Importantly, bacteria isolated from one strain was able to rescue the Mrt of another strain, suggesting common mechanisms of action in rescuing the Mrt phenotype. Surprisingly Microsporidia, usually detrimental to C. elegans, also rescued the Mrt phenotype. These results show that infection of somatic tissues can influence the germline.

      • ts Mrt mutations so far identified affect nuclear small RNA pathways, small RNA amplification and histone modifications in the germline. The authors further show that the Mrt phenotype of laboratory mutants in small RNA inheritance or chromatin factors such as the set-2 histone methyltrasferase is also suppressed by culture on bacteria other than E. coli OP50.

      • Finally, the authors tested whether animals have a memory of their past bacterial environment by shifting animals of the C. elegans JU775 strain that had been cultured for several generations at 25{degree sign}C on an E. coli K-12 strain (on which their Mrt phenotype was suppressed) to the laboratory E. coli OP50, which usually reveals the Mrt phenotype. Lines that were propagated for 10-20 generations at 25{degree sign}C on an E. coli K-12 strain (MG1655 or HT115) showed a rescued phenotype when transferred back on OP50, consistent with a multigenerational memory of the bacterial environment.

      • All experiments are well executed, clearly presented and of the highest standard.

      Significance

      C. elegans is an excellent model system to study transgenerational inheritance. However, most studies on epigenetic inheritance in this system are carried out under standard laboratory conditions, and the phenotypes followed often not very robust (stress resistance, longevity..) raising questions as to their interpretation. This work is an important contribution to the field because it reveals how a widely studied phenotype (the Mrt phenotype) relates to natural isolates. The results reported demonstrate a clear link between the environment and the multigenerational transmission of non-genetic information. They also raise interesting questions on the ability of a species to transiently provide environmental cues to a variable number of generations. Finally, these results offer hints that that the Mrt phenotype may result from inherited metabolic changes, as observed using other experimental paradigms in C. elegans, including starvation. This work will therefore be of interest to a wide audience interested in epigenetic inheritance and the environment, soma-germline communication, and host pathogen interactions.

      Minor points

      1. Could the authors please define "experimental blocks"

      2. Legend to supplementary snp table should be completed: define AF, impact, modifier, moderate, AA1, AA2...

      3. Please define "intermediate-frequency allele"

      4. Figure 7 legend: Authors should be more specific in describing the figure: After 10 (A panel), 13 or 20 generations (B panel) on the K-12 strain... What is E. coli OP50 start 'G10'? the 15{degree sign} stock?

      Optional:

      Did the authors attempt to rescue the Mrt phenotype with individual metabolites (eg Vit B12...)? These are not straight forward experiments and most likely part of a future study.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Frézal et al. reported novel interactions between microbes and C. elegans in the regulation of epigenetic inheritance. By screening for 132 isotypes, the authors found natural genetic variants that contribute to the mortal germline (Mrt) phenotype. The authors further found that naturally associated gut bacteria, microsporidia, and E. coli K12 could rescue Mrt phenotype in wild isolates as well as in epigenetic mutants. Finally, the authors showed that the epigenetic memory of bacterial environment could propagate transgenerationally. I find this paper highly intriguing as it provides valuable insights into the impact of the environment on epigenetic inheritance and its effects on evolution within ecologically relevant contexts.

      Major comments:

      • The authors claimed that the variants causing Mrt exist at intermediate frequency in the natural population but the evidence supporting this claim is rather limited. To strengthen the claim, the authors should examine the distribution and frequency (perhaps coupled with phylogenetic analysis) of the Ch III haplotype in the wild isolates. The authors should also examine the GWAS peak for the signature of balancing selection (e.g., dN/dS ratio).

      • Does JU775 carry polymorphisms in genes that are known to be involved in Mrt? These genes may genetically interact with the Ch III variant, as suggested by the partial penetrant phenotypes of the introgressed lines. It would be helpful to have a table summarize the variation in these genes. <br /> It is curious to me that for experiments with HT115, the expression of the RNAi vectors was induced with IPTG. Is this step necessary? It is known that even the backbone of L4440 could trigger a non-specific RNAi response (PMID: 30838421). I wonder if activating exogenous RNAi response is required for Mrt rescue.

      • In figure 7, it appears that the worms transferred from MG1655/HT115 to OP50 showed an even stronger rescue (higher Mrt value) than the ones constantly on MG1655/HT115. This suggests to me that fluctuations in food composition may strongly affect epigenetic inheritance. Please clarify as this is very interesting, if true.

      • Optional - Numerous studies have shown that SKN-1 regulates metabolism in response to food composition and availability (PMID: 23040073). Additionally, some recent studies have indicated a role of SKN-1 in epigenetic inheritance triggered by exogenous RNAi. In particular, SKN-1 promotes stress-induced epigenetic resetting (PMID: 33729152). I wonder if SKN-1 modulates Mrt based on bacterial diet.

      Minor comments:

      • Line 47: typo "...they defined..."

      • Line 100-101: weird sentence structure. Please consider rephrasing.

      • Line 138-139: I don't quite understand what "intermediate-frequency chromosome III alleles" means here. Some SNPs were found in Ch III 4-6Mb? Please expand.

      • Line 213 - it was unclear to me why the assay was performed at 23C instead of 25C. I later learned in the method section that microsporidia cannot be cultured at 25C. I think it will be helpful to add that information when microsporidia is introduced to improve clarity.

      Significance

      This study beautifully demonstrates how diet composition affects epigenetic inheritance. This study is rigorous (replicated by 3 different labs) and the data is solid. Using natural wild isolates and naturally associated microbes, the authors described how diet composition affects germline mortality and epigenetic inheritance. Interestingly, the authors showed that Mrt phenotype might be the result of standard lab cultivation conditions and it was masked when the worms were fed on naturally associated bacteria and microsporidia. Overall, the findings are very interesting and novel. While mechanistic insights are currently lacking, it is outside the scope of this paper. This paper provides an interesting paradigm to study how genetic and environmental variation influence epigenetic inheritance and evolution. I believe this paper will be of great interest to audiences across many fields of biology, including quantitative biology, evo-devo, ecology, and genetics and epigenetics.

      My field of expertise: C. elegans biology, epigenetic inheritance, genetics.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Frezal, Felix and colleagues study 132 wild isolates of the C. elegans species and demonstrate that the majority of these strains will become sterile within 20 generations if grown at 25oC. This is a very thorough analysis of a multigenerational trait that the authors show is commonly found in wild C. elegans strains. The authors use GWAS to identify a peak on chromosome III that is enriched in strains that become sterile at 25oC. Consistently genetic crosses place this segment of chromosome III from a Mrt wild strain into the N2 background resulted in a strong Mrt phenotype. The authors noticed that bleaching of wild C. elegans strains to remove associated bacteria promoted the Mrt phenotype. Remarkably, the authors show that growth of bleached wild strains on bacteria isolated from the wild strains prior to bleaching is sufficient to suppress the Mrt phenotype. These results were obtained with two wild isolates and with multiple species of wild bacteria, strongly supporting the authors' conclusions. The authors also show that independent types of intracellular bacteria that infect the intestine can partially suppress the Mrt phenotypes. The authors also show partial to strong rescue of temperature-sensitive epigenetic mutants set-2, set-24 and nrde-2 by wild bacteria. Remarkably, the authors demonstrate that growth of the introgressed JU775 strain on a N2 background can be grown on suppressor bacteria for 10 to 20 generations, then bleached and placed on OP50, then there is a multigenerational memory of the suppressor bacteria. This intriguing result is consistent with bacteria having an epigenetic effect on C. elegans Mrt phenotypes, which are themselves in some cases caused by epigenetic defects.

      Comments for the authors:

      1. 'For GWAS, the strains that were fertile after 20 generations were considered non-Mrt.'

      One aspect of Fig 1D that could be clarified are the dots at generation 21. If these represent strains that were always fertile at generation 21, then perhaps give these a different color to indicate that sterility was never observed?

      1. 'The mean Mrt values of strains ranged from sterile at 3 generations to fertile after 20 generations at 25oC, with a skewed distribution toward high values (Figure 1B).'

      Based on Table S2, part of the explanation for this skewed distribution in later generations is that some strains became sterile rapidly for some blocks, whereas the same strain did not become sterile in other blocks. For example, JU1200, JU360, PB303. I suggest providing a second color for Fig. 1D for strains that sometimes displayed sterility and sometimes did not.

      For those that were sometimes fertile and sometimes sterile, I suggest creating a graph in Figure 1 that shows generations at sterility or lack of sterility, color coded by block. This will allow the significance of strains with high generation Mrt values to be better appreciated for readers who do not look at the supplementary table.

      1. The GWAS section could benefit from a simple explanation of the premise of GWAS for non-specialist readers.

      2. One problem might be that the Mrt phenotype is widespread among wild strains. To the authors' credit, they consider results observed in different laboratories as valid, even when the results do not agree. If the Mrt phenotype is influenced by the environment, then some laboratory environments might result in 'false negative' Mrt results that could be ignored in favor of positive results from another lab that appear strong. Might focusing on strains with a set of strong positive results from one lab allow the authors to draw stronger GWAS conclusions?

      3. The authors' perform GWAS based on the variance of the Mrt phenotype data. Would the GWAS data be more illuminating if the authors only considered strains that become sterile fairly rapidly, within 10 generations. The authors might then have a second category that included strains that become sterile from generation 11-20. If the genetic basis for the Mrt phenotypes is the same, then GWAS of strains that become sterile in less than 10 generations might yield similar peaks as GWAS for strains that become sterile between generations 11-20.

      4. 'We did not investigate whether a second locus present in JU775 on the right arm of Chr III might have a lesser effect.'

      5. It might be interesting to test the memory of growth on beneficial bacteria on JU4134, which had a Mrt phenotype that was strongly suppressed by the beneficial bacteria.

      6. The Mrt phenotype of mutants in small RNA inheritance and histone modifying enzymes 'appears however distinct from that of the prg-1/piwi mutant (for which the cause of sterility is debated), especially the latter does not show temperature dependence and is suppressed by starvation.'

      While it is true that the cause of sterility is debated for the prg-1/piwi mutant, this mutant is defective for small RNA silencing and likely has parallels with some defects in histone modifying enzymes. Anecdotal reports suggest that starvation might affect the Mrt phenotype or longevity of histone modifying enzyme mutants. Moreover, the cause of sterility is not clear for small RNA inheritance and histone modifying enzyme mutants. It is fair to say that the distinction between temperature-sensitivity or lack of temperature sensitivity of small RNA mutants is not understood. Could the authors please comment here about whether any of the wild strains display sterility at 20oC.

      1. If intracellular bacteria are simply somatic, then how is it that they are transmitted to progeny. If they are released into the environment and then consumed by hatched larvae, this is soma-to-soma transmission.

      Minor comments:

      1. 'We measured the mortal germline (Mrt) phenotype'. Mortal Germline (Mrt)

      Significance

      All in all, this is an interesting and well-written manuscript that represents a considerable amount of work and demonstrates that a temperature-sensitive multigenerational sterility phenotype is widespread among wild C. elegans strains. This Mrt phenotype is modulated by the food they consume or by intracellular bacterial parasites that reside in somatic intestinal cells. This may mean that the intestine is a major modulator of the Mrt phenotype, which may be a consequence of lab culture conditions and may not occur for wild strains in the wild. Nevertheless, the phenotype or phenotypes are intriguing and likely relevant to natural variation.

      The limitations of this manuscript include a lack of understanding of the precise genes involved or if small RNAs or metabolites from bacteria are involved. But this manuscript represents an enormous effort and raises many interesting points that will be addressed in future efforts.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Major comments:

      1. A control group of mice fed chow diet is needed to distinguish the effects of the genotype from those caused by diet. What is the phenotype of regular chow-fed mice in terms of energy metabolism and thermogenesis?

      We are sincerely grateful to Reviewer 1 for raising an important question regarding the need for a control group of mice fed chow diet.

      To address this concern, we have conducted experiments on mice fed a regular chow diet and measured their phenotype in terms of energy metabolism and thermogenesis. In addition to be sure that the phenotype also is present in when we compared littermates we have included as control both to chow-fed CD4-Cre and littermates (MKK3/6f/f). Our findings reveal that MKK3/6CD4-KO mice fed a chow diet presented an increased brown adipose tissue (BAT) thermogenesis compared with CD4-Cre and littermates. This phenotype is similar to the observed in HFD-fed mice. Also, these results indicate that the same phenotype is observed when we compared with littermates including an extra control in the study.

      To further investigate the effect on energy metabolism, we utilized metabolic cages. The data from these experiments align with the increased thermogenesis observed in MKK3/6CD4-KO mice fed a chow diet, as they also demonstrated increased energy expenditure. We thank the reviewer for this suggestion as we believe that these new data strengthen our conclusion significantly.

      We have thoughtfully incorporated these essential findings into in Supplementary Figure 2C-D of the manuscript.

      1. While an increase in BAT temperature (as demonstrated here by infrared imaging) in line with increased thermogenesis, it will be critical to verify this hypothesis by indirect calorimetry. Energy expenditure, food intake, and activity measures should be added for regular and DIO mice. Please follow the guidelines for ANCOVA analysis and measurements explained in PMID: 22205519 and PMID: 21177944.

      We are grateful to Reviewer 1 for bringing up an essential point concerning the need to verify our hypothesis on increased BAT temperature and thermogenesis through indirect calorimetry. We acknowledge the importance of including energy expenditure, food intake, and activity measures for both regular and DIO mice to strengthen our study.

      To address this valuable suggestion, we have taken immediate action. We utilized metabolic cages in mice under chow diet. The data from these experiments align with the increased thermogenesis observed in MKK3/6CD4-KO mice fed a chow diet, as they also demonstrated increased energy expenditure, without differences in food intake or locomotor activity. We thank the reviewer for this suggestion as we believe that these new data strengthen our conclusion significantly. These new data are now in Supplementary Figure 2A-B.

      In addition, we have initiated a new experimental group of age-matched mice on HFD, which we will carefully feed for 8 weeks. Following this dietary period, we will subject the mice to metabolic cage analysis, allowing us to obtain accurate data on energy expenditure, food intake, and activity levels. These additional measurements will provide a comprehensive understanding of the metabolic changes induced by MKK3/6 deficiency in T cells under different dietary conditions.

      1. That the phenotype is still seen at isothermal housing is interesting but should be backed up by direct assessment of thermogenic capacity (see PMID: 21177944). In the end, it could also be increased heat loss, independently of heat production. If the browning is cause or consequence remains unclear, then.

      Thank you for raising this important point. Indeed, it is essential to corroborate the observed phenotype with direct assessments of thermogenic capacity to gain a comprehensive understanding of the underlying mechanisms. The study mentioned in PMID: 21177944 highlights the significance of evaluating thermogenesis directly to support the findings.

      According to your suggestion, we plan to house the animals at 30 ºC for four weeks and subsequently inject norepinephrine to evaluate thermogenesis capacity while measuring brown adipose tissue (BAT) activation. This approach should provide valuable insights into the thermogenic potential of the animals under isothermal conditions.

      However, we will not be able to conduct the experiment in metabolic cages at 30 ºC due to the constraint that our system does not allow 30 ºC temperature. For this reason, we will measure BAT temperature to analyze this experiment.

      1. Regarding the in vitro data, a thermogenic phenotype should be functionally verified by Seahorse analysis.

      We thank Reviewer 1 for raising an important point concerning the need for functional verification of the thermogenic phenotype observed in our in vitro data using Seahorse analysis.

      In response to this valuable suggestion, we performed Seahorse analysis in differentiated adipocytes treated with or without IL-35 for 48 hours. The results demonstrated a slight increase in basal metabolism and a heightened response to isoproterenol (ISO) stimulation of β3 adrenergic receptors in adipocytes after IL-35 treatment. These findings provide functional evidence supporting the thermogenic phenotype induced by IL-35 in adipocytes.

      We have thoughtfully included this essential data in Figure 2 of this revision plan, allowing reviewers and the scientific community to comprehensively evaluate and validate the functional implications of our findings.

      1. Mechanistically, there is epistasis type of experiment that IL-35 influences Ucp1 levels via ATF2 as the data remain associative in nature.

      Thank you for your valuable comment. We agree that to establish a mechanistic link between IL-35 and Ucp1 levels will improve the strength of the manuscript.

      To delve deeper into the mechanism through which IL-35 influences Ucp1 expression, we focused on the role of ATF2, a transcription factor known to be involved in regulating UCP1 levels (PMID: 11369767 and PMID: 15024092). In our investigation, we treated adipocytes with IL-35 both in the presence and absence of an inhibitor targeting the ATF2 pathway. The results were illuminating as we observed a significant reduction in the expression of Ucp1 when the ATF2 pathway was inhibited.

      These findings indicate that ATF2 is indeed a crucial mediator of the effects of IL-35 on Ucp1 levels. By inhibiting the ATF2 pathway, we demonstrate a direct functional link between IL-35 and the expression of Ucp1, providing mechanistic insights into the regulatory role of IL-35 in thermogenesis. We included new results in Figure 7F.

      1. What are other consequences of injecting IL-35? Is it good or bad? What is the therapeutic potential in DIO mice? Also, in these experiments (Fig. 7) indirect calorimetry as described would be supportive of the claims.

      Regarding the consequences of injecting IL-35, we have already performed experiments to analyze its effect. Our findings indicate that IL-35 increases thermogenesis in BAT (Figure 7), suggesting that it may play a role in promoting energy expenditure, which could be beneficial in combating diet-induced obesity (DIO) in mice. Importantly, we did not observe any negative effects of IL-35 in our experiments.

      Based on these promising results, we are expecting the therapeutic potential of IL-35 in DIO mice. By promoting thermogenesis in BAT, IL-35 may offer a novel approach to manage obesity and related metabolic disorders. However, we acknowledge that further comprehensive studies are needed to fully understand its therapeutic benefits and potential side effects.

      In our future works, we plan to evaluate a targeted delivery system for IL-35. We are currently generating IL-35 loaded metal-organic frameworks (MOFs) labeled with adipose tissue-specific peptides. This innovative strategy aims to enhance the delivery of IL-35 to adipose tissue, potentially maximizing its effects in the relevant areas. Our ongoing work with IL-35 loaded MOFs may offer a promising avenue for targeted delivery.

      Minor comments:

      1. The authors claim that their HFD-fed MKK3/6CD4-KO mice are protected against hyperglycemia, but only fasted/fed blood glucose tests are performed. Lower glucose levels could be explained due to a hyperinsulinemic state in response to growing insulin resistance in the presence of HFD. It would be sensible to perform both glucose and insulin tolerance tests to back up your statement.

      Thank you for your insightful comment. We agree that to support our claim of protection against hyperglycemia in HFD-fed MKK3/6CD4-KO mice, further tests are necessary beyond fasted/fed blood glucose measurements.

      In response to your suggestion, we conducted both glucose tolerance tests (GTT) and insulin tolerance tests (ITT) in HFD-fed MKK3/6CD4-KO mice. We did not observed differences in glucose tolerance and but ITT showed significantly enhanced insulin sensitivity compared to control mice. These findings provide evidence that the protection against hyperglycemia in HFD-fed MKK3/6CD4-KO mice is not solely due to a hyperinsulinemic state, but rather indicates genuine improvements in glucose handling and insulin response.

      We have thoughtfully included these crucial data in the revised version of the manuscript, both in the main text and Supplementary Figure 4. We extend our appreciation to the reviewer for this valuable suggestion, which has enhanced the scientific rigor and completeness of our study.

      1. Please provide the loading control for p38 and S6 blots (Figure 6G).

      Thank you for the comment. The loading control we used for P p38 and P S6 blots in Figure 6G is β-actin. Due to the limited amount of sample available, we can only use β-actin as the loading control. The sample amount obtained is very limited, and we can only provide enough lysate to run a couple of blots from the same sample. Running several western blots with the same sample is almost impossible given the constraint of the sample availability. We apologize for this limitation, but it is necessary to avoid using too many mice for ethical reasons, as the samples come from a large number of mice.

      1. Statistical test from Figure 7B should be a t-test, since it is only comparing 2 variables (PBS vs IL-35), and not a 2-way ANOVA as described in the legend.

      We sincerely thank the reviewer for the comment. It was indeed a mistake in the text. While we have performed a t-test, there was an error in the legend that we have now corrected. We apologize for any confusion this may have caused and appreciate the opportunity to rectify the oversight.

      1. Label correctly the panels in the figures -examples: Fig 3, panels C and D are interchanged; reference in the text to Fig S1G even though the figure only as panels A-F; Fig 7 legend referes to the statistical test of panel E when the figure only has A-D.

      We sincerely apologize for any mistakes in our manuscript that may have caused difficulties while reading the article and potentially led to misleading results. We are grateful to Reviewer #1 for bringing these errors to our attention. Thanks to their diligent review, we have been able to identify and rectify the issues in our manuscript. The necessary corrections have been made, ensuring the accuracy and reliability of our research. We greatly appreciate the reviewer's valuable feedback and contribution to improving the quality of our work.

      1. There are several typos along the text, please revise (example: page 4;line 4 -"tremorgenic")

      We apologize for the presence of any typos in the initial version of the article. We have thoroughly revised the manuscript to correct these errors. Thank you for bringing this to our attention and helping us improve the accuracy and clarity of our work.

      Reviewer #1 (Significance):

      The manuscript is well written, and the research conducted properly, even though a thorough analysis of energy metabolism in mice and cells is missing and the mechanistic claims are based on relatively thin data.

      The immune system and inflammation play important roles for obesity and insulin resistance, yet the roles they play in thermogenic adipocytes remains unclear. This work adds novel aspects to this relationship.

      Reviewer #2 (Evidence, reproducibility and clarity):

      This manuscript by Nikolic et al sought to investigate the role of p38 activation in adipose tissue Treg cells and obesity. They found that the expression of p38a, its upstream kinase MKK6, and downstream substrate ATF2 was upregulated specifically in adipose T cells associated with human obesity. They generated T cell-specific knockout MKK3/6 in mice and found these animals were protected from diet-induced obesity as a result of increased BAT thermogenesis. Mechanistically, loss of p38a activation promoted adipose tissue accumulation of Treg cells, leading to elevated IL-35 availability and UCP1 expression.

      Major comments:

      1. They attributed the obesity protection to energy expenditure; however, food intake and intestinal absorption were never tested. Immune cells particularly Treg cells are important modulates of nutrient uptake.

      We are sincerely grateful to Reviewer #2 for this crucial comment, highlighting the importance of assessing not only energy expenditure but also food intake and intestinal absorption in our study.

      In response to this valuable suggestion, we have initiated an HFD experiment to comprehensively examine food intake and intestinal absorption. For food intake analysis, we are employing metabolic cages, which will allow us to monitor and quantify the amount of food consumed by the mice accurately. Additionally, we plan to follow the methodology outlined in the study by Kraus et al. (PMID: 27110587) to measure lipid content in feces, enabling us to evaluate intestinal absorption.

      By conducting these additional experiments, we aim to gain a deeper understanding of the potential role of Treg cells, known immune modulators of nutrient uptake, in our observed obesity protection phenotype.

      1. At thermoneutrality, BAT is inactive even though UCP1 expression is still present (not activated). MKK3/6 deficiency in T cells still confer protection against obesity at thermoneutrality suggests it regulates other energy balance components in addition to BAT thermogenesis.

      Thanks for the comment. We believe that the effects of IL35 on thermogenesis are likely partly mediated by alternative mechanisms, as we did not observe an increase in UCP1 gene expression in BAT in vivo (Figure 3D of the manuscript), and the increase in thermogenesis is still present even at thermoneutrality where UCP1 is inactive (Figure 4E of the manuscript). This suggests that IL35 might regulate other alternative pathways that control BAT thermogenesis.

      While our current findings provide valuable insights, further experiments may be necessary to fully understand the underlying mechanisms. For instance, conducting experiments with transgenic mice expressing IL35 or using IL35 knockout (KO) mice could shed more light on the specific pathways through which IL35 exerts its effects on thermogenesis and energy balance.

      In conclusion, we hypothesize that IL35's effects on thermogenesis are mediated partly by alternative mechanisms beyond UCP1 activation, and its ability to enhance thermogenesis even at thermoneutrality highlights its potential as a regulator of energy balance. We plan to further investigate the specific mechanisms through which IL35 impacts thermogenesis and energy balance. To achieve this, we will consider conducting experiments with transgenic mice expressing IL35 or using IL35 knockout (KO) mice in follow up studies. This is now discussed in our manuscript.

      1. Loss of adipose Treg cells (such as Pparg KO, Foxp3-DTR) did not lead to obvious obesity phenotypes. Gain-of-function Treg cells (such as adoptive transfer, IL-2/IL-2 Ab) did not results in profound obesity protection as observed in MKK3/6 CD4-KO mice. It suggests that MKK3/6 KO in T cells causes other immune defects (besides Tregs).

      We agree with the referee's assessment that the lack of obvious obesity phenotypes in above mentioned animal models. The results we observed in our MKK3/6CD4-KO mice suggest that p38 signaling pathway in T cells may modulate their function, leading to an upregulation of IL35 expression, which could be a contributing factor to the significant obesity protection observed in MKK3/6CD4-KO mice. We believe that IL35's effects on energy balance and thermogenesis are critical components of the observed protection against obesity in this model.

      Regarding the studies with PPAR KO in Treg cells, it is important to note that they did not specifically focus on the effect of thermogenesis. While they observed a general tendency of increased fat deposition when treated with a PPAR agonist in the Treg deficient PPAR KO mice, these findings were not extensively studied in that particular paper. Thus, additional research is necessary to specifically evaluate thermogenesis in these mice and further understand the role of PPAR in Treg-mediated thermogenic processes.

      We also acknowledge the presence of contradictory results from loss-of-function experiments of Treg cells in mice. The observed metabolic changes may be context-dependent, and the impact of Treg cells on metabolism might vary under different physiological conditions. For instance, in lean conditions where adipose tissue inflammation is low, a decrease in VAT Treg cells might not lead to significant metabolic changes. However, under certain circumstances, such as obesity, VAT Treg cells may play a critical role in regulating metabolism. In this context increasing that population that is reduced during obesity could results in improve metabolic performance.

      In conclusion, our findings suggest that the lack of p38 activation in Treg cells may prevent the dramatic down-regulation and loss of function observed in Treg cells during obesity. This preservation of Treg function could be a significant factor driving the observed protection against obesity in MKK3/6CD4-KO mice.

      While further studies are required to elucidate the precise timing and spatial aspects of the specific functions of adipose-resident Treg cells, it is evident that these cells play a crucial role in maintaining immune and metabolic homeostasis. They achieve this, in part, by regulating adipose inflammation, insulin sensitivity, lipolysis, and thermogenesis. This is now discussed in our manuscript.

      1. The increase in IL-35 seemed to be very moderate, compared to the metabolic phenotypes. It raises the question if IL-35 is responsible for BAT activation and reduced weight gain. It is unclear what systemic and local levels of IL-35 were reached after recombinant IL-35 treatment (Fig. 7B). IL-35 antibody blockade experiment in KO mice is recommended.

      Physiological changes in cytokines can indeed have a significant impact on the metabolic profile due to their continuous and intricate interactions. Even minor alterations in the overall cytokine milieu can result in substantial changes in metabolism (doi.org/10.1073/pnas.1215840110). In fact, it is well-established that in humans, small changes in cytokine profiles between genders, in obesity, and during aging can play a critical role in the development of pathology. These cytokines often operate in a chronic manner, exerting long-term effects on various physiological processes (doi.org/10.1038/s41467-020-14396-9).

      In summary, the dynamic interplay of cytokines in metabolism can lead to significant metabolic changes even with subtle alterations in their levels. While the increase in IL-35 may appear moderate, our findings using recombinant IL35 indicate that IL-35 increases thermogenesis in BAT, suggesting that it may play a role in promoting energy expenditure, which could be beneficial in combating diet-induced obesity (DIO) in mice. Importantly, we did not observe any negative effects of IL-35 in our experiments.

      1. IL-35 induced p-ATF2 is acute and transient (Fig. 7D) and it was able to increase BAT temperature in just 4 h (Fig. 7B). However, Ucp1 transcription and translation generally take much longer time (e.g. 2d in Fig. 7C). IL-35 may increase energy expenditure through UCP1-independent mechanisms.

      Thanks for the comment. As previously mentioned, we believe that the effects of IL35 on thermogenesis are might be mediated by alternative mechanisms, as we did not observe an increase in UCP1 gene expression in BAT, and the increase in thermogenesis is still present even at thermoneutrality where UCP1 is inactive. This suggests that IL35 might regulate other alternative pathways that control BAT thermogenesis.

      While our current findings provide valuable insights, further experiments may be necessary to fully understand the underlying mechanisms. For instance, conducting experiments with transgenic mice expressing IL35 or using IL35 knockout (KO) mice could shed more light on the specific pathways through which IL35 exerts its effects on thermogenesis and energy balance. We plan to further investigate the specific mechanisms through which IL35 impacts thermogenesis and energy balance. To achieve this, we will consider conducting experiments with transgenic mice expressing IL35 or using IL35 knockout (KO) mice in follow up studies. This is now discussed in our manuscript.

      Minor comments:

      1. The gating of Treg cells should exclude CD25- cells. Single positive (CD25+ or Foxp3+) cells are progenitors of Tregs. In addition to number, phenotypic activation of Treg cells should also be determined.

      Thank you for the comment. We have reanalyzed our data by excluding CD25- cells and included now in the figure 5A of the manuscript and new supplementary figure 7 of revised manuscript. We also checked CD69+ and KLRG1+ Treg cells and observed no differences between genotypes. We also included figures in this revision plan (Figure 5 and 6).

      1. ATF is also important for adipogenesis, is the adipogenic differentiation of BAT SVF cells affected by MKK3/6 KO or IL-35 treatment?

      We appreciate the reviewer's observation regarding the importance of ATF in adipogenesis. To investigate this aspect further, we performed in vitro differentiation of adipocytes and treated them with IL-35 in the presence or absence of an inhibitor targeting the upstream activator of ATF.

      The results were compelling, as IL-35 treatment led to an increase in the expression of adipogenic markers, including Pparg, Adipoq, Leptin, and Perilipin. In contrast, inhibiting ATF activation resulted in a reduction of these adipogenic markers. These findings provide strong evidence that ATF plays a significant role in mediating the effects of IL-35 on adipogenesis.

      We have thoughtfully included these essential data in Figure 7G of the manuscript. We extend our gratitude to the reviewer for their keen observation, which has enhanced the scientific depth and completeness of our study.

      1. Metabolic cage experiments are desired to determine whole-body energy balance, including food intake, physical activity, and heat production.

      To address this valuable suggestion, we have taken immediate action. We utilized metabolic cages in mice under chow diet. The data from these experiments align with the increased thermogenesis observed in MKK3/6CD4-KO mice fed a chow diet, as they also demonstrated increased energy expenditure, without differences in food intake or locomotor activity. We thank the reviewer for this suggestion as we believe that these new data strengthen our conclusion significantly. The new data are included in Supplementary figure 2 A-B.

      In addition, we have initiated a new experimental group of age-matched mice on HFD, which we will carefully feed for 8 weeks. Following this dietary period, we will subject the mice to metabolic cage analysis, allowing us to obtain accurate data on energy expenditure, food intake, and activity levels. These additional measurements will provide a comprehensive understanding of the metabolic changes induced by MKK3/6 deficiency in T cells under different dietary conditions.

      1. Total UCP1 expression (both RNA and protein) in the whole BAT from an animal should determined (since BAT is smaller in KO mice).

      Thank you for this comment. Yes, we have measured UCP1 expression in the whole BAT from the animals. It is in the figure 3C and 3D and here. Although in vitro studies indicated that IL35 increase UCP1 in adipocytes we were not able to find an increase of this protein in BAT

      We believe that the effects of IL35 on thermogenesis are likely partly mediated by alternative mechanisms, as we did not observe an increase in UCP1 gene expression in BAT in vivo, and the increase in thermogenesis is still present even at thermoneutrality where UCP1 is inactive (Figure 4E of the manuscript). This suggests that IL35 might regulate other alternative pathways that control BAT thermogenesis.

      1. Fig. 6C, IL-35-expressing Treg cells should be quantified from adipose tissue.

      We appreciate the referee's suggestion to quantify IL-35-expressing Treg cells from adipose tissue in Fig. 6C. While we agree that this would be valuable information, we encountered technical challenges that made it impractical to measure IL-35 directly in Treg cells from the visceral adipose tissue (VAT).

      One of the main technical challenges we encountered is the low number of Treg cells present in the adipose tissue, making it difficult to obtain sufficient cell material for accurate quantification of IL-35. Treg cells are relatively rare compared to other immune cell populations in the adipose tissue, and their extraction and analysis can be technically demanding.

      Reviewer #2 (Significance):

      The manuscript is innovative in define the novel role of p38 activation in the T cell compartment and its metabolic regulation. The involvement of Treg cells in adipose tissue homeostasis has been well documented and Treg cell-derived IL-35 has been demonstrated in immune regulation. The authors provided a relatively thorough description of the altered metabolism in these Mkk3/6 CD4-KO mice; however, the reviewer has doubts if Treg cells and IL-35 are primary mechanisms of the observed protection from obesity. The manuscript would be much stronger if the model were Treg cell-specific KO and/or IL-35 deficiency in Treg cells reverses obesity resistance conferred by MKK3/6 deficiency. It also suspected that BAT thermogenesis is not the major reason, as BAT deficiency or UCP1 KO results in much milder phenotypes in mice, even at thermoneutrality.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Specific comments:

      1. It's important to use proper controls for mouse metabolic studies. The authors stated that CD4-Cre and MKK3/6 CD4-KO mice are all in the C57B/6L background. However, it would appear that these two lines were bred separately. The difference in the genetic background, despite minor, can lead to the observed phenotype, notably weight gain. Since the metabolic phenotypes seem to be driven by the weight difference, it is even more critical to include additional controls to validate the findings. For instance, crossing MKK3/6 f/f with one copy of CD4-Cre with MKK3/6 f/f to generate age-matched MKK3/6 CD4-KO and MKK3/6 f/f controls should be used to repeat major in vivo studies similar to those in Fig. 2-4.

      We thank the reviewer for the comment. Although, every control is important using conditional mice, there are several papers indicating that all the cre expression lines have for their own effects that could be important in metabolism and there are several articles that strongly recommended to use cre+ lines as a control. For that reason, we have used the cre expressing line as a control because we really think is the best one (Jonkers and Berns, 2002). In fact, Jackson laboratory recommend to use cre expressing line as a control to avoid side effects that cre overexpression could have in the tissue of interest (https://biokamikazi.files.wordpress.com/2014/07/cre-lox-imp-notes.pdf).

      However, as this reviewer suggested, we checked that similar results were obtained using littermates as controls and we have now included these data in the manuscript (Supplementary Figure 2D).

      1. The assessment of adipose tissue immune cell population in Fig. 5 was conducted after HFD-induced obesity. As mentioned above, the change in Treg and M2 cell percentage could be due to the body weight difference. The experiment should be repeated (with proper controls) in normal chow and after a few weeks of HFD when Treg numbers start to decline.

      Thank you for the comment. We currently performing short HFD experiment to check Treg and M2 cell population in adipose tissue using the littermates as controls.

      In addition, we checked those cell populations in adipose tissue infiltrates in mice fed chow diet and observed no differences in M2 macrophage population between mice, while the percentage of Treg cells was actually lower in MKK3/6CD4-KO mice ND-fed mice (Fig 12 of revision plan). This result suggests that higher accumulation of Treg cells in mice lacking p38 activation in T cells are specific of obese state and strengthen our hypothesis that DIO protection in MKK3/6CD4-KO mice is due to Treg cell population.

      1. Data related to the mechanistic link in Fig. 6/7 are not robust and require a large amount of additional work to substantiate the claim. First of all, the role of IL-35 in BAT thermogenesis remains unclear. It's somewhat surprising to see a single dose of IL-35 i.v. injection is sufficient to increase BAT temperature in Fig. 7B. Minimally, the authors need to demonstrate that IL-35 treatment (perhaps after a few daily doses) is able to increase browning/beiging of fat cells and improve cold tolerance when placing the mice at 4 degree of several hours (and up to 3 days). Serum FGF21 level should also be measured after/during IL-13 treatment. Secondly, ATF2 knockout or knockdown in brown preadipocytes should be employed to demonstrate that IL-35 induced UCP1 and FGF21 expression is ATF2 dependent. Another key experiment is to use IL-35 deficient Treg model to definitively demonstrate the requirement of Treg IL-35 to maintain thermogenesis. However, this can be done in a follow up study.

      We are grateful for all the insightful comment provided by Reviewer #3. We understand the concern, but we have the limitations in performing several sequential i.v. injections in our animal facility due to ethical permissions. In light of this constraint, we have devised an alternative approach to evaluate the role of IL-35 in adaptive thermogenesis.

      To address this, we conducted a cold tolerance test in both control mice and MKK3/6CD4-KO mice, which express higher levels of IL-35. Our findings revealed that MKK3/6CD4-KO mice exposed to cold conditions were able to preserve their body and brown adipose tissue (BAT) temperature, while the temperature of control CD4-Cre mice gradually dropped during the cold challenge.

      The data from this cold tolerance test support our hypothesis and demonstrate the role of IL-35 in promoting adaptive thermogenesis, leading to enhanced temperature maintenance in MKK3/6CD4-KO mice. These observations have been included in Figure 7B of the manuscript, and detailed results are available in Figure 11 of this revision plan.

      We appreciate the reviewer's valuable input, which has encouraged us to explore alternative experimental approaches to address the research question effectively.

      We agree with the reviewer #3 that using IL-35 deficient Treg model would be great approach to confirm our results, but we think that now with the additional experiments we have performed, we strength our findings that IL-35 has a novel role in controlling adipose tissue thermogenesis.

      Reviewer #3 (Significance):

      Dissipating energy as heat through brown or beige adipocyte-mediated thermogenesis is believed to be an effective way to combat obesity. The current study aims to characterize the p38 signaling pathway in T cells as a potential target to modulate browning or beiging of adipose tissues. This would be of interest to the basic biomedical research community, particularly in the area of immunometabolism. A major limitation is the concern of improper controls for the mouse models, which makes data interpretation difficult. In addition, the mechanistic studies lack in depth analyses to support the conclusion.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Nikolic et al. examine the metabolic outcome of T cell specific deletion of MKK3/6 (MKK3/6 CD4-KO), which are the main activators of p38. Previous studies have demonstrated that MKK3/6 CD4-KO leads to Treg expansion and that Tregs in adipose tissues are associated with improved metabolic homeostasis. In line with these observations, the authors show that MKK3/6 CD4-KO mice gain less weight and have more active brown fat thermogenesis on a HFD both at the room temperature and 30C housing conditions. They also find more Tregs and M2 macrophages in eWAT of MKK3/6 CD4-KO. All of the metabolic parameters are compared to CD4-cre mice as the wild type controls. Mechanistically, the authors suggest that reduced p38 activation by MKK3/6 CD4-KO leads to increased IL-35 production by Tregs, which induces beiging/browning of adipose tissues to promote metabolic health.

      The authors have spent most of the efforts conducting metabolic phenotyping of MKK3/6 CD4-KO mice. One potential issue is whether the non-littermate CD4-cre mice are the proper controls for the comparison. In addition, the mechanistic link of the IL-35-ATF2-UCP1/FGF21 axis has only been superficially addressed.

      Specific comments:

      1. It's important to use proper controls for mouse metabolic studies. The authors stated that CD4-Cre and MKK3/6 CD4-KO mice are all in the C57B/6L background. However, it would appear that these two lines were bred separately. The difference in the genetic background, despite minor, can lead to the observed phenotype, notably weight gain. Since the metabolic phenotypes seem to be driven by the weight difference, it is even more critical to include additional controls to validate the findings. For instance, crossing MKK3/6 f/f with one copy of CD4-Cre with MKK3/6 f/f to generate age-matched MKK3/6 CD4-KO and MKK3/6 f/f controls should be used to repeat major in vivo studies similar to those in Fig. 2-4.
      2. The assessment of adipose tissue immune cell population in Fig. 5 was conducted after HFD-induced obesity. As mentioned above, the change in Treg and M2 cell percentage could be due to the body weight difference. The experiment should be repeated (with proper controls) in normal chow and after a few weeks of HFD when Treg numbers start to decline.
      3. Data related to the mechanistic link in Fig. 6/7 are not robust and require a large amount of additional work to substantiate the claim. First of all, the role of IL-35 in BAT thermogenesis remains unclear. It's somewhat surprising to see a single dose of IL-35 i.v. injection is sufficient to increase BAT temperature in Fig. 7B. Minimally, the authors need to demonstrate that IL-35 treatment (perhaps after a few daily doses) is able to increase browning/beiging of fat cells and improve cold tolerance when placing the mice at 4 degree of several hours (and up to 3 days). Serum FGF21 level should also be measured after/during IL-13 treatment. Secondly, ATF2 knockout or knockdown in brown preadipocytes should be employed to demonstrate that IL-35 induced UCP1 and FGF21 expression is ATF2 dependent. Another key experiment is to use IL-35 deficient Treg model to definitively demonstrate the requirement of Treg IL-35 to maintain thermogenesis. However, this can be done in a follow up study.

      Significance

      Dissipating energy as heat through brown or beige adipocyte-mediated thermogenesis is believed to be an effective way to combat obesity. The current study aims to characterize the p38 signaling pathway in T cells as a potential target to modulate browning or beiging of adipose tissues. This would be of interest to the basic biomedical research community, particularly in the area of immunometabolism. A major limitation is the concern of improper controls for the mouse models, which makes data interpretation difficult. In addition, the mechanistic studies lack in depth analyses to support the conclusion.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript by Nikolic et al sought to investigate the role of p38 activation in adipose tissue Treg cells and obesity. They found that the expression of p38a, its upstream kinase MKK6, and downstream substrate ATF2 was upregulated specifically in adipose T cells associated with human obesity. They generated T cell-specific knockout MKK3/6 in mice and found these animals were protected from diet-induced obesity as a result of increased BAT thermogenesis. Mechanistically, loss of p38a activation promoted adipose tissue accumulation of Treg cells, leading to elevated IL-35 availability and UCP1 expression.

      Major comments:

      1. They attributed the obesity protection to energy expenditure; however, food intake and intestinal absorption were never tested. Immune cells particularly Treg cells are important modulates of nutrient uptake.
      2. At thermoneutrality, BAT is inactive even though UCP1 expression is still present (not activated). MKK3/6 deficiency in T cells still confer protection against obesity at thermoneutrality suggests it regulates other energy balance components in addition to BAT thermogenesis.
      3. Loss of adipose Treg cells (such as Pparg KO, Foxp3-DTR) did not lead to obvious obesity phenotypes. Gain-of-function Treg cells (such as adoptive transfer, IL-2/IL-2 Ab) did not results in profound obesity protection as observed in MKK3/6 CD4-KO mice. It suggests that MKK3/6 KO in T cells causes other immune defects (besides Tregs).
      4. The increase in IL-35 seemed to be very moderate, compared to the metabolic phenotypes. It raises the question if IL-35 is responsible for BAT activation and reduced weight gain. It is unclear what systemic and local levels of IL-35 were reached after recombinant IL-35 treatment (Fig. 7B). IL-35 antibody blockade experiment in KO mice is recommended.
      5. IL-35 induced p-ATF2 is acute and transient (Fig. 7D) and it was able to increase BAT temperature in just 4 h (Fig. 7B). However, Ucp1 transcription and translation generally take much longer time (e.g. 2d in Fig. 7C). IL-35 may increase energy expenditure through UCP1-independent mechanisms.

      Minor comments:

      1. The gating of Treg cells should exclude CD25- cells. Single positive (CD25+ or Foxp3+) cells are progenitors of Tregs. In addition to number, phenotypic activation of Treg cells should also be determined.
      2. ATF is also important for adipogenesis, is the adipogenic differentiation of BAT SVF cells affected by MKK3/6 KO or IL-35 treatment?
      3. Metabolic cage experiments are desired to determine whole-body energy balance, including food intake, physical activity, and heat production.
      4. Total UCP1 expression (both RNA and protein) in the whole BAT from an animal should determined (since BAT is smaller in KO mice).
      5. Fig. 6C, IL-35-expressing Treg cells should be quantified from adipose tissue.

      Referees cross-commenting

      I agree with Reviewer #1. In addition to energy metabolism and mechanistic action of IL-35, more rigor characterization of adipose Treg cells is needed.

      Significance

      The manuscript is innovative in define the novel role of p38 activation in the T cell compartment and its metabolic regulation. The involvement of Treg cells in adipose tissue homeostasis has been well documented and Treg cell-derived IL-35 has been demonstrated in immune regulation. The authors provided a relatively thorough description of the altered metabolism in these Mkk3/6 CD4-KO mice; however, the reviewer has doubts if Treg cells and IL-35 are primary mechanisms of the observed protection from obesity. The manuscript would be much stronger if the model were Treg cell-specific KO and/or IL-35 deficiency in Treg cells reverses obesity resistance conferred by MKK3/6 deficiency. It also suspected that BAT thermogenesis is not the major reason, as BAT deficiency or UCP1 KO results in much milder phenotypes in mice, even at thermoneutrality.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this study, Nikolic et al. show a novel role for p38 signaling in Treg cells, which impacts adipocytes through IL-35. This mechanism seems to be important for adipose tissue browning and metabolic health and could be potentially therapeutically exploited.

      Major comments:

      1. A control group of mice fed chow diet is needed to distinguish the effects of the genotype from those caused by diet. What is the phenotype of regular chow-fed mice in terms of energy metabolism and thermogenesis?
      2. While an increase in BAT temperature (as demonstrated here by infrared imaging) in line with increased thermogenesis, it will be critical to verify this hypothesis by indirect calorimetry. Energy expenditure, food intake, and activity measures should be added for regular and DIO mice. Please follow the guidelines for ANCOVA analysis and measurements explained in PMID: 22205519 and PMID: 21177944.
      3. That the phenotype is still seen at isothermal housing is interesting but should be backed up by direct assessment of thermogenic capacity (see PMID: 21177944). In the end, it could also be increased heat loss, independently of heat production. If the browning is cause or consequence remains unclear, then.
      4. Regarding the in vitro data, a thermogenic phenotype should be functionally verified by Seahorse analysis.
      5. Mechanistically, there is epistasis type of experiment that IL-35 influences Ucp1 levels via ATF2 as the data remain associative in nature.
      6. What are other consequences of injecting IL-35? Is it good or bad? What is the therapeutic potential in DIO mice? Also, in these experiments (Fig. 7) indirect calorimetry as described would be supportive of the claims.

      Minor comments:

      1. The authors claim that their HFD-fed MKK3/6CD4-KO mice are protected against hyperglycemia, but only fasted/fed blood glucose tests are performed. Lower glucose levels could be explained due to a hyperinsulinemic state in response to growing insulin resistance in the presence of HFD. It would be sensible to perform both glucose and insulin tolerance tests to back up your statement.
      2. Please provide the loading control for p38 and S6 blots (Figure 6G).
      3. Statistical test from Figure 7B should be a t-test, since it is only comparing 2 variables (PBS vs IL-35), and not a 2-way ANOVA as described in the legend.
      4. Label correctly the panels in the figures -examples: Fig 3, panels C and D are interchanged; reference in the text to Fig S1G even though the figure only as panels A-F; Fig 7 legend referes to the statistical test of panel E when the figure only has A-D.
      5. There are several typos along the text, please revise (example: page 4;line 4 -"tremorgenic")

      Referees cross-commenting

      I think we three reviewers are pretty much on the same page - mouse energy metabolism explored too little and the mechanistic insight a bit thin considering the relatively strong claims.

      Significance

      The manuscript is well written, and the research conducted properly, even though a thorough analysis of energy metabolism in mice and cells is missing and the mechanistic claims are based on relatively thin data.

      The immune system and inflammation play important roles for obesity and insulin resistance, yet the roles they play in thermogenic adipocytes remains unclear. This work adds novel aspects to this relationship.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their thoughtful comments. We were delighted that the reviewers found our manuscript and results “solid”, “important”, “well-written”, “thoughtful”, “critical addition to the literature”, that the “design of (these) experiments is high in quality” and “conclusions are convincing and the experiments are well executed”.

      We were thrilled the reviewers appreciated that “this manuscript provides solutions to technical limitations to observe mRNA in vivo” by approaching such limitations “in a thoughtful study where many of the salient features of MS2 epitope tagging are systematically measured” and “it provides a greatly improved tool to track mRNA by live imaging” that “also alerts of experimental noise that can be found and can be specific for each gene/transcript

      We will address all the concerns raised by the reviewers. Most of the comments concern text edits. In addition, we will add the following to the Results section:

      1. Quantitation of observed phenotypes in Figures 1C-D and 2C-D;

      2. Quantitation of cytoplasmic transcripts in Figure 1G-L.

      Quantitation will be performed as previously done in Tocchini et al., 2021.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Review of: "An adapted MS2-MCP system to visualize endogenous cytoplasmic mRNA with live imaging in Caenorhabditis elegans"<br /> Authors: Cristina Tocchini and Susan Mango

      The MS2-MCP imaging platform is an essential imaging system that enables dynamic quantification of mRNA transcription, abundance, location, and turnover in living biological systems. In the last ten or so years, this approach has been used in extremely successful ways in Drosophila embryos to dissect both the regulatory logic underpinning early transcriptional organization and activation with unprecedented resolution and, furthermore, how active mRNA localization outside of the nucleus impacts pattern formation. The authors correctly point out that full implementation of this tool has been suspiciously lacking in the C. elegans community for some time (aside from a few noted implementations).

      In this manuscript, Tocchini and Mango directly approach this deficit in a thoughtful study where many of the salient features of MS2 epitope tagging are systematically measured. Specifically, the authors use CRISPR genomic engineering to tag two separate dosage-sensitive, developmental genes and study the expression and function of these genes within the context of the MS2/MCP-GFP system. The authors demonstrate that the location of the MS2 epitope insertion within the endogenous 3'UTR is an important design consideration for functional, downstream implementation of the imaging system. In both cases, insertion of the MS2 hairpins near the end of the open reading frame of either gene results in overt and specific developmental phenotypes that phenocopy previously characterized loss of function alleles of each gene. The design of these experiments is high in quality in that they measure both the levels of cytoplasmic abundance of the various epitope-tagged mRNAs as well as the protein expression levels for these transgenes (by monitoring the levels of GFP expression (each MS2-tagged gene encodes a functional GFP-tagged allele). In two clear transgene examples, they demonstrate that the loss of function phenotypes of the proximally-tagged (closest to the ORF) transgenes disrupt mRNA levels and expression and reduce the proper localization of these mRNAs. This may be why previous attempts at implementing this important imaging system have failed.

      The authors then characterize the cellular systems that cause the differential expression of MS2-tagged transgenes. The authors note that previous studies on simpler systems and in C. elegans have suggested that the nonsense-mediated mRNA decay (NMD) pathway limits the expression of mRNAs with exceptionally long 3'UTRs. Tocchini and Mango then use C. elegans NMD mutants to demonstrate that ablation of this natural RNA degradation system corrects the developmental and gene expression defects associated with the reduction of function MS2 insertion alleles. These experiments are complete and compelling as they are validated at all levels (GFP expression (via quantification of GFP expression) and mRNA expression, and mRNA localization levels (via in situ hybridization).

      The authors then make the case that the type and expression levels of the MCP-GFP fusion protein are also essential features that need to be optimized for an effective imaging system. The authors suggest that optimal visualization of endogenous genes requires the surprisingly low-level expression of the MCP-GFP fusion protein. The authors use a novel transgene that differs from the conventional system. Specifically, the Tocchini system employs a 2xMCP ORF fused to 2xmCherry ORFs fusion. This transgene lacks the NLS typically used to localize exported mRNAs in the cytoplasm and also encodes two MCPs that may or may not facilitate dimerization on the MS2 hairpins. They demonstrate that endogenous, epitope-tagged transgenes can be visualized in developing embryos and that tethering this 2xMCP fusion to the reporter transcript does not alter RNA expression levels. While the authors demonstrate that visualization is possible with this system, it is hard to tell if this fusion protein dramatically improves over other available systems without a direct comparison. For instance, measuring the signal-to-noise (S/N) ratio of localized 2xMCP-2xmCherry would be a good addition and support the author's claims. If it were an exceptional system, these calculations should exceed the well-characterized and quantified MCP-GFP system described in Lee et al. 2019 ((Lee et al., 2019). It is just too hard to know if this is a dramatic element that should now be included in future RNA localization experiments.

      Minor critiques:

      1. The authors should provide more details in the experimental description of the MS2-tagged alleles (or in the figure images). It needs to be clarified in the main text how many MS2 hairpins there are, though this can be found in the materials and methods. In addition, it would be nice to know if these were any of the variations of MS2 hairpins that have already been optimized in some other way to increase or decrease structure or RNA metabolism defects in other systems. Specifically, are these hairpins the newest versions, V6 or V7, described in manuscripts from the Singer laboratory (e.g., (Tutucci et al., 2018))? For aficionados of this imaging system, it would be important to qualify each of the potential new features that make the results in this manuscript so clear and important.
      2. For people that are colorblind (or have reduced ability to distinguish some colors from others (like me, a reviewer)), it would be nice to have the MS2 illustrations in Figures 1A and B not have that color within the black, normal UTR. It's picky, but I had to ask someone what color that was.

      References:

      Lee, C., Shin, H., and Kimble, J. (2019). Dynamics of Notch-Dependent Transcriptional Bursting in Its Native Context. Dev Cell 50, 426-435 e424.

      Tutucci, E., Vera, M., Biswas, J., Garcia, J., Parker, R., and Singer, R.H. (2018). An improved MS2 system for accurate reporting of the mRNA life cycle. Nat Methods 15, 81-89.

      Significance

      In summary, this is a well-written and critical addition to the literature that will hopefully increase the implementation of this system in C. elegans research. The systematic approach to getting a new experimental platform up and running certainly has a place in the canon. Aside from the missing elements regarding the putative improvements and/or direct comparisons between different MCP fusion proteins, the manuscript is solid, important, and nearly ready to go.

      It is an advance and will, as noted above, likely serve to help implement this system by other C. elegans reserachers.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The transparency of C. elegans invites us to push the limit of live imaging. In this context, observation of endogenous mRNA using the high affinity between MS2 RNA hairpins (from a bacteriophage) and a protein (MCP) that can be fluorescently labeled. In the time of CRISPR-Cas, the editing of endogenous genes is feasible and authors use it to insert 24 MS sequences (about 650 bp in total) at the UTRs of a couple of genes. Once they found a way to insert the MS2 sequences at the UTRs, although with phenotypic consequences that are solved in mutants defective in Non-mediated decay of transcript, they tune the expression of the MCP using a heat-shock promoter with a leaking expression at 25C and its location in the cytoplasm avoiding nuclear location signal (NLS) of the protein.

      Major comments:

      They present solutions for live imaging endogenous mRNA that would be useful for colleagues interested in this technique but also show experimental noises that would be specific to each gene/transcript/UTR. In the end, the best value of this technique is to observe "real" or physiological levels but to reach this point they need to use a mutant background (NMD mutants), which may alter the "real" scenario. They found a smart way to title the article using "An adapted..." but it would be more realistic/honest to mention in the title that this is happening only in NMD mutant backgrounds. I also have doubt ion the use of the acronym MS2-MCP in the title. What about something like "Visualization of endogenous cytoplasmic mRNA with live imaging in C. elegans embryos requires an inactive Non-mediated decay"?

      In any case, the conclusions are convincing and the experiments are well executed. I do not find the need for any essential experiments if they are clearer through the manuscript (from title and abstract to discussion) that this technique may need to be optimized and (and maybe validated with FISH) for each specific transcript, and developmental stage cell type where NMD and polyadenylation may work differently. Another source of experimental noise may come from the use of mcherry, which is known for forming aggregates in some cells/stages.(would this artefactual aggregation occur in figure 1L?)<br /> The only experiment that I missed, not essential but easy to perform, is a better description of the slight developmental delay of dlg-1 MS2 v2 animals. Size measures? Time until they lay embryos?

      In this sense, although is not the main purpose of the article, they could highlight the fact that this is an additional option to produce hippomorphic alleles of essential genes.

      Regarding methods, I miss information about the CRISPR-Cas efficiency of inserting the MS2 sequences at the UTRs. Sizes are "small" and can facilitate the insertion of dsDNA repair template, but it would be useful to know what efficiency would be expected. It would be good to mention somewhere how frequent are GG PAM sequences at UTRs sequences (probably less common than in other regions). In this sense, the use of minimal PAM Cas9 variants (Vicencio et al, Nat Comm 2022) would be necessary.

      Minor comments:

      In the abstract, line 33, remove epithelial? I do not think this is relevant in this sentence.<br /> In figure 4, panels B and C, add the two different embryonic stages on the left side. Then, it wouldn't be necessary to read the legend to understand the figure.

      Referees cross-commenting

      I find useful and reasonable the comments of my colleagues

      Significance

      I work in C. elegans on diverse topics, with an interest in RNA, and I have used FISH in C. elegans in the past. I find this study useful to expand the C. elegans toolbox in C. elegans. This manuscript provides solutions to technical limitations to observe mRNA in vivo in the cytoplasm, but also alerts of experimental noise that can be found and can be specific for each gene/transcript.

      It is focused on the C. elegans embryo, which is the system of interest for the authors. I miss a bit of discussion at least on the use of this methodology in other stages. One of the interesting aspects of observing mRNA in vivo is the capacity to manipulate the environment. Such capacity is very limited in embryos but feasible in larvae or adults with the use of microfluidics.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors of this manuscript have adapted the MS2-MCP system to visualize endogenous cytoplasmic mRNA with live imaging in Caenorhabditis elegans. They have identified some of the issues that might have prevented the MS2-MCP system's adaptation to C. elegans. Specifically, they have identified that the length of the 3'UTR, which is significantly increased upon the insertion of the MS2 sequences, impacts the mRNAs' stability. They have also shown that removing the nonsense-mediated decay pathway can prevent the destabilization of the MS2 transcripts. Moreover, they have also optimized the MCP expression to avoid nuclear retention of the MS2 transcripts and mislocalization of the mRNAs.

      Major comments:

      • The authors show that the insertion of 24xMS2 in two endogenous genes, spc-1 and dlg-1, causes some phenotypes such as slow growth, lack of coordination (Unc), small body size (Sma), and reduced brood sizes. However, only an image example is provided in Fig. 1 C, D, and quantifying all these phenotypes would be nice. Same in Fig. 2C, D.
      • Similarly, the reduction in mRNA spots from smFISH in Fig. 1 G-L is difficult to visualize by eyes, and proper smFISH quantification will help interpret the results.
      • The authors also claim a reduction in cytoplasmic RNAs and increased signal in nuclear RNAs in Fig. 1J, L. A proper quantification of nuclear and cytoplasmic smFISH will help interpret the results.
      • In Fig. 3D-F, the authors quantified the signal of nuclear smFISH. However, it is unclear to me in what samples or conditions the statistical test is performed. For example, do the three stars in Fig. 3D refer to the significant decrease or increase of NMD strain compared to the WT? What about the stars in Fig. 3E? The authors should indicate what samples they compare in the statistical test.

      Minor comments:

      • on line 195, the authors reference Fig. 3A. However, it should be Fig. 4A.
      • In Fig. 4B, C the authors can add close to the image of the embryos the developmental stages. This will help the reader identify the embryo's developmental stage in the figure's upper and lower parts.
      • The authors can expand a bit the discussion on how their method differs (advantages and disadvantages) from the MASS system by Hu et al., 2023.

      Significance

      This manuscript will help the C. elegans community to adapt and use the MS2-MCP system to visualize endogenous mRNAs by live imaging. Their finding could also be adapted to other animal model systems. At the moment, only one published report has described the usage of the MS2-MCP system in C. elegans (by Hu et al., 2023), which combined the MS2 and Suntag systems. In this way, Hu et al., 2023 could shorten the length of MS2 insertion. I am unsure if this is why they do not observe any impact on endogenous mRNA tagged with MS2. However, they only track one gene, and it is possible that different 3'UTR will react differently to the insertion of MS2 repeats. Another manuscript (Kinney et al., BioRxiv 2023) showed the usage of the MS2-MCP-GFP system to track miRNA transcription. In this case, the insertion of the MS2 repeats in the transgenic lin-4 miRNA precursor rescued lin-4 mutation. In this manuscript, Tocchini and Mango identified possible issues in inserting MS2 repeats in endogenous 3'UTR. They have overcome this potential issue by changing the position of the insert in the 3'UTR and by removing the nonsense-mediated decay pathway to prevent destabilization of the mRNA-MS2 transcripts. One possible limitation is that possible system users need to work in a mutant background for the nonsense-mediated decay pathway, which is not ideal. However, it provides a greatly improved tool to track mRNA by live imaging. Therefore, their improved methodology will certainly contribute to expanding the use of MS2-MCP system in C. elegans.

      I have expertise in C. elegans biology and transcription, but I do not have expertise in the imaging system, and therefore, I cannot fully judge the methodology they have used and the quality of the imaging system.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1:

      In this paper, authors report that radiation, acidic pH, hypoxia, and drugs that interfere with lipid synthesis, all of which affect lipid droplets (LD), also affect the production of small extracellular vesicles (sEVs). In addition, they also report that LD content and sEV secretion are also modulated in CR-CSCs. Authors conclude that sEV formation and secretion is directly linked to LDs, and that their studies may open the way to new clinical perspectives. However, some important issues need to be addressed before the paper can be considered for publication.

      My main concern is that the notion that LDs and sEVs are linked remains vague. Do cells contain more LDs and secrete more sEVs because these two pathways are selectively up-regulated via some mechanisms that controls both pathways in a concerted manner? Or do cells with more LDs and more sEVs also contain more of everything, perhaps as a result of metabolic activity?

      We appreciate the Reviewer's observations. Indeed, this comment represents the main pillar of the entire manuscript. We have attempted to uncover the molecular mechanism behind this novel and intriguing organelle connection. First of all, we have adapted the manuscript emphasizing that the LD – sEV connection might be direct or indirect. Our omic data suggested that some proteins belonging to the RAB family, mainly Rab18, Rab7a and Rab5c, could play a pivotal role in the LDs-sEVs axis. To strengthen those results, we have performed additional experiments by silencing the expression of the three candidate Rabs. Rab5c seems to be a good candidate to modulate the LD-sEV connection. We believe that Rab5c is not the only contributor to the LD-sEV connection but is part of a whole set of different elements that regulate this axis. However, it is quite challenging to rule out other molecular candidates as co-contributors to this phenomenon, especially when considering cellular metabolic pathways.

      We recognize that external stimuli, such as radiation, pH, and lipid-interfering drugs, may exert their effects on other cellular organelles, even though we have strived to analyze each individual phenomenon rigorously. We are confident that our work lays the foundation for further research in the field.

      A direct corollary of this issue is whether increased sEV secretion reflects more endosomes and lysosomes (e.g. LysoTracker-positive compartments) or whether sEV secretion is selectively up-regulated.

      Thanks to the Reviewer’ suggestion, we have analyzed both the lysosome and endosome contents in our experimental cell systems. These data are now included in the manuscript in Figure S8. We have observed that it is unlikely that lysosomes are directly involved in the LD – sEV connection. However, the expression of Rab7a, a regulator of the late endosomal pathway, correlated with the LD content of the cells and their sEV release. Therefore, the endosomal pathway might be a good candidate to contribute to this LD – sEV connection.

      At one point, authors argue that cells that secrete more sEVs also contain more MVBs, but this issue remains elusive. To what extent is the increase in LDs and sEVS correlated in particular with an increase in endosome-lysosomes, and ER-Golgi (LDs originate from the ER)?

      We thank the Reviewer for this comment. We agree that the analyses of sEVs secreted in the media might not reflect the MVB content in the cells. However, two experiments, one on Panc01 cells and another one on MCF7 cells, showed that the number of MVBs, assessed by confocal microscopy using CD63 staining (MCF7) or CD63 and Alix plasmids (PANC-01), was directly correlated with the number of released sEVs in the media (Figure Fig S3C and 4J).

      In addition, we included additional experiments assessing the lysosome content in HT29 LDHigh and LDLowcells. Hereby, we confirmed that HT29 LDHigh cells showed a higher LD content than HT29 LDLow cells. Inversely, by studying the lysotracker area per cell, we showed that HT29 LDLow population has a higher lysosomal content as compared to their counterpart, HT29 LDHigh cells (test = Wilcoxon rank sum test with continuity correction_ W = 85127, p-value = 7.255e-07 for LDs and W = 49321, p-value = 1.14e-11 for Lysotracker). However, we could not demonstrate a clear correlation between the number of LDs in the cell and the lysotracker signal.

      Finally, we have also studied the expression of GM130, a Golgi-shaping protein (Ref. 1) and Rab7, a late-endocytic protein (Fig S8C). While the expression of Rab7 (endosome) seemed to correlate with the LD and sEV contents, the expression of GM130 (Golgi) gave back no coherent results. Indeed, it was inversely correlated to the LD and sEV amount, in accordance with what was already reported elsewhere (Ref 2 and 3)

      • Nakamura N. Emerging new roles of GM130, a cis-Golgi matrix protein, in higher order cell functions. J Pharmacol Sci. (2010) 112:255–64. Doi: 10.1254/jphs.09R03CR
      • Lydia-Ann L.S. Harris, James R. Skinner, Trevor M. Shew, Nada A. Abumrad, Nathan E. Wolins. _Monoacylglycerol disrupts Golgi structure and perilipin 2 association with lipid droplets.___Doi.org/10.1101/2021.07.09.451829
      • Alvin Kamili, Nuruliza Roslan, Sarah Frost, Laurence C. Cantrill, Dongwei Wang, Austin Della-Franca, Robert K. Bright, Guy E. Groblewski, Beate K. Straub, Andrew J. Hoy, Yuyan Chen, Jennifer A. Byrne; TPD52 expression increases neutral lipid storage within cultured cells. J Cell Sci 1 September 2015; 128 (17): 3223–3238. Doi: 10.1242/jcs.167692

      Authors conclude that the data with lipid inhibitors strengthen the connection between LDs and sEVs (Fig 2 and S2). However, is this regulation selective, or does it merely reflect the general effect of these inhibitors on membrane-related processes? The same comment applies to the role of iron metabolism after knockdown of ferritin heavy chain (Fig 3 and S3), acidic pH and X-ray radiation (Fig 4 and S4).

      We thank the Reviewer for the interesting observation. As previously mentioned, we cannot rule out other potential contributors to the LDs-sEVs connection upon lipid inhibitor treatments and/or the others external stimuli applied to our cell systems.

      The data presented in this manuscript merely represent a novel and unexplored (at least so far) organelle connection, direct or indirect, with a broad clinical implication. As the membrane-related processes (such as Endosomes, Golgi apparatus, Exosome (sEV) pathway, Lysosomes and Autophagosome) are all interconnected, in our opinion, it might be quite challenging to make such a definitive statement.

      Such assertion would require extensive further investigation to relate each organelle to the LDs and/or sEVs. However, with our research, we hope to open the door to a new era of investigations regarding the sEV – LDs connection.

      OTHER COMMENTS

      1) Which cell line is used for sEV analysis (markers vs contaminants (Fig S1B)? In any case, the data should be shown for both cell types.

      Our method to isolate sEVs is a standardized method that was already published by our group and collaborators in 2020 (M. Bordas, et al., Optimized Protocol for Isolation of Small Extracellular Vesicles from Human and Murine Lymphoid Tissues. Int J Mol Sci (2020) https:/doi.org/10.3390/ijms21155586.). This protocol was validated on human and mouse tissues, much more complex samples than cell culture supernatant.

      Figure S1C was modified, as requested by the Reviewer, including new data for HT29, Panc01 and MCF7 cell lines to broaden the panel. Those results confirmed the good purity of sEV samples isolated from cell culture supernatant.

      2) The Tsg101 blot is not impressive (Fig S1B): the difference between cells and sEVs is not easy to see. It would be nice if blots were quantified.

      Indeed, the signal obtained for TSG101 for sEVs derived from Panc01 cell line is quite weak. It is important to remember that not all sEV markers are highly expressed in all cell lines and their derived sEVs. Some cell line-derived sEVs show a low or high expression of the diverse sEV markers. To answer the Reviewer #1’s comment, we quantified the expression of TSG101 in Panc01-derived sEVs. The quantification showed that TSG101 is 6.8 times more expressed on Panc01-dervied sEVs as compared to the cell line. However, since the expression is quite low, this quantification should be taken with some caution.

      In light of the Reviewer ‘comment, we have performed the Western Blot analysis on other cell lines (HT29 and MCF7), and we have replaced TSG101 marker with CD9 marker (Figure S1C).

      3) From Fig 1B it cannot be concluded that the size of sEVs ranges from 30 to 200nm: the micrograph only shows a few structures.

      We appreciate the Reviewer's comment and have attempted to provide more clarity. Firstly, we want to highlight that TEM micrographs of sEVs typically show the donut shape, a unique feature of sEVs imaged with TEM, as well as a size range. In Figure 1B micrograph, the sEV size is approximately 100 nm. The size distribution of LoVo and HT29-derived sEVs can be observed from the NTA size measurements in Figure S1B. Indeed, the peak size is 148 nm for LoVo-derived sEVs and 135 nm for HT29, which aligns with the sEV sizes presented in Figure 1B. We have also included multiple micrographs here under. As the number of Supplementary Figures is already large, we have decided to not include those micrographs in the manuscript. The average size of LoVo-derived sEVs, based on TEM micrograph analysis, was 94 ± 41.10 nm, while the average size of HT29-derived sEVs was 76.41 ± 44.22 nm. The size discrepancy between the two methods (NTA versus TEM) can be ascribed to the dehydration step required for TEM, which results in a reduction of the actual sEV size.

      4) HT29 cells contain far more LDs than LoVo cells (Fig 1A). Similarly, sEV proteins (CD63, CD81, CD9, Hsc-70) are more abundant in HT29 sEVs than in LoVo sEVs (Fig 1D). However, the sEV preparation from HT29 cells contains only approx. 50% more total protein than LoVo sEVs (Fig S1D-E). Are sEVs prepared from LoVo cells far more contaminated with cell debris etc.. than sEV fractions from HT29 cells?

      We are confident that our EV isolation method allows us to achieve high yield and excellent purity. It is possible that a lower number of sEVs in samples may lead to increased protein contamination during ultracentrifugation. However, size exclusion chromatography should minimize this protein contamination. It is important to note that the NTA method is significantly more sensitive and accurate than Qubit protein quantification. Consequently, protein concentration and particle concentration should not be directly compared.

      5) LD staining should be shown for the corresponding populations of cells with high/low CD63 (Fig 1E). Cells in culture can be somewhat heterogeneous, but the difference between low and high CD63 is quite extreme (Fig 1E). Is such high heterogeneity also observed with other proteins of the endocytic and biosynthetic pathways? Authors conclude that cells containing high CD63 levels also contain more MVBs (Fig 1E): are all late endosomal proteins (e.g. LAMP1, RAB7) upregulated in cells with high CD63?

      We thank the Reviewer for this comment, and we totally agree with the Reviewer that it would be better to have the LD and CD63 staining on the same images. Unfortunately, the staining for CD63 on LD540-sorted HT29 cells requires a permeabilization step that interferes with the cellular lipid part and could therefore negatively affect the LD imaging by confocal microscopy. To prove that the HT29 LDHigh and HT29 LDLowcontain high and low LD amount respectively, we sorted HT29 cells based on the LD content and, soon after, we observed them at the confocal microscopy. We thus added new images in Figure S1F, corresponding to the LD fluorescence detection. The readers will also appreciate the explanation regarding the inability of observing both LDs and CD63 staining on the same confocal images under the line 165 – 166:

      As the staining for CD63 required a permeabilization step, and therefore lipid digestion, it was not possible to assess both LDs and CD+MVBs on the same micrographs “.

      In addition, we have added confocal images representing HT29 cells sorted based on their LD content and stained with Hoechst and Lysotracker. A quantification of the Lysotracker fluorescence per cell and the correlation with the number of LDs can also be appreciated in Figure S8A-B.

      Finally, we performed Western Blot analysis to examine Rab7a expression under various conditions described in our manuscript (Figure S8C). In general, Rab7 expression corresponded with LD content, indicating that cells with high LD content exhibited higher Rab7 expression, while cells with low LD amount showed lower Rab7 expression, except for Triacsin-C. The Reviewer can now appreciate the quantification in the graphs provided below (not included in the manuscript).

      Regarding the heterogeneity of LDs, CD63+MVBs, or lysotracker among the cell population, we have indeed noticed heterogeneity observable in these three types of staining in HT29, particularly in the HT29 LDHighpopulation.

      6) Inhibitors of lipid synthesis reduce LD formation (Fig 2B), sEV production and CD63 / CD81/ CD9 secretion (Fig2C-D, Fig S2B). Are the cellular levels of these (and other endosomal) proteins also reduced after inhibitor treatment? Does the stimulation of LD formation with oleic acid also stimulate CD63 synthesis and sEV production?

      We thank the Reviewer for this very interesting comment. To answer this question, we have added a supplementary figure (Figure S2A, S2B) showing the cellular expression of CD63 upon LD inhibition or stimulation.

      During the planning of our experiments, we discussed about the possibility of using oleic acid to induce the formation of Lipid Droplets, which was ultimately not done. This is because the use of oleic acid would have more strongly stimulated the triglyceride pathway, as extensively discussed elsewhere (Mejhert N. et al., The lipid droplet knowledge portal: a resource for systematic analyses of lipid droplet biology, Developmental Cell, 2022). Since Lipid Droplets are made by cholesterol esters and triglycerides, we preferred to use other stimuli (hypoxia, radiation), all of them already discussed in literature, to induce both pathways simultaneously, resulting in the Lipid Droplet formation/induction.

      7) It seems that pH and irradiation increase sEV markers far more significantly (Fig 4 B-C and Fig S4A-E) than FTH1 depletion decreases sEV markers (Fig 3 D-E). In fact, authors mention that they cannot exclude a contamination of sEVs with small apoptotic / autophagic vesicles after irradiation (Fig 4). To facilitate comparison, it would be nice to also show the number of secreted particles per cell (like after FTH1 depletion Fig 3D), as well as the distribution of possible contaminants (e.g. Fig S1). Also, authors state that the increase in the number CD63+ MVBs after irradiation is shown, but this is not the case.

      We apologize to the Reviewer because, in fact, one figure was missing (Figure 4). We have rectified this by increasing the quality of Figure 4 and have added representative images for each acquisition of the number of MVBs, either positive for CD63 or Alix, in transfected Panc01 cells X-ray irradiated (8 Gy) or not (0Gy). In addition, a similar experiment was performed in MCF7 cells transduced with shRNA or shFTH1. CD63+ MVBs were assessed in both cell line and the number of CD63+ puncta (MVBs) were quantified by ImageJ. The results, although not significative, illustrated a trend for MCF7 shFTH1 to contain less CD63+ MVBs than MCF7 shRNA. Furthermore, the quantification of sEVs released in the conditioned media was performed in three independent experiments and demonstrated that significantly less particles (sEVs) were released by MCF7 shFTH1 than MCF7 shRNA.

      8) Are the proteomic data (Fig 6) with LDlow and LDhigh cells obtained after cell sorting, as in Fig 1E? Did authors compare the proteome of LoVo and HT29 sEVs? How do the protein profiles (in particular proteins involved in lipid metabolism) obtained under different conditions compare with each other, in particular after irradiation (Fig4N) and knockdown of ferritin heavy chain (Fig 3, Fig S3)? It would also be interesting to compare these data with the data obtained in CR-CSCs culture under hypoxia (Fig S5). Are common proteins involved in sEV production and LD biosynthesis identified in the analysis of these biological processes? Is there a common set of proteins/genes revealed by this analysis, which may potentially control sEV production and LD biosynthesis?

      We thank the reviewer for this interesting comment.

      Proteomic analyses have been performed on the following conditions:

      • Panc01 (0 Gy – 6 Gy – 8 Gy) for sEV samples
      • MCF7 (shFTH1 and MCF7 shRNA)
      • MCF7 (0 Gy and 6 Gy)
      • MCF7 (Normoxia and Hypoxia)
      • H460 (0 Gy and 6 Gy)
      • H460 (Normoxia and Hypoxia)

      RNA sequencing was performed on the following conditions:

      • CR-CSCs (#4, #8, #21)

      Based on all those data, we have analyzed the sEV pathway and how this pathway was modulated in the conditions with high LD content and low LD content. We therefore came up with several proteins, presented in Figure S7. Based on this analysis, we have decided to further investigate the role of RAB18, RAB5c and RAB7a in the connection between LDs and sEVs. Those additional results can be found in Figure 6 and Figure S7A (originally Figure 6). We have found that RAB5c, but not RAB7a or RAB18, seems to be a good candidate to intervene in the LD – sEV connection.

      Minor comments

      1) Some parts of the text are still a bit rough, and should be read and corrected carefully. For example: i) isn't it obvious that a common source of lipids builds up the membrane of sEVS, much like any other membrane (line 90, p.2); ii) what does this sentence mean: "LD have been considered as mere fat storage organelles for a long time, although important evidence could be traced back to the early 1960's". Important evidence for what? iii) why is the acronym AdExo used? iv) (line 138) the text should probably be "sEVs released during 72h were studied" and not "released sEVs were studied ... 72 h after seeding".

      We apologize to the Reviewer if some parts of the paper were a bit rough. We have re-read the entire manuscript and corrected all the parts that needed revision work.

      2) The captions are far too small in most figures and diagrams (for example X and Y axis in Fig 1C-D, text in Fig 1E; Fig S1; Fig 3C proteins in the heatmap).

      We agree with the Reviewer. All images and their captions were properly revised.

      3) The color code for LoVO and HT29 cells is reversed in Fig S1D-E

      The mistake was corrected.

      4) In Fig 1D, I cannot see CD81 in the LoVo blot.

      In the image below, it is possible to see the LoVo blot.

      5) Wording is not adequate in following sentences: "62.7% of proteins related to the exosomal pathway are downregulated in MCF7 shFTH1 cells" (line 233) and a few lines below: ".. the expression of almost all exosomal markers was downregulated in MCF7 shFTH1 cells" (line 239). Does 62.7% represent all proteins?

      We apologize to the reviewer for the mistake. We rephrased this sentence.

      6) In Fig 3E authors compare sEV markers secreted by cells treated with shFTH1 or control shRNA. The Anx5 and CD63 blots are not very convincing (quantification would be helpful).

      We apologize to the Reviewer for this issue. These Western Blot analyses were performed only once, therefore a quantification in the manuscript would not be relevant. However, we report here the results of the quantification. The expression of Annexin V was 1.58 times higher in MCF7 shRNA than MCF7 shFTH1, while the expression of CD63 was 1.34 time higher in MCF shRNA as compared to MCF7 shFTH1.

      7) The micrographs in Fig 4L are too small: gold particles cannot be seen, even in the high magnification views.

      We thank the Reviewer for her/his comment. We have moved the micrograph and the quantification histogram to the Figure S6. Now, it is possible to discriminate easily gold nanoparticles.

      8) The micrographs showing ALIX and CD63 (Fig 4J) in irradiated and unirradiated Panc01 cells should be shown for comparison.

      We followed the Reviewer’ suggestion as it is possible to note in the Figure below.

      Reviewer #2:

      This manuscript describes a relationship between lipid droplet presence in cells and small EV secretion. First, correlations are done between number of lipid droplets and numbers of EVs secreted. Then chemical inhibitors of lipid droplet biosynthesis pathways were shown to reduce small EV secretion. Then various processes known to target lipid droplets, including iron metabolism, irradiation, hypoxia, low pH are used to show concordant effects on lipid droplets and small EV secretion. Proteomic analysis of EVs and cells subjected to some of the treatments are also performed. Overall, it is an interesting line of investigation and the data overall seem solid. Several flaws exist, which can probably be fixed. These include the use of different cell lines for different experiments. It makes it a bit difficult to connect everything together. It could be fixed by adding some extra cell lines to some experiments - for example taking the MCF7 and Panc-01 cells for which proteomics was performed and redoing some of the correlative and causative experiments from Figs 1 and 2.

      We appreciate the Reviewer's insightful observation. Following her/his suggestion, we have conducted additional experiments on MCF7, H460 and PANC-01 cell lines to enhance data consistency and facilitate a smoother transition between different sections of the paper.

      It also would be good to have some more direct evidence of the connection between lipid droplets and EV secretion - one could argue that this was already done in Fig 2 with the chemical inhibitors, I wonder if there is a genetic way to do it too?

      We totally agree with the Reviewer. Indeed, starting from our proteomic data we highlighted some genes belonging to the RAB family as potential candidates to interfere with the LD – sEV connection. The Reviewer can now appreciate in Figure 6 and Figure S7, the results from the additional experiments we carried out on RAB5c, RAB7a and RAB18 silencing in HT29 cells. The former Figure 6 has been moved in the Supplementary part (Figure S7).

      Some tightening up of the writing (especially the Discussion) and the resolution of the figures would also improve the manuscript.

      We apologize to the Reviewer for this issue. We have now re-prepared all Figures by increasing their resolution, as well as reviewing the entire manuscript with the aim of making the reading smoother and simpler.

      Overall, it is a nice piece of work but there are many minor things to be fixed.<br /> <br /> Specific Comments:

      The sentence in the Introduction: "The non-endosomal pathway generates sEVs devoid of<br /> CD63, CD81 and CD9 or sEVs enriched in ECM and serum-derived factors (7)." is not well-supported and should be removed. The idea that you can classify membrane of origin based on markers has not been proven, but rather assumed.

      We agree with the Reviewer. We have rephrased the sentence.

      We thank the Reviewer for this comment. In response to this, we have generated correlation graphs for several of our experiments:

      • HT29 (CTL – Triacsin-C - PF-06424439) in Figure 2E
      • PANC-01 (CTL – 2 – 4 – 6 – 8 Gy) in Figure 4K
      • CR-CSCs (#4, #8, #21) in Figure 5E

      The Method used for EV purification should be stated in the Results rather than referring to a reference and a Supplemental Figure (S1A) that is too low of a resolution to see.

      Our method to isolate sEVs is a standardized methods that was already published by our group and collaborators in 2020 (M. Bordas, et al., Optimized Protocol for Isolation of Small Extracellular Vesicles from Human and Murine Lymphoid Tissues. Int J Mol Sci (2020) https:/doi.org/10.3390/ijms21155586.). This protocol was validated on human and mouse tissues, much more complex samples than cell culture supernatant.

      In regard to the Reviewer’s comment, we have added a better description of the protocol in the Results part, referring to the Material and Method. For this reason, we decided to keep the sEV protocol in the SI section. We apologize for the low quality of the Figure S1. In agreement with the Reviewer suggestion, we have modified the image by increasing its quality.

      Fig 1B would be better to have an image in which the EVs are not aggregated.

      We thank the Reviewer for this comment and have modified the Figure accordingly.

      Fig 3 is interesting but jumps cell lines. For better continuity, some of the experiments from Figs 1 and 2 should be repeated in the MCF7 cells to connect with the proteomics.

      In agreement with the Reviewer’ comment, we decided to perform additional experiment on MCF7, using Triacsin-C. The Reviewer can now appreciate the results in Figure 2F, Figure 2G and Figure S2E.

      Fig 3C is too low resolution to read, please export at higher resolution.

      We are sorry for the low-quality Figure. We have modified the image accordingly.

      Please provide all the raw proteomics data as a supplementary spreadsheet.

      We have provided all the raw data regarding our proteomic analyses.

      Fig 4 panels are low resolution

      We apologize for the low-resolution Figure. We have modified the figure by increasing the quality.

      Fig 4 again adds new cell lines with H460 and Panc-01

      We thank the reviewer for this comment. In this regard, we have performed additional experiment:

      • Western Blot: comparison cellular and exosomal markers (Figure S1C)
      • MCF7 (CTL - Triacsin) (Figure 2F, Figure 2G and Figure S2E)
      • Western Blot: analysis of RAB7a, GM130

      The images corresponding to 4J should be shown in a Supp Figure somewhere

      We thank the reviewer for pointing out this oversight. We have added the confocal images corresponding to the Figure 4J below the quantification.

      The statements: "In addition, the exosomal nature of Panc01-derived vesicles was demonstrated by an analysis of CD63+ or Alix+ multivesicular bodies (MVBs) in unirradiated (0 Gy) or irradiated (8 Gy) pancreatic cancer cells (Fig 4J). Moreover, we confirmed a clear correlation between cellular LD content and sEV biogenesis, as represented in Fig 4K." are overly conclusive. For 4J, one can make a statement about the MVBs but not the EVs as that's not what was measured there. Likewise for 4K, what was measured was how many EVs were released not how many were formed. While the data are suggestive of alteration of exosome biogenesis, they are not conclusive.

      We agree with the reviewer and have performed the necessary changes in the manuscript. The reviewer can see the changes under the lines 282 – 284:

      “In addition, the analysis of CD63+ or Alix+ multivesicular bodies (MVBs) in unirradiated (0 Gy) or irradiated (8 Gy) pancreatic cancer cells revealed an increased number of MVBs after irradiation (Figure 4J).”

      Western blot is always capitalized by convention - Western not western.

      We have corrected it accordingly.

      Fig 5A is too small and low resolution - suggest eliminating and just put info in methods.

      We are sorry for the low-resolution image. We have followed the Reviewer suggestion. The graphical method has been now moved to the Supplementary Figure S6.

      Fig 5G, many of the genes shown are frequently EV cargoes but most not involved in exosome biogenesis - not sure where the label of Exosome pathway came from but it is not very compelling. Only ANXA2, Arf6, and Rab5C seem related and they are barely elevated.

      We completely agree with the Reviewer's comment. As a result, we have revised the heatmap title to "Exosomal Cargoes and Pathways" instead of "Exosomal Pathway".

      Most main figures and all supplementary figures are extremely low res - please fix.

      We are very sorry for the low-quality figures. We have revised all Figures (main text and SI) by increasing their quality.

      Fig 6 is first mentioned in the Discussion - it should be described in the Results before that (or alternatively removed).

      We agree with the Reviewer. Our initial idea was to mention perspectives of analyses that could be carried ulteriorly. Nevertheless, we have performed additional experiments in order to get insight on the mechanism involved in the LD – sEV connection. Indeed, based on our proteomic data, we have analyzed the sEV pathway and how this pathway was modulated in the conditions with high LD content and low LD content. We therefore came up with several proteins, presented in Figure S7A (originally Figure 6). Based on this analysis, we have decided to further investigate the role of RAB18, RAB5c and RAB7a in the connection between LDs and sEVs. Those additional results can be found in Figure 6 and Figure S7 in the Results section. We have found that RAB5c, but not RAB7a or RAB18, seems to be a good candidate to intervene in the LD – sEV connection.

      Table S1, also first mentioned in the Discussion, is missing. Either describe in the Results section or remove the callout to it.

      Our apologies for that. The Table S1 has been now mentioned in the Results section and has been properly uploaded.

      The discussion is too dense with too many trains of thought, often many different directions in the same paragraph. It needs to be streamlined, with a central thought for each paragraph and good transitions between the paragraphs.

      We apologize to the Reviewer if the Discussion part was a bit confusing. We rewrote the paragraph, streamlining it and making the transitions between its paragraphs smoother.

      Reviewer #2 (Significance (Required)):

      <br /> Strengths of this manuscript are the interesting connection between lipid droplets and exosomes and the number of experiments to address it.

      <br /> Limitations: use of different cell lines for different figures, overall descriptive nature with regard to direct demonstration of connection to lipid droplets -- it's kind of done in Fig 2, but could be possibly bolstered.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript describes a relationship between lipid droplet presence in cells and small EV secretion. First, correlations are done between number of lipid droplets and numbers of EVs secreted. Then chemical inhibitors of lipid droplet biosynthesis pathways were shown to reduce small EV secretion. Then various processes known to target lipid droplets, including iron metabolism, irradiation, hypoxia, low pH are used to show concordant effects on lipid droplets and small EV secretion. Proteomic analysis of EVs and cells subjected to some of the treatments are also performed. Overall, it is an interesting line of investigation and the data overall seem solid. Several flaws exist, which can probably be fixed. These include the use of different cell lines for different experiments. It makes it a bit difficult to connect everything together. It could be fixed by adding some extra cell lines to some experiments - for example taking the MCF7 and Panc-01 cells for which proteomics was performed and redoing some of the correlative and causative experiments from Figs 1 and 2. It also would be good to have some more direct evidence of the connection between lipid droplets and EV secretion - one could argue that this was already done in Fig 2 with the chemical inhibitors, I wonder if there is a genetic way to do it too? Some tightening up of the writing (especially the Discussion) and the resolution of the figures would also improve the manuscript. Overall, it is a nice piece of work but there are many minor things to be fixed.

      Specific Comments:

      The sentence in the Introduction: "The non-endosomal pathway generates sEVs devoid of<br /> CD63, CD81 and CD9 or sEVs enriched in ECM and serum-derived factors (7)." is not well-supported and should be removed. The idea that you can classify membrane of origin based on markers has not been proven, but rather assumed.

      Fig 1A and B - to better support the idea of a correlation between LD formation and EV release, more than two cell lines should be used and a linear correlation plot with R2 value shown. Likewise, it would be very interesting to see whether there is really a correlation between LD content and CD63-endosome positivity in a similar manner, given the results in Fig 1E. Also, it would be good to see LD and CD63 in the same cells for Fig 1E from the sorted populations.

      The Method used for EV purification should be stated in the Results rather than referring to a reference and a Supplemental Figure (S1A) that is too low of a resolution to see.

      Fig 1B would be better to have an image in which the EVs are not aggregated.

      Fig 3 is interesting but jumps cell lines. For better continuity, some of the experiments from Figs 1 and 2 should be repeated in the MCF7 cells to connect with the proteomics.

      Fig 3C is too low resolution to read, please export at higher resolution.

      Please provide all the raw proteomics data as a supplementary spreadsheet

      Fig 4 panels are low resolution

      Fig 4 again adds new cell lines with H460 and Panc-01

      The images corresponding to 4J should be shown in a Supp Figure somewhere

      The statements: "In addition, the exosomal nature of Panc01-derived vesicles was demonstrated by an analysis of CD63+ or Alix+ multivesicular bodies (MVBs) in unirradiated (0 Gy) or irradiated (8 Gy) pancreatic cancer cells (Fig 4J). Moreover, we confirmed a clear correlation between cellular LD content and sEV biogenesis, as represented in Fig 4K." are overly conclusive. For 4J, one can make a statement about the MVBs but not the EVs as that's not what was measured there. Likewise for 4K, what was measured was how many EVs were released not how many were formed. While the data are suggestive of alteration of exosome biogenesis, they are not conclusive.

      Western blot is always capitalized by convention - Western not western.

      Fig 5A is too small and low resolution - suggest eliminating and just put info in methods.

      Fig 5G, many of the genes shown are frequently EV cargoes but most not involved in exosome biogenesis - not sure where the label of Exosome pathway came from but it is not very compelling. Only ANXA2, Arf6, and Rab5C seem related and they are barely elevated.

      Most main figures and all supplementary figures are extremely low res - please fix.

      Fig 6 is first mentioned in the Discussion - it should be described in the Results before that (or alternatively removed).

      Table S1, also first mentioned in the Discussion, is missing. Either describe in the Results section or remove the callout to it.

      The discussion is too dense with too many trains of thought, often many different directions in the same paragraph. It needs to be streamlined, with a central thought for each paragraph and good transitions between the paragraphs.

      Significance

      Strengths of this manuscript are the interesting connection between lipid droplets and exosomes and the number of experiments to address it.

      Limitations: use of different cell lines for different figures, overall descriptive nature with regard to direct demonstration of connection to lipid droplets -- it's kind of done in Fig 2, but could be possibly bolstered.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper, authors report that radiation, acidic pH, hypoxia, and drugs that interfere with lipid synthesis, all of which affect lipid droplets (LD), also affect the production of small extracellular vesicles (sEVs). In addition, they also report that LD content and sEV secretion are also modulated in CR-CSCs. Authors conclude that sEV formation and secretion is directly linked to LDs, and that their studies may open the way to new clinical perspectives. However, some important issues need to be addressed before the paper can be considered for publication.

      My main concern is that the notion that LDs and sEVs are linked remains vague. Do cells contain more LDs and secrete more sEVs because these two pathways are selectively up-regulated via some mechanism that controls both pathways in a concerted manner? Or do cells with more LDs and more sEVs also contain more of everything, perhaps as a result of metabolic activity? A direct corollary of this issue is whether increased sEV secretion reflects more endosomes and lysosomes (e.g. LysoTracker-positive compartments) or whether sEV secretion is selectively up-regulated. At one point, authors argue that cells that secrete more sEVs also contain more MVBs, but this issue remains elusive. To what extent is the increase in LDs and sEVS correlated in particular with an increase in endosome-lysosomes, and ER-Golgi (LDs originate from the ER)? Authors conclude that the data with lipid inhibitors strengthen the connection between LDs and sEVs (Fig 2 and S2). However, is this regulation selective, or does it merely reflect the general effect of these inhibitors on membrane-related processes? The same comment applies to the role of iron metabolism after knockdown of ferritin heavy chain (Fig 3 and S3), acidic pH and X-ray radiation (Fig 4 and S4)

      Other comments

      1. Which cell line is used for sEV analysis (markers vs contaminants (Fig S1B)? In any case, the data should be shown for both cell types.
      2. The Tsg101 blot is not impressive (Fig S1B): the difference between cells and sEVs is not easy to see. It would be nice if blots were quantified.
      3. From Fig 1B it cannot be concluded that the size of sEVs ranges from 30 to 200nm: the micrograph only shows a few structures.
      4. HT29 cells contain far more LDs than LoVo cells (Fig 1A). Similarly, sEV proteins (CD63, CD81, CD9, Hsc-70) are more abundant in HT29 sEVS than in LoVo sEVs (Fig 1D). However, the sEV preparation from HT29 cells contains only approx. 50% more total protein than LoVo sEVs (Fig S1D-E). Are sEVs prepared from LoVo cells far more contaminated with cell debris etc.. than sEV fractions from HT29 cells?
      5. LD staining should be shown for the corresponding populations of cells with high/low CD63 (Fig 1E). Cells in culture can be somewhat heterogeneous, but the difference between low and high CD63 is quite extreme (Fig 1E). Is such high heterogeneity also observed with other proteins of the endocytic and biosynthetic pathways? Authors conclude that cells containing high CD63 levels also contain more MVBs (Fig 1E): are all late endosomal proteins (e.g. LAMP1, RAB7) upregulated in cells with high CD63?
      6. Inhibitors of lipid synthesis reduce LD formation (Fig 2B), sEV production and CD63 / CD81/ CD9 secretion (Fig2C-D, Fig S2B). Are the cellular levels of these (and other endosomal) proteins also reduced after inhibitor treatment? Does the stimulation of LD formation with oleic acid also stimulate CD63 synthesis and sEV production?
      7. It seems that pH and irradiation increase sEV markers far more significantly (Fig 4 B-C and Fig S4A-E) than FTH1 depletion decreases sEV markers (Fig 3 D-E). In fact, authors mention that they cannot exclude a contamination of sEVs with small apoptotic / autophagic vesicles after irradiation (Fig 4). To facilitate comparison, it would be nice to also show the number of secreted particles per cell (like after FTH1 depletion Fig 3D), as well as the distribution of possible contaminants (e.g. Fig S1). Also, authors state that the increase in the number CD63+ MVBs after irradiation is shown, but this is not the case.
      8. Are the proteomic data (Fig 6) with LDlow and LDhigh cells obtained after cell sorting, as in Fig 1E? Did authors compare the proteome of LoVo and HT29 sEVs? How do the protein profiles (in particular proteins involved in lipid metabolism) obtained under different conditions compare with each other, in particular after irradiation (Fig4N) and knockdown of ferritin heavy chain (Fig 3, Fig S3)? It would also be interesting to compare these data with the data obtained in CR-CSCs culture under hypoxia (Fig S5). Are common proteins involved in sEV production and LD biosynthesis identified in the analysis of these biological processes? Is there a common set of proteins/genes revealed by this analysis, which may potentially control sEV production and LD biosynthesis?

      Minor comments

      1. Some parts of the text are still a bit rough, and should be read and corrected carefully. For example: i) isn't it obvious that a common source of lipids builds up the membrane of sEVS, much like any other membrane (line 90, p.2); ii) what does this sentence mean: "LD have been considered as mere fat storage organelles for a long time, although important evidence could be traced back to the early 1960's". Important evidence for what? iii) why is the acronym AdExo used? iv) (line 138) the text should probably be "sEVs released during 72h were studied" and not "released sEVs were studied ... 72 h after seeding".
      2. The captions are far too small in most figures and diagrams (for example x and Y axis in Fig 1C-D, text in Fig 1E; Fig S1; Fig 3C proteins in the heatmap).
      3. The color code for LoVO and HT29 cells is reversed in Fig S1D-E
      4. In Fig 1D, I cannot see CD81 in the LoVo blot.
      5. Wording is not adequate in following sentences: "62.7% of proteins related to the exosomal pathway are downregulated in MCF7 shFTH1 cells" (line 233) and a few lines below: ".. the expression of almost all exosomal markers was downregulated in MCF7 shFTH1 cells" (line 239). Does 62.7% represent all proteins?
      6. In Fig 3E authors compare sEV markers secreted by cells treated with shFTH1 or control shRNA. The Anx5 and CD63 blots are not very convincing (quantification would be helpful).
      7. The micrographs in Fig 4L are too small: gold particles cannot be seen, even in the high magnification views.
      8. The micrographs showing ALIX and CD63 (Fig 4J) in irradiated and unirradiated Panc01 cells should be shown for comparison.

      Significance

      The topic of the paper is clearly interesting, since the mechanisms that regulate sEV formation and secretion are not fully understood and since the notion that their fate is linked to LDs is potentially exciting.

      My expertise: subcellular organization, endocytosis, membrane traffic, organelle biogenesis

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their thoughtful comments. Here we provide a point-by-point response to their reviews. All additional experiments that are present in the revised manuscript, or that we plan to include in the final manuscript, are numbered.

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The concept introduced by this paper is exciting and novel. However, the current paucity of presented data can lead to incorrect interpretations of the findings and speculations that might not hold true after a more rigorous assessment of the observed phenomenon.

      The premise of this study builds upon an interaction between the PAXT complex and nuclear YTH domain containing proteins. However, figures 1B and C should be improved. The interacting band for the ZFC3H1 presented in panel B does not seem to match the size of the construct used in panel C. Is the Flag version of ZFC3H1 expressing a smaller isoform for this protein? __

      The reviewer is correct in that endogenous ZFC3H1 (which migrates at 250kD with a minor band at 150kD, see Figure 1B in the initial manuscript) appears to differ from the FLAG-tagged construct as expressed from a plasmid transfected in HEK293 cells (which migrates as two bands at 180kD and 200kD, see Figure 1C in the initial manuscript). For the endogenous protein, the predicted molecular weight is 226kD and the 250kD band disappears when cells are transduced with lentivirus containing shRNAs against ZFC3H1 (see Figure 4A in the initial manuscript), indicating that it is the correct protein. Both the 250kD endogenous protein (*) and the 200kD overexpressed protein (**) in transfected HEK293 and U2OS cells are detected in immunoblots using anti-ZFC3H1 antibodies (see Figure 1 in this document) indicating that the over-expressed protein is indeed ZFC3H1.

      [ Figure 1]

      _Figure 1. Molecular Weight Size Comparison of Endogenous ZFC3H1 and FLAG-ZFC3H1 (1-1233). _Lysates from HEK293 and U2OS that were either untransfected or transfected with FLAG-ZFC3H1 (1-1233) plasmid. We labelled the bands corresponding to the endogenous ZFC3H1 “*” and FLAG-ZFC3H1 (1-1233) “**”.

      We have sequenced the plasmid, and discovered that it contains an additional sequence inserted within the middle of the ZFC3H1 cDNA with a premature stop codon. As such, the version of the protein that is expressed from the plasmid only contains amino acids 1-1233 of the endogenous protein and is missing amino acids 1234-1989. The deleted region only contains TPR repeats, and is not known to interact with any of the well characterized interactors of ZFC3H1 (Wang, Nuc Acid Res 2021, Figure 3). We have renamed this construct FLAG-ZFC3H1 (1-1233). Given these new considerations, our results are consistent with the idea that the N-terminal portion of ZFC3H1 interacts with U1-70K, YTHDC1 and YTHDC2. We will change the text to reflect this.

      We are currently in the process of deleting the small insertion to obtain a plasmid that encodes a full length version of human ZFC3H1. For the final manuscript:

      Experiment #1) We will repeat the co-immunoprecipitation experiment with the full length FLAG-ZFC3H1 to determine whether it interacts with YTHDC1 and YTHDC2. This will take a few weeks.

      __Also, the YTHDC1-2 interaction in panel C is not as convincing considering the negative controls lane show some degree of binding. __

      Although the reviewer is correct that there is substantial background binding in the YTHDC1 immunoblot, we disagree with their characterization of the results with the YTHDC2 immunoblot (see Figure 1B-C in the initial manuscript). In the new manuscript we have included:

      Experiment #2) A new co-immunoprecipitate of the FLAG-tagged ZFC3H1 (1-1233) from HEK293 cells under more stringent conditions where the background level of YTHDC1 binding to beads is negligible. We have already completed this experiment (see Figure 1D in the revised manuscript).

      __Additionally, can the authors test if their RNaseA treatment worked? __

      In the new manuscript we have included:

      Experiment #3) A new co-immunoprecipitate of FLAG-YTHDC1 immunoblotted for eIF4AIII from HEK293 cell lysates. We find that without RNAse, there is some eIF4AIII in the precipitates but that the levels diminish substantially after RNAse A/T1 treatment. We have already completed this experiment (see Figure 1B in the preliminary revised manuscript).

      __Why do you need 18 hours to observe the nuclear export of your modifiable construct when inhibiting METTL3 in figure 3? Is it possible that your observation is secondary to phenotypes these cells develop as a result of blocking METTL3? __

      We treated cells for this period of time so that during the expression of the reporter, all of the newly synthesized mRNA is expressed in the absence of m6A methyl transferase activity. For shorter treatment times, it is unclear whether the bulk of the reporter mRNA, which would be synthesized before the treatment, would lose any pre-existing m6A marks, making a negative result hard to interpret. Previously we found that although 50% of intronic polyadenylated (IPA) transcripts from our reporters are rapidly degraded, about 50% are stable and are nuclear retained over extended periods of time (see Lee at al., PLOS ONE 2015; https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0122743 Figure 3B-G). We believe that the bulk of the reporter mRNA that we are visualizing is stable and accumulates in the nucleus. Given that METTL3-depletion inhibits nuclear retention and that versions of the IPA reporter that lack m6A modification motifs are exported, we think that the most likely interpretation of the 18 hour STM2457 treatment experiments is that the lack of methyltransferase activity had a direct effect, rather than an indirect effect, on nuclear retention. We would be open to performing more experiments if the editors insist, however we ordered STM2457 four weeks ago and it has yet to arrive from Sigma-Millipore. Performing this experiment may substantially delay our ability to resubmit the manuscript in a timely manner.

      __Is ALKBH5 nuclear and/or cytoplasmic in the cell system used? __

      According to The Human Protein Atlas, ALKBH5 is predominantly nuclear in U2OS cells, with some present in the cytoplasm (https://www.proteinatlas.org/ENSG00000091542-ALKBH5/subcellular#human).

      In the revised manuscript we have included:

      Experiment #4) Data from subcellular fractionation demonstrating that ALKBH5 is present in both the nucleus and cytoplasm that we have already performed (see Figure 4J in the preliminary revised manuscript).

      __Reviewer #1 (Significance (Required)):

      The study is highly significant __

      ------

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: In the manuscript by Lee et al. entitled "N-6-methyladenosine (m6A) Promotes the Nuclear Retention of mRNAs with Intact 5'Splice Site Motifs", the authors provide evidence that m6A modifications within specific regions of transcripts can confer nuclear retention. These results are important because they add to our understanding of how m6A modifications can contribute to post-transcriptional regulation. Although the authors do not quite come out and say this, data seem to be accumulating to suggest that the location of the m6A modifications within a given transcript can dictate the functional consequences of those modifications.__

      We thank the reviewer for pointing this out. We have included a few sentences in the new preliminary revised manuscript pointing out that the location of the m6A modification in IPA transcripts, with respect to intact 5’SS and poly(A) signals, may play a role in promoting nuclear retention.

      __The current work builds on previous findings from these authors identifying factors critical for retention of intronic polyadenylated (IPA) transcripts. The present study identified m6A modification as one of the signals for the retention of such transcripts. The authors use reporters for their analysis and also examine validated endogenous IPA transcripts. The data presented supports the conclusions albeit they show a surprising finding for one of the m6A erasers, ALKBH5. However, there is some controversy over the mechanism by which ALKBH5 functions and whether the m6A mark is truly reversible, so these results may continue to add to this point of view.

      Major Comments: One experiment that might add to the argument would be overexpression of Mettl3 as compared to catalytically inactive Mettl3. The prediction would be that the reporter transcript with intact DRACH sequences would be even more retained in the nucleus in a manner that depends on Mettl3 catalytic activity. For some of the data presented, the reporter is already wholly nuclear so no difference could be detected, but in the U2OS cells shown in Figure 2B, it appears that an increase in nuclear localization might be evident. Such an experiment would add an orthogonal approach to demonstrate that the methylation by Mettl3 is required for retention. If such an experiment would work with the endogenous IPA transcripts shown in Figure 4, but these transcripts may already be too nuclear to detect any increase in nuclear retention.

      __

      We have performed two experiments that try to address this but they gave negative results:

      Experiment #5) We have over-expressed wildtype and a methyl transferase mutant FLAG-METTL3 and assessed the nuclear export/retention of ftz-Δi-5’SS mRNA. There was no effect (see Figure 2 in this document).

      [Figure 2]

      __Figure 2. Over-expression of METTL3 does not increase the nuclear retention of ftz-Δi-5’SS. __U2OS cells were co-transfected with ftz-Δi-5’SS reporter and either FLAG-METTL3 or FLAG-METTL3-D395A, which lacks methyl-transferase activity (Wang, Mol Cell 2016). Cells were fixed, stained for ftz mRNA by fluorescent in situ hybridization and METTL3 using anti-FLAG antibodies. The nuclear and cytoplasmic distribution of ftz mRNA was quantified as described in the manuscript. Note that this is the average of one independent experiment (each bar consisting of the average of at least 50 cells). We plan to repeat this two more times, but we anticipate that these will show the same result.

      We could include this negative data as a supplemental figure. We believe that there are two possible reasons for this experimental result. First, as the reviewer points out, the reporter transcripts are already too nuclear to detect any significant change. Second, METTL3 is part of a larger complex that includes several proteins including METTL14, WTAP and potentially other proteins (for example see Covelo-Molares, Nuc Acid Res 2021). We may need to co-express all of these proteins to see an effect.

      Experiment #6) We have also expressed versions of ftz-Δi and ftz-Δi-5’SS mRNA with optimized m6A modification (i.e. DRACH) motifs (AGACT) to enhance methylation (“e-m6A-ftz”). We only observed a slight increase in nuclear retention but it is not significant (see Figure 2A,C in the revised manuscript).

      Again, this result could be explained by the fact that the reporter is too nuclear to detect any significant increase in retention. We had originally performed this in parallel with the no-m6A-ftz-Δi-5’SS reporters but did not report this negative data in the original manuscript.

      __Some rather minor changes to the presentation of the data could enhance the impact of this study.

      Specific Comments:

      The primary question in this manuscript is comparing reporters with m6A site (intact DRACH sequences) to those without. For this reason, organizing the data to the +/- DRACH sites are adjacent to one another might make the most sense. This point is evident in Figure 1C where perhaps simply changing the order of the bars presented to put the ones directly compared adjacent would be preferable. Then the p-value would compare sets of data directly adjacent to one another. __

      We thank the reviewer for this suggestion and we have made these changes to the figures in the preliminary revised manuscript.

      __While the authors show representative fields/cells for most assays, they do an excellent job of providing quantitation as well. One exception is Figure 3D, which shows a single cell image for the most key panel (the 5'SS-containing reporter upon Mettl3 depletion). If there is not a field with more cells, the authors could create a montage. __

      In the revised manuscript, we have replaced this image with one containing multiple cells expressing the reporter.

      __Minor Comments:

      Figure presentation:

      The text in a number of the figures is VERY small (Figures 1B,1C, and 4A) for example. __

      We have fixed this in the new manuscript.

      __Figure 3A includes the label "shRNA:" at the top, but these cells are treated with Mettl3 inhibitor and there does not appear to be any shRNA employed, so this seems like a labeling error. __

      We have fixed this in the new manuscript.

      __In Figure 3C, the immunoblot of Mettl3, there are three bands that all disappear completely upon knockdown of Mettl3- are these all Mettl3? This should at least be mentioned in the legend and perhaps indicated in the figure. The authors do mention in the text employing shRNAs to target multiple Mettl3 isoforms, so likely this is the case. __

      We have clarified these issues in the new manuscript.

      __Minor points (some really minor to just polish the presentation for clarity):

      The word "since" should only be used if there is a time element- otherwise the word "as" is preferable.

      For example on p. 4, the sentence: "Since inhibition of mRNA export typically enhances the nuclear retention of RNAs with intact 5'SS motifs (Lee et al. 2020),.." would more precisely read "As inhibition of mRNA export typically enhances the nuclear retention of RNAs with intact 5'SS motifs (Lee et al. 2020),..". __

      We thank the reviewer for pointing this out. We have fixed this issue in the revised manuscript.

      __Reviewer #2 (Significance (Required)):

      Summary: In the manuscript by Lee et al. entitled "N-6-methyladenosine (m6A) Promotes the Nuclear Retention of mRNAs with Intact 5'Splice Site Motifs", the authors provide evidence that m6A modifications within specific regions of transcripts can confer nuclear retention. These results are important because they add to our understanding of how m6A modifications can contribute to post-transcriptional regulation. Although the authors do not quite come out and say this, data seem to be accumulating to suggest that the location of the m6A modifications within a given transcript can dictate the functional consequences of those modifications.

      This study would be of significant interest to those that study gene expression in any context as well as cell biologists as the data add to our understanding of export of mRNA from the nucleus. This work also adds to our understanding of the biological consequences of m6A modification, which is an area of significant interest. In my opinion, the authors could make a broader conclusion that we do, which is that the location of the modification significantly dictates function- an extension of previous findings mostly focused on processed mRNA transcripts. __

      -------

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Quality control of mRNA is vital for all types of cells. In eukaryotic cells, nuclear export of misprocessed mRNAs containing the 5' splice site is prevented. In this manuscript, Lee and colleagues demonstrate that the nuclear retention of intronic polyadenylated transcripts is dependent on m6A modification. Based on the results shown in yeast, they perform immunoprecipitation experiments and demonstrate the interaction between ZFC3H1, a component of the PAXT complex, and YTHDC1 and YTHDC 2, nuclear YTH RNA-binding proteins that recognize m6A-modified transcripts. The study also shows the interaction of U1-70K with YTHDC1 and with ZFC3H1. Depletion of YTHDC1/2 prevents the nuclear retention of IPA transcripts. Additionally, CLIP-seq analysis is performed, demonstrating that m6A modification is enriched around the 5' splice site motif and the 3' polyadenylation site in IPAs. From these observations, they conclude that m6A modification contributes to the quality control of mRNA by promoting nuclear retention of misprocessed transcripts.

      Major Points 1. The interaction between ZFC3H1 and YTHDC1 is clearly shown by immunoprecipitation of FLAG-tagged YTHDC1 in Figure 1B. However, the co-purification of YTHDC1 with FLAG-tagged ZFC3H1 in Figure 1C is rather ambiguous. Additionally, the immunoprecipitated samples do not appear to show signals corresponding to FLAG-tagged ZFC3H1, making it unclear if the immunoprecipitation is working. It is essential to provide a better quality result to clarify these observations. __

      Please see our responses to reviewer #1. We have repeated the co-immunoprecipitation of FLAG-ZFC3H1 (1-1233) with YTHDC1 under more stringent conditions and have reduced the background binding (see Figure 1B and D in the new manuscript). We have also determined why the FLAG-ZFC3H1 is smaller than expected as the construct contains a premature stop codon. As explained above, we are in the midst of generating a full-length FLAG-ZFC3H1 and we plan to repeat the co-immunoprecipitation with this new construct.

      2. While the authors demonstrate that the m6A modification is dispensable for the targeting of IPA reporter transcripts to the nuclear speckles, it would be valuable to investigate whether m6A is required for their exit from the nuclear speckles. Do reporter transcripts with m6A motifs remain in the nuclear speckles at later time points?

      We have now analyzed the colocalization of nuclear speckles (SC35) with ftz-Δi-5’SS, which contains both a 5’SS and DRACH motifs, and no-m6A-ftz-Δi-5’SS, which contains a 5’SS but lacks DRACH motifs, at steady state – i.e. after 18-24 hours of transfection (as opposed to at early time points as shown in Figure 2D-E of the initial manuscript). Unexpectedly, we see that both mRNAs continue to colocalize with nuclear speckles, although the no-m6A-ftz-Δi-5’SS mRNA is well exported from the nucleus and its signal in nuclear speckles is faint (see Figure 2F-H in the new manuscript).

      Previously, we observed that ftz-Δi-5’SS required the 5’SS motif to remain in nuclear speckles at these later time points (Lee PLOS ONE 2015 and Lee RNA 2022). Upon closer inspection, ftz-Δi-5’SS mRNA also accumulates in additional nuclear foci that are not SC35-positive. Our new results may indicate that m6A marks promote the transfer of mRNAs from nuclear speckles to other foci, but more data is required to make a firm statement. Given this, we plan to conduct further experiments which may take a month to complete:

      Experiment #7) We are now assessing whether these additional ftz-Δi-5’SS foci correspond to either YTHDC-positive foci which were previously shown to partially overlap nuclear speckles and sequester m6A-rich mRNAs (Cheng Cancer Cell 2022), or “pA+ RNA foci” which accumulate MTR4/ZFC3H1-targetted RNAs when the nuclear exosome is inhibited (Silla Cell Reports 2018). These foci are enriched in ZFC3H1. We plan on co-staining ftz-Δi, ftz-Δi-5’SS, no-m6A-ftz-Δi and no-m6A-ftz-Δi-5’SS with SC35, YTHDC1 and ZFC3H1 to determine whether m6A may help to transfer mRNAs from nuclear speckles to YTHDC1 or ZFC3H1-enriched foci.

      __3. Figures 5B and 5C suggest that ZFC3H1 is required for the degradation of IPA transcripts. However, the range of the vertical axis is inappropriate and it is difficult to assess the extent of the increase in expression levels. Please adjust the vertical axis range for improved clarity. __

      We thank the reviewer for the feedback we have added additional graphs with an expanded vertical axis to demonstrate that ZFC3H1 is required for the degradation IPA transcripts.

      Minor Points 1. page 4, line 2 "RNAse" should be corrected to "RNase".

      We thank the reviewer for catching this error. We have fixed this.

      __ 2. page 7, line 5: Is the statement "prevents the nuclear export and decay of non-functional and misprocessed RNAs" correct? m6A modification promotes the decay of such RNAs. __

      We thank the reviewer for pointing this out. We have altered the text to clarify that m6A promotes decay.

      __3. Figure 2E: ftz-∆i should be ftz-∆i-5'SS. __

      We thank the reviewer for catching this error. We have fixed this.

      __4. Figure 5A: It would be helpful to indicate the number of IPA transcripts analyzed. __

      We have included this information.

      __Reviewer #3 (Significance (Required)):

      Overall, the work is sound and generally well-controlled. This study advances our understanding of the quality control of misprocessed transcripts in higher eukaryotes. This reviewer suggests a few points for clarification or improvement. __

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Quality control of mRNA is vital for all types of cells. In eukaryotic cells, nuclear export of misprocessed mRNAs containing the 5' splice site is prevented. In this manuscript, Lee and colleagues demonstrate that the nuclear retention of intronic polyadenylated transcripts is dependent on m6A modification. Based on the results shown in yeast, they perform immunoprecipitation experiments and demonstrate the interaction between ZFC3H1, a component of the PAXT complex, and YTHDC1 and YTHDC 2, nuclear YTH RNA-binding proteins that recognize m6A-modified transcripts. The study also shows the interaction of U1-70K with YTHDC1 and with ZFC3H1. Depletion of YTHDC1/2 prevents the nuclear retention of IPA transcripts. Additionally, CLIP-seq analysis is performed, demonstrating that m6A modification is enriched around the 5' splice site motif and the 3' polyadenylation site in IPAs. From these observations, they conclude that m6A modification contributes to the quality control of mRNA by promoting nuclear retention of misprocessed transcripts.

      Major Points

      1. The interaction between ZFC3H1 and YTHDC1 is clearly shown by immunoprecipitation of FLAG-tagged YTHDC1 in Figure 1B. However, the co-purification of YTHDC1 with FLAG-tagged ZFC3H1 in Figure 1C is rather ambiguous. Additionally, the immunoprecipitated samples do not appear to show signals corresponding to FLAG-tagged ZFC3H1, making it unclear if the immunoprecipitation is working. It is essential to provide a better quality result to clarify these observations.
      2. While the authors demonstrate that the m6A modification is dispensable for the targeting of IPA reporter transcripts to the nuclear speckles, it would be valuable to investigate whether m6A is required for their exit from the nuclear speckles. Do reporter transcripts with m6A motifs remain in the nuclear speckles at later time points?
      3. Figures 5B and 5C suggest that ZFC3H1 is required for the degradation of IPA transcripts. However, the range of the vertical axis is inappropriate and it is difficult to assess the extent of the increase in expression levels. Please adjust the vertical axis range for improved clarity.

      Minor Points

      1. page 4, line 2 "RNAse" should be corrected to "RNase".
      2. page 7, line 5: Is the statement "prevents the nuclear export and decay of non-functional and misprocessed RNAs" correct? m6A modification promotes the decay of such RNAs.
      3. Figure 2E: ftz-∆i should be ftz-∆i-5'SS.
      4. Figure 5A: It would be helpful to indicate the number of IPA transcripts analyzed.

      Significance

      Overall, the work is sound and generally well-controlled. This study advances our understanding of the quality control of misprocessed transcripts in higher eukaryotes. This reviewer suggests a few points for clarification or improvement.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In the manuscript by Lee et al. entitled "N-6-methyladenosine (m6A) Promotes the Nuclear Retention of mRNAs with Intact 5'Splice Site Motifs", the authors provide evidence that m6A modifications within specific regions of transcripts can confer nuclear retention. These results are important because they add to our understanding of how m6A modifications can contribute to post-transcriptional regulation. Although the authors do not quite come out and say this, data seem to be accumulating to suggest that the location of the m6A modifications within a given transcript can dictate the functional consequences of those modifications.

      The current work builds on previous findings from these authors identifying factors critical for retention of intronic polyadenylated (IPA) transcripts. The present study identified m6A modification as one of the signals for the retention of such transcripts. The authors use reporters for their analysis and also examine validated endogenous IPA transcripts. The data presented supports the conclusions albeit they show a surprising finding for one of the m6A erasers, ALKBH5. However, there is some controversy over the mechanism by which ALKBH5 functions and whether the m6A mark is truly reversible, so these results may continue to add to this point of view.

      Major Comments: One experiment that might add to the argument would be overexpression of Mettl3 as compared to catalytically inactive Mettl3. The prediction would be that the reporter transcript with intact DRACH sequences would be even more retained in the nucleus in a manner that depends on Mettl3 catalytic activity. For some of the data presented, the reporter is already wholly nuclear so no difference could be detected, but in the U2OS cells shown in Figure 2B, it appears that an increase in nuclear localization might be evident. Such an experiment would add an orthogonal approach to demonstrate that the methylation by Mettl3 is required for retention. If such an experiment would work with the endogenous IPA transcripts shown in Figure 4, but these transcripts may already be too nuclear to detect any increase in nuclear retention.

      Some rather minor changes to the presentation of the data could enhance the impact of this study.

      Specific Comments:

      The primary question in this manuscript is comparing reporters with m6A site (intact DRACH sequences) to those without. For this reason, organizing the data to the +/- DRACH sites are adjacent to one another might make the most sense. This point is evident in Figure 1C where perhaps simply changing the order of the bars presented to put the ones directly compared adjacent would be preferable. Then the p-value would compare sets of data directly adjacent to one another.

      While the authors show representative fields/cells for most assays, they do an excellent job of providing quantitation as well. One exception is Figure 3D, which shows a single cell image for the most key panel (the 5'SS-containing reporter upon Mettl3 depletion). If there is not a field with more cells, the authors could create a montage.

      Minor Comments:

      Figure presentation:

      The text in a number of the figures is VERY small (Figures 1B,1C, and 4A) for example.

      Figure 3A includes the label "shRNA:" at the top, but these cells are treated with Mettl3 inhibitor and there does not appear to be any shRNA employed, so this seems like a labeling error.

      In Figure 3C, the immunoblot of Mettl3, there are three bands that all disappear completely upon knockdown of Mettl3- are these all Mettl3? This should at least be mentioned in the legend and perhaps indicated in the figure. The authors do mention in the text employing shRNAs to target multiple Mettl3 isoforms, so likely this is the case.

      Minor points (some really minor to just polish the presentation for clarity):

      The word "since" should only be used if there is a time element- otherwise the word "as" is preferable.

      For example on p. 4, the sentence: "Since inhibition of mRNA export typically enhances the nuclear retention of RNAs with intact 5'SS motifs (Lee et al. 2020),.." would more precisely read "As inhibition of mRNA export typically enhances the nuclear retention of RNAs with intact 5'SS motifs (Lee et al. 2020),..".

      Significance

      Summary: In the manuscript by Lee et al. entitled "N-6-methyladenosine (m6A) Promotes the Nuclear Retention of mRNAs with Intact 5'Splice Site Motifs", the authors provide evidence that m6A modifications within specific regions of transcripts can confer nuclear retention. These results are important because they add to our understanding of how m6A modifications can contribute to post-transcriptional regulation. Although the authors do not quite come out and say this, data seem to be accumulating to suggest that the location of the m6A modifications within a given transcript can dictate the functional consequences of those modifications.

      This study would be of significant interest to those that study gene expression in any context as well as cell biologists as the data add to our understanding of export of mRNA from the nucleus. This work also adds to our understanding of the biological consequences of m6A modification, which is an area of significant interest. In my opinion, the authors could make a broader conclusion that we do, which is that the location of the modification significantly dictates function- an extension of previous findings mostly focused on processed mRNA transcripts.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The concept introduced by this paper is exciting and novel. However, the current paucity of presented data can lead to incorrect interpretations of the findings and speculations that might not hold true after a more rigorous assessment of the observed phenomenon.

      The premise of this study builds upon an interaction between the PAXT complex and nuclear YTH domain containing proteins. However, figures 1B and C should be improved. The interacting band for the ZFC3H1 presented in panel B does not seem to match the size of the construct used in panel C. Is the Flag version of ZFC3H1 expressing a smaller isoform for this protein? Also, the YTHDC1-2 interaction in panel C is not as convincing considering the negative controls lane show some degree of binding. Additionally, can the authors test if their RNaseA treatment worked?

      Why do you need 18 hours to observe the nuclear export of your modifiable construct when inhibiting METTL3 in figure 3? Is it possible that your observation is secondary to phenotypes these cells develop as a result of blocking METTL3?

      Is ALKBH5 nuclear and/or cytoplasmic in the cell system used?

      Significance

      The study is highly significant

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We would like to thank the editorial staff and the reviewers for their handling of our manuscript. We were very pleased with the timely communications from Review Commons, and we are grateful to have been assigned this insightful and constructive group of reviewers.

      The reviewers were well-suited to evaluate our work based on their stated areas of expertise (cancer biology, image analysis, machine learning, cell-based screening, etc.). As such, we received thoughtful and constructive feedback, which we have already incorporated into our attached revision. We are confident that these reviews have improved our manuscript.

      Our goal with this manuscript is to present a proof-of-concept study where high-content imaging and morphological profiling are used to characterize drug resistance in clonal cell lines. The main criticism from reviewers was that our original manuscript may have overstated our method’s ability to discriminate the signal of bortezomib resistance and that any extension beyond cultured cells (to patient samples for example) would require significant follow-up studies. The reviewers suggested that such work would be beyond the scope of our study, and recommended toning down our language to better reflect the limitations of this proof-of-concept work. We have embraced this suggestion, extensively revising our text, and we now believe our language and tone more accurately reflects our results. The reviewers also suggested follow-up computational analyses to more robustly characterize the bortezomib resistance signature. We have performed these analyses and added their description to our revised manuscript. We feel that these analyses have improved understanding of the signature, and will help a reader to gain a deeper understanding of our results and methodology.

      The reviewers also suggested several minor changes; many of which we embraced fully, but others that we chose not to incorporate. We felt that a lack of clarity in our text contributed to these reviewer suggestions. In these cases, we improved clarity in the text and responded to each comment point-by-point in the “prefer not to carry out” section. Further, we address all reviewer comments in the following document point-by-point, grouped by common themes across reviewers (e.g., tone, clarity, analyses, etc.).

      Lastly, a common theme among reviewer comments was their appreciation for our strong methodology and data transparency (examples pasted below). We are extremely gratified by this observation as we feel this is a particular strength of our manuscript. In addition, we were pleased to see reviewers engaged by our work, acknowledging the interest this manuscript is likely to generate among a broad range of scientific disciplines.

      Examples of reviewer appreciation of our strong methodology and data transparency:

      Reviewer 1: “However, this does not imply that the same approach can not achieve the goal, perhaps by using other cell painting markers for bortezomib-sensitivity, or with the same markers to assess sensitivity of different drugs. The cell painting + analysis approaches are not new and the clinical impact is questionable, but the technical aspects (data, analysis) are exceptional and the concept may hold as I described above.”

      Reviewer 2: “The paper is well written, and the text is clear, as is the presentation of data and transparency of methods being utilized. The methods were applied appropriately and followed established standards in the field. The paper's premise is timely and interesting, addressing a pressing issue in cancer therapy: making informed treatment decisions fast, based on markers found in tumors early in tumor development, and using image-based screening for characterizing drug resistance before treatment could be an option. A fascinating bit of the manuscript is the description of the feature selection from the screen is done systematically, considering the technical and biological variability and technical artifacts and modeling covariates using linear models seems a very appropriate way of doing so and could serve as another proof of concept that this is indeed the most robust way of modeling and removing signal of technical covariates from the data.”

      Reviewer 3: “The strengths of this study are the machine learning best practice and detailed methodology. The experiments could be reproduced and statistical analysis is more than adequate. The analysis takes into account batch effects, well position, differences in cell numbers, and other sources of technical variation that complicate high-content image analysis. It is a good exemplar of how unsupervised morphological profiling can be applied to imaging data. The major limitation is the generalizability of this particular method for patient samples. This could be addressed in the Discussion.”

      1. Description of the planned revisions

      We have incorporated all planned revisions.

      1. Description of the revisions that have already been incorporated in the transferred manuscript

      Text revisions already carried out

      1. [Text revision] We have materially toned down our claims in the manuscript in two distinct areas: A) model performance and B) potential clinical application. A) Model performance. We specifically balanced our discussion of the discriminative signal of the Bortezomib Signature. While the signature adequately separated never-before-seen wildtype and resistant clones with metrics well above randomly permuted baselines (accuracy near 80%, average precision about 70%, area under the ROC curve (AUROC) about 84%), there were many limitations that we should have more explicitly highlighted. For example, many individual profiles were incorrectly classified, some clones were predicted entirely incorrectly, and many profiles did not receive Bortezomib Signature scores above the randomly permuted baseline. We have more clearly discussed these limitations and used more balanced language (see key examples of text-based changes below). Additionally, we modified a figure (now Figure 3) to include boxplots of clones that explicitly show the Bortezomib Signature scores of each well profile and permit examination of the strength of the signature for each clone (previously found in Figure 2-Supplement 9). Lastly, we add a new supplementary figure (now Figure 5-Supplement 1) that describes a feature space analysis of misclassified samples. Please note that this figure rearrangement and new analysis helped to balance our claims, but were also performed in response to other tangential reviewer comments. B) Clinical application. In the abstract, introduction, and discussion, we further emphasized that this work is a proof of concept, and that more advances must be made prior to clinical application.

      We made these changes in direct response to the following reviewer comments:

      Reviewer 1 - Major Comment 1 (relevant excerpts)

      While I am convinced that the signature captures morphological phenotypes associated with drug resistance, at the cumulative scale, the discriminative signal of a single cell type seems weak… With Fig. 4, the data fully supports the argument that the bortezomib-signature encodes bortezomib-resistance, but the signal is weak. Thus statements such as "We found the Bortezomib Signature could predict whether a cell line was bortezomib-resistant or bortezomib-sensitive" (line #172) and the specificity statements in the abstract" (line #28) are not supported by the data in my opinion. I would recommend the authors to tune down these and other related statements throughout the manuscript.

      Reviewer cross-commenting - Reviewer 1

      My main critic is regarding "over selling" a weak discriminative signal. Specifically, I am not convinced that the major claims regarding predicting sensitivity and specificity at the single cell types scales are supported by the data. Since reviewer #2 and #3 did not raise this concern I think it is worth discussion here.

      Once these statements are tuned down - I think no significant additional work is needed to make the point that they can measure a discriminative signal. If they want to make these claims, perhaps they'd like to collect more data to gain statistical power (but I am not optimistic this will work at the single cell level).

      Personally, I was happy with the authors' choice of cell lines not included in the training dataset. I am not convinced that additional cell lines + validations are necessary for making the point of a proof of principle.

      Reviewer cross-commenting - Reviewer 2

      I agree that, perhaps, my major criticism of the paper was the manuscript's 'overselling' of claims that were only weakly supported by the data. Yes, if the authors tune down their claims and clearly state that this is an interesting starting point and proof of concept study, it might be ok to publish with only minor revisions. If the claims should be more generalized, then this study needs more data supporting the conclusions and the method's predictive power.

      Reviewer 2 - Major Comment 8

      Lastly, I find some misfits between the question, the model used, and the conclusions drawn. The authors start by exploring the problem of bortezomib resistance in cancer treatment, which they say is a devastating issue for patients with, e.g., multiple myeloma. Yet, the authors use HCT116 as their model cell line, a microsatellite instable, colorectal cell line with several intrinsic mutations that make it a difficult model to address physiologically relevant medical problems after all. The authors then go on to suppose that their method might be suitable to diagnose resistance in patient samples, but I am not convinced this conclusion can be speculated based on data from HCT cells. I suggest the authors test their approach on at least two other cell lines (maybe from different tissues) and benchmark their results against a dataset of digital pathology where such predictions are made from stained and analyzed tissue slices. This way, after a thorough benchmark against related third-party data sets, the method would significantly gain relevance, the paper would appeal to a broader audience, and the advance gains more merit.

      Reviewer 3 - Major Comment 5

      It is not clear from the Discussion whether this type of analysis is more broadly applicable to cell lines derived from patients, rather than selected from a parental cell line, or if this approach would be more efficient than genotyping or next-gen sequencing. How many replicates and ground truth cell lines would be necessary for predictive confidence?

      We edited the last two sentences of the abstract to tone down specificity claims (“provide evidence”) and clarify that we are establishing a “proof-of-concept framework”.

      • This signature predicted bortezomib resistance better than resistance to other drugs targeting the ubiquitin-proteasome system. Our results establish a proof-of-concept framework for the unbiased analysis of drug resistance using high-content microscopy of cancer cells, in the absence of drug treatment.

      We revised the last paragraph of the introduction to contrast bortezomib predictions with ixazomib/CB-5083 predictions, and to remove claims about “using microscopy to guide therapy”.

      • This morphological signature correctly predicted the bortezomib resistance of seven out of ten clones not included in the signature training dataset. Overall, our results establish a proof-of-concept framework for identifying unbiased signatures of drug resistance using high-content microscopy. The ability to identify drug-resistant cells based on morphological features provides a valuable orthogonal method for characterizing resistance in the absence of drug treatment.

      To tone down claims in the figures, we added boxplots to Figure 3 (previous Figure 2) showing specific distribution of signature scores per well profile and updated Figure 4 legend (previous Figure 3).

      • Figure 4. Bortezomib Signature has limited ability to characterize clones resistant to other ubiquitin-proteasome system inhibitors.

      We modify the following text in the discussion to tone down claims of specificity and clinical utility:

      • This Bortezomib Signature correctly predicted the bortezomib resistance of seven out of ten clones not included in the training dataset and was more specific to bortezomib-resistance given its limited ability to identify clones that were resistant to other UPS-targeting drugs.

      Though it is unclear whether this method can be extended to patient samples, where identifying intrinsic drug resistance in cells prior to treatment has the potential to improve targeted cancer therapy, our results are an encouraging proof of concept. We expect that further refinement may develop Cell Painting as a tool for identifying drug-resistant cells, perhaps even guiding strategies to overcome intrinsic resistance.

      1. [Text revision] We defined LD50 in text (originally line #97), changed description of resistant clone selection to remove main text references to LD90 (originally line #87), and stated drug concentrations used for selection in Methods. We also defined LD90 in the Methods and described its role in determining the drug concentrations to use for clone selection. This change was in response to the following comments:

      Reviewer 1 - Minor Comment 2

      What is LD90 (line #87)? LD50 (line #97)?

      Reviewer 2 - Minor Comment 5

      What was the LD 90 per drug on HCT cells? Rather than LD90 foldchanges, absolute concentrations should be used in the results and discussion to allow the reader to vet the conclusions.

      • To determine the appropriate drug concentrations to use in order to isolate drug-resistant clones, we performed proliferation assays on HCT116 parental cells with our drugs of interest: bortezomib (proteasome inhibitor), ixazomib (proteasome inhibitor), or CB-5083 (p97 inhibitor) (Fig. 1-Supplement 1 A-D).
      • We characterized the bortezomib-resistant clones and found that the median lethal doses (LD50s) were ~2.8- to ~9-fold that of HCT116 parental cells (Fig. 1-Supplement 2 B).
      • Briefly, HCT116 cells were plated in 150 mm dishes and grown in the presence of the desired drug at a concentration that resulted in the death of the majority of cells (selection concentrations: bortezomib, 12 nM; ixazomib, 150 nM; CB-5083, 600 and 700 nM).
      • Using the data from our proliferation assays, we calculated the median lethal dose (LD50) for each of our drugs of interest by fitting data of normalized growth vs. log[drug concentration] to a sigmoidal dose-response curve using GraphPad Prism (v.9.2.0) (Fig. 1-Supplement 1 D).

      • [Text revision] We thank the reviewer for allowing us an opportunity to improve clarity on the clones we used. We now describe the total number of clones generated and removed unnecessary references to specific clones for ease of reading (originally lines #96-98) (We maintain all references to specific clones in the figures, legends, supplement, and methods)

      Reviewer 1 - Minor Comment 3

      It was not clear to me in the text which and how many cell lines were evaluated and the reader is forced to go to the SI. For example, "(BZ01-10 and BZ clones A and E)" (line #96-97) and "wild-type clones (WT01-05, 10, and 12-15)" (line #98) appeared when presenting the results without a clear explanation and made it harder for me to follow. Summary of the data (for example, based on Figure 2-Supplement 8) can be briefly mentioned in the text to make it more clear for the reader.

      We added the following to the second paragraph of the results:

      • Together these methods provided a total of twelve bortezomib-resistant, five ixazomib-resistant, five CB-5083-resistant, and twelve bortezomib-sensitive clones as well as HCT116 parental cells for our experiments.

      [Text revision] We removed duplicate text (originally lines #115-125).

      Reviewer 1 - Minor Comment 5

      1. Lines #104-111 were duplicated in lines #114-122.

      Reviewer 3 - Minor Comment 4

      Ten lines of text are duplicated on page 5.

      Reviewer 2 - Minor Comment 4

      on page 5, paragraph 4, there is a sizeable copy-and-paste error of text being identically replicated.

      1. [Text revision] We provided more intuition of the Bortezomib Signature in the results section (originally lines #150-151).

      Reviewer 1 - Minor Comment 6

      The "Bortezomib Signature" is a critical measurement but is only briefly mentioned in lines 150-151 ("..based on the direction-sensitive ranking method for phenotype analysis, singscore (Foroutan et al., 2018)"). Please provide more information/intuition.

      • We used these 45 features to compute a rank-based resistance score or “Bortezomib Signature” for each well profile based on the direction-sensitive method called singscore (Foroutan et al. 2018). Singscore ranks these 45 resistance-related features on a per sample basis and calculates a normalized score between -1 and 1, with higher values expected for bortezomib-resistant clones and lower values expected for bortezomib-sensitive clones.

      • [Text revision] We clarified that DNA sequencing had been performed solely on clones A and E in a previous study (originally lines #88-90). Furthermore, one of the strengths of our approach is that it can identify resistant clones in an unbiased fashion prior to molecular characterization. It is beyond scope to perform these sequencing studies in the present paper.

      Reviewer 2 - Minor Comment 3

      The authors talk about validating the mutation - PSMB5 by RNA-seq. However, the data for the genotyping/sequencing/characterization of these newly generated BZ-resistant lines are missing.<br />

      In the results, we clarify DNA sequencing that was previously performed on clones A and E

      • We also isolated bortezomib-sensitive (wild-type; WT) clones by dilution of the HCT116 parental cell line and acquired two bortezomib-resistant clones (BZ clones A and E) both with mutations in PSMB5 identified by RNA sequencing performed in previous work (Fig. 1-Supplement 1 E) (Wacker et al. 2012).

      In the last paragraph of the discussion, we highlight the strength of our unbiased approach

      • Together, our work has demonstrated the potential for morphological profiling with Cell Painting to be used as an unbiased method to characterize resistance in the absence of drug treatment. Our results indicate that different mechanisms of bortezomib resistance may generate distinct morphological profiles; with larger and broader training datasets, it may be possible to identify signatures for distinct mechanisms of bortezomib resistance as well as signatures of resistance to other drugs. Though it is unclear whether this method can be extended to patient samples, where identifying intrinsic drug resistance in cells prior to treatment has the potential to improve targeted cancer therapy, our results are an encouraging proof of concept. We expect that further refinement may develop Cell Painting as a tool for identifying drug-resistant cells, perhaps even guiding strategies to overcome intrinsic resistance.

      • [Text revision] We thank the reviewers for their suggestions. We agree that the description of the experimental design was somewhat unclear and have provided greater detail and clarity, particularly regarding the generation of clones. We used the HCT116 parental cell line to generate drug-resistant clones by identifying single surviving cells after drug treatment and allowing these cells to expand prior to isolating colonies for experimentation. We did not perform experiments to confirm whether these “clones” were isogenic and can not exclude cell migration during expansion or genetic drift as convoluting factors. However, we have provided greater detail in the descriptions of our method for clone isolation in order to address this concern.

      Reviewer 1 - Minor Comment 1

      More information in Fig. 1's legend would be helpful to follow the experimental design. I found it hard to follow in its current form and had to go back to carefully reading the main text to fully understand.

      Reviewer 2 - Minor Comment 6

      The description of the resistant clonal populations is confusing. As I understand, no single-cell clones were isolated during the selection procedure. Thus, the training lines are not yet isogenic clones but oligoclonal sub-populations of the parental cell line. The authors could provide more details here and discuss the different characteristics of their sub-populations, e.g., their growth kinetics or molecular alterations.

      We bolstered the description in the results.

      • We first isolated and characterized drug-resistant cells (Fig. 1 A). To isolate drug-resistant clones, we used an approach we have described previously (Wacker et al. 2012; Kasap, Elemento, and Kapoor 2014) and the HCT116 cell line. These cancer cells express multidrug resistance pumps at low levels and are mismatch repair deficient, providing a genetically heterogeneous polyclonal population of cells (Umar et al. 1994; Papadopoulos et al. 1994; Teraishi et al. 2005) allowing for isolation of drug-resistant clones in 2-3 weeks. We hypothesize that a rapid selection of resistance could favor the isolation of clones with intrinsic resistance. To determine the appropriate drug concentrations to use in order to isolate drug-resistant clones, we performed proliferation assays on HCT116 parental cells with our drugs of interest: bortezomib, ixazomib, or CB-5083 (Fig. 1-Supplement 1 A-D). We also isolated bortezomib-sensitive (wild-type; WT) clones by dilution of the HCT116 parental cell line and acquired two published bortezomib-resistant clones (BZ clones A and E) both with mutations in PSMB5 identified by RNA sequencing performed in previous work (Fig. 1-Supplement 1 E) (Wacker et al. 2012). We characterized the bortezomib-resistant clones and found that the median lethal doses (LD50s) for bortezomib were ~2.8- to ~9-fold that of HCT116 parental cells (Fig. 1-Supplement 2 B). In contrast, bortezomib-sensitive clones had LD50s for bortezomib that ranged from ~0.7- to ~1.2-fold that of HCT116 parental cells (Fig. 1-Supplement 2 A). Together these methods provided a total of twelve bortezomib-resistant, five ixazomib-resistant, five CB-5083-resistant, and twelve bortezomib-sensitive clones as well as HCT116 parental cells for our experiments.

      We also updated the legend for Figure 1A.

      • Figure 1. Experimental design for using Cell Painting to examine morphological profiles of drug-resistant cells. (A) Graphic of the experimental workflow: we isolated drug-resistant clones by treating parental HCT116 cells with a high dose of the desired drug and then expanded them for experiments. We isolated drug-sensitive clones by diluting HCT116 cells and then expanded them for experiments. We then performed proliferation assays on select clones to screen for multidrug resistance. Next, we performed Cell Painting on both drug-resistant and -sensitive clones, using multiplexed high-throughput fluorescence microscopy of fixed cells followed by feature extraction and morphological profiling to search for features that contribute to a signature of drug resistance.

      • [Text revision] We clarified that the Bortezomib Signature did not correspond to well position (originally lines #155-157).

      Reviewer 1 - Minor Comment 9

      Line #155-156: "We found that the pattern of Bortezomib Signatures corresponded to the cell identity plate layout", the word "not" is missing before "corresponded".

      We found that the pattern of Bortezomib Signatures did not correspond to well position relative to the plate (Fig. 2-Supplement 7 B), indicating that the well position for each clone was not strongly contributing to its Bortezomib Signature.

      1. [Text revision] We explicitly described the result that some misclassified clones (WT10, WT15, and BZ06) did not have unexpected bortezomib sensitivity as determined by proliferation assays. We also moved the supplementary figure to an updated Figure 3 to better highlight this result (described below in “Figure revisions already carried out”). Lastly, we add a new figure (Figure 5-Supplement 1) to more explicitly analyze the misclassified lines (described below in “New analyses already carried out”).

      Reviewer 3 - Minor Comment 3

      The bortezomib sensitivity of the WT lines used in the last experiments was determined and did not seem to be greater than parental. This could be mentioned in the text; the figure raises the question and the answer is provided, but it's in the supplemental material.

      While the Bortezomib Signature correctly characterized the bortezomib sensitivity of most clones, it consistently misclassified others (WT10, WT15, and BZ06) (Fig 5-Supplement 1 A). Proliferation assays conducted in earlier experiments showed that WT10 and WT15 were sensitive to bortezomib while BZ06 was resistant (Fig. 1-Supplement 2 A and B). By comparing these incorrect predictions with high-confidence correct predictions, we observed differences that varied by clone type, suggesting unique morphology may be driving each of these misclassifications (Fig. 5-Supplement 1 B and C). These results are consistent with the Bortezomib Signature being generalizable to clones not included in the training dataset and suggest that morphological profiling has the potential to identify bortezomib-resistant clones based on the morphological features of cells in the absence of drug treatment.

      1. [Text revision] We clarified that the metrics (accuracy and average precision) were based on median Bortezomib Signature scores of all replicate well-level profiles per clone. We can compare samples based on rank, and difference from 95% confidence interval of permuted data. There is no current way for our method to assign a likelihood. Also note that we have updated the discussion to discuss alternative metrics (see Reviewer 1 - Minor Comment 7) These are very important distinctions, and we are grateful to the reviewer for bringing them up.

      Reviewer 3 - Major Comment 3

      The study classifies cells as binary sensitive or resistant, but would results be improved by scoring based on likelihood of being resistant/sensitive?

      Reviewer 3 - Minor Comment 2

      It is not clear whether the accuracy was based on a percentage of replicates per cell line that were classified correctly or whether that was referring to classification of the cell line overall as sensitive/resistant.

      • We next examined whether the Bortezomib Signature was able to predict the bortezomib resistance of a clone based on morphological profiling data (Fig. 3 A-E and Fig. 3-Supplement 2 A and B). We called the clone bortezomib-resistant if the median Bortezomib Signature of all replicate well profiles was greater than zero and bortezomib-sensitive if the median Bortezomib Signature less than zero. In the training dataset, the Bortezomib Signature correctly predicted the bortezomib resistance of all ten clones, with median Bortezomib Signatures for eight out of ten clones beyond the 95% confidence interval for the randomly permuted data (Fig. 3 A). The accuracy of the Bortezomib Signature was 88% while the average precision was 81% for the training dataset (Fig. 3-Supplement 2 A and B) (see Methods). The signature performed similarly well in the validation dataset (Fig. 3 B), with an accuracy of 92% and an average precision of 89% (Fig. 3-Supplement 2 A and B). In the test dataset the Bortezomib Signature correctly predicted the bortezomib resistance of all clones, though only HCT116 parental cells had a median Bortezomib Signature outside the 95% confidence interval for the randomly permuted data (Fig. 3 C). The test dataset had an accuracy of 80% and an average precision of 68% (Fig. 3-Supplement 2 A and B). Similarly, in the holdout dataset the Bortezomib Signature had an accuracy of 78% and an average precision of 69% (Fig.3 -Supplement 2 A and B), and correctly predicted the bortezomib resistance of twelve out of thirteen clones, with WT01 misclassified as bortezomib-resistant (Fig. 3 D). In the holdout dataset, four of the twelve correctly characterized clones had median Bortezomib Signatures outside the 95% confidence interval for the randomly permuted data.

      We also mirrored language when discussing the ixazomib and CB-5083 results.

      • However, only two of the four correctly identified ixazomib-resistant clones and one of the three CB-5083-resistant clones had median Bortezomib Signatures outside the 95% confidence interval of the randomly permuted data. The area under the ROC (AUROC) curve for ixazomib-resistant and CB-5083-resistant clones (0.63 and 0.60, respectively) was lower than those calculated for the training, validation, test, and holdout datasets. In addition, many of the Bortezomib Signatures for well profiles of ixazomib- and CB-5083-resistant clones, particularly those for CB-5083-resistant clones, landed within the 95% confidence interval of the randomly permuted data. These results suggest that the Bortezomib Signature is not a general signature of UPS-targeting drug resistance and instead has some specificity for bortezomib.

      • [Text revision] We added an explicit note that our image analysis pipelines are also publicly available. Our reporting of our data processing pipelines are documented fully and well above standards in our field. Linking the publicly-available resources with these methods maximizes reproducibility.

      Reviewer 1 - Minor Comment 10

      Additional details on the processing steps in the analysis pipeline in the Methods will be highly appreciated.

      We include all image analysis pipelines at https://github.com/broadinstitute/profiling-resistance-mechanisms (G. Way et al. 2023).

      1. [Text revision] We have compared our approach to the on-disease/off-disease scores as introduced in (Heiser et al. 2020). We agree with the reviewer that a discussion of these two methods would help clarify our phenotypic signature concept. The on/off score is about the degree to which a perturbation pushes disease towards a healthy state. In this case we have 3 sets of data: healthy samples (used for training), disease samples (used for training), and the sample we want to score, which should be of the form "disease + perturbation". With our approach, based on singscore, we also have 3 sets of data: sensitive samples (used for training), resistance samples (used for training), and the sample we want to score. Here, our sample we want to score could be anything, not necessarily of the form "resistance + perturbation". Furthermore, singscore does not have the concept of orthogonality to resistance/sensitivity. This would become relevant if we were exploring perturbations or conditions that would induce a resistant cell line to become sensitive, but we are not doing that here. There are other statistical differences (projection vs. rank based etc.) but the key difference is the applicability of the method to the specific problem at hand.

      Reviewer 1 - Minor Comment 7

      How is the Bortezomib Signature related to the "on-disease"/"off-disease" scores described in https://www.biorxiv.org/content/10.1101/2020.04.21.054387v1.full? Are there other alternatives used for similar binary phenotypic signatures? What is the justification for using these measurements? I would love to see this generalized concept explicitly discussed in the Discussion.

      We added the following to the discussion.

      • The Bortezomib Signature is conceptually similar to the on-disease/off-disease score (Heiser et al. 2020). Both require three phenotypic measurements: a target phenotype representing ideal, a disease phenotype, and a new phenotype to classify. However, our approach is technically different (non-parametric compared to linear projection) and our goals are different (phenotypic classification compared to perturbation alignment). Other methods also enable phenotype labeling, but they focus on single-sample annotation without regard to a target phenotype (Wawer et al. 2014; Rohban et al. 2017; Simm et al. 2018; Nyffeler et al. 2020).

      Figure revisions already carried out

      1. [Figure revision] We moved all boxplots from the original Fig. 2-Supplement 9 to the main text (also splitting Fig. 2 into Fig. 2 and 3). From the original Figure 2, we moved the accuracy and average precision bar graphs to the supplement. We also note that this change increases transparency of the discriminative signal of our signature.

      Reviewer 1 - Minor Comment 8

      I would highly recommend showing the Bortezomib Signatures from Figure 2-Supplement 9. in Fig. 2. This was the main measurement used throughout the manuscript and in my opinion, it is very important to consistently visualize the data along the manuscript, for clarity and easier reader interpretation.

      1. [Figure revision] We adjusted the position of the legend in the accuracy and average precision bar graphs (originally Fig. 2 C and D, now Fig. 3-Supplement 2) for clarity. We also note that keeping the bar chart here is standard best practice (compared to a dot plot).

      Reviewer 1 - Minor Comment 4

      I found the visualization in Fig. 2C-D not intuitive (it is properly explained in the legend). I suggest replacing the accuracy colorbar with a color marker to make it more distinct from the random permutation (|--*--|) The location of the text "mean +- SD of 100 random permutation" made me first think that it is linked to the holdout.

      1. [Figure revision] We changed the point distribution in the boxplots (from expanded to standard) to minimize overlap with the boxplot lines. We also updated the legend text to indicate that individual points in boxplots represent the Bortezomib Signature for well profiles. Note, we paste a representative example of this change above (new Figure 3).

      Reviewer 3 - Minor Comment 1

      I found the box plots somewhat difficult to interpret (especially where the WT lines had a lot of overlap with the red shaded area). Do the points in these charts correspond to replicate wells?

      We also update the figure legend.

      • Plots show values for individual well profiles (points), range (error bars), 25th and 75th percentiles (box boundaries), and median.

      • [Figure revision] [Response to Reviewer 2 - Major Comment 7] We thank the reviewer for allowing us an opportunity to clarify the mechanism. We feel that it is beyond scope of this manuscript to disentangle the molecular alterations that cause bortezomib resistance based on our Cell Painting insights. This wet lab experimental process is arduous and cost prohibitive, and we argue that one of the benefits of taking a morphology approach to resistance status is that we can detect resistant cells (and therefore cells that won’t die when presented with a treatment) without knowing the molecular mechanism.

      Nevertheless, the reviewer has encouraged us to enhance the ability for a reader to view and interpret the signature to perhaps more easily facilitate future work. Previously, we presented our signature in text form in Figure 2-Supplement 4 and in heatmap form in Figure 2-Supplement 5. Here, we add a new figure (Figure 2-Supplement 6; pasted below) which will improve interpretability.

      Reviewer 2 - Major Comment 7:

      Next to feature importance, the authors do not discuss (or I missed) what biology the features represent. Such the reader is left wondering what the actual mechanism of bortezomib resistance could be and if cell painting could shed light on the molecular alterations that cause the treatment resistance. While reviewing, I thus wondered which audience the authors targeted with their manuscript. A more focused analysis of their data that highlights aspects of the study either for the machine learning community, the cell biology community, or the precision oncology community would greatly benefit the manuscript's impact. In its current form, the study's findings seem diluted and spread across a wide range of research questions.<br />

      • Figure 2-Supplement 6. Bortezomib Signature visualized by CellProfiler features. Visualization of CellProfiler features contributing to the Bortezomib Signature. Features with high values (mean signature estimates) in resistant cells are purple while features with low values in resistant cells are green. The mean signature estimates were based on Tukey's Honestly Significant Difference test score and the number in each box represents the number of features used to calculate the mean signature estimate.

      Additionally, we add the following to the results section:

      • We then examined the grouping of features across compartments and channels and found radial distribution features were higher in resistant cells (Fig 2-Supplement 6).

      The code change to generate the signature visualization summary is available at: https://github.com/broadinstitute/profiling-resistance-mechanisms/pull/131

      New analyses already carried out

      1. [New analysis] [Response to Reviewer 2 - Major Comment 5] We agree that a systematic analysis of feature selection methods will provide additional insights not already in the manuscript. Therefore, we have performed two new computational experiments to compare our linear modeling feature selection approach against other standard approaches. We demonstrate that our linear modeling approach is effective at isolating the core differences between resistant and sensitive classes.

      Specifically, we performed two analyses: A) UMAP and B) k-means cluster analysis. We analyzed profiles defined by four different feature selection approaches: 1) Using all traditional CellProfiler features; 2) Using the traditional CellProfiler feature selection approach (removing low variance features, high correlating features, etc.); 3) Using 45 random features (same size as Bortezomib Signature); and 4) Using only the bortezomib signature features. We performed Fisher’s exact tests to derive odds ratios of cluster membership by resistance status and calculated Silhouette widths to quantify relative proximity of clusters.

      This analysis generates a new supplementary figure (see below), and demonstrates that the linear-modeling-based feature selection isolated the features driving the differences between the clone types (resistance vs. wildtype) while the standard approaches do not as effectively separate.

      Reviewer 2 - Major Comment 5:

      A fascinating bit of the manuscript is the description of the feature selection from the screen is done systematically, considering the technical and biological variability and technical artifacts and modeling covariates using linear models seems a very appropriate way of doing so and could serve as another proof of concept that this is indeed the most robust way of modeling and removing signal of technical covariates from the data. Yet, I wondered why the authors do not discuss other means of feature selection or dimensionality reduction; further, they need to show how the features cluster the cell lines or why impact (information content) different features deliver. For an audience interested in the technical aspects of cell painting analysis and machine learning based on the data, that would, IMHO, be the most exciting questions.

      • Figure 3-Supplement 3. Benchmarking linear-modeling feature selection to separate clones by bortezomib resistance. Uniform Manifold Approximation and Projection (UMAP) analysis of the qualitative separability of (A) resistance status and (B) Bortezomib Signature scores across four different feature spaces. (C) k-means clustering from k=2 to k=14 of average odds ratio, maximum odds ratio (Fisher’s exact test), and Silhouette width using Bortezomib Signature features.

      Additionally, we add the following to the results section:

      • We then compared our linear-modeling approach to feature selection against other feature spaces and found that the Bortezomib Signature clusters same-type clones (bortezomib-resistant vs. bortezomib-sensitive) with higher enrichment compared to the full feature space, standard feature selection (see Methods), or a random selection of 45 features (Fig 3-Supplement 3).

      And methods section, describing this analysis:

      • We were also interested in comparing the ability of different feature spaces to cluster clones of the same type (resistant vs. sensitive). This analysis would determine if the Bortezomib Signature features, which we derived using linear modeling to isolate biological from technical variables, had a greater ability to cluster. We compared the Bortezomib Signature against three other feature spaces: 1) the full feature space, 2) standard feature selection (see Image data processing methods), and 3) 45 randomly selected features. We performed two analyses using these four feature spaces including Uniform Manifold Approximation and Projection (UMAP) (McInnes et al. 2018) and k-means clustering. For UMAP, we used default umap-learn parameters to identify two UMAP coordinates per feature space. We then visualized the clusters by their resistance status and Bortezomib Signature score. The UMAP analysis represents a qualitative analysis. Next, we applied k-means clustering with 25 initializations across a range of 2-14 clusters (k). Prior to clustering and for each feature space, we applied principal component analysis (PCA) and transformed each feature space into 30 principal components. This step was necessary to compare k-means clustering metrics, which are sensitive to the feature space dimensionality. We applied a Fisher’s exact test to each cluster using a two-by-two contingency matrix that specified cluster membership for each clone classification (resistant vs. sensitive). We visualized the mean odds ratio and max cluster odds ratio for each feature space across k. A high odds ratio tells us that the feature space effectively clusters clones of the same resistance status. Lastly, we calculated Silhouette width (the average proximity between samples in one cluster to the second nearest cluster) for each feature space across k.

      The code change to derive the UMAP coordinates, perform clustering, and generate the figure is available at https://github.com/broadinstitute/profiling-resistance-mechanisms/pull/132

      1. [New analysis] [Response to Reviewer 3 - Major Comment 1] We thank the reviewer for this suggestion, which allowed us to explore the misclassified samples in more depth. We added a new supplementary figure in which we summarized all bortezomib clones (wildtype and resistant) in their accuracy based on the bortezomib signature (panel A). We did not include training set samples in this analysis. Using samples that were consistently incorrectly classified with high confidence (three samples: WT15, BZ06, WT10) we performed two separate two-sample Kolmogorov–Smirnov (KS) tests. Specifically, we compared high incorrect wildtype to high correct wildtype and high incorrect resistant to high correct resistant. Our results indicate that most bortezomib signatures were significantly different between correct and incorrect assignments (panel B), and that the signature features varied between resistant and wildtype misclassification tests (panel C).

      Reviewer 3 - Major Comment 1:

      While the claims are largely substantiated, there are a few points where further consideration would improve the manuscript. Several cell lines were mis-classified with what appears to be a high degree of certainty. Can the authors tell what was driving those predictions? Was there something in the morphological signature that weighed more heavily in those cases?

      • Figure 5-Supplement 1. Examining the accuracy of clone classification and misclassification of clones. (A) Proportion of high-confidence correct, low-confidence correct, low-confidence incorrect, and high-confidence incorrect predictions of well profiles across clones in the test, holdout, and validation sets. High-confidence predictions (high) had a Bortezomib Signatures greater (resistant clones) or less than (sensitive) the 95% confidence interval of randomly permuted data while low-confidence predictions (low) had Bortezomib Signatures within the 95% confidence interval of randomly permuted data. (B) Visualization of Kolmogorov-Smirnov (KS) test statistic means of feature groups across channels and cellular compartments. (C) Plot of the KS test statistic means for feature groups in bortezomib-resistant vs. -sensitive cells. Each feature group is color coded by the imaging channel.

      Additionally, we add the following to the results section:

      • While the Bortezomib Signature correctly characterized the bortezomib sensitivity of most clones, it consistently misclassified others (WT10, WT15, and BZ06) (Fig 5-Supplement 1 A). Proliferation assays conducted in earlier experiments showed that WT10 and WT15 were sensitive to bortezomib while BZ06 was resistant (Fig. 1-Supplement 2 A and B). By comparing these incorrect predictions with high-confidence correct predictions, we observed differences that varied by clone type, suggesting unique morphology may be driving each of these misclassifications (Fig. 5-Supplement 1 B and C). These results are consistent with the Bortezomib Signature being generalizable to clones not included in the training dataset and suggest that morphological profiling has the potential to identify bortezomib-resistant clones based on the morphological features of cells in the absence of drug treatment.

      And methods section, describing this analysis:

      Some profiles were consistently predicted incorrectly with high confidence but in the opposite direction (see Figure 5-Supplement 1). For a well-level profile to be categorized as high-confidence (in either the correct or incorrect directions), it needed to score beyond the 95% confidence interval of the randomly permuted data range. For example, a high-confidence incorrect resistant profile would have a Bortezomib Signature below 95% confidence interval of the randomly permuted data. To evaluate the features driving the differences in these samples, we applied two-sample Kolmogorov–Smirnov (KS) tests per Bortezomib Signature feature. We applied these tests to two separate groups: 1) misclassified bortezomib-sensitive vs. high-confidence accurate bortezomib-sensitive and 2) misclassified bortezomib-resistant vs. high-confidence accurate bortezomib-resistant.

      The code change to generate the UMAP coordinates and figure is available at https://github.com/broadinstitute/profiling-resistance-mechanisms/pull/130

      Description of analyses that authors prefer not to carry out

      1. [Response to Reviewer 2 - Minor Comments 1 and 2]: These are interesting suggestions! Still, we prefer not to speculate on the biological mechanism of the Bortezomib signature. Connecting morphological features identified as contributing to the Bortezomib Signature by Cell Painting to specific biological pathways would demand considerable cell-based assays to validate. In addition, our analyses suggest that the features contributing to the Bortezomib Signature are spread across a range of cellular compartments and channels, making it difficult to pin down specific mechanisms or pathways as likely contributors to bortezomib resistance. However, we are adding a figure to increase interpretability of the signature, which will aid in developing future hypotheses. Note that the signature was not possible to detect by eye (Fig. 2 A).

      Reviewer 2 - Minor Comment 1:

      There could be some speculation on the mechanism of Bortezomib resistance concerning the literature with the existing image data. For example, Bortezomib resistance is connected to serine synthesis and how a particular feature could contribute to the known mechanism.<br />

      Reviewer 2 - Minor Comment 2:

      Along the same lines, the authors could show that larger cells lead to resistance with microscopic images.

      2. [Response to Reviewer 2 - Major Comment 8]: We appreciate the reviewer’s concern that our work using HCT116 clonal cells lines may not directly reflect results from patient samples. Our choice was based on previously published work demonstrating the efficiency with which HCT116 cells generate resistant clones due to diminished DNA mismatch repair and decreased expression of drug efflux pumps. Since our work is a proof of concept rather than a comprehensive demonstration of translating morphological profiling into clinical practice, we believe that experiments using multiple patient cell lines from different tissues as well as digital pathology records to be beyond the scope of this work. We instead chose to tone down the language of our manuscript to more clearly acknowledge the limitations of our work and clarify this as a proof of concept.

      Reviewer 2 - Major Comment 8 (relevant excerpt):

      I suggest the authors test their approach on at least two other cell lines (maybe from different tissues) and benchmark their results against a dataset of digital pathology where such predictions are made from stained and analyzed tissue slices. This way, after a thorough benchmark against related third-party data sets, the method would significantly gain relevance, the paper would appeal to a broader audience, and the advance gains more merit.<br />

      3. [Response to Reviewer 3 - Major Comment 2]: The bortezomib sensitivity of ixazomib- and CB-5083-resistant clones was not determined, and hence can not be ruled out as a possible explanation for their high Bortezomib Signature scores. However, we prefer not to conduct additional proliferation assays for the misclassified clones (IX02, WT06, CB14, CB16) in the presence of bortezomib to determine whether coincidental bortezomib resistance might explain the signature performance. Our rationale is that three other misclassified clones (WT10, WT15, and BZ06) had the expected bortezomib sensitivity in proliferation assays (Fig. 1-Supplement 2), meaning that additional proliferation assays may not reveal any insights regarding the signature performance.

      Reviewer 3 - Major Comment 2:

      Was the bortezomib sensitivity of the IX (or CB) resistant cell lines determined? If there were differences, this could explain some of the variation in the morphological signatures. This could be easily done in one or two growth experiments.

      4. [Response to Reviewer 2 - Major Comment 7]: Thank you for pointing this out. Our goal is to keep the study multi-disciplinary. We are adding a figure to increase interpretability of the signature, and adding text-based clarifications.

      Reviewer 2 - Major Comment 7 (relevant excerpt):

      While reviewing, I thus wondered which audience the authors targeted with their manuscript. A more focused analysis of their data that highlights aspects of the study either for the machine learning community, the cell biology community, or the precision oncology community would greatly benefit the manuscript's impact. In its current form, the study's findings seem diluted and spread across a wide range of research questions.<br />

      5. [Response to Reviewer 2 and 3 - Major Comments 6 and 4]: We prefer not to expand the scope of the model to predict other drug signatures. This would require a substantial amount of work to generate the appropriate drug-resistant clones, collect the imaging data, and analyze it, and we think it important to convey the purpose of our paper is proof of concept. We do not feel that the time invested in performing this analysis would result in adequate returns beyond what we already demonstrate.

      Reviewer 2 - Major Comment 6.

      Interestingly, the Bortezomib signature is specific to the drug and not a broad range of proteasomal inhibitors. However, seeing the common features between all the proteasomal inhibitors would be interesting.

      Reviewer 3 - Major Comment 4

      There was some predictive ability of the Bortezomib Signature for ixazomib resistance. Were there some features that were correlated with IX-resistance, i.e. UPS pathway, versus specific to bortezomib? Do the features suggest anything about resistance mechanisms or is the feature set too abstruse to interpret?

      References

      Foroutan, Momeneh, Dharmesh D. Bhuva, Ruqian Lyu, Kristy Horan, Joseph Cursons, and Melissa J. Davis. 2018. “Single Sample Scoring of Molecular Phenotypes.” BMC Bioinformatics 19 (1): 404.

      Heiser, Katie, Peter F. McLean, Chadwick T. Davis, Ben Fogelson, Hannah B. Gordon, Pamela Jacobson, Brett Hurst, et al. 2020. “Identification of Potential Treatments for COVID-19 through Artificial Intelligence-Enabled Phenomic Analysis of Human Cells Infected with SARS-CoV-2.” bioRxiv. https://doi.org/10.1101/2020.04.21.054387.

      McInnes, Leland, John Healy, Nathaniel Saul, and Lukas Großberger. 2018. “UMAP: Uniform Manifold Approximation and Projection.” Journal of Open Source Software 3 (29): 861.

      Nyffeler, Johanna, Clinton Willis, Ryan Lougee, Ann Richard, Katie Paul-Friedman, and Joshua A. Harrill. 2020. “Bioactivity Screening of Environmental Chemicals Using Imaging-Based High-Throughput Phenotypic Profiling.” Toxicology and Applied Pharmacology 389 (January): 114876.

      Rohban, Mohammad Hossein, Shantanu Singh, Xiaoyun Wu, Julia B. Berthet, Mark-Anthony Bray, Yashaswi Shrestha, Xaralabos Varelas, Jesse S. Boehm, and Anne E. Carpenter. 2017. “Systematic Morphological Profiling of Human Gene and Allele Function via Cell Painting.” eLife 6 (March). https://doi.org/10.7554/eLife.24060.

      Simm, Jaak, Günter Klambauer, Adam Arany, Marvin Steijaert, Jörg Kurt Wegner, Emmanuel Gustin, Vladimir Chupakhin, et al. 2018. “Repurposing High-Throughput Image Assays Enables Biological Activity Prediction for Drug Discovery.” Cell Chemical Biology 25 (5): 611–18.e3.

      Wacker, Sarah A., Benjamin R. Houghtaling, Olivier Elemento, and Tarun M. Kapoor. 2012. “Using Transcriptome Sequencing to Identify Mechanisms of Drug Action and Resistance.” Nature Chemical Biology 8 (3): 235–37.

      Wawer, Mathias J., Kejie Li, Sigrun M. Gustafsdottir, Vebjorn Ljosa, Nicole E. Bodycombe, Melissa A. Marton, Katherine L. Sokolnicki, et al. 2014. “Toward Performance-Diverse Small-Molecule Libraries for Cell-Based Phenotypic Screening Using Multiplexed High-Dimensional Profiling.” Proceedings of the National Academy of Sciences of the United States of America 111 (30): 10911–16.

      Way, Gregory, Yu Han, David Stirling, and Shantanu Singh. 2023. Broadinstitute/profiling-Resistance-Mechanisms: Analysis for Preprint. Zenodo. https://doi.org/10.5281/ZENODO.7803787.

      Way, Gregory P., Maria Kost-Alimova, Tsukasa Shibue, William F. Harrington, Stanley Gill, Federica Piccioni, Tim Becker, et al. 2021. “Predicting Cell Health Phenotypes Using Image-Based Morphology Profiling.” Molecular Biology of the Cell 32 (9): 995–1005.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      This study aimed to determine a morphological signature based on CellPainting that could predict resistance of multiple myeloma cells to bortezomib. Bortezombi-resistant clones were derived from parental wild type cells and tested for multidrug resistance. Best practice machine learning methods were applied to identify features that were not correlated with technical variation or nonspecific heterogeneity between cell lines. A Bortezomib Signature comprised of 45 selected features complied using a robust ranked method was described. This signature score was then used to classify cell lines, including wild-type and bortezomib-resistant clones not included in the training set and clonal lines selected for resistance to other drugs. The Bortezomib Signature performed better than chance for predicting resistance to this drug in validation and holdout datasets, and for an independent dataset of clones not used in the initial training. Some predictive power was observed for a drug targeting the same pathway (UPS), but with lower accuracy.

      Major comments

      While the claims are largely substantiated, there are a few points where further consideration would improve the manuscript.<br /> Several cell lines were mis-classified with what appears to be a high degree of certainty. Can the authors tell what was driving those predictions? Was there something in the morphological signature that weighed more heavily in those cases?<br /> Was the bortezomib sensitivity of the IX (or CB) resistant cell lines determined? If there were differences, this could explain some of the variation in the morphological signatures. This could be easily done in one or two growth experiments.<br /> The study classifies cells as binary sensitive or resistant, but would results be improved by scoring based on likelihood of being resistant/sensitive?<br /> There was some predictive ability of the Bortezomib Signature for ixazomib resistance. Were there some features that were correlated with IX-resistance, i.e. UPS pathway, versus specific to bortezomib? Do the features suggest anything about resistance mechanisms or is the feature set too abstruse to interpret?<br /> It is not clear from the Discussion whether this type of analysis is more broadly applicable to cell lines derived from patients, rather than selected from a parental cell line, or if this approach would be more efficient than genotyping or next-gen sequencing. How many replicates and ground truth cell lines would be necessary for predictive confidence?

      Minor comments

      I found the box plots somewhat difficult to interpret (especially where the WT lines had a lot of overlap with the red shaded area). Do the points in these charts correspond to replicate wells?<br /> It is not clear whether the accuracy was based on a percentage of replicates per cell line that were classified correctly or whether that was referring to classification of the cell line overall as sensitive/resistant.<br /> The bortezomib sensitivity of the WT lines used in the last experiments was determined and did not seem to be greater than parental. This could be mentioned in the text; the figure raises the question and the answer is provided, but it's in the supplemental material.<br /> Ten lines of text are duplicated on page 5.

      Significance

      The strengths of this study are the machine learning best practice and detailed methodology. The experiments could be reproduced and statistical analysis is more than adequate. The analysis takes into account batch effects, well position, differences in cell numbers, and other sources of technical variation that complicate high-content image analysis. It is a good exemplar of how unsupervised morphological profiling can be applied to imaging data. The major limitation is the generalizability of this particular method for patient samples. This could be addressed in the Discussion.

      The advances presented here are largely technical, as machine learning best practices are implemented to address a specific type of drug resistance.

      This study would be especially valuable to computationally-oriented biologists and has some translational appeal, which could be broadened by including some more information in the Discussion regarding the logistics of this approach in the clinic and whether the signatures identified have interpretability as well as predictive power.

      Areas of expertise: image analysis, systems biology, cancer biology

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In the study reviewed herewith, Kelley et al. present an interesting approach to evaluate whether image-based profiling via the cell painting assay can enable a model to predict if a cell line is resistant to the anti-cancer drug bortezomib (a specific inhibition of the proteasome). Therefore, they employed the human colorectal cancer-derived derived cell line HCT116 and used a previously published selection protocol to derive bortezomib resistant clonal subpopulation. These sub-populations were then subjected to cell painting, and it was tested (by dividing the data set into training, test, validation, and holdout) if a machine learning model could be trained on selected image-derived features that would predict if the cell line is indeed resistant to bortezomib treatment. This worked in 7 out of ten cases and two commercially available resistant HCT116 clones. Theoretically, this implies that image-based cellular profiling of tissue materials could work as a fast and reliable method to gauge the treatment resistance of cancer patients before treatment and open new avenues for personalized medicine.

      Major comments:

      • the paper is well written, and the text is clear, as is the presentation of data and transparency of methods being utilized.
      • the methods were applied appropriately and followed established standards in the field.
      • The paper's premise is timely and interesting, addressing a pressing issue in cancer therapy: making informed treatment decisions fast, based on markers found in tumors early in tumor development, and using image-based screening for characterizing drug resistance before treatment could be an option.
      • A fascinating bit of the manuscript is the description of the feature selection from the screen is done systematically, considering the technical and biological variability and technical artifacts and modeling covariates using linear models seems a very appropriate way of doing so and could serve as another proof of concept that this is indeed the most robust way of modeling and removing signal of technical covariates from the data.
      • Yet, I wondered why the authors do not discuss other means of feature selection or dimensionality reduction; further, they need to show how the features cluster the cell lines or why impact (information content) different features deliver. For an audience interested in the technical aspects of cell painting analysis and machine learning based on the data, that would, IMHO, be the most exciting questions.
      • Interestingly, the Bortezomib signature is specific to the drug and not a broad range of proteasomal inhibitors. However, seeing the common features between all the proteasomal inhibitors would be interesting.
      • Next to feature importance, the authors do not discuss (or I missed) what biology the features represent. Such the reader is left wondering what the actual mechanism of bortezomib resistance could be and if cell painting could shed light on the molecular alterations that cause the treatment resistance. While reviewing, I thus wondered which audience the authors targeted with their manuscript. A more focused analysis of their data that highlights aspects of the study either for the machine learning community, the cell biology community, or the precision oncology community would greatly benefit the manuscript's impact. In its current form, the study's findings seem diluted and spread across a wide range of research questions.
      • Lastly, I find some misfits between the question, the model used, and the conclusions drawn. The authors start by exploring the problem of bortezomib resistance in cancer treatment, which they say is a devastating issue for patients with, e.g., multiple myeloma. Yet, the authors use HCT116 as their model cell line, a microsatellite instable, colorectal cell line with several intrinsic mutations that make it a difficult model to address physiologically relevant medical problems after all. The authors then go on to suppose that their method might be suitable to diagnose resistance in patient samples, but I am not convinced this conclusion can be speculated based on data from HCT cells. I suggest the authors test their approach on at least two other cell lines (maybe from different tissues) and benchmark their results against a dataset of digital pathology where such predictions are made from stained and analyzed tissue slices. This way, after a thorough benchmark against related third-party data sets, the method would significantly gain relevance, the paper would appeal to a broader audience, and the advance gains more merit.

      Minor comments

      • There could be some speculation on the mechanism of Bortezomib resistance concerning the literature with the existing image data. For example, Bortezomib resistance is connected to serine synthesis and how a particular feature could contribute to the known mechanism.
      • Along the same lines, the authors could show that larger cells lead to resistance with microscopic images.
      • The authors talk about validating the mutation - PSMB5 by RNA-seq. However, the data for the genotyping/sequencing/characterization of these newly generated BZ-resistant lines are missing.
      • on page 5, paragraph 4, there is a sizeable copy-and-paste error of text being identically replicated.
      • What was the LD 90 per drug on HCT cells? Rather than LD90 foldchanges, absolute concentrations should be used in the results and discussion to allow the reader to vet the conclusions.
      • The description of the resistant clonal populations is confusing. As I understand, no single-cell clones were isolated during the selection procedure. Thus, the training lines are not yet isogenic clones but oligoclonal sub-populations of the parental cell line. The authors could provide more details here and discuss the different characteristics of their sub-populations, e.g., their growth kinetics or molecular alterations.

      Referees cross-commenting

      I agree that, perhaps, my major criticism of the paper was the manuscript's 'overselling' of claims that were only weakly supported by the data. Yes, if the authors tune down their claims and clearly state that this is an interesting starting point and proof of concept study, it might be ok to publish with only minor revisions. If the claims should be more generalized, then this study needs more data supporting the conclusions and the method's predictive power.

      Significance

      In its current form, the manuscript describes an incremental technical advance that addresses an interesting premise for precision oncology but explores it on a single cell line. The manuscript is timely in that it adds to a growing body of research around the utilization of image-based profiling in various areas of biomedical research using cell painting to create reference datasets for training advanced machine learning models for diagnosis. This research might be interesting and relevant for cell biology, precision medicine, oncology, image-based profiling, and machine-learning communities.

      My expertise includes image-based profiling, cell-based screening, assay development, functional genomics, cancer research, drug discovery, machine learning, and data science. There is no aspect of the study I cannot review.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors use Cell Painting, a high-content image-based phenotypic assay, to distinguish between clonal cancer cell lines that are resistant versus sensitive to a proteasome inhibitor anti-myeloma drug called bortezomib. The authors characterized a high-dimensional cell morphology signature for bortezomib-resistance, evaluated it on an independent subset of cell lines, and evaluated specificity in respect to other drugs targeting the ubiquitin-proteasome system. The authors thus propose image-based morphology characterization as an alternative method for characterizing drug resistance.

      Strengths: solid methodology - cell lines validation of drug resistance, extensive data collection, thorough validation of the analysis pipeline, avoiding potential confounders, biases and proper data partitioning to test and hold-out (what the authors refer to as "machine learning best practices").

      Weakness: weak discriminative signal. Some aspects of the writing could be improved to make the manuscript easier to follow (see Minor comments).

      Major comments:

      While I am convinced that the signature captures morphological phenotypes associated with drug resistance, at the cumulative scale, the discriminative signal of a single cell type seems weak. Specifically, it is not clear whether the signature can effectively capture the drug resistance of a single cell line. In Figure 2-Supplement 9, considering the test (C) and the holdout (D), only 1/9 BZ clones' median signatures were beyond the 95% confidence interval, with 4/6 and 2/6 WT cell types with median signatures beyond the positive and negative 95% confidence interval correspondingly. When defining bortezomib-sensitivity according to the median signatures' sign (>0 or <0) of a cell line, Figure 2-Supplement 9 shows that in the test+holdout there are 9/9 correct bortezomib-resistance (BZ) and 6/7 correct bortezomib-sensitive (WT) predictions. However, similar discrimination levels also appeared in the other drugs (ixazomib, CB-5083), making the statements about specificity less grounded. When the authors evaluate the AUROC they report ~0.6 (line #194) for the non-specific (ixazomib, CB-5083) drugs versus ~0.75 for bortezomib-resistance (line #202). With Fig. 4, the data fully supports the argument that the bortezomib-signature encodes bortezomib-resistance, but the signal is weak. Thus statements such as "We found the Bortezomib Signature could predict whether a cell line was bortezomib-resistant or bortezomib-sensitive" (line #172) and the specificity statements in the abstract" (line #28) are not supported by the data in my opinion. I would recommend the authors to tune down these and other related statements throughout the manuscript. An alternative would be to increase the number of wells and see whether this weak signal can indeed be statistically amplified with many replicates to make a robust and specific characterization of a cell line's bortezomib-sensitivity (but I assume this is a lot of work and probably out of scope of this manuscript). I think it is also important to discuss in more detail the interpretation of these results (including Figure 2-Supplement 9), in this context, in the Discussion.

      Minor comments:

      Suggested clarifications (some might be less relevant if the manuscript is designed for experts in the more clinical domain who are familiar with these terms / style):

      1. More information in Fig. 1's legend would be helpful to follow the experimental design. I found it hard to follow in its current form and had to go back to carefully reading the main text to fully understand.
      2. What is LD90 (line #87)? LD50 (line #97)?
      3. It was not clear to me in the text which and how many cell lines were evaluated and the reader is forced to go to the SI. For example, "(BZ01-10 and BZ clones A and E)" (line #96-97) and "wild-type clones (WT01-05, 10, and 12-15)" (line #98) appeared when presenting the results without a clear explanation and made it harder for me to follow. Summary of the data (for example, based on Figure 2-Supplement 8) can be briefly mentioned in the text to make it more clear for the reader.
      4. I found the visualization in Fig. 2C-D not intuitive (it is properly explained in the legend). I suggest replacing the accuracy colorbar with a color marker to make it more distinct from the random permutation (|--*--|) The location of the text "mean +- SD of 100 random permutation" made me first think that it is linked to the holdout.
      5. Lines #104-111 were duplicated in lines #114-122.
      6. The "Bortezomib Signature" is a critical measurement but is only briefly mentioned in lines 150-151 ("..based on the direction-sensitive ranking method for phenotype analysis, singscore (Foroutan et al., 2018)"). Please provide more information/intuition.
      7. How is the Bortezomib Signature related to the "on-disease"/"off-disease" scores described in https://www.biorxiv.org/content/10.1101/2020.04.21.054387v1.full? Are there other alternatives used for similar binary phenotypic signatures? What is the justification for using these measurements? I would love to see this generalized concept explicitly discussed in the Discussion.
      8. I would highly recommend showing the Bortezomib Signatures from Figure 2-Supplement 9. in Fig. 2. This was the main measurement used throughout the manuscript and in my opinion, it is very important to consistently visualize the data along the manuscript, for clarity and easier reader interpretation.
      9. Line #155-156: "We found that the pattern of Bortezomib Signatures corresponded to the cell identity plate layout", the word "not" is missing before "corresponded".
      10. Additional details on the processing steps in the analysis pipeline in the Methods will be highly appreciated.

      Referees cross-commenting

      My main critic is regarding "over selling" a weak discriminative signal. Specifically, I am not convinced that the major claims regarding predicting sensitivity and specificity at the single cell types scales are supported by the data. Since reviewer #2 and #3 did not raise this concern I think it is worth discussion here.

      Once these statements are tuned down - I think no significant additional work is needed to make the point that they can measure a discriminative signal. If they want to make these claims, perhaps they'd like to collect more data to gain statistical power (but I am not optimistic this will work at the single cell level).

      Personally, I was happy with the authors' choice of cell lines not included in the training dataset. I am not convinced that additional cell lines + validations are necessary for making the point of a proof of principle.

      Significance

      Cell Painting was applied to many applications, but as far as I am aware this is the first attempt for an image-based phenotypic characterization of drug resistance. While the authors established that this approach can measure, to some extent, bortezomib-sensitivity, at the current state of the results, I am not convinced that cell painting can be practically used to assess bortezomib-sensitivity of a single cell line. However, this does not imply that the same approach can not achieve the goal, perhaps by using other cell painting markers for bortezomib-sensitivity, or with the same markers to assess sensitivity of different drugs. The cell painting + analysis approaches are not new and the clinical impact is questionable, but the technical aspects (data, analysis) are exceptional and the concept may hold as I described above.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary: Forer and Otsuka provide first-rate evidence for tethers fixed in place between separating anaphase chromosomes using electron tomography. The authors traced the anaphase movement of a number of living cells before fixation for examination using electron tomography. The manuscript is clearly written and provides an excellent introduction and discussion of the known literature. The reader will have an excellent background to see the importance of this work.

      Major comments:<br /> - Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      No further experiments are needed. The data are very supportive, and extremely clear.<br /> - Are the data and the methods presented in such a way that they can be reproduced? Yes.<br /> - Are the experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments:<br /> - Are prior studies referenced appropriately? Yes.<br /> - Are the text and figures clear and accurate? Absulotely.<br /> - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      The authors are to congratulated on their major contribution to this study on tethers between separated daughter chromosomes. It is a tpur deforce to go from the living cells to fixing and identifying the same separated chromosomes using electron tomography to see the ultrastructure of the fibers seen fir.

      Referees cross-commenting<br /> Thank you reviewer #2. The manuscript should be published. It is an excellent contribution.

      We thank the reviewer for the appreciation of the clarity and quality of our work.

      Reviewer #1 (Significance):

      Provide contextual information to readers (editors and researchers) about the novelty of the study, its value for the field and the communities that might be interested.

      This manuscript is the first to use electron tomography to identify the tethers between separated anaphase chromosomes. Forer and the laetMichael Berns and their co-authors have published a number of papers using phase microscopy and lasers to report on the physical nature and elastic properties of these fibres in the past. Forer and Otsuka have presented first-rate evidence for the reality of these structures using electron tomography. This manuscript should highlighted in the published journal.<br /> The chemical identity of these fibers as the authors state is unclear.

      The following aspects are important:

      • Audience: describe the type of audience ("specialized", "broad", "basic research", "translational/clinical", etc...) that will be interested or influenced by this research; how will this research be used by others; will it be of interest beyond the specific field?

      This exciting contribution will be read by anyone interested in mitosis. It will be of interest to all Cell Biologists because of the careful manner in which the living cells were studied before they were fixed for examination using electron tomography. The readers will be dreaming how they can use this process on their Cell Biology problems._

      • Please define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I am a cell Biologist who has made contributions, both in light microscopy and in transmission microscopy on diving cells, both in tissue culture and in situ in aviav and zebrafish embryos.

      We thank the reviewer for appreciating the significance of our work.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In this paper, the authors use light microscopy and electron tomography to study anaphase chromosomes in crane fly spermatocytes. They find that there are two "tether" structures that connect telomeres of sister chromatids. One tether is thicker (denser) and extends between sister chromatids during early but not late anaphase, whereas a second, less-dense tether maintains contact with both sister chromatids in all examined stages of anaphase. The paper makes arguments as to what the tethers could or could not be. Specifically, they are too numerous to be ultrafine DNA bridges seen in various normal or abnormal segregation events and they also do not affect anaphase chromosome motion the same way ultrafine DNA bridges do.

      Major comments:<br /> The major claim that there are tethers that connect sister chromatids in anaphase is supported by the data. Moreover, the data resolves two types of tethers on the basis of their density. While it is unclear what the composition of the tethers are, the paper makes a convincing case that they cannot be the DNA ultrafine bridges seen in other studies. The discussion has sufficient caveats that most readers will see that more work is needed to identify the composition of the two tethers. In my opinion, no further experiments are needed to support the modest claims of this paper. Therefore, I only have minor comments that may hopefully improve the paper's clarity.

      We thank the reviewer for the positive evaluation of our work.

      Minor comments:<br /> It was argued that the tethers reported here were also seen in other species and cellular contexts, where the imaging work was done with projection EM imaging. Presumably, what is new here is the usage of electron tomography. It would help readers if the authors explained why the electron tomography done here was essential to arrive at key conclusions.

      Thank you for the useful comment. We have added the explanation of why electron tomography was critical to visualise small tether structures to the last paragraph of the Discussion on page 7.

      p.3 mitochondria appeared to be fixed properly ... (e.g., Figs. 1C, 2B) - I don't see any mitochondria in any figures. Perhaps this observation should be noted as "not shown"?

      We thank the reviewer for pointing this out. We have added an electron micrograph of mitochondria to the Supplementary Figure 1.

      p.3 The images shown in Figs. 1, 2, 4 - The figures should be called out in the order; in this case, Fig 3 has not been called out yet.

      We have corrected the order of the figures.

      p.4 we did not find any other connecting structures - Because the sample was processed by traditional EM methods, it's safer to add a caveat that other connecting structures could be missed if they were disrupted by sample prep or if they did not pick up stain as well as the two structures presented in this paper.

      We have clarified that our sample was chemically fixed in the first paragraph of the Discussion on page 4. Because the details of how our samples were prepared are described in the Method section, we did not add further details to this paragraph.

      p.7 we expect such structures to be commonly seen in other cell types as well if they are examined carefully - Instead of saying that examinations should be done "carefully", it would be more helpful to specify how other cell types should be examined. This work shows that the bridges can be found if the cells are either sectioned parallel to the spindle axis or if a sufficiently large volume is sampled.

      We have now clarified that 3D electron microscopy techniques such as electron tomography are critical to visualise small tether structures in the last paragraph of the Discussion on page 7.

      Please use consistent spelling/hyphenation of ultrafine/ultra-fine and word choice (strands vs. bridges).

      Referees cross-commenting<br /> I agree with my co-reviewers's comments and have no further suggestions._

      Reviewer #2 (Significance):

      This may be the first use of electron tomography to study the structural details of tethers that connect chromosomes in anaphase cells. The data is of sufficient quality to reveal differences in density. Namely, one class of tether appears to be an extension of the chromosome while the other class is composed of thin filaments. This study is novel in that it characterizes a mitosis-associated complex that is poorly studied compared to the microtubule-based spindle apparatus and the kinetochore. Hopefully, the tethers will draw more attention and further characterization by methods like super-resolution microscopy and cryo-electron microscopy. My expertise is in chromatin, mitotic machines, and cryo-electron tomography.

      We thank the reviewer for appreciating the novelty and the impact of our work.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Summary:

      Tethers between telomeres of chromosomes in anaphase were inferred from earlier studies of laser microbeam cutting experiments. The current paper presents images from electron tomography of crane fly spermatocytes that substantiates the earlier inference. The authors deduce that the darker filaments and the lighter filaments that they visualize may be the structural tethers at telomeres.

      Major comments:

      The experiments are carefully done, and the conclusions are appropriately worded to qualify any caveats. This short communication is well-presented, and I have only a few comments._

      We thank the reviewer for appreciating the clarity and quality of our work.

      The authors should expand their list of references on bridges to include those listed by Warecki et al (Curr Biol 33:1-17, 2023; refs 15-26, etc).

      We do not think it is necessary to expand the list of references for ultra-fine DNA bridges. In the article we submitted, we discussed the Warecki at al article in the penultimate paragraph of the Discussion; we concluded that the bridges that Warecki at al described are different from ours in having so few per cell that they couldn’t be tethers, and further that there was no evidence that those bridges were elastic. For those reasons, we do not find discussion of those proteins relevant to tethers, any more than would listing all the proteins associated with ultra-fine DNA bridges be relevant to the elastic tethers.

      In the Discussion, we discussed data suggesting that a known elastic protein titin was present; that is as far as we wanted to go on speculation of what the elastic component of tethers might be.

      The authors present arguments that the tethers are not the DNA bridges observed by others. However, they should try to address this experimentally by treatment of their preparations with DNase to see if the thick and/or thin filaments disappear.

      While we agree that it would be important to identify the components of the tethers, we are concerned that those experiments are beyond the scope of this manuscript. Nevertheless, we appreciate the constructive suggestion for the future research direction.

      Moreover, they should discuss in more detail the possible functions of (DNA) bridges, including the recent model from Bill Sullivan's lab (Warecki et al, Curr Biol, 2023) that they help to retain fragments of broken chromosomes. In addition, the authors should summarize the various proteins that may be associated with the bridges (as enumerated in the Warecki et al 2023 paper).

      As we describe above, we concluded that the bridges Warecki at al described are different from the tethers that we report in our manuscript. Therefore, we do not think it is necessary to expand the discussion on the proteins and functions associated with ultra-fine DNA.

      The authors could add a sentence to the Results or Discussion of whether the thicker tethers might become stretched as anaphase progresses to become the thinner tethers (Fig. 4G).

      We thank the reviewer for this suggestion. We actually mentioned this possibility in the third paragraph of our Discussion on page 7.

      The authors may want to add a few sentences to the Discussion about the "chromosomal bouquet" stage of leptotene of meiosis prophase I where the telomeres of chromosomes seem pulled together and associate with the nuclear envelope --- they could speculate if this might also be due to the tethers that they describe in spermatocytes.

      This is a very interesting possibility. While we would refrain from adding this speculation to our manuscript as it is beyond the scope of the main points, it is certainly an interesting avenue of future research.

      Minor comments:

      A few additional comments are as follows:

      p. 2 last sentence of first paragraph -modify the wording about "no structural evidence that identifies physical connections between separating telomeres", since there is some information from genetic and cell biology light microscopy experiments. Perhaps simply change "structural" to "ultrastructural".

      We have changed the wording as the reviewer recommended

      p. 6, 5th line of second paragraph - change "ribosome DNA" to "ribosomal DNA"

      We have corrected it.

      Figure 1D - add the chromosome to the right of the schematic model (as suggested by Fig. 1B).

      We are sorry for the confusion. In Figure 1D, the left half of the tethers are 3D modelled and shown. We have clarified this point by modifying the legend of Figure 1D

      p. 17 (Methods), line 10 of first paragraph - state if this is light or heavy Halocarbon oil (give details).

      It is a mixture of heavy and light Halocarbon oil. We have clarified it on page 17.

      p. 17 (Methods), line 12 of first paragraph- state the concentration for fibrinogen and for thrombin.

      As we wrote in the original manuscript, the procedures are described in detail in our previous publication (Forer A. & Pickett-Heaps J. (2005) Fibrin clots keep non-adhering living cells in place on glass for perfusion or fixation. Cell Biology International 29: 721–730). Nonetheless, to clarify this point, we have modified the text on page 17.

      p. 17 (Methods), line 4 of second paragraph - is there any data to show that the filaments (tethers) occur if there is no cold shock?

      Yes, we do see similar filamentous structures in the sample without cold shock. For your information, we show one of the electron micrographs below. In our manuscript, we show the data from the samples prepared with cold shock, because it better visualizes the filamentous structures. We now show these electron micrographs in the Supplementary Figure 2.

      Referees cross-commenting<br /> I concur with Reviewers #1 and #2 that this is a fine paper that should be published. My detailed comments submitted with my review are simply meant as revisions to further strengthen this paper.

      We thank the reviewer for supporting the publication of our manuscript.

      Reviewer #3 (Significance):

      Strengths: This is an important conceptual advance and the carefully done ultrastructural imaging provides the foundation for future studies that could delve into the molecular composition and functional significance of the tethers at telomeres of anaphase chromosomes seen here by 3D electron microscopy.

      Limitations: the molecular composition and functional roles are not yet known for the tethers seen here by 3D electron microscopy, but to do so would involve an entire new program of experimentation.

      Advances: there have only been two earlier ultrastructural papers on tethers at telomeres, and the tethers were peripheral to the main focus of those papers. Thus, the current paper extends our ultrastructural information about tethers.

      Audience: this work is of importance for scientists who study the mechanics of chromosome movement on spindles, including regulation to combat aneuploidy. This work will also be important for a broader audience to inform them about transmission of the hereditary information to daughter cells._

      We thank the reviewer for appreciating the significance and the impact of our work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Tethers between telomeres of chromosomes in anaphase were inferred from earlier studies of laser microbeam cutting experiments. The current paper presents images from electron tomography of crane fly spermatocytes that substantiates the earlier inference. The authors deduce that the darker filaments and the lighter filaments that they visualize may be the structural tethers at telomeres.

      Major comments:

      The experiments are carefully done, and the conclusions are appropriately worded to qualify any caveats. This short communication is well-presented, and I have only a few comments. The authors should expand their list of references on bridges to include those listed by Warecki et al (Curr Biol 33:1-17, 2023; refs 15-26, etc). The authors present arguments that the tethers are not the DNA bridges observed by others. However, they should try to address this experimentally by treatment of their preparations with DNase to see if the thick and/or thin filaments disappear. Moreover, they should discuss in more detail the possible functions of (DNA) bridges, including the recent model from Bill Sullivan's lab (Warecki et al, Curr Biol, 2023) that they help to retain fragments of broken chromosomes. In addition, the authors should summarize the various proteins that may be associated with the bridges (as enumerated in the Warecki et al 2023 paper).

      The authors could add a sentence to the Results or Discussion of whether the thicker tethers might become stretched as anaphase progresses to become the thinner tethers (Fig. 4G).

      The authors may want to add a few sentences to the Discussion about the "chromosomal bouquet" stage of leptotene of meiosis prophase I where the telomeres of chromosomes seem pulled together and associate with the nuclear envelope --- they could speculate if this might also be due to the tethers that they describe in spermatocytes.

      Minor comments:

      A few additional comments are as follows:

      p. 2 last sentence of first paragraph -modify the wording about "no structural evidence that identifies physical connections between separating telomeres", since there is some information from genetic and cell biology light microscopy experiments. Perhaps simply change "structural" to "ultrastructural".

      p. 6, 5th line of second paragraph - change "ribosome DNA" to "ribosomal DNA"

      Figure 1D - add the chromosome to the right of the schematic model (as suggested by Fig. 1B).

      p. 17 (Methods), line 10 of first paragraph - state if this is light or heavy Halocarbon oil (give details).

      p. 17 (Methods), line 12 of first paragraph- state the concentration for fibrinogen and for thrombin.

      p. 17 (Methods), line 4 of second paragraph - is there any data to show that the filaments (tethers) occur if there is no cold shock?

      Referees cross-commenting

      I concur with Reviewers #1 and #2 that this is a fine paper that should be published. My detailed comments submitted with my review are simply meant as revisions to further strengthen this paper.

      Significance

      Strengths: This is an important conceptual advance and the carefully done ultrastructural imaging provides the foundation for future studies that could delve into the molecular composition and functional significance of the tethers at telomeres of anaphase chromosomes seen here by 3D electron microscopy.

      Limitations: the molecular composition and functional roles are not yet known for the tethers seen here by 3D electron microscopy, but to do so would involve an entire new program of experimentation.

      Advances: there have only been two earlier ultrastructural papers on tethers at telomeres, and the tethers were peripheral to the main focus of those papers. Thus, the current paper extends our ultrastructural information about tethers.

      Audience: this work is of importance for scientists who study the mechanics of chromosome movement on spindles, including regulation to combat aneuploidy. This work will also be important for a broader audience to inform them about transmission of the hereditary information to daughter cells.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, the authors use light microscopy and electron tomography to study anaphase chromosomes in crane fly spermatocytes. They find that there are two "tether" structures that connect telomeres of sister chromatids. One tether is thicker (denser) and extends between sister chromatids during early but not late anaphase, whereas a second, less-dense tether maintains contact with both sister chromatids in all examined stages of anaphase. The paper makes arguments as to what the tethers could or could not be. Specifically, they are too numerous to be ultrafine DNA bridges seen in various normal or abnormal segregation events and they also do not affect anaphase chromosome motion the same way ultrafine DNA bridges do.

      Major comments:

      The major claim that there are tethers that connect sister chromatids in anaphase is supported by the data. Moreover, the data resolves two types of tethers on the basis of their density. While it is unclear what the composition of the tethers are, the paper makes a convincing case that they cannot be the DNA ultrafine bridges seen in other studies. The discussion has sufficient caveats that most readers will see that more work is needed to identify the composition of the two tethers. In my opinion, no further experiments are needed to support the modest claims of this paper. Therefore, I only have minor comments that may hopefully improve the paper's clarity.

      Minor comments:

      It was argued that the tethers reported here were also seen in other species and cellular contexts, where the imaging work was done with projection EM imaging. Presumably, what is new here is the usage of electron tomography. It would help readers if the authors explained why the electron tomography done here was essential to arrive at key conclusions.

      p.3 mitochondria appeared to be fixed properly ... (e.g., Figs. 1C, 2B) - I don't see any mitochondria in any figures. Perhaps this observation should be noted as "not shown"?

      p.3 The images shown in Figs. 1, 2, 4 - The figures should be called out in the order; in this case, Fig 3 has not been called out yet.

      p.4 we did not find any other connecting structures - Because the sample was processed by traditional EM methods, it's safer to add a caveat that other connecting structures could be missed if they were disrupted by sample prep or if they did not pick up stain as well as the two structures presented in this paper.

      p.7 we expect such structures to be commonly seen in other cell types as well if they are examined carefully - Instead of saying that examinations should be done "carefully", it would be more helpful to specify how other cell types should be examined. This work shows that the bridges can be found if the cells are either sectioned parallel to the spindle axis or if a sufficiently large volume is sampled.

      Please use consistent spelling/hyphenation of ultrafine/ultra-fine and word choice (strands vs. bridges).

      Referees cross-commenting

      I agree with my co-reviewers's comments and have no further suggestions.

      Significance

      This may be the first use of electron tomography to study the structural details of tethers that connect chromosomes in anaphase cells. The data is of sufficient quality to reveal differences in density. Namely, one class of tether appears to be an extension of the chromosome while the other class is composed of thin filaments. This study is novel in that it characterizes a mitosis-associated complex that is poorly studied compared to the microtubule-based spindle apparatus and the kinetochore. Hopefully, the tethers will draw more attention and further characterization by methods like super-resolution microscopy and cryo-electron microscopy. My expertise is in chromatin, mitotic machines, and cryo-electron tomography.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: Forer and Otsuka provide first-rate evidence for tethers fixed in place between separating anaphase chromosomes using electron tomography. The authors traced the anaphase movement of a number of living cells before fixation for examination using electron tomography. The manuscript is clearly written and provides an excellent introduction and discussion of the known literature. The reader will have an excellent background to see the importance of this work.

      Major comments:

      • Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      No further experiments are needed. The data are very supportive, and extremely clear.<br /> - Are the data and the methods presented in such a way that they can be reproduced? Yes.<br /> - Are the experiments adequately replicated and statistical analysis adequate? Yes.

      Minor comments:

      • Are prior studies referenced appropriately? Yes.
      • Are the text and figures clear and accurate? Absulotely.
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      The authors are to congratulated on their major contribution to this study on tethers between separated daughter chromosomes. It is a tpur deforce to go from the living cells to fixing and identifying the same separated chromosomes using electron tomography to see the ultrastructure of the fibers seen fir.

      Referees cross-commenting<br /> Thank you reviewer #2. The manuscript should be published. It is an excellent contribution.

      Significance

      Provide contextual information to readers (editors and researchers) about the novelty of the study, its value for the field and the communities that might be interested.

      This manuscript is the first to use electron tomography toidentify the tethers between separated anaphase chromosomes. Forer and the laetMichael Berns and their co-authors have published a number of papers using phase microscopy and lasers to report on the physical nature and elastic properties of these fibres in the past. Forer and Otsuka have presented first-rate evidence for the reality of these structures using electron tomography. This manuscript should highlighted in the published journal.<br /> The chemical identity of these fibers as the authors state is unclear.

      The following aspects are important:

      • Audience: describe the type of audience ("specialized", "broad", "basic research", "translational/clinical", etc...) that will be interested or influenced by this research; how will this research be used by others; will it be of interest beyond the specific field?

      This exciting contribution will be read by anyone interested in mitosis. It will be of interest to all Cell Biologists because of the careful manner in which the living cells were studied before they were fixed for examination using electron tomography. The readers will be dreaming how they can use this process on their Cell Biology problems.<br /> - Please define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I am a cell Biologist who has made contributions, both in light microscopy and in transmission microscopy on diving cells, both in tissue culture and in situ in aviav and zebrafish embryos.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The manuscript by Rigger and Brenner details the role of vimentin network, in advancing OA pathogenesis by exacerbating premature senescence. The data is well presented and the study of interest, in that there is little known about vimentin in cartilage biology.<br /> The authors used OA derived cartilage explants and chondrocytes cultures, were graded for severity and compared accordingly. Figure 1 shows that markers of senescence are increased with structural damage, which is well established and consistant with the literature. Using a DOX model the authors induce premature senescence and exhibit a disrupted vimentin network. However, upon KD of CDKN2A, a marker of senescence, but did not observe complete reversal of CSV presentation.<br /> Next the authors show in figure 4 and 5, that the reduction or dismemberment of vimentin structures are linked to senescence and may act as contributing factors.<br /> Figures 6 and 7 then go on to show that upon advanced passage chondrocytes lose their vimentin network, and tend to senesce and mineralize.

      Reviewer #1 (Significance):

      Strength:<br /> This is a very novel study showing a link between vimentin and senescence in chondrocytes. The data are in line with other data. The work is clearly written structured and well displayed.

      Author´s response:<br /> We thank reviewer #1 for their interest in our work and their overall positive report.

      Suggestions for improvement:

      While the study is very thorough ought in describing the markers of senescence and vimentin network, it lacks insight regarding mechanism which isn't completely deciphered. Are there links to key transcription factors?

      Author´s response:<br /> The transcriptional regulation of vimentin in human cells is very complex. The VIM promoter region comprises multiple elements, such as a NF-kB- binding site, a PEA3-binding site and two AP1-binding sites (Zhang et al., 2003). Moreover, it was recently demonstrated that redox signaling is involved in vimentin expression at the wound margin after tissue injury in zebra fish (LeBert et al., 2018). However, it has also been reported that IL-1ß stimulation results in reduced gene expression of vimentin via p38-signalling in cartilage degeneration and OA progression (see manuscript REF. 36,37).

      In our study, we observed that enhanced CSV levels are associated with a decreased vimentin gene expression, indicating a lower stability of the mRNA or decreased transcription of VIM in senescent chondrocytes (maybe due to enhanced p38-signalling as mentioned above). Since the transcriptome in senescent cells is radically changed, this question cannot be answered easily.

      In future studies, we will rather try to clarify the underlying mechanism of vimentin externalization. There are still many questions to be answered: is the CSV anchored in the cell membrane (which anchor protein?) and is there still a connection to the intracellular vimentin network? Which proteins are involved in the externalization process: maybe comparable to phosphatidylserine exposure, mediated by flippases, scramblases, and lipid transfer proteins or rather by vesicles?

      Literature mentioned above (not included in manuscript):

      LeBert et al., 2018: Damage-induced reactive oxygen species regulate vimentin and dynamic collagen-based projections to mediate wound repair. DOI: 10.7554/eLife.30703

      Zhang et al., 2003: ZBP-89 represses vimentin gene transcription by interacting with the transcriptional activator, Sp1. DOI: 10.1093/nar/gkg380

      It is also unclear if disruption of the network is more detrimental than KD in promoting senescence.

      Author´s response:<br /> KD of Vimentin led to a gradually decrease of intracellular Vimentin content and consequent stress. The cells were analyzed 7 days after induction of the KD and exhibited a stable senescent phenotype, comparable to Doxorubicin-treated chondrocytes (treated with very low concentrations over several days to produce only mild but ongoing stress). These models might reflect the pathophysiologic situation: We think that cellular stress due to mechanical impact and subsequent oxidative stress/ low-grade inflammation might lead to a gradual disruption or re-organization of the vimentin network, which is accompanied by decreased vimentin gene expression.

      In case of the disruption of the vimentin network by Simvastatin, the stress response was very intense and rapid (24 h), and was only conducted as a proof-of-principle experiment. Despite the upregulation of some senescence-associated markers, we don`t think that permanent Simvastatin treatment would be suitable to obtain a stable senescent phenotype, but rather expect the cells to die due to excessive stress.

      It would have been good to include models OA murine models to understand these processes better, and make a stronger physiological connection with OA of the joint.

      Author´s response:<br /> The CSV antibody is only suitable for human cells and cannot be used for immunohistochemistry. Therefore, all previous reports of CSV are based on human (isolated) cells. At the current time point, it would not be possible to stain CSV in joints of mice after induction of PTOA due to the methodological limitations. We actually tested the CSV-antibody in isolated lapine chondrocytes and found a high percentage of CSV-positive cells, even at low passages. Although stress increased the amount of CSV-positive lapine cells, we did not consider the results as reliable due to the high percentage in un-stressed cells, which might result from unspecific antibody binding.

      Overall, we think that the usage of clinical OA samples is convincing and reflect the pathophysiologic situation in the human OA joint.

      Reviewer #2 (Evidence, reproducibility and clarity):

      The manuscript provides solid evidence for an association between cell surface vimentin (CSV) and chondrocyte senescence. Human cartilage and cultured chondrocytes are used with a wide range of approaches to provoke senescence: natural osteoarthritis, traumatic loading ex vivo, doxorubicin to cells in monolayer, vimentin siRNA, and simvastatin. In contrast, relatively little was done to try and interrupt or reverse the role of CSV in senescence, with CDKN2A siRNA representing one attempted intervention. The manuscript is well written and the data are presented in a logical and clear manner, with a high likelihood of being reproduced in subsequent studies.

      Author´s response:<br /> We thank reviewer #2 for their interest in our work and their mainly positive report.<br /> Regarding their comment on our attempts to reverse CSV on senescent chondrocytes, we would like to add the following: Reversal of cellular senescence is a very ambitious challenge. But in fact, we are currently preparing a manuscript in which we characterize an appropriate senolytic strategy to “rejuvenate” human chondrocytes and plan to use this approach to reduce the amount of senescent and thus CSV-positive cells in future experiments.

      _Major comments:

      In the doxorubicin experiments, the senescent cells show a spread morphology as expected. Given the importance of vimentin in cell spreading (as the authors own data show), the possibility that spread morphology itself (and not senescence) leads to CSV should probably be examined. This could perhaps be achieved by plating with different concentrations of fibronectin or other matrix proteins that produce a spread morphology to a degree that matches the doxo. If the cells remain spread for ~10 days but don't become senescent and don't have CSV, this would provide further support for a direct relationship.

      Author´s response:<br /> We agree that cell spreading is associated with various cellular processes (for example by the YAP signaling pathway). Moreover, we would like to thank the reviewer for the proposed experiment.

      Seeding of cartilage cells on fibronectin coated plates is a commonly used procedure to isolate chondrogenic stem progenitor cells, due to their higher affinity to fibronectin. The cells are usually cultured for several days on the coated plates and do not exhibit a flattened, senescent-like phenotype (as we observe for Doxorubicin-treated cells), but an elongated, fibroblast-/ stem cell-like shape. Our results (Figure 6E) demonstrate that CSPC have no increased CSV levels, despite their elongated (not flat) morphology.

      There are some findings supporting the assumption that CSV leads to enhanced cell adhesion, but not that adhesion or cell spreading promotes CSV: we included experiments with HeLa (low CSV levels) and SaOS-2 (high CSV levels), which demonstrated that high CSV levels are associated with increased plastic adhesion (Figure S5). In line with this, we demonstrated that higher CSV levels on chondrocytes were associated with enhanced fibronectin and vitronectin binding, which might explain increased plastic adhesion. Moreover, Simvastatin stimulation and subsequent cellular stress by Vimentin disruption resulted in enhanced CSV but did not lead to cell spreading (Actin not affected, cells rather elongated, not flattened).

      Minor comments:

      The CSV antibody and staining method appeared to have generated some signal from debris, which makes it challenging to assess the localization of true staining. Presumably the true staining would be present only on the cell surface. While the widefiled view is appreciated, perhaps insets with a higher magnification would clarify.

      Author´s response:<br /> In Figure 2h and Figure 2i, we provide insets of the IF-staining and an exemplary image made by scanning electron microscopy (SEM). CSV is not localized on debris – Figure 2h, actually represents the cell surface. The magnified, Doxo-treated cell is highly senescent and thus flattened. The uneven (rather spotted) staining pattern of CSV and the unusual shape of the cell might suggest that this is debris, not the cell membrane.

      For figure 1k, it is a bit surprising that CDKN2A would peak so early after injury and then drop off. Most studies in other systems show a gradual increase in CDKN2A levels with persistent stress as opposed to a rapid increase in response to acute stress. Could the drop-off be due to preferential death of these cells? The CSV % in 1m was taken from 7d after trauma (plus 7 days in monolayer it appears). Further discussion on the timing of traditional senescence markers as compared to the emergence of CSV would be useful.

      Author´s response:

      We would like to thank the reviewer for this comment. That CDKN1A was induced by mechanical trauma without significant decrease at the later time points was in line with the P53 expression, which we detected via immunohistochemistry (IHC; positive staining of chondrocyte nuclei in cartilage). P53 and P21 are regarded as interconnected senescence markers. Interestingly, P53 is not regulated on gene expression level upon cartilage trauma or Doxorubicine stimulation – but there is a significant increase in P53 nuclear translocation.

      Although such a discrepancy between gene expression and protein activity has not been reported in case of P16 or P21, we plan to investigate the dynamics of these cell cycle regulators and its connection to CSV after cartilage trauma in more detail in future studies.

      We included the following statement in the discussion part:

      “In the current study, we observed that CSV on chondrocytes was reduced by siRNA-mediated silencing of CDKN2A and increased after Doxo treatment or cartilage trauma. While we confirmed that mRNA levels of both CDKN1A and CDKN2A were significantly enhanced upon injury but exhibited different expression levels over time, we determined CSV-positive cells only at one time point after ex vivo cartilage trauma. Future studies might also consider earlier and later time points after cartilage injury to identify a potential time-dependent peak or decline in CSV-positive chondrocytes. In this way a potential association between CSV and the expression levels of CDKN1A and CDKN2A, which are thought to play differential roles in initiating and maintenance of senescence, respectively [50], might be clarified.”

      [50] Stein G, Drullinger L, Soulard A, and Dulić V. Differential Roles for Cyclin-Dependent Kinase Inhibitors p21 and p16 in the Mechanisms of Senescence and Differentiation in Human Fibroblasts. Mol Cell Biol. 1999;19(3): 2109–2117. https://doi.org/10.1128/mcb.19.3.2109.

      There is no CSV staining shown for figures 4 and 5. While the quantification of CSV was done by flow cytometry, it would nice confirmation to see the increase in CSV on the surface of cells with either siRNA for vimentin or the simvastatin.

      Author´s response:

      CSV-IF of simvastatin-treated chondrocytes is provided in Figure 5 (b). We did not perform exemplary staining of CSV after VIM-KD, because the quantification was performed via flow cytometry.

      Reviewer #2 (Significance):

      The strengths of the study include a rigorous design and the establishment of a potential new cell surface marker of chondrocyte senescence. The main limitation is that the conclusions are largely descriptive in nature.

      If CSV is confirmed as a robust marker of senescence, this would be of value to the field. While this marker has been explored previously in other systems, there is value in this manuscript given the wide range of contexts investigated for a cell type in which senescence likely has an important role.

      Reviewer #3 (Evidence, reproducibility and clarity):

      This study presents a sound piece of science in the puzzle about extracellular vimentin in the differentiation/dedifferentiation of human chondrocytes and senescence and osteoarthritis. Eventhough, no mechanism is elucidated, the results clearly point towards a correlation of the amount of extra cellular vimentin and the level of chondrocyte senescence, and therefore signs of osteoarthritic changes in the cultivated chondrocytes. The methods applied are state-of-the art and provide the means to generate meaningful results in this experimental setting. The paper is concise and clearly written, there are only minor remarks.

      Author´s response:

      We thank reviewer #3 for their interest in our work and their overall positive report.

      Minor comments:

      1. The main clue of the paper is extra cellular vinemtin around chondrites in culture, please provide better pictures (1g) to support this. Why is the extra cellular staining seen so broad and not concentrated on the cells surface? The picture chosen imply a huge amount of vimentin to be externilized in disease states. It also indicates that in diseased chondrocytes no intact or semi-intact vimentin network is found intracellular. Please comment.

      Author´s response:

      In Figure 1g, CSV is located on the cell membrane. The pattern of the staining was surprising to us, as well. CSV was not equally distributed on the membrane, but rather represented an inconsistent pattern. Sometimes the staining was located at the filopodia of the cells, sometimes the whole cell was covered by spots. We also observed this on cancer cells, which was in line with other studies using this antibody. It remains unclear whether the distribution of the CSV has any effect. But we assume that the high abundance in filopodia might be connected with cell adhesion and mobility, which was positively associated with CSV.

      Yes, chondrocytes isolated from highly degenerated tissue exhibited higher CSV levels as compared to cells derived from macroscopically intact regions. Although we did not investigate the vimentin network of these cells, our observations in Doxo-treated cells imply, indeed, that intracellular vimentin might be altered in diseased chondrocytes. According to this, Blain et al (Ref. 13) reported that there is a disassembly of the intracellular vimentin network in OA chondrocytes, which can disturb the chondrocyte phenotype and contributes to the development of OA (see discussion).

      1. In the doxo experiment no extracellular vimentin is found? Please explain.

      Author´s response:

      Doxo-treated cells are highly positive for CSV (= extracellular vimentin on membrane). However, the intracellular vimentin is strongly decreased and some cells seem to be negative. We have not clarified the underlying mechanism by now, but it seems that senescence/ disease progression negatively affects the transcription of vimentin and, at the same time, promotes the externalization of the existing intracellular vimentin. Altogether, this might result in a decline in intracellular vimentin.

      1. The SEM picture is showing what. IGH? The red dots are colloidal gold particles? In any case the quantity of stain gathered EM level would not correlate to the huge amount seen in LM staining. Please comment.

      Author´s response:

      For the SEM analysis, a gold particle-coated secondary antibody was used. The positive signal usually appears in white and was subsequently colored via a software. In IF and ICC staining, we had a signal amplification due to the biotin-streptavidin system and the magnification makes, of course, a huge difference.

      1. Why the ICC in Fig. 3c? The siRNA is not detected in the KD? A reduction of Vimentin could be shown via WB.

      Author´s response:

      In Figure 3c, the KD of P16 was confirmed on protein level. In addition to the gene expression analysis, we chose the ICC (IF) to confirm that there is a decline in active (nuclear) CDKN2A. In case of P53, we made the experience that gene expression and the amount of cytoplasmic/ nuclear protein might not be consistent.

      In Figure 4, we confirmed the successful KD of vimentin on mRNA and protein level (flow cytometry plus IF). Of course, WB would also be possible, but we decided to use the methods in which the antibody was well established and we wanted to visualize the disturbance of the intracellular vimentin network upon KD.

      1. Fig. 4c, why are there no remnants of the vimentin networks seen in the chondrocytes? A Knock-down, not a KO is shown.

      Author´s response:

      In fact, most of the intracellular vimentin seems to be gone. However, there are some remnants (condensed fibers/ bundles) of the former vimentin network. We applied the VIM-KD over seven days. Usually, a KD experiment is only conducted for 2-3 days. But since we were not sure how stable the vimentin protein would be, we chose seven days. This long-lasting KD might have resulted in a strong decline of the protein. Moreover, the CSV levels on these cells were very high, indicating that existing vimentin was externalized and additionally decreased the amount of intracellular vimentin.

      1. Please comment of the concentration of simvastatin, why not nmolar?

      Author´s response:

      The concentration of Simvastatin was chosen in accordance with Trogden et al. (Ref. 26), who first described the effects of simvastatin on the vimentin network. A lower concentration might have had the advantage, that the effects were less severe, allowing a longer observation time than 24h. However, as a proof-of-principle model to demonstrate the connection between vimentin network collapse ant CSV expression, the concentration worked quite well.

      1. CSV+ is misleading in Fig. 6g, it's not an over expression.

      Author´s response:

      We would like to thank the reviewer for this comment and removed the “+” to make it less misleading.

      1. The concept of EMT is debatable, at least in kidney fibrosis, and chondrocytes are not epithelial cells. Please add a more critical discussion point.

      Author´s response: The authors agree with the reviewer’s argument that chondrocytes are no epithelial cells ant that the term EMT doesn’t seem to be appropriate. However, this is one leading hypothesis proposed by the working group of Prof. Mayán, who described CX43 and other EMT-markers on/ in senescent chondrocytes (see reference 31; more recently: Cell Death Dis. 2022;13(8):681. doi: 10.1038/s41419-022-05089-w).

      We added the following passage in the discussion part to indicate that this hypothesis is a controversial concept:

      “Nevertheless, the hypothesis that chondrocytes might undergo an EMT-like process remains controversially discussed, because chondrocytes are mesenchymal and not epithelial cells. In a recent review, Gems and Kern propose to consider senescent chondrocytes as activated and hyperfunctional remodeling cells occurring during OA progression [49]. Accordingly, chondrosenescence might represent an unsuccessful attempt of tissue repair. They further suppose that the senescent or activated chondrocytes are associated with a hypertrophic, bone-forming phenotype, following the process of bone development rather than hyaline cartilage formation. In line with this, we observed that CSV was associated with enhanced osteogenic capacities and a decline in chondrogenic properties.”

      [49] Gems and Kern, 2022): Geroscience. 2022;44(5):2461-2469. doi: 10.1007/s11357-022-00652-x.

      Reviewer #3 (Significance):

      The manuscript provides novel insight in the role of intermediary filaments, i.e. vimentin, on chondrocyte senescence and osteoarthritic changes in vitro. It's strength is a thorough elucidation of the connection with a wealth of experimental data, a weakness is the missing elucidation, or first experiments in the direction, of the cell biological mechanism.<br /> It is well suited for a broad audience, because it deals with fundamental cell biological phenomena, definitely it's important for the OA /chondrocyte biology community.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This study presents a sound piece of science in the puzzle about extracellular vimentin in the differentiation/dedifferentiation of human chondrocytes and senescence and osteoarthritis. Eventhough, no mechanism is elucidated, the results clearly point towards a correlation of the amount of extra cellular vimentin and the level of chondrocyte senescence, and therefore signs of osteoarthritic changes in the cultivated chondrocytes. The methods applied are state-of-the art and provide the means to generate meaningful results in this experimental setting. The paper is concise and clearly written, there are only minor remarks.

      Minor comments:

      1. The main clue of the paper is extra cellular vine tin around chondrites in culture, please provide better pictures (1g) to support this.<br /> Why is the extra cellular staining seen so broad and not concentrated on the cells surface? The picture chosen imply a huge amount of vimentin to be externilized in disease states. It also indicates that in diseased chondrocytes no intact or semi-intact vimentin network is found intracellular. Please comment.
      2. In the doxo experiment no extracellular vimentin is found? Please explain.
      3. The SEM picture is showing what. IGH? The red dots are colloidal gold particles? In any case the quantity of stain gathered EM level would not correlate to the huge amount seen in LM staining. Please comment.
      4. Why the ICC in Fig. 3c? The siRNA is not detected in the KD? A redyuction of Vimeo tin could be shown via WB.
      5. Fig. 4c, why are there so remenants of the vimentin networks seen in the chondrocytes? A Knock-down, not a KO is shown.
      6. Please comment of the concentration of simvastatin, why not nmolar?
      7. CSV+ is misleading in Fig. 6g, it's not an over expression.
      8. The concept of EMT is debatable, at least in kidney fibrosis, and chondrocytes are not epithelial cells. Please add a more critical discussion point.

      Significance

      The manuscript provides novel insight in the role of intermediary filaments, i.e. vimentin, on chondrocyte senescence and osteoarthritic changes in vitro. It's strength is a thorough elucidation of the connection with a wealth of experimental data, a weakness is the missing elucidation, or first experiments in the direction, of the cell biological mechanism.

      It is well suited for a broad audience, because it deals with fundamental cell biological phenomena, definitely it's important for the OA /chondrocyte biology community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript provides solid evidence for an association between cell surface vimentin (CSV) and chondrocyte senescence. Human cartilage and cultured chondrocytes are used with a wide range of approaches to provoke senescence: natural osteoarthritis, traumatic loading ex vivo, doxorubicin to cells in monolayer, vimentin siRNA, and simvastatin. In contrast, relatively little was done to try and interrupt or reverse the role of CSV in senescence, with CDKN2A siRNA representing one attempted intervention. The manuscript is well written and the data are presented in a logical and clear manner, with a high likelihood of being reproduced in subsequent studies.

      Major comments:

      In the doxorubicin experiments, the senescent cells show a spread morphology as expected. Given the importance of vimentin in cell spreading (as the authors own data show), the possibility that spread morphology itself (and not senescence) leads to CSV should probably be examined. This could perhaps be achieved by plating with different concentrations of fibronectin or other matrix proteins that produce a spread morphology to a degree that matches the doxo. If the cells remain spread for ~10 days but don't become senescent and don't have CSV, this would provide further support for a direct relationship.

      Minor comments:

      The CSV antibody and staining method appeared to have generated some signal from debris, which makes it challenging to assess the localization of true staining. Presumably the true staining would be present only on the cell surface. While the widefiled view is appreciated, perhaps insets with a higher magnification would clarify.

      For figure 1k, it is a bit surprising that CDKN2A would peak so early after injury and then drop off. Most studies in other systems show a gradual increase in CDKN2A levels with persistent stress as opposed to a rapid increase in response to acute stress. Could the drop-off be due to preferential death of these cells? The CSV % in 1m was taken from 7d after trauma (plus 7 days in monolayer it appears). Further discussion on the timing of traditional senescence markers as compared to the emergence of CSV would be useful.

      There is no CSV staining shown for figures 4 and 5. While the quantification of CSV was done by flow cytometry, it would nice confirmation to see the increase in CSV on the surface of cells with either siRNA for vimentin or the simvastatin.

      Significance

      The strengths of the study include a rigorous design and the establishment of a potential new cell surface marker of chondrocyte senescence. The main limitation is that the conclusions are largely descriptive in nature.

      If CSV is confirmed as a robust marker of senescence, this would be of value to the field. While this marker has been explored previously in other systems, there is value in this manuscript given the wide range of contexts investigated for a cell type in which senescence likely has an important role.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Rigger and Brenner details the role of vimentin network, in advancing OA pathogenesis by exacerbating premature senescence. The data is well presented and the study of interest, in that there is little known about vimentin in cartilage biology.

      The authors used OA derived cartilage explants and chondrocytes cultures, were graded for severity and compared accordingly. Figure 1 shows that markers of senescence are increased with structural damage, which is well established and consistant with the literature. Using a DOX model the authors induce premature senescence and exhibit a disrupted vimentin network. However, upon KD of CDKN2A, a marker of senescence, but did not observe complete reversal of CSV presentation.

      Next the authors show in figure 4 and 5, that the reduction or dismemberment of vimentin structures are linked to senescence and may act as contributing factors.

      Figures 6 and 7 then go on to show that upon advanced passage chondrocytes lose their vimentin network, and tend to senesce and mineralize.

      Significance

      Strength:

      This us a very novel study showing a kink between vimentin and senescence e in chondrocytes. The data are in line with other data. The work is clearly written structured and well displayed.

      Suggestions for improvement:

      While the study is very thorough ought in describing the markers of senescence and vimentin network, it lacks insight regarding mechanism which isn't completely deciphered. Are there links to key transcription factors? It is also unclear if disruption of the network is more detrimental than KD in promoting senescence. It would have been good to include models OA murine models to understand these processes better, and make a stronger physiological connection with OA of the joint.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      The authors do not wish to provide a response at this time as only a revision plan is provided at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Using an elegant set of experiments, the authors probe the function of PfPK2 in asexual blood stage development. Using a rapamycin inducible dimerizable Cre recombinase-based strategy for the conditional knockdown of PK2, coupled with immunofluorescence assay, live cell imaging and phosphoproteomics the authors demonstrate the role of PK2 in invasion of red blood cells and signalling pathways in the parasite. Results show that PK2 is critical for the formation of tight junctions between merozoites and host red blood cells and that PfPK2 regulates cGMP levels and calcium release.

      Major comments:

      1. Is PK2 expressed in other life cycle stages? The authors should discuss what is known about the role of PK2 in other stages of parasite development. Would PK2 inhibitors be expected to have activity against liver or sexual stages of the parasite?
      2. Have any inhibitors of PK2 been reported to date? If so, were these tested against the PK2 cKD parasite line? Do they show similar phenotypic effects to knockdown of PK2? Do these studies give any insight on how vulnerable PK2 is as a target? To what extent does PK2 need to be inhibited/knocked down to lead to parasite death/growth inhibition?
      3. What were the expression yields and purity for the recombinantly expressed wild-type PK2 and mutant/ΔRD and ΔCD deletions? Has the ATP Km for PK2 been reported/determined? How did the deletions affect protein expression and stability? Could these deletions be influencing protein folding/amount of active protein in the preparation rather than regulating catalytic activity? How were protein concentrations normalised for the assay?
      4. Does rapamycin have any effect on wild-type parasites? What controls were included to ensure that observed effects were a result of rapamycin-induced PK2 knockdown, rather than rapamycin acting on other pathways/interfering directly with parasite biology. This should be discussed for readers less familiar with this system. Presumably the cHA-PfPK2-loxP line was used as the control to account for this for at least some of the experiments.

      Minor comments:

      1. "Malaria contributes to almost 400,000 deaths globally (Hay et al, 2004)."It should be made clear that this is the number of deaths reported annually. The references and statistic should be updated based on the most recent WHO Malaria report.
      2. "Several of these kinases are regulated by second messengers like calcium, PIPs, cAMP, cGMP Plasmodium kinases like CDPKs, PKA, PKG etc and are crucial for diverse parasitic functions." This sentence could be reworded for clarity.
      3. "However, the precise function of this protein kinase in parasite life cycle has remained unknown." Add word "the" should be added before "parasite".
      4. "Present studies demonstrate that PfPK2 regulates the invasion of host erythrocyte by P. falciparum." Should be erythrocytes (plural).
      5. "but the sequence similarity is mainly between kinase domains of human CamKIand PfPK2 (~80%)". Change "mainly" to "highest".
      6. "PfPK2 has a long N-terminal extension compared to CamKIand a CaM binding domain was predicted within a postulated regulatory domain present downstream of the kinase domain, which is the case with CamKI (Kato et al, 2008)." Please check this sentence and reword for clarity.
      7. References to "www.plasmodb.org" should be replaced with specific references to the source of the information listed in the database.
      8. Figure 2b. Please explain the relevance of adding/excluding EDTA and the effect of this on PK2 activity.
      9. "Correct integration of the targeting plasmid at the desired locus was confirmed by PCR as amplicons of expected size were obtained parasites from a clone that lacked wild type PfPK2 (Fig. 2B)." Remove the word "parasites".
      10. "Successful expression of GFP-PfPK2 was indicated by Western blotting (Fig. 2D) as well as by IFA (Supp. Fig. S1A), which wasundetectable after rapamycin treatment suggesting efficient depletion of this kinase from the parasite (Fig. 2D and Supp. Fig. S1B)." Write IFA in full the first time it is used in the main text. "was undetectable" should be two words.
      11. "The release of another microneme protein EBA-175, which interacts with glycophorin A on the 11 surface of the erythrocytes (Sim et al, 1994), was also reduced (Fig. 5A, Supp. Fig. S3B). but its localization to the microneme was unaltered (Supp. Fig. S4)." Remove full stop mid-sentence.
      12. "Therefore, we focussed on a possible role of PfPK2 in the release of AMA1, which is involved in late inavsion events like Tight Junction formation and PfPK2 plays a role at this stage of invasion (Fig. 4)." Correct "inavsion" to "invasion".
      13. Figure 5b. What is the difference between upper and lower panels. Please label or include details in figure legend. Figure legends should also include details on the DAPI, DIC and merged images for readers less familiar with microscopy images.
      14. "AMA1 plays a role in the formation of Tight-Junction (TJ) is via its association with rhoptry neck proteins (RONs) that are released by the rhoptries "Just-in-Time" on to erythrocyte surface (Riglar et al, 2011)." Remove "is".
      15. "Interestingly, phosphorylation site S1214 which is regulated by PfPK2 (Fig. 6B, 7B) resides in this insert This phosphositeis conserved only in GC" Add full stop after "insert". Replace "phoshositeis" with "phoshosite is"
      16. Order of Materials and Methods should be consistent with order of the Results section. For example, "Expression of recombinant proteins" and "Kinase assays" sections should come first.
      17. Please ensure all abbreviations, panels and labels in Figures are clearly explained in the relevant figure legends.

      Significance

      This study provides new insight into the role of PK2 in Plasmodium falciparum asexual blood stage development. While this study gives provides information on parasite biology, it does not provide significant advances in terms of understanding the value of PK2 as a drug target. However, this study does lay the groundwork for future studies using PK2 inhibitors as tool compounds for phenotypic validation.

      Audience: Specialised audience interested in parasite biology/signalling pathways.

      Reviewer's expertise: Target-based malaria drug discovery, Plasmodium kinase inhibitors, Target deconvolution for phenotypic hits in Plasmodium, kinase assay development.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). Please place your comments about significance in section 2.

      The authors present a biochemical and functional characterisation of the Protein Kinase 2 in Plasmodium falciparum (PfPK2). They state that PfPK2 is an active protein kinase, as shown by phosphorylation of a histone protein (Figure 1). The next 6 figures of the paper describe the functional characterisation of PfPK2. It is a protein essential for the propagation of asexual malaria parasites in the clinical blood stage (Figure 2). The protein is expressed in the schizont stage and deletion of the PfPK2 gene by the conditional DiCre-LoxP system prevents parasite invasion, while not affecting egress (Figure 3 and 4), reduces the protein levels of two invasion molecules (Figure 5), modifies phosphorylation status on a variety of schizont stage proteins that play a role in invasion (Figure 6). PfPK2 deletion also leads to the decrease in the molecule cGMP which plays a role in parasite invasion (Figure 7).

      Major comments:

      • Are the key conclusions convincing?

      The conclusions made in Figure 2, 3, 4 and 7 are adequate but I have added questions in my detailed analysis on these figures that should assist in increasing the confidence on the conclusions.<br /> In my view, conclusions made in Figure 1, 5 and 6 are not convincing as they are missing key quality control data including loading control. Detailed comments on each figure is attached.<br /> - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?<br /> Conclusions drawn from Figures 1,5,6 should be supplemented with the experiments suggested, otherwise the claims should be marked as speculative.<br /> - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.<br /> The experiments suggested in the detailed figure comments are necessary for supporting the claims of the paper.<br /> - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.<br /> To improve the phosphoproteomics data, I would strongly suggest the creation or acquisition of a parasite line in which a related kinase has also been conditionally regulated and analysis of the phosphoproteome. I would suggest PfPKA made in the Wilde et al study. These experiments are required in order to qualify the statements made in the discussion section such as<br /> "A novel pathway was deciphered in which it is an upstream regulator of cGMP-calcium signalling axis, which is critical for invasion"<br /> - Are the data and the methods presented in such a way that they can be reproduced?

      How many times were the experiments mentioned in the following figures repeated?<br /> Figure 1C: PfPK2 phosphorylation assay<br /> Figure 1D: Kinase assay<br /> Figure 2B,C, D<br /> How many cells were counted in Figure 2F?<br /> How many biological replicates were used in Figure 6B?<br /> - Are the experiments adequately replicated and statistical analysis adequate?<br /> For figures not stated above, experiments were adequately replicated and analysis is adequate.

      Minor comments:

      • Specific experimental issues that are easily addressable.
      • Are prior studies referenced appropriately?<br /> Yes.
      • Are the text and figures clear and accurate?<br /> Axes on Figure 5A require editing as the values are unable to be seen clearly. Other figures are clear and well presented. Specific figure related comments are attached to this document.
      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      This data, in its current state, requires supplementation with necessary controls in order to be convincing and also additional experiments as suggested in the detailed figure related information.<br /> - Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.<br /> The data included advances the field in characterising the role of PfPK2 and may influence the thinking of researchers studying Apicomplexan invasion biology. The PfPK2 phosphoproteomics and invasion imaging help knowledge gain in this area.<br /> - Place the work in the context of the existing literature (provide references, where appropriate).<br /> This work advances the data identified by Kato et al (2008) in the characterisation of PfPK2. Kato et al identified the activity of PfPK2, it's dependence on calmodulin and calcium for activity and the localisation of PfPK2 at the merozoite. Kato et al attempted to delete PK2 in the rodent malaria parasite P. berghei, but were unsuccessful, suggesting that the gene is essential for malaria parasites.<br /> Rawat et al recapitulate the kinase activity data and advance Kato et al's findings in creating a conditionally deletable PfPK2 parasite strain. Using this strain, they confirm the essentiality of the gene and characterise it's effect on invasion related processes in the parasite.<br /> - State what audience might be interested in and influenced by the reported findings.<br /> Researchers in Apicomplexan parasite biology may find this data interesting.<br /> - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.<br /> My expertise is in malaria parasite biology including parasite invasion, genetic manipulation and protein export.

      Specific Figure related comments:

      Figure 1:

      C: Where is the data on the purification of PfPK2? Looking at Image 1C, I don't know what has been added to the reaction to phosphorylate Histone IIa.<br /> I would like to see Coomassie or western blots of the His-PfPK2 purification and the presence of Histone IIa so that I can be convinced that only PfPK2 and not a contaminant E-coli protein is responsible for the histone phosphorylation.

      Without knowing this data, I cannot be convinced that PfPK2 is phosphorylating Histone IIa

      Da: Again, without a Coomassie of western blot to show that K140M/WT PfPK2 as well as Histone IIa have been included in the reaction, I can't know that it is the single point mutation that has led to the lack of autophosphorylation of PfPK2 and the lack of Histone IIa phorphorylation.<br /> Please include this data.

      Db:<br /> Once again, there is no Coomassie or western to show what proteins are included in each treatment. Please include there.<br /> Why is there a band of the size of 32P-HIIa in the WT lane lacking HIIa? Is there a breakdown product of PfPK2 that gets autophosphorylated?

      Figure 2:

      A, B: The PCR results are conclusive in showing integration of the plasmid.<br /> C: The PCR and western blots of excision are conclusive and show efficient excision.<br /> D: Ideally, I would like to see the growth of the 1G5DC parasite strain on DMSO and rapamycin as well alongside the PK2 parasite line. 200nM of rapamycin is double the concentration used by Collins et al 2013 when creating the 1G5DC strain.

      Supplementary Figure 2A: Figure is missing a loading control here. Rest of the figure is conclusive and the complementation has worked nicely. I expect that the reason the complemented parasite isn't behaving exactly like the DiCre strain is due to the fact the complementation is done via an episomal plasmid which may segregate differentially in daughter merozoites.<br /> Supp Figure 2C: The Y-axis is hidden behind the legend for the same axis.

      Figure 3

      A + B: In my view the data from Figure 3A and 3B are effectively the one figure and therefore does not require an A/B division. The data shows no change in schizont maturation while invasion rate of rapa treatment is ~50% of DMSO control.<br /> C: Data shows that parasite invasion is impaired when PK2 is not present and that the cytochalasin D treatment was successful.

      Figure 4:

      The data shows reduction in the invasion rate upon Rapa treatment and an increased rate of prolonged echinocytosis. The video stills demonstrate the defects nicely.

      The data suggest that upon Rapa treatment, 40% of all invasion events are successful (Figure 4B). This result does not seem to match with Figure 2E, where rapa treatment leads to a reduction in parasitemia much less that 40% of control.<br /> Can the authors comment on this discrepancy?

      Figure 5 and Supp Figure S3

      A loading control is required for culture supernatant and lysate in all conditions (Supp Figure S3A,B,C). Otherwise we cannot interpret the loss of AMA1/EBA175 shedding as well as the non-effect on RhopH3. Without this data, the interpretations made in Figure 5A cannot be assessed.<br /> B: The figure needs to be better labelled to represent the E64 treatment as well.<br /> The representative image for PK2-LoxP + Rapa + E64 does not show any level of AMA1 on the surface, but the graph to the right shows 30% of parasites with surface AMA1 levels. Can the authors attach a series of images of this condition, showing the variation of AMA1 on the parasite surface?<br /> C: This figure shows nicely that AMA1 levels are reduced during invasion in rapa treated parasites.

      Figure 6:

      A. Do the authors identify PfPK2 peptides also being depleted upon PK2 knockdown?<br /> If so, please update Figure 6B. If not, then it makes the autophosphorylation data from Figure 1D contradictory.

      The data from Figure 6B and 6C identifies many invasion related proteins that are hyper/hypo phosphorylated upon rapamycin addition. It would have been interesting to include another invasion related kinase as a control line in all conditions (For example PfPKAc) that would allow finely dissecting the involvement of PfPK2 vs PfPKA.

      Figure 7A:

      1. What happens to the total protein levels of PDE and GCa upon PfPK2 deletion? Does the level of cGMP reflect the increased stability or loss of these two antagonistic proteins?<br /> I.E Does PK2 depletion lead to lower GCa activity (=less cGMP production) or increased stability and action of PDE (= more degradation of cGMP)?

      Significance

      This work advances the data identified by Kato et al (2008) in the characterisation of PfPK2. Researchers in Apicomplexan parasite biology may find this data interesting.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary: The manuscript by Rawat et al presents study to define the mechanism of action of an essential Plasmodium protein kinase PfPK2 using a conditional knockout line. The data show that PfPK2 has a role in the invasion of RBC as well as in the intraerythrocytic maturation of the parasite. In addition, quantitative phosphoproteomics was done to identify potential cellular targets and pathway analysis. The manuscript reports several important findings. I have following comments and suggestions:

      Major:

      1. Page 3. The data of malaria mortality is from 2004! Current WHO data needs to be included.
      2. Page 4. As per recent analysis (Adderley and Doerig,2022), number of P. falciparum kinases is higher than 85.
      3. Page 5. Homology analysis references are quite old. Is there any update from recent comparative analysis or alpha fold-based structural similarity.
      4. Page 7, line 4. It should be "Conditional Knockout". Needs to be corrected elsewhere as well.
      5. Page 8, line 18. Based on the methods section, parasites were treated at the ring-stage- needs to stated here as well.
      6. Page 12, line 16. It would be helpful to define "significant" in terms of phosphorylation changes. p<0.5?
      7. Page 12, line 23. It is mentioned that PfPK2-depleted samples have an increase in PPM2 phosphorylation. Is there a reference for these specific phosphosites?
      8. It would be bit of a stretch to say that the differentially phosphorylated proteins are substrate/targets (as in Table 1). PfPK2 depletion occurred in ring stage, and the sample was collected at ~44 HPI, leaving ample time for compensatory changes.
      9. Table 1. Criteria used for selecting proteins for the table needs to be defined in the legend. Example: p<0.5 FC>|1.2|. It would be also useful to see the fold change difference in a column for each protein.
      10. Fig 6B. Looks like there are 8 phosphorites with a greater negative fold change than those annotated. Are these all "unknown proteins"? They may not fall into the invasion or signaling groups in Table 1, but they still need to be included.
      11. Fig 6C. In the STRING analysis, PfPK2 is included as a yellow diamond but not referenced in the legend. Also, it is not clear if PfPK2 was differentially phosphorylated or was placed in the STRING network manually. "CPPUF is a nice abbreviation for "conserved Plasmodium protein, unknown function" but it would be useful to include the PlasmoDb number.
      12. Fig 6C. What is the significance of the line thickness or the circle size? Needs to be specified in the legend.
      13. It is not clear how many replicates of small collection was done for phosphoproteomic analysis.

      Minor:

      1. Page 5, line 3. It would useful to include PlasmoDb ID of PfPK2.
      2. Page 5, line 17: should read "....it has long N and a C-terminal...."
      3. Page 7, line 21: should be " was undetectable"...
      4. Page 9, line 17. It would be better to rephrase as.. "To better understand defects in the invasion following PfPK2 depletion live cell imaging was performed".
      5. Page 11, line 5. should be " late invasion".
      6. Page 30. Expression of recombinant protein and kinases assay should be moved to the beginning of the Methods as it is mentioned first in the results.
      7. Page 31, line 10. Concentration of imidazole needs to be mentioned.

      Significance

      The manuscript defines the role of PFPK2 in parasite invasion as well as in asexual maturation. Although unambiguous PfPK2 targets have not been defined, but the phosphoroteomic analysis alludes to the role of PfPK2 in cellular processes, particularly in invasion.

      The manuscript will be of general interest to readership of the journals listed.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank all Reviewers for their detailed and helpful comments and suggestions for this manuscript. The overall goal of this study is to interrogate which transcriptional and metabolic pathways lose oscillation when MYC is amplified or activated. We have now added additional replicates to our RNA-sequencing and nutrient transporter expression analyses, and have demonstrated that MYC disrupts oscillation of metabolic and biosynthetic gene expression, nutrient transporter oscillation, and metabolite pathways. On the suggestion of Reviewer #3, we have also strengthened this work by directly contrasting the transcriptional oscillations we observe in cancer cells with well-established primary cell models of transcriptional oscillation, MEFs and macrophages. We have carefully responded to each comment and have noted which Figures or lines in the manuscript address each comment. We hope that our revised manuscript is now suitable for publication.

      Reviewer #1:

      This is an interesting paper from the Altman, Weljie and Dang labs that furthers their previous publications looking at the effect of "oncogenic" Myc levels have on circadian gene expression. They provide compelling data that in neuroblastoma and osteosarcoma models of Myc amplification circadian gene expression and metabolite fluctuation are lost. The data is convincing and comprehensive and should be of broad general interest. There are a few major issues that need to be addressed.

      (Significance (Required)):

      The Myc oncogene is dysregulated in many cancer types, so there is considerable interest in the mechanisms that underpin its function as a transforming oncogene. This group of authors has previously described that Myc can disrupt circadian gene expression, which is linked to several types of cancer. This paper extends the authors previously findings by performing careful timed RNA-seq analysis and metabolomic analysis. The work is well done, and the findings justify the conclusions. This paper should be of interest to those who study Myc, circadian gene expression and cancer. Two key limitations are noted: 1) the cells that are analyzed are grown entirely in vitro in serum and nutrient replete media, 2) there is no direct evidence that the blockage of circadian gene expression by Myc is important for Myc-dependent transformation, although it seems likely. These two limitations do not detract from the significance of the manuscript.

      My expertise is in cancer-centric gene regulatory mechanisms

      We thank Reviewer #1 for finding the data compelling and comprehensive, and for suggesting key experiments and revisions. We also thank the Reviewer for noting that this will be of interest to those who study MYC and cancer (noted in the Significance section below). We have addressed all suggestions below.

      1) Is there a control (done here or in prior literature) showing that simply adding tam to cells doesn't change circadian gene expression?

      We previously performed this experiment in our 2015 Cell Metabolism paper (Altman and Hsieh, et al). In particular, by using either cells that did not express MYC-ER, or a control mutant MYC-ER that lacks transcriptional activity, we found the 4OHT did not blunt clock function in U2OS cells. This is now discussed in Lines 120-121 of the revised manuscript.

      2) All of the experiments rely on Myc:ER fusions. The authors should mine other datasets to determine of systems that rely on conditional expression of wt Myc drive a similar loss of circadian gene expression.

      This is an important point: the work in this manuscript focuses on cancers with endogenous levels of MYC where MYC-ER fusions drive changes in gene expression. In our previous works, we tested systems such as the Reviewer suggested, where endogenous MYC was conditionally overexpressed (Altman and Hsieh et al. Cell Metabolism 2015, Altman et al Nature Communications 2017). In these systems (liver cancer and Burkitt’s Lymphoma), elevated MYC also leads to dysfunction of circadian gene expression and oscillation. We reference this work on lines 123-126. In addition, two papers from the laboratory of Michael Brunner (Shostak et al Nature Communications 2016, Shostak et al Nature Communications 2017) used a tetracycline-inducible model of wild-type MYC in U2OS instead of MYC-ER. Their results largely mirrored our own, where BMAL1 was suppressed and molecular circadian oscillation was disrupted. These two papers are cited extensively throughout manuscript.

      3) The authors argue Myc:ER fusions mimic Myc amplification which is common in cancer and they discuss several previous papers that at least correlate the presence of amplified Myc with loss of circadian gene expression. It should be possible to test whether reduction of Myc by knockdown or using a Myc inhibitor restores circadian gene expression in a cell line known to be Myc amplified. This need not be an exhaustive RNA-seq experiment, but looking at a handful of circadian genes using a qPCR approach would be informative.

      Thank you for the suggestion. Our previous work in Burkitt’s Lymphoma and liver cancer cell lines, as well as analysis of data from primary liver cancer, drew on MYC Tet-OFF systems, where amplified MYC is suppressed by addition of tetracycline or doxycycline. In these experiments, the ‘control’ is amplified MYC, and the ‘experimental’ condition is tet- or dox-treated cells where MYC has been suppressed. In all cases, suppressing MYC led to predictable changes in circadian gene expression (suppression of REV-ERBα, upregulation of BMAL1, etc).

      To support these findings, upon the suggestion of the Reviewer, we identified a recent study where the PC3 prostate cancer cell line, known to harbor highly elevated endogenous MYC levels, was treated with the new generation MYC inhibitor MYCi361. We downloaded their RNA-sequencing data and performed differential expression analysis, and showed that several circadian genes were significantly altered upon treatment with the inhibitor, including BMAL1, PER2, and the REV-ERB genes. This is now included as Supplemental Figure S1C, and the text for this is on lines 126-136.

      4) The data pretty clearly shows that metabolites lose their periodicity MycON cells. Can these be linked back to loss of circadian expression of specific genes in those metabolic pathways? If so, are genes direct Myc transcriptional targets in other studies?

      Thank you for bringing this up. We now have computational evidence from multiple replicate circadian time-series experiments that MYC disrupts oscillation of the LAT1 amino acid transporter across multiple cell lines (Figure 6), and that MYC upregulates LAT1 and 4F2hc protein and mRNA (Supplemental Figure 6 and not shown). Indeed, LAT1 is known to be a direct MYC target, which is now mentioned on lines 493-499 of the discussion.

      In U2OS, where we performed our metabolomics studies, while LAT1 oscillates at the protein level, we saw less evidence of metabolic program oscillation at the transcriptional level, which we acknowledge on lines 274-277. This may be due to the fact that not all metabolic and protein oscillations arise solely from transcriptional oscillation, which we mention on lines 48-49 of the introduction, and revisit in lines 430-432 of the discussion. Nonetheless, our findings that MYC disrupts oscillation of nutrient transporters and metabolites fits in with the overall theme of this manuscript that MYC disrupts circadian control of metabolism.

      5) The finding Myc activation "releases" metabolic and biosynthetic pathways from circadian control implies that this must have something to do with Myc-dependent transformation. A priori, it is not obvious why this should be the case. Do metabolic precursors and biosynthetic molecules become periodically limiting when their levels oscillate in MycOFF cells? In MycOn cells do the non-oscillating metabolites, provide a growth advantage? This is a difficult question to address and one that is certainly beyond the scope of this manuscript. The authors should address this issue in their discussion.

      Thank you for the suggestion to discuss this idea in more detail. While we propose the hypothesis that circadian metabolic oscillations are limiting for tumor cells, testing this directly is indeed outside the scope of this current study. We address this issue on lines 506-518 of the discussion, where we contrast our hypothesis with the idea that alternate metabolic oscillations (those tied to cell cycle or faster-than-circadian) may arise in the absence of circadian control.

      Minor points<br /> 1) phrase "for the first time" is used multiple times in the discussion. Gets a bit redundant ( and loses impact). Consider revising.

      Thank you for this suggestion. We have revised our Discussion section accordingly.

      2) In figure 4, the periodicity in expression of the proteins in figure 4 is fairly clear, but it might be beneficial to bracket (or denote in some other way) the circadian fluctuation in expression.

      In response to other Reviewer comments, we performed multiple replicates of the nutrient transporter protein expression time-series, quantified protein, and calculated circadian oscillations. This is now presented in the new Figure 6.

      Referees cross-commenting

      I had not considered the important points raised by reviewer 3. The authors definitely need to address the concern over replicates and whether the gene expression of truly rhythmic. If not, this seems like a fatal flaw in the MS.

      We have carefully addressed both of these concerns by adding replicates to our RNA-sequencing and protein expression assays, and contrasting our gene expression oscillation findings with those in established primary cell models (Supplemental Figure 2A). Please see the Response to Reviewer 3 for more detail.

      Reviewer #2:

      (Significance (Required)):<br /> Using the circadian synchronized cancer cell lines, DeRollo and colleagues characterized the MYC oncoprotein role in metabolic role through the circadian clock disruption. Authors found that forced activation of MYC disrupts up to 85% of genes oscillation particularly nutrient transporter glycosylation and amino acid metabolism. This work addresses important questions in the circadian clock and cancer field through the oncogene activation, and the manuscript is well-written. However, there are a few concerns that should be addressed to improve the manuscript quality.

      We thank Reviewer #2 for their detailed and astute suggestions on demonstrating the degree of MYC overexpression and the synchronization / entrainment of our cells, as well as the suggestion to add and quantify multiple replicates. We also appreciate the Reviewer’s comments in the Significance section that the manuscript addresses important questions in the field and is well-written. We have individually addressed each comment below and made several revisions and additions in response to them.

      Major comments:<br /> 1) The 3 cell lines used in this paper, what is the expression levels of MYC protein under -OFF and -ON conditions? It is important to demonstrate this information through the western blot data. Since the 4-hydroxy tamoxifen was used to activate MYC, what is the vehicle/control for MYC OFF cells? Otherwise, it will be difficult to assess with everything observed on this manuscript under MYC-ON could be due to 4-hydroxy tamoxifen treatment.

      Thank you for this important consideration. We have clarified in the manuscript that the MYC-ER system is constitutively expressed, and when cells are treated with 4OHT, MYC-ER is activated and translocates to the nucleus, while MYC-OFF control cells are treated with ethanol as a vehicle (Lines 100-105). In response to the suggestion to quantify the degree of overexpression, we have also added new experiments to quantify the degree of MYC-ER overexpression, in Supplemental Figure 1A and 1B. Finally, we previously showed in our 2015 Cell Metabolism paper (Altman and Hsieh, et al) that 4OHT does not affect the molecular clock on its own. In particular, by using either cells that did not express MYC-ER, or a control mutant MYC-ER that lacks transcriptional activity, we found the 4OHT did not blunt clock function in U2OS cells. This is discussed in Lines 120-121 of the revised manuscript.

      2) In Fig. 1A, it is crucial to demonstrate that the circadian synchronization protocol is working by performing statistical analysis with at least 3 biological replicates. This should be performed by either cosinor analysis and/or JTK cycle analysis of all the canonical clock genes including BMAL1 (ARNTL), CLOCK, CRY1, CRY2, PER1, PER2, DBP and NR1D1. This reviewer would like to see both transcripts (qPCR) and protein levels (western blot data) of those clock genes expression pattern. Without these results, rest of the data will be hard to conclude the connection with the circadian/molecular clock.

      Thank you for bringing of the need for quantitation of circadian transcripts. In the new Figure 1, we have quantified and performed ECHO analysis (which is a parametric method of oscillation analysis used through the manuscript) on several circadian transcripts. We chose to specifically show the same n=2 input RNA that we used for RNA-seq for each cell lines. Additionally, our updated RNA-sequencing and analysis of oscillating genes in MYC-OFF shows that CRY2, PER2, PER3 oscillate in all three cell lines (Figure 2B). These findings agree with extensive literature by us and others that the molecular circadian clock is functional after dexamethasone entrainment in U2OS, SHEP, and SKNAS: Baggs et al Plos Biology 2009, Zhang et al Cell 2009, Hughes et al Plos Genetics 2009, Altman and Hsieh et al. Cell Metabolism 2015, Altman et al Nature Communications 2017, Shostak et al Nature Communications 2016, Shostak et al Nature Communications 2017.

      3) In Fig. 5A-C: It is important to repeat this western blot experiment at least 3 times and have the quantitation to demonstrate the circadian rhythmicity significance by probing to majority of the canonical clock proteins as discussed above.

      Thank you for this suggestion. The western blot experiments have now been repeated and have n=3-4 replicates, have been quantified, and oscillation assessed. We also quantified and plotted the oscillation of BMAL1 and REV-ERBα as comparisons. This is in the new Figure 6.

      Minor Comments:

      1) For all of the western blot images, authors need to show the molecular weight of the corresponding protein bands detected on the blots.

      All raw western blot images, including molecular weights, will either be published as Supplemental Material or on FigShare (with a persistent doi), depending on the preference of the Journal. A private link is available for Reviewers with all the relevant background data, including westerns with molecular weights.

      2) As a proof of concept, it would be interesting if knockout/knockout the MYC gene in these cell lines and look for the expression pattern of canonical clock gene expression levels to see whether it will help enhancing circadian rhythmicity. If authors cannot perform this experiment, it is important to address under discussion.

      Thank you for this idea. Reviewer 1 had a similar idea / comment. We addressed it through previous studies, and a new analysis of MYC-high PC3 prostate cancer cells treated with the MYC inhibitor MYCi361 (Supplemental Figure 1C). See response to Review 1 Major Point 3 for more details.

      3) It is not discussed, how many biological replicates were used for RNA-seq analysis?

      In the revised manuscript, each RNA-sequencing experiment is performed from n=2 biological replicates and n=13-14 time points per replicate.

      Referees cross-commenting

      I completely agree with reviewer #3. In fact, I have raised the similar points in my major comments #2 and # 3.

      As discussed below, we have responded to Reviewer comments with more replicates and new analyses of oscillating genes and proteins.

      Reviewer #3:

      Review of "MYC Disrupts Transcriptional and Metabolic Circadian Oscillations in Cancer and Promotes Enhanced Biosynthesis"<br /> DeRollo et al. attempt to find commonalities in how MYC affects transcriptional and metabolic programming by examining MYC-switchable U2OS, SHEP, and SKNAS cell lines. They claim that oncogenic MYC both represses transcriptional oscillation of many genes and supports rhythmic expression of other genes. They use RNA-Seq and UPLC-MS/MS with the appropriate bioinformatics analyses. In some cases, they employ qPCR and immunoblotting. In the three different cell lines, they observed that MYC either statically upregulates or downregulates oscillatory genes.

      (Significance (Required)):

      Myc is known to interfere with the rhythmic expression of core circadian clock genes. Myc seems to do this in order to rewire control-clock expression programs in favor of cell growth and proliferation.<br /> DeRollo et al intended to investigate which clock-controlled expression programs are deregulated by MYC. For this purpose, they investigated three cell lines. Unfortunately, it seems that clock-controlled genes are not really express with a (sufficiently) substantial amplitude in these cultured cells. It is therefore not possible to distinguish by RNA-seq the truly rhythmic genes from false positives. Therefore, it is not possible to reliably determine which metabolic rhythmic programs are deregulated by MYC.

      We thank Reviewer #3 for their important observations and suggestions with regards to the number of replicates employed, and the confidence in the oscillations we observe. We have responded to these comments in a detailed fashion by adding replicates to our RNA-sequencing and immunoblot, and by comparing the oscillations in our cell lines to established primary models of circadian oscillation to determine the amplitudes of oscillations we observed.

      As it stands, the work has technical and conceptual weaknesses.<br /> First, it is not clear how many replications the authors performed for RNA seq. For U2OS, this is explicitly stated. Replicate 1: four-hours sampling, ribosomal RNA depleted; replicate 2: two-hours sampling, polyA+ RNA. There do not appear to be replicates for the other two cell lines?

      Thank you for this observation. In the initial manuscript submission, we used n=2 replicates for U2OS and SHEP, and n=1 replicate for SKNAS (apologies for this not being clear). In response to your comment and others, we added a new biological replicate for RNA-sequencing for SKNAS, so each cell line now has n=2 replicates. This allowed us to more confidently identify oscillating genes across biological replicates for each cell line.

      U2OS cells are widely used in circadian research. These cells rhythmically express clock genes with a decent amplitude, which the authors confirmed by qPCR. However, the clock-controlled genes are generally expressed at a very low amplitude (in the range of standard deviation of RNA-Seq). It is therefore extremely difficult to identify and distinguish them from nonrhythmic genes by RNA seq. The fact that the authors find approximately the same number of rhythmic genes in MYC-OFF (Fig. 1B) as in MYC-ON (Supplemental Fig. 1B) and no overlap between the three cell lines tells that most of the genes shown in the heat maps are not truly rhythmic. Rather, they appear to represent those genes that are called rhythmic because a cosine wave happens to fit the data (better than a line). I suspect that true replicates (which are missing) would also show little or no overlap because most genes in these cells are probably not really rhythmic with any significant amplitude (and why would they be under constant conditions in a Petri dish?).

      Thus, if there are no rhythmic clock-controlled genes that can be clearly distinguished from non-rhythmic genes, there is no way to tell which rhythms are attenuated by MYC (apart from the core clock genes shown in Fig. 1A), or to identify potentially rhythmic pathways.

      Thank you for bringing up this important point about the confidence in the rhythmicity of genes examined. As is correctly noted and as brought up in response to Reviewer #2, cell lines such as U2OS, SKNAS, and SHEP have extensively been used for molecular clock studies (see Reviewer 2 Major Point 2). We also note that each cell line is now n=2 biological replicates, so all oscillating transcripts represent genes that were oscillating in both biological replicates. We take seriously the concern that the oscillating transcripts are not truly rhythmic, or of insufficient amplitude to be biologically significant. We employed a published algorithm, ECHO (De Los Santos et al, Bioinformatics 2019), which uses a conservative parametric approach to determine oscillations from sequencing data, and filters out genes that are too lowly expressed for oscillations to be determined, and those where a sharp increase or decrease in gene expression would preclude determining oscillations.

      To directly test the strength of our observed oscillations in MYC-OFF conditions, we downloaded and analyzed time-series RNA-sequencing data from two entrained primary cell models that are known to have robust transcriptional circadian oscillations: MEFs and macrophages. These two datasets were analyzed in the same fashion as our cell lines, using the ECHO parametric algorithm, and we plotted the median amplitude of oscillation for all transcripts that had significant circadian oscillation (Supplemental Figure 2A). We found that the median amplitude of oscillation was within the same range as those from primary cells: SHEP cells showed nearly identical median amplitude to MEFs, while U2OS and SKNAS had slightly higher median amplitudes than macrophages. These suggest that the oscillating transcripts we observe and measure represent true oscillations above background noise that are similar to those observed in primary cell models where transcriptomic oscillation has been extensively studied.

      With regards to oscillations observed in MYC-ON: we would first like to note that in some cases, there are fewer oscillating genes, especially in SKNAS, where there are less than half the number of oscillating genes in MYC-ON as compared to MYC-OFF. Nonetheless, the observation of emergent oscillations in MYC-ON cells is interesting, and we devote a paragraph to this in the Discussion (lines 457-482). We note that major perturbations to the molecular clock, such as DKO of REV-ERBα and β, result in emergent oscillations in the liver (Guan D et al, Science, 2020), and speculate that oscillations observed in MYC-ON may be from residual activity of CLOCK and BMAL1, which may occupy new sites when MYC is overexpressed.

      If there are no strong rhythmic clock-controlled genes, there are probably no strong rhythms in clock-controlled metabolism. Indeed, the authors found no overlap in rhythmic metabolites.

      Because metabolomics is far less sensitive than RNA-sequencing, we performed KEGG enrichment analysis on oscillating metabolites from both our replicates. We found that in MYC-OFF, the enriched metabolic pathways were identical in timing and identity of enriched pathways (Figure 8, Supplementary Figure 9), while these were quite divergent in MYC-ON. Thus, we concluded that common oscillating metabolic pathways in the absence of MYC are altered or disrupted by MYC activation.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Review of "MYC Disrupts Transcriptional and Metabolic Circadian Oscillations in Cancer and Promotes Enhanced Biosynthesis"<br /> DeRollo et al. attempt to find commonalities in how MYC affects transcriptional and metabolic programming by examining MYC-switchable U2OS, SHEP, and SKNAS cell lines. They claim that oncogenic MYC both represses transcriptional oscillation of many genes and supports rhythmic expression of other genes. They use RNA-Seq and UPLC-MS/MS with the appropriate bioinformatics analyses. In some cases, they employ qPCR and immunoblotting. In the three different cell lines, they observed that MYC either statically upregulates or downregulates oscillatory genes.<br /> As it stands, the work has technical and conceptual weaknesses.<br /> First, it is not clear how many replications the authors performed for RNA seq. For U2OS, this is explicitly stated. Replicate 1: four-hours sampling, ribosomal RNA depleted; replicate 2: two-hours sampling, polyA+ RNA. There do not appear to be replicates for the other two cell lines?<br /> U2OS cells are widely used in circadian research. These cells rhythmically express clock genes with a decent amplitude, which the authors confirmed by qPCR. However, the clock-controlled genes are generally expressed at a very low amplitude (in the range of standard deviation of RNA-Seq). It is therefore extremely difficult to identify and distinguish them from nonrhythmic genes by RNA seq. The fact that the authors find approximately the same number of rhythmic genes in MYC-OFF (Fig. 1B) as in MYC-ON (Supplemental Fig. 1B) and no overlap between the three cell lines tells that most of the genes shown in the heat maps are not truly rhythmic. Rather, they appear to represent those genes that are called rhythmic because a cosine wave happens to fit the data (better than a line). I suspect that true replicates (which are missing) would also show little or no overlap because most genes in these cells are probably not really rhythmic with any significant amplitude (and why would they be under constant conditions in a Petri dish?).<br /> Thus, if there are no rhythmic clock-controlled genes that can be clearly distinguished from non-rhythmic genes, there is no way to tell which rhythms are attenuated by MYC (apart from the core clock genes shown in Fig. 1A), or to identify potentially rhythmic pathways.<br /> If there are no strong rhythmic clock-controlled genes, there are probably no strong rhythms in clock-controlled metabolism. Indeed, the authors found no overlap in rhythmic metabolites.

      Significance

      Myc is known to interfere with the rhythmic expression of core circadian clock genes. Myc seems to do this in order to rewire control-clock expression programs in favor of cell growth and proliferation.<br /> DeRollo et al intended to investigate which clock-controlled expression programs are deregulated by MYC. For this purpose, they investigated three cell lines. Unfortunately, it seems that clock-controlled genes are not really express with a (sufficiently) substantial amplitude in these cultured cells. It is therefore not possible to distinguish by RNA-seq the truly rhythmic genes from false positives. Therefore, it is not possible to reliably determine which metabolic rhythmic programs are deregulated by MYC.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Major comments:

      1. The 3 cell lines used in this paper, what is the expression levels of MYC protein under -OFF and -ON conditions? It is important to demonstrate this information through the western blot data. Since the 4-hydroxy tamoxifen was used to activate MYC, what is the vehicle/control for MYC OFF cells? Otherwise, it will be difficult to assess with everything observed on this manuscript under MYC-ON could be due to 4-hydroxy tamoxifen treatment.
      2. In Fig. 1A, it is crucial to demonstrate that the circadian synchronization protocol is working by performing statistical analysis with at least 3 biological replicates. This should be performed by either cosinor analysis and/or JTK cycle analysis of all the canonical clock genes including BMAL1 (ARNTL), CLOCK, CRY1, CRY2, PER1, PER2, DBP and NR1D1. This reviewer would like to see both transcripts (qPCR) and protein levels (western blot data) of those clock genes expression pattern. Without these results, rest of the data will be hard to conclude the connection with the circadian/molecular clock.
      3. In Fig. 5A-C: It is important to repeat this western blot experiment at least 3 times and have the quantitation to demonstrate the circadian rhythmicity significance by probing to majority of the canonical clock proteins as discussed above.

      Minor Comments:

      1. For all of the western blot images, authors need to show the molecular weight of the corresponding protein bands detected on the blots.
      2. As a proof of concept, it would be interesting if knockout/knockout the MYC gene in these cell lines and look for the expression pattern of canonical clock gene expression levels to see whether it will help enhancing circadian rhythmicity. If authors cannot perform this experiment, it is important to address under discussion.
      3. It is not discussed, how many biological replicates were used for RNA-seq analysis?

      Referees cross-commenting

      I completely agree with reviewer #3. In fact, I have raised the similar points in my major comments #2 and # 3.

      Significance

      Using the circadian synchronized cancer cell lines, DeRollo and colleagues characterized the MYC oncoprotein role in metabolic role through the circadian clock disruption. Authors found that forced activation of MYC disrupts up to 85% of genes oscillation particularly nutrient transporter glycosylation and amino acid metabolism. This work addresses important questions in the circadian clock and cancer field through the oncogene activation, and the manuscript is well-written. However, there are a few concerns that should be addressed to improve the manuscript quality.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is an interesting paper from the Altman, Weljie and Dang labs that furthers their previous publications looking at the effect of "oncogenic" Myc levels have on circadian gene expression. They provide compelling data that in neuroblastoma and osteosarcoma models of Myc amplification circadian gene expression and metabolite fluctuation are lost. The data is convincing and comprehensive and should be of broad general interest. There are a few major issues that need to be addressed.

      1. Is there a control (done here or in prior literature) showing that simply adding tam to cells doesn't change circadian gene expression?
      2. All of the experiments rely on Myc:ER fusions. The authors should mine other datasets to determine of systems that rely on conditional expression of wt Myc drive a similar loss of circadian gene expression.
      3. The authors argue Myc:ER fusions mimic Myc amplification which is common in cancer and they discuss several previous papers that at least correlate the presence of amplified Myc with loss of circadian gene expression. It should be possible to test whether reduction of Myc by knockdown or using a Myc inhibitor restores circadian gene expression in a cell line known to be Myc amplified. This need not be an exhaustive RNA-seq experiment, but looking at a handful of circadian genes using a qPCR approach would be informative.
      4. The data pretty clearly shows that metabolites lose their periodicity MycON cells. Can these be linked back to loss of circadian expression of specific genes in those metabolic pathways? If so, are genes direct Myc transcriptional targets in other studies?
      5. The finding Myc activation "releases" metabolic and biosynthetic pathways from circadian control implies that this must have something to do with Myc-dependent transformation. A priori, it is not obvious why this should be the case. Do metabolic precursors and biosynthetic molecules become periodically limiting when their levels oscillate in MycOFF cells? In MycOn cells do the non-oscillating metabolites, provide a growth advantage? This is a difficult question to address and one that is certainly beyond the scope of this manuscript. The authors should address this issue in their discussion

      Minor points

      1. phrase "for the first time" is used multiple times in the discussion. Gets a bit redundant ( and loses impact). Consider revising.
      2. In figure 4, the periodicity in expression of the proteins in figure 4 is fairly clear, but it might be beneficial to bracket (or denote in some other way) the circadian fluctuation in expression.

      Referees cross-commenting

      I had not considered the important points raised by reviewer 3. The authors definitely need to address the concern over replicates and whether the gene expression of truly rhythmic. If not, this seems like a fatal flaw in the MS.

      Significance

      The Myc oncogene is dysregulated in many cancer types, so there is considerable interest in the mechanisms that underpin its function as a transforming oncogene. This group of authors has previously described that Myc can disrupt circadian gene expression, which is linked to several types of cancer. This paper extends the authors previously findings by performing careful timed RNA-seq analysis and metabolomic analysis. The work is well done, and the findings justify the conclusions. This paper should be of interest to those who study Myc, circadian gene expression and cancer. Two key limitations are noted: 1) the cells that are analyzed are grown entirely in vitro in serum and nutrient replete media, 2) there is no direct evidence that the blockage of circadian gene expression by Myc is important for Myc-dependent transformation, although it seems likely. These two limitations do not detract from the significance of the manuscript.

      My expertise is in cancer-centric gene regulatory mechanisms

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank the Review Commons editor and three reviewers for their enthusiastic response, including their constructive suggestions and appreciation of the high impact and originality of our study. We have completed the revisions and new analyses suggested by the reviewers, and we thank the reviewers for their suggestions to increase the impact and interest in this work and for guiding us towards this much improved manuscript.

      In this response letter, we present the response to each reviewer comment and associated revisions made to the text and figures as bullet points below the reviewers' text (black text).

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Yang et al. took advantage of recently published long-read-based genomic sequences of nearly homozygous genomes from complete hydatidiform moles to retrieve allelic sequences of LINE-1, the currently only active and autonomous retrotransposon of the human genome, and produced the repertoire of intact LINE-1 in a genome. The authors performed cell-culture-based retrotransposition assays measurements and in vivo fitness estimations of all identified intact LINE-1 to infer evolutionary dynamics. In this article, the authors further validate the major contribution of polymorphic LINE-1 to the de novo retrotransposition events in the human genome. They also described, at unprecedented resolution, allelic variations among LINE-1 loci and the potential impact of these variations to the interpretation of mutagenic potential of each LINE-1 locus.

      Major comments:

      1 - The key conclusions of the article are mostly convincing. However, it would be a substantial improvement to consolidate the data of the article with information about known active LINE-1s in germ cells or in cancer by using data from recent publications of the Devine and Tubio labs (for example PMID: 34772701, 32024998, 25082706). Across the article, no mention is made of the transductions generated during LINE-1 de novo retrotransposition, which is instrumental to monitor in vivo activity of a group of LINE-1 active copies. It would be of particular interest to make a link between in vitro activity from this study with LINE-1 classification based on their observed activity in cancer (PMID: 32024998, Figure 3b).

      • We thank this and the other reviewers for this suggestion. We agree that a more explicit comparison to the often-reported counts of 3’ transductions would be a valuable addition to our analyses. We have added the 3’ transduction counts from PMID:34772701, PMID:32024998 and PMID:25082706 to Table S2 (column Y, Z and AA), and made a comparison between these data and our Hamming-distance-based in vivo activity, as the new Figure S5. We found correlations between the two measurements in a significant proportion of LINE-1s, but some interesting exceptions exist which likely reflects the fact that most catalogued 3’ transductions come from cancer genomes, and cancer and germline cells represent distinct cellular environments in which distinct sets of LINE-1s are able to replicate (and leave 3’ transductions). In addition to the new figure (Figure S5), we have added a discussion paragraph focused on this interesting comparison.

      2 - The use of CHM1 BAC library Sanger sequencing validation and comparison with CHM13 and hg38 sequences is instrumental to support the building of LINE-1 repertoire in CHM1 genome, which is a valuable contribution of the article. The use of a distance-based metric to infer fitness of a LINE-1 is an interesting approach and allow to group LINE-1 copies based on their in vivo activity potential. Again, it would be beneficial to correlate the inferred fitness and retrotransposition activity of copies/alleles, when known, from the above-mentioned literature.

      • The sequence validation of LINE-1 sequences in CHM1 is an important point which we have addressed in the edited manuscript. Specifically, we used three forms of sequence validation including end-sequencing of one clone of each LINE-1 after it was cloned into the retrotransposition vector and whole-plasmid sequencing of select LINE-1s with discrepant activity amongst the three clones we assayed. In addition, we sequenced the entire LINE-1 sequence for four LINE-1s which had the largest number of mutations relative to their allelic counterpart in CHM13. Please see the above response to ‘Major comment 1’ for details of our new analysis comparing the previous literature to our data.

      3 - Some aspects of the writing of the article should be improved to better support the conclusions.

      • We thank the reviewer for providing these examples of parts of the text that were particularly difficult to read and comprehend. We have deeply streamlined and improved the text throughout the manuscript based upon detailed editing for readability and clarity by two experienced scientific writers. Below, we detail how we revised the particular sections presented by the reviewer, but we think the entire manuscript is now more succinct and clearer.

      • In general, the descriptions are dense, and details could be provided in a more direct way to lighten the results section. Several redundancies in the discussion can be combined to increase clarity.

      • We have spent considerable time tightening up the text, including removing several overlapping sections from the discussion which can be seen in the included version with changes tracked.

      • There is a lack of clarity in the description of how was handled each pair of alleles for which retrotransposition measurements vary between the study and the literature (last paragraph of the "Comprehensive measurement of LINE-1 in vitro activity in a human genome" section). It is not completely clear how the analysis was done and the way the data is presented in File S3 is not helping to support the conclusion. It could be useful to include some illustrative examples in a panel of Figure 2.

      • We agree that this description was hard to parse, and we have rewritten this and accompanying methods to simplify our explanation of these results. In addition, we have revised Figure 2 to show the data in much more detail. To further aid the logic flow related to this section, we moved the previous Figure 5B to Figure 2B, updated it with more suitable examples and edited the associated descriptions.

      • Regarding inferred in vivo activity, the text contains alternative description with the use of "fit" / "unfit", in vivo "active" / "inactive" or "no closely related LINE-1s" terms. The authors should find a way to clearly define and systematically use one set of terms to enhance clarity along the article. To parallel with in vitro active/inactive, it would be useful to use in vivo fit/unfit.

      • We thank the reviewer for this suggestion and agree with their suggested unified use of ‘in vivo fit/unfit’. To clarify and simplify these terms as much as possible, we added detailed explanations of in vivo / in vitro activity and systematically defined in vitro "active/inactive" (page 5, right column, line 50) and in vivo "fit/unfit" (page 8, left column, line 26) at their first appearance in the article, and we changed most instances of "in vivo activity" to "in vivo fitness" when context permits.

      4 - The authors suggest that in vitro activity can be predicted by integration of population frequency and in vivo activity (/fitness) (second paragraph of the "An analysis of LINE-1 evolutionary history [...] and in vivo activity" section). It would be beneficial to strengthen the writing of this section and ultimately validate/test the model by including data from some of the previous studies (e.g. Brouha 2003, Lutz 2003, Seleme 2006, Beck 2010, Rodriguez-Martin 2020, Chuang 2021).

      • We have thoroughly revised this section of the results (see response to ‘Major comment 3’ above), per the reviewers suggestion, to increase reader comprehension of this important data. In addition, we greatly appreciate the reviewer’s suggestion of a very interesting experimental direction – moving beyond a single long-read-based genome to many diverse genomes, and ultimately calculating the in vivo fitness of the LINE-1s from these diverse genomes. For a long time this has not been possible, but the recent publication of the Human Pangenome presents an opportunity to study this interesting question. Though beyond the scope of this paper, our lab is actively working on this fascinating question, and we appreciate the reviewer’s shared interest in this question.

      5 - The identification of adaptive mutations is only partially described and not strongly supported by experimental or analytical data. It would be interesting to explore the role of phylogenetically informative sites described in Figure 5B/C by testing non CHM1 alleles in retrotransposition assay (by introducing amino acid changes into the cloned CHM1 LINE-1 alleles) or by positioning the sites in ORF1p or ORF2p structure and/or domains to infer impact on functionality.

      • The reviewer rightly points out that this is one of the most interesting and novel findings of our manuscript. However, the testing of potentially adaptive mutations is potentially complicated and nuanced. Specifically, we don’t know the mechanism by which these mutations might be adaptive. It is possible that they simply increase in vivo germline retrotransposition activity and this increase would be reflected by an increase of in vitro retrotransposition activity. However, another possibility is that these adaptive phenotypes only show themselves in vivo or in the context of the host restriction factors expressed in the germline. We strongly agree with the reviewer that experimental and analytical data on the phylogenetic informative sites associated with the Figure 5 phylogeny is the key to finding out the mechanisms for these changes to affect LINE-1 activity/fitness, and we are, indeed, exploring this very question in the lab now with related projects. We respectfully suggest that these (extremely cool) experiments are beyond scope of this paper, but we have also added some more detailed description and analyses of the potentially adaptive LINE-1 variations from Figure 5 (from page 9, right column, line 50 to page 10, left column, line 5).

      Minor comments:

      1 - Regarding the in vitro retrotransposition assay, it would be beneficial to provide more data. The current Figure 2 could be enriched by the addition of data related to the variation in the replicates of the experiment (technical but mostly biological with the three clones per LINE-1 tested). Figure 2 could include a dashed line for 100% L1RP and 5% (since it is used as a threshold). It would be useful to provide an additional panel in Figure 2 to illustrate alleles of LINE-1 that are active in this study and compare the values obtained previously in other studies. Similarly, a supplemental table or alignment could be provided to document amino acid changes in the two alleles of each pair (see comment above in the Major Comment 5). The L1Hs subfamilies could also be included in the graph of Figure 2 to support the conclusions of remaining active old L1Hs at allelic forms in the human genome.

      • Upon consideration of this helpful comment, we now augment the presentation of our in vitro activity data with a remade Figure 2 with boxplots to show the variation of the data, as well as a horizontal dashed line showing the active-cutoffs and star signs showing which LINE-1s belong to L1Hs or L1PA2.

      2 - Also, the validation of cloning is not well described. The choice of PCR validation must be supported by more technical details on the design of the primers used to validate each copy. The authors should clearly state that the strategy chosen for retrotransposition assay does not rely on the transcription from LINE-1 5UTR but from an upstream strong promoter, ruling out the role of potential mutations in LINE-1 promoter.

      • As detailed above in the response to ‘Major Comment 1’, we used a combination of end sequencing, whole plasmid sequencing, and multi-read Sanger sequencing to validate the sequences of each LINE-1 cloned from a CHM1 clone. When cloning each LINE-1, we used a specific set of primers designed for the ends of the UTRs for each LINE-1. We have updated the methods and text to clarify this cloning step, and the sequences of these oligos are included in Table S2.
      • To clarify the fact that our retrotransposition assays use a common, strong promoter, we added text in several places stating this setup and discussing (paragraph that starts at page 11, right column, line 18) how 5'UTRs and other non-ORF factors can affect the rate of LINE-1 in vitro activity.

      3 - There are discrepancies with the reported numbers of LINE-1s between Figure 1A and Table S1: 154 vs. 151 in CHM1, 144 vs. 143 in CHM13, respectively.

      • We thank the reviewer for spotting this error on our part. The numbers in Figure 1 and the main text were correct, and we have revised Table S1 to reflect this data.

      4 - The choice of colors in Figure 3 is not perfectly clear and sometimes not as reported in the text (green highlight and orange highlight). Part of the Figure 3 legend is missing. It should include a description of the color code chosen for the right histogram.

      • We thank the reviewer for bringing this inconsistency to our attention. Based upon feedback from all reviewers, we have simplified the color scheme in Figure 3 and Figure 5 to focus on the core conclusions of these two figures. Specifically, in Figure 3, we have removed the quadrant shading and more clearly presented the cutoffs of ‘polymorphic/high frequency’ and ‘in vitro active/inactive’ as dashed lines in the scatter plot. In Figure 5, we have simplified to two colors – black for in vivo unfit and orange to show the in vivo fit LINE-1s which is also used in Figure 4 to show the definition of in vivo activity. These updated colors are now defined in the figure legends and main text, and we have made references to these colors consistent throughout.

      5 - For Figure 4, it would be useful to define in the legends the color code for the top histogram. To better read the scatter plot, the words "fit" and "unfit" could be added on each side of the vertical dashed line.

      • We thank the reviewer again for suggestions to improve the clarity of our figures. As mentioned above in ‘Minor comment 1’, we have removed unnecessary colors including the gradient of the histograms in Figure 3 and Figure 4, since the boundaries of each bin are already defined by the axis labels and tics. As suggested, we have also added ‘fit’ and ‘unfit’ labels to the dashed cutoff line in Figure 4 to clarify the meaning of this line.

      6 - In panel B of Figure 5, it seems that the color code and hot/cold description is not fully formatted.

      • This formatting error has been corrected.

      Reviewer #1 (Significance (Required)):

      In this article, Yang and colleagues present an unprecedented view of the allelic diversity of young LINE-1 copies related to variable retrotransposition activity in an individual genome. One key aspect of their work is the description of the presence of young active LINE-1 alleles that are absent or non-intact in other genome assemblies, while described at a lower scale in initial work from the Kazazian and Moran labs, cited in the manuscript. The work of Yang et al. demonstrates the requirement of multiple approaches and long-read-based sequencing of individual genomes to fully infer the mutagenesis risk of LINE-1 activity.

      The data and methods provided by the authors open the door to a more systematic analysis of mutations and rare allelic forms to understand both mechanistic aspects and evolution of LINE-1 retrotransposition in the human genome. The identification of rare allelic forms of old LINE-1 that retain activity despite previously being considered as inactive is particularly interesting in the light of LINE-1 evolution in the human genome. The authors also describe allelic diversity inside of the Ta1d subfamily, suggesting further diversification and emergence of LINE-1 subgroups. Together with the identification of nucleotide polymorphism among LINE-1 copies, these findings strengthen the notion of individual genomes with individual set of potentially mutagenic LINE-1 alleles.

      The findings and methods described in this article are of great interest to a wide audience including the fields of research focusing on human genome evolution, transposable elements, genomic instability, human genetic variation, and personalized medical diagnostic.

      Aurélien J. Doucet CNRS - Université Côte d'Azur

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      This manuscript is an interesting and well-crafted study of LINE-1 activity at the single genome human genome level using long read-based haploid assemblies. The manuscript has some real gems and address critical aspects of LINE- biology that are typically not rigorously examined. The authors are to be commended for undertaking this exercise and for providing interesting perspectives that challenge the dogma that dominates the field in several areas. Despite the noted strengths of the contributions, the manuscript ignores the clear limitations inherent to the approaches taken and at times appears as dogmatic as the dogma that they themselves are trying to challenge. These deficiencies should be addressed before this manuscript is published.

      • We thank Reviewer 2 for their enthusiastic appreciation of the value and innovation of our manuscript. We also thank the reviewer for encouraging us to make careful consideration of the missing references relevant to our findings. We have had two researchers with experience in relevant fields edit our text for both readability, clarity, and proper inclusion of relevant references. We have added these throughout and taken careful effort to replace ‘dogmatic’ statements with clear presentations of the data and thorough referencing of the relevant literature.

      Several major and minor points to consider during revision include:

      Major:

      1. Several strategies have been published in the past that have confidently assign LINE-1s to specific loci despite use of shorter reads. These works should be acknowledged, even if as stated in the manuscript, use of longer reads will only continue to add confidence and validity to future assignments.

      2. We thank the reviewer for this suggestion, and we apologize for the omission of these important publications. As noted above, we have added numerous relevant references (reference 17-27 in the revised text) throughout the text including previous work that used short reads to confidently assign polymorphic/non-reference LINE-1s to specific loci. For example, we now cite the MELT pipeline to detect de novo L1 insertions with short reads (PMID: 28855259), and Iskow et al. 2010, which detects LINE-1s with junction fragment sequencing (PMID: 20603005). We have also added additional text to clarify that short reads are, indeed, often sufficient to place new LINE-1 insertions, while long reads are especially useful for resolving the sequence and location of these insertions. The new text (page 2, left column, line 22-30) presents the advantages/disadvantages of both short reads and long reads.

      3. One of the important requirements for precise quantification of LINE-1 activity and predicted risk scores cited in the manuscript was the need to predict activity based on sequence and location. This requirement, as posited in the manuscript, ignores the critical role of epigenetic control in the regulation of LINE-1 activity. As such, a discussion that acknowledges the critical roles of histone and DNA covalent modifications, and that integrates epigenomic insight into predictions of LINE-1 activity must be included in the manuscript.

      4. We thank the reviewer for suggesting this important discussion point. In response, we have expanded our discussion of this topic to place our data in the context of other literature on the effects of epigenomic regulation on in vivo LINE-1 activity, including histone and DNA modifications, as well as the effects of post transcriptional restriction factors (paragraph starting at page 11, right column, line 42).

      5. The limitations associated with the use of the CHMI were not addressed in the manuscript. While CHMI contain a paternal only genome, with no maternal contribution, the moles may arise from fertilization of an anuclear empty ovum by a haploid 23,X sperm or fertilization by two sperm giving rise to 46,XX or 46,XY karyotype. As such, generalizable conclusions about CHMI genetics should be carefully made given that the loss of maternal epigenetic imprinting and gain of paternally imprinted expression may result in abnormal gene expression, including that of LINE-1s. These variances will in turn impact LINE-1 activity profiles.

      6. We thank the reviewer for pointing out this confusingly written section of our manuscript, and we agree with the reviewer that LINE-1 activity measurements could be complicated in the CHM cell lines; however, all of our retrotransposition assays were carried out in the common background of 293T cells (chosen because of their low expression of know LINE-1 restriction factors (PMID: 25182477). We have modified the text (page 11, right column, line 52) to clarify these points.

      Minor

      1. Important citations of previously published work are not properly referenced throughout the manuscript. These are too numerous to identify individually, but the authors should carefully read the manuscript to ensure that proper documentation and reference to previous work is duly acknowledged.

      2. Please see our above response to ‘Major point 1’.

      There are several typos and missing prepositions that should be corrected. For instance, on page 7, the word "great" should be "greater".

      • Please see our above response to ‘Major point 1’ and Reviewer 1’s ‘Major comment 3’ for details on our in depth editing of the manuscript.

      Reviewer #2 (Significance (Required)):

      The contribution is highly significant as it challenges previously held concepts and advances our understanding of critical structure and function relationships of Line-1s.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Yang et al. perform an in-depth analysis of potentially mobile source L1 alleles in a single human genome (CHM1) previously subjected to Pacbio whole genome sequencing. The retrotransposition efficiencies of source L1 alleles with intact ORFs were tested in vitro, and these efficiencies compared to a model of in vivo activity based on Hamming distance to other ORF-intact L1 alleles. Comparisons of CHM1 L1 alleles are made to CHM13 (used for the recent T2T reference assembly), and also to population-scale sequencing efforts to establish how widespread each source L1 allele is. These data showcase the advantages of being able to resolve L1 alleles with long-read sequencing, allowing the field to make much more accurate predictions of retrotransposition potential in a given genome. The core analyses appear robust and for the most part enough detail is provided to follow what was done.

      • We thank Reviewer 3 for their in depth reading and analysis of our manuscript and data, and for their enthusiasm about the importance of this work in the context of foundational research from their lab and many others in the field. We have carefully considered each comment and completed several new analyses of our data and related data from other publications. We feel that our manuscript is much improved with this new data, as detailed below. Comments:

      1) The text overlooks the potential importance of L1 5'UTR mutations in L1 activity and evolution, as per PMID:25274305, PMID:1701022, and other studies, as well as the impact of genomic context on source L1 activity, as per PMID:27016617, PMID: 33186547 etc. L1 promoter evolution is arguably a major driver of L1 lineage emergence.

      • We thank the reviewer for suggesting these important additions. To present the relevance of 5'UTR mutations on LINE-1 activity and evolution, we added a discussion paragraph (paragraph starting at page 11, right column, line 16) to address how 5'UTRs and other non-ORF factors can affect the rate of LINE-1 in vitro activity. Several key references have been added and discussed in the paragraph: PMID:25274305 reported the regulation of human LINE-1 by the evolution of its 5'UTR; PMID:1701022 was one of the earliest papers that found the effect the 5'UTR promoters on human LINE-1 retrotransposition; PMID: 27016617 and PMID: 33186547 reported specific L1 loci regulated by different promoters and was included in the discussion; PMID:9430649 was one of the examples of non-human LINE-1 lineages emerging because of different promoters and was cited in the added discussion paragraph. We have also added discussion points to make clear that genomic content has a clear role in the activity of source LINE-1s (paragraph starting at page 11, right column, line 42).

      2) The way the retrotransposition assay is done here (I think) removes parts of the UTRs as part of introducing L1s into retrotransposition vectors, meaning that the assay tests the biochemical activity of the ORFs. It would be helpful to readers to have a more detailed method for this assay, including the origins of the reporter plasmids, whether there is a CMVp boosting the L1 promoter etc, and some clarity about how much of each L1 was cloned into the assay.

      • We have added relevant details to the results (page 6, left column, line 5), discussion (page 11, right column, line 52), and methods (page 13, right column, line 16 and 30) sections to clarify the reviewer’s important points. The LINE-1s tested for in vitro activity were cloned in their entirety (UTRs and ORFs) but driven by both their native promoters in the 5'UTR as well as an upstream CMV promoter. Also, please see our response to Reviewer 1 ‘Minor comment 2’ above.

      3) Pacbio long-read sequencing has been used previously to locate and characterise L1 alleles in human DNA. The Introduction states: "These represent the first scalable methods to catalog LINE-1 locations and sequences in individual human genomes". The "first" here is questionable. Citations to PMID:31853540 and PMID:34772701 should be included. The latter is particularly relevant at it not only resolves source L1 sequences with PacBio sequencing but also summarises their retrotransposition efficiencies in vitro and population frequencies.

      • We apologize for leaving out these and other important references, and we agree that the “first” claim is unnecessary. We have added the references suggested for the reviewer as well as several other important references as detailed in the above response to Reviewer 2 ‘Major point 1’. In addition, we have revised the adjacent text and deleted any references to our work as the “first” in these approaches.

      4) I am very interested in the two source L1s (on chr7 and chr9) that were found here to be more active in vitro than L1RP (to my knowledge the most active such element isolated to date, or close to it). Is there anything unusual about these two L1s? A quick look at the supplemental suggested the chr9 element was 5' truncated, was it tested as such in vitro? Also I think it would be worth contrasting the assay (all in HEKs) used here to test efficiency with the assay used by Brouha ... I feel readers may be surprised to find two L1s more mobile than L1RP in one genome.

      • To provide more details about the two active L1s (chr7 and chr9), we investigated key changes that could be related to the in vitro activity of these elements and now show them in Figure 2B and File S3. In the process of this updated analysis and suggested modifications to Figure 2 by this reviewer and Reviewer 1, we saw that the chr7 L1, mentioned here, had one very high activity measurement pulling its activity above L1RP. As such, we decided to more rigorously normalize our data by using the positive and negative controls across all plates of each day instead of normalizing to the controls of individual plates, as we had previously done. In addition, for any L1 with discrepant activity among the three clones we assayed, we used whole plasmid sequencing to confirm the identity and consistency of all three clones. In three cases, we found that one or two of the three clones was the wrong L1, and hence excluded them for the in vitro activity calculation. After this validation and testing of additional clones, all clones from the same L1 have consistent in vitro activity (see updated Figure 2). The updated in vitro activity of the chr7 L1 is at 86.7% L1RP, and the chr9 L1 is at 261.4% L1RP in addition to the chr17 LINE-1 with 117% L1RP and two additional LINE-1s that have near-L1RP activity levels (Table S2, column S). These changes in L1 activity were updated in the text, figures, and supplemental materials. Also, we note that the chr9 element is 6019bp in length and was tested as such in vitro. Current work in the lab is attempting to understand the mechanisms of increased LINE-1 in vitro and in vivo activity, as described in detail in response to Reviewer 1’s ‘Major comment 5’.

      5) In several places it is mentioned how L1 alleles may differ from sequences provided in reference assemblies, and may therefore explain discrepancies between assay results here and in other studies (e.g. Brouha). The Seleme and Lutz papers are correctly mentioned here, but arguably the most complete demonstration of this concept, from PMID:31230816, is overlooked. This study reports a chr13 source L1 that was previously found to be inactive by Brouha, and with broken ORFs in the reference genome, has both mobile and immobile alleles in the human population. This L1 is actually in CHM13, but not CHM1, and is "hot" in some individuals and not others. There are several places in the manuscript where this earlier study is very relevant and it would be fair to ask it to be mentioned, especially as the results are concordant. The same concept is reinforced by an even more recent paper (PMID:35728967), except in macaque, showing that this is a general consideration for primate L1 lineages, and actually that source L1 is relatively old and yet jumps extremely well in vitro, which fits an observation made in the present study. Mutually supporting observations like these really add confidence that what is reported in the present study is robust.

      • We thank the reviewer for their suggestion to include these highly relevant and important papers; we apologize for this initial omission. We have now added several sentences to the introduction and discussion (top left paragraph page 11) in addition to citations of these papers.

      6) Hamming distance between ORF-intact source L1 alleles is used to assess in vivo activity. This seems reasonable. However, in other works, transductions have been used to identify families of very closely related L1s. I realise that many highly mobile source L1s will rarely generate insertions carrying transductions, and yet I wonder if any of the youngest L1s in the present study form transduction families, and whether estimates of in vivo activity based on transductions found in population-scale data would reconcile better with in vitro retrotransposition assay data.

      • We thank the reviewer for pointing out our exclusion of data on 3' transductions, the most commonly used surrogates of in vivo activity, while also acknowledging that only a small percent of new L1 retrotranspositions carry 3' transduction. Please see our above response to Reviewer 1’s ‘Major comment 1’ for details on our newly added comparison of our in vivo activity data to the 3' transduction-based somatic LINE-1 retrotransposition landscape of those reported in PMID:34772701, PMID:32024998 and PMID:25082706.

      7) In the Introduction, it is stated that L1 only transmits vertically. It may be prudent to mildly qualify this position, based on PMID:29983116.

      • The referenced text in the introduction has been changed from "LINE-1s only transmit vertically" to "LINE-1s generally transmit vertically with few exceptions", with the addition of the suggested citation.

      8) A column in Table S2 looks mislabelled: Column R should be CHM1 not CHM13?

      • We thank the reviewer for seeing this error. Column P (Column R in the previous version) of Table S2 is now correctly labeled as "CHM1 L1 intactness".

      Geoff Faulkner (University of Queensland)

      Reviewer #3 (Significance (Required)):

      This is a well-executed study of considerable interest to the mobile DNA field, and anyone working with long-read DNA sequencing. Its strengths are the genomic and bioinformatic analysis, leveraging the PacBio long-read data and BAC library available for CHM1 to full effect. One limitation (in current form) is its near-exclusive focus on ORFs to encapsulate how mobile a given L1 allele is, when genomic context and L1 promoter mutations could also contribute heavily. Although I liked the manuscript very much and enjoyed reviewing it, some of the conceptual advances are encroached upon by other work (including some very relevant and yet uncited literature). These issues can very likely be addressed via a revision, additional analyses may be required but not new experiments.

      Geoff Faulkner (University of Queensland)

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Yang et al. perform an in-depth analysis of potentially mobile source L1 alleles in a single human genome (CHM1) previously subjected to Pacbio whole genome sequencing. The retrotransposition efficiencies of source L1 alleles with intact ORFs were tested in vitro, and these efficiencies compared to a model of in vivo activity based on Hamming distance to other ORF-intact L1 alleles. Comparisons of CHM1 L1 alleles are made to CHM13 (used for the recent T2T reference assembly), and also to population-scale sequencing efforts to establish how widespread each source L1 allele is. These data showcase the advantages of being able to resolve L1 alleles with long-read sequencing, allowing the field to make much more accurate predictions of retrotransposition potential in a given genome. The core analyses appear robust and for the most part enough detail is provided to follow what was done.

      Comments:

      1. The text overlooks the potential importance of L1 5'UTR mutations in L1 activity and evolution, as per PMID:25274305, PMID:1701022, and other studies, as well as the impact of genomic context on source L1 activity, as per PMID:27016617, PMID: 33186547 etc. L1 promoter evolution is arguably a major driver of L1 lineage emergence.
      2. The way the retrotransposition assay is done here (I think) removes parts of the UTRs as part of introducing L1s into retrotransposition vectors, meaning that the assay tests the biochemical activity of the ORFs. It would be helpful to readers to have a more detailed method for this assay, including the origins of the reporter plasmids, whether there is a CMVp boosting the L1 promoter etc, and some clarity about how much of each L1 was cloned into the assay.
      3. Pacbio long-read sequencing has been used previously to locate and characterise L1 alleles in human DNA. The Introduction states: "These represent the first scalable methods to catalog LINE-1 locations and sequences in individual human genomes". The "first" here is questionable. Citations to PMID:31853540 and PMID:34772701 should be included. The latter is particularly relevant at it not only resolves source L1 sequences with PacBio sequencing but also summarises their retrotransposition efficiencies in vitro and population frequencies.
      4. I am very interested in the two source L1s (on chr7 and chr9) that were found here to be more active in vitro than L1RP (to my knowledge the most active such element isolated to date, or close to it). Is there anything unusual about these two L1s? A quick look at the supplemental suggested the chr9 element was 5' truncated, was it tested as such in vitro? Also I think it would be worth contrasting the assay (all in HEKs) used here to test efficiency with the assay used by Brouha ... I feel readers may be surprised to find two L1s more mobile than L1RP in one genome.
      5. In several places it is mentioned how L1 alleles may differ from sequences provided in reference assemblies, and may therefore explain discrepancies between assay results here and in other studies (e.g. Brouha). The Seleme and Lutz papers are correctly mentioned here, but arguably the most complete demonstration of this concept, from PMID:31230816, is overlooked. This study reports a chr13 source L1 that was previously found to be inactive by Brouha, and with broken ORFs in the reference genome, has both mobile and immobile alleles in the human population. This L1 is actually in CHM13, but not CHM1, and is "hot" in some individuals and not others. There are several places in the manuscript where this earlier study is very relevant and it would be fair to ask it to be mentioned, especially as the results are concordant. The same concept is reinforced by an even more recent paper (PMID:35728967), except in macaque, showing that this is a general consideration for primate L1 lineages, and actually that source L1 is relatively old and yet jumps extremely well in vitro, which fits an observation made in the present study. Mutually supporting observations like these really add confidence that what is reported in the present study is robust.
      6. Hamming distance between ORF-intact source L1 alleles is used to assess in vivo activity. This seems reasonable. However, in other works, transductions have been used to identify families of very closely related L1s. I realise that many highly mobile source L1s will rarely generate insertions carrying transductions, and yet I wonder if any of the youngest L1s in the present study form transduction families, and whether estimates of in vivo activity based on transductions found in population-scale data would reconcile better with in vitro retrotransposition assay data.
      7. In the Introduction, it is stated that L1 only transmits vertically. It may be prudent to mildly qualify this position, based on PMID:29983116.
      8. A column in Table S2 looks mislabelled: Column R should be CHM1 not CHM13?

      Geoff Faulkner (University of Queensland)

      Significance

      This is a well-executed study of considerable interest to the mobile DNA field, and anyone working with long-read DNA sequencing. Its strengths are the genomic and bioinformatic analysis, leveraging the PacBio long-read data and BAC library available for CHM1 to full effect. One limitation (in current form) is its near-exclusive focus on ORFs to encapsulate how mobile a given L1 allele is, when genomic context and L1 promoter mutations could also contribute heavily. Although I liked the manuscript very much and enjoyed reviewing it, some of the conceptual advances are encroached upon by other work (including some very relevant and yet uncited literature). These issues can very likely be addressed via a revision, additional analyses may be required but not new experiments.

      Geoff Faulkner (University of Queensland)

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript is an interesting and well-crafted study of LINE-1 activity at the single genome human genome level using long read-based haploid assemblies. The manuscript has some real gems and address critical aspects of LINE- biology that are typically not rigorously examined. The authors are to be commended for undertaking this exercise and for providing interesting perspectives that challenge the dogma that dominates the field in several areas. Despite the noted strengths of the contributions, the manuscript ignores the clear limitations inherent to the approaches taken and at times appears as dogmatic as the dogma that they themselves are trying to challenge. These deficiencies should be addressed before this manuscript is published.

      Several major and minor points to consider during revision include:

      Major:

      1. Several strategies have been published in the past that have confidently assign LINE-1s to specific loci despite use of shorter reads. These works should be acknowledged, even if as stated in the manuscript, use of longer reads will only continue to add confidence and validity to future assignments.
      2. One of the important requirements for precise quantification of LINE-1 activity and predicted risk scores cited in the manuscript was the need to predict activity based on sequence and location. This requirement, as posited in the manuscript, ignores the critical role of epigenetic control in the regulation of LINE-1 activity. As such, a discussion that acknowledges the critical roles of histone and DNA covalent modifications, and that integrates epigenomic insight into predictions of LINE-1 activity must be included in the manuscript.
      3. The limitations associated with the use of the CHMI were not addressed in the manuscript. While CHMI contain a paternal only genome, with no maternal contribution, the moles may arise from fertilization of an anuclear empty ovum by a haploid 23,X sperm or fertilization by two sperm giving rise to 46,XX or 46,XY karyotype. As such, generalizable conclusions about CHMI genetics should be carefully made given that the loss of maternal epigenetic imprinting and gain of paternally imprinted expression may result in abnormal gene expression, including that of LINE-1s. These variances will in turn impact LINE-1 activity profiles.

      Minor

      1. Important citations of previously published work are not properly referenced throughout the manuscript. These are too numerous to identify individually, but the authors should carefully read the manuscript to ensure that proper documentation and reference to previous work is duly acknowledged.
      2. There are several typos and missing prepositions that should be corrected. For instance, on page 7, the word "great" should be "greater".

      Significance

      The contribution is highly significant as it challenges previously held concepts and advances our understanding of critical structure and function relationships of Line-1s.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Yang et al. took advantage of recently published long-read-based genomic sequences of nearly homozygous genomes from complete hydatidiform moles to retrieve allelic sequences of LINE-1, the currently only active and autonomous retrotransposon of the human genome, and produced the repertoire of intact LINE-1 in a genome. The authors performed cell-culture-based retrotransposition assays measurements and in vivo fitness estimations of all identified intact LINE-1 to infer evolutionary dynamics. In this article, the authors further validate the major contribution of polymorphic LINE-1 to the de novo retrotransposition events in the human genome. They also described, at unprecedented resolution, allelic variations among LINE-1 loci and the potential impact of these variations to the interpretation of mutagenic potential of each LINE-1 locus.

      Major comments:

      1. The key conclusions of the article are mostly convincing. However, it would be a substantial improvement to consolidate the data of the article with information about known active LINE-1s in germ cells or in cancer by using data from recent publications of the Devine and Tubio labs (for example PMID: 34772701, 32024998, 25082706). Across the article, no mention is made of the transductions generated during LINE-1 de novo retrotransposition, which is instrumental to monitor in vivo activity of a group of LINE-1 active copies. It would be of particular interest to make a link between in vitro activity from this study with LINE-1 classification based on their observed activity in cancer (PMID: 32024998, Figure 3b).
      2. The use of CHM1 BAC library Sanger sequencing validation and comparison with CHM13 and hg38 sequences is instrumental to support the building of LINE-1 repertoire in CHM1 genome, which is a valuable contribution of the article. The use of a distance-based metric to infer fitness of a LINE-1 is an interesting approach and allow to group LINE-1 copies based on their in vivo activity potential. Again, it would be beneficial to correlate the inferred fitness and retrotransposition activity of copies/alleles, when known, from the above-mentioned literature.
      3. Some aspects of the writing of the article should be improved to better support the conclusions.
        • In general, the descriptions are dense, and details could be provided in a more direct way to lighten the results section. Several redundancies in the discussion can be combined to increase clarity.
        • There is a lack of clarity in the description of how was handled each pair of alleles for which retrotransposition measurements vary between the study and the literature (last paragraph of the "Comprehensive measurement of LINE-1 in vitro activity in a human genome" section). It is not completely clear how the analysis was done and the way the data is presented in File S3 is not helping to support the conclusion. It could be useful to include some illustrative examples in a panel of Figure 2.
        • Regarding inferred in vivo activity, the text contains alternative description with the use of "fit" / "unfit", in vivo "active" / "inactive" or "no closely related LINE-1s" terms. The authors should find a way to clearly define and systematically use one set of terms to enhance clarity along the article. To parallel with in vitro active/inactive, it would be useful to use in vivo fit/unfit.
      4. The authors suggest that in vitro activity can be predicted by integration of population frequency and in vivo activity (/fitness) (second paragraph of the "An analysis of LINE-1 evolutionary history [...] and in vivo activity" section). It would be beneficial to strengthen the writing of this section and ultimately validate/test the model by including data from some of the previous studies (e.g. Brouha 2003, Lutz 2003, Seleme 2006, Beck 2010, Rodriguez-Martin 2020, Chuang 2021).
      5. The identification of adaptive mutations is only partially described and not strongly supported by experimental or analytical data. It would be interesting to explore the role of phylogenetically informative sites described in Figure 5B/C by testing non CHM1 alleles in retrotransposition assay (by introducing amino acid changes into the cloned CHM1 LINE-1 alleles) or by positioning the sites in ORF1p or ORF2p structure and/or domains to infer impact on functionality.

      Minor comments:

      1. Regarding the in vitro retrotransposition assay, it would be beneficial to provide more data. The current Figure 2 could be enriched by the addition of data related to the variation in the replicates of the experiment (technical but mostly biological with the three clones per LINE-1 tested). Figure 2 could include a dashed line for 100% L1RP and 5% (since it is used as a threshold). It would be useful to provide an additional panel in Figure 2 to illustrate alleles of LINE-1 that are active in this study and compare the values obtained previously in other studies. Similarly, a supplemental table or alignment could be provided to document amino acid changes in the two alleles of each pair (see comment above in the Major Comment 5). The L1Hs subfamilies could also be included in the graph of Figure 2 to support the conclusions of remaining active old L1Hs at allelic forms in the human genome.
      2. Also, the validation of cloning is not well described. The choice of PCR validation must be supported by more technical details on the design of the primers used to validate each copy. The authors should clearly state that the strategy chosen for retrotransposition assay does not rely on the transcription from LINE-1 5UTR but from an upstream strong promoter, ruling out the role of potential mutations in LINE-1 promoter.
      3. There are discrepancies with the reported numbers of LINE-1s between Figure 1A and Table S1: 154 vs. 151 in CHM1, 144 vs. 143 in CHM13, respectively.
      4. The choice of colors in Figure 3 is not perfectly clear and sometimes not as reported in the text (green highlight and orange highlight). Part of the Figure 3 legend is missing. It should include a description of the color code chosen for the right histogram.
      5. For Figure 4, it would be useful to define in the legends the color code for the top histogram. To better read the scatter plot, the words "fit" and "unfit" could be added on each side of the vertical dashed line.
      6. In panel B of Figure 5, it seems that the color code and hot/cold description is not fully formatted.

      Significance

      In this article, Yang and colleagues present an unprecedented view of the allelic diversity of young LINE-1 copies related to variable retrotransposition activity in an individual genome. One key aspect of their work is the description of the presence of young active LINE-1 alleles that are absent or non-intact in other genome assemblies, while described at a lower scale in initial work from the Kazazian and Moran labs, cited in the manuscript. The work of Yang et al. demonstrates the requirement of multiple approaches and long-read-based sequencing of individual genomes to fully infer the mutagenesis risk of LINE-1 activity. The data and methods provided by the authors open the door to a more systematic analysis of mutations and rare allelic forms to understand both mechanistic aspects and evolution of LINE-1 retrotransposition in the human genome. The identification of rare allelic forms of old LINE-1 that retain activity despite previously being considered as inactive is particularly interesting in the light of LINE-1 evolution in the human genome. The authors also describe allelic diversity inside of the Ta1d subfamily, suggesting further diversification and emergence of LINE-1 subgroups. Together with the identification of nucleotide polymorphism among LINE-1 copies, these findings strengthen the notion of individual genomes with individual set of potentially mutagenic LINE-1 alleles. The findings and methods described in this article are of great interest to a wide audience including the fields of research focusing on human genome evolution, transposable elements, genomic instability, human genetic variation, and personalized medical diagnostic.

      Aurélien J. Doucet CNRS - Université Côte d'Azur

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Responses to Reviewers’ Comments

      __Reviewer_ #1 (Evidence, reproducibility and clarity (Required)):_ __ *Summary In this study, using genetic labeling and deletion modeling, the authors discovered that Tbx1 myoprogenitors, rather than Pax3+/Myf5+ cells, give rise to supraclavicular brown adipose tissue (scBAT). This finding is both intriguing and significant. Furthermore, the genetic ablation of PPARγ or PRDM16, driven by Tbx1-Cre, reduced the size/weight of scBAT and its thermogenesis function capacity, supporting the importance of Tbx-1 cells. Interestingly, the authors found that human scBAT, located in the deep neck region, exhibits higher Tbx1 expression than subcutaneous neck WAT, potentially indicating a similar origin of scBAT in rodents and humans. Overall, this novel finding is exciting and could push the BAT field into a new phase. The manuscript is also well-written and organized.

      Comments 1. The authors assert that Myf5+ progenitors do not contribute to scBAT adipocytes. However, Figure 2C shows that 7% of medial scBAT are mG positive, suggesting a minor contribution of Myf5+ progenitors to scBAT. The conclusion that scBAT does not originate from Myf5+ precursor cells may be overly strong.*

      __Response____: __We thank the reviewer for pointing out the overstatement. We have rephased the conclusion from Figure 2 as “Myf5+ progenitors seldom give rise to scBAT adipocytes”.

      The authors claim that "prdm16 is dispensable for the development of scBAT." However, supporting data seems insufficient. The absence of a difference in scBAT weight does not guarantee that development was unaffected. Additional experiments, like H&E staining and immunofluorescence (IF) staining of RFP/GFP, could help demonstrate a similar number of GFP+ brown adipocytes in scBAT, thereby supporting this statement.

      Response____: __We do not have direct evidence that Prdm16 is dispensable for scBAT development; therefore, we have removed such statement in the revised manuscript. In addition, we provided H&E staining of scBAT, demonstrating evident adipose whitening and fibrosis in Prdm16ΔTbx1 mice (__Figure 7C). These results are consistent with previous findings that PRDM16 is required for the maintenance of brown adipocyte identity (PMID: 24703692).

      • In the "scBAT contributes to temperature maintenance in mice" section, this phenomenon only seems to apply to female mice (Figure 5). This conclusion may need adjustment to account for this sex difference. Moreover, for females, are there H&E staining results available for scBAT? For males, is there a change in Ucp1 expression? It could be beneficial to examine mRNA expression levels for additional thermogenic genes, such as Dio2, Prdm16, Cidea, and PGC-1a.*

      Response____: __Following the reviewer’s suggestion, we rephased our conclusion from Figure 5 to "scBAT contributes to temperature maintenance in __female mice". Also as requested, we provided the H&E staining of female scBAT (Figure 5E), showing a partial loss of brown adipose identity in PpargΔTbx1 mice. For males, we performed RT-qPCR and calculated total gene expression based on tissue weight. While Pparg, Prdm16 and Dio2 total expression was downregulated in Pparg-deficient scBAT, total Ucp1 expression was unchanged in scBAT and iBAT of male mice (Figure 5N, O). This might explain the similar cold tolerance between wildtype and PpargΔTbx1 males (Figure 5P).

      • Considering that mutant mice have smaller scBAT, how does this difference influence glucose homeostasis between WT and mutants?*

      Response____: __We performed glucose tolerance test and found no difference in glucose homeostasis of HFD PpargΔTbx1 mice (__Figure 6C). This could be due to intact iBAT in these animals, which is a much larger glucose consumer than scBAT.

      • In Figures 5G & M, it seems more accurate for the y-axis label to read "Body temperature change."*

      __Response____: __We changed the y-axis label. Thank you for the suggestions.

      • There is inconsistency in scale bar labeling: o Figure 3A vs 3D o Figure 4B vs 4C o Figure 5L, where scale bars are missing in the H&E staining images o Figure S1, where the scale bar is in the middle of the image*

      __Response____: __In the revised manuscript, we have provided consistent scale bars in all figures, including all these pointed by the reviewer. Thank you.

      • In Figure 3, the western blot data should be quantified, and the molecular weight (kDa) should be included to clarify the band's position.*

      Response____: __We have now indicated molecular weights in __Figure 3C, F and provided quantified UCP1 and PPARγ expression levels to the right of Western blots.

      • The statement "RT-qPCR revealed much higher levels of TBX expression in total lysates from deep neck BAT (Figure 8A)" could be clarified by adding "than neck WAT" at the end. *

      __Response____: __Done. Thank you for your suggestion.

      Reviewer____ #1 (Significance (Required)): Significance The major advancement of this study lies in the authors' novel finding of the embryonic lineage of brown adipocytes in the supraclavicular brown adipose tissue (scBAT) depot. They demonstrated that this lineage differs from those in the dorsal BAT depots by utilizing Myf5-Cre, Pax3-Cre, and TBX1-Cre reporter mouse models.

      __Reviewer____ #2 (Evidence, reproducibility and clarity (Required)): __ *This study aimed to investigate the developmental origin of brown adipose tissue (BAT) in the supraclavicular region and its implications for metabolic health. The authors have used genetic fate mapping in mice to trace the lineage of brown adipocytes in the supraclavicular region. The findings revealed that supraclavicular brown adipocytes do not originate from the Pax3+/Myf5+ epaxial dermomyotome, which is responsible for the development of interscapular BAT. Instead, most supraclavicular brown adipocytes were marked by the Tbx1+ lineage, indicating that the pharyngeal mesoderm is involved in their development. This work provides the first evidence that scBAT adipocytes do not share the same embryonic origins as iBAT fat cells. By identifying the location-specific myogenic progenitors for supraclavicular BAT versus interscapular BAT, the researchers shed light on the distinct developmental origins of different BAT depots. Overall, these findings provide new insights into the developmental origin of supraclavicular BAT and highlight the need to consider the anatomical locations and developmental origins in studying BAT development and function.

      Major comments: The manuscript addresses an important yet understudied area and provides convincing results that support the key conclusions. The authors also engage in extensive discussion, speculating on the broader implications of the findings and outlining the limitations of their study. The observation that the loss of Pparg in Tbx1-expressing cells leads to a reduction in supraclavicular BAT is intriguing. However, as the authors acknowledge, Tbx1 is expressed in inguinal white adipocytes as well. Therefore, it remains unclear how changes in Tbx1 in ingWAT in this model might affect BAT depots. The authors should provide further clarification on this matter and include additional discussion regarding the potential indirect effects and limitations of these models. *

      Response____: __We thank the reviewer for their overall enthusiasm about our manuscript, in which we primarily focus on Tbx1 lineage cells in BAT, the supraclavicular depot to be more specific. Tbx1-Cre mediated genetic deletion of the Pparg gene will inevitably perturb PPARγ function in other Tbx1-expressing tissues, such as scapular muscles and certain WAT depots. In a recent publication (PMID: 32240964), the Tbx1 gene was shown to be expressed in mouse inguinal WAT (iWAT) at a much higher level than interscapular BAT. TBX1 overexpression was sufficient to induce UCP1 protein in iWAT, while TBX1 deficiency rendered mice more sensitive to cold-induced body temperature drop. However, in our PpargΔTbx1 mice, we did not observe any significant loss of Pparg gene expression in iWAT (__Figure S3F), suggesting the possibility that Tbx1-positive cells are only a small proportion of iWAT. Nevertheless, we did not see any reduced expression of thermogenic genes in iWAT of PpargΔTbx1 mice (Figure S3F), indicating either that Tbx1-expressing cells are dispensable for thermogenesis or that other still-undefined mechanisms exist to compensate for the loss of PPARγ. We acknowledge the limitations of our animal models and have discussed these points in the second paragraph of Discussion.

      In Figure 5, it would be beneficial to include the expression of a broader range of thermogenic genes and proteins in both males and females. This would strengthen the argument that the loss of Pparg in Tbx1+ progenitors impairs scBAT function.

      Response____: __Thank you for the suggestion and we have now provided the expression of additional thermogenic genes including Ucp1, Prdm16, Ppargc1a, Cidea, and Dio2 in both males and females (__Figure 5F, G, N, and O). These results demonstrate that loss of PPARγ in Tbx1+ myoprogenitors impairs scBAT thermogenic function, with more profound impact on males than females.

      *Minor Comments: It would be more appropriate to present the results of the cold challenge studies as absolute body temperature rather than just the difference between groups. This is important because there could be baseline differences in basal body temperature among the experimental groups. *

      __Response____: __Thank you for the suggestion. The results summarized in Figure 5H and 5P were from several batches of independent experiments that were carried out at different times. Plotting absolute body temperature created a bigger variance that precluded us from drawing any conclusion. Therefore, we decided to plot the data as changes in body temperature in Figure 5H and 5P. As shown in Figure 8, we did not observe baseline differences in body temperature between wildtype and Prdm16ΔTbx1 mice, suggesting that loss of scBAT thermogenic function would not affect basal body temperature.

      *It is recommended to include representative images of cre-negative animals to validate the reagents and models used in the study. Including such images would enhance the reliability of the experimental approach and strengthen the overall validity of the findings. *

      Response____: __Representative images of Cre-negative mT/mG reporter animals have now been provided in __Figure S1. Thank you for the suggestion.

      The manuscript would benefit from placing the findings in the context of the broader field. For instance, discussing whether the loss of Prdm16 in Myf5 progenitors has a similar impact on interscapular BAT (iBAT) development or thermogenic function would provide valuable insights. Additionally, exploring the relationship between Tbx1 progenitors and adipocyte progenitors in adult BAT depots, such as Pdgfra+ and Trpv1+ progenitors, could further enhance our understanding of the developmental origins and functional characteristics of different adipocyte populations

      __Response____: __The reviewer raised an important point that we should place our finders in the context of the broader field. Indeed, it has been reported earlier that loss of PRDM16 in Myf5 progenitors leads to iBAT dysfunction in adult mice (PMID: 24703692), a similar phenotype to what we found in scBAT of Prdm16ΔTbx1 mice. The early embryonic development was not evidently perturbed in iBAT (using Myf5-cre) or scBAT (using Tbx1-cre), which might be compensated by other PRDM family protein, such as PRDM3. However, using Adipoq-Cre to knock out PRDM16 only leads to defects in beige adipocyte, but not classic BAT (PMID: 24439384). Together, these results demonstrate that PRDM16 in myoprogenitors is required for brown adipocyte identity maintenance during aging and similar transcriptional regulatory circuits control the differentiation and/or activity of both iBAT and scBAT. Further investigations are warranted to identify depot-specific regulations and functions of BAT. In adult BAT, progenitors that are marked by genes like Pdgfra and Trpv1 have been reported to contribute to cold-induced BAT recruitment and tissue homeostasis. While not the scope of our current research, future endeavors are needed to test if embryonic Tbx1+ myoprogenitors give rise to all or only some populations of adult BAT progenitors. These points have now been added to the revised Discussion section.

      Reviewer____ #2 (Significance (Required)): * Field of expertise: Adipose biology, Developmental biology, Thermogenesis

      The manuscript addresses an important and previously understudied area in the field of brown adipose tissue (BAT) development. The study challenges the long-standing assumption that Pax3+/Myf5+ progenitors in the dermomyotome are the sole developmental source of brown adipocytes in mice, including both interscapular BAT (iBAT) and other brown adipocytes.

      This work provides experimental evidence that the supraclavicular BAT (scBAT) adipocytes have a distinct embryonic origin compared to iBAT fat cells. While the developmental source of iBAT has been known for over a decade, this study demonstrates for the first time that scBAT adipocytes do not arise from the Pax3+/Myf5+ progenitors in the dermomyotome.

      The findings presented in this manuscript have the potential to make a lasting impact on the brown adipose tissue research community, particularly those interested in the developmental aspects of brown fat. *

      __Reviewer____ #3 (Evidence, reproducibility and clarity (Required)): __ *This manuscript explores the cell origin of supraclavicular brown adipose tissue (scBAT). UCP1-expressing brown/beige adipocytes are found in several anatomical locations but most studies have been focused on intrascapular brown adipose tissue (iBAT) in rodents, a depot that only exist in infants (but not adults) in humans. Very little is known about the biology (including the origin) of other BAT deposes. Here using multiple lineage tracing tools and conditional KO mice the authors convincingly demonstrate that about 50% of the scBAT cells originate from Tbx1+ myoprogenitors and provide functional evidence that ablation of Pparg or Prdm16 in Tbx1-cells blocks development of scBAT or affects thermogenic function of mice. The study design is straightforward, well carried out and the conclusion is supported by the data. The discovery has significant implication as brown adipocytes are mainly found in the supraclavicular region in the human body. I have few relative minor comments.

      Tbx-Cre lineage tracing indicates that 50% scBAT cells are marked by Tbx1. The authors discussed this may be either due to low recombination efficiency of Tbx1-Cre or due to multiple lineage origin of scBAT cells. The possibility of Cre recombination efficiency can be addressed easily by genomic DNA recombination analysis. *

      Response____: __To test recombination efficiency, we performed RT-PCR to detect wildtype and mutant Pparg mRNA using primers provided in the original report of the Pparg-floxed mice (PMID: 14660788). The mutant Pparg mRNA could only be detected in scBAT, not iBAT, of PpargΔTbx1 mice (__Figure S3A). The similar abundance of wildtype and mutant Pparg transcripts in PpargΔTbx1 scBAT indicates a 50% recombination rate. However, we currently cannot distinguish the following possible reasons for the incomplete recombination: 1) only half of scBAT adipocytes are Tbx1-lineage cells; 2) a portion of Tbx1-progeny cells have low Cre expression and thus no recombination; 3) The Tbx1 promoter does not fully recapitulate the endogenous Tbx1 gene expression. The identity of non-Tbx1 progeny adipocytes in the scBAT depot requires future investigations.

      *In the introduction the authors conclude that Myf5, Pax3 are "location markers" instead of cell identity markers. While I agree with this statement it is quite confusing. It implies that the study aims to identify cell identify markers but based on the analogy Tbx1 is also a location marker. *

      __Response____: __We apologize for the confusion caused by using “identity” to describe both lineage origins and functional cell types. Based on previous and our current studies, we propose Myf5, Pax3, and Tbx1 as location markers that distinguish BAT depots at different locations. They are not identity markers as these progenitors also give rise to other cell types, such as muscles, white adipocytes, fibroblasts, and others. We removed any statements indicating Tbx1 as an identity marker.

      *Figure 8 human expression compares TBX1 mRNA levels in neck fat to subcutaneous white fat. This is different from the comparison of dorsal and ventral BAT in mice. Therefore, the data are not particularly relevant to this study and should be moved to a supplemental figure. *

      __Response____: __We agree with the reviewer that the comparison of human TBX1 was not made between dorsal and ventral BAT. This is due the loss of dorsal BAT in adult humans, which make it almost impossible to make such direct comparison in humans. Nonetheless, the higher expression of TBX1 in deep neck BAT compared to subcutaneous neck WAT suggests that TBX1 might be functionally important for BAT thermogenesis. This is consistent with a recent report using transgenic Tbx1 mouse models that demonstrates the role of adipose TBX1 in thermogenic capacity (PMID: 32240964). We believe that data in Figure 8 is relevant to our current study, but we are also receptive to move it to the supplementary if the reviewer insists so.

      *Can the authors speculate/discuss on why there is no Pax3 labelled cells but 7% Myf5 labeled cells in scBAT? *

      __Response____: __We would like to clarify that both Pax3+ (6.7%) and Myf5+ (7.2%) progenitors label a similar percentage of brown adipocytes in the medial scBAT. These percentages are now provided in the revised text and can be visualized in Figure 1H and 2D. On the other hand, nearly none of lateral scBAT adipocytes were labelled by either Pax3+ or Myf5+ progenitors.

      Reviewer____ #3 (Significance (Required)): The discovery has significant implication as brown adipocytes are mainly found in the supraclavicular region in the human body.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript explores the cell origin of supraclavicular brown adipose tissue (scBAT). UCP1-expressing brown/beige adipocytes are found in several anatomical locations but most studies have been focused on intrascapular brown adipose tissue (iBAT) in rodents, a depot that only exist in infants (but not adults) in humans. Very little is known about the biology (including the origin) of other BAT deposes. Here using multiple lineage tracing tools and conditional KO mice the authors convincingly demonstrate that about 50% of the scBAT cells originate from Tbx1+ myoprogenitors and provide functional evidence that ablation of Pparg or Prdm16 in Tbx1-cells blocks development of scBAT or affects thermogenic function of mice. The study design is straightforward, well carried out and the conclusion is supported by the data. The discovery has significant implication as brown adipocytes are mainly found in the supraclavicular region in the human body. I have few relative minor comments.

      Tbx-Cre lineage tracing indicates that 50% scBAT cells are marked by Tbx1. The authors discussed this may be either due to low recombination efficiency of Tbx1-Cre or due to multiple lineage origin of scBAT cells. The possibility of Cre recombination efficiency can be addressed easily by genomic DNA recombination analysis.

      In the introduction the authors conclude that Myf5, Pax3 are "location markers" instead of cell identity markers. While I agree with this statement it is quite confusing. It implies that the study aims to identify cell identify markers but based on the analogy Tbx1 is also a location marker.

      Figure 8 human expression compares TBX1 mRNA levels in neck fat to subcutaneous white fat. This is different from the comparison of dorsal and ventral BAT in mice. Therefore, the data are not particularly relevant to this study and should be moved to a supplemental figure.

      Can the authors speculate/discuss on why there is no Pax3 labelled cells but 7% Myf5 labeled cells in scBAT?

      Significance

      The discovery has significant implication as brown adipocytes are mainly found in the supraclavicular region in the human body.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This study aimed to investigate the developmental origin of brown adipose tissue (BAT) in the supraclavicular region and its implications for metabolic health. The authors have used genetic fate mapping in mice to trace the lineage of brown adipocytes in the supraclavicular region. The findings revealed that supraclavicular brown adipocytes do not originate from the Pax3+/Myf5+ epaxial dermomyotome, which is responsible for the development of interscapular BAT. Instead, most supraclavicular brown adipocytes were marked by the Tbx1+ lineage, indicating that the pharyngeal mesoderm is involved in their development. This work provides the first evidence that scBAT adipocytes do not share the same embryonic origins as iBAT fat cells. By identifying the location-specific myogenic progenitors for supraclavicular BAT versus interscapular BAT, the researchers shed light on the distinct developmental origins of different BAT depots. Overall, these findings provide new insights into the developmental origin of supraclavicular BAT and highlight the need to consider the anatomical locations and developmental origins in studying BAT development and function.

      Major comments:

      The manuscript addresses an important yet understudied area and provides convincing results that support the key conclusions. The authors also engage in extensive discussion, speculating on the broader implications of the findings and outlining the limitations of their study.

      The observation that the loss of Pparg in Tbx1-expressing cells leads to a reduction in supraclavicular BAT is intriguing. However, as the authors acknowledge, Tbx1 is expressed in inguinal white adipocytes as well. Therefore, it remains unclear how changes in Tbx1 in ingWAT in this model might affect BAT depots. The authors should provide further clarification on this matter and include additional discussion regarding the potential indirect effects and limitations of these models.

      In Figure 5, it would be beneficial to include the expression of a broader range of thermogenic genes and proteins in both males and females. This would strengthen the argument that the loss of Pparg in Tbx1+ progenitors impairs scBAT function.

      Minor Comments:

      It would be more appropriate to present the results of the cold challenge studies as absolute body temperature rather than just the difference between groups. This is important because there could be baseline differences in basal body temperature among the experimental groups.

      It is recommended to include representative images of cre-negative animals to validate the reagents and models used in the study. Including such images would enhance the reliability of the experimental approach and strengthen the overall validity of the findings.

      The manuscript would benefit from placing the findings in the context of the broader field. For instance, discussing whether the loss of Prdm16 in Myf5 progenitors has a similar impact on interscapular BAT (iBAT) development or thermogenic function would provide valuable insights. Additionally, exploring the relationship between Tbx1 progenitors and adipocyte progenitors in adult BAT depots, such as Pdgfra+ and Trpv1+ progenitors, could further enhance our understanding of the developmental origins and functional characteristics of different adipocyte populations

      Significance

      Field of expertise: Adipose biology, Developmental biology, Thermogenesis

      The manuscript addresses an important and previously understudied area in the field of brown adipose tissue (BAT) development. The study challenges the long-standing assumption that Pax3+/Myf5+ progenitors in the dermomyotome are the sole developmental source of brown adipocytes in mice, including both interscapular BAT (iBAT) and other brown adipocytes.

      This work provides experimental evidence that the supraclavicular BAT (scBAT) adipocytes have a distinct embryonic origin compared to iBAT fat cells. While the developmental source of iBAT has been known for over a decade, this study demonstrates for the first time that scBAT adipocytes do not arise from the Pax3+/Myf5+ progenitors in the dermomyotome.

      The findings presented in this manuscript have the potential to make a lasting impact on the brown adipose tissue research community, particularly those interested in the developmental aspects of brown fat.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In this study, using genetic labeling and deletion modeling, the authors discovered that Tbx1 myoprogenitors, rather than Pax3+/Myf5+ cells, give rise to supraclavicular brown adipose tissue (scBAT). This finding is both intriguing and significant. Furthermore, the genetic ablation of PPARγ or PRDM16, driven by Tbx1-Cre, reduced the size/weight of scBAT and its thermogenesis function capacity, supporting the importance of Tbx-1 cells. Interestingly, the authors found that human scBAT, located in the deep neck region, exhibits higher Tbx1 expression than subcutaneous neck WAT, potentially indicating a similar origin of scBAT in rodents and humans. Overall, this novel finding is exciting and could push the BAT field into a new phase. The manuscript is also well-written and organized.

      Comments

      1. The authors assert that Myf5+ progenitors do not contribute to scBAT adipocytes. However, Figure 2C shows that 7% of medial scBAT are mG positive, suggesting a minor contribution of Myf5+ progenitors to scBAT. The conclusion that scBAT does not originate from Myf5+ precursor cells may be overly strong.
      2. The authors claim that "prdm16 is dispensable for the development of scBAT." However, supporting data seems insufficient. The absence of a difference in scBAT weight does not guarantee that development was unaffected. Additional experiments, like H&E staining and immunofluorescence (IF) staining of RFP/GFP, could help demonstrate a similar number of GFP+ brown adipocytes in scBAT, thereby supporting this statement.
      3. In the "scBAT contributes to temperature maintenance in mice" section, this phenomenon only seems to apply to female mice (Figure 5). This conclusion may need adjustment to account for this sex difference. Moreover, for females, are there H&E staining results available for scBAT? For males, is there a change in Ucp1 expression? It could be beneficial to examine mRNA expression levels for additional thermogenic genes, such as Dio2, Prdm16, Cidea, and PGC-1a.
      4. Considering that mutant mice have smaller scBAT, how does this difference influence glucose homeostasis between WT and mutants?
      5. In Figures 5G & M, it seems more accurate for the y-axis label to read "Body temperature change."
      6. There is inconsistency in scale bar labeling:
        • Figure 3A vs 3D
        • Figure 4B vs 4C
        • Figure 5L, where scale bars are missing in the H&E staining images
        • Figure S1, where the scale bar is in the middle of the image
      7. In Figure 3, the western blot data should be quantified, and the molecular weight (kDa) should be included to clarify the band's position.
      8. The statement "RT-qPCR revealed much higher levels of TBX expression in total lysates from deep neck BAT (Figure 8A)" could be clarified by adding "than neck WAT" at the end.

      Significance

      The major advancement of this study lies in the authors' novel finding of the embryonic lineage of brown adipocytes in the supraclavicular brown adipose tissue (scBAT) depot. They demonstrated that this lineage differs from those in the dorsal BAT depots by utilizing Myf5-Cre, Pax3-Cre, and TBX1-Cre reporter mouse models.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: This manuscript by Lan et al. addresses the still incompletely resolved question as to how branching morphogenesis of the embryonic mammary epithelium is regulated at the molecular and cellular level. Using (combinatorial) primary explant cultures of wildtype and genetically engineered mouse embryos, in which the authors have developed a unique expertise over many years, together with imaging and RNAseq analyses, they (i) show that the timing of epithelial branching is dictated by the biological age of the epithelium, but that an epithelial-mesenchymal interaction is required to bestow branching ability on the mammary epithelium somewhere between E13.5 and E16.5, (ii) seek to determine if and how lineage and cell proliferation affect branching, (iii) show that while salivary mesenchyme can promote growth (i.e. branching density) of the E16.5 mammary epithelium, the mode of branching (i.e. lateral branching vs tip-clefting) is an intrinsic property of the mammary epithelium, (iv) use transcriptomics to identify genes that are likely to control either mammary- or salivary gland specific growth and/or branching patterns, (v) hypothesize that low levels of WNT signaling in the mammary gland mesenchyme (due to relatively high expression of WNT signaling inhibitors) are responsible for mammary specific branching, (vi) show that hyperactivation of WNT/CTNNB1 signaling in the mesenchyme indeed induces hyperbranching, (vii) identify Eda and Igf1 as putative mediators and paracrine signaling factors that regulate branching of the mammary epithelium upon secretion from the mesenchyme downstream of WNT/CTNNB1 signaling and (viii) show that mammary gland branching is impaired in Igfr1 null embryos.

      Major comments: 1. Overall, this is a solid study that is well controlled and technically of high quality. The materials and methods should allow follow up and replication by others and the transcriptomic data have been made available via NCBI GEO. I think the authors convincingly demonstrate points (i), (iii), (iv) and (vi) and (viii). I have some questions regarding (ii), (v) and (vii) and (viii) that I will pose below.

      Our response:

      We thank the reviewer for the careful assessment and recognition of our work. In the subsequent sections, we have tried to address all the concerns raised by the reviewer.

      Re: (ii): The authors try to study the link between basal cell fate and branching. They use position of the cells (which they describe clearly and which is a good choice), since they cannot use specific markers due to the fact that the basal and luminal linages have not yet segregated at this point. This part of the manuscript is not the most straightforward to follow. The most obvious experiment would have been to focus on the location of the cells and their associated cell cycle profile - but the authors themselves have just recently published a pre-print (their REF #54, now also out in JCB) that is an in-depth study of the link between cell proliferation + cell motility and branching, but this only becomes apparent in the discussion. In that sense, Fig2 of the current manuscript is less novel, although it is nice to see that it holds up in a slightly different analysis.

      Our response:

      We thank the reviewer for acknowledging our recently published work, which is focusing on the active branching phase during late embryogenesis/around birth. In the current proliferation analysis, however, our focus was on a different aspect of embryonic mammary gland development: understanding the mechanism underlying the ability to acquire competence to branch, i.e. how the epithelium changes between late bud and sprout stages. Our data obtained from tissue recombination and 3D culture experiments suggest that heterotypic mesenchymes or mesenchyme-free 3D organoid culture conditions do not provide sufficient signals to support branching of mammary epithelia before E16.5. We have rephrased the text to better emphasize this point.

      Instead of focusing on the cell cycle markers, the authors turn to a K14-Eda mouse model - which shows precocious branching and a temporary reduction in K8 expression. They also analyze Eda-KO embryos. Quite frankly, I find the authors' reasoning difficult to follow here and I cannot deduce how these experiments really address the question at hand (i.e. how lineage and cell proliferation affect branching), so I hope they can rewrite this section of the paper to make the arguments more clear and easy to follow for the reader who, at this point, knows little about Eda. For example, the authors present the argument that K14-Eda mice show a transient reduction in K8 expression - but we don't know if that also really means a (temporary?) change in (future?) luminal cell fate. In fact, since Eda later also makes an appearance as a candidate factor to be secreted by the mesenchyme together with Igf1, I wonder if their K14-Eda data would not be better suited to underscore that point instead and if the authors should perhaps eliminate this section altogether and just refer to their prior work in REF #45. If the authors think the current data add something more, than they need to be more explicit about this (and then also introduce the link to REF #45 in the results section).

      __Our response: __

      We agree with all the reviewers in that this part of the manuscript was not mature enough and provided only indirect evidence on the potential link between lineage segregation and branching ability. This is an important question in the field that merits a study of its own and should be addressed with better tools than those available to us at present. As suggested by reviewers #1 and #3, we have omitted this part in the revised manuscript.

      Re: (v): Do the authors have any WNT/CTNNB1 target genes that they can include in their transcriptomics analysis to show that the WNT/CTNNB1 signaling levels are indeed lower in the mammary mesenchyme? Axin2 comes to mind, but there are some other negative feedback targets that are often induced across tissues, e.g. Rnf43 and/or Znrf3 and/or Sp5?E.g. to include in FIg6E?

      __Our response: __

      In the original manuscript (lines 339-342), we have performed the GSVA analysis comparing the KEGG database, and the significantly altered pathways comparing different mammary mesenchymes with salivary gland mesenchyme have been pooled and displayed as heatmap in Supplementary Fig 4b. The WNT signaling pathway is lower in the mammary mesenchyme, especially at E16.5.

      As suggested by the reviewer, we have analyzed Axin2, the most commonly used readout of WNT/CTNNB1 signaling activity in our RNA-seq data that we include as a __new Supplemental Fig. 4c __in the preliminary revised manuscript. Axin2 data indicate that Wnt/β-catenin signaling activity is lower in the E16.5 fat pad, where branching takes place, compared to younger stages of mammary gland and the salivary gland.

      Plan for the final revision:

      Additionally, we will provide expression data of a transgenic Wnt reporter from the same developmental stages and tissues that were used to generate the RNA-seq data.

      Re: (vii) and (viii): The authors convincingly show the phenotype of the Igfr1 KO mice, but I hope the authors concur that an epithelial only Igfr1 KO (or alternatively a mesenchymal only Igf1 KO, or epithelial/mesenchymal recombination experiments with WT vs IGFR1 null or IGF1 null tissue, or experiments with small molecule inhibitors of IGF1/IGFR1 signaling) would have given more solid mechanistic evidence regarding the presumed paracrine effect of IGF1 signaling. I am not asking the authors to perform another mouse experiment or even generate or use these conditional strains, but if the authors agree, then I do think this would merit some attention in the discussion section. See also my comments regarding Eda in point 1.

      Our response:

      As shown in the current manuscript, Igf1 is expressed in the mammary and salivary gland mesenchyme. This finding is in line with E14 in situ expression data available in Genepaint (https://gp3.mpg.de/results/Igf1) showing that overall in embryonic tissues, Igf1 is mainly produced in mesenchymal tissues. Of note, in Genepaint, a clear signal can be detected in the salivary gland mesenchyme, not the epithelium. Published E16 and E18 datasets indicate low level of Igf1 expression in the mammary epithelium (https://wahl-lab-salk.shinyapps.io/Mammary_snATAC/). Hence, we conclude that Igf1 is mainly produced by mesenchymal cells. Instead, Igf1r appears to be rather ubiquitously expressed.

      A previous study assessed BrdU incorporation in Igf1r-/- mammary buds at E14.5, and reported a specific proliferation defect in the epithelium, while no difference was detected in the mesenchyme (Fig. 9, Heckman et al., 2007; PMID:17662267). However, we cannot exclude the possibility of autocrine, mesenchymal Igf1/Igf1r signaling, which in turn could lead to upregulation of a paracrine factor to regulate epithelial growth.

      We agree with the reviewer in that novel conditional mouse models are beyond the scope of the current study. However, we do not think that small molecule drugs could be used to block Igf1r activity in a tissue-specific manner neither.

      Plan for the final revision:

      To further delineate the paracrine and/or autocrine role of Igf1/Igf1r pathway during mammary epithelial growth and branching, we will perform tissue recombination experiments between Igf1r-/- and control mammary epithelium and mesenchyme, as suggested by the reviewer.

      Minor comments: - A few minor spelling/grammar errors, including a couple of "the"s missing (first line of the abstract, and also preceding "Majority" in line 148.

      Our response:

      We apologize for these slips. They have been corrected in the revised manuscript.

      • Line 517-518: please also include the details for the Eda mice.

      Our response:

      We apologize for missing this important information in materials and methods. We have included a short introduction of the K14-Eda mice, a new reference for the original publication producing them, as well as the Jackson Laboratories strain number for Eda-/- (a.k.a. Tabby) mice in the revised manuscript.

      • 1f spelling error: separation

      Our response:

      The spelling error has been corrected in the revised manuscript.

      **Referees cross-commenting**

      Having read all three review reports I think they are pretty much in agreement, with shared questions about the inclusion/meaning/discussion of the lineage specification data and also agreement about the overall technical solidity of the data and this approach.

      I gather that reviewer #2 asks for more controls than myself or reviewer #3 and while I think all of their points are valid, in principle, I don't think all of these are required. I should add that I am inclined to trust the authors on their ability to separate mesenchyme and epithelium as they have been developing and optimising this system over many years.

      Our response:

      We are grateful to the reviewer for the reliance on the technical aspect of our experiments. We do routinely monitor tissue purity in the recombinants (for more details, see our response to reviewer #2). To demonstrate this, we have included new data in new Supplementary Fig. 1a,b and new Supplementary Fig. 3. We believe these additions will further enhance the validity of our findings and effectively address the concerns raised by reviewer 2.

      Reviewer #1 (Significance (Required)):

      General assessment: This is a carefully executed study in which an impressive amount of (combinatorial) embryonic mammary tissue explant experiments are combined with quantitative imaging and transcriptomics analysis.

      The main limitations of the work lie in the fact that the investigation of a potential link between branching and the cell cycle is not entirely novel, as the authors themselves recently published an nice pre-print (now also out in JCB) describing similar analyses. In addition, the mechanistic link between WNT/CTNNB1 signaling in the mesenchyme and the paracrine signaling activities of the presumed downstream effectors EDA and IGF, while plausible, is not yet complete. The work also does not yet addresses what exactly the branching identity is that is bestowed upon the mammary epithelium between E13.5 and E16.5 and how this then becomes an intrinsic (epigenetic?) feature of the mammary gland.

      Advance: This work provides more insight into the embryonic branching of the mammary gland - a stage of mammary gland development that is still poorly understood and that is, in general, understudied. In part, the work confirms prior work in the literature (their REF #19) regarding mammary and salivary gland tissue recombination experiments. It supplements this with a more elaborate time series of heterochronic and heterologous epithelium/mesenchyme explant cultures, using genetically engineered (and fluorescently labeled) mouse tissues to allow better and quantitative imaging. The transcriptomic analysis of different mesenchyme populations is also informative and allows the researchers to propose a putative mechanism for why the mammary gland branches differently from the salivary gland. The advance is both technical and functional, as well as conceptual, with some advance in terms of mechanism.

      Audience: This works should appeal to mammary gland biologists interested in the molecular and cellular mechanisms of (early) mammary gland development, as well as to a broader community of developmental biologists studying branching morphogenesis in tissues such as lung, kidney and salivary gland.

      My expertise: WNT signaling and mammary gland biology, at the intersection of developmental, stem cell and cancer biology

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __

      The mammary gland is a branched structure that consists of a bilayered epithelium embedded in a specialized mesenchyme. In mice, at 11,5 days of embryogenesis, the ectoderm thickens forming 5 pairs of peculiar structures called placodes. During the following days, the placodes will grow and invaginate into the surrounding mammary mesenchyme and they will finally start to branch by the end of embryogenesis (E16). It has been suggested that the bidirectional communication between the growing mammary gland and the surrounding mesenchyme plays a pivotal role in the determination of each step of mammary gland development (placode formation, mammary bud invagination, gland outgrowth, branching). The role of different signalling molecules has already been shown, particularly for the placode growth and mammary bud invagination. Nevertheless, the pathways regulating embryonic mammary gland branching are still incompletely understood. In this manuscript, Lan and colleagues aim to decipher the correlation between different stages of mammary gland development such as proliferation, lineage segregation and ductal branching. Furthermore, they want to define which stage of mammary development is intrinsically determined by the epithelium and which one requires the supportive guidance of the mesenchyme. Lastly, they aim to discover the key signal for the growth and branching of mammary epithelium. To these purposes, they used an ex vivo model of heterochronic epithelial-mesenchymal recombination. In particular, they micro-dissected the epithelium and/or the mesenchyme from murine mammary glands at different stages of embryonic development (i.e. at E13,5 for the quiescent phase or 16,5 for branching phase) and explanted them together in different combinations using fluorescent reporters. To assess the role of the mesenchyme they also cultured the epithelium in a mesenchyme free 3d structure. Through this model they demonstrated that the presence of the mesenchyme is necessary for the priming of mammary epithelium for branching, since only E16,5 epithelial cells were able to grow and branch in a mesenchyme free 3D experiment. Nevertheless, intrinsic properties of the epithelium are necessary for the timing of branching, since E16,5 mesenchyme was not able to accelerate the outgrowth of E13,5 epithelia. In order to determine which epithelial properties are important, the authors correlated the beginning of cell proliferation in the embryonic mammary gland to the beginning of the branching phase. They indeed used the Fucci2a mouse model to carefully characterise the timing of mammary cells proliferation at different stages of embryonic development, concluding that the great majority of proliferating cells reside in the inner part of the mammary bud until E14,5, while in the external part at later stages. Regarding the importance of cell proliferation, Lan and colleagues claim that the beginning of the branching phase is not its direct consequence, thanks to the use of the K14Cre- Eda mouse model, known to have anticipated mammary gland development. Using this and the Eda-/- models, the authors also sustain that the branching occurs independently of the lineage specification of the epithelium. The use of salivary mesenchyme instead the mammary one was able to increase the number of branching of E16,5 mammary epithelium. Nevertheless, this model demonstrated that the branching pattern (side branching vs tip bifurcation) is an intrinsic feature of the epithelium. Lan and colleagues also defined the transcriptomic profiles of the mammary and salivary mesenchymes at different stages. In particular, they observed an increased expression of negative regulators of Wnt pathway in the mammary mesenchyme compared to the salivary mesenchyme. Moreover, using a mouse model where B-catenin is stabilised, they observed increased tip production in the mammary gland epithelium. They also showed that IGF1 production is increased after Wnt pathway activation and they tested its function, both treating their ex vivo cultures with exogenous IGF1 and using Igf1r-/- mouse models.

      Major comments 1- The great majority of the results of the manuscript are based on an ex vivo model of heterochronic epithelial-mesenchymal recombination. Since the authors are studying the effect of the mesenchyme of different stages on the epithelium (and vice versa), the purity of the two compartments after the dissection is particularly important. Although they said that the purity is evaluated (line 112), it would be important to show a control staining in which they use known markers of the mesenchyme with no colocalization with the fluorescent reporter of the epithelium.

      Our response:

      We agree with the reviewer that the purity of the separated tissues is very important for our conclusions. This is why we have used genetically labeled tissues in all recombination experiments: the epithelium and the mesenchyme were always isolated from embryos ubiquitously expressing GFP or tdTomato. We find this the most reliable way to assess the origin and purity of the isolated tissues. If there was any carry-over mesenchyme isolated with the GFP+ epithelium, this would be revealed as GFP+ mesenchymal cells in the recombinants consisting of otherwise tdTomato+ mesenchyme. And vice versa: any carry-over tdTomato+ epithelium isolated with the mesenchyme would be revealed as tdTomato+ epithelial cells in the recombinants. We apologize for not making this clear enough in the original manuscript. In the revised manuscript, we now provide confocal high-resolution images of the recombinant explants (new Supplementary Fig. 1a,b). The explants have been co-stained with the epithelial marker EpCAM, revealing a robust colocalization between the ubiquitously expressed florescent labels in the designated epithelial tissues and the EpCAM.

      2- Another important point for understanding the quality and impact of these findings is to assess the similarities and differences, if there are, between the in vivo mesenchyme and the ex vivo one. Indeed, once explanted and put in culture, mesenchymal cells could change their transcriptomic profile and consequently change their signals to the epithelium. The authors should assess the expression of the genes and pathways studied during embryonic development in vivo.

      Our response:

      The reviewer is correct in that the transcriptomes will likely undergo some changes when organs are cultured ex vivo. This is why RNA-seq was done on freshly isolated tissues. Regarding the potential changes taking place ex vivo, however, we do not consider them relevant with respect to the questions we are addressing in this study. The reason is (as reported in the manuscript) that all control recombinations (homochronic recombinations such as E13 epithelium + E13 mesenchyme, E16 epithelium + E16 mesenchyme etc.) branched essentially as in vivo. Therefore, we find the results and conclusions made from the tissue recombination experiments solid.

      3- The authors clearly showed that E16,5 epithelium is able to branch in a mesenchyme free 3D culture model, while epithelia from earlier stages don't. This led to the conclusion that mesenchyme is necessary for acquiring the branching ability. Nevertheless, the authors also said that early stages epithelia scarcely grow in the mesenchyme free 3D culture. Therefore, the lack of branching may be due to the lack of growth, if not the increase of death, of epithelial cells. The authors should quantify the size and the cell death of the epithelia in the different culture conditions and discuss better this point.

      Our response:

      The reviewer is correct in that one of the key functions of the mammary mesenchyme up to E16.5 may be to provide survival signals for the epithelium, and this might explain why epithelia younger than E16.5 fail to grow/branch when recombined with salivary gland mesenchyme and in mesenchyme-free organoid culture.

      Plan for the final revision:

      To address this issue, we will assess apoptosis in mammary epithelia cultured in the mesenchyme-free 3D culture organoid set-up.

      4- The Fucci2a model allowed to assess the proliferation of embryonic mammary epithelium, showing that the great majority of proliferating cells are basal, at late stages of development (line 182). As it has already been shown, lineage specification is a late process during mammary gland development. The fact that the proliferating cells reside at the external part of the bud does not mean that they are basal cells yet. A p63/K8 staining could be important to understand if the increased proliferation occured in already specified basal cells or not.

      __Our response: __

      Indeed, mammary lineage specification is a later process. As pointed out in the manuscript and by reviewer #1, the widely used basal and luminal lineage markers have not yet segregated to separate compartments at the developmental stages analyzed in our study, and therefore cannot be used as tools for this purpose. We would like to emphasize that in the manuscript, we analyzed the cells based on their position, and have used the term basal to indicate the basal position, not the prospective lineage. Accordingly, we used the term inner instead of luminal cells to indicate their location, not lineage. We have further clarified this point in the preliminary revised manuscript.

      5- The use of Fucci2a model showed that 20% of epithelial cells are proliferative at E13,5. This phase is considered as "quiescent" by the authors (line 120), but the moderate proliferation rate shown in this experiment demonstrated that it is not. A change of the nomenclature is needed.

      __Our response: __

      We have removed the word “quiescent” from the text.

      6- Through the use of K14-Eda and Eda-/- models, the authors claimed that the lineage specification is not a prerequisite for ductal branching. To support this point, they showed that the K14-Eda mice have an anticipated branching although the expression of K8 in the inner part of the bud is transitorily decreased. The authors link the K8 downregulation to a transient suppression of the luminal lineage, but this is clearly overclaimed. Although K8 is a known marker of luminal lineage, the downregulation of one marker is not sufficient to support their thesis. They should first check more markers and in particular critical regulators of luminal lineage as Notch1, Foxa1 and Elf5. Lately, the use of different models that drive embryonic epithelial cells to a forced lineage commitment (Notch1 or Δnp63 overexpression) would support more their claim. As additional evidence, the authors showed that Eda is able to promote basal cell signature. Firstly, the authors should better explain why this point would support their thesis. Secondly, the supplementary figure 2b does not show which genes are taken into account to define the basal signature. A list of these genes would be helpful, as well as staining for some representative proteins.

      Our response:

      We thank the reviewer for these constructive suggestions. We agree with all reviewers in that this part of the manuscript was not mature enough and provided only indirect evidence on the potential link between lineage segregation and branching ability. This is an important question in the field that merits a study of its own to be addressed with better tools than those available to us at present. As suggested by reviewers #1 and #3, we have omitted this part in the revised manuscript.

      7- The authors used the same mouse models to assess the importance of proliferation in the determination of ductal branching and they claimed that proliferation is not a sufficient feature. This conclusion was supported by two observations. The first one is the fact that the K14-Eda model shows an increased cell proliferation at early stages compared to wt, coupled with anticipated branching. Secondly, although having smaller glands compared to wt and showing a delay in ductal branching, Eda-/- mice have an epithelial proliferation rate very similar to wt. Again, the conclusion that proliferation is not sufficient for branching is overclaimed. Firstly, the authors should explain how the buds in wt and Eda-/- mice have different sizes although the similar proliferation (increased cell death?, cellular volume?). Secondly, to support the thesis that proliferation is not sufficient for branching, functional experiments should be performed (see point 12). For instance, the short-time treatments with inhibitors or promotors of proliferation may help to understand the effective role of proliferation in the determination of branching.

      Our response:

      We show that there is no direct link between onset of proliferation and acquisition of branching ability. However, we are not claiming that proliferation is not important for branching, as obviously new cells are needed as building blocks of growing tissues. In a recently published paper, we have assessed the role of proliferation in branch point formation in embryonic mammary glands. Using mitomycin C to block proliferation, we showed that initiation of new branches occurs even when proliferation is blocked (Myllymäki et al., JCB2023, PMID: 37367826).

      The reviewer was also asking why Eda-/- mammary primordia are smaller at E15.5-E16.5 despite similar proliferation rates. In the revised manuscript, we have quantified the volume of E13.5 Eda-/- and control mammary buds and show that Eda-/- buds are ~25% smaller (3.5 ± 0.8 x 105 µm3 in Eda-/- vs. 4.6 ± 0.7 x 105 µm3 in control, mean ± SD) already at the bud stage (new Supplementary Fig. 2c,d).

      We have also quantified the cellular size in Eda-/- and control mammary glands at E13.5 and E15.5 and found that mammary epithelial cells in Eda-/- embryos are ~15% smaller (new Supplementary Fig. 2e,f). Together, these data indicate that the smaller size of E15.5-E16.5 Eda-/- mammary glands is a combinatorial effect the smaller mammary anlage at E13.5 and smaller cell size. These findings, while interesting on their own, do not challenge our conclusions regarding the link between onset of proliferation and acquisition of branching ability.

      8- The heterotypic epithelial-mesenchymal recombination using the salivary gland is interesting. Nevertheless, some stainings to assess the purity of their systems are again required (e.g., marker of salivary epithelium to verify the purity of the mesenchyme and vice versa).

      __Our response: __

      As mentioned above, all tissue recombination experiments were performed so that the epithelium and the mesenchyme originated from genetically labelled embryos expressing different fluorescent proteins. In the revised manuscript, we provide confocal images of the salivary-mammary tissue recombinants (new Supplementary Fig. 3), confirming the purity of the tissue compartments used in these experiments.

      This model clearly showed that the mammary epithelium can form more branching when combined with the salivary mesenchyme. Moreover, the salivary epithelium preferentially branches through tip bifurcation, while mammary epithelium combined with the salivary mesenchyme has a mixed pattern of tip bifurcation and side branching (typical of the mammary gland). The authors thus concluded that the branching pattern is an intrinsic feature of the epithelium. However, a comparison between the percentage of tip bifurcation and side branching in the heterotypic combination and the homotypic combination between mammary epithelium and mammary mesenchyme is crucial to understand this point. Indeed, these results are not sufficient to exclude that the branching pattern is partially determined by intrinsic features and partially by extrinsic signals. The authors should carefully quantify the branching pattern in the homotypic combination and compare that to the heterotypic one. If the percentage of tip bifurcation do not change, their conclusion is correct; if this percentage increases in the heterotypic combination, it would be a sign of a partial effect of the signals of the mesenchyme.

      Our response:

      We thank the reviewer for raising this question. We have independently generated data on the type of mammary gland branching events in two papers with somewhat different culture and imaging conditions (Lindström et al., BiorXiv 2022 and Myllymäki et al., JCB, 2023, PMID: 37367826). Both analyses showed that in embryonic mammary glands, the majority of branching events (~70%) occurs by side-branching. These data are in line with the current study that we have now complemented to include also the mammary-mammary recombination experiments (revised Supplementary Video 1, revised Fig. 4b). Quantification of branching events revealed no significant difference in the type of branching events of mammary epithelia grown with salivary or mammary gland mesenchyme (revised Fig. 4c), further supporting our initial conclusions.

      9- Through the analysis of their transcriptomic data, Lan and colleagues found that the mammary mesenchyme expresses higher levels of negative regulators of Wnt pathway compared to the salivary mesenchyme. To demonstrate the value of their findings, they should confirm this in vivo, through staining of known Wnt proteins on the salivary and mammary mesenchymes at the embryonic stage.

      Our response:

      In mammals, there are 19 Wnt ligands, over a dozen secreted Wnt inhibitors, 10 Frizzled receptors, two Lrp co-receptors, and numerous other pathway modifiers that contribute to the net Wnt signaling activity in a complex manner. Furthermore, it has been “notoriously difficult to generate useful antibodies to vertebrate Wnt proteins...In general, these sera do not detect endogenous Wnt proteins in cell extracts, nor do they detect Wnt proteins in tissues by staining techniques. Hence, there are few data on Wnt protein distribution in intact vertebrate animals.” This is a direct citation from the Wnt Homepage, maintained by the Nusse Lab; https://web.stanford.edu/group/nusselab/cgi-bin/wnt/reagents#antibod.

      For all these reasons, we do not find this approach feasible nor informative.

      Instead, in the revised manuscript, we report the expression levels of Axin2, the most commonly used transcriptional readout of canonical Wnt activity in our RNA-seq data (new Supplementary Fig. 4c). Axin2 levels are lowest in the E16 fat pad where mammary branching takes place, much lower than in any other tissues analyzed in the study.

      Plan for the final revision:

      To complement these findings, we will additionally provide expression data of a transgenic Wnt reporter from the same developmental stages and tissues that were used to generate the RNA-seq data.

      10- Since the ability of the salivary mesenchyme to promote a higher rate of branching in the mammary epithelium, the authors wanted to assess what could be the role of Wnt signalling. To do so, they used a mouse model where B-catenin is stabilised, allowing an increased Wnt signalling in the mammary mesenchyme. As a result, they observed increased branching in the mammary epithelium. They also found that IGF1 is a ligand regulated by Wnt pathway in the mesenchyme. Therefore, the use of exogenous IGF1 in their ex vivo model was able to increase the branching of the mammary epithelium. Moreover, Igf1r-/- embryos showed a significant decrease of mammary gland branching. The conclusion based on these experiments was that the Wnt-Igf1-Igf1r axis plays a pivotal role in the promotion of mammary gland branching during embryogenesis. This conclusion is overclaimed for different reasons. Firstly, the normalization of the ductal branching to the body weight is insufficient to exclude that the impact of the Igf1r knockout may have severe consequences on the mammary gland formation, upstream of the ductal branching. Another parameter for this normalization is required (e.g., size of the bud before branching, proliferation status, etc).

      Our response:

      We agree with the reviewer in that Igf1r knockout may affect mammary gland formation in multiple ways, and also prior to onset of branching, as already indicated in the original manuscript: “…apart from one study reporting the smaller size of the E14 mammary bud in IGF-1R deficient embryos …” (line 398-399 in the revised version) and ‘…mammary gland 3 that was consistently absent.’ (line 414-415 in the revised version).

      To assess whether the reduced size and branching of E16.5/E18.5 Igf1r-/- mammary glands is merely a consequence of the smaller anlage, the revised manuscript includes new data reporting quantification of the volume of mammary gland 2 of Igf1r-/- and wild type littermate embryos at E13.5, E16.5, and E18.5 from 3D confocal images of whole mount EpCAM stained mammary glands. As can be seen from the new Fig. 7g-h, at E13.5, the mutant mammary buds are about 60% of the size of the controls, at E16.5, 25% and at E18.5 only 20 % revealing a progressive defect, indicative of a specific defect at the outgrowth and branching stage. This conclusion was validated by normalization to the body weight: at E13.5 the size of Igf1r-/- mammary anlage did not differ from that of the wild type embryos (p = 0.11), at E16.5 the sprouts were smaller in the mutants, though the difference did not reach statistical significance (p = 0.08), while at E18.5, the Igf1r-/- mammary glands were significantly smaller (p = 0.000021) (new Fig. 7i). We find these data compelling evidence for a specific role for Igf1r in outgrowth and branching of the embryonic mammary gland.

      The use of alternative models to specifically knockout the receptor in the epithelium or the ligand in the mesenchyme (e.g. viruses) would be even more useful to specifically focus on the role of this pathway for ductal branching excluding side effects.

      Our response:

      We thank the reviewer for this suggestion. Unfortunately, based on our experience, viral shRNA delivery is not sufficiently efficient for effective gene silencing, unlike Cre delivery for a gain-of-function approach (used in the current study to flox out exon 3 of beta-catenin) in case where the endogenous pathway activity is very low and therefore, targeting even a subset of cells is sufficient for upregulation of paracrine factors.

      Plan for the final revision:

      To address the question on the autocrine or paracrine role of Igf1r, we will perform tissue recombination experiments between Igf1r-/- and control mammary epithelium and mesenchyme.

      Another limit of this model is the fact that Igfr1 can be bound by Igf2 as well and we cannot exclude that this has an impact too (except if Igf2 is not expressed at this stage). A quantification of Igf2 expression may be useful.

      Our response:

      Indeed, we cannot exclude the possibility that Igf2 could also play a role (Igf2 expression was similar to Igf1 in our RNA-seq dataset, see Supplementary Fig. 5), but the connection of mesenchymal Wnt signaling activity was to Igf1, not Igf2 – in fact Igf2 was somewhat downregulated in Wnt3A treated sample reported by Wang et al. (Wang et al., 2021) (highlighted by an arrow in the revised Fig. 6). We have also clarified this point in the Discussion of the preliminary revised manuscript.

      11- From the experiments presented in this section it is clear that Wnt-Igf1-Igf1r axis has to be finely regulated to have the correct amount of ductal branching in the embryonic mammary epithelium. Nevertheless, the author just showed the RNA levels of Igf1 in the different compartments they have analysed. Stainings to see the effective presence of the ligand on the tissue is mandatory to clarify the role of this axis in the ductal branching in vivo.

      Our response:

      Igf1-Igf1r signaling plays a critical growth promoting function during embryonic and postnatal development. The expression of Igf1 at RNA and protein level has been detected in almost all tissues in humans (Daughaday et al., Endocr. Rev., 1989; PMID: 2666112). Given that Igf1 is a secreted protein and multiple Igf binding proteins (Igfpbs) (that regulate the bioactivity of Igf1 by sequestering it) are expressed in the mammary and salivary gland mesenchyme (Supplementary Fig. 5), we find it unlikely that Igf1 staining would provide any additional information to the current study, as they cannot be used to assess the source of Igf1, nor the location of the signaling activity.

      Furthermore, as underlined by the authors, this axis is specifically important and upregulated in the salivary gland. Due the limit of the Igf1R-/- model, we cannot exclude that, although Wnt-Igf1-Igf1r axis is able to increase the branching ability of mammary epithelium, the normal branching rate observed in wt mice is due to other pathways.

      Our response:

      We agree with the reviewer in that other pathways are also important in regulating normal mammary gland branching, for example, Eda/NF-κB and FGF pathways as we described in the Introduction. Our results do not exclude the possibility that also pathways other than Wnt regulate Igf1 expression. The reviewer is correct that if a paracrine factor is expressed in the salivary gland but not in the mammary mesenchyme, its physiological effect may be limited to the salivary gland. Indeed, cluster 5 identified by the mFuzzy analysis (Fig. 5f) is likely to include some genes like that. This is why we decided to focus on cluster 6 genes like Igf1. In the revised manuscript, we have better highlighted the difference between cluster 5 and 6 genes.

      Unfortunately, with the currently available tools, we cannot test the importance of the endogenous mesenchymal Wnt signaling activity by inactivating Wnt signaling activity specifically in the mesenchyme at the time point when branching begins. This would require an inducible mesenchymal Cre line (mesenchymal β-catenin is essential for the early fate specification of the primary mammary mesenchyme; Hiremath et al., 2012, PMID: 23034629), and conditional β-catenin null mouse. We do not have such mice available and we find that these experiments are beyond the scope of the current study.

      12- Lastly, once claimed to have found the key factor necessary for ductal branching promotion, the authors should also test if the proliferation and lineage segregations are unaffected in this context, confirming their dispensable role claimed in the initial part of the manuscript.

      __Our response: __

      Igf1/Igf1r is well-known for its growth promoting function via cell proliferation. We have no reasons to think that this would not be the case also in the mammary gland, and it was not our intention to give the impression that proliferation was not affected. In fact, Hiremath et al. (2012) already reported a defect in epithelial cell proliferation in Igf1rmammary buds at E14. Our key finding is that compared to other organs, the mammary gland is particularly sensitive to loss of Igf1r during branching morphogenesis. Finally, as pointed out earlier, better tools will be needed to assess the potential link between lineage segregation and onset of branching, a topic that we hope to address in the future.

      Minor comments: 1- An important paper on mammary gland ductal branching was published on Nature in 2017 by Scheele and colleagues and should be presented in the introduction, even though it is at later stages (after birth).

      Our response:

      We thank the reviewer for the suggestion. In the revised manuscript, we have added the findings from Scheele et al. 2017 in the introduction.

      2- In line 136 and 139 the authors referred to Fig 2 but it should be Fig 1

      Our response:

      We apologize for these slips. They have been corrected in the revised manuscript.

      3- The sentence on line 142 should be rephrased, since "advanced developmental stages" may be referred to pubertal development. The authors should specify that they are talking about embryonic development.

      Our response: We apologize for the potential misunderstanding. In the revised manuscript, we have used the phrase “advanced embryonic developmental stage” to describe our conclusion more precisely.

      Reviewer #2 (Significance (Required)):

      Overall, the authors concluded that embryonic mammary gland development and branching are extremely sensitive to the loss of IGF1, normally produced by the mesenchyme. The topic of the paper is interesting, the experimental approaches are well conceived, the data are convincing and the findings are of interest to developmental biologists. Nevertheless, there are some significant points that need to be further investigated before considering the manuscript suitable for publication:

      Our response:

      We thank the reviewer for the careful assessment and positive feedback of our manuscript. We have already addressed most of the points raised and most remaining ones will be addressed in the final revised manuscript.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __

      Here the authors use classical embryonic tissue recombination and pharmacological manipulation of explants in conjunction with cutting edge 3D imaging of tissue derived from highly sophisticated reporter and knock-out mouse models and state of the art transcriptomic analysis to masterfully delineate and dissect regulatory pathways critical for embryonic mammary development. Specifically, they set out to parse regulation of proliferation from that of branch patterning.

      While it has long been established that epithelial-mesenchymal interaction is necessary for mammary branching this work shows by heterochronic recombination that initiation mammary branching is not advanced by mesenchymal stage. By examining Fucci2a embryos the authors demonstrate that branching is preceded by a significant increase in basal cell-biased proliferation but, through further analysis of Eda gain and loss of function mice, conclude that proliferation per se does not cause branching. They show by heterotypic recombination with salivary tissue that early mammary epithelia rudiments require their own mesenchyme for survival and that although later E16.5 rudiments expand more robustly when in contact with salivary mesenchyme they nevertheless retain their characteristic mammary branch pattern. Thus, they establish that initiation and patterning are intrinsic properties of the epithelium but that early survival and later expansion/proliferation is regulated by the mesenchymal context. By transcriptomic comparison of mammary and salivary mesenchyme they reveal that genes encoding canonical Wnt attenuators and antagonists are highly expressed in early mammary mesenchyme and drop as branching ensues. The low expression of these negative regulators of Wnt signaling in salivary mesenchyme is proposed as an explanation for its growth and branch stimulating capability. In keeping with these observations, the authors show that experimental activation of mammary mesenchymal Wnt signaling augments both growth and branching. Lastly, they identify transcriptomic changes in IGF1 coincident with the initiation of mammary branching and confirm its role by extending analyses of the effects of gain and loss of function of IGF1 on embryonic mammary development.

      This is a thorough, well-constructed paper that adds new knowledge and important conceptual nuance and mechanistic insight to classical findings on branch patterning. This work is a technical tour de force and backed by solid quantitative and statistical analysis throughout. Their experimental approach is superb and the conclusions are sound. Their findings will be of great interest to the community of mammary gland biologists and to the wider field of embryologists focused on early development of a broad range of ectodermal appendages.

      I have some minor criticisms that I believe can be quickly remedied in a minor rewrite and suggestions for the authors consideration to improve the manuscript discussion as follows:

      Minor issues Abstract, line 37: The authors misuse the word "decompose" - it should be "deconstruct"

      __Our response: __ We thank the reviewer for pointing out our mistake, which we have corrected in the revised manuscript.

      Results, p7 line 48: Add "The" to the sentence: "The majority...."

      __Our response: __ Corrected it in the revised manuscript.

      P8 line 173 This sentence refers to Figure 2G which is a quantitative plot. I would suggest replacing the word "cluster" which implies a spatial organization with the word "subset" or "significant fraction" The spatial data in Fig 2d support basal bias but do NOT to my eye show any clustering - in fact the proliferative basal cells appear to be evenly dispersed within the basal layer.

      Our response:

      We thank the reviewer for highlighting this aspect. We agree that “significant fraction” is a more suitable term than “cluster”.

      P9 line 188: The statement on basal cell lineage specification needs a reference.

      __Our response: __

      Following the suggestions from reviewers #1 and 3, we have removed the content about lineage segregation in Results, together with this sentence.

      P10 line 201-216 I found the section on lineage specification (fig S2) weaker than the rest and a distraction from the main thrust of the paper making it difficult for the reader to focus. I suggest omitting this section and supplemental figures associated with it altogether.

      __Our response: __

      We agree with all reviewers in that this part of the manuscript was not mature enough and provided only indirect evidence on the potential link between lineage segregation and branching ability. This is an important question in the field that merits a study of its own that should be addressed with better tools than those available to us at present. As suggested by reviewers #1 and #3, we have omitted this part in the revised manuscript.

      P9 line 190: "displays precocious onset of branching" it is sufficient to say: displays precocious branching - the use of both "precocious" and "onset' is redundant.

      P10 line 229 Similarly, delete "the onset of branching was delayed" it is sufficient to say: branching was delayed.

      __Our response: __ Both sentences have been corrected it in the revised manuscript.

      P11 line 243: Delete "on the regulation of the" and substitute the word "to" in the sentence: "Next, we shifted our focus on the regulation of the branching pattern, which is thought to be determined by mesenchymal cues."

      __Our response: __ Corrected it in the revised manuscript.

      P11 line 241 subtitle and Figure 4 title: The disparity in titles here is jarring for the reader: Results text subtitle: "Salivary gland mesenchyme is rich in growth-promoting cues, but does not alter the mode of branch point formation of the mammary epithelium". Figure 4 Title: "Mammary mesenchyme is indispensable for the branching ability of the mammary gland". I suggest to the authors divide the figure as well as the text to make the two points indicated by their disparate titles separately.

      __Our response: __ We thank reviewer for the suggestion to clarify the Results part of the manuscript. As suggested, we have split the data under two separate subtitles, but due to limitations in figure numbers, we prefer to report these data in one figure panel.

      P12 line 279 From here on out the manuscript has a tendency to use the term "growth" ambiguously - in many instances it is unclear do they mean expansion, proliferation, increased branch number/ morphology?? Please try to clarify.

      __Our response: __

      Our aim is to use the term growth to mean tissue growth (expansion). We hope that this is clearer in the revised manuscript.

      P16 line 341 use word "prompted" instead of word "promoted"

      __Our response: __ We thank reviewer for spotting out the slip, which we have corrected in the revised manuscript.

      P16 line 382: include word "embryonic" before "mammary development"

      __Our response: __ We have modified the text in the revised manuscript.

      Discussion P18 line 416: Add the words "later stage (E16.5)" to the sentence: "Importantly, we demonstrate that salivary gland mesenchyme could only promote the growth of later stage (E16.5) mammary epithelium"

      __Our response: __ We thank reviewer for the suggestion. We have modified the text in the revised manuscript.

      P19 line 437: Given the authors statement "Instead, cell motility is critical for branch point formation in the mammary gland" they should consider a brief sentence mentioning their transcriptomic findings on cadherin 11 and Tenascin.

      __Our response: __ We thank the reviewer for appreciation of our transcriptomic data. In the revised manuscript, we have added the following text in discussion: “Accordingly, we observed significantly increased expression of cell migration promoting genes such as Cdh11 (encoding Cadherin 11), and Tnc (encoding Tenascin C) 60,61 in the E16.5 mesenchyme compared to E13.5 (Supplementary Table 2).”

      P19 line 451: Similarly, given their statement "This observation suggests that mammary epithelium itself carries the instructions dictating the mode of branching" they could consider their transcriptomic data on Ltbp1 in "mammary specific" clusters 7,8,9 as a matrix molecule initially expressed by mammary mesenchyme but which becomes expressed by luminal epithelial cells at precisely the time they acquire lineage specification and intrinsic branching capability.

      __Our response: __ This is an excellent suggestion. We have added following text in discussion: “It is worth noting that certain mesenchymal factors, such as Ltbp1, began transitioning towards epithelium-specific expression around E16.5 69. Exploring the potential impact of these factors on the self-instructed branching capacity of the mammary epithelium could yield valuable insights.”

      P20 lines 462-470 The authors should address their theory of Wnt suppression in the mammary mesenchyme in the context, albeit conflictingly, of earlier studies showing expression of Wnt signaling reporters, in either epithelial or mesenchymal locations during early stages.

      Our response: __ We thank reviewer for the suggestion. In the preliminary revised manuscript, we report Axin2 expression data as __new Supplementary Fig. 4c. Axin2 expression data suggest that Wnt/β-catenin activity is lowest in the E16.5 fat pad (where branching takes place) compared to all other tissues analyzed in the study.

      Plan for the final revision:

      For the final revised manuscript, we will additionally generate transgenic Wnt reporter expression data (see also our response to point 3 of Reviewer #1). These results will be discussed in light of the published Wnt reported literature in the final revised manuscript.

      Reviewer #3 (Significance (Required)):

      Here the authors use classical embryonic tissue recombination and pharmacological manipulation of explants in conjunction with cutting edge 3D imaging of tissue derived from highly sophisticated reporter and knock-out mouse models and state of the art transcriptomic analysis to masterfully delineate and dissect regulatory pathways critical for embryonic mammary development. Specifically, they set out to parse regulation of proliferation from that of branch patterning.

      This is a thorough, well-constructed paper that adds new knowledge and important conceptual nuance and mechanistic insight to classical findings on branch patterning. This work is a technical tour de force and backed by solid quantitative and statistical analysis throughout. Their experimental approach is superb and the conclusions are sound. Their findings will be of great interest to the community of mammary gland biologists and to the wider field of embryologists focused on early development of a broad range of ectodermal appendages.

      Our response:

      We much appreciate the positive evaluation of our manuscript. We have addressed all the feedback provided by the reviewer 3 in the preliminary revised manuscript, except the last point, which will be included in the final revision along with the new data on the Wnt reporter expression.

      Field of expertise: Embryonic and adult mammary development, Wnt signaling, cell adhesion

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Here the authors use classical embryonic tissue recombination and pharmacological manipulation of explants in conjunction with cutting edge 3D imaging of tissue derived from highly sophisticated reporter and knock-out mouse models and state of the art transcriptomic analysis to masterfully delineate and dissect regulatory pathways critical for embryonic mammary development. Specifically, they set out to parse regulation of proliferation from that of branch patterning.

      While it has long been established that epithelial-mesenchymal interaction is necessary for mammary branching this work shows by heterochronic recombination that initiation mammary branching is not advanced by mesenchymal stage. By examining Fucci2a embryos the authors demonstrate that branching is preceded by a significant increase in basal cell-biased proliferation but, through further analysis of Eda gain and loss of function mice, conclude that proliferation per se does not cause branching. They show by heterotypic recombination with salivary tissue that early mammary epithelia rudiments require their own mesenchyme for survival and that although later E16.5 rudiments expand more robustly when in contact with salivary mesenchyme they nevertheless retain their characteristic mammary branch pattern. Thus, they establish that initiation and patterning are intrinsic properties of the epithelium but that early survival and later expansion/proliferation is regulated by the mesenchymal context. By transcriptomic comparison of mammary and salivary mesenchyme they reveal that genes encoding canonical Wnt attenuators and antagonists are highly expressed in early mammary mesenchyme and drop as branching ensues. The low expression of these negative regulators of Wnt signaling in salivary mesenchyme is proposed as an explanation for its growth and branch stimulating capability. In keeping with these observations, the authors show that experimental activation of mammary mesenchymal Wnt signaling augments both growth and branching. Lastly, they identify transcriptomic changes in IGF1 coincident with the initiation of mammary branching and confirm its role by extending analyses of the effects of gain and loss of function of IGF1 on embryonic mammary development.

      This is a thorough, well-constructed paper that adds new knowledge and important conceptual nuance and mechanistic insight to classical findings on branch patterning. This work is a technical tour de force and backed by solid quantitative and statistical analysis throughout. Their experimental approach is superb and the conclusions are sound. Their findings will be of great interest to the community of mammary gland biologists and to the wider field of embryologists focused on early development of a broad range of ectodermal appendages.

      I have some minor criticisms that I believe can be quickly remedied in a minor rewrite and suggestions for the authors consideration to improve the manuscript discussion as follows:

      Minor issues

      Abstract, line 37: The authors misuse the word "decompose" - it should be "deconstruct"

      Results, p7 line 48: Add "The" to the sentence: "The majority...."

      P8 line 173 This sentence refers to Figure 2G which is a quantitative plot. I would suggest replacing the word "cluster" which implies a spatial organization with the word "subset" or "significant fraction" The spatial data in Fig 2d support basal bias but do NOT to my eye show any clustering - in fact the proliferative basal cells appear to be evenly dispersed within the basal layer.

      P9 line 188: The statement on basal cell lineage specification needs a reference.

      P10 line 201-216 I found the section on lineage specification (fig S2) weaker than the rest and a distraction from the main thrust of the paper making it difficult for the reader to focus. I suggest omitting this section and supplemental figures associated with it altogether.

      P9 line 190: "displays precocious onset of branching" it is sufficient to say: displays precocious branching - the use of both "precocious" and "onset' is redundant.

      P10 line 229 Similarly, delete "the onset of branching was delayed" it is sufficient to say: branching was delayed.

      P11 line 243: Delete "on the regulation of the" and substitute the word "to" in the sentence: "Next, we shifted our focus on the regulation of the branching pattern, which is thought to be determined by mesenchymal cues."

      P11 line 241 subtitle and Figure 4 title: The disparity in titles here is jarring for the reader: Results text subtitle: "Salivary gland mesenchyme is rich in growth-promoting cues, but does not alter the mode of branch point formation of the mammary epithelium". Figure 4 Title: "Mammary mesenchyme is indispensable for the branching ability of the mammary gland". I suggest to the authors divide the figure as well as the text to make the two points indicated by their disparate titles separately.

      P12 line 279 From here on out the manuscript has a tendency to use the term "growth" ambiguously - in many instances it is unclear do they mean expansion, proliferation, increased branch number/ morphology?? Please try to clarify.

      P16 line 341 use word "prompted" instead of word "promoted"

      P16 line 382: include word "embryonic" before "mammary development"

      Discussion P18 line 416: Add the words "later stage (E16.5)" to the sentence: "Importantly, we demonstrate that salivary gland mesenchyme could only promote the growth of later stage (E16.5) mammary epithelium"

      P19 line 437: Given the authors statement "Instead, cell motility is critical for branch point formation in the mammary gland" they should consider a brief sentence mentioning their transcriptomic findings on cadherin 11 and Tenascin.

      P19 line 451: Similarly, given their statement "This observation suggests that mammary epithelium itself carries the instructions dictating the mode of branching" they could consider their transcriptomic data on Ltbp1 in "mammary specific" clusters 7,8,9 as a matrix molecule initially expressed by mammary mesenchyme but which becomes expressed by luminal epithelial cells at precisely the time they acquire lineage specification and intrinsic branching capability.

      P20 lines 462-470 The authors should address their theory of Wnt suppression in the mammary mesenchyme in the context, albeit conflictingly, of earlier studies showing expression of Wnt signaling reporters, in either epithelial or mesenchymal locations during early stages.

      Significance

      Here the authors use classical embryonic tissue recombination and pharmacological manipulation of explants in conjunction with cutting edge 3D imaging of tissue derived from highly sophisticated reporter and knock-out mouse models and state of the art transcriptomic analysis to masterfully delineate and dissect regulatory pathways critical for embryonic mammary development. Specifically, they set out to parse regulation of proliferation from that of branch patterning.

      This is a thorough, well-constructed paper that adds new knowledge and important conceptual nuance and mechanistic insight to classical findings on branch patterning. This work is a technical tour de force and backed by solid quantitative and statistical analysis throughout. Their experimental approach is superb and the conclusions are sound. Their findings will be of great interest to the community of mammary gland biologists and to the wider field of embryologists focused on early development of a broad range of ectodermal appendages.

      Field of expertise: Embryonic and adult mammary development, Wnt signaling, cell adhesion

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The mammary gland is a branched structure that consists of a bilayered epithelium embedded in a specialized mesenchyme. In mice, at 11,5 days of embryogenesis, the ectoderm thickens forming 5 pairs of peculiar structures called placodes. During the following days, the placodes will grow and invaginate into the surrounding mammary mesenchyme and they will finally start to branch by the end of embryogenesis (E16). It has been suggested that the bidirectional communication between the growing mammary gland and the surrounding mesenchyme plays a pivotal role in the determination of each step of mammary gland development (placode formation, mammary bud invagination, gland outgrowth, branching). The role of different signalling molecules has already been shown, particularly for the placode growth and mammary bud invagination. Nevertheless, the pathways regulating embryonic mammary gland branching are still incompletely understood. In this manuscript, Lan and colleagues aim to decipher the correlation between different stages of mammary gland development such as proliferation, lineage segregation and ductal branching. Furthermore, they want to define which stage of mammary development is intrinsically determined by the epithelium and which one requires the supportive guidance of the mesenchyme. Lastly, they aim to discover the key signal for the growth and branching of mammary epithelium.

      To these purposes, they used an ex vivo model of heterochronic epithelial-mesenchymal recombination. In particular, they micro-dissected the epithelium and/or the mesenchyme from murine mammary glands at different stages of embryonic development (i.e. at E13,5 for the quiescent phase or 16,5 for branching phase) and explanted them together in different combinations using fluorescent reporters. To assess the role of the mesenchyme they also cultured the epithelium in a mesenchyme free 3d structure. Through this model they demonstrated that the presence of the mesenchyme is necessary for the priming of mammary epithelium for branching, since only E16,5 epithelial cells were able to grow and branch in a mesenchyme free 3D experiment. Nevertheless, intrinsic properties of the epithelium are necessary for the timing of branching, since E16,5 mesenchyme was not able to accelerate the outgrowth of E13,5 epithelia. In order to determine which epithelial properties are important, the authors correlated the beginning of cell proliferation in the embryonic mammary gland to the beginning of the branching phase. They indeed used the Fucci2a mouse model to carefully characterise the timing of mammary cells proliferation at different stages of embryonic development, concluding that the great majority of proliferating cells reside in the inner part of the mammary bud until E14,5, while in the external part at later stages.

      Regarding the importance of cell proliferation, Lan and colleagues claim that the beginning of the branching phase is not its direct consequence, thanks to the use of the K14Cre- Eda mouse model, known to have anticipated mammary gland development. Using this and the Eda-/- models, the authors also sustain that the branching occurs independently of the lineage specification of the epithelium. The use of salivary mesenchyme instead the mammary one was able to increase the number of branching of E16,5 mammary epithelium. Nevertheless, this model demonstrated that the branching pattern (side branching vs tip bifurcation) is an intrinsic feature of the epithelium. Lan and colleagues also defined the transcriptomic profiles of the mammary and salivary mesenchymes at different stages. In particular, they observed an increased expression of negative regulators of Wnt pathway in the mammary mesenchyme compared to the salivary mesenchyme. Moreover, using a mouse model where B-catenin is stabilised, they observed increased tip production in the mammary gland epithelium. They also showed that IGF1 production is increased after Wnt pathway activation and they tested its function, both treating their ex vivo cultures with exogenous IGF1 and using Igf1r-/- mouse models.

      Major comments

      1. The great majority of the results of the manuscript are based on an ex vivo model of heterochronic epithelial-mesenchymal recombination. Since the authors are studying the effect of the mesenchyme of different stages on the epithelium (and vice versa), the purity of the two compartments after the dissection is particularly important. Although they said that the purity is evaluated (line 112), it would be important to show a control staining in which they use known markers of the mesenchyme with no colocalization with the fluorescent reporter of the epithelium.
      2. Another important point for understanding the quality and impact of these findings is to assess the similarities and differences, if there are, between the in vivo mesenchyme and the ex vivo one. Indeed, once explanted and put in culture, mesenchymal cells could change their transcriptomic profile and consequently change their signals to the epithelium. The authors should assess the expression of the genes and pathways studied during embryonic development in vivo .
      3. The authors clearly showed that E16,5 epithelium is able to branch in a mesenchyme free 3D culture model, while epithelia from earlier stages don't. This led to the conclusion that mesenchyme is necessary for acquiring the branching ability. Nevertheless, the authors also said that early stages epithelia scarcely grow in the mesenchyme free 3D culture. Therefore, the lack of branching may be due to the lack of growth, if not the increase of death, of epithelial cells. The authors should quantify the size and the cell death of the epithelia in the different culture conditions and discuss better this point.
      4. The Fucci2a model allowed to assess the proliferation of embryonic mammary epithelium, showing that the great majority of proliferating cells are basal, at late stages of development (line 182). As it has already been shown, lineage specification is a late process during mammary gland development. The fact that the proliferating cells reside at the external part of the bud does not mean that they are basal cells yet. A p63/K8 staining could be important to understand if the increased proliferation occured in already specified basal cells or not.
      5. The use of Fucci2a model showed that 20% of epithelial cells are proliferative at E13,5. This phase is considered as "quiescent" by the authors (line 120), but the moderate proliferation rate shown in this experiment demonstrated that it is not. A change of the nomenclature is needed.
      6. Through the use of K14-Eda and Eda-/- models, the authors claimed that the lineage specification is not a prerequisite for ductal branching. To support this point, they showed that the K14-Eda mice have an anticipated branching although the expression of K8 in the inner part of the bud is transitorily decreased. The authors link the K8 downregulation to a transient suppression of the luminal lineage, but this is clearly overclaimed. Although K8 is a known marker of luminal lineage, the downregulation of one marker is not sufficient to support their thesis. They should first check more markers and in particular critical regulators of luminal lineage as Notch1, Foxa1 and Elf5. Lately, the use of different models that drive embryonic epithelial cells to a forced lineage commitment (Notch1 or Δnp63 overexpression) would support more their claim. As additional evidence, the authors showed that Eda is able to promote basal cell signature. Firstly, the authors should better explain why this point would support their thesis. Secondly, the supplementary figure 2b does not show which genes are taken into account to define the basal signature. A list of these genes would be helpful, as well as staining for some representative proteins.
      7. The authors used the same mouse models to assess the importance of proliferation in the determination of ductal branching and they claimed that proliferation is not a sufficient feature. This conclusion was supported by two observations. The first one is the fact that the K14-Eda model shows an increased cell proliferation at early stages compared to wt, coupled with anticipated branching. Secondly, although having smaller glands compared to wt and showing a delay in ductal branching, Eda-/- mice have an epithelial proliferation rate very similar to wt. Again, the conclusion that proliferation is not sufficient for branching is overclaimed. Firstly, the authors should explain how the buds in wt and Eda-/- mice have different sizes although the similar proliferation (increased cell death?, cellular volume?). Secondly, to support the thesis that proliferation is not sufficient for branching, functional experiments should be performed (see point 12). For instance, the short-time treatments with inhibitors or promotors of proliferation may help to understand the effective role of proliferation in the determination of branching.
      8. The heterotypic epithelial-mesenchymal recombination using the salivary gland is interesting. Nevertheless, some stainings to assess the purity of their systems are again required (e.g., marker of salivary epithelium to verify the purity of the mesenchyme and vice versa). This model clearly showed that the mammary epithelium can form more branching when combined with the salivary mesenchyme. Moreover, the salivary epithelium preferentially branches through tip bifurcation, while mammary epithelium combined with the salivary mesenchyme has a mixed pattern of tip bifurcation and side branching (typical of the mammary gland). The authors thus concluded that the branching pattern is an intrinsic feature of the epithelium. However, a comparison between the percentage of tip bifurcation and side branching in the heterotypic combination and the homotypic combination between mammary epithelium and mammary mesenchyme is crucial to understand this point. Indeed, these results are not sufficient to exclude that the branching pattern is partially determined by intrinsic features and partially by extrinsic signals. The authors should carefully quantify the branching pattern in the homotypic combination and compare that to the heterotypic one. If the percentage of tip bifurcation do not change, their conclusion is correct; if this percentage increases in the heterotypic combination, it would be a sign of a partial effect of the signals of the mesenchyme.
      9. Through the analysis of their transcriptomic data, Lan and colleagues found that the mammary mesenchyme expresses higher levels of negative regulators of Wnt pathway compared to the salivary mesenchyme. To demonstrate the value of their findings, they should confirm this in vivo, through staining of known Wnt proteins on the salivary and mammary mesenchymes at the embryonic stage.
      10. Since the ability of the salivary mesenchyme to promote a higher rate of branching in the mammary epithelium, the authors wanted to assess what could be the role of Wnt signalling. To do so, they used a mouse model where B-catenin is stabilised, allowing an increased Wnt signalling in the mammary mesenchyme. As a result, they observed increased branching in the mammary epithelium. They also found that IGF1 is a ligand regulated by Wnt pathway in the mesenchyme. Therefore, the use of exogenous IGF1 in their ex vivo model was able to increase the branching of the mammary epithelium. Moreover, Igf1r-/- embryos showed a significant decrease of mammary gland branching. The conclusion based on these experiments was that the Wnt-Igf1-Igf1r axis plays a pivotal role in the promotion of mammary gland branching during embryogenesis. This conclusion is overclaimed for different reasons. Firstly, the normalization of the ductal branching to the body weight is insufficient to exclude that the impact of the Igf1r knockout may have severe consequences on the mammary gland formation, upstream of the ductal branching. Another parameter for this normalization is required (e.g., size of the bud before branching, proliferation status, etc). The use of alternative models to specifically knockout the receptor in the epithelium or the ligand in the mesenchyme (e.g. viruses) would be even more useful to specifically focus on the role of this pathway for ductal branching excluding side effects. Another limit of this model is the fact that Igfr1 can be bound by Igf2 as well and we cannot exclude that this has an impact too (except if Igf2 is not expressed at this stage). A quantification of Igf2 expression may be useful.
      11. From the experiments presented in this section it is clear that Wnt-Igf1-Igf1r axis has to be finely regulated to have the correct amount of ductal branching in the embryonic mammary epithelium. Nevertheless, the author just showed the RNA levels of Igf1 in the different compartments they have analysed. Stainings to see the effective presence of the ligand on the tissue is mandatory to clarify the role of this axis in the ductal branching in vivo. Furthermore, as underlined by the authors, this axis is specifically important and upregulated in the salivary gland. Due the limit of the Igf1R-/- model, we cannot exclude that, although Wnt-Igf1-Igf1r axis is able to increase the branching ability of mammary epithelium, the normal branching rate observed in wt mice is due to other pathways.
      12. Lastly, once claimed to have found the key factor necessary for ductal branching promotion, the authors should also test if the proliferation and lineage segregations are unaffected in this context, confirming their dispensable role claimed in the initial part of the manuscript.

      Minor comments:

      1. An important paper on mammary gland ductal branching was published on Nature in 2017 by Scheele and colleagues and should be presented in the introduction, even though it is at later stages (after birth).
      2. In line 136 and 139 the authors referred to Fig 2 but it should be Fig 1
      3. The sentence on line 142 should be rephrased, since "advanced developmental stages" may be referred to pubertal development. The authors should specify that they are talking about embryonic development.

      Significance

      Overall, the authors concluded that embryonic mammary gland development and branching are extremely sensitive to the loss of IGF1, normally produced by the mesenchyme. The topic of the paper is interesting, the experimental approaches are well conceived, the data are convincing and the findings are of interest to developmental biologists. Nevertheless, there are some significant points that need to be further investigated before considering the manuscript suitable for publication:

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This manuscript by Lan et al. addresses the still incompletely resolved question as to how branching morphogenesis of the embryonic mammary epithelium is regulated at the molecular and cellular level. Using (combinatorial) primary explant cultures of wildtype and genetically engineered mouse embryos, in which the authors have developed a unique expertise over many years, together with imaging and RNAseq analyses, they (i) show that the timing of epithelial branching is dictated by the biological age of the epithelium, but that an epithelial-mesenchymal interaction is required to bestow branching ability on the mammary epithelium somewhere between E13.5 and E16.5, (ii) seek to determine if and how lineage and cell proliferation affect branching, (iii) show that while salivary mesenchyme can promote growth (i.e. branching density) of the E16.5 mammary epithelium, the mode of branching (i.e. lateral branching vs tip-clefting) is an intrinsic property of the mammary epithelium, (iv) use transcriptomics to identify genes that are likely to control either mammary- or salivary gland specific growth and/or branching patterns, (v) hypothesize that low levels of WNT signaling in the mammary gland mesenchyme (due to relatively high expression of WNT signaling inhibitors) are responsible for mammary specific branching, (vi) show that hyperactivation of WNT/CTNNB1 signaling in the mesenchyme indeed induces hyperbranching, (vii) identify Eda and Igf1 as putative mediators and paracrine signaling factors that regulate branching of the mammary epithelium upon secretion from the mesenchyme downstream of WNT/CTNNB1 signaling and (viii) show that mammary gland branching is impaired in Igfr1 null embryos.

      Major comments:

      1. Overall, this is a solid study that is well controlled and technically of high quality. The materials and methods should allow follow up and replication by others and the transcriptomic data have been made available via NCBI GEO. I think the authors convincingly demonstrate points (i), (iii), (iv) and (vi) and (viii). I have some questions regarding (ii), (v) and (vii) and (viii) that I will pose below.
      2. Re: (ii): The authors try to study the link between basal cell fate and branching. They use position of the cells (which they describe clearly and which is a good choice), since they cannot use specific markers due to the fact that the basal and luminal linages have not yet segregated at this point. This part of the manuscript is not the most straightforward to follow. The most obvious experiment would have been to focus on the location of the cells and their associated cell cycle profile - but the authors themselves have just recently published a pre-print (their REF #54, now also out in JCB) that is an in-depth study of the link between cell proliferation + cell motility and branching, but this only becomes apparent in the discussion. In that sense, Fig2 of the current manuscript is less novel, although it is nice to see that it holds up in a slightly different analysis. Instead of focusing on the cell cycle markers, the authors turn to a K14-Eda mouse model - which shows precocious branching and a temporary reduction in K8 expression. They also analyze Eda-KO embryos. Quite frankly, I find the authors' reasoning difficult to follow here and I cannot deduce how these experiments really address the question at hand (i.e. how lineage and cell proliferation affect branching), so I hope they can rewrite this section of the paper to make the arguments more clear and easy to follow for the reader who, at this point, knows little about Eda. For example, the authors present the argument that K14-Eda mice show a transient reduction in K8 expression - but we don't know if that also really means a (temporary?) change in (future?) luminal cell fate. In fact, since Eda later also makes an appearance as a candidate factor to be secreted by the mesenchyme together with Igf1, I wonder if their K14-Eda data would not be better suited to underscore that point instead and if the authors should perhaps eliminate this section altogether and just refer to their prior work in REF #45. If the authors think the current data add something more, than they need to be more explicit about this (and then also introduce the link to REF #45 in the results section).
      3. Re: (v): Do the authors have any WNT/CTNNB1 target genes that they can include in their transcriptomics analysis to show that the WNT/CTNNB1 signaling levels are indeed lower in the mammary mesenchyme? Axin2 comes to mind, but there are some other negative feedback targets that are often induced across tissues, e.g. Rnf43 and/or Znrf3 and/or Sp5?E.g. to include in FIg6E?
      4. Re: (vii) and (viii): The authors convincingly show the phenotype of the Igfr1 KO mice, but I hope the authors concur that an epithelial only Igfr1 KO (or alternatively a mesenchymal only Igf1 KO, or epithelial/mesenchymal recombination experiments with WT vs IGFR1 null or IGF1 null tissue, or experiments with small molecule inhibitors of IGF1/IGFR1 signaling) would have given more solid mechanistic evidence regarding the presumed paracrine effect of IGF1 signaling. I am not asking the authors to perform another mouse experiment or even generate or use these conditional strains, but if the authors agree, then I do think this would merit some attention in the discussion section. See also my comments regarding Eda in point 1.

      Minor comments:

      • A few minor spelling/grammar errors, including a couple of "the"s missing (first line of the abstract, and also preceding "Majority" in line 148.
      • Line 517-518: please also include the details for the Eda mice.
      • 1f spelling error: separation

      Referees cross-commenting

      Having read all three review reports I think they are pretty much in agreement, with shared questions about the inclusion/meaning/discussion of the lineage specification data and also agreement about the overall technical solidity of the data and this approach.

      I gather that reviewer #2 asks for more controls than myself or reviewer #3 and while I think all of their points are valid, in principle, I don't think all of these are required. I should add that I am inclined to trust the authors on their ability to separate mesenchyme and epithelium as they have been developing and optimising this system over many years.

      Significance

      General assessment:

      This is a carefully executed study in which an impressive amount of (combinatorial) embryonic mammary tissue explant experiments are combined with quantitative imaging and transcriptomics analysis.

      The main limitations of the work lie in the fact that the investigation of a potential link between branching and the cell cycle is not entirely novel, as the authors themselves recently published an nice pre-print (now also out in JCB) describing similar analyses. In addition, the mechanistic link between WNT/CTNNB1 signaling in the mesenchyme and the paracrine signaling activities of the presumed downstream effectors EDA and IGF, while plausible, is not yet complete. The work also does not yet addresses what exactly the branching identity is that is bestowed upon the mammary epithelium between E13.5 and E16.5 and how this then becomes an intrinsic (epigenetic?) feature of the mammary gland.

      Advance:

      This work provides more insight into the embryonic branching of the mammary gland - a stage of mammary gland development that is still poorly understood and that is, in general, understudied. In part, the work confirms prior work in the literature (their REF #19) regarding mammary and salivary gland tissue recombination experiments. It supplements this with a more elaborate time series of heterochronic and heterologous epithelium/mesenchyme explant cultures, using genetically engineered (and fluorescently labeled) mouse tissues to allow better and quantitative imaging. The transcriptomic analysis of different mesenchyme populations is also informative and allows the researchers to propose a putative mechanism for why the mammary gland branches differently from the salivary gland. The advance is both technical and functional, as well as conceptual, with some advance in terms of mechanism.

      Audience: This works should appeal to mammary gland biologists interested in the molecular and cellular mechanisms of (early) mammary gland development, as well as to a broader community of developmental biologists studying branching morphogenesis in tissues such as lung, kidney and salivary gland.

      My expertise:

      WNT signaling and mammary gland biology, at the intersection of developmental, stem cell and cancer biology

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this study, Hwang et al. develop an inducible Cas9 hiPSC line and perform with it a pooled CRISPR knockout screen using a custom sgRNA library to identify novel genes involved in human primordial germ cell like cell (hPGCLC) differentiation in vitro. Thereby they find the AKT coactivator TCL1A to be important in the proliferation/survival of hPGCLCs after specification.

      Specific Comments:

      1.) p.7-p.8: "Using the MAGeCK algorithm (Li et al, 2014) to call hits on merged replicates, 25 genes scored as significantly depleted from the AG+ population at p < 0.05. Among the top hits was SOX17, and near-hits included TFAP2C, both of which are well-known drivers of the hPGCLC state (Fig. S2E)."

      When looking at Table S3, it appears that only 2 genes were significantly depleted (P < 0.05) in replicate 1, 23 in replicate 2 and 10 genes when rep 1 and rep 2 were analyzed together. The essential germ cell genes SOX17 and TFAP2C were not significantly depleted in replicate 1 and only TFAP2C but not SOX17 was depleted significantly in replicate 2. Also the main hits discussed in this paper, METTL7 and TCL1A were not significantly depleted in replicate 1 and only METTL7 but not TCL1A was significantly depleted in replicate 2. This indicates that replicate 1 might not have been robust enough to reliably detect depleted genes and that TCL1A was not among the significant hits. A potential explanation could be that not enough cells have been used to ensure a sufficient representation of sgRNAs to provide significant results in a depletion screen. Ideally the screen would need to be repeated to provide another informative replicate, or the authors should at least correct the sentences above and openly state that their hits are only based on one replicate of the screen and that the list of their hits might therefore not be fully reliable. Also the statement on page 8 that "the screen was both technically and biologically successful" might need to be toned down.

      2.) p.8 and Fig. 2E: The authors do not clearly describe, what are the 25 top hits in PGCLC(+) and PGCLC(-) cells and how they were chosen (Score, p-value or fold change), which they compared in the gene set enrichment analysis to the RNA-Seq data.

      3.) Fig. S3I-K: The authors mention in the text on p. 9 a significant reduction in hPGCLC induction efficiency for both TCL1A and METTL7A KO cells, but they do not provide statistics and do not mention, how many biological replicates have been used. As hiPSCs generally show a high clone to clone variability in hPGCLC induction efficiency, results from a single KO clone can not be considered as a reliable result. The authors should provide results from additional wt and KO clones (they are showing in S3E multiple for each gene) to ensure reliable effects, especially for METTL7A KO cells, where the reduction in PGCLC induction efficiency is more modest and might not be significant (Fig. S3I). Another way of validating the phenotype would be to use individual control, TCL1A and METTL7A sgRNAS from the screen and compare induction efficiencies with or without DOX-induced Cas9 expression.

      4.) Fig. 3F, Supp Tables S4, S5, S6: It is not clearly described, what was the criteria to define DEGs for the GO term analysis of TCL1A KO cells, as more genes have been used than the relatively few significant DEGs reported in Table S4. Furthermore, only FDRs or otherwise adjusted p-values (not raw p-values as done in the figure and tables) should be used to determine significantly enriched GO terms. Also no representative gene names are displayed in Fig.3F, as stated in the figure legend.

      5.) Fig.4: p-AKT and p-mTOR signaling are represented as the mechanism, by which TCLA1 KO affects hPGCLC maintenance/proliferation. It is not clear from the presented data, what is meant with biological triplicates (different germ cell inductions, different subclones) mentioned in the figure legend. As some of the effects observed are quite small (e.g. p-mTOR differences in 4E, cell cycle differences in 4J), biological replicates with a different KO clone should be performed (see point 3). Otherwise it is hard to judge how robust the data and conclusions derived are.

      CROSS-CONSULTATION COMMENTS

      Apart from my points regarding the weakness in statistical confidence I agree with the other reviewers that it needs to be shown whether the effect of TCL1A KO is based on a general proliferation defect of the entire aggregation body or if the effect is really hPGCLC-specific.

      Significance

      The study is the first CRISPR screen performed during hPGCLC differentiation and provides a proof of principle for a useful tool to allow dissection of gene regulatory networks during human germ cell development. Overall this is a technical advancement and an ambitious study and will generate interest in the human germ cell field. Generally it is easy to follow but could be improved in the description of some of the methodology (points 2+4). Overall the study suffers from weaknesses in the statistical robustness, as the screen itself did not provide many significant hits (major point 1) and the follow up was only performed using a single KO clone (points 3+5). Therefore adding replicates would be necessary to strengthen the confidence in the drawn conclusions.

      Reviewers' relevant expertise: in vitro germ cell differentiation, pluripotency, CRISPR screening

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors conducted a custom CRISPR screening including 422 coding-genes in hPGC-like cells (hPGCLCs) to identify genes important to hPGCLC. Based on the screen, they found two candidates, TCL1A and METTL7A, that regulate hPGCLC specification. They concluded that TCL1A, an AKT coactivator, is critical for hPGCLC specification through regulating AKT-mTOR signaling. Unfortunately, we found that the evidences for the key conclusions are not quite convincing. Reasons are as below:

      1. The results to demonstrate the key role of TCL1A on hPGCLC specification is not convincing. Fig. 3B, C & D. the cell number per aggregate is also significantly reduced in TCL1A KO (2843/20.7% =13734) compared to that in WT cells (545/16.4%=3323). Despite that, the percentage of AG+ cells per aggregate is significantly, but not dramatically decreased in TCL1A KO (20.7%) vs WT (16.4%) cells. Thus, the effect of TCL1A KO may not be specific on the AG+ cells, but on the whole aggregate.
      2. The overall effect of METTL7A KO on hPGCLC development is too moderate to conclude it as a key regulator for hPGCLC specification.

      Significance

      a CRISPR screening for key regulator for human germline cell apecification was not reported before.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, the authors performed CRISPR/Cas9-based screening to identify genes involved in human germ cell development. Using human PGCLC system, the authors found that sgRNAs targeting TCL1A and METTL7A genes were enriched in non-PGCLC population. Gene-disruption of each gene resulted in a significant reduction in PGCLC differentiation. Moreover, TCL1A-knockout PGCLCs failed to proliferation during PGCLC induction, possibly due to attenuation of AKT signaling. Further analyses showed that protein synthesis and cell-cycle progression, especially in S-phase, were impaired in TCL1A-knockout PGCLCs. Thus, the authors provided remarks on successful identification of genes functionally important for human PGCLC differentiation and the importance of translational control in human germ line.

      This manuscript demonstrates a genetic screening for genes involved in hPGCLC differentiation. Although the number of genes targeted by sgRNAs were limited (422 genes), it is still valuable to show such genetic screening can be applied to the hPGCLC system. Overall, statements and data in the manuscript are convincing, except for following points, and the novelty is sufficient for publication. It would be further improved, considering following points with additional experiment, if feasible.

      1. A major concern in this manuscript is whether TCL1A function is specifically involved in hPGC development or generally important for other cell type. As AKT signaling plays multiple roles on many cell contexts, this is important to verify the author's conclusion. For example, is there any defect in proliferation of TCL1A-knockout iPS lines? The authors should quantify the doubling rate of TCL1A-knockout iPS cells, the number of iMeLCs yielded, and the cell number included in the aggregates. Looking at Figure S3K and K, the total number of the cells in the aggregates seems lower in TCL1A-knockout aggregates than in WT.

      2. Related to the comment above, the author should add a statement describing expression pattern and level of TCL1A and METTL7A in tissue. Are they preferentially expressed in the germ line, or generally expressed in a broad range of tissue?

      3. The quantification of pAKT and p-mTOR is vague. The authors should quantify in a different way. Although the author claimed that Western blot analysis was not able to detect pAKT and p-mTOR in PGCLCs, there are a number of reports that detect these proteins. As an advantage of PGCLC system is to handle a large number of cells in culture, the authors should perform rigorously the quantification.

      4. In the abstract, there is a statement "demonstrate the importance of translational control in human reproduction". Isn't this too general? Is it a new finding? With critical examination for cell-specificity of the translational control by TCL1A as described above, this should be refined.

      Significance

      As authors performed CRISPR/Cas9-based screening to identify genes involved in human germ cell development using human PGCLC system, and then successfully isolated TCL1A and METTL7A genes that are previously known as drives for PGCLC induction. Therefore, from both technological and scientific viewpoints, there is a significant advance shown in this manuscript.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to Reviewer comments

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      __Summary __

      The manuscript by Parker et al addresses the important question of how different organisms have evolved pre-messenger RNA systems that are either more or less complex. This question underlies the evolution of complex organisms and the genome adaptation of simple organisms to their specific environments, so is an important question to answer. This manuscript now provides the underlying molecular mechanisms of how 5' splice site sequence preference may have evolved which is both an interesting and exciting advance for the field.

      We thank the reviewer for these kind comments.

      __Major comments __

      __This manuscript builds on the previous work from this group where they identified the role of adenosine N6 methylation (m6A) of the U6 small nuclear RNA (snRNA) of the spliceosome by METTL16 as being important for 5' splice site selection. This work led to the speculation that loss of a METTL16 ortholog, or potentially other splicing factors, in some species could contribute to an evolutionary change in 5' splice site sequence preference. Here the authors now use the power of phylogenetics, interspecies association mapping and the available spliceosome structures to provide convincing conclusions that 5' splice site sequence preferences in the extensive number of organisms examined correlate with the presence of the U6 snRNA methyltransferase METTL16 and the splicing factor SNRNP27K. __

      __An analysis of METTL16 conservation was first carried out by comparing the METTL16 methyltransferase domain (MTD) in 29 diverse eukaryotic species. All the METTL16 orthologs were found to have either one or two globular domains. Three domain types were identified and compared in detail. What was not clear from this analysis was the functional significance of orthologs having either one or two domains. __

      We identified several species, including Drosophila melanogaster, whose METTL16 orthologs do not contain a VCR domain. However, in this study we do not draw specific conclusions about the functional significance of orthologs having different domain topologies.

      __In addition, while this analysis provides important new information on the domain structure of METTL16 orthologs, especially where these domains had not been identified previously, the link between this section of the results and the following sections is not that apparent. __

      We agree that there is a significant difference in approach between the first section of the Results and the following sections. However, we are keen to keep this part of the manuscript because it provides an orthogonal line of evidence suggesting that the ancestral role of METTL16 in eukaryotes is specifically the methylation of U6 snRNA.

      __Next novel bioinformatics pipelines were developed to compare both introns and orthologous groupings of protein coding genes between 227 Sacchromycotina genomes as well as 13 well-annotated eukaryote genomes. First, the 5' splice site sequence preference was compared and clearly indicates that the +4 position has the greatest variation in preferences within the Sacchromycotina. The ability to now compare a large number of genomes has provided novel information on the evolution of the 5' splice site sequence and the conclusion that there is more complexity to the 5' splice site in fungi that previously recognized. While it is apparent why only the 5' splice site signal was investigated here, with its relationship to the U6 snRNA and METTL16, it seems a shame the other splice site sequences were not analyzed using this novel pipeline. In any case, the complexity of the 5' splice site +4 position now allows, for the first time, interesting interspecies association studies. __

      We have now included the variance plots for 3’SS motifs (analogous to the 5’SS variance plots shown in Figure 2B) as Figure 2 supplementary figure 4A, and a traitgram for 3’SS -3C to U ratio as Figure 2 supplementary figure 4B. We have included a short section of text in the Results section to describe these additional findings.

      __With ____the 5' splice site +4 variation identified, the next step was to determine the underlying molecular mechanisms that dictate the evolution of the various sequence preferences. Some obvious players here are the U1 and U6 snRNAs which directly interact with the 5' splice site during splicing. However, no association was found between these snRNAs and the 5' splice site +4 sequence. __

      __The powerful interspecies association mapping was then used to determine whether the presence or absence of METTL16 ortholog or a splicing factor correlated with the 5' splice site +4 sequence variation. Interestingly, a clear association was found between METTL16 and the 5' splice site +4 position; METTL16 presence was associated with +4A at the 5' splice site and METTL16 absence was associated with +4U at the 5' splice site. This is an exciting and significant finding. __

      We thank the reviewer for these comments on the importance of this study.

      __Interestingly, the next most significant association with the 5' splice site +4 position was with SNRNP27K. This result makes sense as in the cryo-EM structure of the pre-B spliceosome complex the C-terminal domain of SNRNP27K is found near the region of the U6 snRNA that will interact with the 5' splices site. Absence of SNRNP27K was associated with an increased preference for +4U at the 5' splice site. Now the real power of the interspecies association mapping was demonstrated by investigating whether any association could be determined specifically within the C-terminus of SNRNP27K. Significantly, the methionine 141 position in SNRNP27K was found to be associated with the +4 position of the 5' splice site. This finding fits nicely with previous studies where mutation of M141 caused a shift in 5' splice site selection away from +4A 5' splice sites, to 5' splice sites without +4A. What is not clear is whether M141 is conserved or invariant between all the species that were compared? __

      M141 is not completely conserved across the species that were compared for the SNRNP27K C-terminus analysis. We did not test positions with very strong sequence conservation, because without variation in both the genotype and phenotype it is not possible to test for an association. We have rephrased the relevant Results and Methods sections to make this point clearer. In addition, we have incorporated a sequence logo to illustrate the degree of conservation of each position in the SNRNP27K C-terminal domain as Figure 5 -figure supplement 1A. Finally, we have included an additional box-plot to illustrate the finding that species which have lost SNRNP27K or have only lost the Methionine equivalent to human SNRNP27K position 141, show a similar preference for +4U at 5’ SSs. This is now included as Figure 5 - figure supplement 1B.

      Overall, this result reveals the power of the interspecies association approach and provides interesting and exciting information on the molecular determinants of 5' splice site evolution.

      We are grateful to the reviewer for these comments.

      __The final analysis was to investigate the interaction potentials of the U5 and U6 snRNAs with the 5' splice site in the Sacchromycotina genomes and try to relate this to species with fewer introns and less alternative splicing. Species with low intron numbers and low splicing complexity were revealed to have weaker U5 and U6 anti-correlation potentials and favor +4U at the 5' splice site. On the other hand, species with high intron number and presumably higher splicing complexity featured anti-correlated U5 and U6 snRNA interaction potentials and favored +4A 5' splice sites. This extensive analysis provides novel information on the interactions and splice site properties of species with simple and complex splicing. Again, I see why there is emphasis on the 5' splice site here but a similar analysis with the U2 snRNA and the branch site could also be informative. __

      We absolutely agree that inter-species association mapping could be applied to other splicing signal phenotypes including 3’ splice sites and intron branchpoints. Accordingly, we raise this subject in the final section of the Discussion. However, branchpoint sequences are challenging to predict with genomic data. Because preliminary analyses suggest independent variation in these other splicing signal phenotypes, we feel a separate focused study is required to properly explain (and substantiate) even the analytical approaches involved. We hope the reviewer would agree that incorporating U2 snRNA and branchpoint variation analyses into this manuscript as well, could detract from the clarity of the conceptual advances that we make here.

      __Minor comments __

      __Should the Title include SNRNP27K? __

      We have included SNRNP27K into the revised title.

      Should the title specify that it is the evolution of only the 5’ splice site sequence preference being studied here?

      Because apostrophes in titles can compromise some scholarly online search engines (https://insights.uksg.org/articles/10.1629/uksg.534), we would prefer not to include 5’ in the title.

      Include information on intron number and 5’ splice site interaction potential of U5 and U6 snRNA in the Summary?

      We thank the reviewer for this suggestion. We have updated the Summary to include our findings on U5 and U6 interaction potential in species with reduced intron number.

      __Figure 1C is not referred to in the text? __

      We apologise for this oversight. We have added references to figure 1C in the appropriate Results section.

      Page 8, line 5 – better to say “splicing signal phenotypes”.

      We have amended this statement on Page 8 and at other places in the text where related phrasing was made.

      __What are the other points on Figure 3B? What is the next point below SNRNP27K? Is it U2A’? __

      The other points on Figure 3B represent Orthofinder orthogroups which contain human orthologs that are known components of the spliceosome. The list of spliceosomal components was taken from Sales-Lee et al. 2021. The third most significant point is indeed the orthogroup containing the human ortholog of U2A’. As we state in the text, however, the correlation of U2A’ with the 5’SS+4 A to U ratio phenotype is no longer significant once METTL16 presence/absence is controlled for, indicating that the correlation of U2A’ with the +4A phenotype is likely explained by similarity in the patterns of gene loss of U2A’ and METTL16.

      __The second paragraph of the Discussion is vague and lacks a reference. “we could also identify an association with a methionine residue in the conserved C-terminal domain of SNRNP27K orthologs.” There are a few methionines in the C-terminus, which one? Please reference the statement “transcriptome analysis of C. elegans SNRP-27 M141T mutants..” __

      We apologise for the lower quality of writing in this section of the Discussion. We have updated the text, made the statements about the SNRNP27K C-terminus less ambiguous, and added the relevant citations as appropriate.

      Reviewer #1 (Significance (Required)):

      Overall, this is a well written and clearly presented study that provides some key molecular information on the splicing factors involved in the evolution of 5’ splice sites and shows the power of interspecies association studies. Some important conceptual principles have now been defined for the field going forward.

      With thank the reviewer for this kind comment on the importance of this work.

      __The question remains as to whether METTL16 and SNRNP27K are the sole determinants of 5’ splice site preference evolution at +4? __

      We cannot say for certain that METTL16 and/or SNRNP27K determine the 5’SS +4 phenotype – only that they are correlated with it. In our response to reviewer 3, and in a new Discussion section, we have detailed some of the scenarios that could explain these correlations. We also cannot rule out whether there are changes in the presence/absence (or domain/sequence-level changes) of other, untested proteins that correlate with the 5’SS +4 phenotype and we allude to this in the final section of the Discussion.

      One splicing factor that immediately comes to mind is Prp8 where there is extensive evidence for involvement in splice site selection and is clearly in the right location throughout splicing to be involved. This question should at least be discussed but Prp8 would also be a very interesting candidate for the interspecies association mapping.

      Prp8 is a core component of spliceosomes and is conserved throughout the Saccharomycotina. For this reason, we were unable to associate splicing phenotypes with Prp8 presence or absence variation at the level of orthogroups. However, we revisited this question posed by the reviewer. Our experience with inter-species association mapping, so far, indicates it works well with orthogroup presence/absence or when straightforward amino acid substitutions can be detected in conserved and hence alignable protein sequence domains. We analysed the conserved U6 snRNA-interacting region of the Prp8 linker domain, which maps close to the 5’ splice site in cryo-EM models, using the profile HMM PF10596 available from Pfam. We found that the majority of this domain was extremely highly conserved with variation in only a few species and positions. The strongest correlation with the +4A to U ratio phenotype was at position 58, which is conserved as a Glycine in all but 8 species (6 Dipodascaceae, 2 CUG-Ser1), that also tend to have a stronger preference for +4A. However, examination of the species contributing to this result (and to similar results at other positions) indicated that in the 6 Dipodascaceae species, this change is part of a larger deletion or replacement that makes the whole linker region align poorly to the model. Hence, the G58 position itself may not be specifically important for the +4 phenotype. Although the wholesale loss or replacement of the U6 snRNA-interacting region in these species is potentially interesting, these larger scale structural changes in a small number of species are difficult to interpret. Therefore, to maintain the focus of the manuscript and the clear links to METTL16 and SNRNP27K that have orthogonal support, we have decided not to add these results to the manuscript but present them here (Figure not available on biorXiv commenting window).

      Also, as mentioned previously, only the 5’ splice site was investigated here and the manuscript could become a more substantial piece of work if the other splice sites were included in some way.

      We agree that it will be exciting to apply this approach to other splicing signal phenotypes and in other phylogenetic clades with emerging tree-of-life-scale genomics data. We have included variation in 3’ splice sites in the revised manuscript. As the first of its kind, this study should pioneer a wider use of this approach, by us and others, to understand the mechanisms and functions of molecular interactions not only in splicing but in other areas of biology too.

      __The obvious audience here are those directly in the splicing field but the overall principles are relevant for evolutionary biologists and those studying organismal complexity. __

      We thank the reviewer for recognising the broad importance of this work.

      My expertise is in yeast and human splicing mechanisms. I do not have the expertise to critically evaluate the bioinformatic pipelines but they were clearly explained and presented.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In their manuscript, Parker et al. investigate the evolutionary patterns of splice site preference, focusing on the A/U ratio at position A+4 on the 5´ splice site. Building upon prior studies in S. pombe and A. thaliana, the authors establish a strong correlation between this preference and the co-evolution of the METTL16 U6 snRNA methyltransferase. Furthermore, through inter-species association mapping, they identify the involvement of the splicing factor SNRNP27K in altered A/U ratios and highlight the significance of the residue Met-141 in SNRNP27K for this function. Overall, the paper effectively presents impactful new findings on the evolution of METTL16, U6 snRNA, and splicing.

      We thank the reviewer for these kind comments on the importance of our study.

      The computational analyses employed in this study are situated outside our field of expertise, preventing us from offering a comprehensive evaluation of the methodology’s appropriateness and rigor. Nonetheless, the identification of METTL16 through the authors’ methods, which aligns with previous research in S. pombe and A. thaliana, lends support to the validity of their approach. Notably, the close proximity between SNRNP27K and the methylated A43 residue in U6 snRNA within the spliceosome, particularly near Met-141, is an impressive finding. Previous studies have shown that a mutation at position M141T affects splicing at +4A introns, thus providing robust validation for their methods.

      We thank the reviewer for these kind comments on our work.

      The data presented in this study furnish crucial insights into the role of METTL16, U6 snRNA methylation, and splice site recognition. The authors expand upon recent observations that the “vertebrate conserved region” exists in non-vertebrates, despite the absence of primary sequence homology. These results will serve as a valuable guide for future molecular investigations into U6 snRNA methylation and its mechanisms in splicing. Furthermore, the implications of this paper extend to human evolution, as the plasticity in splicing is an essential factor in the evolution of developmental complexity.

      We thank the reviewer for these kind comments.

      Minor suggestions for improvement:

      1. __ Given the significance of the interaction between U6 snRNA and the intron for understanding the data, it would be beneficial to include a figure illustrating the RNA-RNA base-pairing interactions between U6 snRNA and the 5´ splice site. This addition is particularly important if the paper is intended for publication in a journal with a general readership.__  We thank the reviewer for this excellent suggestion. We have included this as Figure 3A.

      __ Similarly, the section on U1 snRNA would be more comprehensible with the inclusion of U1 RNA-RNA intron diagrams and improved descriptions of both the figures and the assay. Despite being negative data in the supplement, clarifying this section is essential. As currently written, it is challenging to follow.__ 

      We agree that this section is difficult to follow. We have updated the text to improve the readability and included a figure of U1 snRNA:5’SS basepairing as Figure 3 – figure supplement 1A.

      __ Whenever possible, consider increasing the figure and font sizes to enhance readability for readers.__ 

      We agree that some of the more complex figures can be difficult to read when embedded into a Word document/pdf. We hope that providing high-resolution figures for reading online will mitigate this.

      __ In the text, there is no reference to Figure 1C.__ 

      We apologise for this oversight. We have resolved this issue with the appropriate references in the Results text.

      __ In Figure 5B, the y-axis in the top panel is labelled “species,” but the legend only mentions U5/6p as the y-axis. Please revise the legend to include the appropriate information.__ 

      We apologise for the confusion caused by our poorly written legend for this plot. We have updated the legend so that the text clearly refers to either the scatter plot or the marginal histograms.

      Reviewer #2 (Significance (Required)):

      The data presented in this study furnish crucial insights into the role of METTL16, U6 snRNA methylation, and splice site recognition. The authors expand upon recent observations that the “vertebrate conserved region” exists in non-vertebrates, despite the absence of primary sequence homology. These results will serve as a valuable guide for future molecular investigations into U6 snRNA methylation and its mechanisms in splicing. Furthermore, the implications of this paper extend to human evolution, as the plasticity in splicing is an essential factor in the evolution of developmental complexity.

      We are grateful to the reviewer for these kind comments on the importance of this work.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Parker et al present a nice exploration of the evolutionary and mechanistic relationships between 5′ splice site consensus sequences, intron numbers and METTL16/SNRNP27K. By performing inter-species association mapping in Saccharomycotina species, they found that a T in position +4 is strongly associated with the absence of METTL16 (and/or in some cases SNRNP27K or mutations in it). They also provide solid structural modelling data in support of this association.

      In general, I think this is a very nice manuscript. I only have a few comments, which could be addressed by rewording specific parts and/or improving the current figures.

      We are grateful to the reviewer for the kind comments on this work.

      1) As the authors acknowledge, a key issue that cannot be fully resolved in this study is causality between the different events investigated. Overall, the authors are careful about this, but there are some exceptions that should be corrected. Probably the most important is in the abstract, where they write: “We conclude that variation in concerted processes of 5’ splice site selection by U6 snRNA is crucial to evolutionary change in splicing complexity”. I suggest they write something more open (and correct), such as: “We conclude that variation in concerted processes of 5’ splice site selection by U6 snRNA is associated with evolutionary changes in splicing complexity”. Similarly, other plausible scenarios should be discussed in the corresponding Discussion section.

      We agree with the reviewer that it is not possible to infer the causal relationship between METTL16 absence and 5’SS+4 preference change from the current data. We, therefore, apologise for failing to be more careful in the Summary and Introduction. We have reworded these statements to better reflect what we can currently say about the evolutionary relationship between METTL16 and 5’SS sequence preference.

      The correlation between METTL16 absence and 5'SS+4 sequence preference change could most likely be explained by one of several scenarios: (a) sudden loss of METTL16 causes a rapid necessity to change 5'SS sequence preferences. This is unlikely as such rapid change without widespread corresponding 5'SS changes would likely impose a high fitness cost. (b) Changes in 5'SS sequence preference occur first, driven by some other selective pressure, until there is no longer a benefit to retaining the METTL16 gene. (c) Gradual changes in the expression or catalytic efficiency of METTL16 reduce the stoichiometry of U6 snRNA m6A modification, which permits gradual change in 5'SS+4 sequence preference until complete loss of the METTL16 no longer imposes a major fitness cost. As we suggest in the Discussion, future work could examine this question by determining whether the METTL16 orthologs found in Zygosaccharomyces and Eremothecium species, which have altered their 5'SS+4 preference to a U, are expressed and functional. We have updated the Discussion to include a new section that addresses these scenarios.

      2) I do not agree with the statement that "The extent of alternative splicing is the best genomic predictor of developmental complexity". To start with, there are many ways to quantify "extent of alternative splicing" and there are also different types of alternative splicing that might have different prevalence and biological impact. Then, this claim is usually related with exon skipping, which is tightly linked with intron length, and that is likely a better prediction of complexity (yet clearly not causative). My concern is: to what extent has this claim been formally and properly assessed by comparing splicing prevalence with other genomic features, such as intergenic region length, intron length, or average distance between enhancer-promoter interactions (arguably the most relevant predictor, in light of many other studies)? Moreover, I found it a bit misleading to frame the work presented in this study as directly related with developmental (or even splicing) complexity. The work is very interesting on its own, and I doubt their findings on +4 position preference in Saccharomycotina has anything to do with developmental complexity (as the Abstract and Introduction seem to imply).

      On reflection, we agree with the reviewer. Some of our framing of the text isn’t balanced with other studies on the scaling of alternative splicing with developmental complexity. We have edited the Summary and Introduction sections accordingly and cited other references that broaden the consideration of this subject. We are grateful to the reviewer for this suggestion because the changes we make improve the focus of the manuscript since our findings relate more to splicing simplification than to an understanding of increased developmental complexity.

      __3) I found Figure 2 and its associated supplementary figure very difficult to follow. I suggest the authors try to improve it and make it clearer. Also, other trees summarizing the results might be helpful. __

      We apologise for the complexity of these figures. We opted to show phylogenetic trees with phenotypes plotted on the y axis, rather than simply trait histograms or box-plots, because the underlying structure of the tree is important for demonstrating that multiple independent changes in the 5’SS phenotype have occurred in the Saccharomycotina. We have tried to improve the comprehensibility of the figures in the following ways: (a) We have added 5’SS sequence motifs to the x-axis of figure 2B to make what the plot represents clearer, (b) as suggested by the reviewer, we have created a pruned tree showing the 5’SS motifs of a selection of Saccharomycotina species, which demonstrates that the changes in 5’SS+4 position preferences seen in S. cerevisiae and C. albicans are likely to be a result of convergent evolution. We have added this tree as Figure 2 - figure supplement 3.

      __4) I also found the Results section corresponding to Figure 5B a bit confusing. I would argue (as I think the authors do) that there are two main patterns here: below 500 introns, there is no association, while above 500 introns there is an increasingly negative association (correlation). I think it would help to more explicitly distinguishing these two patterns. Then, for the intron-poor species: is the correlation (or lack of) for species with a T or an A in position +4 different? __

      We do indeed think that there are two patterns here, as indicated by the reviewer. In the previous version of the manuscript, we separated species into those having an overall preference for A at the +4 position, and those having +4U. By showing regression lines for these two classes, rather than for the general relationship between intron number and U5/6rho, we somewhat imply that the switch in +4 base preference might be causing the loss of correlation between U5/6rho and intron number. However, since essentially all species with a 5'SS +4U preference are intron poor, it seems more likely that these trends are the result of a loss of the negative correlation between intron number and U5/6rho in intron poor species, as suggested by the reviewer. To address this issue, we have replaced the regression lines on Figure 6B with a single loess (locally estimated scatterplot smoothing) regression line for all species and updated the text to make it clearer that we think loss of U5/6rho and +4A preference are separate traits of intron poor species. Although this is not exactly what the reviewer requested, we hope that it satisfies their issue with the analysis.

      __Reviewer #3 (Significance (Required)): __

      __This is a very interesting study that sheds light on an intriguing evolutionary pattern: the change in consensus sequence at position +4 of the 5' splice site. This topic is relevant since it is closely associated with intron loss and splicing efficiency and evolution. __

      We thank the reviewer for the kind and constructive comments on this study.

    2. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary

      The manuscript by Parker et al addresses the important question of how different organisms have evolved pre-messenger RNA systems that are either more or less complex. This question underlies the evolution of complex organisms and the genome adaptation of simple organisms to their specific environments, so is an important question to answer. This manuscript now provides the underlying molecular mechanisms of how 5' splice site sequence preference may have evolved which is both an interesting and exciting advance for the field.

      We thank the reviewer for these kind comments.

      Major comments

      This manuscript builds on the previous work from this group where they identified the role of adenosine N6 methylation (m6A) of the U6 small nuclear RNA (snRNA) of the spliceosome by METTL16 as being important for 5' splice site selection. This work led to the speculation that loss of a METTL16 ortholog, or potentially other splicing factors, in some species could contribute to an evolutionary change in 5' splice site sequence preference. Here the authors now use the power of phylogenetics, interspecies association mapping and the available spliceosome structures to provide convincing conclusions that 5' splice site sequence preferences in the extensive number of organisms examined correlate with the presence of the U6 snRNA methyltransferase METTL16 and the splicing factor SNRNP27K. 

      An analysis of METTL16 conservation was first carried out by comparing the METTL16 methyltransferase domain (MTD) in 29 diverse eukaryotic species. All the METTL16 orthologs were found to have either one or two globular domains. Three domain types were identified and compared in detail. What was not clear from this analysis was the functional significance of orthologs having either one or two domains.

      We identified several species, including Drosophila melanogaster, whose METTL16 orthologs do not contain a VCR domain. However, in this study we do not draw specific conclusions about the functional significance of orthologs having different domain topologies.

      In addition, while this analysis provides important new information on the domain structure of METTL16 orthologs, especially where these domains had not been identified previously, the link between this section of the results and the following sections is not that apparent.

      We agree that there is a significant difference in approach between the first section of the Results and the following sections. However, we are keen to keep this part of the manuscript because it provides an orthogonal line of evidence suggesting that the ancestral role of METTL16 in eukaryotes is specifically the methylation of U6 snRNA.

      Next novel bioinformatics pipelines were developed to compare both introns and orthologous groupings of protein coding genes between 227 Sacchromycotina genomes as well as 13 well-annotated eukaryote genomes. First, the 5' splice site sequence preference was compared and clearly indicates that the +4 position has the greatest variation in preferences within the Sacchromycotina. The ability to now compare a large number of genomes has provided novel information on the evolution of the 5' splice site sequence and the conclusion that there is more complexity to the 5' splice site in fungi that previously recognized. While it is apparent why only the 5' splice site signal was investigated here, with its relationship to the U6 snRNA and METTL16, it seems a shame the other splice site sequences were not analyzed using this novel pipeline. In any case, the complexity of the 5' splice site +4 position now allows, for the first time, interesting interspecies association studies.

      We have now included the variance plots for 3’SS motifs (analogous to the 5’SS variance plots shown in Figure 2B) as Figure 2 supplementary figure 4A, and a traitgram for 3’SS -3C to U ratio as Figure 2 supplementary figure 4B. We have included a short section of text in the Results section to describe these additional findings.

      With the 5' splice site +4 variation identified, the next step was to determine the underlying molecular mechanisms that dictate the evolution of the various sequence preferences. Some obvious players here are the U1 and U6 snRNAs which directly interact with the 5' splice site during splicing. However, no association was found between these snRNAs and the 5' splice site +4 sequence. 

      The powerful interspecies association mapping was then used to determine whether the presence or absence of METTL16 ortholog or a splicing factor correlated with the 5' splice site +4 sequence variation. Interestingly, a clear association was found between METTL16 and the 5' splice site +4 position; METTL16 presence was associated with +4A at the 5' splice site and METTL16 absence was associated with +4U at the 5' splice site. This is an exciting and significant finding.

      We thank the reviewer for these comments on the importance of this study.

      Interestingly, the next most significant association with the 5' splice site +4 position was with SNRNP27K. This result makes sense as in the cryo-EM structure of the pre-B spliceosome complex the C-terminal domain of SNRNP27K is found near the region of the U6 snRNA that will interact with the 5' splices site. Absence of SNRNP27K was associated with an increased preference for +4U at the 5' splice site. Now the real power of the interspecies association mapping was demonstrated by investigating whether any association could be determined specifically within the C-terminus of SNRNP27K. Significantly, the methionine 141 position in SNRNP27K was found to be associated with the +4 position of the 5' splice site. This finding fits nicely with previous studies where mutation of M141 caused a shift in 5' splice site selection away from +4A 5' splice sites, to 5' splice sites without +4A. What is not clear is whether M141 is conserved or invariant between all the species that were compared?

      M141 is not completely conserved across the species that were compared for the SNRNP27K C-terminus analysis. We did not test positions with very strong sequence conservation, because without variation in both the genotype and phenotype it is not possible to test for an association. We have rephrased the relevant Results and Methods sections to make this point clearer. In addition, we have incorporated a sequence logo to illustrate the degree of conservation of each position in the SNRNP27K C-terminal domain as Figure 5 -figure supplement 1A. Finally, we have included an additional box-plot to illustrate the finding that species which have lost SNRNP27K or have only lost the Methionine equivalent to human SNRNP27K position 141, show a similar preference for +4U at 5’ SSs. This is now included as Figure 5 - figure supplement 1B.

      Overall, this result reveals the power of the interspecies association approach and provides interesting and exciting information on the molecular determinants of 5' splice site evolution.

      We are grateful to the reviewer for these comments.

      The final analysis was to investigate the interaction potentials of the U5 and U6 snRNAs with the 5' splice site in the Sacchromycotina genomes and try to relate this to species with fewer introns and less alternative splicing. Species with low intron numbers and low splicing complexity were revealed to have weaker U5 and U6 anti-correlation potentials and favor +4U at the 5' splice site. On the other hand, species with high intron number and presumably higher splicing complexity featured anti-correlated U5 and U6 snRNA interaction potentials and favored +4A 5' splice sites. This extensive analysis provides novel information on the interactions and splice site properties of species with simple and complex splicing. Again, I see why there is emphasis on the 5' splice site here but a similar analysis with the U2 snRNA and the branch site could also be informative.

      We absolutely agree that inter-species association mapping could be applied to other splicing signal phenotypes including 3’ splice sites and intron branchpoints. Accordingly, we raise this subject in the final section of the Discussion. However, branchpoint sequences are challenging to predict with genomic data. Because preliminary analyses suggest independent variation in these other splicing signal phenotypes, we feel a separate focused study is required to properly explain (and substantiate) even the analytical approaches involved. We hope the reviewer would agree that incorporating U2 snRNA and branchpoint variation analyses into this manuscript as well, could detract from the clarity of the conceptual advances that we make here.

      Minor comments

      Should the Title include SNRNP27K?

      There is certainly a case that the title should include SNRNP27K. Our aim was to make the title as short and informative as possible without too many acronyms that need explaining. Since the clearest correlation is with METTL16 and this has broader implications for understanding the role of this enzyme not only in splicing but in possibly modifying other RNA targets too, we think not including SNRNP27K is a suitable compromise. In addition, retaining the current title simplifies the tracking of the manuscript from pre-print through to journal publication.

      Should the title specify that it is the evolution of only the 5’ splice site sequence preference being studied here?

      Because apostrophes in titles can compromise some scholarly online search engines (https://insights.uksg.org/articles/10.1629/uksg.534), we would prefer not to include 5’ in the title.

      Include information on intron number and 5’ splice site interaction potential of U5 and U6 snRNA in the Summary?

      We thank the reviewer for this suggestion. We have updated the Summary to include our findings on U5 and U6 interaction potential in species with reduced intron number.

      Figure 1C is not referred to in the text?

      We apologise for this oversight. We have added references to figure 1C in the appropriate Results section.

      Page 8, line 5 – better to say “splicing signal phenotypes”.

      We have amended this statement on Page 8 and at other places in the text where related phrasing was made.

      What are the other points on Figure 3B? What is the next point below SNRNP27K? Is it U2A’? 

      The other points on Figure 3B represent Orthofinder orthogroups which contain human orthologs that are known components of the spliceosome. The list of spliceosomal components was taken from Sales-Lee et al. 2021. The third most significant point is indeed the orthogroup containing the human ortholog of U2A’. As we state in the text, however, the correlation of U2A’ with the 5’SS+4 A to U ratio phenotype is no longer significant once METTL16 presence/absence is controlled for, indicating that the correlation of U2A’ with the +4A phenotype is likely explained by similarity in the patterns of gene loss of U2A’ and METTL16.

      The second paragraph of the Discussion is vague and lacks a reference. “we could also identify an association with a methionine residue in the conserved C-terminal domain of SNRNP27K orthologs.” There are a few methionines in the C-terminus, which one? Please reference the statement “transcriptome analysis of C. elegans SNRP-27 M141T mutants..”

      We apologise for the lower quality of writing in this section of the Discussion. We have updated the text, made the statements about the SNRNP27K C-terminus less ambiguous, and added the relevant citations as appropriate.

      Reviewer #1 (Significance):

      Overall, this is a well written and clearly presented study that provides some key molecular information on the splicing factors involved in the evolution of 5’ splice sites and shows the power of interspecies association studies. Some important conceptual principles have now been defined for the field going forward.

      With thank the reviewer for this kind comment on the importance of this work.

      The question remains as to whether METTL16 and SNRNP27K are the sole determinants of 5’ splice site preference evolution at +4?

      We cannot say for certain that METTL16 and/or SNRNP27K determine the 5’SS +4 phenotype – only that they are correlated with it. In our response to reviewer 3, and in a new Discussion section, we have detailed some of the scenarios that could explain these correlations. We also cannot rule out whether there are changes in the presence/absence (or domain/sequence-level changes) of other, untested proteins that correlate with the 5’SS +4 phenotype and we allude to this in the final section of the Discussion.

      One splicing factor that immediately comes to mind is Prp8 where there is extensive evidence for involvement in splice site selection and is clearly in the right location throughout splicing to be involved. This question should at least be discussed but Prp8 would also be a very interesting candidate for the interspecies association mapping.

      Prp8 is a core component of spliceosomes and is conserved throughout the Saccharomycotina. For this reason, we were unable to associate splicing phenotypes with Prp8 presence or absence variation at the level of orthogroups. However, we revisited this question posed by the reviewer. Our experience with inter-species association mapping, so far, indicates it works well with orthogroup presence/absence or when straightforward amino acid substitutions can be detected in conserved and hence alignable protein sequence domains. We analysed the conserved U6 snRNA-interacting region of the Prp8 linker domain, which maps close to the 5’ splice site in cryo-EM models, using the profile HMM PF10596 available from Pfam. We found that the majority of this domain was extremely highly conserved with variation in only a few species and positions. The strongest correlation with the +4A to U ratio phenotype was at position 58, which is conserved as a Glycine in all but 8 species (6 Dipodascaceae, 2 CUG-Ser1), that also tend to have a stronger preference for +4A. However, examination of the species contributing to this result (and to similar results at other positions) indicated that in the 6 Dipodascaceae species, this change is part of a larger deletion or replacement that makes the whole linker region align poorly to the model. Hence, the G58 position itself may not be specifically important for the +4 phenotype. Although the wholesale loss or replacement of the U6 snRNA-interacting region in these species is potentially interesting, these larger scale structural changes in a small number of species are difficult to interpret. Therefore, to maintain the focus of the manuscript and the clear links to METTL16 and SNRNP27K that have orthogonal support, we have decided not to add these results to the manuscript but present them here (Figure not available on biorXiv commenting window).

      Also, as mentioned previously, only the 5’ splice site was investigated here and the manuscript could become a more substantial piece of work if the other splice sites were included in some way.

      We agree that it will be exciting to apply this approach to other splicing signal phenotypes and in other phylogenetic clades with emerging tree-of-life-scale genomics data. We have included variation in 3’ splice sites in the revised manuscript. As the first of its kind, this study should pioneer a wider use of this approach, by us and others, to understand the mechanisms and functions of molecular interactions not only in splicing but in other areas of biology too.

      The obvious audience here are those directly in the splicing field but the overall principles are relevant for evolutionary biologists and those studying organismal complexity.

      We thank the reviewer for recognising the broad importance of this work.

      My expertise is in yeast and human splicing mechanisms. I do not have the expertise to critically evaluate the bioinformatic pipelines but they were clearly explained and presented.

      Reviewer #2 (Evidence, reproducibility and clarity):

      In their manuscript, Parker et al. investigate the evolutionary patterns of splice site preference, focusing on the A/U ratio at position A+4 on the 5´ splice site. Building upon prior studies in S. pombe and A. thaliana, the authors establish a strong correlation between this preference and the co-evolution of the METTL16 U6 snRNA methyltransferase. Furthermore, through inter-species association mapping, they identify the involvement of the splicing factor SNRNP27K in altered A/U ratios and highlight the significance of the residue Met-141 in SNRNP27K for this function. Overall, the paper effectively presents impactful new findings on the evolution of METTL16, U6 snRNA, and splicing.

      We thank the reviewer for these kind comments on the importance of our study.

      The computational analyses employed in this study are situated outside our field of expertise, preventing us from offering a comprehensive evaluation of the methodology’s appropriateness and rigor. Nonetheless, the identification of METTL16 through the authors’ methods, which aligns with previous research in S. pombe and A. thaliana, lends support to the validity of their approach. Notably, the close proximity between SNRNP27K and the methylated A43 residue in U6 snRNA within the spliceosome, particularly near Met-141, is an impressive finding. Previous studies have shown that a mutation at position M141T affects splicing at +4A introns, thus providing robust validation for their methods.

      We thank the reviewer for these kind comments on our work.

      The data presented in this study furnish crucial insights into the role of METTL16, U6 snRNA methylation, and splice site recognition. The authors expand upon recent observations that the “vertebrate conserved region” exists in non-vertebrates, despite the absence of primary sequence homology. These results will serve as a valuable guide for future molecular investigations into U6 snRNA methylation and its mechanisms in splicing. Furthermore, the implications of this paper extend to human evolution, as the plasticity in splicing is an essential factor in the evolution of developmental complexity.

      We thank the reviewer for these kind comments.

      Minor suggestions for improvement:

      1. Given the significance of the interaction between U6 snRNA and the intron for understanding the data, it would be beneficial to include a figure illustrating the RNA-RNA base-pairing interactions between U6 snRNA and the 5´ splice site. This addition is particularly important if the paper is intended for publication in a journal with a general readership.

      We thank the reviewer for this excellent suggestion. We have included this as Figure 3A.

      1. Similarly, the section on U1 snRNA would be more comprehensible with the inclusion of U1 RNA-RNA intron diagrams and improved descriptions of both the figures and the assay. Despite being negative data in the supplement, clarifying this section is essential. As currently written, it is challenging to follow.

      We agree that this section is difficult to follow. We have updated the text to improve the readability and included a figure of U1 snRNA:5’SS basepairing as Figure 3 – figure supplement 1A.

      1. Whenever possible, consider increasing the figure and font sizes to enhance readability for readers.

      We agree that some of the more complex figures can be difficult to read when embedded into a Word document/pdf. We hope that providing high-resolution figures for reading online will mitigate this.

      1. In the text, there is no reference to Figure 1C.

      We apologise for this oversight. We have resolved this issue with the appropriate references in the Results text.

      1. In Figure 5B, the y-axis in the top panel is labelled “species,” but the legend only mentions U5/6p as the y-axis. Please revise the legend to include the appropriate information.

      We apologise for the confusion caused by our poorly written legend for this plot. We have updated the legend so that the text clearly refers to either the scatter plot or the marginal histograms.

      Reviewer #2 (Significance):

      The data presented in this study furnish crucial insights into the role of METTL16, U6 snRNA methylation, and splice site recognition. The authors expand upon recent observations that the “vertebrate conserved region” exists in non-vertebrates, despite the absence of primary sequence homology. These results will serve as a valuable guide for future molecular investigations into U6 snRNA methylation and its mechanisms in splicing. Furthermore, the implications of this paper extend to human evolution, as the plasticity in splicing is an essential factor in the evolution of developmental complexity.

      We are grateful to the reviewer for these kind comments on the importance of this work.

      Reviewer #3 (Evidence, reproducibility and clarity):

      In this manuscript, Parker et al present a nice exploration of the evolutionary and mechanistic relationships between 5′ splice site consensus sequences, intron numbers and METTL16/SNRNP27K. By performing inter-species association mapping in Saccharomycotina species, they found that a T in position +4 is strongly associated with the absence of METTL16 (and/or in some cases SNRNP27K or mutations in it). They also provide solid structural modelling data in support of this association.

      In general, I think this is a very nice manuscript. I only have a few comments, which could be addressed by rewording specific parts and/or improving the current figures.

      We are grateful to the reviewer for the kind comments on this work.

      1) As the authors acknowledge, a key issue that cannot be fully resolved in this study is causality between the different events investigated. Overall, the authors are careful about this, but there are some exceptions that should be corrected. Probably the most important is in the abstract, where they write: “We conclude that variation in concerted processes of 5’ splice site selection by U6 snRNA is crucial to evolutionary change in splicing complexity”. I suggest they write something more open (and correct), such as: “We conclude that variation in concerted processes of 5’ splice site selection by U6 snRNA is associated with evolutionary changes in splicing complexity”. Similarly, other plausible scenarios should be discussed in the corresponding Discussion section.

      We agree with the reviewer that it is not possible to infer the causal relationship between METTL16 absence and 5’SS+4 preference change from the current data. We, therefore, apologise for failing to be more careful in the Summary and Introduction. We have reworded these statements to better reflect what we can currently say about the evolutionary relationship between METTL16 and 5’SS sequence preference.

      The correlation between METTL16 absence and 5'SS+4 sequence preference change could most likely be explained by one of several scenarios: (a) sudden loss of METTL16 causes a rapid necessity to change 5'SS sequence preferences. This is unlikely as such rapid change without widespread corresponding 5'SS changes would likely impose a high fitness cost. (b) Changes in 5'SS sequence preference occur first, driven by some other selective pressure, until there is no longer a benefit to retaining the METTL16 gene. (c) Gradual changes in the expression or catalytic efficiency of METTL16 reduce the stoichiometry of U6 snRNA m6A modification, which permits gradual change in 5'SS+4 sequence preference until complete loss of the METTL16 no longer imposes a major fitness cost. As we suggest in the Discussion, future work could examine this question by determining whether the METTL16 orthologs found in Zygosaccharomyces and Eremothecium species, which have altered their 5'SS+4 preference to a U, are expressed and functional. We have updated the Discussion to include a new section that addresses these scenarios.

      2) I do not agree with the statement that "The extent of alternative splicing is the best genomic predictor of developmental complexity". To start with, there are many ways to quantify "extent of alternative splicing" and there are also different types of alternative splicing that might have different prevalence and biological impact. Then, this claim is usually related with exon skipping, which is tightly linked with intron length, and that is likely a better prediction of complexity (yet clearly not causative). My concern is: to what extent has this claim been formally and properly assessed by comparing splicing prevalence with other genomic features, such as intergenic region length, intron length, or average distance between enhancer-promoter interactions (arguably the most relevant predictor, in light of many other studies)? Moreover, I found it a bit misleading to frame the work presented in this study as directly related with developmental (or even splicing) complexity. The work is very interesting on its own, and I doubt their findings on +4 position preference in Saccharomycotina has anything to do with developmental complexity (as the Abstract and Introduction seem to imply).

      On reflection, we agree with the reviewer. Some of our framing of the text isn’t balanced with other studies on the scaling of alternative splicing with developmental complexity. We have edited the Summary and Introduction sections accordingly and cited other references that broaden the consideration of this subject. We are grateful to the reviewer for this suggestion because the changes we make improve the focus of the manuscript since our findings relate more to splicing simplification than to an understanding of increased developmental complexity.

      3) I found Figure 2 and its associated supplementary figure very difficult to follow. I suggest the authors try to improve it and make it clearer. Also, other trees summarizing the results might be helpful. 

      We apologise for the complexity of these figures. We opted to show phylogenetic trees with phenotypes plotted on the y axis, rather than simply trait histograms or box-plots, because the underlying structure of the tree is important for demonstrating that multiple independent changes in the 5’SS phenotype have occurred in the Saccharomycotina. We have tried to improve the comprehensibility of the figures in the following ways: (a) We have added 5’SS sequence motifs to the x-axis of figure 2B to make what the plot represents clearer, (b) as suggested by the reviewer, we have created a pruned tree showing the 5’SS motifs of a selection of Saccharomycotina species, which demonstrates that the changes in 5’SS+4 position preferences seen in S. cerevisiae and C. albicans are likely to be a result of convergent evolution. We have added this tree as Figure 2 - figure supplement 3.

      4) I also found the Results section corresponding to Figure 5B a bit confusing. I would argue (as I think the authors do) that there are two main patterns here: below 500 introns, there is no association, while above 500 introns there is an increasingly negative association (correlation). I think it would help to more explicitly distinguishing these two patterns. Then, for the intron-poor species: is the correlation (or lack of) for species with a T or an A in position +4 different? 

      We do indeed think that there are two patterns here, as indicated by the reviewer. In the previous version of the manuscript, we separated species into those having an overall preference for A at the +4 position, and those having +4U. By showing regression lines for these two classes, rather than for the general relationship between intron number and U5/6rho, we somewhat imply that the switch in +4 base preference might be causing the loss of correlation between U5/6rho and intron number. However, since essentially all species with a 5'SS +4U preference are intron poor, it seems more likely that these trends are the result of a loss of the negative correlation between intron number and U5/6rho in intron poor species, as suggested by the reviewer. To address this issue, we have replaced the regression lines on Figure 6B with a single loess (locally estimated scatterplot smoothing) regression line for all species and updated the text to make it clearer that we think loss of U5/6rho and +4A preference are separate traits of intron poor species. Although this is not exactly what the reviewer requested, we hope that it satisfies their issue with the analysis.

      Reviewer #3 (Significance):

      This is a very interesting study that sheds light on an intriguing evolutionary pattern: the change in consensus sequence at position +4 of the 5' splice site. This topic is relevant since it is closely associated with intron loss and splicing efficiency and evolution. 

      We thank the reviewer for the kind and constructive comments on this study.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, Parker et al present a nice exploration of the evolutionary and mechanistic relationships between 5′ splice site consensus sequences, intron numbers and METTL16/SNRNP27K. By performing inter-species association mapping in Saccharomycotina species, they found that a T in position +4 is strongly associated with the absence of METTL16 (and/or in some cases SNRNP27K or mutations in it). They also provide solid structural modeling data in support of this association.

      In general, I think this is a very nice manuscript. I only have a few comments, which could be addressed by rewording specific parts and/or improving the current figures.

      1. As the authors acknowledge, a key issue that cannot be fully resolved in this study is causality between the different events investigated. Overall, the authors are careful about this, but there are some exceptions that should be corrected. Probably the most important is in the abstract, where they write: "We conclude that variation in concerted processes of 5' splice site selection by U6 snRNA is crucial to evolutionary change in splicing complexity". I suggest they write something more open (and correct), such as: "We conclude that variation in concerted processes of 5' splice site selection by U6 snRNA is associated with evolutionary changes in splicing complexity". Similarly, other plausible scenarios should be discussed in the corresponding Discussion section.
      2. I do not agree with the statement that "The extent of alternative splicing is the best genomic predictor of developmental complexity". To start with, there are many ways to quantify "extent of alternative splicing" and there are also different types of alternative splicing that might have different prevalence and biological impact. Then, this claim is usually related with exon skipping, which is tightly linked with intron length, and that is likely a better prediction of complexity (yet clearly not causative). My concern is: to what extent has this claim been formally and properly assessed by comparing splicing prevalence with other genomic features, such as intergenic region length, intron length, or average distance between enhancer-promoter interactions (arguably the most relevant predictor, in light of many other studies)? Moreover, I found it a bit misleading to frame the work presented in this study as directly related with developmental (or even splicing) complexity. The work is very interesting on its own, and I doubt their findings on +4 position preference in Saccharomycotina has anything to do with developmental complexity (as the Abstract and Introduction seem to imply).
      3. I found Figure 2 and its associated supplementary figure very difficult to follow. I suggest the authors try to improve it and make it clearer. Also, other trees summarizing the results might be helpful.
      4. I also found the Results section corresponding to Figure 5B a bit confusing. I would argue (as I think the authors do) that there are two main patterns here: below 500 introns, there is no association, while above 500 introns there is an increasingly negative association (correlation). I think it would help to more explicitly distinguishing these two patterns. Then, for the intron-poor species: is the correlation (or lack of) for species with a T or an A in position +4 different?

      Significance

      This is a very interesting study that sheds light on an intriguing evolutionary pattern: the change in consensus sequence at position +4 of the 5' splice site. This topic is relevant since it is closely associated with intron loss and splicing efficiency and evolution.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their manuscript, Parker et al. investigate the evolutionary patterns of splice site preference, focusing on the A/U ratio at position A+4 on the 5´ splice site. Building upon prior studies in S. pombe and A. thaliana, the authors establish a strong correlation between this preference and the co-evolution of the METTL16 U6 snRNA methyltransferase. Furthermore, through inter-species association mapping, they identify the involvement of the splicing factor SNRNP27K in altered A/U ratios and highlight the significance of the residue Met-141 in SNRNP27K for this function. Overall, the paper effectively presents impactful new findings on the evolution of METTL16, U6 snRNA, and splicing.

      The computational analyses employed in this study are situated outside our field of expertise, preventing us from offering a comprehensive evaluation of the methodology's appropriateness and rigor. Nonetheless, the identification of METTL16 through the authors' methods, which aligns with previous research in S. pombe and A. thaliana, lends support to the validity of their approach. Notably, the close proximity between SNRNP27K and the methylated A43 residue in U6 snRNA within the spliceosome, particularly near Met-141, is an impressive finding. Previous studies have shown that a mutation at position M141T affects splicing at +4A introns, thus providing robust validation for their methods.

      The data presented in this study furnish crucial insights into the role of METTL16, U6 snRNA methylation, and splice site recognition. The authors expand upon recent observations that the "vertebrate conserved region" exists in non-vertebrates, despite the absence of primary sequence homology. These results will serve as a valuable guide for future molecular investigations into U6 snRNA methylation and its mechanisms in splicing. Furthermore, the implications of this paper extend to human evolution, as the plasticity in splicing is an essential factor in the evolution of developmental complexity.

      Minor suggestions for improvement:

      1. Given the significance of the interaction between U6 snRNA and the intron for understanding the data, it would be beneficial to include a figure illustrating the RNA-RNA base-pairing interactions between U6 snRNA and the 5´ splice site. This addition is particularly important if the paper is intended for publication in a journal with a general readership.
      2. Similarly, the section on U1 snRNA would be more comprehensible with the inclusion of U1 RNA-RNA intron diagrams and improved descriptions of both the figures and the assay. Despite being negative data in the supplement, clarifying this section is essential. As currently written, it is challenging to follow.
      3. Whenever possible, consider increasing the figure and font sizes to enhance readability for readers.
      4. In the text, there is no reference to Figure 1C.
      5. In Figure 5B, the y-axis in the top panel is labeled "species," but the legend only mentions U5/6p as the y-axis. Please revise the legend to include the appropriate information.

      Significance

      The data presented in this study furnish crucial insights into the role of METTL16, U6 snRNA methylation, and splice site recognition. The authors expand upon recent observations that the "vertebrate conserved region" exists in non-vertebrates, despite the absence of primary sequence homology. These results will serve as a valuable guide for future molecular investigations into U6 snRNA methylation and its mechanisms in splicing. Furthermore, the implications of this paper extend to human evolution, as the plasticity in splicing is an essential factor in the evolution of developmental complexity.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      The manuscript by Parker et al addresses the important question of how different organisms have evolved pre-messenger RNA systems that are either more or less complex. This question underlies the evolution of complex organisms and the genome adaptation of simple organisms to their specific environments, so is an important question to answer. This manuscript now provides the underlying molecular mechanisms of how 5' splice site sequence preference may have evolved which is both an interesting and exciting advance for the field.

      Major comments

      This manuscript builds on the previous work from this group where they identified the role of adenosine N6 methylation (m6A) of the U6 small nuclear RNA (snRNA) of the spliceosome by METTL16 as being important for 5' splice site selection. This work led to the speculation that loss of a METTL16 ortholog, or potentially other splicing factors, in some species could contribute to an evolutionary change in 5' splice site sequence preference. Here the authors now use the power of phylogenetics, interspecies association mapping and the available spliceosome structures to provide convincing conclusions that 5' splice site sequence preferences in the extensive number of organisms examined correlate with the presence of the U6 snRNA methyltransferase METTL16 and the splicing factor SNRNP27K.

      An analysis of METTL16 conservation was first carried out by comparing the METTL16 methyltransferase domain (MTD) in 29 diverse eukaryotic species. All the METTL16 orthologs were found to have either one or two globular domains. Three domain types were identified and compared in detail. What was not clear from this analysis was the functional significance of orthologs having either one or two domains. In addition, while this analysis provides important new information on the domain structure of METTL16 orthologs, especially where these domains had not been identified previously, the link between this section of the results and the following sections is not that apparent.

      Next novel bioinformatics pipelines were developed to compare both introns and orthologous groupings of protein coding genes between 227 Sacchromycotina genomes as well as 13 well-annotated eukaryote genomes. First, the 5' splice site sequence preference was compared and clearly indicates that the +4 position has the greatest variation in preferences within the Sacchromycotina. The ability to now compare a large number of genomes has provided novel information on the evolution of the 5' splice site sequence and the conclusion that there is more complexity to the 5' splice site in fungi that previously recognized. While it is apparent why only the 5' splice site signal was investigated here, with its relationship to the U6 snRNA and METTL16, it seems a shame the other splice site sequences were not analyzed using this novel pipeline. In any case, the complexity of the 5' splice site +4 position now allows, for the first time, interesting interspecies association studies.

      With the 5' splice site +4 variation identified, the next step was to determine the underlying molecular mechanisms that dictate the evolution of the various sequence preferences. Some obvious players here are the U1 and U6 snRNAs which directly interact with the 5' splice site during splicing. However, no association was found between these snRNAs and the 5' splice site +4 sequence.

      The powerful interspecies association mapping was then used to determine whether the presence or absence of METTL16 ortholog or a splicing factor correlated with the 5' splice site +4 sequence variation. Interestingly, a clear association was found between METTL16 and the 5' splice site +4 position; METTL16 presence was associated with +4A at the 5' splice site and METTL16 absence was associated with +4U at the 5' splice site. This is an exciting and significant finding.

      Interestingly, the next most significant association with the 5' splice site +4 position was with SNRNP27K. This result makes sense as in the cryo-EM structure of the pre-B spliceosome complex the C-terminal domain of SNRNP27K is found near the region of the U6 snRNA that will interact with the 5' splices site. Absence of SNRNP27K was associated with an increased preference for +4U at the 5' splice site. Now the real power of the interspecies association mapping was demonstrated by investigating whether any association could be determined specifically within the C-terminus of SNRNP27K. Significantly, the methionine 141 position in SNRNP27K was found to be associated with the +4 position of the 5' splice site. This finding fits nicely with previous studies where mutation of M141 caused a shift in 5' splice site selection away from +4A 5' splice sites, to 5' splice sites without +4A. What is not clear is whether M141 is conserved or invariant between all the species that were compared? Overall, this result reveals the power of the interspecies association approach and provides interesting and exciting information on the molecular determinants of 5' splice site evolution.

      The final analysis was to investigate the interaction potentials of the U5 and U6 snRNAs with the 5' splice site in the Sacchromycotina genomes and try to relate this to species with fewer introns and less alternative splicing. Species with low intron numbers and low splicing complexity were revealed to have weaker U5 and U6 anti-correlation potentials and favor +4U at the 5' splice site. On the other hand, species with high intron number and presumably higher splicing complexity featured anti-correlated U5 and U6 snRNA interaction potentials and favored +4A 5' splice sites. This extensive analysis provides novel information on the interactions and splice site properties of species with simple and complex splicing. Again, I see why there is emphasis on the 5' splice site here but a similar analysis with the U2 snRNA and the branch site could also be informative.

      Minor comments

      Should the Title include SNRNP27K?

      Should the title specify that it is the evolution of only the 5' splice site sequence preference being studied here?

      Include information on intron number and 5' splice site interaction potential of U5 and U6 snRNA in the Summary?

      Figure 1C is not referred to in the text?

      Page 8, line 5 - better to say "splicing signal phenotypes".

      What are the other points on Figure 3B? What is the next point below SNRNP27K? Is it U2A'?

      The second paragraph of the Discussion is vague and lacks a reference. "we could also identify an association with a methionine residue in the conserved C-terminal domain of SNRNP27K orthologs." There are a few methionines in the C-terminus, which one? Please reference the statement "transcriptome analysis of C. elegans SNRP-27 M141T mutants.."

      Significance

      Overall, this is a well written and clearly presented study that provides some key molecular information on the splicing factors involved in the evolution of 5' splice sites and shows the power of interspecies association studies. Some important conceptual principles have now been defined for the field going forward.

      The question remains as to whether METTL16 and SNRNP27K are the sole determinants of 5' splice site preference evolution at +4? One splicing factor that immediately comes to mind is Prp8 where there is extensive evidence for involvement in splice site selection and is clearly in the right location throughout splicing to be involved. This question should at least be discussed but Prp8 would also be a very interesting candidate for the interspecies association mapping.

      Also, as mentioned previously, only the 5' splice site was investigated here and the manuscript could become a more substantial piece of work if the other splice sites were included in some way.

      The obvious audience here are those directly in the splicing field but the overall principles are relevant for evolutionary biologists and those studying organismal complexity.

      My expertise is in yeast and human splicing mechanisms. I do not have the expertise to critically evaluate the bioinformatic pipelines but they were clearly explained and presented.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1*. This is a good paper dealing with gap of our knowledge in understanding reason of ICB failures. Subject being difficult it is expected that the design and content of such experiment will be complex.But the authors forget practicality of readers attention and making paper apear interesting. They need to organise and may be classify the varied information in such a way that reader can find a rhythm in excavating data more easily. It appears confusing at time, so they may try to make it more simple. In this way they may concentrate more on methods and classify results too. A thorough revision is suggested, to make it consize. *

      __Authors’ answer: __We thank the Reviewer for his positive evaluation and constructive feedback. We appreciate the complexity of single-cell RNA-sequencing analyses. In order to simplify our manuscript, our revised manuscript now focuses on the transitional states of tumor-resident and circulating T cells found in ovarian cancer patients. Our study is timely as it is the first to report the developmental relationship of TILs in ovarian cancer. We substantially edited our manuscript to make it clear that our findings suggest a gradual acquisition of the exhaustion program initiated by effector-like cells (cluster CD8_GZMH) that eventually gives rise to more terminal states with features of tissue residency and chemotaxis (clusters CD8_CCL4, CD8_XCL1, and CD8_CXCL13). We also include new analyses revealing the presence and proportion of these T cell states in different cancer patients (New Fig. 4A-B), and how these T cell states associate with clinical responses to immune checkpoint blockade (ICB). We hope the Reviewer will find our revised manuscript easier to read.

      Reviewer #2. I think the first half of the article, in which the GZMH-CD8 cluster is considered to be in an intermediate state of transition to exhaustion, is interesting, and I feel that the single-cell seq and TCR data are well analyzed to make the point. On the other hand, I feel that the latter part of the paper may not be anything more than a hypothesis. In particular, the part claiming that it is related to prognosis or applicable to the prediction of the effect of ICB is insufficient, since their gene signature is not described in detail and the contents of the Figure are not mentioned in the manuscript. In the latter part, the effects of GPR184 and 25-HC, or the effects of IL21, would require experiments to verify (to verify whether the addition of chemokine or the inhibition of the receptor changes the specific CD8 population).

      Author’s answer: Thank you for discussing the limitation of the signature employed. We agree with the reviewer’s comment. Old Figure 5 has been removed from the revised manuscript.

      Reviewer #2. Minor point: In particular, there is little mention of Figure 5 in the text, making it difficult to understand.

      Author’s answer: Thank you for your comment. As we previously discussed, we have removed Figure 5 from the revised manuscript. The method used to generate the signature was found to be inappropriate.

      Reviewer #2. The latter part is difficult to understand. To begin with, it is already known that ovarian cancer does not contribute much to ICB, so what does it mean to analyze the CD8 population, which is known as a marker of ICB response in other carcinomas, as an indicator? Especially for clinicians like us, it is hard to imagine that the results will lead to clinical trials that will attempt to sort out the population that ICB is favored in.

      Author’s answer: Although immune checkpoint blockade has demonstrated limited effectiveness against ovarian cancer, subset analyses suggest superior efficacy for some patients and according to subtype. Combination anti-PD-1/CTLA-4 therapy for instance achieved response rates up to 31% (Zamarin et al., 2020), and superior benefit for single agent PD-1 blockade has been reported in clear cell ovarian cancer. Moreover, encouraging clinical results have recently been reported in studies exploring combinations with PARP and VEGF inhibitors. As example, interim analysis of the phase 3 DUO-O trial (NCT03737643) showed a statistically significant and clinically meaningful improvement in PFS in patients with newly diagnosed advanced ovarian cancer without a BRCA1/2 mutation (Harter et al., 2023).

      Our study aimed to better understand how ovarian tumor-infiltrating T cells acquire their exhaustion program after migrating from the periphery and whether these mechanisms are unique or shared amongst cancer types. Recent studies in other cancer types had shown the dynamics of T cells and demonstrated the clonal replacement of intratumoral T cells after ICB and emphasized the role of peripheral clones in this process (Wu et al., 2020; Yost et al., 2019). In lung cancer, it has been proposed a transitional state between precursor and terminally differentially cells (Gueguen et al., 2021). Our study demonstrates, for the first time in ovarian cancer, the presence of similar transitional states of CD8 T cells. Our revised manuscript also now includes new data revealing that pre-effector GZMK- and intermediary GZMH-expressing CD8 cells are better biomarkers of ICB response than terminally differentiated XCL1 and CXCL13 expressing CD8 T cells (New Figure 4). Altogether, our study provides important and novel insights on the development of tumor-infiltrating T cells in ovarian cancer patients, which may serve to better select ovarian cancer patients for ICB therapy.

      Reviewer #2. Since the first half of the study is very interesting, we feel that it is more important to confirm the mechanism of exhaustion from the blood via the intermediate (GZMH_CD8), including functional experiments. Also, as a clinician, we are very interested in the perspective of whether some of the fractions identified in this study are different in proportion in different patients and whether they correlate with the clinical course of the disease since the study only analyzed a sample of 5 patients.

      Author’s answer: We thank the reviewer for proposing to extend our analysis. As suggested, our revised manuscript now includes new analyses which reveals the different proportions of our identified T cells states in different cancer patients (New Figure 4). We further investigated whether these T cell states associate with clinical responses and observed that pre-effector GZMK- and intermediary GZMH-expressing CD8 T cells are better biomarkers of ICB response than terminally exhausted XCL1- and CXCL13-expressing CD8 T cells (New Figure 4).

      Reviewer #3. Question 1: Whether the distribution patterns of CD4+ and CD8+ T cell clusters in Figure 1B were comparable among the 5 patient samples? Whether the proportion of five types of clones in Figure 3C are comparable among the 5 patient samples?

      Author’s answer: Thank you for the question. We included the results to answer these questions in the supplementary material (fig. S1C-D). For each patient, we calculated the proportion of a cluster among T cells in the blood or tumor. As observed in the boxplot (fig. S1C), the proportion of some subsets were higher in certain patients, such as the higher proportion of CD8_GZMK in the tumor of patient p09454. A recent study classified patients’ tumors based on the spatial distribution of CD8 T cells and performed scRNA-seq to identified cell subsets enriched in the groups inflamed/infiltrated (characterized by the distribution of CD8 T cells within the tumor epithelium), excluded (infiltrating CD8 T cells are restricted to the tumor stroma) or desert (T cells are not present or have low frequency) (Hornburg et al., 2021). Interestingly, this subset of CD8_GZMK cells were enriched in desert tumors, suggesting that the difference we observed in our dataset might reflect the spatial distribution of CD8 T cells in patient p09454. Regarding the TCR-seq data, the frequency of the five types of clones was different among patients. To show this data, we included a barplot (fig. S2D), showing for example, a higher proportion of tumor-expanded clones in patient p10329.

      Reviewer #3. Question 2: In Figure S2C, only a very small number of cells in the CD8-GZMK K-22 population. Are these cells representative? Do they generally exist in multiple samples or only in one sample?

      Author’s answer: Thank you for your comment. The subcluster k_22 indeed has a smaller number of cells compared to other subclusters. Nevertheless, the K_22 cluster was found in every patient and in every healthy donor. To clarify, we edited our revised manuscript to include a statement that cluster k_22 was composed of fewer cells compared to other clusters.

      Reviewer #3. Question 3: In the Fig.S6 legend, the authors stated "Our results suggest the differentiation of cluster CD8-GZMK into the effector-like subset CD8-GZMH." However, there seems to be no corresponding analysis in the main text to support this conclusion.

      Author’s answer: We appreciate your attention to this statement. We agree the results of our study doesn’t sustain this statement and so we have excluded it in the revised manuscript.

      Reviewer #3____. Question 4: Is there more detailed clinical information that can be provided for the 5 patients included in the study? Per the methods all patients were receiving debulking surgery and were treatment naïve, but did they differ in stage, age, comorbidities, etc.?

      Author’s answer: Thank you for your comment on this. We have included a table with clinical information on the stage, age, and menopause status of the five patients.

      Reviewer #3. Question 5: Were any cells included for sequencing from adjacent 'normal' tissue uninvolved with tumor (these samples are from surgical debulking of primary tumors, which may include such areas of non-involved tissue.) While shared TCR clonotypes between blood and intratumoral T cells strongly suggests the tumor-resident populations are recruited from the blood, the degree of sharing with normal tissue-resident T cells would be of interest as well.

      Author’s answer: Thank you for your comment. Samples were provided for sc-RNA-seq after pathology review and validation of tumor histology. We did not perform sc-RNA-seq on normal adjacent tissue (NAT) We agree this would be interesting as a follow up study, since in other cancer types (renal, colon and lung) it has been demonstrated that T clones expanded in the tumor and NAT are also present in peripheral blood (Wu et al., 2020).

      Reviewer #3. Question 6: Very little is discussed about HGSOC itself in the main text (eg clinical background, prior literature on the composition of infiltrating immune populations and potential reasons for at best modest poor responses to IO) until the first sentence of the discussion. As the entirety of the new data produced in this study is from HGSOC tumors there should be more focus on this tumor type and conversation with the prior literature on it (mainly from prior studies on the immune environment of HGSOC). Further, how distinct do the authors suspect the cell populations found in their study to be to ovarian as opposed to other epithelial tumor types?

      Author’s answer: Thank you for the suggestion. We now included more background information on immunotherapy of HGSOC. Specifically, we added the following paragraph in our introduction: “In ovarian cancer, the presence of both T and B cells improves patients' survival (Nelson, 2015; Nielsen et al, 2012). They are usually organized in lymphoid aggregates ranging from a small group of cells to a well-organized TLS (Kroeger et al, 2016). Organized TLSs correlate with better survival, such as observed in patients treated with ICB. Although immunotherapy has demonstrated limited effectiveness against ovarian cancer, subsets of patients may thus benefit from ICB. In support of this, combination anti-PD-1/CTLA-4 therapy can achieve response rates above 30% (Zamarin et al., 2020), and encouraging clinical results have recently been reported when combining ICB with with PARP and VEGF inhibitors (Harter et al., 2023)”.

      Reviewer #3. Question 7: Were the signature genes used for analysis in figure 5 remove chosen in a formal, unbiased manner, or simply hand-picked as representative of the respective cell types? This information is not provided in the supplement.

      Author’s answer: Another reviewer has also expressed similar concerns. The genes selected to represent cell types were chosen manually, which we acknowledge is not the best method for defining a signature. As a result, we have decided to exclude Figure 5 from the manuscript under review. We believe an unbiased approach is more suitable for characterizing the cell network proposed in our study.

      Reviewer #3. Question 8: While the NicheNet analysis of potential interactions among lymphocyte populations raises some strong hypotheses, it would be interesting to extend the interaction analysis to all CD45+ populations, given the sequencing was done on CD45+ immune cells.

      Author’s answer: Thank you for suggesting analysis. We have included the results of cell interaction including all CD45+ cells (fig. S3). We observed CD40L as one of the top predicted ligands highly expressed in CD4_CXCL13 subset mediating a response in subsets of antigen-presenting cells, such as B cells (cluster B), plasma cells (cluster PC_2), and plasmacytoid dendritic cells (cluster pDC). Interestingly, this result also support the hypothesis of Tfh-like cells (cluster CD4_CXCL13) coordinating the action of intratumoral immune cells involved in the antitumor immune response.

      Reviewer #3. Question 9: A sample size of 5 patients is relatively small for current single cell RNAseq studies of human tumor patients.

      Author’s answer: We agree with the reviewer that a sample size of 5 patients is relatively small. Thus, to validate our results in other patients, we included in the reviewed manuscript the analysis of scRNA-seq of 47 patients across10 cancer types (dataset from (Zheng et al., 2021). As demonstrated in figure 3 and figure 5, we could identify subsets of CD8 and CD4 T cells from our ovarian cancer patients in those 10 cancer types dataset.

      Reviewer #3.____ Minor

      *1. In lines 96-97, "CD8-GZMB" was mentioned twice in the description. *

      2. In line 126, this section did not discuss residency markers, yet a conclusion about residency was made in this sentence.

      Author’s answer: We appreciate you bringing these errors to our attention. We fixed them in the updated version of the manuscript.

      References:

      Gueguen, P., Metoikidou, C., Dupic, T., Lawand, M., Goudot, C., Baulande, S., … Amigorena, S. (2021). Contribution of resident and circulating precursors to tumor-infiltrating CD8 T cell populations in lung cancer. Science Immunology, Vol. 6, p. eabd5778. doi:10.1126/sciimmunol.abd5778

      Harter, P., Trillsch, F., Okamoto, A., Reuss, A., Kim, J.-W., Rubio-Pérez, M. J., … Aghajanian, C. (2023). Durvalumab with paclitaxel/carboplatin (PC) and bevacizumab (bev), followed by maintenance durvalumab, bev, and olaparib in patients (pts) with newly diagnosed advanced ovarian cancer (AOC) without a tumor BRCA1/2 mutation (non-tBRCAm): Results from the randomized, placebo (pbo)-controlled phase III DUO-O trial. Journal of Clinical Orthodontics: JCO, 41(17_suppl), LBA5506–LBA5506.

      Hornburg, M., Desbois, M., Lu, S., Guan, Y., Lo, A. A., Kaufman, S., … Wang, Y. (2021). Single-cell dissection of cellular components and interactions shaping the tumor immune phenotypes in ovarian cancer. Cancer Cell. doi:10.1016/j.ccell.2021.04.004

      Wu, T. D., Madireddi, S., de Almeida, P. E., Banchereau, R., Chen, Y.-J. J., Chitre, A. S., … Grogan, J. L. (2020). Peripheral T cell expansion predicts tumour infiltration and clinical response. Nature. doi:10.1038/s41586-020-2056-8

      Yost, K. E., Satpathy, A. T., Wells, D. K., Qi, Y., Wang, C., Kageyama, R., … Chang, H. Y. (2019). Clonal replacement of tumor-specific T cells following PD-1 blockade. Nature Medicine. doi:10.1038/s41591-019-0522-3

      Zamarin, D., Burger, R. A., Sill, M. W., Powell, D. J., Jr, Lankes, H. A., Feldman, M. D., … Aghajanian, C. (2020). Randomized Phase II Trial of Nivolumab Versus Nivolumab and Ipilimumab for Recurrent or Persistent Ovarian Cancer: An NRG Oncology Study. Journal of Clinical Oncology: Official Journal of the American Society of Clinical Oncology, 38(16), 1814–1823.

      Zheng, L., Qin, S., Si, W., Wang, A., Xing, B., Gao, R., … Zhang, Z. (2021). Pan-cancer single-cell landscape of tumor-infiltrating T cells. Science, 374(6574), abe6474.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Below are some questions as well as suggestions for revision and to strengthen the manuscript.

      Major:

      1. Whether the distribution patterns of CD4+ and CD8+ T cell clusters in Figure 1B were comparable among the 5 patient samples? Whether the proportion of five types of clones in Figure 3C are comparable among the 5 patient samples?
      2. In Figure S2C, only a very small number of cells in the CD8-GZMK K-22 population. Are these cells representative? Do they generally exist in multiple samples or only in one sample?
      3. In the Fig.S6 legend, the authors stated "Our results suggest the differentiation of cluster CD8-GZMK into the effector-like subset CD8-GZMH." However, there seems to be no corresponding analysis in the main text to support this conclusion.
      4. Is there more detailed clinical information that can be provided for the 5 patients included in the study? Per the methods all patients were receiving debulking surgery and were treatment naïve, but did they differ in stage, age, comorbidities, etc.?
      5. Were any cells included for sequencing from adjacent 'normal' tissue uninvolved with tumor (these samples are from surgical debulking of primary tumors, which may include such areas of non-involved tissue.) While shared TCR clonotypes between blood and intratumoral T cells strongly suggests the tumor-resident populations are recruited from the blood, the degree of sharing with normal tissue-resident T cells would be of interest as well.
      6. Very little is discussed about HGSOC itself in the main text (eg clinical background, prior literature on the composition of infiltrating immune populations and potential reasons for at best modest poor responses to IO) until the first sentence of the discussion. As the entirety of the new data produced in this study is from HGSOC tumors there should be more focus on this tumor type and conversation with the prior literature on it (mainly from prior studies on the immune environment of HGSOC). Further, how distinct do the authors suspect the cell populations found in their study to be to ovarian as opposed to other epithelial tumor types?
      7. Were the signature genes used for analysis in figure 5 chosen in a formal, unbiased manner, or simply hand-picked as representative of the respective cell types? This information is not provided in the supplement.
      8. While the NicheNet analysis of potential interactions among lymphocyte populations raises some strong hypotheses, it would be interesting to extend the interaction analysis to all CD45+ populations, given the sequencing was done on CD45+ immune cells.
      9. A sample size of 5 patients is relatively small for current single cell RNAseq studies of human tumor patients.

      Minor

      1. In lines 96-97, "CD8-GZMB" was mentioned twice in the description.
      2. In line 126, this section did not discuss residency markers, yet a conclusion about residency was made in this sentence.

      Significance

      In this manuscript titled "Predicting Developmental Relationships of Tumor-Resident and Circulating T Cells in Ovarian Cancer," Carneiro and colleagues employed single-cell transcriptomics and T cell receptor profiling of immune cells sorted from paired peripheral blood and tumor tissue in a small cohort of ovarian cancer patients to investigate the developmental relationships of T cell populations and their potential interactions. They identified a possible differentiation pathway involving GZMH-expressing CD8+ T cells that progresses towards tissue residency and exhaustion. The researchers suggested the effector function of intermediate GZMH-expressing CD8+ T cells could be sustained through interaction with CXCL13-expressing CD4+ Tfh-like cells. Moreover, they proposed that CD4+ Tfh-like cells could attract GPR183-expressing pre-effector GZMK-expressing CD8+ T cells and plasmacytoid dendritic cells via the production of 7α,25 dihydroxycholesterol (7α,25-HC). Ultimately, the study hypothesized that CD4+ Tfh-like cells expressing IL-21 among other molecules might enhance antitumor immunity against ovarian tumors by coordinating the actions of multiple immune populations. Strengths of the study include detailed, combined analysis of inferred developmental trajectories via shared TCR clonotypes across tissue as well as potential crosstalk between cellular populations, as well as association of signature genes with clinical outcomes. Weaknesses include a small number of patients and the dataset being limited only to single cell RNAseq and thus providing descriptive findings without functional validation or perturbation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: This study used single-cell transcriptomics and T cell receptor profiling to identify the developmental relationships of T cell populations in ovarian cancer patients. The researchers proposed a model of differentiation pathway that showed how an intermediate GZMH-expressing CD8 T cell subset progressively reinforces exhaustion and tissue residency programs towards terminally exhausted cells. Then they also focus on the nature of TPEX, dual-expanded clone, which is considered an important indicator for the efficacy of ICB, and argue that it is strongly related to GPR183, 25-OHC, and IL21. Based on the analysis of clinical samples, they argue that their proposed gene signature may also be prognostically relevant and predictive of ICB efficacy.

      Major comment: I think the first half of the article, in which the GZMH-CD8 cluster is considered to be in an intermediate state of transition to exhaustion, is interesting, and I feel that the single-cell seq and TCR data are well analyzed to make the point. On the other hand, I feel that the latter part of the paper may not be anything more than a hypothesis. In particular, the part claiming that it is related to prognosis or applicable to the prediction of the effect of ICB is insufficient, since their gene signature is not described in detail and the contents of the Figure are not mentioned in the manuscript. In the latter part, the effects of GPR184 and 25-HC, or the effects of IL21, would require experiments to verify (to verify whether the addition of chemokine or the inhibition of the receptor changes the specific CD8 population).

      Minor point: In particular, there is little mention of Figure 5 in the text, making it difficult to understand.

      Significance

      It is interesting to note that the authors simultaneously analyze immune cells in the blood and in the tumor, and examine in detail what is characteristic of the blood, what is characteristic of the tumor, and what is seen in both. And it is very interesting that they specifically proposes an intermediate group that is recruited from the blood to the tumor and is in the process of becoming exhausted. I am sure there are many studies on TILs and TLSs, but this study would be helpful to understand how they are concentrated locally (near the tumor) in comparison with immune cells in the blood as well.

      However, the latter part is difficult to understand. To begin with, it is already known that ovarian cancer does not contribute much to ICB, so what does it mean to analyze the CD8 population, which is known as a marker of ICB response in other carcinomas, as an indicator? Especially for clinicians like us, it is hard to imagine that the results will lead to clinical trials that will attempt to sort out the population that ICB is favored in.

      Since the first half of the study is very interesting, we feel that it is more important to confirm the mechanism of exhaustion from the blood via the intermediate state, including functional experiments. Also, as a clinician, we are very interested in the perspective of whether some of the fractions identified in this study are different in proportion in different patients and whether they correlate with the clinical course of the disease, since the study only analyzed a sample of 5 patients.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This is a good paper dealing with gap of our knowledge in understanding reason of ICB failures. Subject being difficult it is expected that the design and content of such experiment will be complex.But the authors forget practicality of readers attention and making paper apear interesting.

      They need to organise and may be classify the varied information in such a way that reader can find a rhythm in excavating data more easily.

      It appears confusing at time, so they may try to make it more simple.

      In this way they may concentrate more on methods and classify results too.

      A thorough revision is suggested, to make it consize.

      Significance

      This is a good paper dealing with gap of our knowledge in understanding reason of ICB failures. Subject being difficult it is expected that the design and content of such experiment will be complex.But the authors forget practicality of readers attention and making paper apear interesting.

      They need to organise and may be classify the varied information in such a way that reader can find a rhythm in excavating data more easily.

      It appears confusing at time, so they may try to make it more simple.

      In this way they may concentrate more on methods and classify results too.

      A thorough revision is suggested, to make it consize.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript entitled "Aurora A mediated new phosphorylation of RAD51 is observed in Nuclear Speckles", the authors unveil the Serine S97 as a novel phosphorylation site of the RAD51 recombinase and that this phosphorylation is mediated by the Aurora A kinase using a set of in vitro and in cellulo experiments. The authors also describe this phosphorylation being in the nucleus specifically in nuclear speckles where mRNA maturation and splicing occurs suggesting a role of RAD51 in the latter. The confocal microscopy images provided to test this hypothesis are convincing. However, using confocal images as well, the authors claim that RAD51 phosphorylated at S97 foci do not colocalize with the DNA damage marker -H2AX, hence a function not related to DNA damage, however the data provided does not fully support this statement. In this study, Alaouid et al, utilize mutants of RAD51 that alter S97 phosphorylation to further study its function and provide data that support RAD51 as an RNA binding protein. Overall, the manuscript shows some interesting observations that are worth pursuing however the in vitro and in cellulo results are not aligned, lack some controls, and many points should be reconsidered.

      Major comments:

      • Are the key conclusions convincing?

      Not as stated.

      Fig. 1A. The authors conclude that pS97-RAD51 favors RAD51 strand invasion capacity using the D-loop assay. Indeed, the S97D phosphomimic increased the D-loop activity 2.5-fold compared to WT RAD51. However, the S97A mutant, which is the non-phosphorylated form also increased the D-loop activity by 2-fold compared to WT (figure 1C). So, the phosphorylation or the absence of it seem to promote strand invasion. So, what is the role of the phosphorylation? There is no discussion about this. Besides, no representative image of the D-loop assay is shown, this is very important as these experiments need to be run with the relevant controls to be meaningful.

      Fig. 1D. The polymerization rate of RAD51 is probably irrelevant for its function in the absence of DNA. What do they want to get at with this assay?

      In figure 2B, the authors conclude that RAD51 phosphorylation at S97 is dynamically regulated throughout the cell cycle. Indeed, the pS97-RAD51 is well observed in asynchronous cells, and the double thymidine block time course experiment followed by PI staining shows the oscillation of the pS97-RAD51 from G1 to G2/M stage. The authors quantified the ratio of pS97-RAD51/total RAD51 to conclude this. However, it would be more accurate to also divide the above over the intensity of the loading control (tubulin) because in figure 3A for example, they quantified the ratio of pS97-RAD51/tubulin but did not consider the levels of RAD51 in their quantifications.

      In figure 3B, the authors state that pS97-RAD51 is decreased after CPT treatment and that the pS97-RAD51 foci do not localize with the DNA damage marker -H2AX. The signal of gH2AX is already weird as it does not change from Ctrl to CPT conditions (especially in HCC1806 cells). A pre-extraction of soluble protein with CSK should be used to then look at the co-localization, with the pan-staining of the two signals is difficult to draw any conclusions of colocalization. Nevertheless, the signal of RAD51 seems equal in all conditions in the images shown and it does not seem to be reduced after CPT.

      In figure 4A, the authors show that Aurora A is responsible for the S97-RAD51 phosphorylation in cellulo. Indeed, the use of an Aurora A inhibitor reduces the pS97-RAD51 signal, however, this is only true in one cell line (HCC1806) but no effect was observed in HeLa cells. Is this effect cell-specific?

      The authors find that RAD51 binds both DNA and RNA and measure the affinities of the RAD51 bearing the S97D and S97A mutations. S97D shows the highest affinity for ssDNA and RNA in Fig. 7A, B, however the opposite is true for dsDNA in Fig 7C, D. All three forms of RAD51 bind RNA although with different affinities however no error bars are shown. The description of the results does not seem accurate. Importantly, these data should somehow correlate/be discussed with respect to the D-loop assay performed in Fig. 1. The authors conclude that the binding to RNA is reduced in S97D-RAD51 suggesting that the pRAD51 that they observe at nuclear speckles would be probably not associated with RNA at these nuclear speckles, right? this goes against their idea of this phosphorylated form being related to RNA splicing... - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The manuscript seems to be in early days and requires lots of editing, rewriting to relate the in vitro and in cell data and make a coherent story - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      The authors performed chromatin fractionation to determine the correct localization of the pS97-RAD51 and looked for the phosphorylated form by western blots. But then they confirmed the finding using immunofluorescence. I think it would be more convincing and consistent if the authors do a pre-extraction before the use the antibody because as such, they would be indeed confirming the localization of the protein they are looking at that is specifically in the nucleus.

      As well, in order to test the specificity of the pS97-RAD51 antibody they generated, a simple treatment of the lysates with phosphatases would be a good control for the specificity of their antibody These and the critics mentioned above need to be address. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      This manuscript is not ready for submission - Are the data and the methods presented in such a way that they can be reproduced?

      Yes. However, the legends of the images are way too concise. - Are the experiments adequately replicated and statistical analysis adequate?

      In Fig. 2B, the authors performed a double thymidine block followed by a time course release to track cell cycle progression of the cells and phosphorylation of RAD51 at S97. They do not indicate the biological replicates they performed. There are no error bars in the estimated KD shown in Fig.7.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      The authors conclude that the S97 is specifically phosphorylated by the Aurora A kinase. How? Have they looked at other documented kinases known to phosphorylate RAD51?

      In figure 6 the authors overexpress HA-tagged RAD51 proteins corresponding to WT, S97D and S97A mutants in cells and label them for immunofluorescence. Maybe it would be better to downregulate the endogenous RAD51 to discard possible combined effects.

      In figure - Are prior studies referenced appropriately?

      The authors show in their manuscript that RAD51 protein CAN interact with RNA in vitro, a finding not previously described to my knowledge. However, a recent study entitled "RAD51-dependent recruitment of TERRA lncRNA to telomeres through R-loops, Nature, 2020" provides in vitro data showing the binding of RAD51 to TERRA, a LncRNA, which I think would be worth mentioning their manuscript.

      The authors should mention previous contributions in the field especially when it comes to RAD51 in the HR pathway post DNA damage, which is quite documented and updated. For example, in this section of the introduction, "RAD51 is a recombinase protein implicated in the strand exchange mechanism during the DSB repair by the Homologous Recombination (HR) pathway. In the absence of DNA Damage (DD), RAD51 is predominantly cytoplasmic and translocates to the nucleus during the DNA Damage Response (DDR) to manage HR repair. As it needs the undamaged sister chromatid as a template, the HR repair pathway occurs mainly in the late S, G2 phases of the cell cycle. However, it has been documented that HR repair can also occur during G1 and early S phases, and in this case, the undamaged template used for the repair could be the homologous chromosome or an RNA transcript2". This statement is definitely worth more references.

      The same problem is recurrent in the rest of the introduction; therefore, it needs to be updated and better referenced. - Are the text and figures clear and accurate?

      The text needs a lot of editing to accurately describe the results, see for example: "The resulting KD evaluation shows that the S97D mutant had a dsDNA binding affinity lower to that of the WT (a KD of 2.26 μM for the S97D-RAD51 vs a KD of 0.38 μM for the WT RAD51). Concerning, the S97A mutant comparison to the WT RAD51, we observed modified association and dissociation curves that resulted in an identical affinity to dsDNA (a KD of 0.33 μM for the S97A-RAD51 vs a KD of 0.38 μM for the WT RAD51). We can conclude that in our in vitro conditions, the Ser97 phosphorylation has a high impact on RAD51 affinity for DNA by dividing its affinity by 5.8." Besides, the figures are of low quality and should be more carefully crafted and presented. Some experiments (such as the D-loop) are not represented in the figures.<br /> - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      Using a different representation for the graphs would be a plus (also see previous comments)

      Referees cross-commenting

      I think the other reviewers and I have raised very important and complementary points that will help the authors improve the quality of the manuscript substantially.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      The discovery of a new phosphorylation site in RAD51 (S97) by Aurora A is potentially interesting for the field of the maintenance of genome stability as it could broaden the understanding of how such an important recombinase may be regulating the maintenance of genome integrity throughout the cell cycle. Also, the idea of RAD51 being involved in splicing and mRNA maturation seems very attractive and a very important conceptual advance. However, given the premature status of the text and the figures, the manuscript falls short to show convincing evidence. - Place the work in the context of the existing literature (provide references, where appropriate).

      Many works are highlighting the role of RNA binding proteins as an integral part of the DNA damage response. In addition, a wealth of evidence in the literature suggest that many DNA repair proteins are RNA binding proteins, and that RNA is an important player in the DDR. The possible finding that RAD51 interacts with RNA and localize to nuclear speckles possibly acting in splicing is very interesting and attractive. How is Aurora A involved in this, what is the trigger, and whether RAD51 is binding RNA at these sites is still unclear. - State what audience might be interested in and influenced by the reported findings.

      Labs working in genome integrity mechanisms and the crosstalk between transcription and DNA repair would be interested. - Define your field of expertise with a few keywords to help the authors contextualize your point of view.

      Genome Instability, homologous recombination

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Homologous recombination is central to stabilizing the genome where Rad51 recombinase plays a pivotal role. Authors found unexpected localization of Rad51 to nuclear speckles. This localization is associated with a novel phosphorylation site of Rad51 at Ser97, which is phosphorylated by Aurora A. Because nuclear speckles are where RNA maturation occurs, authors argue for the possible involvement of Rad51 in modulating splicing, a previously unsuspected role for this important recombinase.

      Major points:

      1. The discoveries made in this paper heavily rely on the Phospho-S97 specific antibody (PS97 antibody). The biggest concern of this reviewer is that the validation of this material is not rigid enough. The specificity of this antibody against PS97 is validated only by PS97 peptide competition. The outcome is not convincing either; the PS97 signal seems quite resistant to Si-RAD51 in Fig2A. Furthermore, in Fig2B, PSer97 signal seems rather constant throughout the cell cycle while Rad51 signal fluctuates.

      These observations make this reviewer wonder if the authors are really detecting the phosphorylation of Rad51 with this material (i.e., PSer97 antibody).

      I suggest the authors validate this antibody by doing the following experiments:

      1-1. Do phosphatase treatment to see if the western blotting signal depends on phospho-S97.

      1-2. Do competition experiments using the non-phospho peptide (i.e., the same polypeptides carrying a regular unmodified Ser at 97).

      1-3. Try western blotting using purified Rad51 proteins, one treated with AuroraA and another without the treatment.

      1-4. Do western blotting with cell extract from the cell line producing Rad51-S97A, S97D and compare with wild-type Rad51.

      1. P.10, line 4 The purity of the purified protein should be included (Rad51 and two other mutant proteins) by showing CBB-stained SDS-page gel.
      2. P. 10, line 7 (Fig1C). D-loop assay with Rad51 and its mutants. The actual data should be presented with the actual D-loop formation efficiency. Comparison with wild type value is not enough.
      3. Fig5AB There are lots of PSer97 signals that do not even overlap with DAPI (Fig5A) or Sc35 (Fig5B). How do authors explain that? Also, quantification needs to be done regarding colocalization between PSer97 and Sc35.
      4. Fig5D I do not know what to look for here. At least authors should employ proper negative controls such as siRad51 extract and WCE supplemented with PSer peptides.
      5. Fig6AB Quantification of the results needs to be presented. This reviewer is wondering if there is any explanation regarding the difference in the localization of overproduced HA-Rad51 between HeLa and HCC1806; HA-Rad51 goes into a nucleus in HeLa while it stays in the cytoplasm in HCC1806. Any explanation?

      Minor points:

      1. Please include line numbers.
      2. P.2, line 11 Could you cite the literature showing Rad51 is predominantly cytoplasmic?
      3. P.10, line 15 The authors are not measuring the polymerization rate here. The title is misleading.
      4. Fig2A What do NT and Si-Sc stand for? How come Pser97 signal is resistant to Si-RAD51?
      5. Fig2B P-Rad51/Rad51 ratio graph does not have error bars, making it difficult to assess its reproducibility.

      Referees cross-commenting

      I am pretty much in agreement with the comments/criticisms raised by the other two reviewers.

      Significance

      If Rad51 is indeed involved in RNA maturation, that will be a very novel and exciting discovery. The observations presented in this work, however, seem a bit too inconclusive to support the idea, at least, to this reviewer.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper by Alaouid et al. describes the role of Aurora A-mediated RAD51 phosphorylation in RNA metabolism in the sub-nuclear organelle such as Nuclear Speckles (NS). By the combination of biochemistry and cell biology methods, the authors showed that Aurora A phosphorylates Ser97 of human RAD51 in vitro. The antibody against the phosphor-Ser97 RAD51 seems to recognize NS. Moreover, RAD51 binds not only DNA, but also RNA in vitro. These suggest the role of RAD51 in RNA processing in NS. Moreover, the authors analyzed the biochemical properties of a phosphor-mimetic version of RAD51 (RAD51-S97D) and a phosphor-defective version (RAD51-S97A) and the effect of the over-expression of these mutants in NS dynamics in a cell. The idea of the link to RAD51 in RNA processing in a specific nuclear organelle sounds very interesting and opens a new area of DNA damage response (DDR). However, this paper did not show any functional evidence on the link between RAD51 phosphorylation and RNA processing in NS. Moreover, there are lots of technical issues in the results reported in the manuscript. In some experiments, the authors have to check the reproducibility and be careful about statistics. More importantly, the specificity of the anti-PSer97-RAD51 antibody raised in this study was not properly evaluated in vivo, which makes it hard to interpret the results using this antibody such as western blotting and immuno-fluorescent analyses.

      Major comments:

      1. Because of the poor description, it is hard to evaluate the content. This manuscript needs a more detailed description of the results in a scientifically valid way.
      2. Please describes the basic biochemical activities (ssDNA (Figure, dsDNA binding, and ATP hydrolysis activities) of mutant RAD51 proteins used in this study; RAD51-S97A and RAD51-S97D proteins. See also minor comments in this respect.
      3. It is essential to check the localization signal of PS97 signal in cells with RAD51 depletion by siRNA. Alternatively, the authors used chicken DT40 RAD51 cKO cells (Sonoda et al. EMBO J. 1998) to check the specificity of the antibody (RAD51 is conserved between humans and chicken, and the antibody seems to work in DT40).
      4. Data not shown: Please show the data in Supplemental Figures or deposit it in a public database.
      5. Please cite original references to cite the previous results.
      6. Please make a single composite Figure.

      Minor comments:

      1. Page 4, the second paragraph, line 1: Please add the reference number of Chabot et al. (4).
      2. Page 4, last sentence, DNA/RNA binding activity: For the binding of RAD51 in the presence of ATP, Mg2+ ion or divalent ion is essential. However, there is no description on how much concentration of the divalent ion was used in the assay.
      3. Page 10, the first paragraph, line 4: Please show a Coomassie gel of purified RAD51, RAD51-S97A, and RAD51-S97D proteins.
      4. Figure 1C, D-loop assay: Please show the gel of the products in this assay. It would be nice to show the kinetics of the reactions by these RAD51 mutant proteins. Or the effect of a different RAD51 concentration was tested.
      5. Page 10, the third paragraph, line 1: Please explain what is "BS3"; how this chemical stabilizes the oligomer and the references related to the drug.
      6. P values: Please describe the method to calculate the value in Figure legends.
      7. Figure 1D: Since this assay cannot quantitatively measure the oligomerization status of RAD51, the authors' claims are not convincing. Electron microscopic observation, which is the best, and/or ultra-centrifugation or gel filtration would be recommended to see the difference in the oligomeric status of the RAD51.
      8. Figure 2A: This result is not convincing. Although siRNA for RAD51 largely decreases the amount of RAD51 in cell lysates (bottom, ~80%), a modest decrease of the signal is seen for Phospho-Ser97-RAD51 (top, ~50%)). The authors need to explain this discrepancy. More importantly, this phosphorylation is mediated by Aurora A kinase. It is important to show the signals detected with this antibody decrease in the treatment of the Aurora A inhibitor or siRNA for Aurora A subunits. The inhibition experiments shown in Figure 4A are not convincing because the effect of the inhibitor is very small.
      9. Figure 2B: How did the authors determine each stage of G1, S, G2, and M phases (bottom right graph)? There are no markers of the S phase in western blots such as Cyclin A. Moreover, FACS analysis would be recommended.
      10. Figure 2B, graphs: Please add error bars of quantification of the bands by doing multiple experiments to support the authors' claim on the increase of RAD51 S97 phosphorylation from G1/S to G2/M transition.
      11. Figure 2C top, fractionation assay: Please include a western blot of RAD51 as a control like the ones in the middle.
      12. Figure 2D top: Please include images of RAD51 as a control.
      13. Page 12, first paragraph, line 2: Please show representative images of immunostaining of different cell lines in the Supplementary Figure and quantify the size of the foci. Do show all the data in either main Figures or Supplemental Figures without "data not shown".
      14. Figure 3A: Please explain gammaH2AX blot in either text or legend.
      15. Page 12, the second paragraph, line 8, data not shown: Please show the data in Supplemental Figure.
      16. Page 12, the third paragraph, line 3: The authors need to explain what is the gammaH2AX to readers.
      17. Figure 3B and C: Please check BRCA1/2 or RPA32 (or other DNA repair center markers) localization rather than gammaH2AX for the marker for DNA damage focus. As shown in these figures, gammaH2AX signals spread over the DSB sites, make it difficult to check the colocalization. Why number and intensity of gammaH2AX signals are so different between B and C? In Figure 3C, did the authors use non-treated cells?
      18. Figure 3B, western blots: The top panel is over-exposed.
      19. Page 12 last paragraph-page 13 first paragraph: The short summary is not necessary. These sentences should be moved to Discussion.
      20. Figure 4A: Please include any positive marker for the Aurora-A inhibition such as histone H3S10-phosphorylation.
      21. Figure 4B: Did Aurora A overexpression induce any cell cycle arrest? If it induces G2/M arrest, this increased phosphorylation is simply due to the arrest (in Figure 2B, the authors showed an increase of the phosphorylation in G2/M phase).
      22. Figure 4B pSer97-RAD51/RAD51 ratio: This reviewer is not convincing the quantification. What is the dynamic range of this western? Do they try different cell lysate volumes to adjust constant RAD51 signals to compare the pSer97-RAD51 signals?
      23. Page 13, third paragraph, lines 2-3 and Figure 4B left graph: Is this statistically significant? Please show what statistical method was used here (show it in Legend).
      24. Figure 5B, PlaB treatment: The Images show a decreased focus number of PSer97-RAD51. This is more obvious than the formation of larger foci. The authors need a more precise description of the result in the text.
      25. Figure 5C: Please show the position of the full-length of RAD51 protein by an arrow. The position of RAD51 and pS97 are different-pS97 signal migrates faster than the RAD51 (opposed to the result in Figure 1A).
      26. Figure 5D, IE: What is "NIP"?
      27. Figure 5D, IP: Where is a band of Sc-35? In IP fraction (bottom), there is little band corresponding with the band in lysates. Three thick band are not specific.
      28. Figure 5E, page 14 last sentence, "improved this experiment": Without the quantification, the authors do not conclude this.
      29. Figure 5 experiments: It is not clear why the improvement of Rad51-IP by RNA treatment could explain the role of pSer97-RAD51 points out the RAD51-binding to RNA. Rather the opposite interpretation would be possible. If pSer97-RAD51 is tightly bound to an RNA-containing nuclear structure, the authors may try chromatin fractionation with RNAase treatment.
      30. Figure 6: Please quantify the number and size of Nuclear speckles in different conditions.
      31. Figure label of B and D: "B" and "D" should be placed on the graph for RNA binding.
      32. Page 15-16, DNA/RNA binding assay: Please indicate the length of DNA/RNA in the text. Moreover, it is well established that ATP analogs modulate RAD51-binding to DNA. It is important for the authors to check the effect of ATP and ADP et on DNA/RNA.
      33. Page 16, the second paragraph: In this paragraph, the authors mentioned about "ds"DNA rather than ssDNA described above. Which is true?
      34. Supplemental Figure 1: Please explain what the purple circle means. Moreover, how this result shows the phosphorylation of Ser97. The two spectra look very different. Do they have any other phosphorylation?

      Referees cross-commenting

      I also agree with the other two reviewers. My concern is that we need a re-review of the revised version. I am not familiar with how the Review Common works. Hope that the journal will take care of the re-reviewing after the authors address our concerns on this paper

      Significance

      This paper may offer a new idea in the biology of nuclei by providing a possible link between proteins involved in homologous recombination such as RAD51 and RNA processing in subnuclear compartments, which is regulated by Aurora A-phosphorylation.

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. This paper might provide a possible link of RAD51 protein involved in homologous recombination with RNA processing in subcompartments in nuclei.
      • Place the work in the context of the existing literature (provide references, where appropriate). The concept on the role of RAD51 in nuclear RNA processing sounds interesting.
      • State what the audience might be interested in and influenced by the reported findings. The results in the paper are of interest to researchers who work on DNA damage response and DNA repair as well as RNA metabolism.
      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. I am a researcher on DNA damage response and DNA repair but is not working of RNA biology.
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript reports the interaction between poorly characterized FAM104 proteins to ubiquitin-dependent segregase VCP. VCP functions in protein and organelle quality control as well as in extraction of ubiquitylated proteins from chromatin to regulate DNA repair, replication and transcription. In addition, VCP mutations are causative for several human neurological disorders. The authors demonstrate that FAM104 proteins promote the nuclear localization of VCP and that their loss causes impaired growth and hypersensitivity to chemical inhibition of VCP. They show that FAM104 proteins bind to VCP directly via a non-canonical helical motif and model the interaction with AlphaFold Miner, which allowed the identification of critical amino acids that mediate the interaction, which was then validated in vivo and in vitro.

      The conclusions are supported with well-designed experiments and data of high quality, manuscript is written in a clear and precise way.

      Minor points

      • P3: The authors write that mutations in VCP are causative for cancer. This should be rephrased.
      • P3: I would suggest to add a reference to the new study that also shows that VCP is also exploited by bacteria rand not only viruses (
      • Could the authors better illustrate the difference between FAM104A and B, and provide some explanations of why A seems to interact better with VCP compared to B. Is it just matter of higher expression of FAM104A in the cells where the interaction has been tested?
      • The authors should quantify the IF results in Figure 4 and include the quantification in the main figure
      • UBXN2B interaction with FAM104A was found in HT affinity-MS (Huttlin et al) and Y2H (Luck et al) studies. Can the authors validate this interaction of UBXN2B with FAM104 proteins? This would help to understand whether FAM104 interacts mainly with nuclear adaptors.

      Significance

      The results presented in this manuscript will be of interest to the borad field of protein quality control and lay the foundation to study the functions of FAM104 proteins in chromatin-associated degradation.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their manuscript ‚FAM104 proteins promote the nuclear localization of p97/VCP' Maria Körner, Susanne Meyer, and coauthors describe the identification of FAM104 proteins as cofactors of p97/VCP, a central factor in ubiquitin-mediated cellular proteolytic processes. Function as p97/VCP cofactors has not previously been clearly attributed to FAM104 proteins.

      The initial observation that FAM104 proteins interact directly with p97/VCP in yeast two-hybrid assays is confirmed by in vitro pulldown experiments with recombinantly purified proteins as well as in lysates of cultured mammalian cells. Using truncated proteins and structure predictions, the interaction interface of p97/VCP and FAM104 proteins is further narrowed down to single amino acids of a characteristic C-terminal alpha helix in FAM104. Overall, these interaction studies are technically sound and include meaningful control conditions to postulate FAM104 proteins as p97/VCP binders. Subsequent functional analyses using colocalization studies and cellular fractionation suggest that FAM104 proteins determine the nuclear/chromatin-associated fraction of p97/VCP. Based on this observation, the authors speculate that FAM104 proteins are of particular importance given the established nuclear/chromatin-associated processes involving p97/VCP activity. This hypothesis is supported by the observation that FAM104A knockout cells exhibit an impaired growth phenotype that is exacerbated in the presence of a p97/VCP inhibitor and in combination with a DNA damage trigger.

      Points of concern:

      1. The authors hypothesize that FAM104 proteins enhance the nuclear/chromatin-associated function of p97/VCP by sequestering it from the cytosol into nuclear/chromatin. In the corresponding experiments, overexpression of FAM104 species (Figures 4 and 5) in otherwise unperturbed cells is used. Because recruitment of p97/VCP to client proteins is thought to depend in large part on ubiquitylation, it is unclear how overexpression of FAM104 is sufficient to enhance nuclear/chromatin localization of VCP. Is nuclear/chromatin localization accompanied by changes in ubiquitylation and/or turnover of the corresponding proteins? In other words, does enhanced localization also correlate with increased activity, or could the enhanced nuclear/chromatin association also be explained by inhibited/captured p97/VCP?
      2. The authors link the function of FAM104 proteins in nuclear targeting of p97/VCP to the absence of a unique NLS peptide. Therefore, it would be interesting to determine whether the appearance of FAM104 proteins at the evolutionary level correlates with the strength/presence of NLS peptides in p97/VCP and/or its cofactors UFD1/NPL4/FAF1/UBXN3. Do FAM104 proteins compensate for the loss of NLS peptides in p97/cofactor complexes?
      3. Re 2) It remains unclear whether FAM104 proteins are responsible for the mere sequestration of p97/VCP in the nucleus or whether FAM104 proteins also contribute to process/client specificity in other ways. In this context, the authors could investigate a possible compensation of the reduced nuclear targeting of p97/VCP in FAM104 knock-out cells by fusion with an efficient cNLS peptide. Does this compensate for both nuclear/chromatin localization and growth/drug sensitivity?
      4. Re 3) How does overexpression of FAM104 alter drug sensitivity compared to knock-out cell lines (Figure 7)?
      5. Is there experimental evidence on how FAM104 proteins can bind p97/VCP to chromatin in this context and the proposed targeting of p97/VCP to the nucleus/chromatin? Does FAM104 mRNA/protein expression increase when p97/VCP-mediated processes are disrupted (e.g., in the presence of p97/VCP inhibition or DNA damage)? Are FAM104 protein levels stabilized under these conditions? Are FAM104 proteins differentially regulated (e.g., in terms of localization) under these conditions? Figure 3A suggests that FAM104 proteins may have a different function in relation to p97/VCP protein levels: FAM104A iso1/2 have lower p97/VCP protein levels than FAM104A iso5 and B iso3. The authors suggest that this is due to the solubility of p97/VCP. It should be clarified whether lower solubility equates to increased chromatin association.
      6. It remains unclear whether a FAM104-dependent shift in nuclear/chromatin-associated p97/VCP could also be a secondary compensatory effect versus functional impairment in FAM104 overexpression/depletion. The authors might include this in their discussion.

      Significance

      In summary, the author's conclusion that FAM104 proteins represent a previously underappreciated class of p97/VCP cofactors is well supported. Given the versatile and important role of p97/VCP and cellular protein homeostasis pathways, this finding is of interest to a broad audience. However, the functional role of FAM104 proteins in p97/VCP biology remains unclear. Therefore, the authors need to further elaborate the physiological contribution of FAM104 proteins to p97/VCP function in additional experiments. The suggestions are largely based on modifications of experiments already performed in this manuscript.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript submitted by Korner et al. presents data regarding the interaction of p97 with FAM104 protein family. This part of the manuscript is performed by Y2H as well as in-vitro and in-vivo pull-down assays. After molecular mapping and characterizations, the authors continue and address the role of FAM104-p97 interaction on nuclear localization of p97 as well cell health in respect to p97 dependent activity. Provide a short summary of the findings and key conclusions (including methodology and model system(s) where appropriate). Please place your comments about significance in section 2.

      Major comments:

      • Are the claims and the conclusions supported by the data or do they require additional experiments or analyses to support them?

      Claims concerning the mapping of FAM104 and p97 (Figures 1&2) are generally well concluded. Yet, minor issues concerning Fig2D (FAM104Aiso1 cdel26) as well as Figure 2E (p97-deltaN pull down) lack of interaction-are not supported by the presented data (both show weak interactions). Claims concerning nuclear/cytosol p97 distribution impact upon FAM104 manipulations (over-expression or KO) need to be further evaluated by additional methodologies. For example, the distribution impact using the FAM104 mutants in 4B should be evaluated by cell fractionation experiments (as performed in figure 5). Cell fractionation performed for FAM104A isoforms 1&2 should be performed on isoforms 5&3, the fact that they are expressed at lower levels has no impact, as the evaluation is on p97 and they were able to show in figure 3A an impact on p97 levels. Impact on distribution performed in Figure 6 using FAM104 KO cells should also include cell fractionation experiments in order to enable clear conclusion regarding FAM104 impact on p97 nuclear distribution.

      Also, statistics presented are somewhat problematic at several points. In figure S4C the ** difference between vector and deNLS mutant make no sense (I think they should have been non-significant). Figure 7 make no sense to compare WT and KO cells (in panels b&C) if their original growth was different. One should compare the differences in respect to the drug concentration in each cell type. Also, it may be useful for statistical purposes to evaluate cell numbers rather than growth% and this may enable to obtain better statistical significances. - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated time investment for substantial experiments.

      The suggested experiments are all in reasonable time frame. - Are the data and the methods presented in such a way that they can be reproduced?

      Y2H is not explained at all in methods, furthermore, it would be useful to present in a table the entire list of p97 interactome obtained in this screen. - Are the experiments adequately replicated and statistical analysis adequate?

      See comments above

      Minor comments:

      • Specific experimental issues that are easily addressable.
      • Are prior studies referenced appropriately?

      Previous reports regarding FAM104 interaction with p97 have been reported in two papers (PMID 32296183 sup. Table9 therein and PMID 32814053 S2 therein) this has not been stated at all. Furthermore, no data concerning previous knowledge of FAM104 is referred to in the introduction. - Are the text and figures clear and accurate?

      The text is written well and one can easily follow<br /> - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      See major suggestion above

      Referee cross-commenting

      It seems reviewer #2 concerns are also situated close to our comments regarding nuclear function of FAM104 on p97 function. Reviewers 3 comment regarding UBXN2B possible tertiary complex with p97 and FAM104 should be attempted as it would help put p97 function in a slightly more specific context

      Significance

      Provide contextual information to readers (editors and researchers) about the novelty of the study, its value for the field and the communities that might be interested.

      The following aspects are important:

      • General assessment: provide a summary of the strengths and limitations of the study. What are the strongest and most important aspects? What aspects of the study should be improved or could be developed? The authors deal with a specific interaction of p97 with FAM104 protein family. While this interaction has been previously reported, their mapping of domains required for interaction is new. Conclusions regarding the additional binding partners of the FAM104-p97 complex would require additional double affinity and mass-spectrometry identifications (as well as possible substrates identification).
      • Advance: compare the study to the closest related results in the literature or highlight results reported for the first time to your knowledge; does the study extend the knowledge in the field and in which way? Describe the nature of the advance and the resulting insights (for example: conceptual, technical, clinical, mechanistic, functional,...). The study advances the repertoire of p97 adaptors and interacting domains.
      • Audience: describe the type of audience ("specialized", "broad", "basic research", "translational/clinical", etc...) that will be interested or influenced by this research; how will this research be used by others; will it be of interest beyond the specific field? The suitable audience would be p97 basic researchers
      • Please define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. p97 role in protein quality control and cellular homeostasis.
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2023-01991

      Corresponding author(s): Chaitanya A. Athale

      1. General Statements [optional]

      *We are grateful to the editors sending our manuscript out to review, and the reviewers for the careful reading and critical comments. In the following sections we describe our plan for revisions that will address the comments of the reviewers. We have added these in a point-wise manner. In summary most of the comments are addressable with additional experiments, simulations and data analysis. These will indeed serve to strengthen the findings without altering the fundamental findings. However, we would require upto 90 days to make these changes. *

      Description of the planned revisions

      Reviewer #1 Evidence, reproducibility and clarity

      Summary: This work combines in-vitro experiments and numerical modeling to study the dynamics ofmicrotubules, driven by molecular motors. In this bottom-up approach, molecular motors areimmobilized on the surface and microtubule filaments are anchored to the surface from one end. The dynamics results in "beating like" motion of the anchored microtubules. The authors establish aphase diagram of the different dynamical patterns of "beating like" motions by varying the molecular motor density and the length of the microtubule anchored to the surface. They use a numerical framework that captures the observed patterns.

      Our response: We are grateful to the reviewer for the careful reading and agree with the summary of our work. In the following sections we detail how we plan to address the specific comments.

      Major comments:

      1. Overall the experiments and results are well described and claims are supported by the data.

      Both experimental and numerical methods are presented in a way that they can be reproduced.

      Our response: We are grateful for the reviewer’s assessment about our findings and presentation of the results.

      Minor comments:

      1. A key feature of beating cilia is the asymmetry of the beat pattern (fast stroke and slow recovery). It might be interesting to use the kymographs or the Phy vs time analysis to see whether or not this feature exists in this simplified experimental model.

      Our response: We agree with the reviewer that it could be of interest to examine whether the dynamics of the tip-angle phi (φ) shows a difference between the strokes at onset and return, to compare to the fast-slow asymmetry observed in cilia. This will be approached in two ways:

      1) We will obtain more data from more fields of view

      2) Use the time-derivative of the tip-angle, phi (φ) dynamics to examine whether the onset and return strokes are asymmetric and how this compares to ciliary dynamics.

      3) We will also analyze the tangent angle to the contour, psi (ψ) plots with time (y-axis) and MT length (x-axis).

      A qualitative analysis of a few time-series suggests indeed that the onset v/s return stroke of the ‘beating’ is likely to be asymmetric in the manner qualitatively distinct from cilia and flagella, that appear to be symmetric. This would suggest we avoid the term flagella-like to describe the dynamics.

      2. Also, the beating frequency is very low (mHz) compared to real cilia/flagella (~Hz). Would it be possible to use the model to predict which parameter would need to be tuned to reach more

      physiologically relevant beating frequencies?

      Our response: We agree that the oscillations we observe have a frequency thousand fold lower as compared toflagella and cilia and have highlighted this in our discussion. When we modified motor velocity and stall forces, we found only a marginal increase in frequency of oscillations by a factor of 2-5, but not 10-fold or more. We also attempted in simulations to mimic kinesin-like properties. However we do not see a dramatic improvement. This suggests an involvement of higher-order organization of the filament. Indeed we plan to perform simulations that test the following scenarios not already tested:

      1) the role of microtubule bundling factors resulting in 2-, 3-, and higher order complexes of MTs

      2) varying the bending rigidity of the microtubules within ranges of what may be experimentally feasible with differences reported for taxol and GMPCPP filaments

      3) altering the duty ratio of the motor

      These will be in the nature of “what if” simulations that could provide the basis of future experimental design to test such predictions. This comment is similar to one by the other reviewers.

      Significance

      This study is part of the field of in-vitro reconstitution, from a minimal set of components, that aims to reproduce a biological function to identify and understand the minimal physical/biophysical mechanisms underlying a function. This study might be of interest for the people who address questions of the self-organization of cytoskeletal elements in minimal systems.

      Our response: We agree with the assessment of the reviewer of the significance of the study and the readership that might be most interested in this work.

      *The main limitation of this study relies on the claim of reproducing a flagella-like motion. Indeed, the frequency of the described oscillations is in the mHz range while the frequency of cilia is in the range of few Hz to tens of Hz. This suggests that the mechanism at play in such a reconstituted system is not the one that drives beating in real cilia/flagella. Yet, this limitation also applies to other studies in the field (Vilfan et al. 1999, Guido et al. 2022 ...). *

      Our response: We agree with the reviewer that the 10^3 to 10^4 difference in oscillation frequency with that observed in cilia is striking. Indeed our claim was limited to the wavelike nature of the oscillation of the free end of a clamped microtubule driven by molecular motors producing a buckling instability, release and re-engagement of motors. Therefore it is evident we are missing many components in our minimal system as compared to cilia. However, we would like to emphasize for now that the beating is only qualitatively comparable to cilia and flagella. So far we have not compared the two waveforms. As a part of our revision plan, we aim to objectively describe the quantitativeaspects that could strengthen our claim of a similarity or lack thereof in wave-forms.

      Indeed this limitation is also observed in the work of Vilfan et al. (2019) and Guido et al. (2022). However, we believe with changes to the experimental setup and a robust and tractable model we have improved on these studies.

      References:

      Vilfan A, Subramani S, Bodenschatz E, Golestanian R, Guido I (2019). Flagella- like Beating of a Single Microtubule. Nano Lett 19(5), 3359–3363.

      Guido I, Vilfan A, Ishibashi K, Sakakibara H, Shiraga M, Bodenschatz E, Golestanian R, Oiwa K (2022). A synthetic minimal beating axoneme. Small e2107854.

      My second concern is that the added value with regards to state of art is not clearly explicit. I'm thinking about the work of the Isabelle Guido's team where they have more complex reconstituted systems (a pair of 2 microtubules); or the work of Pascal Martin's lab where the design of the system allows to capture more complex mechanisms such as myosin density waves, which result infrequency beat of 0.1Hz.

      Our response: We agree that the advances of our study can be highlighted. In the following points we highlight the value added to prior art:

      1. In previous work, MT bundles have been shown to produce synchronized base-to-tip oscillations in vitro driven by kinesin in presence of crowdants (Sanchez et al., 2011??). However, the study lacked control over MT length,, something we have addressed in our study.

      2. Cilia reconstitution with MT length and motor density control (Sasaki et al., 2018) are closer to control of the system but because of the complexity it is hard to distinguish what effect emerged from which componen.

      3. The generation of a bending wave driven by outer dynein arm (ODA) combined with pairs of MTs nucleated from Chlamydomonas axonemal fragments (Guido et al., 2022) was probably a close mimic of a minimal system; it not only lacked ??lacked variation in motor density and length but failed to show oscillations, with S-shaped buckling patterns observed.

      As a result it is reasonable to state that this work is a distinct improvement on previous work. In some senses it provides a consistency check on the previous results and at the other with a model and novel order-parameter an opportunity to improve our understanding.

      References:

      1. Vilfan A, Subramani S, Bodenschatz E, Golestanian R, Guido I (2019). Flagella-like Beating of a Single Microtubule. Nano Lett 19(5), 3359–3363.

      2. Sanchez T, Welch D, Nicastro D, Dogic Z (2011). Cilia-like beating of active microtubule bundles. Science 333(6041), 456–9.

      3. Sasaki R, Kabir AMR, Inoue D, Anan S, Kimura AP, Konagaya A, Sada K, Kakugo A (2018). Construction of artificial cilia from microtubules and kinesins through a well-designed bottom-up approach. Nanoscale 10(14), 63236332.

      4. Guido I, Vilfan A, Ishibashi K, Sakakibara H, Shiraga M, Bodenschatz E, Golestanian R, Oiwa K (2022). A synthetic minimal beating axoneme. Small e2107854.

      Reviewer #2 Evidence, reproducibility and clarity Summary:

      The authors use a modified version of conventional gliding assays to induce microtubule bending, buckling, looping and cyclic beating (which they term "flagella-like") via clamping the plus ends of gliding microtubules to the surface. They find that the pattern of motion depends on different factors such as microtubule length and motor density. They build a simple computational model that predicts transitions between microtubule motion patterns depending on these parameters.

      Our response: We agree with the assessment of the reviewer summarizing our work in terms of the approach taken and the inferences.

      Major comments:

      - Overall, the experimental data is extremely sparse. As far as I can see, there are only two replicas for the lower motor density. It is not clear to me how the authors define the boundaries in the

      experimental phase diagram in Fig. 7. To build a phase diagram - where one axis corresponds to the motor density - on just two experiments is not convincing. I would need to see more experiments covering a larger range of motor densities and at least three replica per condition.

      Our response: The comment refers to Fig. 7, whose purpose was to answer the question- can we test the phase diagram predicted in simulations by comparing to experiment? The answer was provided with representative data, in order to demonstrate that the model is qualitatively validated.

      The reviewer is asking for a systematic experimental test that rigorously demonstrates such a match between simulation and experiment. To this end, the phase diagram may not be the ideal form for such a test. We will attempt to examine the beating frequency and wave-transition in line with a comment by reviewer #3, as a measure of experiment-theory validation.

      We agree with the reviewer that our data could be enriched with replicates, with more densities of the motor. We will then analyze all the experimental data using common metrics to compare to simulations.

      - It is not clear to me why the proportion of pinned vs. free microtubule segments should affect the beating pattern. I would expect that the free microtubule segment does not "feel" the length of the clamped segment, if it is indeed fixed all along its length and unable to move / bend. The simulations use only two anchor points at the pinned tip. The segment in between the anchor points bends, which could affect how the free microtubule segment behaves. To support the claim that it is indeed the proportion of the lengths of the pinned vs. free segments and not simply the length of the free segment alone that influence the beating pattern, I would expect to

      (1) see the corresponding and thoroughly quantified experimental data that verifies this simulation-based prediction. Fig. 5C is based on only three microtubules and it is not clear how long the segments are.

      (2) the entire pinned segments in the simulation should be fixed. This should also be compared to experimental data, where the lengths of the free segments are the same and only the lengths of the pinned segments

      vary.

      Our response: Originally the intention of comparing pinned length changes was based on experimental design, in which we incubated biotinylated tubulin to obtain longer or shorter clamped plus-ends. The contrast between a point-pinning and a longer segment is based on beam bending and buckling theory, corresponding to the difference between a swivelling point of immobilization (pinning) and a clamped end (clamped). However, we agree with the reviewer that beyond the pining scenario, once a segment is pinned the only thing really driving the beating is the free length. To address the specific comment we aim to add simulation calculations that will include a fixed clamp and increasing free length demonstrating that the primary driver in changing dynamics (so long as a segment is clamped) is the free length.

      (1) To address the question of experimental comparison we will examine more data with increasing free segment lengths for the same density of motors and plot the dynamics, as well as characterize the oscillations with frequency estimation.

      (2) This relates to the earlier part of this comment and we aim to re-run the filament clamped segment simulations to make it consistent with expectation and theory from related papers in the field, with only the free-segment length varied.

      - In relation to my previous comments: I would expect a direct comparison between the simulation-based prediction that the beating pattern changes with microtubule length and motor density in a quantitative manner, where all pinned microtubules observed experimentally are analyzed. The figures are often based on single observations.

      Our response: The experimental phase diagram had representative beating MTs, as compared to simulations. We agree that showing more statistics on these patterns could help. We aim to perform more experiments and analyze more data, which will be systematically plotted to make statistically relevant inferences of patterns as a function of density and length of MT.

      - The authors report that the pinned microtubules typically undergo 2-3 cycles of beating. This

      number is very low, and I am hesitant to call it "flagella-like" cyclic beating. Is this due to the dynein motors being much slower than e.g. kinesis? To confirm this and support the generality claimed by the authors, I would like to see experiments with a different, faster motor. If other motors are not readily available to the authors, this would imply a substantial amount of time and effort though.

      Our response: The slower velocity of yeast cytoplasmic dynein is indeed one the contributing factors for the slow oscillations seen. In preliminary experiments with kinesin we indeed see a faster oscillation, but still in the 10 mHz range. These experiments will be added to the revised manuscript.

      - Please perform statistical analysis of the experimental data.

      Our response: Most of the data, while statistical, is not being compared for means (e.g. simulation v/s experiment). However we will analyze the frequency as a function of length and density and examine differences based on standard statistical tests.

      Minor comments:

      - Number of replicates and samples should be indicated in the figures.

      Our response: With additional analyzed data and new experiments we will have more datasets and

      Significance

      - The approach to clamp the plus ends of gliding microtubules in order to induce buckling, bending and beating is elegant and should be easily transferable to other groups who may be interested in this method, since it is straightforward to adapt conventional gliding assays to induce pinning.

      Our response: We agree with this assessment of the reviewer.

      - The study could potentially be interesting to an audience studying flagella-like systems. Since the system is simple and based on in vitro components with defined parameters, it could serve as a basis for studying more complex systems or testing the influence of particular proteins associated with flagella. However, I do not see a major advance regarding our understanding of flagella or similar structures based on the manuscript. In combination with the model, I see it majorly as a useful tool, providing methodological advance. It would be desirably to make the computational model available to the public.

      Our response: We agree that this system of minimal in vitro components could in future be made more complex in a step-wise manner. Once the manuscript is accepted after review, we have intended to make the code available in OpenSource. The source code of Cytosim already is OpenSource and can be downloaded here: https://gitlab.com/f-nedelec/cytosim.

      - The computational model seems useful and straightforward to me, yet my background is purely experimental and I cannot judge the model in detail.

      Our response: The computational model is indeed straightforward, and is based on a set of C++ codes that are OpenSource and those with a computational training have tested it in multiple studies both by us and other labs.

      - In my view, the most important limitation of the manuscript is its lack of thorough experimental data to support the claims made by the authors. In its current state, the manuscript seems rather preliminary and I see the need for significant additional experimental evidence.

      Our response: We plan to take the reviewers criticism on board and perform new experiments, analyses and simulations to address this gap of additional experiments. These experiments we believe will go to strengthen the manuscript, but not fundamentally alter the result.

      Reviewer #3 *(Evidence, reproducibility and clarity (Required)): *

      -Summary:

      *This manuscript reports experimental in vitro gliding assays demonstrating bending oscillations when single microtubules are anchored at their plus end and compressed beyond a buckling threshold by dynein molecular motors immobilized on a solid substrate. Together with numerical simulations based on the well-established Cytosim software, the authors identify three main classes of motile behavior under the control of microtubule length and motor density: aperiodic fluctuations (flapping), periodic beating with bending traveling waves over at least part of the filament length, and looping behaviors where the microtubule can curl on itself near its free tip. The authors claim that these movements are reminiscent of the beating movements of eukaryotic cilia and flagella and may provide useful information of the mechanism underlying the oscillatory instability. *

      Our response: We are grateful to the reviewer for a careful reading and have in the following sections outlined our plan for revision in response to the specific comments.

      *- Major comments: *

      1. The observed oscillations show only a few cycles (up to only 4, but often 2-3 (Fig. 1-2)) and are in addition very noisy. Oscillations thus appear to happen only transiently, i.e. do not show a dynamic steady state on timescale much larger than the oscillation period. Demonstrating the emergence of true (and stable) regular oscillations thus remains a challenge, in contrast to the authors' claim. The large variability of behavior from filament to filament (as seen in SV3), as well as in a single filament over time, also makes it difficult to achieve a robust quantitative description of these movements (see below).

      Our response: We have observed at times 4 and at times more cycles but we believe this is limited by the fact that mechanical pulling on the streptavidin-biotin linkage could result in occasional detachment of the filament from the surface. Stable oscillations of the form that the reviewer is pointing to may not emerge due to practical challenges and may require an alternative experiment such as optical tweezer to clamp the filament for a longer period. This is currently beyond the scope of the study, but could be attempted in future.

      Regarding the variability, we are aiming to analyze more data that has already been recorded and is also being acquired. These additional datapoints will allow for more representative statistics. The variability should tell us more about the nature of the system. We will estimate frequency of oscillations as a parameter for comparison along with our order parameter (span). This is similar to the comment by reviewer #2.

      2. Overall, the amount of experimental control seems relatively limited, for there is systematic variation of microtubule length (free or pinned) and only two motor densities have been explored.

      Our response: We will address this shortcoming by performing more experiments, with a few more motor densities of intermediate value. This will be supplemented with additional data analysis.

      • *One wonders why the motor density has not been more extensively varied and what determines the range of densities that can be achieved. What happens if the density gets larger than 50/µm^2? Do the filaments fail to remain anchored? Is buckling still permitted at high motor density? *

      Our response: The range of densities are obtained after the experiment, since this is not a patterning system. At times the density is either too low, and the filaments do not beat, or too high and they detach. This results in only two reported densities, less than perhaps desirable as pointed out by the reviewer. Now that we know what densities work, we aim for a fine-grained scan in the same range expected to produce regular oscillations.

      We will titrate the motors to obtain intermediate densities in the range that we have already found to result in stable oscillations with between 4-5 periods and hope to address this question.

      • *Important fundamental issues remain here unfortunately untouched in experiments and are also only qualitatively discussed in simulations (bottom of p11 and Fig. 4), namely the dependence of the frequency and wavelength of wave-like beating as a function of motor density and microtubule length. These limitations result from a lack of control over the microtubule lengths and that only two motor densities have been tested. Using the natural variability in length of the anchored filaments may be potentially used to study length effects but then a relatively large amount of data will be required to reliably conclude that filaments ensembles of different mean lengths reliably show different behaviors. Similarly, I do not see where in the data one can see that increasing motor density actually controls the oscillation frequency, as concluded from simulation data (but not analyzed quantitatively). *

      Our response: We plan to systematically analyze the frequency, which we have already demonstrated we can measure. The dependence on MT length and density will be tested and added as additional data. We will perform experiments with more motor densities to address that aspect too. We will also run additional simulations and compare outputs. This will help to address the comment and is in line with suggestions by the other reviewers too.

      3. The authors repeatedly claim that the movements they observe are "flagella-like". However, the comparison remains vague as there is no quantitative assessment of the similarity or dissimilarity between the movements observed here and biological beating of flagella or cilia (e.g. using data in Riedel-Kruse et al HSFP Journal 2007. DOI:10.2976/1.2773861 as a reference).

      Our response: We have compared frequency of oscillations from previous literature but find them to be extremely disparate – by a factor of 1,000. We will use the suggested references to find geometric properties that could test our claim of flagella-like in terms either of waveforms, symmetry of beating or the dynamics or tip-behavior.

      • *What does it mean to resemble flagellar beating? It would be desirable to be more explicit/quantitative and not be ashamed to point to differences (could be event more instructive) as well as to similarities. Note that oscillations of the tangent angle in flagella of the bull sperm are nearly sinusoidal, and are thus smooth, with no snaps (Riedel-Kruse et al HSFP 2007), thus challenging the claimed resemblance between bending oscillations in this work and the flagellar beat. *

      Our response: This is similar to the previous point. We agree that a quantitative comparison between the dynamics we observe of single filaments and of bonafide flagella, could strengthen the findings of this manuscript. We will use multiple metrics such as the tangent angle-with time of the free end, and the average angle along the flagella (as reported by Riedel-Kruse et al.) to make a more concrete comparison.

      • *In my opinion, the authors should tone done the resemblance of their system with cilia and flagella and be much more quantitative about the detailed features of the observed movements in their in-vitro assay. *

      Our response: We will take the reviewers comment on board and discuss the work in the absence of the flagellar connection since indeed there is no direct link so far- our comparison with flagella-like systems will be moved to the discussion section with a qualitative comparison of waveforms as this reviewer and others have suggested.

      • *In the present gliding assay, motors produce compressive tangential forces on the microtubule, which can result in buckling and thus in an elastic load applied by the filament to the motor with a component perpendicular to the filament. Instead, flagellar motors produce force dipoles that result in neighboring-filament sliding which is then converted in bending of the filament bundle as a result of elastic constrains. Symmetries of the problem thus seem very different. It is also worth noting that many (but not all) models of the flagellar beat actually assume a constant inter-filament distance so that there is no effective normal force acting on the motors to detach them, yet faithfully reproduce beating waveforms (e.g. Camalet and Jülicher New J of Phys (2000) DOI: 10.1088/1367-2630/2/1/324; Riedel-Kruse et al HSFP J (2010)). More generally, whether the present study provides any useful information to inform our current understanding of the flagellar beat is not clear to me and the authors' claim that it may be the case is not motivated enough. Accordingly, the statement (P19) "qualitative transitions (...) expected from not just the minimal but even the potentially complex flagellum" is not justified. *

      Our response: This distinction will be more elaborately discussed in the revised discussions section and similar to the previous point, we will avoid reference to flagella-like behavior.

      4. I could not find a detailed statistical account of the total number of filaments that was used for the paper, how many fell in the four classes of movement (swiveling, fluctuations, beating, and looping) identified by the authors, and whether the population in each class could actually be controlled experimentally, e.g. by varying motor density or microtubule length. This gave the unfortunate impression that the conclusions were based on cherry picking, which is troublesome considering the large variability in behavior between filaments and the ambition of the authors to provide a state diagram of the dynamics (Fig. 7). To reach clear conclusions, one parameter must be changed while the others remain fixed. For instance, to discuss the effects of the pinned length, one would like to fix the total microtubule length (but then the free length varies) or vary the pinned length with constant free length (thus changing the total microtubule length). I understand that this might be difficult (in experiments), but the authors should then acknowledge these limitations and mitigate their conclusions. In principle, if the yield of the experiment (number of anchored filaments per slide) were sufficient, one could to address these issues by classifying the filaments in ensembles of a given properties (e.g. same total length by variable pinned length). To reach this goal, there is a need to obtain a sufficiently large quantity of data. The reader gets an estimate on the order of 10 usable filaments per slide (video SV3 and inset in Fig. 2D), with only a few replicates (4 experiments at 46/µm^2 and 2 experiments at 27/µm^2). The authors talk about "representative filaments" throughout the text but there is no detail about the ensemble of filaments that show a given behavior and the number of filaments that are used to reach a given conclusion is not given. Length distributions for the free and pined ends of the microtubules, for the maximal amplitude of tangent-angle oscillations, and other measures that characterize the microtubule movements (curvature, wave speed) ought to be given, provided that enough data has been collected to compute reliable ensemble averages.

      Our response: For now we have only considered the average behavior with the dynamics observed from multiple fields of view, combined in terms of MT lengths and motor densities. Since Fig. 7 was meant to be representative and therefore a qualitative comparison with simulation predictions, replicates were not added. However, in response to reviewer’s question, we will analyze more data and add it in the supplementary material, in order to support the statistical validity of our claims- that are not based on purely selective evidence.

      *5. The effect of motor density on beating properties, in particular frequency, is discussed in simulations but not clearly demonstrated in experiments. One cannot conclude that experiments confirm the prediction of the theory in this respect. *

      Our response: Currently we have used a novel metric for the type of oscillation and pattern, the span-parameter (S). However, this was meant to capture large qualitative changes observed in experiment and simulation in terms of patterns.

      In response to this comment, we will also analyze the dominant frequency of filaments using FFT on the tip-angles from multiple conditions of MT length and motor density. The scaling of frequency with length and motor density will be compared to simulation predictions. The comparison will then allow an additional quantitative comparison between experiment and simulation.

      *- Minor comments: *

      6. More extensive quantitative analysis of the waveform of oscillation (noisy sinusoid vs. sawtooth or relaxation oscillations?) and bending wave propagation (speed and curvature vs position along the filament) is needed. In particular, it is claimed that the filaments "snap" and thus evince a "recovery stroke" (e.g. p7). I agree that snaps are evident in some of the videos, and are expected at low motor density. However, I would expect the movements to get smoother at higher motor density, as shown in simulations (looping regime). In any case, one could use the analysis of the tip or (better) tangent angle as a function of time to assess whether 'snaps' indeed occur; due to noise, snapping behavior is not so clear in the data provided in Fig. 1D-E.

      Our response: We agree with the reviewer that “snap-back” movement arising from potentially low motor density scenarios changes when the motor density is increased to a more smooth motion. We have observed this, and will characterize it quantitatively to make this point more clear. The tip-dynamics will be analyzed for velocity and symmetry to make this point more apparent.

      *7. Because the tips of the microtubules are "sticky" due to their biotinylated tips, I wonder whether the histogram of gliding velocity of the microtubules that are not anchored is modified, i.e shifted toward lower velocities, as compared to that of bare gliding microtubules. This is assuming that a majority of the microtubules are equipped with biotinylated "heads"; this information ought to be provided in the Methods if, as the author claim, the biotinylated tips can be visualized. Analysis of gliding velocities (e.g. in video SV3) could potentially reveal the enhanced interaction between the microtubules and the surface. *

      Our response: We will analyze the instantaneous gliding velocity and test the hypothesis that some filaments may be transiently immobile, while others may move unhindered at typical gliding assay velocities (50 to 80 nm/s).

      *8. Demonstrating that the anchoring strategy has actually improved the chance to anchor a microtubule, as compared to random anchoring to surface defects that occur naturally in gliding assays, would be welcome. *

      Our response: We will analyze the frequency histogram of gliding assay velocities and compare them to the filament-oscillation scenario with biotinylated filaments. We expect to see a zero-velocity mode in the clamped filament scenario and only transient (and therefore less frequent) pinned or clamped filaments. This is already our qualitative observation but we will seek to quantify it.

      *9. The simulations should be analyzed more quantitatively and extensively to study how motor density and microtubule length affect the wavelength and frequency of oscillations in the wave-like beating regime, going beyond what can be achieved experimentally. In particular, one could compute the speed of the bending waves, asses how it varies during wave progression from base to tip of the microtubule, describe the increase in the magnitude of tangent-angle and curvature oscillation as a function of curvilinear abscissa. *

      Our response: We have now analyzed the frequency and amplitude of filament oscillations in simulations. This will in the next step be used to look for trends as a function of MT length and motor density. We hope indeed to look beyond what experimentally achievable ranges might be, including measuring the propagation of the bending wave along the contour as suggested by the reviewer.

      *- Suggestions to help improving the presentation: *

      *1. First section of the results (p5-7): this section is full of methological details that get in the way of the description of the actual result (Fig. 1). I would suggest moving these details (e.g. there is not need here to explain how the motors are attached to the substrate, which you use cytoplasmic yeast dynein, and other details). *

      Our response: We will rewrite the manuscript to improve the clarity and move the methods to the section dedicated to the methodology.

      *2. The top of P9 could also be moved to Discussion section. *

      Our response: We will move the page 9 text referred to into the discussion.

      *3. P12-13: I also find that the Results section mixes results with discussion, which is not very effective. I would again move elements of discussion (here associated with bending energies) to the Discussion section and focus on results only. *

      Our response: We have done so due perhaps to a requirement from an earlier round of reviews. However, we will be happy to separate results from discussion- for example the reference to bending energies.

      *4. Throughout the result section: Move any comparison to actual flagellar dynamics to a dedicated section in Discussion. *

      Our response: The flagellar discussion will be moved out of the results section entirely unless we invoke the analysis of bonafide flagella.

      5. P12: doesn't the increase of the clamped length reduce the length of the free length, moving in the state diagram toward regions of shorter filaments. One wonders whether the clamped length really matter as long as the filament is clamped near the plus end. I would naively expect that it is the free-filament length that maters rather than the total length or the faction of the filament that is clamped.

      Our response: We agree with the reviewer with one caveat- filaments pinned at one end (point pinning) are distinct from those with long segments clamped. However, the reviewer is correct in pointing out cases where filaments have a substantial clamp, the free length is more important. We will revise our figures and results section to clarify this.

      *6. Figure 4: this figure shows very interesting simulation data that, in my opinion could be much more extensively studied. In particular, one could plot the oscillation frequency, the bending-wave speed, and wavelength as a function of the filament length and the motor density. Also, to characterize the beating waveform more in detail, it would be worth computing how the magnitude of tangent-angle oscillation increases with the curvilinear abscissa for representative examples of waveforms in the three regimes (see again in Riedel-Kruse et al HSFP 2007. DOI: 10.2976/1.2773861 or Pochitaloff et al Nat Phys 2022 DOI: 10.1038/s41567-022-01688-8). *

      Our response: We have performed fourier series analysis to obtain dominant frequencies. This will indeed be applied to the simulations in Fig. 4 in order to examine the rich dynamics, as well as provide a point of quantitative comparison to exepriments.

      *7. Figure 5: the way to display the data in A-B (simulations) and C-D (experiments) does not allow for an easy comparison between simulations and experiments. I would use beating patterns and kymographs of the tangent angle for both. *

      Our response: We are in the process of revising Fig. 5 in order to examine the effect of free MT length on oscillations and will put experimental and simulation analyses that match each other in the nature of the analysis. The analysis itself will be elaborated to include aspects such as average tangent angle as a function of arc-length (Riedel-Kruse et al., 2007) along with frequency.

      *8. Figure 6: the way to present the experimental beating patterns is no so clear (thick colored lines). I would recommend showing black lines resulting from automatic tracking of the microtubule. *

      Our response: The data in Fig. 6 is raw data projected in order to provide a picture within the limits of magnification. In order to address this comment we will project the tracked contour of the filament and that will result in a finer and better resolved image.

      *9. Legend of Fig. S1: the panels (D) and (E) of the figure are not called properly. *

      Our response: We will rectify the issue of sub-figure callouts.

      *10. Fig. S3: use the same scale in the different panels of (a) and (b) to allow for an easier comparison. It would also be nice to show videos of the simulated motion. *

      Our response: The current differences were in order for visual clarity and the modified axis values are mentioned. We will revise Fig. S3 simulation outputs where filaments are projected on the same axis for consistency.

      *11. Fig. S4: Hard to read, in particular the motors are not visible. Would be better to have the patterns in black on a white background. The panels look like screen shots. *

      Our response: The unbound motors have been deliberately made invisible for clarity. We can provide a figure update with the motors made visible again.

      *12. Fig.S5: indicate in the title that this figure deals with results of simulations. The legend refers to color bars but the figure is in grey scale. *

      Our response: Colorbar is indeed in grayscale. The legend entry will be modified to read “grayscale bar”.

      *Reviewer #3 (Significance (Required)): *

      *The motile phenomena reported here are qualitatively already well known in the field. Indeed, anyone who has performed a gliding assay, with microtubules or actin filaments has probably seen undulating or spiraling filaments accidentally anchored on surface defects. Accordingly, the topic has already been somewhat adressed in previous publications (e.g. Bourdieu et al Phys Rev Lett 1995; Sekimoto et al Phys rev Lett 1995; Vilfan et al Nanoletter 2019). As a matter of fact, microtubules anchored on defects in standard gliding assay can show oscillations very similar to those shown here. However, the lack of control over filament anchoring has precluded a systematic experimental study of the oscillatory filament dynamics. It is worth noting that ther bottom-up approaches have used filament bundles instead of single filaments, either with microtubules and kinesin motors (Sanchez et al Science 2011) or actin filaments and myosin motors (Pochitaloff et al Nature Phys 2022). These assays evince more regular oscillations (over tens of cycles) and waveforms that more closely resemble those of eukaryotic flagella than reported here. *

      Our response: We agree with this summary of our work, and will highlight the possible reasons why it differs from the work of Pochitaloff et al.

      *Here, the authors have developed an experimental strategy to increase the chance of anchoring single filaments' plus end to the substrate, potentially allowing for more control of the experimental conditions that lead to the emergence of oscillations (but see my criticisms above). Anchoring is made more likely, because short segments of biotinylated tubulin are added to the end of bare microtubules to make them stick to the substrate, which has been functionalized with streptavidin. A similar protocol had been reported before in the literature to study buckling of single microtubules by single kinesin motors (Gittes et al Biophys J 1996), but is here used at larger motor densities on the substrate. There is unfortunately no quantification of the success of the approach. *

      Our response: We propose to perform more experiments and analyze the data more quantitatively using multiple measures described in the literature and cited by this reviewer. We believe these changes will adequately address the concerns.

      *The comparison of the experimental data to Cytosim simulations is, to my knowledge, novel and a clear asset of the work, although this comparison could be more effective, as detailed above. *

      Our response: We will add a more complete quantitative comparison to supplement the already provided qualitative comparison to address the comments.

      *The emergence of periodic wave-like beating oscillations in motor-filament systems is a classical problem in biophysics. This problem is particularly relevant in the context of eukaryotic cilia and flagellar beating in biology. The audience for the present work is thus potentially broad, although the simplistic and artificial nature of the in-vitro system, with only one microtubule, will probably appeal more to biophysicists and theoretical physicists than biologists. *

      Our response: We appreciate the effort of this reviewer to evaluate our work. We however believe that the relevance of this work could extend beyond purely biophysics and theoretical physics as claimed by the reviewer.

      Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      -

      Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      -

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This manuscript reports experimental in vitro gliding assays demonstrating bending oscillations when single microtubules are anchored at their plus end and compressed beyond a buckling threshold by dynein molecular motors immobilized on a solid substrate. Together with numerical simulations based on the well-established Cytosim software, the authors identify three main classes of motile behavior under the control of microtubule length and motor density: aperiodic fluctuations (flapping), periodic beating with bending traveling waves over at least part of the filament length, and looping behaviors where the microtubule can curl on itself near its free tip. The authors claim that these movements are reminiscent of the beating movements of eukaryotic cilia and flagella and may provide useful information of the mechanism underlying the oscillatory instability.

      Major comments:

      1. The observed oscillations show only a few cycles (up to only 4, but often 2-3 (Fig. 1-2)) and are in addition very noisy. Oscillations thus appear to happen only transiently, i.e. do not show a dynamic steady state on timescale much larger than the oscillation period. Demonstrating the emergence of true (and stable) regular oscillations thus remains a challenge, in contrast to the authors' claim. The large variability of behavior from filament to filament (as seen in SV3), as well as in a single filament over time, also makes it difficult to achieve a robust quantitative description of these movements (see below).

      2. Overall, the amount of experimental control seems relatively limited, for there is systematic variation of microtubule length (free or pinned) and only two motor densities have been explored.

      a) One wonders why the motor density has not been more extensively varied and what determines the range of densities that can be achieved. What happens if the density gets larger than 50/µm^2? Do the filaments fail to remain anchored? Is buckling still permitted at high motor density?

      b) Important fundamental issues remain here unfortunately untouched in experiments and are also only qualitatively discussed in simulations (bottom of p11 and Fig. 4), namely the dependence of the frequency and wavelength of wave-like beating as a function of motor density and microtubule length. These limitations result from a lack of control over the microtubule lengths and that only two motor densities have been tested. Using the natural variability in length of the anchored filaments may be potentially used to study length effects but then a relatively large amount of data will be required to reliably conclude that filaments ensembles of different mean lengths reliably show different behaviors. Similarly, I do not see where in the data one can see that increasing motor density actually controls the oscillation frequency, as concluded from simulation data (but not analyzed quantitatively).

      1. The authors repeatedly claim that the movements they observe are "flagella-like". However, the comparison remains vague as there is no quantitative assessment of the similarity or dissimilarity between the movements observed here and biological beating of flagella or cilia (e.g. using data in Riedel-Kruse et al HSFP Journal 2007. DOI: 10.2976/1.2773861 as a reference).

      a) What does it mean to resemble flagellar beating? It would be desirable to be more explicit/quantitative and not be ashamed to point to differences (could be event more instructive) as well as to similarities. Note that oscillations of the tangent angle in flagella of the bull sperm are nearly sinusoidal, and are thus smooth, with no snaps (Riedel-Kruse et al HSFP 2007), thus challenging the claimed resemblance between bending oscillations in this work and the flagellar beat.

      b) In my opinion, the authors should tone done the resemblance of their system with cilia and flagella and be much more quantitative about the detailed features of the observed movements in their in-vitro assay.

      c) In the present gliding assay, motors produce compressive tangential forces on the microtubule, which can result in buckling and thus in an elastic load applied by the filament to the motor with a component perpendicular to the filament. Instead, flagellar motors produce force dipoles that result in neighboring-filament sliding which is then converted in bending of the filament bundle as a result of elastic constrains. Symmetries of the problem thus seem very different. It is also worth noting that many (but not all) models of the flagellar beat actually assume a constant inter-filament distance so that there is no effective normal force acting on the motors to detach them, yet faithfully reproduce beating waveforms (e.g. Camalet and Jülicher New J of Phys (2000) DOI: 10.1088/1367-2630/2/1/324; Riedel-Kruse et al HSFP J (2010)). More generally, whether the present study provides any useful information to inform our current understanding of the flagellar beat is not clear to me and the authors' claim that it may be the case is not motivated enough. Accordingly, the statement (P19) "qualitative transitions (...) expected from not just the minimal but even the potentially complex flagellum" is not justified.

      1. I could not find a detailed statistical account of the total number of filaments that was used for the paper, how many fell in the four classes of movement (swiveling, fluctuations, beating, and looping) identified by the authors, and whether the population in each class could actually be controlled experimentally, e.g. by varying motor density or microtubule length. This gave the unfortunate impression that the conclusions were based on cherry picking, which is troublesome considering the large variability in behavior between filaments and the ambition of the authors to provide a state diagram of the dynamics (Fig. 7). To reach clear conclusions, one parameter must be changed while the others remain fixed. For instance, to discuss the effects of the pinned length, one would like to fix the total microtubule length (but then the free length varies) or vary the pinned length with constant free length (thus changing the total microtubule length). I understand that this might be difficult (in experiments), but the authors should then acknowledge these limitations and mitigate their conclusions. In principle, if the yield of the experiment (number of anchored filaments per slide) were sufficient, one could to address these issues by classifying the filaments in ensembles of a given properties (e.g. same total length by variable pinned length). To reach this goal, there is a need to obtain a sufficiently large quantity of data. The reader gets an estimate on the order of 10 usable filaments per slide (video SV3 and inset in Fig. 2D), with only a few replicates (4 experiments at 46/µm^2 and 2 experiments at 27/µm^2). The authors talk about "representative filaments" throughout the text but there is no detail about the ensemble of filaments that show a given behavior and the number of filaments that are used to reach a given conclusion is not given. Length distributions for the free and pined ends of the microtubules, for the maximal amplitude of tangent-angle oscillations, and other measures that characterize the microtubule movements (curvature, wave speed) ought to be given, provided that enough data has been collected to compute reliable ensemble averages.

      2. The effect of motor density on beating properties, in particular frequency, is discussed in simulations but not clearly demonstrated in experiments. One cannot conclude that experiments confirm the prediction of the theory in this respect.

      Minor comments:

      1. More extensive quantitative analysis of the waveform of oscillation (noisy sinusoid vs. sawtooth or relaxation oscillations?) and bending wave propagation (speed and curvature vs position along the filament) is needed. In particular, it is claimed that the filaments "snap" and thus evince a "recovery stroke" (e.g. p7). I agree that snaps are evident in some of the videos, and are expected at low motor density. However, I would expect the movements to get smoother at higher motor density, as shown in simulations (looping regime). In any case, one could use the analysis of the tip or (better) tangent angle as a function of time to assess whether 'snaps' indeed occur; due to noise, snapping behavior is not so clear in the data provided in Fig. 1D-E.

      2. Because the tips of the microtubules are "sticky" due to their biotinylated tips, I wonder whether the histogram of gliding velocity of the microtubules that are not anchored is modified, i.e shifted toward lower velocities, as compared to that of bare gliding microtubules. This is assuming that a majority of the microtubules are equipped with biotinylated "heads"; this information ought to be provided in the Methods if, as the author claim, the biotinylated tips can be visualized. Analysis of gliding velocities (e.g. in video SV3) could potentially reveal the enhanced interaction between the microtubules and the surface.

      3. Demonstrating that the anchoring strategy has actually improved the chance to anchor a microtubule, as compared to random anchoring to surface defects that occur naturally in gliding assays, would be welcome.

      4. The simulations should be analyzed more quantitatively and extensively to study how motor density and microtubule length affect the wavelength and frequency of oscillations in the wave-like beating regime, going beyond what can be achieved experimentally. In particular, one could compute the speed of the bending waves, asses how it varies during wave progression from base to tip of the microtubule, describe the increase in the magnitude of tangent-angle and curvature oscillation as a function of curvilinear abscissa.

      Suggestions to help improving the presentation:

      1. First section of the results (p5-7): this section is full of methological details that get in the way of the description of the actual result (Fig. 1). I would suggest moving these details (e.g. there is not need here to explain how the motors are attached to the substrate, which you use cytoplasmic yeast dynein, and other details).

      2. The top of P9 could also be moved to Discussion section.

      3. P12-13: I also find that the Results section mixes results with discussion, which is not very effective. I would again move elements of discussion (here associated with bending energies) to the Discussion section and focus on results only.

      4. Throughout the result section: Move any comparison to actual flagellar dynamics to a dedicated section in Discussion.

      5. P12: doesn't the increase of the clamped length reduce the length of the free length, moving in the state diagram toward regions of shorter filaments. One wonders whether the clamped length really matter as long as the filament is clamped near the plus end. I would naively expect that it is the free-filament length that maters rather than the total length or the faction of the filament that is clamped.

      6. Figure 4: this figure shows very interesting simulation data that, in my opinion could be much more extensively studied. In particular, one could plot the oscillation frequency, the bending-wave speed, and wavelength as a function of the filament length and the motor density. Also, to characterize the beating waveform more in detail, it would be worth computing how the magnitude of tangent-angle oscillation increases with the curvilinear abscissa for representative examples of waveforms in the three regimes (see again in Riedel-Kruse et al HSFP 2007. DOI: 10.2976/1.2773861 or Pochitaloff et al Nat Phys 2022 DOI: 10.1038/s41567-022-01688-8).

      7. Figure 5: the way to display the data in A-B (simulations) and C-D (experiments) does not allow for an easy comparison between simulations and experiments. I would use beating patterns and kymographs of the tangent angle for both.

      8. Figure 6: the way to present the experimental beating patterns is no so clear (thick colored lines). I would recommend showing black lines resulting from automatic tracking of the microtubule.

      9. Legend of Fig. S1: the panels (D) and € of the figure are not called properly.

      10. Fig. S3: use the same scale in the different panels of (a) and (b) to allow for an easier comparison. It would also be nice to show videos of the simulated motion.

      11. Fig. S4: Hard to read, in particular the motors are not visible. Would be better to have the patterns in black on a white background. The panels look like screen shots.

      12. Fig.S5: indicate in the title that this figure deals with results of simulations. The legend refers to color bars but the figure is in grey scale.

      Significance

      • The motile phenomena reported here are qualitatively already well known in the field. Indeed, anyone who has performed a gliding assay, with microtubules or actin filaments has probably seen undulating or spiraling filaments accidentally anchored on surface defects. Accordingly, the topic has already been somewhat adressed in previous publications (e.g. Bourdieu et al Phys Rev Lett 1995; Sekimoto et al Phys rev Lett 1995; Vilfan et al Nanoletter 2019). As a matter of fact, microtubules anchored on defects in standard gliding assay can show oscillations very similar to those shown here. However, the lack of control over filament anchoring has precluded a systematic experimental study of the oscillatory filament dynamics. It is worth noting that ther bottom-up approaches have used filament bundles instead of single filaments, either with microtubules and kinesin motors (Sanchez et al Science 2011) or actin filaments and myosin motors (Pochitaloff et al Nature Phys 2022). These assays evince more regular oscillations (over tens of cycles) and waveforms that more closely resemble those of eukaryotic flagella than reported here.

      • Here, the authors have developed an experimental strategy to increase the chance of anchoring single filaments' plus end to the substrate, potentially allowing for more control of the experimental conditions that lead to the emergence of oscillations (but see my criticisms above). Anchoring is made more likely, because short segments of biotinylated tubulin are added to the end of bare microtubules to make them stick to the substrate, which has been functionalized with streptavidin. A similar protocol had been reported before in the literature to study buckling of single microtubules by single kinesin motors (Gittes et al Biophys J 1996), but is here used at larger motor densities on the substrate. There is unfortunately no quantification of the success of the approach.

      • The comparison of the experimental data to Cytosim simulations is, to my knowledge, novel and a clear asset of the work, although this comparison could be more effective, as detailed above.

      • The emergence of periodic wave-like beating oscillations in motor-filament systems is a classical problem in biophysics. This problem is particularly relevant in the context of eukaryotic cilia and flagellar beating in biology. The audience for the present work is thus potentially broad, although the simplistic and artificial nature of the in-vitro system, with only one microtubule, will probably appeal more to biophysicists and theoretical physicists than biologists.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The authors use a modified version of conventional gliding assays to induce microtubule bending, buckling, looping and cyclic beating (which they term "flagella-like") via clamping the plus ends of gliding microtubules to the surface. They find that the pattern of motion depends on different factors such as microtubule length and motor density. They build a simple computational model that predicts transitions between microtubule motion patterns depending on these parameters.

      Major comments:

      • Overall, the experimental data is extremely sparse. As far as I can see, there are only two replicas for the lower motor density. It is not clear to me how the authors define the boundaries in the experimental phase diagram in Fig. 7. To build a phase diagram - where one axis corresponds to the motor density - on just two experiments is not convincing. I would need to see more experiments covering a larger range of motor densities and at least three replica per condition.

      • It is not clear to me why the proportion of pinned vs. free microtubule segments should affect the beating pattern. I would expect that the free microtubule segment does not "feel" the length of the clamped segment, if it is indeed fixed all along its length and unable to move / bend. The simulations use only two anchor points at the pinned tip. The segment in between the anchor points bends, which could affect how the free microtubule segment behaves. To support the claim that it is indeed the proportion of the lengths of the pinned vs. free segments and not simply the length of the free segment alone that influence the beating pattern, I would expect to (1) see the corresponding and thoroughly quantified experimental data that verifies this simulation-based prediction. Fig. 5C is based on only three microtubules and it is not clear how long the segments are. (2) the entire pinned segments in the simulation should be fixed. This should also be compared to experimental data, where the lengths of the free segments are the same and only the lengths of the pinned segments vary.

      • In relation to my previous comments: I would expect a direct comparison between the simulation-based prediction that the beating pattern changes with microtubule length and motor density in a quantitative manner, where all pinned microtubules observed experimentally are analyzed. The figures are often based on single observations.

      • The authors report that the pinned microtubules typically undergo 2-3 cycles of beating. This number is very low, and I am hesitant to call it "flagella-like" cyclic beating. Is this due to the dynein motors being much slower than e.g. kinesis? To confirm this and support the generality claimed by the authors, I would like to see experiments with a different, faster motor. If other motors are not readily available to the authors, this would imply a substantial amount of time and effort though.

      • Please perform statistical analysis of the experimental data.

      Minor comments:

      • Number of replicates and samples should be indicated in the figures.

      Significance

      • The approach to clamp the plus ends of gliding microtubules in order to induce buckling, bending and beating is elegant and should be easily transferable to other groups who may be interested in this method, since it is straightforward to adapt conventional gliding assays to induce pinning.

      • The study could potentially be interesting to an audience studying flagella-like systems. Since the system is simple and based on in vitro components with defined parameters, it could serve as a basis for studying more complex systems or testing the influence of particular proteins associated with flagella. However, I do not see a major advance regarding our understanding of flagella or similar structures based on the manuscript. In combination with the model, I see it majorly as a useful tool, providing methodological advance. It would be desirably to make the computational model available to the public.

      • The computational model seems useful and straightforward to me, yet my background is purely experimental and I cannot judge the model in detail.

      • In my view, the most important limitation of the manuscript is its lack of thorough experimental data to support the claims made by the authors. In its current state, the manuscript seems rather preliminary and I see the need for significant additional experimental evidence.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This work combines in-vitro experiments and numerical modeling to study the dynamics of microtubules, driven by molecular motors. In this bottom-up approach, molecular motors are immobilized on the surface and microtubule filaments are anchored to the surface from one end. The dynamics results in "beating like" motion of the anchored microtubules. The authors establish a phase diagram of the different dynamical patterns of "beating like" motions by varying the molecular motor density and the length of the microtubule anchored to the surface. They use a numerical framework that captures the observed patterns.

      Major comments:

      Overall the experiments and results are well described and claims are supported by the data. Both experimental and numerical methods are presented in a way that they can be reproduced.

      Minor comments:

      A key feature of beating cilia is the asymmetry of the beat pattern (fast stroke and slow recovery). It might be interesting to use the kymographs or the Phy vs time analysis to see whether or not this feature exists in this simplified experimental model.

      Also, the beating frequency is very low (mHz) compared to real cilia/flagella (~Hz). Would it be possible to use the model to predict which parameter would need to be tuned to reach more physiologically relevant beating frequencies ?

      Significance

      This study is part of the field of in-vitro reconstitution, from a minimal set of components, that aims to reproduce a biological function to identify and understand the minimal physical/biophysical mechanisms underlying a function. This study might be of interest for the people who address questions of the self-organization of cytoskeletal elements in minimal systems.

      The main limitation of this study relies on the claim of reproducing a flagella-like motion. Indeed, the frequency of the described oscillations is in the mHz range while the frequency of cilia is in the range of few Hz to tens of Hz. This suggests that the mechanism at play in such a reconstituted system is not the one that drives beating in real cilia/flagella. Yet, this limitation also applies to other studies in the field (Vilfan et al. 1999, Guido et al. 2022 ...).

      My second concern is that the added value with regards to state of art is not clearly explicit. I'm thinking about the work of the Isabelle Guido's team where they have more complex reconstituted systems (a pair of 2 microtubules); or the work of Pascal Martin's lab where the design of the system allows to capture more complex mechanisms such as myosin density waves, which result in frequency beat of 0.1Hz.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The manuscript describes that simultaneous inhibition of LOXL2 and BRD4 reduces proliferation of TNBC in vitro and reduces growth in vivo.

      This observation is followed by extensive mechanistic studies that suggest physical interaction between LOXL2 and short isoform of BRD4-MED1. Inferences from Chip-seq analyses suggest that this interaction is involved in regulation of multiple transcriptional programs. Authors focus on differential activation of DREAM complex, to claim that this interaction "is fundamental for proliferation of TNBC". The manuscript is very well written and mechanistic inferences are based on a set of sophisticated epigenetic analyses and bioinformatical inferences. The phenotypic effects from LoxL2 inhibition by itself, or in combination with BRD4 inhibition are relatively modest. These modest effects, as well as many of the reported changes in gene expression are clearly inconsistent with the frequently used adjectives as "dramatic", "fundamental", "deeply affected", "drastically hampered" etc. Given the modest phenotypic effects, many of the key claims and conclusions are not supported by the data.

      We thank the reviewer for appreciating our work, defining the manuscript as well-written, and saying that it comprises extensive mechanistic studies as well as sophisticated epigenetic analysis.

      We apologize if some of our statements seemed exaggerated. In this revised version, we revisited some of our conclusion to moderate them.

      Moreover, we took the reviewer's criticism as an opportunity to strengthen our findings. In the revised version of the manuscript, we included an additional TNBC PDX (PDX-127), and results from this experiment clearly reinforce our claims (Fig. 6D and Fig. EV9E-F). In this new in vivo experiment, we selected a PDX model in which the expression of BRD4L is not detectable, while BRD4S is clearly expressed. Therefore, the treatment with JQ1 would specifically affect the activity of BRD4S, making the treatment selective. Additionally, we reduced by half the dose of JQ1 administrated to limit the effect of BRD4S inhibition alone on tumor growth. The combinatorial treatment (JQ1+PXS) induced a clear superior effect in this setting as compared with single-agent treatments. In addition to this, we discarded that the observed growth reduction is not the result of the sole inhibition of LOXL2, which could affect FAK/Src activity or extracellular Collagen crosslinking. In conclusion, our data show that the combinatorial inhibition of LOXL2 and BRD4S is effective in reducing tumor proliferation in TNBC in vivo models, independently of the inhibition of BRD4S and of other pathways known to be regulated by LOXL2.

      Specifically:

      1) It is unclear why authors generalize their conclusions to TNBC. Figure 1B demonstrates synergy for 1/3 cell lines, which is chosen for the follow up study. Even for MDA231, the synergy is confined to low concentrations of BRD4i (S1c). While MDA231 cell line is frequently used in experimental studies of TNBC, it is quite dissimilar to majority of clinical TNBC, and contains mutant RAS, which is rare in this disease.

      The synergistic effect is observed in MDA-MB-231 cells because only this cell line expresses both BRD4S and LOXL2. Indeed, in Fig. 1C we show that MDA-MB-468 cells do not express LOXL2, while BT549 only express minimal BRD4 levels.

      To corroborate this hypothesis, in the revised version of the manuscript we added:

      1. A new cell line (Cal51) expressing the same LOXL2 and BRD4 levels (Fig. EV8C) but showing greater resistance to JQ1 than MDA-MB-231 (Fig. EV8D). Also, in this cell line, we could show that the combinatorial treatment had a superior effect on cell viability than the single agents’ treatment (Fig. EV8E).
      2. A western blot panel of different TNBC PDXs shows that the majority of them express medium to high levels of both BRD4S and LOXL2 proteins, as is the case of MDA-MB-231 (Fig. EV9E) and Cal51 (Fig. EV8C). This result suggests that the combinatorial treatment could be used in the majority of TNBC patients as they are expected to express both BRD4S and LOXL2.
      3. Finally, as explained above, we performed another in vivo choosing a PDX that expresses BRD4S (but not BRD4L) and LOXL2 (PDX-127) (Fig. 6D and Fig. EV9E-F). Also, in this new model, we could observe that the combinatorial inhibition had a superior effect than single treatments.

        2) In vivo, the effect appears to be modest even in the MDA231 model, selected for evidence of synergy in vitro. In vivo, the combination appears to have an additive effect. Tumor growth rates are reduced, but no shrinkage is occurring. In the PDX model, LOXL2i does not have an effect as a monotherapy, while modestly enhancing the impact of BRD4i. These results are at odds with the claim of the interaction being fundamental for proliferation.

      We agree with the reviewer that the combinatorial inhibition appears to have an additive effect in vivo using the MDA-MB-231 model.

      1. For that reason, we have now performed the in vivo PDX experiment mentioned above (PDX-127; Fig. 6D and Fig. EV9E-F) in which we decreased the dose of JQ1 by half to avoid strong tumor growth effect due to BRD4 inhibition alone. In this new experiment, the synergistic effect is evident. While single-agent treatment showed a very moderate effect (0% or 20% tumor growth reduction for LOXL2 and JQ1, respectively), the combinatorial treatment showed a 50% reduction in tumor volume, further supporting our conclusions.
      2. We also performed either BRD4 or MED1 pull-down experiments in the presence of PXS and JQ1. We show that upon PXS treatment, the interaction between LOXL2 and BRD4S is maintained while the interaction with MED1 is reduced (Fig. 5A-C). However, in the presence of JQ1, the interaction between LOXL2 and MED1 is maintained while BRD4S-LOXL2 and BRD4S-MED1 interactions are impaired (Fig. 5D-F). These new results explain why monotherapy does not have a sufficient effect in vivo and set the rationale for the use of the combinatorial treatment. We believe that these new results corroborate our initial findings and we hope to have been able to satisfy the reviewer comments.

      3) No analysis of cell proliferation was shown in vivo. Authors should have performed BrdU or KI67 staining to support the claim. For in vitro analyses, authors also used indirect assays for proliferation. PI staining by itself does not have sufficient resolution to clearly capture modest effects that authors demonstrate. BrdU-PI double staining would have been much more useful.

      We appreciate the reviewer’s comment. In the revised manuscript we have added Ki67 and H3S10p staining in the tumor samples for the new in vivo PDX experiment (Fig. 6E and Fig. EV10A-C). We show that the combinatorial treatment significantly induces a reduction of both proliferation markers, which is in agreement with a reduced tumor volume. Regarding the in vitro analysis, we did not only use PI staining to show a reduced proliferation state but also H3S10p staining (Fig. 4B) and an SLBP1 fluorescent reporter MDA-MB-231 cell line (Fig. 4D, Fig. EV6B, E, and Movie EV). In the revised version of the manuscript, we included a new FACS-PI analysis (Fig. 4A, C) to better represent the effects we see on the cell cycle.

      Minor points:

      Dose dependent decrease in phosphorylated H3 is not at all obvious from eyeballing the data in S1A; the only effect that I see is a modest reduction at the highest concentration of the inhibitor. Authors need to quantify the results to support the claim.

      We agree with the reviewer and we apologize for the misinterpretation. We have changed the revised manuscript as follows: “The selective LOXL2 inhibitor PXS-538224 (hereafter, PXS) efficiently reduced the levels of oxidized histone H3 (H3K4ox) in MDA-MB-231 cells at 40 μM (Fig. EV6C), indicating an efficient inhibition of LOXL2 catalytic activity in the nucleus.”

      Most of breast cancer cell lines are derived from metastatic disease, including pleural effusion, thus the point that because MDA231 cell line is derived from pleural effusion, it is metastatic does not have sufficient logical foundation.

      Many publications have shown the high metastatic capacity of MDA-MB-231 (e.g. https://doi.org/10.1016/j.bbabio.2011.04.015, doi: 10.1038/s41467-017-01829-1), which are therefore used as TNBC metastatic model. The scope of the analysis reported in Fig. 6C was just to show whether any of the used treatments could reduce the metastatic capacity of this cell line. We believe we do not overstate the results but just report them as they are.

      How is loss of cell-cell junction in vitro consistent with LOXL2 role in modulating ECM? There is no evidence of ECM production in MDA231 in vitro. On the other hand, this loss is associated with EMT.

      We thank the reviewer for identifying this mistake. In the revised manuscript we changed the text as follows: “Gene set enrichment analysis (GSEA) revealed that LOXL2 KD induced upregulation of processes involved in cell morphology, secretion, membrane trafficking, and cell differentiation, with cell-cell junction being one of the most significantly affected pathways (Fig. EV5E). These results agree with the role of LOXL2 in regulating epithelial-to-mesenchymal transition, corroborating the high quality of our dataset.”

      Reviewer #1 (Significance (Required)):

      Discovery and characterization of LOXL2-BRD4 interaction is advancing the ever-deepening understanding of molecular mechanisms of regulation of gene expression. The studies and analyses appear to be sufficiently rigorous and reported with clarity, and the claimed discovery of the biological interaction between LOXL2 and BRD4 is well supported. However, given the magnitude of the reported (rather than claimed) effects of this interaction, and concerns about generalizability of authors conclusions, it is not clear how these results are promising for the development of new therapies in TNBC. Moreover, in contrast to luminal BC, there is no clear evidence for utility of cytostatic drugs in constraining TNBC. Therefore, biological and clinical significance of the authors discovery is unclear and claims in this regard appear to be overblown

      We thank the reviewer for stating that our analysis is rigorous and reported with clarity. We really took the criticisms as an opportunity to strengthen our findings, as explained above.

      For the newly presented in vivo PDX model, we performed immunohistochemistry of Ki67, H3S10p and Cleaved Caspase 3 to check whether the reduction of tumor volume observed in the combinatorial treatment was a result of a cytotoxic and/or a cytostatic effect (Fig. 6E and Fig. EV10A-C). As shown in the figure, the combination of the two inhibitors induced a superior decrease of Ki67, H3S10p, and a clear increase of Cleaved Caspase 3. Therefore, these new data indicate that the combinatorial treatment does not only have a cytostatic effect but also cytotoxic, suggesting a clinical exploitability for the treatment of TNBC patients.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In their study, Pascual-Reguant et al. show that combined inhibition of BRD4 and LOXL2 can synergize to restrict triple-negative breast cancer (TNBC) proliferation. BRD4 and LOXL2 are transcription regulators that can read and write epigenetic information, respectively. The authors employ three distinct breast cancer cell lines and mouse models with cell line-derived xenografts, and they show that combined inhibition of BRD4 and LOXL2 can be superior to single BRD4/LOXL2 inhibition in these model systems. In an attempt to identify a connection between BRD4 and LOXL2, the authors find that the two proteins can bind to each other. The authors performed most of the experiments in the breast cancer cell line MDA-MB-231. To assess the impact of LOXL2-inhibition on transcription, the authors assessed changes of the transcriptome in MDA-MB-231 cells following LOXL2 knockdown. They found that genes related to cell differentiation and morphology were upregulated, while genes related to the cell cycle were downregulated. ChIP-seq data of BRD4 showed that BRD4 can bind to cell cycle gene promoters and that this binding was enhanced upon loss of LOXL2. The authors found that LOXL2 and BRD4 interacted with the transcriptional cell cycle regulators B-MYB, FOXM1, and LIN9, which are components of the MYB-MuvB-FOXM1 (MMB-FOXM1) complex that is known to promote the expression of late cell cycle genes with important functions during mitosis. The authors conclude that LOXL2/BRD4 interact with each other and with the MMB-FOXM1 complex to drive the expression of cell cycle genes and cell proliferations. Vice versa, they conclude that inhibition of LOXL2/BRD4 reduces cell proliferation through inhibiting the expression of cell cycle genes.

      Major:

      • The data and methods are presented well. The experiments are adequately replicated and analyzed. However, except for the first section, all experiments were performed using only one cell line. It is important to validate key findings in at least a second cell line.

      We thank the reviewer for valuing our work.

      To address the reviewer’s comment, in the revised manuscript we added an additional cell line (Cal-51), that expresses similar levels of LOXL2 and BRD4 as compared to MDA-MB-231 (Fig. EV8C). Even though this cell line is clearly more resistant to JQ1 than the MDA-MB-231 cell line (Fig. EV8D), the combinatorial treatment is significantly more effective as compared with single agents’ treatment (Fig. EV8E).

      Moreover, we have also performed an additional in vivo experiment using another TNBC PDX (PDX-127) that expresses LOXL2 and BRD4S, but not BRD4L. Given that JQ1 can inhibit both BRD4 isoforms, this in vivo system allowed us to demonstrate that the tumor antiproliferative capacity of the combinatorial treatment is due to the simultaneous inhibition of LOXL2 and BRD4S (rather than BRD4S and L) (Fig. 6D and Fig. EV9E-F).

      • There appears to be a misunderstanding of the concept of cell cycle-dependent gene regulation by the DREAM complex and its related factors. Early (G1/S) cell cycle genes contain E2F promoter motifs, while late (G2/M) cell cycle genes contain CHR promoter motifs. The DREAM complex can bind both, while RB-E2F and MuvB recognize only E2F and CHR motifs, respectively. B-MYB and FOXM1 bind to MuvB and regulate late cell cycle genes, but they do not bind to early cell cycle genes. Given this concept, the authors' rationale to connect BRD4/LOXL2 through MuvB/B-MYB/FOXM1 with E2F promoter sequences and early cell cycle genes and the subsequent conclusions must be corrected.

      We thank the reviewer for their expert explanation. We corrected our conclusion in the revised version of the manuscript following the reviewer’s comment.

      • I felt that the suggested functional connection between LOXL2/BRD4 and DREAM is not strongly supported by the authors' data. Figure S6E: A similarity score of Fig. EV6E: We agree with the reviewer that a similarity score of Fig. 4E: We thank the reviewer for this comment. The performed pulldown showed that BRD4S, LOXL2, and MED1 interact with Lin9 and B-Myb, but not with FOXM1, thus FOXM1 itself is an internal negative control of the pulldown. Additionally, BRD4L does not show the same interaction pattern as BRD4S, LOXL2, and MED1, again acting as an internal negative control. We, therefore, believe that the pulldown is properly controlled and that the observed interaction is trustful. We furthermore agree with the reviewer that it would be interesting to characterize the interactions between the DREAM complex and BRD4S, LOXL2, and MED1. However, we believe that the dissection of these interactions at the mechanistic levels would require a deeper study, which can be a project in itself that we aim to explore in the future. For example, it would be interesting to investigate whether either the inhibition or the downregulation of LOXL2 and/or BRD4S specifically impairs the formation of the DREAM complex or the recruitment of specific DREAM complex subunits, as well as how these effects impair the DREAM complex chromatin binding. We are afraid that the suggested pulldowns would not be sufficient to answer these questions, which would require extensive cross-interaction studies in either BRD4/LOXL2 and BRD4+LOXL2 inhibition or downregulation followed by ChIP-seq and transcriptomics for all the conditions. We believe that the provided data, together with the functional characterization (both, in vitro and in vivo), of the phenotypes triggered by BRD4S and LOXL2 inhibition make a strong case for our manuscript and leave out of scope the suggested experiments. We hope the reviewer will understand our explanation and will appreciate that we are planning to pursue this further in the future.

      Fig. 3: We thank the reviewer for this important comment. The ChIP-seq technique very often does not provide exhaustive results due to sequencing depth limits and antibody performance. We believe that the fraction of DREAM target genes found in our dataset as bound by BRD4S is not exhaustive and that the analysis proposed by the reviewer would not lead to clear conclusive results. However, we understand the importance of verifying that DREAM target genes whose promoter is bound by BRD4 are indeed downregulated when LOXL2 is inhibited. To give an answer to this question, in the revised manuscript we added gene expression analysis of selected DREAM target genes upon treatment with JQ1, PXS their combination. We could successfully show that both JQ1 and PXS treatment impairs the transcription of the selected DREAM target genes, however, the combinatorial treatment almost shut down their expression, in agreement with our hypothesis (Fig. 5J).

      • The authors state that it is surprising to find that LOXL2 can promote target gene transcription because it is rather known as a transcriptional repressor. To this point, the authors should perform standard analyses using their RNA-seq and ChIP-seq data. Compare differential expression of genes that are bound by BRD4S/L/S+L and genes not bound by BRD4. Perform motif search and enrichment analyses for transcription factor and co-factor binding data (public ChIP-seq repositories). Such analyses may suggest what gene sets are up- and downregulated by LOXL2 through BRD4S/L and what other factors could be involved in LOXL2-dependent up- and downregulation of gene transcription.

      We thank the reviewer for this valuable comment that certainly provides the rationale for a follow-up project. However, we believe that the proposed study goes beyond the scope of our work at this moment.

      Minor:

      • I felt that background information on the BRD4 isoforms was missing. The short and long isoforms of BRD4 should be introduced briefly.

      We agree with the reviewer. In the revised manuscript, we addressed this by presenting BRD4 isoforms in the introduction part of the manuscript.

      • Given that BRD4 inhibition is known to activate p53 (e.g., PMID 23317504 and 33431824) and p21 (PMID 31265875), the authors should discuss the p53 status of their cell lines (largely mutant). In general, I felt that the authors could better cite and discuss the current literature on BRD4 and LOXL2.

      We appreciate the comment of the reviewer regarding p53. Given the fact that p53 is mutant in MDA-MB-231, we believe that the proliferation defect observed with the combinatorial treatment may be due to the activation of alternative cytostatic or cytotoxic signaling cascades, independently of P53 activation. We have now briefly mentioned this point in the manuscript discussion.

      • It was unclear to me why the authors did not actually test experimentally whether their predicted interaction models 2 or 4 are likely true (Figure 2E+G).

      We understand the reviewer’s comment. The fact that JQ1 treatment almost abrogates the interaction between LOXL2 and BRD4S strongly suggests that models 1 and 3 are likely wrong, therefore pointing towards models 2 and 4 as the correct ones. To test whether models 2 and 4 are indeed the correct models we are now performing extensive mutagenesis studies, which are producing preliminary results suggesting indeed that models 2 and 4 are correct. The reason why we did not include this study in the current manuscript, is that we started a parallel line of investigation aimed at identifying residues fundamental for the interaction that can be exploited in compound screening campaigns to identify molecules able to block the described interaction and thus cancer proliferation. Publishing these preliminary results at this stage could jeopardize the drug discovery campaign and we hope that the reviewer will understand our constraints.

      • The transcription of cell cycle genes depends on the cell cycle (i.e., reduced cell cycle entry correlates with reduced cell cycle gene expression). Given that the authors showed LOXL2 inhibition reduce MDA-MB-231 cell proliferation, they should note that reduced expression of cell cycle-related genes is expected upon LOXL2 knockdown.

      We understand the reviewer’s comment. We believe that we provide sufficient data supporting our hypothesis that LOXL2 controls the expression of cell cycle genes at the transcriptional level together with BRD4S. In addition, the sole inhibition of LOXL2 has practically no effect on tumor proliferation in vivo but largely enhances the antiproliferative effect of low-dose JQ1 (Fig. 6D). We hope these clarifications would satisfy the reviewer.

      • The authors specify in their discussion that their data show a function of LOXL2/BRD4 in the cell cycle interphase, while there were no experiments that support that specific conclusion. At least it is unclear to me why the authors rule out a function in mitosis?

      We thank the reviewer for this comment. We referred to interphase genes because these are the early cell cycle genes, while mitotic genes are the late ones. We do not discard a possible function for BRD4S and LOX2 regulating mitotic progression, however, we believe this would be a consequence of dysregulated G1-S-G2 gene expression, rather than a direct transcriptional effect. This conclusion derives from the fact that while we observe interactions between LOXL2, BRD4S, and MED1 with Lin9 and B-Myb, these are not fully conserved with FOXM1, which is typically required for the transcription of mitotic genes. To avoid confusion, we have now anyway removed the word “interphase” from the text.

      • I felt that the first part of the manuscript (combination of BRD4 and LOXL2 inhibitors in TNBC) was a bit uncoupled from the functional studies on LOXL2 and its connection to BRD4. The transition between these parts and the final discussion on why the joint control of cell cycle genes by LOXL2/BRD4 may be important for the synergistic effect of LOXL2/BRD4 inhibitors. To this point, the authors' model was not clear to me.

      We really appreciate the reviewer’s comment. To better connect the functional studies with the clinical significance of the proposed combinatorial treatment, we restructured the manuscript. In the revised version, the use of the combinatorial treatment is shown in Figure 6. Moreover, to better explain why we focused all the studies on BRD4 and LOXL2, we also included data from the Cancer Cell Line Encyclopedia (CCLE)-associated chemotherapeutics sensitivity (Fig. 1A and Fig. EV1) showing that LOXL2 expression levels can predict the response to BRD4 inhibition, suggesting a functional interaction between BRD4 and LOXL2 and the possibility to exploit it for therapeutical purposes. We believe that these data set the rationale to further explore the connection between LOXL2 and BRD4, both at the mechanistic and functional levels.

      Reviewer #2 (Significance (Required)):

      The study by Pascual-Reguant et al. shows that inhibitors of BRD4 and LOXL2 can be combined to achieve better efficacy in reducing proliferation of breast cancer cell lines and breast tumor growth in xenograft models. They provide strong evidence for a functional interaction between LOXL2 and BRD4 and investigate their common transcriptional targets. Intriguingly, some evidence points towards a direct regulation of the DREAM complex and its cell cycle gene targets.

      The findings are novel and can be the basis for further research on TNBC combination therapy using BRD4 and LOXL2 inhibitors. The link to the DREAM complex is preliminary.

      The study is of interest for a basic research audience with some translational aspects.

      I reviewed this manuscript as a researcher in gene regulatory mechanisms, with cell cycle genes as one focus area. I have no expertise in the computational modeling of protein-protein interactions and I am no expert for breast cancer.

      We thank the reviewer for the positive comments. We also would really like to thank the reviewer for their criticism, which, we believe, contributed to a new and improved manuscript version.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this manuscript, Laura Pascual-Reguant et al. identified a novel role of the LOXL2 oxidase in sustaining cell cycle progression through a so far uncharacterized gene-activating function is mediated by the BRD4S epigenetic reader and exerted on key DREAM-target genes in TNBC. Moreover, the authors showed that combinatorial treatment of TNBC with LOXL2- and BRD4-specific inhibitors result in a tremendous anti-tumorigenic effect. For all findings, they leveraged in vitro and in vivo settings as well as high-throughput sequencing approaches. However, the following points should be addressed and explained.

      Major points:

      -The authors on their working hypothesis propose that dual inhibition of BRD4 and LOXL2 is a novel strategy for curing TNBC. For my taste, just because both targets are quite promising for TNBC, the jump to this combinatorial treatment is kind of abrupt. Knowing the difficulty and time-/financial- investment, authors could optionally perform a mass spectrometry analysis on nuclei lysates with LOXL2 pull down to identify physical interactors. Due to the augmented resources and analysis of raw data, authors may necessitate a generous revision period (approx. 4 months for starters). By that, this can provide a more unbiased approached to look at nucleus-specific gene-regulatory functions and particularly at epigenetic readers. It would be also interesting to see if LOXL2 interacts with other members of the BRD family. Selecting BRD4 and no other members of the bromodomain family cannot be the only choice given that other BRD members can also interact with several of these mediator subunits.

      We thank the reviewer for the suggestion and we agree with the fact that the rationale for combining BRD4 and LOXL2 inhibitors was not sufficiently argued in the first version of the manuscript. For that reason, in the revised manuscript, we added new data to explain why we explored this topic. In particular, to better explain why we focused all the studies on BRD4 and LOXL2, we included data from the Cancer Cell Line Encyclopedia (CCLE)-associated chemotherapeutics sensitivity (Fig. 1A and Fig. EV1) showing that LOXL2 expression levels can predict the response to BRD4 inhibition (but not to other approved chemotherapeutic drug), suggesting a functional interaction between BRD4 and LOXL2 and the possibility to exploit it for therapeutical purposes. Moreover, we restructured the manuscript to make the story more linear, explaining first the functionality of BRD4S-LOXL2 interaction at the molecular and cellular levels, and then presenting the in vivo systems in the last part of the manuscript.

      We agree with the reviewer that it may be interesting to explore whether LOXL2 interacts with other BRD family members. However, given the prominent role of BRD4 in promoting cancer proliferation, we believe that understanding the relevance of BRD4S-LOXL2 interaction in TNBC is, per se, of great interest and provide a novel mechanistic understanding of how TNBC proliferation is controlled at the transcription level. In the specific case of TNBC, it has been shown that BRD4S has an oncogenic effect, while BRD4L is an oncosuppressor. In the manuscript, we now showed that LOXL2 downregulation sensitizes cells to JQ1 treatment (Fig. 1D). Additionally, while the downregulation of BRD4L does not have any additional effect on cell treated with PXS, the downregulation of BRD4S sensitize them to LOXL2 inhibition (Fig. EV8B). These results, once again, indicate the relevance of studying the functional interaction between BRD4S and LOXL2.

      -LOXL enzymes have been shown to promote collagen and fibronectin assembly, thereby sustaining the pro-survival effect of the ITG5A/FN1/FAK/SRC signaling cascade and shielding TNBC cells against chemotherapy treatment (32415208). Did authors observe if LOXL2 loss or inhibition decreased the active status of FAK and SRC, which are well known to promote G1-S transition (25381661)?

      Probably the cell cycle defects upon LOXL2 loss may also partially arise from the impairment of this cascade.

      We really appreciate the reviewer’s suggestions. In the revised version of the manuscript, we checked FAK and Src activation status in tumor samples from one of our in vivo experiments (Fig. EV10D). We did not observe any difference in phospho-FAK or phospho-Src upon treatment either with PXS, JQ1, or their combinations, suggesting that alterations in the activity of these factors were not driving the observed proliferation defects.

      -Authors exclusively use JQ1 as a BRD4 inhibitor. As JQ1 may have an unspecific effect on BRD2 as well, authors should consider reproducing key experiments with siControl- and siBRD4-treated cells and increasing doses of PSX as well as repeating the JQ1 dose response assay in Figure 1B using siRNA-mediated silencing of LOXL2. Given that both players are part of the same complex, silencing of one and inhibition of the other should sensitize cells compared to their control counterparts.

      We agree with the reviewer and we addressed this comment in the revised manuscript. In particular, we have added two additional experiments:

      • We transduced MDA-MB-231 cells with isoform-specific shBRD4s (shBRD4L and shBRD4S) (Fig. EV5H) and checked cell sensitivity to PXS treatment (Fig. EV8B). As explained also above, we observed that only when the short isoform of BRD4 was downregulated cells displayed higher sensitivity to PXS treatment. This result corroborates that BRD4S and LOXL2 are required for TNBC proliferation.

      • We transduced MDA-MB-231 cells with shLOXL2 and assessed JQ1 sensitivity (Fig. 1D). We showed that upon LOXL2 downregulation, cells became more sensitive to JQ1 treatment, again corroborating the fact that TNBC proliferation requires BRD4S and LOXL2.

      -Moreover, in Figures 1G and S3D the differential sensitivity of low and high LOXL2 cell lines is unclear. Do authors know if any of these growth kinetic lines represent one of the tested cell lines in Figure 1A-B? Authors should provide respective legends. In addition, authors should take advantage of their homemade data given that they have already selected a panel of TNBC cell lines with various LOXL2 expression at basal state (Figure 1A) for which dose response assays have been performed (Figure 1B). Therefore, I would perform an IC50 graph for JQ1 (without PSX treatment) using the existing data from Figure 1B.

      We apologize if our representation was confusing. In the revised manuscript we have changed the sensitivity plots (Fig. 1A and Fig. EV1) to make them easier to grasp. Additionally, in Figure 1A we included the analysis of CCLE cell lines stratified based on their LOXL2 expression levels. This analysis showed that LOXL2 expression levels could overall predict the response to BETi treatment. As suggested by the reviewer, we also plotted the IC50 of the 3 cell lines tested. However, their JQ1 sensitivity curves did not show any difference that could be attributed to their different LOXL2 levels. Our speculation is that only 3 cell lines do not provide a sufficient size to reach a meaningful conclusion, which, in contrast, can be achieved by comparing the CCLE BETi sensitivity.

      -In Figure 2D, the pull-down assay is inconclusive, as the molecular weight for each construct is not mentioned. I would probably add this information also in all performed western blots. Also, the overexpression of the BD1/BD2-mutated and especially the BD1/BD2-lacking construct is unclear if it still interacts with LOXL2, probably because of the lack of molecular weight reference of each band. Therefore, the authors should make this pull-down assay more descriptive regarding the size of the bands. Also, BD1 mutagenesis at N140 was shown to dislodge the binding of JQ1 to BRD4 (24497639), which implies that BD1 mutagenesis or overexpression of the BD1-deficient construct should abrogate the interaction of LOXL2 with BRD4, reminiscent to the abrogated interaction of BRD4/LOXL2 upon JQ1 that binds to both BDs (Figure 2F). And, what happens if a BD2-deficient construct is expressed?

      We thank the reviewer for spotting this distraction. We apologize for this and in the revised version of the manuscript we included molecular weights for all western blots.

      We acknowledge that BD1 mutagenesis displaces JQ1 binding, however, we respectfully disagree that because of this BD1-N140 mutant should not bind to LOXL2. Our docking analysis indeed showed that none of the poses is impaired either by BD1 or BD2 mutagenesis (Fig. EV4D). The fact that JQ1 disrupts the interaction between BRD4S and LOXL2 (Fig. 2F, G) is not due to the fact that they compete for the same binding residue, but rather for the space occupied by JQ1 inside the AcK binding pocket of either BD1 or BD2, which impedes proper binding to LOXL2. Our pulldown data indeed showed that mutant BD1 and BD2 retain the ability to bind to LOXL2 (Fig. 2C), as predicted by the docking.

      We did not try to express constructs either lacking BD1 or BD2 and we cannot speculate what could happen to the BRD4S-LOXL2 interaction in this scenario. Even though this experiment could help dissect the interaction between LOXL2 and BRD4S, we decided to rather perform mutagenesis of specific residues that have been predicted to be important for the interaction. The reason why we did not include this study in the current manuscript, is that we started a parallel line of investigation aimed at identifying residues fundamental for the interaction that can be exploited in compound screening campaigns to identify molecules able to block the described interaction and thus cancer proliferation. Publishing these preliminary results at this stage could jeopardize the drug discovery campaign and we hope that the reviewer will understand our constraints.

      -If authors support that BRD4S is the predominant isoform driving the expression of DREAM-targets, this means that DREAM-targets are mainly bound by BRD4S, relying on Figure 3E-F. However, based on the author's ChIPseq tracks in Figure 3H, DREAM targets such as EZH2 and HMGB2 are co-occupied by both BRD4 isoforms at the basal state on their promoter region. Also, especially for EZH2 and PLK4, authors should set to 'group auto-scale' both conditions in a smaller scale range for ChIPseq- and RNAseq tracks, although I do not these two genes as good candidates representing your analysis. Therefore, authors should initially show all genes (e.g in a table format) that enrich the 'DREAM-targets' signature and select for a greater panel of genes (like for AURKB and HMGB2) demonstrating a preferential occupancy of the BRD4S at their promoter region. Finally, authors are recommended to perform a ChIP-qPCR on these genomic regions at basal state (no LOXL2 silencing) to validate the predominant occupancy of BRD4S and the low/absent occupancy of BRD4L at these genomic sites.

      We apologize for the confusion. To make the figure more understandable, we now scaled all the panels to the same scale and highlighted in grey the promoter region of each selected DREAM target gene. As the reviewer can appreciate, none of these genes is bound by BRD4L in basal conditions (Fig. 3F).

      To better characterize the differential binding, following the reviewer’s suggestion, we performed ChIP-qPCR using Ab2 (which recognizes both BRD4 isoforms), in cells either downregulated for BRD4L or BRD4S with isoform-specific shRNAs (Fig. EV5H). Results showed that only the downregulation of BRD4S reduced the binding of Ab2 to the promoter of the selected DREAM target genes (Fig. 3D), corroborating our hypothesis and validating our ChIPseq strategy.

      -Authors in Figure 3G should select an equal-sized population of randomly chosen non-DREAM-target genes, otherwise, the comparison of log2FC difference between these two gene cohorts is unreliable and difficult to make. Mann-Whitney test should also be performed.

      We thank the reviewer for this suggestion, which was added to the revised version of the manuscript (Fig. 3E, lower panel).

      -Authors should repeat the cell cycle analysis (Figure 4A) as the number of cells subjected to flow cytometry is quite discrepant between the conditions. Also, it is not clear if the experiment was performed in at least biological triplicates (although in the respective legend, it is stated so). If performed in biological triplicates, authors should make a new graph where each cell cycle phase cell population differs between the two conditions. Moreover, the difference in cell cycle defects in LOXL2-inhibited cells (Figure 4C) is indifferent compared to their control counterpart. Therefore, authors should address these inconsistencies.

      We thank the reviewer for the suggestion. In the revised version of the manuscript, we represent the cell cycle also as a bar plot with statistical analysis (Fig. 4A, C). Even though the number of cells was the same across conditions, the sub-G1 population of the LOXL2 KD cells may have distorted the profile of the cell cycle. To avoid misinterpretations, we repeated the analysis in the revised version of the manuscript. Statistical analysis supports that LOXL2 inhibition or downregulation has a significant effect on cell cycle progression (Fig. 4A, C, right panel).

      -Furthermore, authors should explain what was the rational selecting a mediator subunit and specifically MED1 as a possible interacting partner of LOXL2 and BRD4s since MED12 and MED24 were also highly essential (Figure 4F).

      We selected MED1 as a Mediator Complex proxy. In our essentiality analysis MED 1, 9, 10, 12, 15, 16, 19, 23, 24, 25 score as significant, suggesting a functional interaction between LOXL2 and the Mediator Complex, rather than a specific subunit. MED1 has been previously described as a BRD4 partner and it is often used in immunofluorescence to visualize transcriptional foci, which made it the best candidate for follow-up study in our project.

      -Moreover, do authors also observe this functional relationship of LOXL2 and BRD4S in cell cycle progression in other breast cancer subtypes presenting a high proliferation index e.g HER2+?

      Presumably, the author's proposed mechanism applies to a wide panel of breast cancer entities, for which, only key experiments could be performed.

      We thank the reviewer for the suggestion. We hypothesized that other cancer types expressing LOXL2 and BRD4S could also benefit from the combinatorial treatment. Indeed, the CCLE drug sensitivity panel in Fig. 1A comprises cancer cell lines of different origins, not just TNBC, and corroborates that the relationship between LOXL2 expression levels and BRD4 sensitivity exist also beyond TNBC. Even though it is important to experimentally verify this hypothesis, we decided to pursue it in the future to broaden the applicability of the proposed strategy in preclinical settings.

      -Authors in Figure 5H represent LOXL2 and BRD4s as integral chromatin looping factors together with MED1 at promoter and enhancer regions. However, this illustration is an overrepresentation of their finding because authors did not address the differential occupancy of BRD4S upon LOXL2 loss in DREAM-target-specific enhancer regions. If they wish to do so, they may use the RANK ORDERING OF SUPER-ENHANCERS (ROSE) package to call for super-enhancer regions in the proximity of DREAM-targets and confirm similar results as for their TSS-proximal sites.

      We thank the reviewer for the useful suggestion. In the new version of the manuscript, we have simplified the representation, which now does not show super-enhancers. However, following the reviewer’s suggestion, we performed super enhancer analysis using ROSE. Results showed that BRD4S binds to super-enhancers more than BRD4L, including DREAM target gene super-enhancers. Additionally, while LOXL2 KD did not alter the binding of LOXL2 to DREAM target gene super-enhancers, it decreased the binding of BRD4S to them (Fig. EV7D, E). Overall, these data are in agreement with our hypothesis that BRD4S together with LOXL2 controls the expression of DREAM target genes.

      -In the current manuscript, authors did not address the translational relevance of their proposed mechanism in the context of conventional therapies. Knowing that several BRD-specific compounds currently undergo clinical trials, authors should address if LOXL2 low (MDAMB468) and high (BT549) cells demonstrate a differential sensitivity to increasing doses of chemotherapy, in the presence or absence of BRD4. By doing that, LOXL2 apart from being a therapeutic target could be also used as a prognostic marker to stratify patients and achieve better response to standard therapies.

      We really appreciate the reviewer’s suggestion and we think this is a fundamental point. In the new version of the manuscript, we have performed further analysis using a greater panel of chemotherapeutic agents from the CCLE sensitivity database. We now show that LOXL2 low-expressing cells show significantly more sensitivity to BETi treatments, but not to conventional chemotherapeutic agents (e.g. doxorubicin, Olaparib, 5-fluorouracil, paclitaxel, etc.) (Fig. 1A and Fig. EV1), which set the rationale to further explore the functional relationship between BRD4 and LOXL2.

      Minor points:

      -In Figure 1D, the authors should convert the y-axis to a logarithmic scale to better represent the differences between JQ1, PXS, and combo. Also, One-way Anova should be performed between JQ1, PXS and combo.

      We don’t understand the reviewer’s suggestion since Fig. 1D (Fig. 6B, right panel in the revised version) is a tumor picture for which the y-axis cannot be converted to a logarithmic scale.

      -In Figure S6F, authors did not show the sensitivity of LOXL2 low and high cell lines for BRD4 KO. If LOXL2-proficient cells are less sensitive to JQ1, based on Figure 1B, authors should consider showing something similar from the gene essentiality database.

      We agree with the reviewer and we apologize for this mistake. We have included the sensitivity of LOXL2 low and high cell lines for BRD4 KO and also for MYC KO (Fig. EV6G).

      -Authors failed to discuss the work from Ozge Saatci et al (PMID: 32415208) regarding LOXL2 in TNBC and ECM reorganization as well as in other cancer entities (PMID: 35428659) in the context of ECM remodeling. Authors should realize that these published works and the current ones are not conflicting but complement each other.

      We thank the reviewer for the suggestion. In the revised version of the manuscript, we discussed this work.

      Reviewer #3 (Significance (Required)):

      SIGNIFICANCE

      The conception and findings are of enlightening significance for TNBC therapy, especially given the lack of targeted therapies in this particularly aggressive breast cancer subtype. Hence, I posit this work as highly relevant for the cancer epigenetics research community interested in characterizing unknown factors that facilitate the gene-activating function of epigenetic readers in health and disease.

      My field of expertise is to uncover epigenetic vulnerabilities responsible for transcriptional plasticity driving drug tolerance in aggressive forms of breast cancer.

      We would like to take the opportunity to thank the reviewer for the relevant suggestions. We strongly believe the revised version of the manuscript has been substantially improved by addressing the comments the reviewer made.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Laura Pascual-Reguant et al. identified a novel role of the LOXL2 oxidase in sustaining cell cycle progression through a so far uncharacterized gene-activating function is mediated by the BRD4S epigenetic reader and exerted on key DREAM-target genes in TNBC. Moreover, the authors showed that combinatorial treatment of TNBC with LOXL2- and BRD4-specific inhibitors result in a tremendous anti-tumorigenic effect. For all findings, they leveraged in vitro and in vivo settings as well as high-throughput sequencing approaches. However, the following points should be addressed and explained.

      Major points:

      • The authors on their working hypothesis propose that dual inhibition of BRD4 and LOXL2 is a novel strategy for curing TNBC. For my taste, just because both targets are quite promising for TNBC, the jump to this combinatorial treatment is kind of abrupt. Knowing the difficulty and time-/financial- investment, authors could optionally perform a mass spectrometry analysis on nuclei lysates with LOXL2 pull down to identify physical interactors. Due to the augmented resources and analysis of raw data, authors may necessitate a generous revision period (approx. 4 months for starters). By that, this can provide a more unbiased approached to look at nucleus-specific gene-regulatory functions and particularly at epigenetic readers. It would be also interesting to see if LOXL2 interacts with other members of the BRD family. Selecting BRD4 and no other members of the bromodomain family cannot be the only choice given that other BRD members can also interact with several of these mediator subunits.
      • LOXL enzymes have been shown to promote collagen and fibronectin assembly, thereby sustaining the pro-survival effect of the ITG5A/FN1/FAK/SRC signaling cascade and shielding TNBC cells against chemotherapy treatment (32415208). Did authors observe if LOXL2 loss or inhibition decreased the active status of FAK and SRC, which are well known to promote G1-S transition (25381661)? Probably the cell cycle defects upon LOXL2 loss may also partially arise from the impairment of this cascade.
      • Authors exclusively use JQ1 as a BRD4 inhibitor. As JQ1 may have an unspecific effect on BRD2 as well, authors should consider reproducing key experiments with siControl- and siBRD4-treated cells and increasing doses of PSX as well as repeating the JQ1 dose response assay in Figure 1B using siRNA-mediated silencing of LOXL2. Given that both players are part of the same complex, silencing of one and inhibition of the other should sensitize cells compared to their control counterparts.
      • Moreover, in Figures 1G and S3D the differential sensitivity of low and high LOXL2 cell lines is unclear. Do authors know if any of these growth kinetic lines represent one of the tested cell lines in Figure 1A-B? Authors should provide respective legends. In addition, authors should take advantage of their homemade data given that they have already selected a panel of TNBC cell lines with various LOXL2 expression at basal state (Figure 1A) for which dose response assays have been performed (Figure 1B). Therefore, I would perform an IC50 graph for JQ1 (without PSX treatment) using the existing data from Figure 1B.
      • In Figure 2D, the pull-down assay is inconclusive, as the molecular weight for each construct is not mentioned. I would probably add this information also in all performed western blots. Also, the overexpression of the BD1/BD2-mutated and especially the BD1/BD2-lacking construct is unclear if it still interacts with LOXL2, probably because of the lack of molecular weight reference of each band. Therefore, the authors should make this pull-down assay more descriptive regarding the size of the bands. Also, BD1 mutagenesis at N140 was shown to dislodge the binding of JQ1 to BRD4 (24497639), which implies that BD1 mutagenesis or overexpression of the BD1-deficient construct should abrogate the interaction of LOXL2 with BRD4, reminiscent to the abrogated interaction of BRD4/LOXL2 upon JQ1 that binds to both BDs (Figure 2F). And, what happens if a BD2-deficient construct is expressed?
      • If authors support that BRD4S is the predominant isoform driving the expression of DREAM-targets, this means that DREAM-targets are mainly bound by BRD4S, relying on Figure 3E-F. However, based on the author's ChIPseq tracks in Figure 3H, DREAM targets such as EZH2 and HMGB2 are co-occupied by both BRD4 isoforms at the basal state on their promoter region. Also, especially for EZH2 and PLK4, authors should set to 'group auto-scale' both conditions in a smaller scale range for ChIPseq- and RNAseq tracks, although I do not these two genes as good candidates representing your analysis. Therefore, authors should initially show all genes (e.g in a table format) that enrich the 'DREAM-targets' signature and select for a greater panel of genes (like for AURKB and HMGB2) demonstrating a preferential occupancy of the BRD4S at their promoter region. Finally, authors are recommended to perform a ChIP-qPCR on these genomic regions at basal state (no LOXL2 silencing) to validate the predominant occupancy of BRD4S and the low/absent occupancy of BRD4L at these genomic sites.
      • Authors in Figure 3G should select an equal-sized population of randomly chosen non-DREAM-target genes, otherwise, the comparison of log2FC difference between these two gene cohorts is unreliable and difficult to make. Mann-Whitney test should also be performed.
      • Authors should repeat the cell cycle analysis (Figure 4A) as the number of cells subjected to flow cytometry is quite discrepant between the conditions. Also, it is not clear if the experiment was performed in at least biological triplicates (although in the respective legend, it is stated so). If performed in biological triplicates, authors should make a new graph where each cell cycle phase cell population differs between the two conditions. Moreover, the difference in cell cycle defects in LOXL2-inhibited cells (Figure 4C) is indifferent compared to their control counterpart. Therefore, authors should address these inconsistencies.
      • Furthermore, authors should explain what was the rational selecting a mediator subunit and specifically MED1 as a possible interacting partner of LOXL2 and BRD4s since MED12 and MED24 were also highly essential (Figure 4F).
      • Moreover, do authors also observe this functional relationship of LOXL2 and BRD4S in cell cycle progression in other breast cancer subtypes presenting a high proliferation index e.g HER2+? Presumably, the author's proposed mechanism applies to a wide panel of breast cancer entities, for which, only key experiments could be performed.
      • Authors in Figure 5H represent LOXL2 and BRD4s as integral chromatin looping factors together with MED1 at promoter and enhancer regions. However, this illustration is an overrepresentation of their finding because authors did not address the differential occupancy of BRD4S upon LOXL2 loss in DREAM-target-specific enhancer regions. If they wish to do so, they may use the RANK ORDERING OF SUPER-ENHANCERS (ROSE) package to call for super-enhancer regions in the proximity of DREAM-targets and confirm similar results as for their TSS-proximal sites.
      • In the current manuscript, authors did not address the translational relevance of their proposed mechanism in the context of conventional therapies. Knowing that several BRD-specific compounds currently undergo clinical trials, authors should address if LOXL2 low (MDAMB468) and high (BT549) cells demonstrate a differential sensitivity to increasing doses of chemotherapy, in the presence or absence of BRD4. By doing that, LOXL2 apart from being a therapeutic target could be also used as a prognostic marker to stratify patients and achieve better response to standard therapies.

      Minor points:

      • In Figure 1D, the authors should convert the y-axis to a logarithmic scale to better represent the differences between JQ1, PXS, and combo. Also, One-way Anova should be performed between JQ1, PXS and combo.
      • In Figure S6F, authors did not show the sensitivity of LOXL2 low and high cell lines for BRD4 KO. If LOXL2-proficient cells are less sensitive to JQ1, based on Figure 1B, authors should consider showing something similar from the gene essentiality database.
      • Authors failed to discuss the work from Ozge Saatci et al (PMID: 32415208) regarding LOXL2 in TNBC and ECM reorganization as well as in other cancer entities (PMID: 35428659) in the context of ECM remodeling. Authors should realize that these published works and the current ones are not conflicting but complement each other.

      Significance

      Minor points:

      • In Figure 1D, the authors should convert the y-axis to a logarithmic scale to better represent the differences between JQ1, PXS, and combo. Also, One-way Anova should be performed between JQ1, PXS, and combo.
      • In Figure S6F, the authors did not show the sensitivity of LOXL2 low and high cell lines for BRD4 KO. If LOXL2-proficient cells are less sensitive to JQ1, based on Figure 1B, authors should consider showing something similar from the gene essentiality database.
      • Authors failed to discuss the work from Ozge Saatci et al (PMID: 32415208) regarding LOXL2 in TNBC and ECM reorganization as well as in other cancer entities (PMID: 35428659) in the context of ECM remodeling. Authors should realize that these published works and the current ones are not conflicting but complement each other.

      Significance

      The conception and findings are of enlightening significance for TNBC therapy, especially given the lack of targeted therapies in this particularly aggressive breast cancer subtype. Hence, I posit this work as highly relevant for the cancer epigenetics research community interested in characterizing unknown factors that facilitate the gene-activating function of epigenetic readers in health and disease.

      My field of expertise is to uncover epigenetic vulnerabilities responsible for transcriptional plasticity driving drug tolerance in aggressive forms of breast cancer.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their study, Pascual-Reguant et al. show that combined inhibition of BRD4 and LOXL2 can synergize to restrict triple-negative breast cancer (TNBC) proliferation. BRD4 and LOXL2 are transcription regulators that can read and write epigenetic information, respectively. The authors employ three distinct breast cancer cell lines and mouse models with cell line-derived xenografts, and they show that combined inhibition of BRD4 and LOXL2 can be superior to single BRD4/LOXL2 inhibition in these model systems. In an attempt to identify a connection between BRD4 and LOXL2, the authors find that the two proteins can bind to each other. The authors performed most of the experiments in the breast cancer cell line MDA-MB-231. To assess the impact of LOXL2-inhibition on transcription, the authors assessed changes of the transcriptome in MDA-MB-231 cells following LOXL2 knockdown. They found that genes related to cell differentiation and morphology were upregulated, while genes related to the cell cycle were downregulated. ChIP-seq data of BRD4 showed that BRD4 can bind to cell cycle gene promoters and that this binding was enhanced upon loss of LOXL2. The authors found that LOXL2 and BRD4 interacted with the transcriptional cell cycle regulators B-MYB, FOXM1, and LIN9, which are components of the MYB-MuvB-FOXM1 (MMB-FOXM1) complex that is known to promote the expression of late cell cycle genes with important functions during mitosis. The authors conclude that LOXL2/BRD4 interact with each other and with the MMB-FOXM1 complex to drive the expression of cell cycle genes and cell proliferations. Vice versa, they conclude that inhibition of LOXL2/BRD4 reduces cell proliferation through inhibiting the expression of cell cycle genes.

      Major:

      • The data and methods are presented well. The experiments are adequately replicated and analyzed. However, except for the first section, all experiments were performed using only one cell line. It is important to validate key findings in at least a second cell line.
      • There appears to be a misunderstanding of the concept of cell cycle-dependent gene regulation by the DREAM complex and its related factors. Early (G1/S) cell cycle genes contain E2F promoter motifs, while late (G2/M) cell cycle genes contain CHR promoter motifs. The DREAM complex can bind both, while RB-E2F and MuvB recognize only E2F and CHR motifs, respectively. B-MYB and FOXM1 bind to MuvB and regulate late cell cycle genes, but they do not bind to early cell cycle genes. Given this concept, the authors' rationale to connect BRD4/LOXL2 through MuvB/B-MYB/FOXM1 with E2F promoter sequences and early cell cycle genes and the subsequent conclusions must be corrected.
      • I felt that the suggested functional connection between LOXL2/BRD4 and DREAM is not strongly supported by the authors' data. Figure S6E: A similarity score of <0.7 is poor support for a 'consensus E2F sequence' and indicates very limited specificity. Figure 4E: IP with BRD4 and LOXL2 is missing as important control. A chromatin-binding control is missing that does not bind to DREAM/LOXL2/BRD4. To test for binding to the actual DREAM complex, the authors should include E2F4 and p130 in their IPs and western blots, perhaps following LOXL2 inhibition/knockdown. Figure 3: The authors' ChIP-seq data indicate that only a fraction of DREAM targets is bound by BRD4. To provide more evidence that LOXL2/BRD4 may be directly involved in regulating DREAM targets, the authors should compare the differential regulation of BRD4-bound DREAM targets upon LOXL2 knockdown with DREAM targets which are not bound by BRD4. If LOXL2/BRD4 acted in a direct manner on those targets, one would expect that loss of LOXL2 affected their transcription more strongly than the other DREAM targets which are affected only indirectly. Such an analysis can be performed readily using the available data.
      • The authors state that it is surprising to find that LOXL2 can promote target gene transcription because it is rather known as a transcriptional repressor. To this point, the authors should perform standard analyses using their RNA-seq and ChIP-seq data. Compare differential expression of genes that are bound by BRD4S/L/S+L and genes not bound by BRD4. Perform motif search and enrichment analyses for transcription factor and co-factor binding data (public ChIP-seq repositories). Such analyses may suggest what gene sets are up- and downregulated by LOXL2 through BRD4S/L and what other factors could be involved in LOXL2-dependent up- and downregulation of gene transcription.

      Minor:

      • I felt that background information on the BRD4 isoforms was missing. The short and long isoforms of BRD4 should be introduced briefly.
      • Given that BRD4 inhibition is known to activate p53 (e.g., PMID 23317504 and 33431824) and p21 (PMID 31265875), the authors should discuss the p53 status of their cell lines (largely mutant). In general, I felt that the authors could better cite and discuss the current literature on BRD4 and LOXL2.
      • It was unclear to me why the authors did not actually test experimentally whether their predicted interaction models 2 or 4 are likely true (Figure 2E+G).
      • The transcription of cell cycle genes depends on the cell cycle (i.e., reduced cell cycle entry correlates with reduced cell cycle gene expression). Given that the authors showed LOXL2 inhibition reduce MDA-MB-231 cell proliferation, they should note that reduced expression of cell cycle-related genes is expected upon LOXL2 knockdown.
      • The authors specify in their discussion that their data show a function of LOXL2/BRD4 in the cell cycle interphase, while there were no experiments that support that specific conclusion. At least it is unclear to me why the authors rule out a function in mitosis?
      • I felt that the first part of the manuscript (combination of BRD4 and LOXL2 inhibitors in TNBC) was a bit uncoupled from the functional studies on LOXL2 and its connection to BRD4. The transition between these parts and the final discussion on why the joint control of cell cycle genes by LOXL2/BRD4 may be important for the synergistic effect of LOXL2/BRD4 inhibitors. To this point, the authors' model was not clear to me.

      Significance

      The study by Pascual-Reguant et al. shows that inhibitors of BRD4 and LOXL2 can be combined to achieve better efficacy in reducing proliferation of breast cancer cell lines and breast tumor growth in xenograft models. They provide strong evidence for a functional interaction between LOXL2 and BRD4 and investigate their common transcriptional targets. Intriguingly, some evidence points towards a direct regulation of the DREAM complex and its cell cycle gene targets.

      The findings are novel and can be the basis for further research on TNBC combination therapy using BRD4 and LOXL2 inhibitors. The link to the DREAM complex is preliminary.

      The study is of interest for a basic research audience with some translational aspects.

      I reviewed this manuscript as a researcher in gene regulatory mechanisms, with cell cycle genes as one focus area. I have no expertise in the computational modeling of protein-protein interactions and I am no expert for breast cancer.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript describes that simultaneous inhibition of LOXL2 and BRD4 reduces proliferation of TNBC in vitro and reduces growth in vivo. This observation is followed by extensive mechanistic studies that suggest physical interaction between LOXL2 and short isoform of BRD4-MED1. Inferences from Chip-seq analyses suggest that this interaction is involved in regulation of multiple transcriptional programs. Authors focus on differential activation of DREAM complex, to claim that this interaction "is fundamental for proliferation of TNBC". The manuscript is very well written and mechanistic inferences are based on a set of sophisticated epigenetic analyses and bioinformatical inferences. The phenotypic effects from LoxL2 inhibition by itself, or in combination with BRD4 inhibition are relatively modest. These modest effects, as well as many of the reported changes in gene expression are clearly inconsistent with the frequently used adjectives as "dramatic", "fundamental", "deeply affected", "drastically hampered" etc. Given the modest phenotypic effects, many of the key claims and conclusions are not supported by the data.

      Specifically:

      1. It is unclear why authors generalize their conclusions to TNBC. Figure 1B demonstrates synergy for 1/3 cell lines, which is chosen for the follow up study. Even for MDA231, the synergy is confined to low concentrations of BRD4i (S1c). While MDA231 cell line is frequently used in experimental studies of TNBC, it is quite dissimilar to majority of clinical TNBC, and contains mutant RAS, which is rare in this disease.
      2. In vivo, the effect appears to be modest even in the MDA231 model, selected for evidence of synergy in vitro. In vivo, the combination appears to have an additive effect. Tumor growth rates are reduced, but no shrinkage is occurring. In the PDX model, LOXL2i does not have an effect as a monotherapy, while modestly enhancing the impact of BRD4i. These results are at odds with the claim of the interaction being fundamental for proliferation.
      3. No analysis of cell proliferation was shown in vivo. Authors should have performed BrdU or KI67 staining to support the claim. For in vitro analyses, authors also used indirect assays for proliferation. PI staining by itself does not have sufficient resolution to clearly capture modest effects that authors demonstrate. BrdU-PI double staining would have been much more useful.

      Minor points:

      1. Dose dependent decrease in phosphorylated H3 is not at all obvious from eyeballing the data in S1A; the only effect that I see is a modest reduction at the highest concentration of the inhibitor. Authors need to quantify the results to support the claim.
      2. Most of breast cancer cell lines are derived from metastatic disease, including pleural effusion, thus the point that because MDA231 cell line is derived from pleural effusion, it is metastatic does not have sufficient logical foundation.
      3. How is loss of cell-cell junction in vitro consistent with LOXL2 role in modulating ECM? There is no evidence of ECM production in MDA231 in vitro. On the other hand, this loss is associated with EMT.

      Significance

      Discovery and characterization of LOXL2-BRD4 interaction is advancing the ever-deepening understanding of molecular mechanisms of regulation of gene expression. The studies and analyses appear to be sufficiently rigorous and reported with clarity, and the claimed discovery of the biological interaction between LOXL2 and BRD4 is well supported. However, given the magnitude of the reported (rather than claimed) effects of this interaction, and concerns about generalizability of authors conclusions, it is not clear how these results are promising for the development of new therapies in TNBC. Moreover, in contrast to luminal BC, there is no clear evidence for utility of cytostatic drugs in constraining TNBC. Therefore, biological and clinical significance of the authors discovery is unclear and claims in this regard appear to be overblown.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors show expression of the thyroid hormone transporter MCT8 in the human placenta. The MCT8-inhibiting compound sylchristine reduces the transfer of T4 from the maternal to the fetal side and accordingly reduces T4 degradation by DIO3. Since maternal thyroid hormones are relevant for neurodevelopment before the onset of fetal thyroid gland function, disruption of MCT8 transport, as occurs in the Allan-Herdon-Dudley syndrome, may contribute to the neurodevelopment failure present in these patients.

      The results of the transfer experiments are clear and support the authors' conclusions.

      We thank the reviewer for this positive statement.

      Minor comments:

      1. Line 71: please provide some references on the immaturity of the blood.brain barrier before 18 months. The endothelial cells may have tight junctions when the vessels sprout in the CNS. "Maturity" implies the full complement of the neurovascular unit, i.e., pericytes and astrocytes. So, please clarify this point, even if it does not contradict the experimental results showing the role of placental MCT8.

      We acknowledge that there is debate when the human blood-brain barrier is regarded as mature. Tight junctions of endothelial cells are functional from week 14 in fetal development (Saili et al, DOI: 10.1002/bdr2.1180) and reach functionality comparable with adult blood-brain barrier from 18 weeks onwards (DOI: 10.1016/j.placenta.2016.12.005). A fully functional blood-brain barrier requires interaction with a range of cells, including pericytes, astrocytes, microglia and neurons (DOI: 10.1016/j.placenta.2016.12.005), which matures during the entire pregnancy (e.g. cortical astrocytes start to appear from 30 weeks onwards).

      In our manuscript, we do not intend to overstate the relevance of our findings. Hence, in the revised manuscript, we changed the wording in lines 70-71 from ‘’mature’’ to ‘’functional’’. This avoids the discussion on when the human blood-brain barrier is mature, while conveying the message that the placental barrier is key in determining bioavailability of thyroid hormone for the fetal brain.

      It is unclear if the partial effect of sylchristine on T4 transport means that MCT8 contribution is also partial and other transporters contribute.

      We thank the reviewer for raising this point. Our in vitro data (Figure Expanded View 2) showed that at 10 µM concentration silychristin fully inhibits MCT8, agreeing with previous data by others (Johannes et al, DOI: 10.1210/en.2015-1933). As MCT8 is expressed at the apical membrane of the syncytiotrophoblasts which is in direct contact of maternal circulation, it can be inferred that T4 entering the placenta via MCT8 is fully inhibited. In our manuscript, we show that the application of silychristin on the maternal circulation leads to a 60% reduction of T4 accumulation at the fetal side, with the remaining 40% of fetal T4 corresponding to an absolute concentration of ~ 4 nM T4. Of note, we previously showed in the same placenta model that there is ~4 nM T4 endogenously present in the placenta (see Figure 3, DOI: 10.1089/thy.2022.0406). This endogenous placental T4 can be transferred to the fetal circulation; this latter process is not blocked by silychristin, which is only present in the maternal circulation. As adding silychristin results in only ~ 4 nM T4 appearing at the fetal side, equal to the endogenous concentration, it is likely that the contribution by other transporters is minimal.

      To clarify this, we have added this in the Discussion of the revised manuscript (lines 117-120).

      Lines 113-114. I missed the controls measuring TRIAC transport without sylchristine, or do the authors have strong reasons to assume that sylchristine does not affect TRIAC transport? If so, it should be stated.

      We thank the reviewer for raising this question. No human TRIAC transporters have been published and, hence, we cannot exclude the possibility that silychristin may inhibit TRIAC transport. However, we previously showed that human MCT8 does not induce TRIAC uptake (Figure 7, DOI: 10.1089/thy.2019.0009), indicating that TRIAC transport is MCT8 independent. In a previous study, we tested the specificity of silychristin for the thyroid hormone transporters expressed in human term placenta (Figure 2 in DOI: 10.1089/thy.2021.0503)). Silychristin potently inhibited MCT8 with an IC50 of 0.12 µM and at a much higher concentration (10 µM) it also inhibited OATP1A2 by ~40% but none of the other transporters. However, OATP1A2 does not transport TRIAC (unpublished data). Therefore, we feel that silychristin is unlikely to have relevant effects on other placental thyroid hormone transporters that may facilitate TRIAC transport. Hence, we did not include experiments of TRIAC transport in the absence of silychristin.

      Our aim was to provide a proof-of-concept that TRIAC is very efficiently transported across human placenta when MCT8 is inhibited. Should the reviewer insist to perform TRIAC transport in the absence of silychristin, we would be happy to do so.

      In the revised manuscript, we have included this point as a limitation in our study (lines 146-151).

      Reviewer #1 (Significance (Required)):

      This study confirms the presence of MCT8 in the human placenta and adds additional data to demonstrate its functionality with experiments using placental perfusion.

      We are pleased to see that the reviewer agrees that our data are a relevant addition to the field.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Short summary of findings:

      The authors used silychristin to selectively block the thyroid hormone transporter MCT8 in perfused human placenta. This was done to model the MCT8 deficient placenta in the rare genetic condition called Allan Herndon Dudley Syndrome. They then showed that the thyroid hormone analogue TRIAC can still cross the placenta and may be a potential treatment to prevent some of the effects of thyroid hormone deficiency in the affected fetus.

      Major comments:

      The absence of binding proteins in the maternal circulation is an issue since protein bound thyroid hormones can also be taken up by trophoblasts. Additionally, the placenta produces and secretes transthyretin and albumin into the maternal circulation - was this taken into account?

      We thank the reviewer for raising this point. We are aware of thyroid binding proteins such as transthyretin (TTR) (eg. DOI: 10.1016/j.placenta.2013.05.005; DOI: 10.1016/j.placenta.2012.01.006; DOI: 10.1210/jc.2009-0048). From such data, it has been established that the placenta secretes TTR. Also, it has been shown that the TTR-T4 complex can be internalized into Jeg3 cells. However, to our knowledge, there is no direct evidence showing that the TTR-T4 complex is transported across human placenta reaching the fetal circulation.

      As we have mentioned in the response to the comment 2 of Reviewer 1, adding silychristin results in only ~ 4 nM T4 appearing at the fetal side, equal to the endogenous concentration present in the placenta. Therefore it is likely that the contribution by other transporters or transport mechanisms such as TTR-T4 is minimal.

      A caveat to the abovementioned arguments is that our perfusion only lasted for 3 hours because longer perfusions will lead to loss of intactness of the placenta and, hence, less functionality. Therefore, it cannot be excluded that during longer exposures TTR might have a role.

      Following this reviewer’s comment, we added this as a limitation to our model in the revised manuscript (lines 151-153).

      Other studies have suggested that T4 cannot cross the placenta unless the type 3 deiodinase is blocked which differs from this study. This paper should be referenced and discussed.

      We are aware of the study by Mortimer et al (DOI: 10.1210/jcem.81.6.8964859) that showed T4 could transport across human term placenta only when D3 was blocked by iopanoic acid. In our previous study (DOI: 10.1089/thy.2022.0406), we confirmed their findings in perfusion experiments with and without iopanoic acid on maternal-to-fetal T4 transfer (Figure 1), which we discussed in the discussion of our previous publication.

      However, in that previous study we also found that transport of T4 in human term placenta is asymmetrical with fetal-to-maternal transfer being more rapid than maternal-to-fetal transfer. However, when adding albumin (BSA) to the fetal circulation (which was not done by Mortimer et al), we prevented re-uptake of T4 and were able to show fetal T4 accumulation in the absence of iopanoic acid. Therefore, we optimized the model by maintaining the physiological conditions in which the type 3 deiodinase is present.

      Following the reviewer’s suggestion, we discussed this paper between lines 144-146.

      How specific is the action of silychristin? Does it have effects on other thyroid hormone transporters that may facilitate TRIAC transport?

      In a previous study, we tested the specificity of silychristin for the thyroid hormone transporters expressed in human term placenta (Figure 2 in DOI: 10.1089/thy.2021.0503). Silychristin potently inhibited MCT8 with an IC50 of 0.12 µM and at a much higher concentration 10 µM it also inhibited OATP1A2 by ~40%. However, OATP1A2 does not transport TRIAC (unpublished data). Therefore, we feel that silychristin is unlikely to have relevant effects on other placental thyroid hormone transporters that may facilitate TRIAC transport.

      We have added discussion about this between lines 146-151.

      Although TRIAC is able to cross the placenta, it is likely that it still would not be able to cross into the fetal brain making its use somewhat limited. Additionally, it has been suggested that TRIAC exposure may also be a neurodevelopmental risk ((Barez-Lopez et al., 2016) and (Yamauchi et al. 2022 TRIAC disrupts cerebral thyroid hormone action via a negative feedback loop and heterogenous distribution among organs. BioRXiv). This should be discussed.

      We thank the reviewer for raising this important point. We would like to respectfully point out that TRIAC in different animal models for MCT8 deficiency has been able to restore abnormal brain development (DOI: 10.1210/me.2014-1135, DOI: 10.3390/ijms232415547, DOI: 10.1530/JOE-16-0323, DOI: 10.1242/dmm.027227).

      Specifically, TRIAC administration between postnatal day 1 and 12 restored T3-dependent neural differentiation in the cerebral and cerebellar cortex in Mct8/Oatp1c1 double knockout mice, which represents a relevant mouse model recapitulating the neurological phenotype in patients with MCT8 deficiency (doi: 10.1210/me.2014-1135).

      Barez-Lopez (2016) administered TRIAC to Mct8 single knock-out mice. As Oatp1c1 is a redundant T4 transporter in mice, brains of these animals are only mildly hypothyroid and do not recapitulate the severity of the phenotype seen in humans. Hence, we disagree with the conclusion of these authors as they utilized a non-optimal mouse model (DOI: 10.1210/jc.2012-3759). Yamauchi et al. (doi: 10.1016/j.isci.2023.107135) showed that TRIAC content in cerebral cortex did not increase after oral administration of TRIAC after postnatal day 21 in euthyroid and hypothyroid mice. Moreover, they utilized the same dose as T3 as comparison, while TRIAC should be dosed 10-times higher ( DOI: 10.1210/me.2014-1135). Using such a dose, it is very much understandable that TRIAC only affects the hypothalamus-pituitary-thyroid axis, but is insufficient to exert thyroid hormone action in the brain.

      We would like to emphasize that there is >70-year experience with TRIAC in humans for other conditions. Neurotoxicity has never been observed. Currently, TRIAC is being studied in high dosages in young children with MCT8 deficiency. The study protocols have been approved by different Ethics Committees as well has been discussed with regulatory authorities.

      In the revised manuscript, we have incorporated some information that there is sufficient data in different animal models showing that TRIAC is able to enter the brain.

      The data for the control group was used in a previous publication - were the data collected at the same time? Ideally, the control vs silichrystin treated placental cotyledons should be from the same placental samples.

      We agree with the reviewer that ideally the control and silychristin treated cotyledons should be matched from the same placenta. However, in practice, it is extremely difficult to realize this for many reasons. In our perfusion experiments, only ~30% of the perfusions succeeded as determined by the criteria of the quality controls (antipyrine and FITC-dextran). Moreover, using two cotyledons from one placenta is not feasible for different reasons (e.g. absence of two intact cotyledons due to damage during delivery; one cotyledon is intact during perfusion, whereas the other is not; one cotyledon has maternal antipyrine diffused to the fetal circulation whereas the other one is not. Therefore, due to practical obstacles and strict quality control criteria, it is not likely to obtain such data of these from different placentas.

      Minor comments

      T4 and TRIAC were prepared in 0.1N NaOH and silychristin in DMSO. Do NaOH or DMSO affect membrane transporters? Were vehicle controls used in the perfusion experiment?

      We dissolved T4 and TRIAC in 0.1 N NaOH and silychristin in DMSO and added them to the perfusion buffer at a 1000 times dilution. For NaOH, it is commonly used in transport assays. Such NaOH and DMSO dilutions did not affect thyroid hormone transport in COS1 cells; therefore we did not include vehicle control DMSO in perfusion experiments.

      Were the silichrystin vs control samples matched from the same placentas?

      The silychristin and control samples were not matched from the same placentas for the reasons mentioned above.

      In Figure 1C, why is there an increase in TRIAC on the maternal side between the first and second time points?

      As we sometimes observed in our perfusion experiments with other compounds, at t=0 min (the first time point), the buffer is not aerated and still heterogeneous, leading to differences in the measured concentrations of the TRIAC. Therefore we also included t=6 min (the second time point) to get a more accurate starting concentration.

      In figure 1C, TRIAC movement from maternal to fetal side in silichrystin treated placenta is shown but there is no data from untreated placenta? TRIAC transport may be reduced but we cannot tell without a control to compare it to. This should be included.

      We thank the reviewer for raising this question, which is similar to comment 3 of Reviewer 1. Hence, we would like to refer to our response to the comment 3 of Reviewer 1.

      Reviewer #2 (Significance (Required)):

      This study is interesting and adds to the current gaps in the knowledge of transplacental thyroid hormone transport. There is still very limited information around how thyroid hormones cross the placenta despite many groups working on this over the years. However, the manuscript would be more interesting if the placental transporter for TRIAC was identified. There are several clinical trials already underway looking at how TRIAC therapy may be useful in this condition and it may even be detrimental. If TRIAC can cross the placenta its use may still be problematic since others have shown that it cannot cross the into the MCT8 deficient brain from the circulation and must be delivered directly into the brain (Barez-Lopez et al., 2019). This is a small study and is fairly limited however it would be interesting to those, like me, with an interest in endocrinology, placental biology and pregnancy.

      Barez-Lopez, S., Grijota-Martinez, C., Liao, X.H., Refetoff, S. and Guadano-Ferraz, A., 2019. Intracerebroventricular administration of the thyroid hormone analog TRIAC increases its brain content in the absence of MCT8, PLoS One. 14, e0226017.

      Barez-Lopez, S., Obregon, M.J., Martinez-de-Mena, R., Bernal, J., Guadano-Ferraz, A. and Morte, B., 2016. Effect of Triiodothyroacetic Acid Treatment in Mct8 Deficiency: A Word of Caution, Thyroid. 26, 618-26.

      We are pleased to see that the reviewer agrees that our data are a relevant addition to the field. We have alluded to the discussion in the field in the revised manuscript (lines 129-132).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Short summary of findings:

      The authors used silychristin to selectively block the thyroid hormone transporter MCT8 in perfused human placenta. This was done to model the MCT8 deficient placenta in the rare genetic condition called Allan Herndon Dudley Syndrome. They then showed that the thyroid hormone analogue TRIAC can still cross the placenta and may be a potential treatment to prevent some of the effects of thyroid hormone deficiency in the affected fetus.

      Major comments:

      The absence of binding proteins in the maternal circulation is an issue since protein bound thyroid hormones can also be taken up by trophoblasts. Additionally, the placenta produces and secretes transthyretin and albumin into the maternal circulation - was this taken into account?

      Other studies have suggested that T4 cannot cross the placenta unless the type 3 deiodinase is blocked which differs from this study. This paper should be referenced and discussed.

      How specific is the action of silychristin? Does it have effects on other thyroid hormone transporters that may facilitate TRIAC transport?

      Although TRIAC is able to cross the placenta, it is likely that it still would not be able to cross into the fetal brain making its use somewhat limited. Additionally, it has been suggested that TRIAC exposure may also be a neurodevelopmental risk ((Barez-Lopez et al., 2016) and (Yamauchi et al. 2022 TRIAC disrupts cerebral thyroid hormone action via a negative feedback loop and heterogenous distribution among organs. BioRXiv). This should be discussed. The data for the control group was used in a previous publication - were the data collected at the same time? Ideally, the control vs silichrystin treated placental cotyledons should be from the same placental samples.

      Minor comments

      T4 and TRIAC were prepared in 0.1N NaOH and silychristin in DMSO. Do NaOH or DMSO affect membrane transporters? Were vehicle controls used in the perfusion experiment?

      Were the silichrystin vs control samples matched from the same placentas?

      In Figure 1C, why is there an increase in TRIAC on the maternal side between the first and second time points? In figure 1C, TRIAC movement from maternal to fetal side in silichrystin treated placenta is shown but there is no data from untreated placenta? TRIAC transport may be reduced but we cannot tell without a control to compare it to. This should be included.

      Significance

      This study is interesting and adds to the current gaps in the knowledge of transplacental thyroid hormone transport. There is still very limited information around how thyroid hormones cross the placenta despite many groups working on this over the years. However, the manuscript would be more interesting if the placental transporter for TRIAC was identified. There are several clinical trials already underway looking at how TRIAC therapy may be useful in this condition and it may even be detrimental. If TRIAC can cross the placenta its use may still be problematic since others have shown that it cannot cross the into the MCT8 deficient brain from the circulation and must be delivered directly into the brain (Barez-Lopez et al., 2019). This is a small study and is fairly limited however it would be interesting to those, like me, with an interest in endocrinology, placental biology and pregnancy.

      Barez-Lopez, S., Grijota-Martinez, C., Liao, X.H., Refetoff, S. and Guadano-Ferraz, A., 2019. Intracerebroventricular administration of the thyroid hormone analog TRIAC increases its brain content in the absence of MCT8, PLoS One. 14, e0226017.

      Barez-Lopez, S., Obregon, M.J., Martinez-de-Mena, R., Bernal, J., Guadano-Ferraz, A. and Morte, B., 2016. Effect of Triiodothyroacetic Acid Treatment in Mct8 Deficiency: A Word of Caution, Thyroid. 26, 618-26.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors show expression of the thyroid hormone transporter MCT8 in the human placenta. The MCT8-inhibiting compound sylchristine reduces the transfer of T4 from the maternal to the fetal side and accordingly reduces T4 degradation by DIO3. Since maternal thyroid hormones are relevant for neurodevelopment before the onset of fetal thyroid gland function, disruption of MCT8 transport, as occurs in the Allan-Herdon-Dudley syndrome, may contribute to the neurodevelopment failure present in these patients. The results of the transfer experiments are clear and support the authors' conclusions.

      Minor comments:

      1. Line 71: please provide some references on the immaturity of the blood.brain barrier before 18 months. The endothelial cells may have tight junctions when the vessels sprout in the CNS. "Maturity" implies the full complement of the neurovascular unit, i.e., pericytes and astrocytes. So, please clarify this point, even if it does not contradict the experimental results showing the role of placental MCT8.
      2. It is unclear if the partial effect of sylchristine on T4 transport means that MCT8 contribution is also partial and other transporters contribute.
      3. Lines 113-114. I missed the controls measuring TRIAC transport without sylchristine, or do the authors have strong reasons to assume that sylchristine does not affect TRIAC transport? If so, it should be stated.

      Significance

      This study confirms the presence of MCT8 in the human placenta and adds additional data to demonstrate its functionality with experiments using placental perfusion.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We appreciate the positive feedback from both reviewers and their critical comments, which will help us to improve the manuscript. Below, we provide a point-by-point response and how we propose to address their queries and comments.

      As the laboratory is currently undergoing a major transition, we propose essential experiments that are realistic to perform under these circumstances. We are positive that we can address all the most critical points identified by the reviewers.

      Suggestions for minor changes to the figures are already included.

      We also include responses to questions the reviewers raise.

      Reviewer #1 (Evidence, reproducibility and clarity):

      Major comments:

      1 - The authors assessed acridine orange incorporation in BECs upon LiverZap and concluded that LiverZap triggers hepatocyte-specific cell death without a bystander effect in adjacent cells (Figure 1 D-E). What happened to endothelial cells, which could also be affected either directly by ROS production in hepatocytes or indirectly by gross morphological changes in tissue organization?

      Response:

      The reviewer raises two excellent points:

      (i) Bystander effect of hepatocyte-produced ROS on endothelial cells: the cell death analysis included in the manuscript, shows that Acridine Orange staining overlaps with hepatocyte _tg(fabp10a:dsRed) expression, but not with biliary tg(tp1:H2B-mCherry) indicating cell death is specific to hepatocytes. Moreover, singlet oxygen species as produced by LiverZap have been shown to have a very short half-life and short range of action, suggesting that neighbouring cells are unlikely affected (Liang et al., 2020).

      PLAN: Investigate potential bystander effect in endothelial cells, by activating the LiverZap tool in livers expressing transgenic tg(kdrl:mcherry) marking the vascular network followed by live staining with Acridine Orange at 8 hours post illumination.

      (ii) indirect effects of morphological tissue changes on endothelial cells: studying the tissue response of the vascular network to hepatocyte ablation would be very interesting. A separate and detailed study would be required to generate meaningful data and insights into said process. This could encompass for instance the use of the transgenic endothelial _tg(kdrl:mcherry) line for LiverZap experiments and parallel those in Figure 4A-S. Thus, it seems beyond the scope of this work.

      NO CHANGES PROPOSED.

      2 - The evaluation criteria for distinguishing mCherry and cells in imaging experiments should be clearly described in the methods section. The authors should also provide some quantitative data regarding the level of correlation between the mCherry hepatocytes and the BEC-derived hepatocytes strictly defined based on the TP1-H2B-EGFP lineage tracing, as the former was used as a surrogate marker for the latter in some experiments.

      Response:

      Here, we believe the reviewer refers to the Tp1:H2B-mCherry-based lineage tracing, since the tg(tp1:egfp) line has not been used for this purpose. Similar to previous studies in the regeneration field (e.g. Choi et al., 2014; He at al., 2014), we have used histone inheritance of Tp1:H2B-mCherry for short-term lineage tracing. Tp1:H2B-mCherry-based lineage tracing was assessed on the whole organ level, for which we will describe the quantification pipeline. Tp1:H2B-mCherrylow cells were identified as BEC-derived hepatocytes after severe hepatocyte ablation, as shown in Fig. 2A,C, correlating with hepatocyte marker tg(fabp10a:GFP) expression. Tp1high and Tp1low cell numbers were quantified for 12, 24, 48 and 72 hpi and can be added as supplementary information.

      PLAN: Update the material and methods section and produce a more detailed description. This would include the following information: Whole-mounted livers of tg(tp1:H2B-mCherry) fish were stained for mCherry and imaged using an Leica SP8 confocal microscope. Image processing was carried out using the Imaris software. All mCherry-expressing cells in the liver were masked using the “spots” function, which allows quantification of signal intensity of all cells, represented by a sphere. Tp1high and Tp1low cells were identified using an automatically generated intensity threshold. Due to intensity differences with increasing imaging depth/z-position, segmented Tp1high cells were manually curated.

      To showcase the analysis strategy, we propose to include an example showing original image data, semi-automated quantification at the surface and deep tissue levels, as well as the overall Tp1:H2B-mCherry intensities for all positive cells and specifically Tp1high cells for all z-positions of an entire liver (see data figure below). This example could be included as supplementary data. Likewise, cell number quantification for Tp1high and Tp1low across regeneration can be added to Fig. S2.

      Fig. Quantification of Tp1:H2B-mCherryhigh cells. (A,B) 10 µm maximum intensity projections from whole mount stained tg(LiverZap);tg(tp1:H2B-mCherry) livers at 48 hpi: at the surface (A-A’) and deep in the liver (B-B’). Tp1high cells are identified by fluorescence intensity of segmented nuclei, outlined in yellow (A’ and B’). Graphs showing distribution of all Tp1:H2B-mCherry nuclei (C) and Tp1high nuclei (D) by fluorescence intensity and z-position (C). The intensity of all mCherry+ nuclei decreases with increasing z-position (C-D). The dotted line outlines the liver in A-B’.

      3 - OPTIONAL: In the locally restricted ablation model, do hepatocytes located adjacent to the ROI proliferate and/or contribute to the regeneration of the injured region?

      Response:

      An important consideration, as highlighted by the reviewer, is whether neighbouring hepatocytes also contribute to regeneration following ROI ablation.

      PLAN: To address this point, LiverZap ROI ablation will be followed by cell proliferation analysis using an EdU incorporation assay at 24 and 72 hpi. These time points are selected based on the proliferation results following global LiverZap ablation; see Fig. 2D-F. The experiment will be performed in a tg(tp1:H2B-mcherry); _tg(fabp10a:gfp)_background to distinguish proliferating GFP-positive hepatocytes, which are H2B-mCherry-negative, from LPC-derived hepatocytes that have inherited H2B-mCherry (Tp1low). The resulting insights may help to refine hypotheses regarding the process(es) stimulating the formation of new hepatocytes adjacent to the ablated region.

      4 - OPTIONAL: Figure 4, A-S. It should be of significant interest if the authors could also analyze the BEC dynamics using the locally restricted hepatocyte ablation model, comparing those in the injured region (ROI) and the outside of the ROI.

      Response:

      We agree with the reviewer that this is the exciting next question, as it likely would provide insights into the cellular mechanism by which the biliary network is de- and re-constructed, as well as the mechanism by which BECs outside the ROI may initiate the LPC response to give rise to hepatocytes in a semi-systemic response. For this, the experimental set-up introduced in Fig.4J-P, in which BECs in the ROI are distinguished from adjacent ones by photoconversion, would be followed by extended live light-sheet microscopy of the regenerating liver. Due to the complexity, extent of the experiments and current unavailability of a light-sheet microscope, we would address this optional comment in future investigations.

      NO CHANGES PROPOSED.

      5A- Figure 4, T-V'. The data shown here for the changes in E-cadherin distribution is difficult to understand and interpret. The authors should provide magnified images and better description on how to distinguish the membranous (spotted signals?) and intracellular localization. Quantitative assessment should certainly be a plus, if possible.

      Response:

      We appreciate that it may be difficult to recognize the changes in E-Cadherin localisation, in particular at BEC membranes, given that there are intracellular puncta, and that E-Cadherin is expressed both in BECs and hepatocytes. We are convinced of the related data described in Figures 4 and S4, because the first experiment allowed quantification of the staining using both Tp1:H2B-mCherry to identify BECs and intestinal E-Cadherin for normalisation, which revealed a 51% E-Cadherin reduction at BEC cell membranes following injury. Unfortunately, the signal-to-noise ratio declined in consecutive experiments precluding further quantification although we could still observe a change in localisation. We tested alternative antibodies against E-Cadherin as well as optimized staining protocols, yet without success.

      5B - OPTIONAL: In relation to the above point, it is this reviewer's candid impression that the very last part regarding the possible role of E-cadherin dynamics in regulating the biliary network remodeling is still preliminary compared to the remaining parts, thereby rather depreciating the value of the entire manuscript. Perhaps this part could be published separately, together with more functional evidence regarding the causal relationship between them (e.g., showing the effect of Ecadherin knockdown in hepatocytes on the biliary remodeling and the induction of the BECdependent regeneration program)

      Response:

      PLAN: Following this reviewer’s and reviewer 2’s comments and suggestions, we agree to remove the data on E-Cadherin. Loss of adhesion as a mechanism for adopting an LPC-state remains very exciting, future investigations with novel tools to monitor and modulate E‑Cadherin expression in BECs would thus be needed.

      6 - Do zebrafish livers possess lobular structures with the portal-to-central vein axis and the metabolic zonation as typically observed in mammalian livers? As has been described in the manuscript, the "localized" injury patters in the mammalian livers usually occur at the sub-lobular structure levels (i.e., peri-portal region-restricted vs. peri-central region-restricted). Although the "localized" injury model described in this study using the zebrafish livers was indeed localized from the viewpoint of the entire organ (or the lobe), it still seemed much more "global" when considering those situations in the mammalian livers, so that the authors' claim that the former recapitulating the latter might be too exaggerated and somehow misleading. The authors should clarify and discuss this point in the manuscript.

      Response:

      The reviewer raises an important point, and it seems that our wording might not have been clear. In mammals, boundaries between injured and healthy tissue arise, because liver injuries frequently occur at the sub-lobular level. Although zebrafish livers are composed of metabolically diverse hepatocytes, a spatial arrangement comparable to mammalian zonation has so far not been identified (Morrison et al. 2022; Oderberg and Goessling, 2023). Yet, the liver lobes in the adult zebrafish have a central vein and periportal veins at the periphery of the organ, similar to the mammalian lobular organisation (Ota et al. 2022). Therefore, the scale of injury in the mammalian setting and the ROI-ablation model introduced in the current work differs. It, nevertheless, creates boundaries of healthy and injured liver tissue relevant for uncovering dynamic cellular processes mediating tissue repair in chronic liver disease. Importantly, with its suitability for advanced live imaging and optogenetic methods (e.g. photoconversion), LiverZap, complements mammalian models, in which this is still challenging. This offers therefore the powerful opportunity to employ LiverZap to screen for dynamic repair behaviours, which subsequently can be validated in a target approach in mammalian injury models.

      PLAN: To describe the relevance of our ROI ablation paradigm for elucidating repair processes at the interface of injured and healthy tissue more precisely. We will further edit and clarify text to place the ROI ablation into the context of hepatic injuries at the sub-lobular level throughout the mammalian liver.

      Minor comments:

      7 - Figure 4. Panels D and G should correspond to the same one image and the way of labeling be changed (as in Figure 1G). Likewise, in panel J, the bars shown separately as "M" and "S" at 12 dpi should correspond to the same data, so that they should be unified as one bar.

      Response:

      Thank you for pointing this out, this is changed in the updated figures; panels Fig. 4D and I.

      8 - Figure S3L. How was the ROI border defined? Perhaps the shape of the ROI should change significantly during regeneration due to dynamic tissue remodeling processes, thereby moving the position of the border as well.

      Response:

      The ROI border was defined as the interface between photoconverted and non-converted BECs. We concur with the reviewer’s notion that cell movement and rearrangement may occur during the regeneration process (see Fig. 4A-J), and the initially straight ROI border could consequently change during the regeneration process. Nevertheless, the border between photoconverted and non-converted BECs persists, serving as a landmark for the measurements shown in Figure S3L.

      Fig.: Quantification strategy for determining the region exhibiting an LPC-response outside the ROI ablation region. The dashed line of the ROI indicates morphogenetic changes of the interface between photoconverted and nonconverted cells over time due to repair-related cell rearrangement.

      PLAN: In the revised manuscript, we propose to include the below schematic as panel J to Figure S3. Moreover, we also suggest to change the solid line of the squares indicating the ROI area in figure panels 3C,G,O,P and S3D,H,K into a dashed line at the interface between photoconverted and non-converted tissue (see below figure as an example).

      9 - The authors should comment in the manuscript as to whether the system can be applicable for induction of more restricted areas (e.g., at a single hepatocyte level; in particular metabolic zones, if existing), as well as for ablation of other hepatic cell types such as BECs and endothelial cells.

      Response:

      Indeed, the optogenetic nature of the LiverZap system allows to induce hepatocyte death at the single cell level, as well as any defined region of interest that can be generated by the light source (e.g. confocal microscope software).

      Likewise, the FAP-TAP system can be easily applied to BECs or endothelial cells, or any cell type for which a specific promoter has been identified to drive the genetic FAP component fluorogen-activating protein dL5**.

      Response:

      PLAN: Both points will be included in the discussion section of the manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity):

      MAJOR COMMENTS:

      1 - The LiverZap is an elegant new tool to induce localized ablation of hepatocytes. It is not as claimed by the authors a real breakthrough: (1) While localized ablation is nice compared to NTR-MTZ model in zebrafish, mice model such as CCl4 chronic injury can also study the interaction between healthy and injured tissue. (2) Although not using MTZ, the system still requires injection or exposure to malachite green derivate dye MG-2I. A few searches suggest that this compound could induce toxicity. Can the authors study and compare the toxicity of malachite green derivate dye MG-2I to the toxicity of MTZ? This is important as this would be indeed a strong argument in favor of the presented tool.

      Response:

      Point 1 – studying interactions between healthy and injured liver tissue: The reviewer is of course correct that interactions between healthy and injured tissue can also be studied in the mouse. However, ROI ablation with the LiverZap system can be combined with live imaging, thereby enabling the observation of cellular responses of the same sample over time, at a resolution currently difficult to achieve in mammals. Moreover, the possibility to induce cell death in a defined ROI, also allows to simultaneously employ other genetic tools, including cell-type specific lineage tracing by photoconversion, which is difficult to achieve in mammalian systems. The finding that BECs beyond the ROI of hepatocyte ablation produce new hepatocytes by a LPC response, illustrates the power of this approach. The optogenetic LiverZap ablation system would therefore complement existing mammalian and zebrafish liver regeneration models.

      PLAN: to include a more detailed discussion of this point and the complementary knowledge that can be gained in the discussion section.

      Point 2 – MG-2I toxicity__: Indeed, as described in the manuscript, the FAP-TAP system, underlying LiverZap hepatocyte ablation, requires MG-2I incubation for the formation of the photosensitiser. Compared to the NTR/MTZ system, incubation with MG2I is short, requiring <3 hours in contrast to more than 24hours MTZ incubation. The system, including MG-2I has also been employed in cells, as well as in the zebrafish heart and nervous system without reported adverse effects (He et al., 2016; Xie et al., 2020). Consistently, we have not observed any apparent adverse effects between 0-72 hpi following 3-18 hour MG-2I incubation (unpublished). Nevertheless, toxicity studies evaluating survival upon MG-2I incubation have not yet been carried out and may be required for comparison with MTZ.

      PLAN: To perform toxicity studies for MG-2I, similar to those previously performed for MTZ (e.g. Mathias et al. 2014), in which larval survival after 3, 24 and 48 hour MG-2I exposure starting at 4 dpf will be assessed daily until 8 days post fertilisation.

      2 -The term ablation is choose because it is anticipated that it induces heaptospecific death. However, the consequences of cell death is not shown. In particular, the inflammatory immune response is not shown nor discussed.

      Response:

      The reviewer raises an interesting point, namely the inflammatory immune response, which is not the focus of this manuscript. Acridine Orange- and TUNEL-positive cells during the ablation process indicate that the reactive oxygen species produced by the FAP-TAP system cause hepatocyte apoptosis. We predict that this would recruit and be cleared by macrophages with little or no inflammatory response, like findings for the NTR-MTZ system (Stoddard et al., 2019). However, the role of neutrophils is unclear due to a possible direct effect of MTZ on this cell type.

      PLAN: We will include this point in the discussion.

      Future in-depth live imaging of transgenic reporters will be required for detailed studies of macrophage and neutrophil recruitment and their role in efferocytosis, including transcriptome analysis of specific gene signatures to detect an inflammatory response.

      3 - The difference between mild and severe ablation is hard to grasp. Can the authors explain more clearly the differences between mild and severe: what are the criteria as there is no difference in liver volume between mild and severe ablation? How do you achieve mild or severe ablation? It appears that the severity of the ablation is judged a posteriori and not decided per the experiment.

      Response:

      Concerning the first point, there must be a misunderstanding. Mild and severe hepatocyte ablation result in clearly different liver sizes, for instance at 30 hpi, the end of ablation, liver volumes are reduced by 23 % for mild or 64 % for severe cases (Fig. 1Q). This is supported by representative image data in Figs. 1F-P and S1A-C. Nevertheless, for consistency, we had represented the 12 hpi volume data as the same two data bars, although we cannot distinguish them yet at that timepoint of the experiment, as shown by images in Fig. 1F-G.

      PLAN: Adjust Fig.1Q and represent the 12 hpi liver volume data as a shared graph for mild/severe ablation, see included figure 1Q. We propose to similarly represent all 12 hpi quantifications, as represented in Figs. S1F, 2D-F and S2A.

      For the second point, the reviewer is correct that ablation severity is evaluated and determined between 24-30 hpi, at the end of hepatocyte ablation, given there is some variability in the response. Nonetheless, both length of 660nm illumination and oxygen availability can be used to shift the proportion of mild and severe ablation, depending on the desired outcome (Figs. 1Q, S1G-H).

      NO CHANGES PLANNED.

      4 - The work supports that biliary-driven regeneration also occurs when hepatocyte ablation happens in a small area of interest. Our knowledge is that you need a large defect in hepatocyte or a chronic liver injury ro activate the BDC-driven auxiliary process for regeneration. Could this be a specificity of the fish model?

      Response:

      Like the reviewer, our understanding is that severe hepatocyte loss, senescence or chronic liver injury activate BEC-derived regeneration in mammals and in zebrafish. All these cases are characterised by substantial reduction of local hepatocyte density or loss of function (in senescence). Given the overall hepatocyte loss is only 10-20% in the ROI model, the induction of the local LPC response was very surprising, on the other hand it corresponds to a near complete local hepatocyte depletion. The hepatobiliary architecture in zebrafish is similar to that of the mammalian ductular reaction, an adaptation of the biliary network to severe hepatocyte loss. In both cases, the majority of hepatocytes connect directly via their apical canaliculi to biliary ductules to ensure physiologic transport of hepatocyte products, often preceding the LPC response (Sato et al., 2019; Caviglia et al., 2022). Therefore, we propose that the LPC response following ROI hepatocyte ablation is not specific to the zebrafish model, but a common mechanism elicited across species and related to the severity of the injury and the configuration of the hepatobiliary network at the time of injury, such as the ductular reaction.

      PLAN: To edit the text and discuss this point clearly.

      5 - Pathways revealed to control liver regeneration or BEC-driven regeneration in fish have not be found to have a similar drastic predominance in rodents. This mitigate perhaps the use of fish for this type of research?

      Response:

      On the contrary, zebrafish has been established and validated as a model to investigate and elucidate developmental hepatic programs as well as regeneration (Goessling and Sadler, 2015; Wang et al., 2017). However, we acknowledge that more comparative studies are needed to understand the molecular pathways driving regeneration both in zebrafish and mammals and their similarity.

      Specifically, zebrafish and mammals display high conservation in the parenchymal and non-parenchymal cell types of the liver as well as their developmental programs (Goessling and Sadler, 2015; Wang et al., 2017). Using different injury paradigms in zebrafish, including ethanol, acetaminophen toxicity and the pharmacogenetic NTR-MTZ model, it has been shown that cellular responses to liver injury are also remarkably conserved with mammals where hepatocyte proliferation governs repair after mild injury while severe injury repair is driven by conversion of BECs into LPCs (So, et al., 2020; Forbes and Newsome, 2019). Major pathways, such as Wnt, FGF and BMP signaling show conserved functions in restorative hepatocyte proliferation (Goessling et al., 2008; Kan et al. 2009, Böhm et al 2010). At present, only very little is known about the molecular mechanisms controlling the BEC/LPC to hepatocyte conversion particularly in rodent models (Kim et al., 2023), while a number of zebrafish studies have started to elucidate the signals governing the different steps of this process (Kim et al., 2023), due to the relative ease of using the larval zebrafish model for this work. Notably, the Notch pathway plays multiple roles in both mouse and zebrafish LPC-mediated repair (Minnis-Lyons et al., 2021; Huang et al 2014; Russel et al.,2019), however further work will be necessary to determine the detailed corresponding functions. Therefore, future work in both rodents and zebrafish will be essential to uncover the molecular mechanisms of this repair process relevant for chronic injury. Given the large conservation of developmental and repair mechanism between mammals and zebrafish observed so far, it is highly likely that this will also apply to LPC-mediated repair. Studies promise to uncover even greater similarity between zebrafish and human (e.g. Fang et al 2011), underscoring the power of using complementary vertebrate models.

      PLAN: To edit the text in the introduction and discussion to clarify and highlight the similarities, differences, and opportunities the zebrafish model offers for understanding the mechanisms of vertebrate liver regeneration in general and in particular by using the LiverZap system.

      6 - The authors show that in the case of mild ablation, hepatocytes are responsible for replenishment of the parenchyma, but in the context of severe ablation, LPC-mediated regeneration takes control. However, when the authors perform localized and controlled ablation, which is small (around 10-20%) and, to my understanding, a mild / local ablation, however the authors show that LPC mediates the regeneration. Can the authors explain the discrepancy between their results?

      Response:

      We agree with the reviewer that the LPC response in the smaller, local ROI ablation was unexpected. However, it could be explained by the following: while such ROI hepatocyte ablation represents only a 10-20% ablation of the total hepatocyte population, by sheer numbers comparable to a mild global ablation, the near-complete local hepatocyte loss however makes it more similar to a severe or chronic global injury. Notably, the zebrafish hepatobiliary architecture in zebrafish is similar to that of the mammalian ductular reaction, an adaptation of the biliary network to severe hepatocyte loss. In both cases, the majority of hepatocytes connect directly via their apical canaliculi to biliary ductules to ensure physiologic transport of hepatocyte products, often preceding the LPC response (Sato et al., 2019; Caviglia et al., 2022). We hypothesize that if a similar local, near complete hepatocyte loss would be induced in a mammalian liver exhibiting a ductular reaction, it would similarly induce local LPC-mediated repair. Since this is, to our knowledge not possible, the LiverZap model represents a unique opportunity to induce the LPC-response in a controlled manner and in addition investigate the underlying cellular and molecular processes of injured and adjacent healthy tissues at high resolution in an in vivo context.

      PLAN: We will edit the discussion to clarify this important point.

      7 - The last part of the paper about E-Cadherin expression is not convincing. I am not sure about the quality of the IF stainings of E-Cadherin, and it is not helping proving the point of the authors. Can the authors provide better stainings for this figure?

      Response:

      (Same response as to point 5A+B of reviewer 1). We appreciate that it may be difficult to recognize the changes in E-Cadherin localisation, in particular at BEC membranes, given that there are intracellular puncta and that E-Cadherin is expressed both in BECs and hepatocytes. We are convinced of the related data described in Figures 4 and S4, because the first experiment allowed quantification of the staining using both Tp1:H2B-mCherry to identify BECs and intestinal E-Cadherin for normalisation, which revealed a 51 % E-Cadherin reduction at BEC cell membranes following injury. Unfortunately, the signal to noise ratio declined in consecutive experiments, while we could still observe a change in localisation, it challenged a meaningful quantification. We tested alternative antibodies against E-Cadherin, yet without success.

      PLAN: Following both reviewers’ comments and suggestions, we agree to remove the data on E-Cadherin.

      8 - Could the authors provide a bit more information on the live imaging. Exactly how do they achieve imaging for such a long time?

      Response:

      Thank you for pointing this out, the information was not very detailed. We used relatively standard mounting conditions (low-melting point agarose and Tricaine anaesthesia, see below for details), combined with light-sheet microscopy, which was the key to achieving the long imaging. We believe that in addition to the known gentle imaging condition, the mounting set-up is critical as the fish is completely suspended in a very low-percentage, low melting point agarose within a large volume of embryo medium.

      PLAN: Update the material and methods section with the following details: Long-term live imaging was performed using a LS1 Live light sheet microscopy system (Viventis Microscopy Sàrl). Larvae were with anesthetized with 0.4% Tricaine and mounted ventrally in 0.8% low melting point agarose in E3/PTU media supplemented with 0.16% tricaine. Once the agarose solidified, the chamber was filled with E3/PTU with 0.16% Tricaine to maintain anaesthesia. A 25X objective was used and acquisition was performed every 20 minutes.

      MINOR COMMENTS:

      9 - It is hard to imagine the full-size liver in Figure 1, bad contrast. Can the authors manually delineate it?

      Response:

      The livers in this figure are now outlined in the updated figures, see new Figure 1.

      10 - "This finding is very surprising, since current understanding in the field links the generation of new hepatocytes from BECs/LPCs with global hepatocyte death." This statement lacks references.

      Response:

      PLAN: To add the following primary references to the above sentence: (Choi et al., 2014; He et al., 2014; Manco et al., 2019; Raven et al., 2017) and recent review (Kim et al_._, 2023).

      REFERENCES

      Böhm F, Köhler UA, Speicher T, Werner S. Regulation of liver regeneration by growth factors and cytokines. EMBO Mol Med. 2010 doi:10.1002/emmm.201000085.

      Caviglia S, Unterweger IA, Gasiūnaitė A, Vanoosthuyse AE, Cutrale F, Trinh LA, Fraser SE, Neuhauss SCF, Ober EA. Fraeppli: a multispectral imaging toolbox for cell tracing and dense tissue analysis in zebrafish. Development. 2022 doi:10.1242/dev.199615.

      Choi TY, Ninov N, Stainier DY, Shin D. Extensive conversion of hepatic biliary epithelial cells to hepatocytes after near total loss of hepatocytes in zebrafish. Gastroenterology. 2014 doi:10.1053/j.gastro.2013.10.019.

      Fang L, Green SR, Baek JS, Lee SH, Ellett F, Deer E, Lieschke GJ, Witztum JL, Tsimikas S, Miller YI. In vivo visualization and attenuation of oxidized lipid accumulation in hypercholesterolemic zebrafish. J Clin Invest. 2011 doi: 10.1172/JCI57755.

      Forbes SJ, Newsome PN. Liver regeneration - mechanisms and models to clinical application. Nat Rev Gastroenterol Hepatol. 2016 doi:10.1038/nrgastro.2016.97

      Goessling W, North TE, Lord AM, Ceol C, Lee S, Weidinger G, Bourque C, Strijbosch R, Haramis AP, Puder M, Clevers H, Moon RT, Zon LI. APC mutant zebrafish uncover a changing temporal requirement for wnt signaling in liver development. Dev Biol. 2008 doi:10.1016/j.ydbio.2008.05.526.

      Goessling W, Sadler KC. Zebrafish: an important tool for liver disease research. Gastroenterology. 2015 doi:10.1053/j.gastro.2015.08.034.

      He J, Lu H, Zou Q, Luo L. Regeneration of liver after extreme hepatocyte loss occurs mainly via biliary transdifferentiation in zebrafish. Gastroenterology. 2014 doi:10.1053/j.gastro.2013.11.045.

      He J, Wang Y, Missinato MA, Onuoha E, Perkins LA, Watkins SC, St Croix CM, Tsang M, Bruchez MP. A genetically targetable near-infrared photosensitizer. NatMethods. 2016 doi:10.1038/nmeth.3735.

      Huang M, Chang A, Choi M, Zhou D, Anania FA, Shin CH. Antagonistic interaction between Wnt and Notch activity modulates the regenerative capacity of a zebrafish fibrotic liver model. Hepatology. 2014 doi:10.1002/hep.27285.

      Kan NG, Junghans D, Izpisua Belmonte JC. Compensatory growth mechanisms regulated by BMP and FGF signaling mediate liver regeneration in zebrafish after partial hepatectomy. FASEB J. 2009 doi:10.1096/fj.09-131730.

      Kim M, Rizvi F, Shin D, Gouon-Evans V. Update on Hepatobiliary Plasticity. Semin Liver Dis. 2023 doi: 10.1055/s-0042-1760306.

      Liang P, Kolodieznyi D, Creeger Y, Ballou B, Bruchez MP. Subcellular Singlet Oxygen and Cell Death: Location Matters. Front Chem. 2020. doi:10.3389/fchem.2020.592941.

      Manco R, Clerbaux LA, Verhulst S, Bou Nader M, Sempoux C, Ambroise J, Bearzatto B, Gala JL, Horsmans Y, van Grunsven L, Desdouets C, Leclercq I. Reactive cholangiocytes differentiate into proliferative hepatocytes with efficient DNA repair in mice with chronic liver injury. J Hepatol. 2019

      doi: 10.1016/j.jhep.2019.02.003.

      Mathias JR, Zhang Z, Saxena MT, Mumm JS. Enhanced cell-specific ablation in zebrafish using a triple mutant of Escherichia coli nitroreductase. Zebrafish. 2014 doi: 10.1089/zeb.2013.0937.

      Minnis-Lyons SE, Ferreira-González S, Aleksieva N, Man TY, Gadd VL, Williams MJ, Guest RV, Lu WY, Dwyer BJ, Jamieson T, Nixon C, Van Hul N, Lemaigre FP, McCafferty J, Leclercq IA, Sansom OJ, Boulter L, Forbes SJ. Notch-IGF1 signaling during liver regeneration drives biliary epithelial cell expansion and inhibits hepatocyte differentiation. Sci Signal. 2021 doi:10.1126/scisignal.aay9185.

      Morrison JK, DeRossi C, Alter IL, Nayar S, Giri M, Zhang C, Cho JH, Chu J. (2022) Single-cell transcriptomics reveals conserved cell identities and fibrogenic phenotypes in zebrafish and human liver. Hepatol Commun. doi: 10.1002/hep4.1930.

      Oderberg IM, Goessling W. (2023) Biliary epithelial cells are facultative liver stem cells during liver regeneration in adult zebrafish. JCI Insight. doi: 10.1172/jci.insight.163929.

      Ota N, Shiojiri N. Comparative study on a novel lobule structure of the zebrafish liver and that of the mammalian liver. Cell Tissue Res. 2022 doi:10.1007/s00441-022-03607-y.

      Raven A, Lu WY, Man TY, Ferreira-Gonzalez S, O'Duibhir E, Dwyer BJ, Thomson JP, Meehan RR, Bogorad R, Koteliansky V, Kotelevtsev Y, Ffrench-Constant C, Boulter L, Forbes SJ. Cholangiocytes act as facultative liver stem cells during impaired hepatocyte regeneration. Nature. 2017 doi:10.1038/nature23015.

      Russell JO, Ko S, Monga SP, Shin D. Notch Inhibition Promotes Differentiation of Liver Progenitor Cells into Hepatocytes via _sox9b_Repression in Zebrafish. Stem Cells Int. 2019 doi:10.1155/2019/8451282.

      Sato K, Marzioni M, Meng F, Francis H, Glaser S, Alpini G. Ductular Reaction in Liver Diseases: Pathological Mechanisms and Translational Significances. Hepatology. 2019 doi: 10.1002/hep.30150.

      So J, Kim A, Lee SH, Shin D. Liver progenitor cell-driven liver regeneration. Exp Mol Med. 2020 doi: 10.1038/s12276-020-0483-0.

      Stoddard M, Huang C, Enyedi B, Niethammer P. Live imaging of leukocyte recruitment in a zebrafish model of chemical liver injury. Sci Rep. 2019 doi: 10.1038/s41598-018-36771-9.

      Wang S, Miller SR, Ober EA, Sadler KC. Making It New Again: Insight Into Liver Development, Regeneration, and Disease From Zebrafish Research. Curr Top Dev Biol. 2017 doi: 10.1016/bs.ctdb.2016.11.012.

      Xie W, Jiao B, Bai Q, Ilin VA, Sun M, Burton CE, Kolodieznyi D, Calderon MJ, Stolz DB, Opresko PL, St Croix CM, Watkins S, Van Houten B, Bruchez MP, Burton EA. Chemoptogenetic ablation of neuronal mitochondria in vivo with spatiotemporal precision and controllable severity. Elife. 2020 doi: 10.7554/eLife.51845.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, the authors present a new chemoptogenetic tool, LiverZap, to study regeneration in zebrafish. In combination with LiverZap, the authors use live imaging to study longitudinally LPC-mediated regeneration and the implication of the healthy surrounding tissue.

      Major comments

      The LiverZap is an elegant new tool to induce localized ablation of hepatocytes. It is not as claimed by the authors a real breakthrough: (1) While localized ablation is nice compared to NTR-MTZ model in zebrafish, mice model such as CCl4 chronic injury can also study the interaction between healthy and injured tissue. (2) Although not using MTZ, the system still requires injection or exposure to malachite green derivate dye MG-2I. A few searches suggest that this compound could induce toxicity. Can the authors study and compare the toxicity of malachite green derivate dye MG-2I to the toxicity of MTZ? This is important as this would be indeed a strong argument in favor of the presented tool.<br /> The term ablation is choose because it is anticipated that it induces heaptospecific death. However, the consequences of cell death is not shown. In particular, the inflammatory immune response is not shown nor discussed.<br /> - The difference between mild and severe ablation is hard to grasp. Can the authors explain more clearly the differences between mild and severe: what are the criteria as there is no difference in liver volume between mild and severe ablation? How do you achieve mild or severe ablation? It appears that the severity of the ablation is judged a posteriori and not decided per the experiment.<br /> - The work supports that biliary-driven regeneration also occurs when hepatocyte ablation happens in a small area of interest. Our knowledge is that you need a large defect in hepatocyte or a chronic liver injury ro activate the BDC-driven auxiliary process for regeneration. Could this be a specificity of the fish model?<br /> - Pathways revealed to control liver regeneration or BEC-driven regeneration in fish have not be found to have a similar drastic predominance in rodents. This mitigate perhaps the use of fish for this type of research?

      The authors show that in the case of mild ablation, hepatocytes are responsible for replenishment of the parenchyma, but in the context of severe ablation, LPC-mediated regeneration takes control.<br /> However, when the authors perform localized and controlled ablation, which is small (around 10-20%) and, to my understanding, a mild / local ablation, however the authors show that LPC mediates the regeneration. Can the authors explain the discrepancy between their results?<br /> The last part of the paper about E-Cadherin expression is not convincing. I am not sure about the quality of the IF stainings of E-Cadherin, and it is not helping proving the point of the authors. Can the authors provide better stainings for this figure?<br /> Could the authors provide a bit more information on the live imaging. Exactly how do they achieve imaging for such a long time?

      Minor comments

      It is hard to imagine the full-size liver in Figure 1, bad contrast. Can the authors manually delineate it?<br /> "This finding is very surprising, since current understanding in the field links the generation of new hepatocytes from BECs/LPCs with global hepatocyte death." This statement lacks references.

      Significance

      General assessment:

      Elegant model to ablate hepatocyte in a clean fashion and study regeneration when coupled to imaging technique.<br /> The work supports that biliary-driven regeneration also occurs when hepatocyte ablation happens in a small area of interest. This seems a new concept, the operation of the process needs to be ascertain in other models including humans.<br /> Immune/inflammatory response to the ablation as well as the way it may influence/drive or dictate a regenerative response is not investigated

      Advance:

      The advance pertains to the model because rodents offer ample possibilities to study interaction between 'intact' and 'diseased' cells. Of course the model is attractive as it is rapid, allows for 'real time' in vivo imaging, ..;

      The audience will be a specialised audience (basic research in liver regeneration, in zebrafish technologies, ...)

      Expertise

      I'm a hepatologist, devoted for the last 15 years to the experimental study of the pathophysiology of liver diseases using animal models, cell cultures models and organoids. I 'm not an expert in zebra fish. I have a large interest in regeneration and in particular I produced pioneer work in BEC-driven regeneration that is studied here.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Ambrosio et al. describes a novel experimental system named "LiverZap", where spatiotemporally-regulated hepatocyte ablation can be achieved in the liver of living zebrafish for the study of the tissue/organ regeneration processes and mechanisms, particularly by in vivo live imaging. Specifically, the authors made use of a binary FAP-TAP system to achieve reactive oxygen species (ROS)-induced hepatocyte cell death upon treatment with the drug MG-2I followed by near-infrared (NIR) illumination in transgenic fish. They found that the system resulted in two categories of hepatocyte ablation, i.e., mild ablation and severe ablation. In the livers where the severe ablation was induced throughout the organ, biliary epithelial cell (BEC)-derived hepatocyte regeneration program was provoked as demonstrated by short-term lineage tracing experiments based on histone inheritance, which is quite consistent with previous studies using different methods of hepatocyte ablation in zebrafish and mice. Taking advantage of the spatial controllability of the LiverZap system, the authors further demonstrated that spatially-restricted severe hepatocyte ablation was sufficient to induce the BEC-dependent regeneration program therein. Interestingly, BECs outside the targeted region also contributed to the local hepatocyte regeneration, as revealed by using a sophisticated photoconvertible BEC labeling system. Finally, a dynamic nature of BEC aggregation and re-distribution upon liver injury was demonstrated to occur in advance of hepatocyte regeneration, reminiscent of the so-called ductular reaction in the mammalian liver. Overall, the authors' claims and the conclusions are well supported by the data presented in the manuscript, except for a few points as listed below.

      Major comments:

      • The authors assessed acridine orange incorporation in BECs upon LiverZap and concluded that LiverZap triggers hepatocyte-specific cell death without a bystander effect in adjacent cells (Figure 1 D-E). What happened to endothelial cells, which could also be affected either directly by ROS production in hepatocytes or indirectly by gross morphological changes in tissue organization?
      • The evaluation criteria for distinguishing mCherry <high> and <low> cells in imaging experiments should be clearly described in the methods section. The authors should also provide some quantitative data regarding the level of correlation between the mCherry<low> hepatocytes and the BEC-derived hepatocytes strictly defined based on the TP1-H2B-EGFP lineage tracing, as the former was used as a surrogate marker for the latter in some experiments.
      • OPTIONAL: In the locally restricted ablation model, do hepatocytes located adjacent to the ROI proliferate and/or contribute to the regeneration of the injured region?
      • OPTIONAL: Figure 4, A-S. It should be of significant interest if the authors could also analyze the BEC dynamics using the locally restricted hepatocyte ablation model, comparing those in the injured region (ROI) and the outside of the ROI.
      • Figure 4, T-V'. The data shown here for the changes in E-cadherin distribution is difficult to understand and interpret. The authors should provide magnified images and better description on how to distinguish the membranous (spotted signals?) and intracellular localization. Quantitative assessment should certainly be a plus, if possible.
      • OPTIONAL: In relation to the above point, it is this reviewer's candid impression that the very last part regarding the possible role of E-cadherin dynamics in regulating the biliary network remodeling is still preliminary compared to the remaining parts, thereby rather depreciating the value of the entire manuscript. Perhaps this part could be published separately, together with more functional evidence regarding the causal relationship between them (e.g., showing the effect of E-cadherin knockdown in hepatocytes on the biliary remodeling and the induction of the BEC-dependent regeneration program)
      • Do zebrafish livers possess lobular structures with the portal-to-central vein axis and the metabolic zonation as typically observed in mammalian livers? As has been described in the manuscript, the "localized" injury patters in the mammalian livers usually occur at the sub-lobular structure levels (i.e., peri-portal region-restricted vs. peri-central region-restricted). Although the "localized" injury model described in this study using the zebrafish livers was indeed localized from the viewpoint of the entire organ (or the lobe), it still seemed much more "global" when considering those situations in the mammalian livers, so that the authors' claim that the former recapitulating the latter might be too exaggerated and somehow misleading. The authors should clarify and discuss this point in the manuscript.

      Minor comments:

      • Figure 4. Panels D and G should correspond to the same one image and the way of labeling be changed (as in Figure 1G). Likewise, in panel J, the bars shown separately as "M" and "S" at 12 dpi should correspond to the same data, so that they should be unified as one bar.
      • Figure S3L. How was the ROI border defined? Perhaps the shape of the ROI should change significantly during regeneration due to dynamic tissue remodeling processes, thereby moving the position of the border as well.
      • The authors should comment in the manuscript as to whether the system can be applicable for induction of more restricted areas (e.g., at a single hepatocyte level; in particular metabolic zones, if existing), as well as for ablation of other hepatic cell types such as BECs and endothelial cells.

      Significance

      The newly developed LiverZap system described in the present study was well designed and has multifaceted advantages compared to other "global ablation systems" that have so far been used in this research field. Indeed, the authors' original finding that the localized hepatocyte ablation provokes activation of BECs outside the injured region and their contribution to hepatocyte renewal, could have never been obtained using previous models. This finding is of considerable novelty and interest in that the localized injury model should better reflect the pathophysiological conditions in various human liver diseases. Thus, the study should make significant contribution to the field in both technological and conceptual ways, providing useful and relevant platforms for the future studies on the mechanisms of liver injury, repair and regeneration.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This is an interesting manuscript that explores how epithelial cells respond to genetically induced disruption of occluding junction formation. To ask how epithelial integrity is maintained under these conditions, the authors investigated the developing pupal epidermis in Drosophila, where they used genetic mosaic techniques to induce patches of mutant tissue lacking selected components of bicellular or tricellular septate junctions (SJs), respectively. They show that occluding junction defects result in elevated levels of E-Cadherin, F-actin, and activated myosin II at adherens junctions (AJs) in the mutant tissue, suggesting that epithelial cells sense breaches in barrier integrity and respond by reinforcing adhesion and actomyosin contractility. Consistent with this idea, the authors find mechanosensitive proteins (Ajuba, Vinculin) enriched at AJs in the mutant cells, and show that new cell-cell interfaces after cytokinesis are shortened in cells lacking the tricellular SJ (tSJ) component Aka. Moreover, aka mutant cells accumulate beta-Integrin, F-actin and vinculin on their basal side, suggesting that upon disruption of tSJs cells increase matrix adhesion by forming focal adhesions (although the authors did not address whether these structures are bona fide focal adhesions that connect to ECM). The authors go on to ask how disruption of SJs is sensed and translated into enhanced adhesion and contractility. Previous work (Pannen et al. eLife 2020) established that the ESCRT complex is required for retromer-dependent delivery of SJ components to their correct membrane destination, and that loss of ESCRT function leads to disruption of SJs. Building on this and their own earlier work, the authors show that SJ defects are accompanied by enlarged ESCRT III (Shrub:GFP)-positive structures, elevated numbers of HRS-positive vesicles, and accumulation of polyubiquitinated proteins. The latter effect upon SJ disruption was reminiscent of Shrub/ESCRTIII loss of function, leading authors to propose that modulation of ESCRT activity prevents SJ protein degradation in favor of SJ protein recycling. Such a scenario could be expected to result in elevated SJ protein levels at the plasma membrane, but whether this is the case is not addressed in the paper. Instead, the authors switch here to analyzing effects of shrub RNAi on the apical determinant Crumbs, which accumulates at or near AJs in cells lacking bSJ (nrv2) or tSJ (aka) components, consistent with reduced degradation and/or increased recycling of Crumbs protein. Finally, they show that clusters of beta-integrin (Mys), associated with vinculin and F-actin, appear on the basal side of aka-depleted cells, leading the authors to conclude that SJ-defective cells reinforce their adhesion to the ECM, perhaps to prevent extrusion from the epithelium. While the appearance of Mys clusters on the basal side is convincingly demonstrated, I don´t see evidence for apical focal adhesions, as depicted in the cartoon in Fig. 7. If focal adhesion-like structures exist on the apical side, to what kind of ECM molecules should they attach there?

      Overall, the manuscript describes interesting new findings that are well documented and should be of interest to a broad audience of cell and developmental biologists. However, the following questions and technical issues remain to be addressed before the manuscript will be ready for publication.

      Major comments:

      The title refers to a "mechanism sensing paracellular diffusion barrier alteration", and in the discussion (line 325) authors state that "loss of bSJs and tSJs by altering the paracellular diffusion barrier triggers an ESCRT-dependent response...". However, no experiments to assess paracellular barrier function (epithelial permeability) are shown in the paper, and it is not clear that the ESCRT-dependent responses described here are triggered by altered barrier function per se, as stated by the authors, or by changes in other SJ-dependent parameters, such as cell adhesion or intra-membrane mobility of lipids and proteins. Statements about paracellular barrier alteration should be rephrased accordingly.

      Altered epithelial barrier function will likely influence osmoregulation via changes in organismal hormonal status and gene expression, which may contribute to the phenotypes described here. How much time passed between induction of mutant clones and phenotypic analysis? The authors should discuss these aspects, and consider that effects of altered barrier function will depend on the distribution and size of clones with defective SJs.

      In the discussion the authors speculate about a "sensing" mechanism based on (hypothetical) altered membrane lipid composition upon loss of SJs. However, such effects would not explain how altered barrier function per se (epithelial permeability) would be sensed by cells, as stated in the title and throughout the text. Please explain.

      How Shrb/ESCRTIII activity could be "redirected" or "modulated" by disruption of SJs remains unclear. Can the authors briefly outline possible mechanisms for modulation of ESCRT activity?

      The presentation of fluorescence intensity data in a rescaled ("standardized") format is uncommon and non-intuitive, as it obscures the true scale (fold-changes) and variation of the data. Also, if data were plotted as a range from 0 to 10, as stated in Materials & Methods, it is not clear why in all graphs (except for a single datapoint in Fig. 5C'?) values start at 1, not at 0. Highest values appear to cluster at 10 and lowest values at 1, suggesting these represent saturated or clipped signals, respectively. Were these datapoints taken into consideration for calculating mean values? Authors need to explain exactly how the analysis was done. Why was this type of representation chosen, and why should it be more appropriate than showing regular normalized data?

      Authors should explain why they jump between different mutant (aka, nrv2) and RNAi (aka, cora nrv2, nrxIV) conditions and different Gal4 driver lines (pnr-Gal4, sca-Gal4) to disrupt SJ integrity. The basis for choosing these different conditions is not always clear and makes results difficult to compare.

      The TEM images shown in Fig. 1A are difficult to interpret, because plasma membrane is barely visible. The images do not seem to contribute much and can be removed from the paper.

      The position of mutant clones is marked by absence of nuclear RFP (Fig. 1B and elsewhere), but drawings of clone boundaries (Fig. 1B) do not match with the pattern of RFP-positive/ -negative nuclei (Fig. 1B'), presumably because different optical sections are shown in Fig. 1B and and B'. This is confusing and needs to be explained.

      Minor comments:

      Line 102: "We recently reported that defects at tricellular Septate Junctions (tSJs) are always accompanied by bicellular Septate Junctions (bSJs) defects". Authors may want to mention that in embryonic and larval epithelia lacking tricellular SJs, bicellular SJs assemble initially, but appear to degenerate during later development (Hildebrand et al. 2015, Byri et al. 2015).

      Line 192 remove "another".

      Line 194: % enrichment and fold enrichment are used; stick to one way.

      Line 259 and elsewhere: Crb "activation" vs. accumulation or mislocalization. What do the authors mean by Crb "activation"?

      Line 346: "FK2 protein": the FK2 antibody does not detect a particular protein, but the polyubiquitin modification, presumably on many different proteins.

      Line 444: "Also, the observed changes at apical level might be mostly due to direct effects." I don´t see experimental evidence to support that the observed changes are mostly due to direct effects. Rephrase or remove.

      Information on how mutant clones were induced needs to be included in Materials and Methods.

      Results referred to as "not shown" should be shown, or corresponding statements be removed from the paper.

      The text needs to be carefully checked for grammatical and typographical errors.

      Significance

      How epithelial cells cope with disruptions of occluding junctions without losing tissue integrity is an important question with far-reaching implications for understanding epithelial biology and disease. This work makes a significant contribution here by carefully describing the interrelationship between disruption of occluding junctions and possible compensatory mechanisms at the level of adherens junctions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Septate junctions provide the barrier function in insect tissues, serving as analogs to the vertebrate tight junctions. Here the authors explore an interesting question-how do epithelial tissues respond to loss of barrier function in vivo. They use a powerful and well-studied system, the Drosophila pupal notum, which allows them to bring powerful genetic tools to bear and use state of the art imaging. Their data are lovely and carefully quantified. Together, they reveal some significant surprises. 1. Disrupting septate junctions leads to elevated accumulation of adherens junction proteins and myosin, and reduced apical area. 2. Disrupting septate junctions led to accumulation of many ESCRT-0-positive vesicles and of enlarged ESCRTIII vesicles. 3. Disrupting septate junctions led to elevated accumulation of Crumbs apically and of integrin-based focal adhesions basally. These observations are well supported by the data and in the results section conclusions are carefully drawn. I had some relatively minor comments outlined below about the results. My only significant suggestion concerns the Abstract and Discussion. The Abstract includes a statement that goes well beyond the data shown, and the Discussion is sometimes hard to follow. With these issues corrected, this will provide important new insights for cell and developmental biologists.

      1. The Abstract states: "We report that the weakening of SJ integrity, caused by the depletion of bi- or tricellular SJ components, reduces ESCRT-III/Vps32/Shrub-dependent degradation and promotes instead Retromer-dependent recycling of SJ components." This is too strong, as the role of the retromer, while plausible, is not directly tested. It's fine to speculate about this in the Discussion but drawing a conclusion like this in the Abstract is unwarranted.
      2. Similarly, the title suggests that "ESCRT-III-dependent adhesive and mechanical changes are triggered by a mechanism sensing paracellular diffusion barrier alteration". They show that knocking down septate junctions alters localization of vesicle trafficking machinery, and that it leads to alterations in apparent recycling of cargo, but do they ever really assess whether these changes are ESCRT-III-dependent? Wouldn't this require knocking down ESCRT-III in cells with defects in septate junctions? There was a lot of data in this paper and perhaps I missed it but was this experiment done? I am not suggesting they do it, but that they temper this conclusion if not.
      3. The authors assessed "poly-ubiquitinylated proteins aggregates appearance, marked using anti-FK2" . They need to define FK2-what does it detect.
      4. Fig 4-is this a clone, and are we far from the boundary? Make this clearer
      5. The authors state: "Despite these apparent similarities, we noticed that, in contrast to Shrub depletion, NrxIV did not accumulate in enlarged intracellular compartments upon Cora depletion" Could the authors reference a Figure here?
      6. The authors state: "Hence, if both Shrub and bSJ/tSJ defects lead to Crumb enhanced signals" It might be better to say "altered" as they then point out the differences.
      7. I found the Discussion challenging to follow. Rather than focusing on the core observations, it addresses many, not very well-connected speculative possibilities, and in my opinion, will be challenging for most readers to follow. I would encourage the authors to revisit it from top-to-bottom.

      Referees cross-commenting

      I think we largely agree that the authors present important data, but that certain points need to be better explained or more clearly documented. While Reviewer 1 is correct that adding context about the basolateral polarity proteins would be helpful, I do not feel as strongly about this as a deficit. The authors did not manipulate Scrib, Dlg or Lgl, and i think their polarity functions may be distinct from those of the more "structural" septate junction proteins analyzed here.

      Significance

      Septate junctions provide the barrier function in insect tissues, serving as analogs to the vertebrate tight junctions. Here the authors explore an interesting question-how do epithelial tissues respond to loss of barrier function in vivo. They use a powerful and well-studied system, the Drosophila pupal notum, which allows them to bring powerful genetic tools to bear and use state of the art imaging. Their data are lovely and carefully quantified. Together, they reveal some significant surprises. 1. Disrupting septate junctions leads to elevated accumulation of adherens junction proteins and myosin, and reduced apical area. 2. Disrupting septate junctions led to accumulation of many ESCRT-0-positive vesicles and of enlarged ESCRTIII vesicles. 3. Disrupting septate junctions led to elevated accumulation of Crumbs apically and of integrin-based focal adhesions basally. These observations are well supported by the data and in the results section conclusions are carefully drawn. I had some relatively minor comments outlined below about the results. My only significant suggestion concerns the Abstract and Discussion. The Abstract includes a statement that goes well beyond the data shown, and the Discussion is sometimes hard to follow. With these issues corrected, this will provide important new insights for cell and developmental biologists.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This paper investigates the cellular response of Drosophila epithelia cells in the notum to damage to septate junctions. They find that disruption of tri- and bi-cellular septate junctions (SJ) integrity alters the distribution of adherens junction (AJ) components including E-cadherin, Myosin-II and others. Loss of SJ increases levels of AJ proteins. They then show that loss of the tri-cellular junction protein Anakonda alters the adhesive and the mechanical properties of the epithelia. They showed Myo-II activation was increased, however laser ablation/recoil studies did not reveal a change in local membrane tensions. Changes in membrane tension were observed by quantifying cell divisions in bicellular septate junction (bSJ) mutants. Building on previous work, they show that defects in SJs lead to ESCRT complex defects, and that loss of tricellular or bicellular septate junction components increase apical-medial Crumbs localization and triggers assembly of focal adhesion contacts. Together, these results show that alterations in SJ structures result in apparently compensatory increases in Crbs and focal adhesion-based intercellular adhesions that is mediated by the ESCRT complex.

      Major comments:

      • Title, abstract and paper body: e.g. title "ESCRT-III-dependent adhesive and mechanical changes are triggered by a mechanism sensing paracellular diffusion barrier alteration in Drosophila epithelial cells"
      • The paper is completely focused on the septate junctions as a paracellular diffusion barrier. However, many of the septate junction components, including Scribble, Dlg, and Lgl, have well documented (if poorly understood) basolateral polarity functions, and considering that septate junctions contain 15 or more cell-cell adhesion proteins, they are also likely to have a adhesive/structural function in addition to paracellular barrier and polarity functions. There is no attempt in the paper to consider or disentangle these multiple roles. Indeed, the introduction and discussion consider the vertebrate tight junction as the analogue of the insect septate junctions when a better view would be that the septate junction is a combination of the claudin-based barrier function of the vertebrate tight junction and the vertebrate basolateral polarity proteins Scribble, Dlg and Lgl that localize similarly and presumably have a function similar to the Drosophila basolateral polarity/SJ proteins for which they are named. Moreover, there are no experiments in the paper to address whether the relevant parameter being sensed in SJ defects is loss of the paracellular barrier, loss of cell adhesion/contact/structure or disruption of the polarity function of the SJ complex. Notably, there aren't any experiments in the paper that test paracellular barrier function. This criticism does not in any way reduce the importance of the paper or the results, but to avoid presenting an overly simplistic and probably misleading view of the cellular processes in play, a more comprehensive discussion of SJs is in order.
      • line 245: "We propose that it is the Shrub activity that is being modified upon SJ alteration, preventing SJ component degradation in favour of SJ component recycling."<br /> line 288, "Thus, as proposed above for Nrx-IV, these data further suggest a hijacking of Shrub activity toward recycling components upon alteration of SJ integrity."<br /> Model in Fig. 7 Arrows showing increased SJ protein delivery in right bottom panel, but decreased bicellular SJ complex formation in the left bottom panel.<br /> The authors demonstrate that in bicellular SJ mutants, there is increased accumulation of Crb, adherens junction components, focal adhesion components, and in the text and in the model in Fig 7 focus on the upregulation of recycling activity. However, as indicated by the reduced bSJs in the left bottom panel in Fig 7, and in the reduced Nrx levels in 3C' and in the text in lines 351-53, the levels of most septate junction proteins drop in the absence of any of 15+ bicellular septate junction mutants. Previously the authors should that reduction of tricellular septate junction proteins increased levels of septate junction proteins in bicellular junctions which the authors translate to increased delivery of "SJ components" to the membrane in SJ mutants as shown in Fig 7 bottom right panel and stated in lines 245 and 288. But the data in the paper, which is consistent with statements on lines 351-353 saying that bicellular SJ mutations cause a general reduction of SJ protein levels, suggests either a more nuanced role of recycling such that Crbs and other proteins show increased recycling in bicellular SJ mutants, but biclellular SJ proteins show decreased recycling, or an alternative scenario in that the SJ proteins are recycled more in a SJ mutant, like Crb is, but SJ proteins don't form stable complexes which leads to their modification that targets them for destruction despite being recycled more. Regardless of the actual explanation, I think readers will be confused by the statements in the current version of the paper about upregulation of recycling activity but apparent reduction of SJ proteins. The authors should address this issue with appropriate changes to text and the model figure.

      Minor comments:

      • The assumption in the paper is that the changes in protein levels result from changes in recycling of the proteins. However, it would be nice to rule out transcriptional regulation. Has anyone established smFISH in the notum that would allow quantification of Crb or other marker RNA to show that there is not increased accumulation of the Crb RNA in the SJ mutant backgrounds?
      • line 58. SJ are only the functional equivalent of tight junctions for paracellular barrier function. SJ have basolateral polarity function that correspond to basolateral polarity proteins in vertebrates, whereas vertebrate TJs are associated with apical complexes. In addition, the mechanical properties of SJ and TJ are probably wildly different since the SJ is a much more elaborate structure with many more cell-cell adhesion proteins than TJs. I feel the presented over-simplification do not adequately inform the reader about alternative functions and therefore hypotheses about the data in the paper.
      • lines 120-121 , Figure 1A-A'. Please quantify the relative frequency of holes observed in the EM sections. Is it every tricellular junction or 1 in 100? Is WT statistically different than mutant?
      • line 126-127 (data not shown). Does EMBO allow data not shown? Just checking current rules.
      • lines 134-135. "We observed similar results upon loss of Gli and M6". Is this data not shown? I couldn't find it. Please either reference a figure or note as "data not shown" if that is allowed.
      • line 319 "We propose that the disruption of SJ barrier in the ...", also line 326 . I suggest the use of "SJ complex" instead of SJ barrier or paracellular diffusion barrier, otherwise the authors need to provide some evidence or rationale that it is the barrier function of the SJ that is triggering the recycling changes rather than the disruption of the polarity or adhesive/structural functions of the SJs.
      • line 341 "Our work shows that a part of the sensing mechanism involves the ESCRT machinery."<br /> I think that the ESCRT machinery is better described as part of a response mechanism to SJ defects than as a "sensor". I don't think the paper presents any evidence that the ESCRT machinery is part of the sensing mechanism for SJ defects. There is lots of evidence that the ESCRT machinery is modified by SJ defects, but that supports a role as part of the response machinery, not as the sensor that directly detects SJ defects.

      Significance

      The topic and results of the paper will be of interest to a wide range of the cell biology community including those studying epithelial integrity, junctions, polarity and endocytic trafficking. The results break new ground in looking at the dynamic relationships between junctional complexes. This paper is generally well written, with the exception of the major comments below which, and the experiments well done. Overall a very interesting paper that is appropriate for a top tier journal.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study, the authors used metabolic labeling of newly-replicated or nascent chromatin followed by quantitative Mass spectrometry (iPOND-MS) to characterize protein composition of nascent chromatin at time points after DNA replication: immediately after a short pulse of EdU labeling (nascent), and after 1 and 2 hours of Thymidine chase (maturing chromatin). The iPOND method was established before but in the current manuscript the authors combined this with inhibiting RNA Pol II transcription at distinct stages to determine the effects on transcription and RNA Pol II cycle on chromatin protein dynamics at the wake of DNA replication. The inhibitors they used are Triptolide, which blocks transcription initiation and induces a proteasomal degradation of all chromatin bound RNA PolII, and DRB, which blocks transcription elongation causing an enrichment of paused RNA PolII. The authors compared the relative enrichment of ~1200 proteins on nascent and maturing chromatin and the effects on transcription inhibition on these proteins.

      The authors found that RNA PolII does not affect the loading or retention of most histones on nascent chromatin except for the histone variant H2A.Z, which requires RNA PolII loading. However, DRB treatment (no elongation) resulted in stabilization of all histones (which the authors do not seem to catch on). Interesting, unlike the histone, both replication-coupled and -independent histone chaperons seem to be enriched immediately behind the fork and are affected by RNA PolII to different extents. They next look at ATP-dependent remodelers and find that most remodeler families are facilitated by RNA PolII loading, while elongation affects some remodeler families and not others. They see the same trend looking at a wide variety of transcription factors. Interestingly, while RNA PolII loading is required for the establishment of some histone post translational modifications (H3K36me3), some others such as H3K9me3 and H4K20me2 are negatively affected. Finally, the authors find that RNA PolII elongation promotes binding of several DNA repair proteins, and speculate that this is because of DNA damage from replication-transcription conflict.

      My main concern about this manuscript is that the relative enrichment of most factors show variability across the time points, which make the interpretation of the data difficult. This becomes more concerning when we look at protein complexes such as the ATP-dependent remodelers. Subunits of the same complex which are expected to bind together show different patterns of enrichment. This raises the concern as to how data was normalized. Furthermore, how do the replicates compare to each other? The others selected ~1200 proteins which were enriched in all three replicates, but how does their relative enrichment compare in the replicates? The authors need to show some kind of comparison across replicates to confirm that the differential relative enrichments are real and biologically meaningful.

      Also, the TF data is very descriptive. Insightful analysis of similarities/differences between types of TFs would be interesting.

      Minor comment: The formaldehyde cross linking used in iPOND makes it difficult to interpret/distinguish what is actually chromatin bound versus what is enriched due to protein-protein cross linking. The authors should highlight that in the limitations section.

      Referees cross-commenting

      I agree with most of Reviewer 1's comments about the lack of proper controls and normalization, which make the interpretations difficult. Particularly all of the controls mentioned under point 1 should not be difficult to perform, and if included, would strengthen the study and the manuscript.<br /> Reviewer 1 makes an important point about normalization, which I totally agree with. Ideally, a spike-in approach would help obtain a much more quantitative and reliable understanding of differential protein enrichment. However, repeating all iPOND experiments with spike-in might be a big ask. What the authors could do at minimum is show how replicates compare with each other. It looks like they pooled all three replicates for analysis, but comparing relative enrichment of all 1257 proteins across replicates would help. The point about delayed histone occupancy is a critical one and difficult to rationalize. To note, histone chaperons are enriched on nascent, but histones are not. Besides, in the current way that the data is analyzed and presented, there are a lot of fluctuations in protein enrichment across the 1-2 hour timepoints of chromatin maturation, which would be very interesting if real. For e.g., Fig. 1I, Triptolide treatment, most of the cluster I and cluster II proteins show medium-high enrichment on nascent, depleted in 1h, but recover in 2h. If the binding/recruitment of these proteins on newly-replicated chromatin is RNA Pol II dependent, why would they come back after 1h? If this real, this would be very interesting. There are several additional examples of problems with quantification/normalization. As for SWI/SNF subunits, both SMARCA4 and SMARCC1 are core subunits and based on several thorough biochemical studies, cannot be expected to bind separately. However, they show different kinetics in DMSO as well as TPL and DRB.

      Another problem of the assay is that it shows genome-wide average. As Reviewer 1 rightly pointed out, transcription inhibition could disproportionally affect chromatin maturation kinetics in different genomic regions. Perhaps it would be interesting to analyze sets of genomic regions separately, such as highly transcribed and lowly transcribed genes. This might be achieved by adding a purification step using pools of DNA sequence probes before or after the streptavidin enrichment.

      Additional comment: The formaldehyde cross linking used in iPOND makes it difficult to interpret/distinguish what is actually chromatin bound versus what is enriched due to protein-protein cross linking. The authors should highlight that in the limitations section.

      On a positive note, it is a very important and timely study and the manuscript has a lot to consider. Addition of proper controls and normalization/analysis of replicates will make it stronger

      Significance

      Overall, it is a very important and timely study, and the manuscript has a lot to consider. There are several recent papers on the kinetics of chromatin maturation behind the replication fork, and this study adds a very important dimension to this ongoing investigation, and will be of interest to a broad readership in the chromatin and transcription field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, the authors characterized the re-establishment of chromatin after DNA replication in fibroblasts using iPOND-MS. By using a short pulse of EdU, followed by different length of thymidine-chase, the authors compare the proteome at nascent DNA (just after the EdU pulse) with the proteome on re-established chromatin (1h and 2h post EdU pulse). Moreover, by using two different transcription inhibitors, they investigate the implication of active transcription elongation and of RNAPII binding itself on the reestablishment of chromatin. They show that different transcription factors bind to newly replicated DNA with different kinetics and are affected differentially by transcription inhibition. They also show that upon transcription inhibition by DRB, certain DNA damage repair proteins are depleted, implicating transcription in the recruitment of these factors at nascent DNA. Chromatin remodelers were shown to be enriched on nascent DNA, but triptolide-transcription inhibition reduced their enrichment, implicating RNAPII in the reestablishment of chromatin structure and of steady-state chromatin accessibility. Lastly, the authors show that histone incorporation and histone modification restoration on nascent DNA is mostly uncoupled from transcription with the exceptions of H3.3K36me2 (transcription inhibition by triptolide or DRB drastically reduces restoration) and H4K16ac (DRB treatment increases its incorporation in nascent DNA).

      Overall, the results and the analysis of the datasets appear robust and well executed. Nevertheless, the work provided by the authors feels mainly descriptive and does not provide further mechanistic insights beyond the current state of the art. Some follow-up experiments to study the functional impact on the different enrichment patterns on nascent DNA or the function of the dependency on RNAPII for the reestablishment of steady-state enrichment on chromatin of some factors would have greatly increased the scientific impact of the manuscript. Nevertheless, the proteome of nascent DNA, its kinetic, and the effect of transcription inhibitors will provide interesting information and a useful resource for research groups in the DNA replication, chromatin, epigenetic, and DNA damage repair fields. Thus, in conclusion, I would recommend this manuscript to be published in its current state in a lower tier journal such as MBoC or PLOS ONE journals. If the authors can provide additional mechanistic insights by addressing at least a few of the specific points listed below, I think it would become a stronger candidate for a journal with higher impact.

      Major comments:

      1. At p.7, the authors state: "Altogether, this analysis further confirms that RNAPII's binding and elongation on newly replicated chromatin are a source of genotoxic stress, and identifies dedicated repair factors handling transcription replication conflicts.". I don't think that depleted DNA repair proteins from nascent chromatin upon DRB treatment is enough to claim that the analysis confirms that transcription on nascent DNA is a source of stress. Another possibility could be that transcription helps handling prior DNA damage on nascent DNA without causing the damage. A useful experiment to clarify this point would be the direct quantification of DNA damage markers on nascent chromatin (e.g yH2AX-EdU colocalization quantification by immunofluorescence). Has the yH2AX variant been detected in the iPOND MS dataset? Another possible follow-up experiment could be to detect direct physical DNA damage on nascent DNA for example by using a TUNEL assay or similar DSB mapping method. Can the DNA damage be prevented by DRB or TRP addition?
      2. Figure 1B-E: Can the authors also show quantifications of EU, RNAPII and EdU at the 1h and 2h timepoints after the chase?
      3. The authors state in p.7 that "The other proteins are DNA repair proteins involved in fork quality control and HR as well as transcription replication conflicts (Berti et al., 2020).". I think this gives rise to the question if the effect of DRB treatments on the enrichment of certain proteins at nascent DNA is due to the inhibition of transcription elongation inhibition on nascent DNA or in front of replication forks, affecting the enrichment of proteins implicated in handling transcription-replication conflicts in front of the fork and not on nascent DNA itself. The authors should address the possibility that some of the proteins enriched in the iPOND-MS datasets could be there because they are enriched in front of the replisome instead of on the nascent DNA.
      4. On this topic, transcription inhibition is performed for two hours prior to the EdU pulse and iPOND-MS procedures. For the DRB treatment, I would expect RNAPII to be paused/stalled prior to the passage of the replication fork that will replicate the analyzed EdU-labelled nascent DNA. This would mean that replication forks during the EdU pulse will encounter paused/stalled RNAPII, generating potential problems. Such interference would most probably lead to chromatin removal of RNAPII from the chromatin. Surprisingly, the authors show enrichment of RNAPII at nascent DNA. How can the author differentiate from accumulation of RNAPII in front of the fork, leading to purification by iPOND, and RNAPII on nascent DNA. Also, if the accumulation of RNAPII is on the nascent DNA, do the authors suggests that RNAPII gets loaded more on nascent DNA while under DRB inhibition or that stalled RNAPII are mainly by-passed by replication forks, leading to their enrichment on nascent DNA?
      5. At p.14, the authors state: "Because they share the same DNA template, transcription is known to challenge replisome progression at high frequency, from RNAPII constituting a roadblock to progressing replisomes, to generate RNA:DNA hybrids (Berti et al., 2020). It is therefore remarkable that behind replisomes, only a handful of DNA repair factors appear to be involved in response to RNAPII binding and elongation.". How does the fact that transcription represents a roadblock in front of the forks makes it remarkable that only a handful of DNA damage repair pathways are involved behind the fork (where they are not a roadblock to any replisome)?
      6. At p.11, the authors states: "As di and tri-methylations require several hours to be re-established following DNA replication (Alabert et al., 2015; Reveron-Gomez et al., 2018), 11 minutes after the passage of the fork, such increase most probably reflects an increase of H4K20me2 and H3K9me3 on recycled parental histones.". Can the authors extend their interpretation of this result? Do the authors think that DRB treatments increase methylation of histones in G1, prior to replication, or specifically in front of the fork (due to conflicts? DNA damage?), and that those methylated histones get recycled on nascent DNA?
      7. Figure 4: The authors perform the histone PTM analysis under 0h (nascent chromatin) versus 2h (re-established chromatin) timepoints. It would have been insightful to also include a 1h timepoint in this experiment. There appear to be some trends/changes but they do not show statistical significance (e.g. H4K5K12ac or H3K14ac). It might be useful to increase the number of biological replicates (including the 1h timepoint) here, which could improve the confidence in the results and/or discover additional transcription-dependent changes of histone PTM restoration.

      Minor comments:

      1. Fig3I: It would be nice to show a TF from the "Restored within 11 min" category as a comparison point.
      2. In page 14, th authors state: "However, we did not detect significant signs of DNA damage in DRB treated cells (Fig. 2A, 2B).". Which signs the authors looked at?
      3. In the iPOND experiment, which size of DNA fragments is achieved?
      4. At p.14, the authors state: "Because they share the same DNA template, transcription is known to challenge replisome progression at high frequency, from RNAPII constituting a roadblock to progressing replisomes, to generate RNA:DNA hybrids (Berti et al., 2020)." The paper has not addressed the role of RNA:DNA hybrids in these processes.
      5. Fig3D: Is there enough datapoints to state a conclusion?
      6. S1A: mistakes in the x axis labels ("no EU" in a EdU quantification graph, "no EdU" in a EU quantification graph).
      7. S1F is not sufficiently described in the legend. It took me some time and additional efforts to understand what the right panel of S1F was showing.
      8. S2E-F: are the axis wrong? Is it supposed to be Nascent when its comparing total extracts?
      9. A lot of graphs have non-precise axis labels that needs reading of the manuscript and/or legends to understand. For example: 1K-L (distribution, %), 2L (% of the max), 3B-C-D log2(Nascent/2h), 3G IP/Input, 4C (Inhibitor treatment/DMSO (%)), S2E-F (TPL/DRB Nascent/ DMSO Nascent), S3A (IP/Input), S4A (No Y axis label).
      10. FigS4: Assignment of colors in bar graphs of C-J to treatments is not shown. Heatmaps in H and K do not show if these are 0 or 2h. The heatmap in H shows H3 modification and the heatmap in K shows modification in H3.3 but the labels of the modification in K (except the first one) are the names of the modifications of H3, not H3.3. In the legend, GAPDH is written GABDH.

      Significance

      In this manuscript, the authors characterized the re-establishment of chromatin after DNA replication in fibroblasts using iPOND-MS. As mentioned above, the work provided by the authors feels mainly descriptive and incremental and therefore does not provide further mechanistic insights beyond the current state of the art. Some follow-up experiments to study the functional impact on the different enrichment patterns on nascent DNA or the function of the dependency on RNAPII for the reestablishment of steady-state enrichment on chromatin of some factors would have greatly increased the scientific impact of the manuscript. Nevertheless, the proteome of nascent DNA, its kinetic, and the effect of transcription inhibitors will provide interesting information and a useful resource for research groups in the DNA replication, chromatin, epigenetic, and DNA damage repair fields. Thus, in conclusion, I would recommend this manuscript to be published in its current state in a lower tier journal. If the authors can provide additional mechanistic insights by addressing at least a few of the specific points, I think it would become a stronger candidate for a journal with higher impact.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The goal was to characterize the changes in the composition of the proteome associated with replicated DNA in conditions of genome-wide inhibition of transcription initiation or transcription elongation. They use iPOND, a MS based technique that identifies proteins specifically associated with replicated DNA labeled with EdU. They use non-synchronized foetal human lung fibroblasts and examine time points immediately after replication (after the 11 min EdU pulse) and 1 and 2 hrs after the Thymidine "chase" later when chromatin has "matured", to assess how the inhibition of transcription influences chromatin maturation and the binding of chromatin associated proteins to replicated DNA.

      The question is pertinent and is in line with the long-standing interest of the group in chromatin replication dynamics. They conclude that 1. RNAPII loading is necessary for the binding of some TFs, chromatin remodelers and DNA repair factors and 2. RNAPII elongation is needed for H2A.Z incorporation , H3K36me2 restoration and DNA repair factor binding. Transcription is on the other hand not needed for nucleosome assembly, histone acetylation and H3K9me3, H3K27me3 and H4K20me2 incorporation or restoration.

      There are two main issues that make the interpretation of their results very difficult and make me question their conclusions:

      1. They don't provide sufficient evidence that the treatments with TPL and DRB do not interfere with replication. The distributions of EdU intensity per EDU+ cell after treatment in Figures 1D-E and S1A are not sufficient. It is not clear why EdU incorporation is so heterogeneous in the cell population (the range of intensities goes from near 0 to 50000!), which makes me wonder if the DMSO treatment also has an effect on replication. I don't think this heterogeneity can simply be explained by the fact that the the cell population is asynchronous. They need to show a -DMSO control as well. Besides since they are only using a positive EdU signal as their criteria for replicating cells, they cannot rule out that some of the EdU signal is coming from DNA repair after replication and depending on how deleterious DMSO/TPL/DRB are to replication the fraction of cells that undergo DNA repair might be significant. More importantly, they need to show that the various treatments don't interfere with the replication program, especially since replication is coupled with new nucleosome assembly and the transcription of replication dependent histone variants is induced during S-phase. Transcription inhibition could disproportionally affect the replication of some parts of the genome more than others and since there is no evidence to the contrary the differences that they observe between the TPL/DRM treated and DMSO treated proteomes bound to replicated DNA could just be because they were isolated from different genomic loci. I am also not convinced that they are able to stop EdU incorporation after 11min with the addition of only equimolar amounts of Thymidine (20µM EDU and 20µM Thymidine). Equimolar amounts of Thymidine are not sufficient to stop EdU incorporation rapidly. They need to show the kinetics of EdU incorporation in synchronized cells +/- Thymidine.<br /> Without these controls it is impossible to draw any meaningful conclusions from the iPOND data.
      2. The normalization of iPOND and total protein MS data is problematic. It seems that each time point from each treatment was first normalized internally to the median of all protein levels in each dataset and then the relative abundances of each protein were normalized to 100% over all treatments and time points. Internal normalization makes it impossible to directly compare time points and treatments between each other. If the enrichment of a protein goes down from one time point to the next it doesn't mean that there is less of that protein on replicated DNA in absolute terms, it just means that there is less of that protein relative to the median of the whole set of proteins at that time point. Their claim that they are comparing iPOND enrichments to total protein abundance is misleading since the data from total protein extracts was also internally normalized so they are comparing relative enrichments in iPOND data to relative enrichments in total cell extracts, which unsurprisingly do not correlate. It is impossible to make any meaningful conclusions about proteome dynamics using this kind of analysis. They should have used external normalization with a "spiked in" protein to be able to directly compare time points and treatments.<br /> Such as it is right now, their analysis produces some puzzling conclusions that I suspect will turn out to be artefacts of their normalization procedure. It is not clear for example why the appearance of histones on replicated DNA would be delayed as they claim: in yeast nucleosomes (new and old recycled ones) are assembled on replicated DNA within minutes of the passage of the replication fork, I don't see why this would not be the case in human cells since the replication machinery is essentially the same in humans and yeast. It is also puzzling why RNAP2 is enriched in the nascent and 1hr time points but then becomes depleted in the 2hr time point in the DRB treatment since global RNAPII levels don't change in the DRB treatment compared to DMSO (Figure 1C). All the conclusions for PTM restoration/incorporation are essentially meaningless: internal normalization makes it impossible to detect whether PTM levels double at the 2hr time point compared to the Nascent time point in the DMSO treatment, as would be expected for all examined PTMs except for H4K5K12Kac which are marks of new histones. Right now, relative PTM levels are all over the place: only histone acetylations seem to increase, while H3K9me3 and H3K27me3 don't change even though they should also double since heterochromatin should also be restored on both sister chromatids. They will only be able to accurately assess the impact of transcription inhibition on PTM restoration when they are able to reliably measure the rate of increase of PTM levels during chromatin maturation.

      Referees cross-commenting

      On reviewer's 2 comment on significance:<br /> I think a thorough descriptive analysis of a biological process is extremely valuable and unlike my colleague, I think these types of studies need to be published in high impact journals with a broad readership. Biological processes need to be described first as completely as possible before we can propose meaningful models on how they function and identify the molecular mechanisms that execute and regulate them. As my colleague is surely aware, thorough descriptive studies of any poorly characterized biological process take years (i.e. at least one grant cycle) and comprehensive follow up mechanistic studies can take even longer than the initial descriptive study and can only be done during the following grant cycle, if the authors were lucky enough to obtain funding. Funding agencies however are more likely to award grants to perform these follow up mechanistic studies if the authors (especially if they are junior PIs) have published in higher impact journals in their previous grant cycle. The kind of thinking exhibited by reviewer 2 disproportionately disadvantages junior PIs that work on understudied biological processes. It is a disservice to scientific progress to dismiss excellent descriptive studies and "downgrade" them to lower impact journals where they will be unfairly labeled as a "work of lesser importance". This kind of thinking is also a disservice to the lower impact journals that often publish works whose quality is comparable to articles published in high impact journals. I value more any comprehensive description of a biological process over what most of the time passes for mechanistic insight that is deemed worthy of publication in a high impact journal i.e. a hastily analyzed phenotype of, more often than not, one single mutant tacked on at the end of a descriptive study. This one mutant phenotype then forms the basis of a somewhat "slapdash" model that is often proven wrong by subsequent publications and that the authors would have probably dismissed themselves had they been given more time to develop and test their model in a follow up publication.<br /> I do not think the main issue with the present study is its descriptive nature. As I said in my review, the main issues are technical: the lack of external normalization of MS data and insufficient evidence of the impact of transcription inhibitors on replication dynamics. The study should not be published in any journal (high or low impact) before those issues are resolved.

      on reviewer 2's remarque 4. in major comments:<br /> iPOND identifies proteins bound to 100-300bp fragments labeled with EdU (i.e. after replication or DNA repair). It is by definition identifying proteins bound to chromatin behind the fork, so I don't think that the isolation of RNAPolII bound in from of the fork is a major issue

      Significance

      I am not convinced by their conclusions and I cannot recommend that the the study be published at this stage due to normalization issues and insufficient evidence that transcription inhibition does not perturb the replication program (see above). They would need to redo all the iPOND experiments using external "spike in" normalization and monitor replication genome-wide before they can make any meaningful conclusions about the transcription dependent composition of the proteome associated with replicated DNA.

      Expertise keywords: Chromatin, Genomics (assay development and bioinformatics analysis) , Replication, Transcription

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript under evaluation investigates the mutation rate between generations and during regeneration in the planarian species S. polychroa. Abundant adult pluripotent stem cells, the potential somatic stem cell contributions to the germ line, and the regeneration of entire animals from tiny tissue fragments provide interesting conceptual background to these questions. Specifically, the authors assemble a draft genome of a purportedly triploid S. polychroa strain with an accompanying de novo transcriptome. Via shallow whole genome shot gut sequencing, the authors subsequently attempt to estimate the germline mutation rate and the effect of regeneration on the number and the spectrum of detectable mutations. As stated in the abstract, the study "...provides, for the first time, the draft genome assembly of S. polychroa, the germline mutation rate for a planarian species and the mutational spectrum of the regeneration process of a living organism". As detailed under "major points" below, the draft genome assembly is not great and significant concerns remain regarding both regeneration rate statements.

      Major points

      Genome and transcriptome assembly:

      1. As the authors state, ploidy is known to be highly variable across S. polychroa strains and ploidy is an important variable in the estimation of mutation rates. The authors should therefore provide additional experimental evidence that their strain is indeed triploid (e.g., through karyology or FACS genome size estimation).
      2. Assembly quality assessments: As stated, the authors intend to use a de novo assembled short-read transcriptome as an independent assessment of assembly quality. However, no information on the quality of the transcriptome reference is provided. Salient quality control metrics on the size, completeness, and assembly contiguity of their transcriptome need to be provided. The published and publically available S. polychroa transcriptome on PlanMine [cited by the authors] might provide a further useful reference.
      3. The assembly is highly fragmented ( 8700 contigs) and the authors detect a significant number of scaffolding errors within contigs (Fig. 3d) . Moreover, the large number of multi mappers might indicate that the draft assembly has been insufficiently purged of haplotigs. This is especially a concern in a potentially triploid species and should be assessed via a kmer-based approach such as Merqury. Ideally, an independent Illumina dataset from the mutation screening and the reads used for polishing should be used. The analysis of synteny and structural variation relative to S. mediterranea in Figure three is therefore of questionable relevance and should be dropped or significantly condensed.

      Mutation analysis:

      1. The SNP analysis is based on a pipeline that the authors published previously. Beyond this, the manuscript provides insufficient information in terms of technical details (e.g., the salient IsoMut settings, how duplicate reads were treated, and how the SNP calling approach guards against the ever-present possibility of sequencing artifacts). In addition, the authors should discuss whether the pipeline adequately accounts for the i) triploid genome and ii) the fragmented and potentially insufficiently purged assembly. In absence of such information, the validity of the results remains difficult to ascertain.
      2. The statistical support for the germline mutation rate is weak (Fig. 5a). The sample size of the experiment is rather low for such studies, with only four Filia in each group. Furthermore, in both the control and regenerate group, two of the filia are siblings. This creates a data dependence that is not adequately discussed or taken into account for the analysis. For example, the authors conduct a t-test to determine if the number of detected mutations differs between the groups. A critical assumption of the t-test is that the samples are independent, an assumption that is violated when a relatedness structure is present. Similarly, this dependence could further influence the analysis of the mutational profile. Hence the authors either have to increase the sample size or temper the interpretation of their results, including the claimed estimation of germline mutation rate.
      3. The possibility of somatic mosaicism that the authors discuss extensively in the context of regeneration also complicates the interpretation of the clonal mutations between parents and filia. First, somatic mosaicism has been already demonstrated in a different planarian species and discussed in multiple reviews (e.g., PMID: 31221097, PMID: 35862435). This literature needs to be cited adequately.<br /> Second, the plausible contribution of individual somatic stem cells to the germ line leaves open the possibility that the observed parent/offspring differences in the control group also reflect rare pre-existing allele heterogeneities within the parental population of pluripotent stem cells. Therefore, clonal differences between parents and offspring cannot simply be attributed to germline mutations. Third, low, but measurable rates of sex are known to occur even in predominantly parthenogenetic S. polychroa populations (e.g., PMID: 16721392; PMID: 15293852]. These studies need to be cited and the possibility of parental genome contributions needs to be explicitly examined, as it would violate the requirement for an isogenic background stated on page 4. Overall, this means that the author's claim of providing the first quantification of the germ cell mutation rate in planarians is therefore insufficiently justified.<br /> The possibility of somatic mosaicism also impacts the interpretation of the apparent increase in genetic variation during regeneration. Given the limited depth of the sequencing assays, it remains difficult to refute the null hypothesis that the apparent increase in the mutation load of regenerates represents a subsampling of the standing genetic variation in the parent animals (and without the single-cell populational bottleneck of parthenogenetic reproduction). Also, the claims regarding the mutagenic nature of the regeneration process should therefore need to be dropped or significantly toned down.

      Minor comments:

      Fig.2a: The S. polychroa genome size estimates from genomesize.com Table S1 and Figure 2 a: The entries from the animal genome Size Database need to be removed from the figure, as this is published background information and not an analysis result.

      Page 6: The text description of the transcriptome backmapping results (Fig. 3A) is confusing: "...17.4 % were not mapped by GMAP,... the remaining transcripts were mapped as duplicates, at multiple positions or in two chimeric fragments... The authors need to insert the fraction of single mappers, as otherwise, they imply that they only obtain multi-mappers.

      Page 7: PlanMine needs to be cited as the source of the orthology information between S.med and S. pol.

      Page 10: What is the COSMIC database? Please explain/reference.

      Fig. 4 and 5: The experimental set-up cartoon in Fig. 4a is confusing and should more clearly illustrate which of the experimental groups involved regeneration, how many individuals were sequenced, and the meaning of the A/B terminology in subsequent graphs. Moreover, the authors need to ensure consistent symbol use, e.g., Fr1, 2, 3, 4 instead of the current F1, F2, ... in Fig. 5a.

      The authors discuss the potential contribution of methylation to the observed mutation spectrum and conclude that it might not be present in S. mediterranea and S. polychroa. Indeed, the lack of measurable mC in the planarian genome has already been demonstrated (PMID: 24063805). Please cite and shorten the respective text section.

      Fig. 6e: The authors estimate the number of stem cells in tail segments using H3P staining. However, H3P marks only a short segment of the cell cycle and therefore underestimates the number of resident stem cells. This caveat needs to be discussed.

      Typo in Figure S2: The splice site is marked wirth a black

      Significance

      Planarian flatworms harbor abundant pluripotent adult stem cells. These cells are the only division-competent cells in the animal and of pivotal importance to planarian biology. For example, they enable the regeneration of entire animals from tiny tissue pieces or the re-formation of the germ line in sexually reproducing strains. How planarians maintain their genetic identity in the face of abundant pluripotent adult stem cells and a strict soma/germline divide raises many intriguing questions.

      The manuscript provides a preliminary and highly fragmented draft genome assembly of the planarian species S. polychroa, which adds to the available planarian genome information. Based on the genome assembly, the manuscript attempts to measure generational mutation rates during parthenogenetic reproduction and regeneration. The quality of the SNP detection is somewhat difficult to evaluate in the current manuscript and the possibility of somatic heterogeneity in the parents raises concerns regarding the interpretation of the supposed germline mutation rate. The data provide further evidence for somatic mosaicism, which has already been demonstrated in a different planarian species. The extent by which regeneration is mutagenic per se or uncovers standing genetic variation due to the inherent population sub-sampling also remains unclear. Overall, the manuscript stands out as one of the first intra-organismal population genomics studies in the field. But I think not all its claims are sufficiently supported by data.

      I am a planarian biologist with experience in planarian genomics. I am not an explicit population genetics/genomics expert.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This paper presents the genome of the planarian Schmidtea polychroa, a sister species to the widely used regenerative model S. mediterranea. The preliminary analysis of the genome is sound and the authors establish that they have produced a good draft assembly. The authors then leverage this assembly to ask novel questions about mutation rates in regenerative planarians, a question that has not been well addressed previously. They use a clear and logical approach to measure mutation rates in relation to both development and regeneration and establish that different clones of stem cells exist in planarians and contribute to regeneration and production of germ cells.

      Overall the work is of a high quality, and logically explained.

      Major comment.

      I could find no statement about raw data availability or metadata availability, or availability of intermediate data analysis files. This makes proper review and consideration of the authors analysis essentially impossible, so all of this must be taken on trust and easy to do but helpful further analyses in the context of the existing data structure can't really be suggested. This is a great shame. Furthermore, in this reviewers' opinion this goes against the principle of open science. A revision must address this issue unless the reviewers' are planning to publish this paper in the 1990s print format only (forgive the sarcasm, but I hope the authors can concede this isn't a good way forward). Without data access the full impact of their very exciting work cannot land. If I have missed reference to access to the data or a GitHub link etc in the paper then I apologise, but I have looked extensively. Sequence Data submission can take time so they should do this in advance and perhaps share the link to the unreleased link too reviewers, and intermediate files and metadata can be shared on GitHub or the like.

      Minor comments.

      I enjoyed reading the paper immensely and I think it touches on many important and interesting theoretical ideas in the field.<br /> With regards to their comments on methylation they should note that previous work on S. mediterranea has rigorously shown that methylation is absent or very low, probably any residual comes from base scavenging from the calf liver food source. Additionally, DMNT 1 and 3 are absent from S. mediterranea, so canonical enzymes for methyl-cytosine addition and maintenance are not available. Citing this would be useful (Jaber-Hijazi et al, 2013, Developmental Biology https://doi.org/10.1016/j.ydbio.2013.09.020). Other work suggests there might be methylation in this group using retriction enzyme based approaches.

      I have some questions that relate to evolution of a parthenogen.<br /> Did the authors ever detect homozygous changes between "parental" generations and offspring or changes from a heterozygous state to a homozygous state?<br /> Given the parthogenic and triploid nature of Polychroa did the authors detect high levels of/accumulation of heterozygous alleles generally in the genome?

      Significance

      This paper will eventually be very significant to the regenerative biology community as it will give us comparative genomic capabilities for the well-established model S. mediterranea. Although not commented on much in the manuscript this is very important. Furthermore, the paper begins the important work of characterising mutation rates in this group of animals that avoid the ageing process entirely, this work will be another important foundation stone in understanding the phenomena that allow for this in this group of animals.

      I have expertise in genome assembly, analysis and annotation, as well in assessing variation in genomes. I also have expertise in the model system used here and its general biology, including regeneration.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper, using the triploid biotype of planarian Schmidea polychroa, the first half of the paper presents the results of the analysis of genome structure and the second half shows that (de novo) mutations in individuals that undergo regeneration are passed on by the next generation.

      While I think this paper contains interesting biological findings, I am skeptical about its novelty. I was convinced by the results and discussion of the analysis of genome structure, but the results and that of the analysis of (de novo) mutation were very confusing. This may be due to my lack of knowledge in this field. But even so, the author needs to improve this manuscript so that the general reader will better understand it.

      Major comments:

      1. The author mentions that it is important to note that this study was conducted using a parthenogenetic triploid biotype. However, I think that the parthenogenesis undergoing by a triploid biotype of S. polychroa is very unusual. It is not typical apomictic parthenogenesis. Triploid oocytes arise by meiosis from hexaploid oocytes derived from triploid adult somatic stem cells called neoblasts. On the other hand, haploid sperm arise by meiosis from diploid spermatogonia derived from neoblasts. Embryogenesis of triploid eggs then occurs by pseudogamy. Occasional sex is also known to occur even if the offspring's chromosome number remains triploid. I think this background is important information to give the reader. Also, don't the authors need to treat the results in this paper with this complex phenomenon also taken into account?
      2. Fig.4B-C: Analysis by lineage-specific mutations of parental controls.<br /> The authors do not specifically mention or discuss this result. What about the accumulation of mutations within such populations in typical parthenogenesis (daphnia and aphids)? In other words, are the results in Fig. 4B-C due to the special mechanism for parthenogenesis in the triploid S.polychroa as described above?
      3. Throughout this paper, the authors show that regeneration increases de novo mutations in the progeny. The authors conclude that many of the mutations occurred in neoblasts during regeneration. However, I would like you to explain the biological significance of this results in S. polychroa, which naturally does not reproduce by fission and regeneration. There are already reports of mutations accumulating in neoblasts in Dugesia japonica, which reproduce aexually by fission. For these reasons, I do not think this paper presents extremely novel results.
      4. p15, Discussion:<br /> "Tissue regeneration is best seen in the liver of mammals, and the regrowth of relapsed tumours following surgery can also be considered an example of a regenerative process. Mutagenesis accompanying these processes is relevant to subsequent tumorigenesis or the development of resistance, and the planarian system can provide a useful model for the mutagenic effect of tissue regeneration."

      Isn't it an overstatement to associate the regenerative system of planaria with the liver regeneration of mammals?<br /> 5. p10, Results:<br /> "We compared the two de novo spectra to the spectrum of germline heterozygous SNPs, present in all animals, and found that the pattern of germline substitutions resembled more closely the de novo spectrum of the control group (Fig 5D, Fig S3), implying that regeneration has a minor contribution to germline mutations in S. polychroa populations."<br /> p14, Discussion:<br /> "The high similarity of the spectrum of heterozygous SNPs and de novo mutations of control animals suggests that the species primarily reproduces in a non-regenerative manner. The increased mutation rate and the altered mutation spectrum upon regeneration confirmed our hypothesis that regeneration is a mutagenic process."

      I was very confused by these sentences and it took me some time to understand them. Triploid S. polychroa naturally does not reproduce by fission and regeneration, namely a non-regenerative manner. I do not understand why the author insists on this. Please explain the results for the regenerated case in Fig. 5D (0.88) in a way that is also easy to understand. Also, what is the biological significance of asserting here that de novo mutation by regeneration increases in a species that does not increase by regeneration and division in the first place?

      Minor comments:

      1. The author should add a schematic diagram showing the distribution of reproductive organs in Fig.1 to help the reader understand that the ovaries are not included in the regenerative fragment.
      2. P12, line12: Fig 6D-E, it's F, not E, right?
      3. P9, line 8:<br /> "these mutations were missing from the original egg but were present in the egg laid by the parent and thus represent the total mutation load of a generation."

      The author mentions that the de novo mutation found in offspring derived from parents that do not undergo regeneration was already present in the eggs, but I can find no evidence of this. Can you rule out the possibility that these mutations occurred between hatching and adulthood?<br /> 4. p10, Results:<br /> "Interestingly, the majority of mutations were shared in the siblings F4A and F4B. This suggests that the germ cells of these animals were descendants of the same stem cell, which underwent a high number of cell divisions early during the regeneration process prior to oocyte differentiation. The same finding also confirms that the detected clonal filial mutations were present in the respective oocyte and were not generated by embryonic cell divisions."

      The shared de novo mutations detected in the siblings (F4A and F4B) derived from the parent that underwent regeneration in Fig. 5A suggest that the germ cells of these siblings are descended from the same stem cell. The authors say that these mutations occurred in a large number of cell divisions early in the regenerative process prior to oocyte differentiation.

      So why is there no shared de novo mutation in the siblings (Fc4A and Fc4B) derived from the non-regenerating parent in Fig. 5A? As mentioned in Minor comment 3, the author states that the de novo mutations were already present in the parent-laid eggs, but when did these mutations, which are not shared, arise?<br /> 5. p11, Results:<br /> "Interestingly, in the case of FR4A-FR4B sibling pair, shared de novo mutations present in both were subclonal in R4 in a proportion comparable to the other samples (7/15 by WGS, 46.7%), while the three unique mutations could not be detected in R4 by the PCR approach, indicating again that the unique mutations, which amounted to approximately 10% of total clonal filial mutations in these two animals, arose late during germ cell regeneration."

      "during germ cell regeneration." the expression is too vague to know which stage you are referring to. In relation to minor comment 4, why not create a new chart to clearly show when the expected mutations occurred?<br /> 6. p12, Results:<br /> "Altogether 7/30 regenerant mutations were detected in PR animals, and these included those with the highest AF in the regenerants (Fig. 6C). This suggests that parental animals, even before regeneration, contained a diverse set of stem cells, and some of the detected de novo mutations in the filial generation resulted from the expansion of mutation-containing stem cell clones contributing also to germ cells in the regenerant animals."

      If the mutation in the offspring is derived from the parent (PR) prior to the time of tail amputation, wouldn't it be wouldn't it be strange to assume that it is a de novo mutation?<br /> 7. p12, Results:<br /> "The remaining 23/30 R- subclonal mutations may have arisen during regeneration. On average, ~250 dividing neoblasts were detected in cut tails of animals from the same population as the sequenced individuals, as determined by immunofluorescence of phosphorylated H3 histone (Fig 6D-E). However, the high proportions of body cells carrying regenerant-specific mutations suggest that certain stem cells contribute to disproportionately large parts of the regenerated body, including the germline."

      I did not quite understand the relevance of this discussion to the photos shown here of the M period (Fig. 6e).

      Significance

      General assessment: This paper contains important biological information. The finding that mutations in planarian stem cells cause diversity in the next generation of parthenogenesis is very interesting. However, I think that the author needs to carefully explain and change his argument, for example, that the mutations were caused by regeneration, which does not naturally occur in the species used.

      Advance: The finding that accumulation of mutations is occurring in planarian stem cells has already been reported in Dugesia japonica. Please cite the papers and clarify what is the key finding in this paper.

      Audience: Basic Research_Evolutionary Ecology, Developmental Biology (Stem Cells), Reproductive Biology

      Please define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      My field of study is reproductive biology. I am familiar with the transcriptome but unfamiliar with genome analysis.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the three reviewers for carefully reading our manuscript and for all considerations, ideas, suggestions, and comments. These were all very helpful for us to strengthen the scientific statements of our manuscript. Please, note that all changes are marked in red in the manuscript and supplement. Below you will find, point by point, our responses to all questions and comments.

      Reviewer #1 (Evidence, reproducibility and clarity):

      Overall, this is an exciting work. There are, however, several open questions that the authors could address to facilitate understanding of their work. These points are:

      1.) On page 5, lines 113ff, the authors mention the membrane bulges that they analyse in figure 1. They show these deformations by light (confocal) and electron microscopy. However, the bulges seen by confocal microscopy seem to be bigger that those seen by electron microscopy. The authors could quantify the sizes of the bulges for clarification.

      We quantified the size of the membrane bulges. At the confocal we measured in average 750nm as mean value of identified bulges (n=12) with 650nm as minimal and 890nm as maximal sizes. At the TEM we measure ~243nm as mean value (n=61), with a range between 62nm as minimum and 442 as maximum value. These measurements are shown as Figure 1E.

      Please note that measurements of TEM images do not always capture the three-dimensionality of bulges and may show only parts of them. In addition, ultrastructure is more sensitive and can easily detect small membrane changes that we cannot observe with confocal and airsycan microscopy. In contrast, even with our high-quality objective (63x Zeiss Plan Neofluar, Glycerin, 1.3 NA), standard confocal analysis is limited at ~200nm on the XY axis (airyscan ~110nm) and ~450nm on Z-axis. Therefore, TEM analysis detects smaller bulges than confocal analysis, and consequently, this method detected a large range of bulge lengths between 63nm and 441nm. In contrast, the airyscan method detected a range of bulge length between 0.65 and 0.83 µm. However, confocal and TEM analyses provide evidence of membrane bulges in pio mutant embryos. Please note that we extended our studies and now show membrane bulges in two different pio mutant alleles (17C and 5M) with airsycan microscopy.

      2.) The subject of the manuscript is rather complicated; presentation of data from Figure 1C and D on lines 113ff and 169ff is confusing.

      We apologize and thank the reviewer for careful reading. We revised both paragraphs (lines 108 – 123 and lines 166 - 174) and are confident that the descriptions are now much more understandable. All changes are marked in red.

      3.) The quality of the sub-images of Figure 2E differs. Especially, the phenotype of the wurst, pio transheterozygous embryo is not well visible.

      We apologize for it. We repeated the experiment with wurst;pio transheterozygotes, and generated wurst;pio double mutant embryos to improve the quality. The gas filling assay is shown in Fig. 3. With brightfield microscopy in overview images (10x air objective) and close-ups of the dorsal trunks (25x Glycerin objectives). Both show the gas-filling defects of dorsal trunk tubes. In a subsequent confocal analysis of chitin stainings in late-stage 17 embryos, we found that tracheal tube lumens are collapsed in the transheterozygotes and double mutant embryos.

      4.) Lines 246ff: the protein size are given for the mCherry:chimeric proteins; an estimate of the native Pio portions should be given.

      The endogenous Pio protein has a calculated mass of about 50.82 kDa. We state it now in the according legend of Fig. 6.

      5.) In Figure 6A, the appearance of chitin in the wildtype tube is different compared to the Np mutant situation, more filamentous. Can the authors comment on that?

      The author is correct. The chitin cable formation in Np mutant embryos is normal but lacks the condensation process, and, therefore, fiber structure of the chitin matrix differs from control embryos in late stage 16 and stage 17 embryos (see Drees et al., PLOS Genetics, 2019).

      6.) In the discussion section, I would appreciate if the timing of events was discussed or even shown in a model. The central question is: how are the functions of Pio and Np coordinated in time? As I understand, Np should not cleave Pio before morphogenesis is completed. Is there any example in the literature for how such an interaction could be controlled? The overexpression of Np shows that either the ratio between Np and Pio is important, or the btl promoter expresses Np at the "wrong" time point.

      We thank the reviewer for this interesting comment.

      Of course, we did not measure forces, but it has been published that axial forces appear at the apical cell membrane during stage 16 tube expansion. Our data show that Np cleaves Pio ZP domain and subsequent release increase during stage 16. The cleaved and released Pio enriches in the lumen during stage 16, from where cleaved Pio is internalized during stage 17 with the help of Wurst-mediated endocytosis. This is supported by several in vivo studies, video microscopy, antibody stainings and biochemical data, such as the interaction of Pio and Dumpy as well as the identification of different Pio products with and without Np cleavage. Moreover, we found membrane bulges that increase in size during stage 16 and identified a subsequent tear-off of the chitin matrix in Np mutant embryos. Thus, we propose that Np is required to cleave Pio-Dumpy linkages at the membrane-matrix when tubes elongate and postulated forces appear at the cell membrane during tube elongation in stage 16 embryos.

      We stated this in the discussion as follows:

      “The membrane defects observed in both Pio and Np mutants indicate errors in the coupling of the membrane matrix due to the involvement of Pio (Figs. 1,7). ..., the large membrane bulges in Np mutants affect the membrane and the apical matrix (Fig. 7). Since apical Pio is not cleaved in Np mutants (Fig. 7D), the matrix is not uncoupled from the membrane as in pio mutant embryos but is likely more intensely coupled, which leads to tearing of the matrix axially along the membrane bulges (Figs. 7, 9), when the tube expands in length.”

      How could Np be regulated at the membrane? Np is a zymogen that very likely undergoes ectodomain shedding for activation, similar to what has been described for matriptases. Additionally, human matriptase requires transient interaction of the stem region with its cognate inhibitor HAI-2, which Drosophila lacks (see Drees et al., PLOS Gen, 2019). Thus, the regulation of Np activation is not known.

      Further, we observed that Dumpy is not degraded in Np mutant embryos during stage 17. Nevertheless, in a previous publication, we showed that btl-G4 driven Np expression rescues Np mutant phenotypes in a time-specific manner. We used the btl-G4 driver line for these rescue experiments to express Np in tracheal cells. This restored tracheal Dumpy degradation in Np mutant embryos. Thus, btlG-G4 driven Np overexpression is able to rescue Np mutant tracheal phenotypes in a time-specific manner, although Gal4 is expressed from early tracheal development onwards. Further, btl-Gal4 driven Np expression mimics the endogenous Np, which is expressed from stage 11 onwards in all tracheal cells throughout embryogenesis (see Drees et al., PLOS Gen, 2019).

      Based on these experiments, we conclude that the btl-G4-driven Np overexpression can cleave Pio ZP domain in stage 16 embryos at the correct time.

      However, the ratio of Np expression and Pio is essential in the way that btl-Gal4 driven Gal4 Np overexpression may cause cleavage of a higher number of Pio proteins and the release of critical Pio-Dumpy linkages at the cell membrane and matrix. Thus, increased Pio shedding into the lumen reduces Pio linkages at the membrane, resulting in a pio mutant like tracheal overexpansion in btl-Gal4 driven Gal4 Np overexpression.

      Finally, we were able to prove the reviewer’s question in a new experiment. We used btl-Gal4 driven UAS-Np embryos for Pio antibody staining. This revealed Pio enrichment at the tracheal chitin cable in stage 14 and 15 embryos. In contrast, stage 16 embryos showed numerous Pio puncta appearing across the entire tube lumen, indicating that Np mediates Pio shedding specifically in stage 16 embryos and not before. This Np-controlled Pio releases modifies tube length control.

      Therefore, we stated this in the manuscript as follows:

      Results:

      “Our data assumes that Np overexpression may enhance Pio shedding in stage 16 embryos, affecting the Pio-mediated ZP matrix function. Upon breathless (btl)-Gal4-mediated expression of UAS-Np in tracheal cells, we observed a high amount of Pio puncta across the entire tracheal tube lumen, specifically in stage 16 embryos but not in earlier stages (Fig. S13). Consistently tracheal Np overexpression led to tube overexpansion in stage 16 embryos resembling the pio mutant phenotype (Fig. 8A,B). Thus, Np-mediated Pio shedding controls Pio function.”

      Discussion:

      “The btl-Gal4-driven Np expression mimics the endogenous Np from stage 11 onwards in all tracheal cells throughout embryogenesis (Drees et al., 2019), suggesting that Np is not expressed at a wrong time point. However, the ratio between Np and Pio is essential. We assume that Np overexpression increases Pio shedding, resulting in a pio loss-of-function phenotype. Thus, the tube length overexpansion upon Np overexpression indicates that Pio cleavage is required for tube length control.

      Our observation that the membrane deformations are maintained in Np mutant embryos supports our postulated Np function to redistribute and deregulate membrane-matrix associations in stage 16 embryos when tracheal tube length expands. In contrast, Np overexpression potentially uncouples the Pio-Dpy ZP matrix membrane linkages resulting very likely in unbalanced forces causing sinusoidal tubes.”

      7.) Also for the discussion: We have two situations where Pio amounts/density are enhanced at the apical plasma membrane. The wurst experiments on lines 136ff show that Pio amount and density depends on endocytosis; is the wurst phenotype (Figure 2), at least partially, due to over-presentation of Pio? Likewise, in Figure 2C, there is more Pio in Cht2 overexpressing tracheae (but there is overall more Pio in these tracheae) - is actually endocytosis reduced in chitin-less luminal matrices? First: does the Pio signal at the apical plasma membrane correspond to membrane-Pio or free-Pio? Second, as in the case of wurst: would more Pio on the membrane (density) affect tracheal dimensions in Cht2 over expressing tracheae? Or are the consequences of Pio accumulation in the apical plasma membrane different in Cht2 and wurst backgrounds? Maybe cleavage of Pio and its endocytosis are dependent on its interaction with the chitin matrix. These questions connect to the question immediately above: how are the functions of the different players coordinated in space and time? We need a discussion on this issue.

      We thank the reviewer for this very important idea to discuss the functions of the different players in a coordinated space and time and apologize that we haven’t done before.

      As this is an important point, we tried to figure out all questions raised by the reviewer and discussed it in several new paragraphs in the discussion:

      "Indeed, the anti-Pio antibody, which can detect all different Pio variants, showed a punctuate Pio pattern overlapping with the apical cell membrane marker Uif at the dorsal trunk cells of stage 16 embryos. Additionally, Pio antibody also revealed early tracheal expression from embryonic stage 11 onwards, and due to Pio function in narrow dorsal and ventral branches, strong luminal Pio staining is detectable from early stage 14 until stage 17, when airway protein clearance removes luminal contents.

      We generated mCherry::Pio as a tool for in vivo Pio expression and localization pattern analysis during tube lumen length expansion. The mCherry::Pio resembled the Pio antibody expression pattern from early tracheal development onwards. However, luminal mCherry::Pio enrichment occurs specifically during stage 16, when tubes expand. The stage 16 embryos showed mCherry::Pio puncta accumulating apically in dorsal trunk cells. Moreover, mCherry::Pio puncta partially overlapped with Dpy::YFP and chitin at the taenidial folds, forming at apical cell membranes. Supported by several observations, such as antibody staining, Video monitoring, FRAP experiments, and Western Blot studies (Figs. 4,5), these findings indicate that Pio may play a significant role at the apical cell membrane and matrix in dorsal trunk cells of stage 16 embryos.

      Furthermore, we show that Np-mediates Pio ZP domain cleavage for luminal release of the short Pio variant during ongoing tube length expansion. The luminal cleaved mCherry::Pio is enriched at the end of stage 16 and finally internalized by the subsequent airway clearance process during stage 17 after tube length expansion. Such rapid luminal Pio internalization is consistent with a sharp pulse of endocytosis rapidly internalizing the luminal contents during stage 17 (Tsarouhas et al., 2007). Wurst is required to mediate the internalization of proteins in the airways (Behr et al., 2007; Stümpges and Behr, 2011). In consistence, during stage 17, luminal Pio antibody staining fades in control embryos but not in Wurst deficient embryos.

      Nevertheless, Pio and its endocytosis depend on its interaction with the chitin matrix and the Np-mediated cleavage. In stage 16 wurst and mega mutant embryos, we detect Pio antibody staining at the chitin cable, suggesting that Pio is cleaved and released into the dorsal trunk tube lumen. Also, the Cht2 overexpression did not prevent the luminal release of Pio. However, reduced wurst, mega function, and Cht2 overexpression caused an enrichment of punctuate Pio staining at the apical cell membrane and matrix (Figs. 1,2). Although the three proteins are involved in different subcellular requirements, they all contribute to the determination of tube size by affecting either the apical cell membrane or the formation of a well-structured apical extracellular chitin matrix, indicating that changes at the apical cell membrane and matrix in stage 16 embryos affect the Pio pattern at the membrane. It also shows that local Pio linkages at the cell membrane and matrix are still cleaved by the Np function for luminal Pio release, which explains why those mutant embryos do not show pio mutant-like membrane deformations and Np-mutant-like bulges. This is in line with our observations that tracheal Pio overexpression cannot cause tube size defects as the Np function is sufficient to organize local Pio linkages at the membrane and matrix. Therefore, it is unlikely that tracheal tube length defects in wurst and mega mutants as well as in Cht2 misexpression embryos are caused the apical Pio density enrichment.

      Nevertheless, oversized tube length due to the misregulation of the apical cell membrane and adjacent chitin matrix may cause changes to local Pio set linkages and the need for Np-mediated cleavage. Strikingly, we observe a lack of Pio release in Np mutants. This shows that Pio density at the membrane versus lumen depends predominantly on Np function. The molecular mechanisms that coordinate the Np-mediated Pio cleavage are unknown and will be necessary for understanding how tubes resist forces that impact cell membranes and matrices. On the other hand, Pio is required for the extracellular secretion of its interaction partner Dpy. At the same time, Dpy is needed for Pio localization at the cell membrane and its distribution into the tube lumen. Consistently, in vivo, mCherry::Pio and Dpy::eYFP localization patterns overlap at the apical cell surface and within the tube lumen. These observations support our model that Pio and Dpy interact at the cell surface where Np-mediates Pio cleavage to support luminal Pio release by the large and stretchable matrix protein Dpy (Fig. 9).

      Taenidial organization prevents the collapse of the tracheal tube. Therefore, cortical (apical) actin organizes into parallel-running bundles that proceed to the onset of cuticle secretion and correspond precisely to the cuticle's taenidial folds (Matusek et al., 2006; Öztürk-Çolak et al., 2016). Mutant larvae of the F-actin nucleator formin DAAM show mosaic taenidial fold patterns, indicating a failure of alignment with each other and along the tracheal tubes (Matusek et al., 2006). In contrast, pio mutant dorsal tracheal trunks contained increased ring spacing (Fig. 3A). Fusion cells are narrow doughnut-shaped cells where actin accumulates into a spotted pattern. Formins, such as Diaphanous, are essential in organizing the actin cytoskeleton. However, we do not observe dorsal trunk tube fusion defects as found in the presence of the activated diaphanous.

      On the other hand, ectopic expression of DAAM in fusion cells induces changes in apical actin organization but does not cause any phenotypic effects (Matusek et al., 2006). DAAM is associated with the tyrosine kinase Src42A (Nelson et al., 2012), which orients membrane growth in the axial tube dimension (Förster and Luschnig, 2012). The Src42 overexpression elongates tracheal tubes due to flattened axially elongated dorsal trunk cells and AJ remodeling. Although flattened cells and tube overexpansion are similar in pio mutant embryos, we did not observe a mislocalization of AJ components, as found upon constitutive Src42 activation (Förster and Luschnig, 2012). Instead, we detected an unusual stretched appearance of AJs at the fusion cells of pio mutant dorsal trunks, which to our knowledge, has not been observed before and may play a role in regulating axial taenidial fold spacing and tube elongation.

      Self-organizing physical principles govern the regular spacing pattern of the tracheal taenidial folds (Hannezo et al., 2015). The actomyosin cortex and increased actin activity before and turnover at stage 16 drive the regular pattern formation. However, the cell cortex and actomyosin are in frictional contact with a rigid apical ECM. The Src42A mutant embryos contain shortened tube length but increased taenidial fold period pattern due to decreased friction. In contrast, the chitinase synthase mutant kkv1 has tube dilation defects and no regular but an aberrant pearling pattern caused by zero fiction (Hannezo et al., 2015).

      In contrast, pio mutant embryos do not contain tube dilation defects or shortened tubes but increased tube length (Figs. 1; 8; S1). Furthermore, our cbp and antibody stainings reveal the presence of a luminal chitin cable and a solid aECM structure in pio mutant stage 16 embryos (Figs. 8, S1; S6). In addition, apical actin enrichment in tracheal cells of pio mutant embryos appeared wt-like. Nonetheless, pio mutant embryos show an increased taenidial fold period compared with wt, indicating a decreased friction. Thus, we propose that the lack of Pio reduces friction. Reasons might be subtle defects of actomyosin constriction or chitin matrix, which we have not detected in the pio mutant tracheal cells. Further reasons for lower friction might be the loss of Pio set local linkages between apical cortex and aECM in stage 16 embryos, which are modified by Np, as proposed in our model (Fig. 9).

      Heterozygous and homozygous pio mutant embryos generally do not show tubal collapse. However, the loss of Pio and accompanying lack of Dpy secretion in stage 17 pio mutant embryos led to the loss of a Pio/Dpy matrix, impacting the late embryonic maturation and differentiation of a normal chitin matrix at the apical cell surface. TEM images reveal reduced dense chitin matrix material at taenidial folds and misarranged taenidial fold pattern (Figs. 1; S2), suggesting impaired taenidial function prevents tube lumen from collapsing after tube protein clearance. Wurst knockdown and mutant embryos do not show general tube collapse, but luminal chitin fiber organization is disturbed in stage 17 embryos (Behr et al., 2007). Therefore, transheterozygous wurst;pio mutant embryos may combine both defects and suffer from maturation deficits of the chitin/ZP matrix at the apical cell surface and within the tube lumen, which finally causes a high number of embryos with incomplete gas filling due to tube collapse. These maturation deficits are even more dramatic in the wurst;pio double mutants, which show no gas filling.”

      8.) The sentence on line 242ff should be rephrased: "dynamic" and "elastic" are not opposites.

      We thank the reviewer for careful reading. We revised the sentence as follow:

      “Our FRAP data suggest that Pio is the dynamic part of the tracheal ZP-matrix, while the static Dpy modulates mechanical tension within the matrix”

      9.) A central question to me is the amounts and the density of factors in different genetic backgrounds as mentioned above. Is there any mechanism adjusting the amounts or the density of the players according to the size of the apical plasma membrane or the tracheal lumen? Pio seemingly responds to these changes.

      We would like to know the molecular mechanisms that control the density of players at the apical membrane. This question is important and could be the starting point for novel scientific investigations. Mechanisms of protein trafficking, such as exocytosis, recycling and endocytosis regulate delivery and internalization of proteins at the apical cell membrane. Furthermore, protein junctions at the lateral membrane may recognize and therefore may respond to low and high mechanical stresses between cells that appear during tube length expansion. However, we did not observe any hint for misregulation of Pio expression levels in the different mutants which affect endocytosis, SJs and luminal ECM. But we observed a shift of Pio levels between apical cell membrane/matrix and lumen in wurst, mega mutants and Cht2 overexpression. This shift is analyzed with diverse ZEN tools and quantified (Fig. 2D-F; Fig. S4B). As discussed in the new paragraph, this shift is very likely caused by changes at the apical cell membrane and chitin matrix which impact Pio shedding. Moreover, we observe the lack of Pio release in Np mutants. This shows that Pio density at the membrane versus lumen depends predominantly on Np-mediated cleavage. As discussed above, how Np is activated at the apical cell membrane to cleave Pio is not known.

      10.) The connection of Pio and taenidia is mentioned in the results section (page 7) but not discussed.

      We appreciate the careful reading and comments of the reviewer very much. We included the connection of Pio and taenidial in the discussion section as follows:

      “Taenidial organization prevents the collapse of the tracheal tube. Therefore, cortical (apical) actin organizes into parallel-running bundles that proceed to the onset of cuticle secretion and correspond precisely to the cuticle's taenidial folds (Matusek et al., 2006; Öztürk-Çolak et al., 2016). Mutant larvae of the F-actin nucleator formin DAAM show mosaic taenidial fold patterns, indicating a failure of alignment with each other and along the tracheal tubes (Matusek et al., 2006). In contrast, pio mutant dorsal tracheal trunks contained increased ring spacing (Fig. 3A). Fusion cells are narrow doughnut-shaped cells where actin accumulates into a spotted pattern. Formins, such as Diaphanous, are essential in organizing the actin cytoskeleton. However, we do not observe dorsal trunk tube fusion defects as found in the presence of the activated diaphanous.

      On the other hand, ectopic expression of DAAM in fusion cells induces changes in apical actin organization but does not cause any phenotypic effects (Matusek et al., 2006). DAAM is associated with the tyrosine kinase Src42A (Nelson et al., 2012), which orients membrane growth in the axial tube dimension (Förster and Luschnig, 2012). The Src42 overexpression elongates tracheal tubes due to flattened axially elongated dorsal trunk cells and AJ remodeling. Although flattened cells and tube overexpansion are similar in pio mutant embryos, we did not observe a mislocalization of AJ components, as found upon constitutive Src42 activation (Förster and Luschnig, 2012). Instead, we detected an unusual stretched appearance of AJs at the fusion cells of pio mutant dorsal trunks, which to our knowledge, has not been observed before and may play a role in regulating axial taenidial fold spacing and tube elongation.

      Self-organizing physical principles govern the regular spacing pattern of the tracheal taenidial folds (Hannezo et al., 2015). The actomyosin cortex and increased actin activity before and turnover at stage 16 drive the regular pattern formation. However, the cell cortex and actomyosin are in frictional contact with a rigid apical ECM. The Src42A mutant embryos contain shortened tube length but increased taenidial fold period pattern due to decreased friction. In contrast, the chitinase synthase mutant kkv1 has tube dilation defects and no regular but an aberrant pearling pattern caused by zero fiction (Hannezo et al., 2015).

      In contrast, pio mutant embryos do not contain tube dilation defects or shortened tubes but increased tube length (Figs. 1; 8; S1). Furthermore, our cbp and antibody stainings reveal the presence of a luminal chitin cable and a solid aECM structure in pio mutant stage 16 embryos (Figs. 8, S1; S6). In addition, apical actin enrichment in tracheal cells of pio mutant embryos appeared wt-like. Nonetheless, pio mutant embryos show an increased taenidial fold period compared with wt, indicating a decreased friction. Thus, we propose that the lack of Pio reduces friction. Reasons might be subtle defects of actomyosin constriction or chitin matrix, which we have not detected in the pio mutant tracheal cells. Further reasons for lower friction might also be the loss of Pio set local linkages between apical cortex and aECM in stage 16 embryos, which are modified by Np, as proposed in our model (Fig. 9).

      Heterozygous and homozygous pio mutant embryos generally do not show tubal collapse. However, the loss of Pio and accompanying lack of Dpy secretion in stage 17 pio mutant embryos led to the loss of a Pio/Dpy matrix, impacting the late embryonic maturation and differentiation of a normal chitin matrix at the apical cell surface. TEM images reveal reduced dense chitin matrix material at taenidial folds and misarranged taenidial fold pattern (Figs. 1; S2), suggesting impaired taenidial function prevents tube lumen from collapsing after tube protein clearance. Wurst knockdown and mutant embryos do not show general tube collapse, but luminal chitin fiber organization is disturbed in stage 17 embryos (Behr et al., 2007). Therefore, transheterozygous wurst;pio mutant embryos may combine both defects and suffer from maturation deficits of the chitin/ZP matrix at the apical cell surface and within the tube lumen, which finally causes a high number of embryos with incomplete gas filling due to tube collapse. These maturation deficits are even more dramatic in the wurst;pio double mutants, which show no gas filling.”

      11.) Dp remains cytoplasmic in pio mutant background - is the pio mutant phenotype due to defects by lack of Pio AND Dp function? What is the tracheal phenotype of dp mutants?

      It has been discussed that dumpyolvr and pio mutants show similar phenotypes in early tracheal development (Jazwinska, 2003) and it has been discussed that dumpyolvr mutant embryos compromise tube size in combination with shrub mutants. The additional quantifications of the dumpyolvr mutant showed significantly increased tube length (Dong 2014). We used dumpyolvr mutant [In(2L)dpyolvr], an X-ray induced mutation of the dumpy gene locus (Wilkin 2000). dumpyolvr mutant resemble pio null mutant tracheal phenotypes including detached dorsal and ventral branches and oversized tracheal dorsal trunk with curly appearance in late embryos. We included chitin and Uif staining’s of stage 16 dumpy mutant embryos (Fig. S10).

      This data suggest that Pio mutant phenotype is due to a lack of Pio and Dumpy, which would support our model, of Pio and Dumpy protein interaction in the extracellular space of the tube lumen.

      In wt embryos Pio is predominantly in the luminal chitin cable, in contrast in dumpy mutant embryos most Pio is predominantly not at the luminal chitin cable. Less luminal Pio staining in dumpy mutant embryos but Pio accumulation apically shows that Dumpy is required for luminal Pio release in stage 16 embryos. This supports our model that Pio and Dumpy interaction may link membrane and matrix and that this link reacts on mechanical stress during tube expansion by Np-mediated cleavage of Pio and its accompanied luminal release due to linked Dumpy.

      12.) Lines 374ff: the reduced dorsal trunk in Np mutants is not significant; the respective statement should be formulated carefully. If we believe the statistics (no significance), this would mean that attachment of the apical plasma membrane to the luminal chitin via Pio is needed to restrict axial extension; release of Pio is needed for differentiation (taenidia formation, luminal clearance) beyond morphogenesis.

      We agree with the reviewer that the reduction of the dorsal trunks in Np mutant is statistically not significant. However, the mean value is clearly below that of WT. Therefore, we revised our statement as follow: “In Np mutant embryos, tracheal dorsal trunk length shows the tends to be reduced compared to wt embryos.” Further, the btlG4-driven UAS-Np overexpression of Np suggests strong Pio release from the apical membrane and therefore resembles the pio mutant tube length overexpansion (Fig. 8A,B; Fig S13). Thus, our current observations indicate that Np-mediated Pio release at the cell membrane enables precise tube length elongation.

      We thank the referee for discussing that Pio is needed for taenidial fold formation which would fit to our findings in pio null mutant embryos. Pio mutant embryos show the appearance of taenidial folds in stage 16 embryos (airyscan) and stage 17 embryos (TEM images). However, TEM images also show chitin matrix reduction in pio mutant stage 17 embryos. Further, co-stainings of Pio with Crb and Uif, as well as co-stainings of mCherry::Pio with Dpy-GFP and cbp confirms that the Pio localize at the apical cell membrane where taenidial folds form in late stage 16 embryos. Thus, our observations suggest that Pio and Dumpy are required at the apical membrane and matrix to stabilize taenidial folds and tube lumen during 17. This also includes the Np-mediated Pio release at the apical cell membrane. As requested by the referee we summarized Pio function during late tracheal development in our simplified model (see Fig. 9).

      However, it is of note that Np-mediated Pio release increases at late stage 16 (Fig. 5A, 6D; Fig. S13) but is strongly reduced in stage 17 embryos. In contrast, thin taenidial fold are formed at late stage 16 and becomes thicker and form at fusion points during stage 17 and reach their most mature form when the intraluminal chitin cable is cleared (Öztürk-Colak et al., elife, 2016). Thus, the pattern of Pio release and taenidial fold differentiation do not fully match. Moreover, in preliminary experiments we observe Pio antibody staining in stage 17 embryos at the apical cell membrane of dorsal trunks (data not shown). Furthermore, lumen clearance of Obst-A, Knk, Sepr and Verm are not affected in pio mutant embryos, but unknown luminal ECM contents remained (Fig. 1D). Therefore, we will follow this very interesting idea in future experiments.

      Nonetheless, we state in the results that Pio shedding is essential:

      “Our data assumes that Np overexpression may enhance Pio shedding in stage 16 embryos, affecting the Pio-mediated ZP matrix function. Upon breathless (btl)-Gal4-mediated expression of UAS-Np in tracheal cells, we observed a high amount of Pio puncta across the entire tracheal tube lumen, specifically in stage 16 embryos but not in earlier stages (Fig. S13). Consistently tracheal Np overexpression led to tube overexpansion in stage 16 embryos resembling the pio mutant phenotype (Fig. 8A,B). Thus, Np-mediated Pio shedding controls Pio function.”

      13.) Why don't we see the apical Pio signal in Figure 4B?<br />

      The red arrowhead points to apical mCherry::Pio punctuate staining in the Fig. 5B (before 4B) in the close up of the “bleached area” before bleaching and 56min post bleaching. However, in vivo bleaching experiments do not allow additional antibody stainings to detect precisely the apical cell membrane. Further, the Dpy::eYFP marks the tube lumen and the apical cell surface. The latter showed adjacent mCherry::Pio punctuate staining. However, due to bleaching Dpy signal was not detectable in the area.

      14.) The Strep signals in the merges in Figure 7C are not well visible.

      We are not sure which Strep signal the reviewer is referring to in Fig. 7C, which is now Fig. 8C. The top panel shows the Strep signal (right panel) overlapping with GFP in cells that do not express Np or human matriptase. Thus, the TGFB3 ZP domain is not cleaved, and the intracellular GFP and also the extracellular Strep signals are maintained and overlap.

      In contrast, when Np or human matriptase is added, the TGFB3 ZP domain is cleaved and only the intercellular GFP signal is retained, whereas the extracellular Strep signal is released from the cell surface. This explains why the Strep signal is barely detectable in the middle and lower panels of Fig. 8C.

      Reviewer #1 (Significance):

      This work brings together several factors (Pio, Dp, Np, Wst etc) already known to be needed for tracheal morphogenesis and differentiation in the embryo of D. melanogaster. Having worked myself with some of these factors, however, I recognize that the interaction between these factors is novel and very exciting. The experiments strongly indicate a new mechanism of cell-ECM connection that seems to be conserved to some extent (as they provide preliminary data on an example from humans). By integrating the functions of different factors, the work provides ample opportunity for future projects to elucidate this mechanism in detail. Therefore, I expect that it will have a significant impact not only on the field of developmental cell biology but also, due to the conserved proteins involved (ZP proteins, Matriptase), on the field of cell biology of human diseases.

      Reviewer #2 (Evidence, reproducibility and clarity):

      _The figures are clear, and the questions well addressed. However, I find that some of the claims are not completely backed by the data presented and have some suggestions that will hopefully make some points clearer.

      Major comments

      1.) In the abstract and at the end of the introduction the authors claim that they show that Pio, Dpy and Np support the balancing of mechanical stresses during tracheal tube elongation. However, this is not shown in this manuscript, where tension or mechanical stress were not measured and it is therefore speculative._

      As requested by the reviewer, we deleted “support balancing of” at the final sentence of the Introduction. Please, note that we did not use the term balancing of mechanical stresses at the abstract.

      However, we revised the abstract.

      It has been shown previously that forces and mechanical tension rise when apical membrane expands and elastic extracellular matrix, which is anchored to the membrane balances theses forces (Dong et al., 2014). Furthermore, its has been shown that the gigantic and elastic Dumpy protein modulates mechanical tension (Wilkin et al., 2000). Thus, these previous publications state that mechanical tension rise at the apical cell membrane and matrix when tubes expand during stage 16 and that Dpy is part of that molecular process, which we included in the abstract as essential background information.

      “The apical membrane is anchored to the apical extracellular matrix (aECM) and causes expansion forces that elongate the tracheal tubes. The aECM provides a mechanical tension that balances the resulting expansion forces, with Dumpy being an elastic molecule that modulates the mechanical stress on the matrix during tracheal tube expansion.”

      Nonetheless, our results show that Np-mediated Pio cleavage increases during stage 16 as response to tube length expansion which is accompanied by forces as postulated by others (see above). We further observe that the membrane bulges and chitin matrix tear off, when Pio cleavage does not occur in Np mutant embryos. Our data further show that Pio and Dumpy interact and that Pio release is prevented in Dpy mutant embryos. Altogether this suggests that the Np-mediated Pio cleavage responds to tube expansion and requires Dpy for luminal Pio release.

      We therefore claim in the final sentence of the introduction that “…ZP domain proteins Pio and, Dumpy, as well as the protease Np respond to mechanical stresses when tracheal tubes elongate”. The according changes are marked in red.

      2.) The authors state that all pio CRISPR/Cas9 generated mutants display identical tracheal phenotypes, however these data are not shown. Tracheal phenotypes, in particular DT phenotypes, of all mutants generated should be shown in supplementary materials.

      As requested by the reviewer, we included the data in the supplement. The pio5M and pio11R alleles showed embryonic lethality and a 100% gas filling defect resembling the pio17C allele. Additionally, we extended the tracheal analysis with the pio5M allele and identified tube size defects, irregular pattern of taenidial folds and apical membrane deformation, altogether resembling the pio17C allele. These new data are shown in the supplement Fig. S1.

      We clarify this in the results section as follows:

      “The tracheal phenotypes of pio5m are shown in the supplement (Fig. S1B-F). In all other Figures, we show images of the pio17c allele. “

      3.) At stage 16, pio null mutants display DT overelongation phenotypes (Fig. 1). The authors should quantify this phenotype.

      As requested by the reviewer, we quantified the DT overelongation phenotypes for pio5M (Fig. S1). The quantification of pio17C was shown already in Fig. 6B, now Fig 8B.

      4.) The authors analyse Pio distribution under tubular stress, using mega mutants and Chitinase overexpression. Pio localization changes in these genetic backgrounds and this is shown in Figure 2 only in a qualitative manner. The authors should measure Pio localization at the lumen and at the membrane and provide quantitative data.

      As requested by the referee, we measured Pio localization recognized by the anti-Pio antibody at the lumen and at the membrane to provide quantitative data. These are shown in Fig. 2E.

      All images were taken with a Zeiss Airyscan. For statistical analysis we used the the profile tool of the Zeiss ZEN 2.3 black software. This tool allows the measurement and comparison of fluorescence pixel intensities of individual channels. We determined the fluorescent intensities profile across the tube to identify values at apical membrane and tube lumen at minimum 10 different position of DTs (metameres 5 to 6) of two distinct embryos for each genetic background. The maximum values of membranes versus tube lumen were set into ratio and compared between control, mega mutant and Cht2 overexpression. The control embryos showed a ration below 0.4, the Cht2 overexpression a ratio of 1.2 and mega mutants a ratio of about ~0.9. These quantitative data confirm the statement that Pio localization increases at and near the apical cell membrane with respect to the lumen in mega mutants and in Cht2 overexpression embryos.

      5.) Surprisingly and interestingly, wurst;pio transheterozygotes display very strong tracheal defects. The authors say they observe gas filling defects; however it is not clear from figure 2E if this indeed the case. From the panel in the figure, it looks like these embryos suffer from strong tracheal morphogenetic defects. It would be necessary to have a better analysis of these embryos. What is the penetrance of this phenotype. If this is 100% penetrant, one would expect it to be lethal. Therefore, double mutant balanced stocks are not viable? Having analyzed the phenotypes and confirmed which morphogenetic defects the transheterozygote embryos present, how does this genetic interaction fit with the model presented?

      We are thankful to the reviewer for this interesting point of view suggesting that the wurst;pio embryos display tracheal morphogenetic defects. First, our data show that only 11.6% of the wurst;pio transheterozygous embryos completed gas filling and survived until adulthood. In contrast, 88.4% of transheterozygous wurst;pio mutant embryos did not complete gas filling which is now presented in Fig. 3B. The corresponding quantifications is presented in Fig. 3D. Importantly, the 88.4% wurst;pio transheterozygous embryos which show gas filling defects do not hatch as larvae and die.

      As requested, we performed a better morphogenetic analysis, which is presented in Fig. 3C. Analysis of the gas filling defects with light microscopy were repeated with a better objective (Zeiss Apochromat 25x Gly; 0.8 NA). Indeed, this analysis revealed a strongly compromised tube lumen morphology with irregular tube lumen pattern as if tubes twist and bend. This tube lumen deformation was further confirmed with the confocal analysis of chitin staining (cbp). The tube lumen of stage 17 transheterozygous wurst;pio mutant embryos showed irregular lumen pattern with unusual twists and even partially collapsed tubes.

      Furthermore, as asked by the referee, we generated the wurst,pio double mutation. All wurst,pio double mutant embryos lacked gas filling. In a more in-depth analysis of the tube lumen with a high-performance objective we could not identify any normal tube lumen in stage 17 embryos. Instead the double mutant embryos revealed completely collapsed tracheal tubes. This was confirmed by the chitin staining and confocal analysis. All new data are presented in the supplement.

      As shown in our manuscript and in previous publications, neither pio nor wurst mutant embryos affect cell polarity or gross organization of the actin and tubulin cytoskeleton. However, we found that wurst mutant embryos showed irregular apical membrane expansion at tube lumen (Behr et al., 2007; legend Fig. 4), irregular chitin fiber organization and to some extend collapsed tube lumen. In pio mutant embryos we found deformed apical membrane of DTs, irregular pattern of taenidial folds and to some extend collapsed tube lumen. Thus, the apical membrane is their common target of both proteins in late embryonic development, suggesting that pio functions provide stability and wurst functions the internalization of proteins at the apical membrane.

      We discussed it as follows:

      “Nevertheless, Pio and its endocytosis depend on its interaction with the chitin matrix and the Np-mediated cleavage. In stage 16 wurst and mega mutant embryos, we detect Pio antibody staining at the chitin cable, suggesting that Pio is cleaved and released into the dorsal trunk tube lumen. Also, the Cht2 overexpression did not prevent the luminal release of Pio. However, reduced wurst, mega function, and Cht2 overexpression caused an enrichment of punctuate Pio staining at the apical cell membrane and matrix (Figs. 1,2). Although the three proteins are involved in different subcellular requirements, they all contribute to the determination of tube size by affecting either the apical cell membrane or the formation of a well-structured apical extracellular chitin matrix, indicating that changes at the apical cell membrane and matrix in stage 16 embryos affect the Pio pattern at the membrane. It also shows that local Pio linkages at the cell membrane and matrix are still cleaved by the Np function for luminal Pio release, which explains why those mutant embryos do not show pio mutant-like membrane deformations and Np-mutant-like bulges. This is in line with our observations that tracheal Pio overexpression cannot cause tube size defects as the Np function is sufficient to organize local Pio linkages at the membrane and matrix. Therefore, it is unlikely that tracheal tube length defects in wurst and mega mutants as well as in Cht2 misexpression embryos are caused by the apical Pio density enrichment.”

      “Heterozygous and homozygous pio mutant embryos generally do not show tubal collapse. However, the loss of Pio and accompanying lack of Dpy secretion in stage 17 pio mutant embryos led to the loss of a Pio/Dpy matrix, impacting the late embryonic maturation and differentiation of a normal chitin matrix at the apical cell surface. TEM images reveal reduced dense chitin matrix material at taenidial folds and misarranged taenidial fold pattern (Figs. 1; S2), suggesting impaired taenidial function prevents tube lumen from collapsing after tube protein clearance. Wurst knockdown and mutant embryos do not show general tube collapse, but luminal chitin fiber organization is disturbed in stage 17 embryos (Behr et al., 2007). Therefore, transheterozygous wurst;pio mutant embryos may combine both defects and suffer from maturation deficits of the chitin/ZP matrix at the apical cell surface and within the tube lumen, which finally causes a high number of embryos with incomplete gas filling due to tube collapse. These maturation deficits are even more dramatic in the wurst;pio double mutants, which show no gas filling.”

      6.) mCherry::Pio Dpy::eYFP time lapse analysis and FRAP experiments is very interesting. However, it is not clear to which degree bleaching occurs in the tracheal lumen. The authors claim that recovery is very fast and can be seen from minute 2, however, frame-by-frame analysis of Movie S2 does not show a clear different between luminal Pio from minute 0 to minute 2. Rough comparison with the luminal area surrounding the bleached area, does not show a clear difference in luminal Pio before and after photobleaching. To claim fast recovery of luminal Pio after photobleaching, the authors should quantify luminal Pio, before and after bleaching.

      We agree with the reviewer and deleted “fast”. The Video2 shows intracellular mCherry::Pio recovery within 2min after photobleaching. The Video 2 shows extracellular (luminal) recovery within 6min after photobleaching, when first large mCherry::Pio puncta appear at the apical surface of the bleached area. Nonetheless, mCherry::Pio puncta appear in the lumen indicating recovery, whereas Dpy::eYFP did not.

      We state this in the Results section as follows:

      “In stage 16 embryos mCherry::Pio puncta reappeared in tracheal cells within 2 minutes of bleaching and in the tubular lumen within 6 minutes.”

      In addition, in figure 4D, the normalized mCherry::Pio fluorescence in the graph what does it refer to? Intracellular Pio?

      Figure 4D, now 5D, shows Western Blot signals. We guess that you refer to Fig 4B which is Fig. 5B.

      We are sorry for confusion and named it now Fig. 5B’.

      We stated in the Material section:

      “The bleaching was performed with 405nm full laser power (50mW) at the ROI for 20 seconds. A Z-stack covering the whole depth of the tracheal tubes in the ROI were taken at each imaging step. “Fluorescence intensity in the bleached ROIs was measured after correction for embryonic movements using Fiji.”

      Thus, to clarify this point, we added to the legends:

      “Fluorescence intensities refer to the bleached ROIs as indicated with the frame in corresponding Movie S2 and was measured after correction for embryonic movements.”

      7.) When mCherry::Pio Dpy::eYFP time lapse analysis and FRAP experiments was done in an Np mutant background, the authors describe lack of Pio recovery within the lumen (Movie S3). However, when comparing control and Np mutant background embryos, Pio is not properly released into the lumen of Np mutants (as stated by the authors and seen by comparing movies S1 and S4). Furthermore, on minute 0 of the FRAP experiment in Np embryos, there is no detectable Pio in the DT lumen. Therefore, recovery was not expected in Np mutants and should not be claimed as a conclusion for this experiment.

      We thank the reviewer for careful reading and apologize our wrong description. We changed it accordingly as follows:

      “In contrast to the control, extracellular mCherry::Pio is not released into the tube lumen within 56 min after bleaching in Np mutant embryos (Fig. 6C, Video S3).”

      8.) Brodu et al (Dev Cell 2010) have shown that Pio is important for cytoskeletal modulation during tracheal maturation. Pio is important for non-centrosomal microtubule (MT) arrays anchored at the tracheal cell apical membranes. In addition, MT disruption in tracheal cells leads to lumen formation defects (Brodu et al, Dev Cell 2010). In the absence of Pio, the tracheal cytoskeleton is altered, and this could explain some of the results observed. Ideally, the work should be complemented with a basic cytoskeletal analysis, but if this is not possible, the authors should discuss some of the phenotypes in light of this Pio function.

      Dear reviewer, this is a great idea. Therefore, we analyzed F-actin with Phalloidin and beta tubulin (E7 antibody, DSHB) in the dorsal trunk cells of stage 16 control and pio mutant embryos. However, tracheal cells are tiny and only gross irregularities can be realized. So, confocal Z-stack analysis of the stainings did not show gross differences between control and pio mutant embryos. We observe the expected apical subcortical accumulation for the actin and tubulin cytoskeleton in dorsal trunk cells of pio stage 16 mutant embryos which also has been shown for wt embryos elsewhere. These new data are presented in the supplement Fig. S7.

      Minor comments<br /> The model should not be in supplementary materials and should be moved to the main manuscript.

      We thank the reviewer for this suggestion and moved the model to the main part – now Fig.9. As requested by the reviewer 1, we extended the model, showing the timing events of Pio function.

      Throughout the manuscript embryonic stages are described using different nomenclature (stage X, stX and st X). Either way is correct, but the same nomenclature should be used throughout.

      We apologize for the different nomenclature and use "stage X" in the manuscript and "stX" in the figures for space reasons. Legend 1 clarifies the abbreviation.

      In Fig. S1 B and C the authors should specify which pio allele is being analysed (as in Fig. 7). The same should be done in the text.

      That's a fairly good point. To be clear from the beginning, we now state the following in the first paragraph of the results:

      “The tracheal phenotypes of pio5m are shown in the supplement (Fig. S1B-F). In the all other Figures, we show phenotypes of the pio17c allele.”

      Line 131, it is not correct to say that WGA visualizes cell membranes. WGA marks/stains cell membranes.

      Thanks for finding this mistake, it’s now corrected.

      Line 165 "leads to excessive tube dilation and length expansion due to strongly reduced luminal chitin" is not correct. Chitin reduction leads to excessive tube dilation but not to length expansion, as reported in the papers cited at the end of the sentence.

      Thanks very much for careful reading, we deleted “and length expansion” from the sentence.

      Line 220-221, what do authors refer to as "stage 16 wt-like control embryos"?

      Thanks for finding these mistakes. We corrected as follows:

      “In stage 16 embryos mCherry::Pio puncta….”

      Line 221, "some minutes" should be replaced by a specific number of minutes. According to Movie S2 reappearance of tracheal cell Pio happens from minute 16.

      We agree with the reviewer to state the time when mCerry::Pio puncta reappear. We observe first large puncta within two minutes after bleaching in tracheal cells at the ROI (Video S2, lower cell row at the movie). We further observe the reappearance of first large puncta at the ROI within 6 minutes in the tracheal tube lumen.

      We corrected it as follows: “In stage 16 embryos mCherry::Pio puncta reappeared in tracheal cells within 2 minutes of bleaching and in the tubular lumen within 6 minutes.”

      Line 291 "time laps" should be lapse.

      Thanks for finding the typo, it is corrected now.

      Line 302, "Pio was not shedded into the lumen but remained at the cell" should be "Pio was not shed into the lumen but remained in the cell".

      Thanks for finding the typo, it is corrected now.

      _Referees cross-commenting

      I agree. Taken together, all the comments will improve the quality of the work and of a future manuscript. Also, everything seems quite doable and will not present any problems._

      Reviewer #2 (Significance):

      _The findings shown in this manuscript shed light on the regulation of tubulogenesis by ZP proteins and how their interaction with the ECM can be regulated by proteolysis. It was known that Pio is involved in tracheal development, is secreted into the lumen, regulating tube elongation (Jaźwińska et al., Nat.Cell Biol., 2003) and anchoring MTs to the apical membrane during tubulogenesis (Brodu et al, Dev. Cell 2010). This work provides additional molecular insights into Pio dynamics and regulation during tube maturation.<br /> This work will be of interest to a broad cell and developmental biology community as they provide a mechanistic advance in ZP proteins involved in morphogenesis. It is of specific interest to the specialized field of tubulogenesis and tracheal morphogenesis.

      Field of expertise:<br /> Drosophila, morphogenesis, tracheal tubulogenesis, cytoskeleton_

      Reviewer #3 (Evidence, reproducibility and clarity):

      _Summary<br /> In this manuscript, Drees and colleagues analysed, during the formation and growth of tubular systems, how cells combine forces at the cell membranes while maintaining tubular network integrity. A fundamental question is to understand how cells manage to integrate the axial forces to stabilise the cell membrane and the apical extracellular matrix (aECM).<br /> To address this question, the authors study the formation of the tracheal system in Drosophila embryos, a well-established and detailed model system to investigate formation of tubular networks. In particular, they focused on the formation of the larger tube of the tracheal network, the dorsal trunk. The formation of this tube depends in part of axial extension along the antero-posterior axis.<br /> They concentrated their work on the function of Piopio (Pio), a Zona-Pellucida (ZP)-domain protein. They showed that Pio together with the protease Notopleural (Np) contribute the sense and support mechanical stresses when tracheal tubes elongate, thus ensuring normal membrane -aECM morphology.

      Major Comments

      In a previous work, Drees et al. (PLOS Genetics 2019), showed the matriptase-prostasin proteolytic cascade (MPPC), is conserved and essential for both Drosophila ECM morphogenesis and physiology.<br /> The functionally conserved components of the MPPC mediate cleavage of zona pellucida-domain (ZP-domain) proteins, which play crucial roles in organizing apical structures of the ECM in both vertebrates and invertebrates. They showed that ZP-proteins are molecular targets of the conserved MPPC and that cleavage within the ZP-domains is a conserved mechanism of ECM development and differentiation.<br /> Here, Drees et al. investigate further how the coupling between membrane and matrix takes place to ensure proper tube growth.<br /> Pio distribution and phenotypes<br /> They first focused on the tracheal phenotypes observed in a pio null mutant context. So far, the only pio mutant characterised was a point mutation in the ZP domain. Using CRISPR/Cas9, they generated new alleles of pio which are lack of function alleles. In the context, Drees and colleagues observed over-elongated dorsal trunk tubes, with bulges appearing at stage 16 between the apical domain of tracheal cells and adjacent extra-luminal matrix.<br /> Additionally, pio mutant embryos showed impaired tube lumen clearance of the some of the aECM components, which prevent gas-filling of the airways.<br /> To detect Pio distribution, the authors used either anti-Pio antibody directed toward a short stretch with the Pio ZP domain or generated a CRISPR/Cas9 piomCherry::pio line.

      _

      1.) The Pio antibody shows a strong luminal staining as already published. But the authors reported an apical membrane signal in tracheal cells. I find this apical membrane signal really difficult to observe in panel Fig. 2B. The overlap between the Pio dots and the apical membrane labelled with Uif showed in Fig 2C can be due to the 3D projection. It is only when endocytosis is unpaired (Suppl Fig. 2), that data are more convincing.

      We thank the reviewer for this important point, we are sorry for the unconvincing presentation and for having the chance to improve it.

      We show the 3D image of Pio puncta as voxels overlapping with Uif at the apical cell membrane. The amount of Pio voxels overlapping with the Uif marked apical cell membrane increased in mega mutant and due to tracheal Cht2 overexpression. This result was indicated by a representative region (frame) and white arrows and is shown now in Fig. 2C.

      We further used orthogonal projections across the tracheal tube of the airyscan Z-stacks. Random usage confirmed that puncta of Pio antibody staining overlap with Uif at the tube lumen. We observed overlap in controls, but increasing overlap in mega mutant and Cht2 overexpressing embryos. This result is shown now in Fig. 2E.

      However, to overcome any misinterpretations of projections, we used single images of the original airyscan Z-stacks for co-localization analysis with the Zeiss ZEN software (black, 2.3, sp1). We used two available and independent standard methods to compare fluorescence pixel intensities of different channels namely the ZEN co-localization and the ZEN profile tool. Both are described in the Materials section.

      a.) With the co-localization tool we compared directly fluorescence pixel intensities of Pio and Uif. Highest overlap of the intensities, shown in the ZEN tool as third quadrant, were set to white for better visualization in the images. These new images are included as Fig. 2D and show recurrent overlap of Pio and Uif antibody stainings (punctuate pattern) along the apical cell membrane at the dorsal trunk of stage 16 control embryos. This overlap pattern increased in mega mutant and Cht2 overexpression embryos.

      b.) A second approach for comparing fluorescence intensities is the ZEN “profile” tool. Drawing a line across the tube allowed us to compare peak fluorescence pixel intensities of the different channels at distinct regions, such as the apical cell membrane and the tube lumen including the cbp marked chitin cable. This tool detected overlap of peak fluorescence intensities of UIF and Pio antibody staining’s, confirming that Pio is located together with UIF at the apical membrane of dorsal trunk tracheal cells. These new intensity profiles and the corresponding images are presented in the supplement as Fig. S4B-D. Quantifications of this method comparing the ration of Pio peak intensities between the apical cell membrane and the tube lumen are presented as Fig. 2F (as requested by Reviewer 2).

      2.) When the author used their CRISPR/Cas9 piomCherry::pio line to characterise Pio distribution (Fig.4), Pio is localised at the apical plasma membrane before stage 16. Only at stage 16, Pio is detected within the lumen. This timing of Pio release in the lumen is critical for the model proposed by Drees at al. This is an important point to assess the difference between the use of the antibody (which mostly label the lumen) while piomCherry::pio line is mostly at the membrane.

      We agree with the reviewer that the Pio antibody shows a different pattern within the tube lumen of earlier stages. The Pio antibody shows intense extracellular staining from early stage 12 onwards, presumably due to its early function at dorsal and ventral branches, as shown by Anna Jazwinska (Jazwinska et al., 2003). The intense luminal Pio antibody staining, predominantly at the chitin cable, persist until its disappearance due to airway protein clearance during stage 17. Unfortunately, this strong luminal Pio staining made it impossible to examine the Pio distribution pattern in more detail during stage 16. Nevertheless, Np overexpression experiments indicate that luminal Pio release occurs specifically in stage 16 embryos (Fig. S13), which was tested with the Pio antibody, see results, second last paragraph:

      “Our data assumes that Np overexpression may enhance Pio shedding in stage 16 embryos, affecting the Pio-mediated ZP matrix function. Upon breathless (btl)-Gal4-mediated expression of UAS-Np in tracheal cells, we observed a high amount of Pio puncta across the entire tracheal tube lumen, specifically in stage 16 embryos but not in earlier stages (Fig. S13).”

      We further agree with the reviewer that mCherry::Pio was used to characterize in vivo Pio distribution within the dorsal trunk cells and tube lumen during stage 16. The Fig. 5A shows apical mCherry::Pio distribution pattern in early and late stage 16 embryos. Importantly, the appearance of luminal mCherry::Pio increased during stage 16 and mainly enriched at late stage 16. See Figure 5A, red arrowheads point to apical Pio and red arrows to luminal Pio staining.

      Furthermore, as discussed above and shown by different ZEN tools, such as co-localization and fluorescence intensity profile tools, Pio antibody stainings revealed a punctuate pattern at the apical cell membrane of dorsal trunk cells in stage 16 embryos, which is reflected also by the appearance of apical mCherry::Pio puncta at the membrane surface. Additionally, we observed mCherry::Pio puncta also within the tube lumen (see the new Figures S4B & S8). Thus, subcellular Pio distribution at the apical cell membrane and lumen were observed for both, Pio antibody staining and mCherry::pio pattern.

      Nonetheless, there is different luminal appearance between the Pio antibody staining and mCherry::Pio. Pio antibody detects a short stretch at the ZP domain and thus detects all possible Pio variants, uncleaved and cleaved. Due to early tracheal Pio function, Pio enriches within the tube lumen in an intense core-like structure, which is recognized by the Pio antibody and is comparable with the Dpy::eYFP pattern. Also mCherry::Pio labels all Pio variants, uncleaved and cleaved. The spatial temporal mCherry::Pio expression pattern (Fig. S5) is comparable with the Pio antibody pattern and the staining at the membrane in stage 16 embryos. However, mCherry::Pio did not enrich in the lumen in a core-like structure, nonetheless, shows overlap with luminal Dpy::eYFP.

      Jaswinska showed that Pio antibody staining is intracellular in the trachea of stage 11 pio2R-16 point mutation embryos (Jaswinska et al., 2003; Fig 2d). To understand more about the specificity of the antibody, we performed stainings in the null mutant embryos. In contrast, to the high number of intracellular Pio puncta in pio2R-16 point mutation embryos, Pio stainings were much more reduced in pio5m and pio17c mutants, but a low number of Pio puncta were still detectable in the embryos (Fig. S1G,H). It is of note that also dpy mutants showed strongly reduced Pio antibody staining (Fig. S10E). Thus, discussing underlying causes of enriched (Pio antibody) versus non-enriched (mCherry::Pio) luminal staining are speculative. However, observations by Jaswinska et al. (2003) and our new observations, investigating the Pio antibody stainings in pio null mutants, dpy mutants, eYFP::Dpy embryos and NP overexpression may hint to the possibility of cross-reactivity of the Pio antibody to other ZP domains which may intensify the appearance of luminal Pio antibody staining in control embryos.

      Anyway, we clarify the difference in luminal Pio pattern in the discussion as follows:

      “Indeed, the anti-Pio antibody, which detects all different Pio variants, showed a punctuate Pio pattern overlapping with the apical cell membrane markers Crb and Uif at the dorsal trunk cells of stage 16 embryos (Fig. 2; Fig. S3,S4). Additionally, Pio antibody also revealed early tracheal expression from embryonic stage 11 onwards, and due to Pio function in narrow dorsal and ventral branches, strong luminal Pio antibody staining is detectable from early stage 14 until stage 17, when airway protein clearance removes luminal contents. In the pio5m and pio17c mutants Pio stainings were strongly reduced although some puncta were still detectable in the trachea (Fig. S1G,H). Similarly, Pio antibody staining is intracellular in the trachea of stage 11 pio2R-16 point mutation embryos (Jaźwińska et al., 2003). Interestingly, also dpy mutants showed strongly reduced and intracellular Pio antibody staining (Fig. S10E).

      We generated mCherry::Pio as a tool for in vivo Pio expression and localization pattern analysis during tube lumen length expansion. The mCherry::Pio resembled the Pio antibody expression pattern from early tracheal development onwards. However, luminal mCherry::Pio enrichment occurs specifically during stage 16, when tubes expand. The stage 16 embryos showed mCherry::Pio puncta accumulating apically in dorsal trunk cells. Moreover, mCherry::Pio puncta partially overlapped with Dpy::YFP and chitin at the taenidial folds, forming at apical cell membranes. Supported by several observations, such as antibody staining, Video monitoring, FRAP experiments, and Western Blot studies (Figs. 4,5), these findings indicate that Pio may play a significant role at the apical cell membrane and matrix in dorsal trunk cells of stage 16 embryos.”

      3.) Another important point is to explain the discrepancy between the pio mutant alleles. The allele containing a point mutation in the ZP domain shows no over-elongated tubes (Dong et al 2014, Jazwinska et al. 2003) while the lack of function alleles does.

      The reviewer is correct that the pio2R-16 mutation shows only a disintegration phenotype whereas our pio null mutations show in addition tube length defects. However, Dong et al. showed significantly increased dorsal trunk length in shrub; pio2R-16 double mutant embryos when compared with shrub mutant embryos (Supplemental Fig. S4A). Also, the shrub;dpyolvR double mutant embryos revealed increased tube length expansion when compared with shrub mutant embryos. Moreover, their quantifications show that the also dpyolvR mutant embryos revealed significantly increased tube expansion when compared with wt. Altogether these previous findings suggests that Pio and Dpy are involved in controlling tube length control during stage 16.

      Furthermore, we generated three independent pio null mutation alleles, which lost all the essential Pio protein domains, and caused all embryonic lethality, gas-filling defects, branch disintegration phenotype and tube length defects (quantifications are shown in Figs. 9 and S1). In addition, pio null mutations prevent Dpy::eYFP secretion. Thus, we are confident that the observed tube length defects as well as the air-filling defects are due to the loss of Pio, and in particular since these defects could be rescued by Pio Expression in the pio null mutation background, as shown in Fig. 3B.

      So, what could make the difference?

      The described pio2R-16 mutation allele contains a X-ray induced single point mutation that led to an amino acid replacement (V159D) in the ZP domain. It is not clear how the amino acid exchange affects the protein and the ZP domain. It may hamper pio function and maybe this amino acid replacement is problematic for the early tracheal function but not during stage 16. As stated by Jazwinska et al. 2003 (Fig. 2 legend), Pio antibody staining is intracellular in the mutants and extracellular in the trachea of wt at stage 13.

      They further speculate that the mutant Pio protein may retain in the secretory pathway, but this is not confirmed with co-markers. As luminal Pio function is required to provide a barrier for autocellular AJ formation, this fails in pio2R-16 mutation. In contrast, it is still possible that Pio interacts and supports Dpy secretion in pio2R-16 mutation and additionally it is thinkable that intracellular Pio may reach to some extend the apical cell membrane in pio2R-16 mutation stage 16 and thus can support tube size control. But these assumptions are speculations.

      Nevertheless, to clarify this point we explain the discrepancy between the pio2R-16 mutation and pio null mutations alleles as follows:

      “Using CRISPR/Cas9, we generated three pio lack of function alleles (Fig. S1A), all exhibiting embryonic lethality and identical tracheal mutant phenotypes. The tracheal phenotypes of pio5m are shown in the supplement (Fig. S1B-F). In all other Figures, we show images of the pio17c allele. The pio17c and pio5m null mutant embryos revealed the dorsal and ventral branch disintegration phenotype known from a previously described pio2R-16 mutation allele which contains a X-ray induced single point mutation that led to an amino acid replacement (V159D) in the ZP domain (Jaźwińska et al., 2003). Additionally, the late stage 16 pio17c and pio5m null mutant embryos showed over-elongated tracheal dorsal trunk tubes (see below).”

      4.) A minor point, the author should provide hypothesis to explain why only the clearance of CBP, Obstructor-A and Knickkopf are affected in a pio mutant background and not Serpentine and Vermiform.

      We thank the reviewer for careful reading and the comment on this point. We would be happy to see such a scenario which could give us a hind of Pio interaction partners at the chitinous matrix. However, we stated that luminal material, such as Obst-A and Knk are removed from the lumen (see Fig. S5A). We further describe that in pio mutant embryos, luminal Serp and Verm staining appeared reduced but showed wt-like distribution (see Fig. S6) in stage 16 embryos. We do not show Serp and Verm in stage 17 embryos, but they are removed from the tube lumen (not shown). These data are received from immune-staining’s and confocal analysis.

      Nevertheless, we also state that pio mutant embryos revealed lumen clearance defects in TEM analysis, of undefined material in the tube lumen (see Fig. 1D and Fig. S2B).

      To clarify this point we state in the results as follows:

      “Fourth, ultrastructure TEM images revealed aECM remnants in the airway lumen of pio mutant stage 17 embryos, while control embryos cleared their airways (Fig. S2B). Consistently, the in vivo analysis of airways in stage 17 pio mutant embryos revealed lack of tracheal air-filling (Fig 3B). The pan-tracheal expression of Pio in pio mutant embryos rescued the lack of gas filling (Fig 3B). Thus, TEM images suggest that pio mutant embryos showed impaired tube lumen clearance of aECM, which prevented subsequent airway gas-filling. “

      And

      “Also, the pio mutant embryos showed tracheal lumen clearance defects of chitin fibers in ultrastructure (TEM) analysis (Figs. 1D, S2B). In contrast, confocal analysis revealed that well-known chitin matrix proteins, such as Obstructor-A (Obst-A) and Knickkopf (Knk), are removed from the lumen of pio mutants (Fig. S5A). These results suggest that the Pio function did not affect airway clearance of Obst-A and Knk and therefore did not play a central role in airway clearance like Wurst. Nevertheless, airway clearance defects observed in TEM images in pio null mutant embryos and, in addition, defective tube lumen morphology in wurst;pio transheterozygous mutant embryos explain the occurrence of airway gas filling defects.”

      5.) Pio and Dumpy. Dumpy (Dpy) is another ZP domain protein secreted by the tracheal cells and detected in the lumen. To follow Dpy distribution, Drees and colleagues used a Dpy::eYFP protein trap line, the same used in Dong et al. However, in this latter paper, Dong et al. stated, using a Crb staining, that Dpy is not at the apical cell surface but only in the lumen. However, Drees and colleagues reported (line 227 and Fig. 4C) that Dpy appears both at the apical cell surface and in the lumen of the tracheal system. But they did not show a co-localisation with an apical marker. Furthermore, in their previous work, (Drees et al. 2019) they called the apical staining a "peripheral shell" layer. In addition, in S2R+ cell culture, it is only when Pio and Dpy co-express that Dpy is detected at the cell membrane. The in vivo localisation of Dpy is an important point that needs to be clarified as it is of importance for the final model proposed Supp Fig. 9.<br /> Drees at al. also performed FRAP experiments on Dpy::eYFP protein trap embryos. As excepted as already shown by Dong et al.

      The referee is correct, we state “In stage 16 embryos Dpy::eYFP (Lye et al., 2014) appears at the tracheal apical cell surface and predominantly within the lumen (Fig. 4C).” The corresponding Fig. 4C reveals Dumpy::eYFP staining overlapping with chitin at two subcellular regions: Dpy is enriched as a core-like structure within the lumen overlapping with the chitin cable of the control embryos. Additionally, Dpy::eYFP overlaps with the chitin part that might be part of the apical cell surface. But this observation is hard to see in images in Fig. 4C and we apologize it. We therefore repeated the Dpy::eYFP localization analysis and analyzed in more detail with the ZEN profile tools, which shows peak fluorescence pixel intensities of different channels and provides the possibility to prove, if they overlap in XY axis.

      We asked first, if cbp (chitin) appears at the apical surface of dorsal trunk cells, when Pio becomes cleaved and released. In mid stage 16 embryos cbp staining appeared in the luminal chitin cable and additionally in a distinctive pattern, which fits to the pattern of taenidial folds that start to form. We therefore used the apical cell membrane marker Crumbs to co-stain cbp. Airycsan microscopy fluorescence intensity profile analysis and corresponding close ups images confirmed the overlap of Crb and cbp stainings at this distinctive pattern indicating this shows the chitin matrix at the apical cell surface (Fig. S8A). But there was no overlap of cbp and Crb at the chitin cable structure. Thus, knowing the localization of the apical cell surface chitin matrix, we performed co-stainings of cbp with mCherry::Pio (RFP antibody). This revealed, as expected, overlap of cbp and RFP antibody staining at the apical cell surface chitin matrix (distinct pattern) and with the luminal chitin-cable (Fig. S8B,C). Finally we repeated the stainings and analysis with cbp, mCherry::Pio (RFP antibody) and Dpy::eYFP (GFP antibody). First, these results revealed overlap of Dpy::eYFP and cbp at the apical cell surface and in the tube lumen (Fig. S8D) and second, overlap of punctuate staining of Dpy::eYFP, cbp and mCherry::Pio at the apical cell surface chitin matrix and also at the luminal chitin cable (Fig. S8E).

      Very obvious from images and Z-projection in Fig. 4C is the lack of extracellular Dpy::eYFP staining in pio mutant embryos. Dpy::eYFP enriched intracellularly, and thus, the pio mutant caused Dpy::eYFP mis-expression fits well to our results from S2R+ cell culture. As the reviewer notes, it is only when Pio and Dpy co-express that Dpy is detected at the cell membrane.

      Altogether, Fig. 4C, cell culture experiments and our new stainings support our model, that Pio and Dumpy interact and are co-secreted at the apical cell membrane/surface, where Np mediates Pio cleavage. As requested by reviewer 2, we moved the model to Fig. 9. As requested by reviewer 1, we extended the model for timing events.

      A minor point, the Dpy::eYFP protein trap line used in this study is not listed in the Materials and Methods section of the supplementary data.

      Thanks, we included it into the List of sources (Supplement). This YFP-trap line (called CPTI lines) was published by Claire M. Lye et al., Development, 141, 2014. We cite it in our manuscript.

      6.) The serine protease NP and Pio release. Drees and colleagues have pervious shown, preforming in vitro studies, that protease Notopleural (Np) cleaves the Pio ZP domain (Drees at al. 2019). Here the authors went a step further in demonstrating that it is also true in vivo at stage 17. In addition, they showed that, in Np mutant embryos, mCherry::Pio is mostly detected within tracheal cells and the luminal staining is strongly reduced. In this mutant context, the authors conducted FRAP experiment on the mCherry::Pio signal even very weak in the lumen. They showed hardly no recovery after photobleaching.<br /> In Drosophila S2 cells, Drees and colleagues showed that co-expression of the catalytically inactive NpS990A with mCherry::Pio in showed as a prominent signal the 90kDa mCherry::Pio variant in the cell lysate (Fig. 5B), and live imaging revealed mCherry::Pio localisation at the cell surface (Fig. S6B). However, in this inactive form context, a strong signal is also detected at 60kDA corresponding to a cleaved form of the Pio ZP domain (Fig. 5B), and Pio localisation at the cell surface appears weaker than in controls. They authors did not consider that another protease could be at play.<br /> On the other hand, in their previous work, Drees et al. identified a mutant form of Pio (PioR196A) which is resistant to NP cleavage in vitro. It will be a step forward to establish by CRISPR/cas9, as the authors seems to be successful with this technique, a mutant line carrying this point mutation. It will be important to determine whether the observed phenotype resembles that of a mutant Np phenotype.<br /> In their previous work (PLOS Genetics 2019), in Np mutant embryos, Drees et al. did not report "budge-like" deformations from stage 16 onwards leading to the detachment of the tracheal cell from their adjacent aECM. Either the alleles or the allelic combination is different between the two studies which could explain this difference, or it is a new phenotype that has not been previously described. In the latter case, it becomes important to quantify the proportion of segments showing these bubbles. Is this a rare phenotype to observe?

      We thank the reviewer for the very interesting comments and the careful reading of our manuscripts and the very useful suggestions. We agree, the we cannot exclude the possibility that another protease is involved in the cleavage of Pio. Therefore, we included this important point in the discussion section as follows:

      “Unknown proteases may likely be involved in Pio processing since cleaved mCherry::Pio is also detectable in inactive NpS990A cells.”

      We think the generation of the pioR196A mutant to address Pio localization and tracheal phenotypes is a great idea, which we would like to address in future experiments. Unfortunately, the production of this fly line with such a specific point mutation at this position will take several months, not included the subsequent evaluation and phenotypic analysis of this fly line and mutants. Therefore, we apologize that we cannot pursue this question experimentally. Nevertheless, mentioning the possibility and the requirement of such an experiment is important and we discuss it as follows:

      “Previously we identified a mutation at the Pio ZP domain (R196A) resistant to NP cleavage in cell culture experiments (Drees et al., 2019). Establishing a corresponding mutant fly line would be essential in determining whether the observed phenotype resembles the phenotype of the Np mutant embryos.”

      However, knowing that we are not able to provide a new mutant fly line to evaluate the formation of the dorsal tube when an NP non-cleavable form of Pio is expressed, we sought to use an alternative approach by overexpressing Np in the trachea with btl-Gal4. This shows a clear pairing of Np overexpression and Pio release specifically at stage 16 dorsal trunk and associated tube overexpansion.

      Finally, the reviewer is correct, we did not mention the appearance of bulges in Np mutant tracheal dorsal trunk cells in our previous publication. We used that same Np alleles in 2019 and a closer look at the publication of 2019 likewise shows the appearance of bulges in Np mutant embryos, e.g. Fig. 1B (red-dextran, left part of the tracheal lumen shows bulges) and even the Dpy::YFP matrix tear off at the site of bulges (Fig. 4F’’, above the arrowhead). But we did not know at the time the link with Pio and Dumpy

      However, we agree, it is important to know more about the appearance of the phenotype by means of quantifications. The quantifications of bulges per dorsal trunk (n=16) is shown in Fig. 7B.

      7.) Minor point: I don't understand what the authors are trying to show in supplementary Figure 8. Tracheal cells detach and are found in the lumen?

      We are sorry for the unclear description in the legend. We corrected it as follows in the legend of Fig. S12:

      “This indicates disintegration of apical cell membrane at bulges and subsequent leaking of cellular content into the lumen.”

      8.) Np function conserved matriptase.<br /> In this work, Drees and colleagues showed that Np controls in vivo the cleavage of the Pio ZP domain.<br /> Dumpy and Piopio are not conserved in vertebrates but they both contain a ZP domain which is conserved. The authors tested if other ZP proteins can be cleaved by Np or the human homolog Matriptase. The authors tested in cell culture the ability of the type III Transforming growth factor-β receptor which contains a ZP domain to be cleaved either by Np or Matriptase.<br /> This could be a general mechanism that needs to be extended to other ZP domain proteins and that could be at play to structure the matrix and give it its physical properties.<br /> However, as it is all speculative, I find the discussion section related to these data, for too long and that does not help to understand better the work done in the formation of the tracheal tubes of the drosophila embryo.

      We show that Np mediates cleavage of the Pio ZP domain in vitro and in vivo in Drosophila embryos. We further showed that also the human matriptase was able to cleave the Pio ZP domain. To understand if this is a more general mechanism, we extended our studies with the human TβIII and its ZP domain. These data show that both Drosophila and human matriptases are able to cleave ZP domains of different proteins from different species. These data suggest that Matriptase-mediated ZP domain cleavage is not a Drosophila specific mechanism. We cannot follow the argumentation of the referee to state it all speculative. Nevertheless, we agree that it will need follow up studies to show that the mechanism is more general than two different species and ZP domain proteins. Anyway, as requested by the referee, we deleted the following sentences of the paragraph, since they are speculative in the context of our manuscript and do not directly describe a potential matriptase and ZP domain function:

      “Matriptase degrades receptors and ECM in pulmonary fibrinogenesis in squamous cell carcinoma (Bardou et al., 2016; Martin and List, 2019). TβRIII is a membrane-bound proteoglycan that generates a soluble form upon shedding (López-Casillas et al., 1991), a potent neutralizing agent of TGF-β. Expression of the soluble TβRIII inhibits tumor growth due to the inhibition of angiogenesis (Bandyopadhyay et al., 2002). Idiopathic pulmonary fibrosis (IPF) is associated with a progressive loss of lung function due to fibroblast accumulation and relentless ECM deposition (King et al., 2011; Loomis-King et al., 2013). “

      However, the comparisons of the tubular organ and the phenotypic expressions of the bulging membrane and the aortic aneurysm appear to us as an important element of the article. In both cases, cell membrane loses its integrity and can break in tubular networks. Thus, with our findings on the modification of extracellular ZP proteins, we offer a potential new molecular approach even for clinical investigation.

      9.) Minor points: Pio and cytoskeleton organisation.<br /> Line 78-79, the authors wrongly quoted a work from Brodu et al (2010). Pio does not anchor the microtubule severing enzyme Spastin. Instead, Spastin releases the microtubule-organising centre from its centrosomal location, then Pio contributes to its apical membrane anchoring. It can therefore be assumed that the organisation of the microtubule network is affected in a pio null mutant. In addition, ZP proteins have been shown to link the aECM to the actin cytoskeleton. Therefore, it would be interesting to look at the organisation of the actin and microtubule cytoskeletons in a pio mutant context in which enlarged apical cell surface area are observed.

      We are very thankful for finding this mistake in the introduction. We corrected it as follows:

      “Further, Pio is involved in relocating microtubule organizing center components γ-TuRC (γ-tubulin and Grips; gamma-tubulin ring proteins). This requires Spastin-mediated release from the centrosome and Pio-mediated γ-TuRC anchoring in the apical membrane.”

      Studying cytoskeleton in pio mutant embryos is a helpful idea. Therefore, we analyzed F-actin with Phalloidin and beta tubulin (E7 antibody, DSHB) in the dorsal trunk cells of stage 16 control and pio mutant embryos. However, tracheal cells are tiny and only gross changes can be realized. The confocal Z-stack analysis of the stainings did not show gross differences between control and pio mutant embryos. We observe the expected apical subcortical accumulation for the actin and tubulin cytoskeleton in dorsal trunk cells of pio stage 16 mutant embryos which also has been shown for wt embryos elsewhere. These new data are presented in the supplement Fig. S7.

      _Referees cross-commenting

      I have just read the comments of the other two reviewers, who like me are specialists in the formation of the tracheal system in the drosophila embryo.<br /> I find the comments very fair and balanced. They are in the same spirit as my comments and are very complementary. I hope that all our comments will be constructive for the authors and will improve the quality of their work._

      Reviewer #3 (Significance):

      _Overall, the methodology is sound, the quality of the data is good and the paper is very well written. Authors combine in vivo, in vitro studies as well a cell culture approach. Using CRISPR/Cas9, they generated a large number of new tools allowing in vivo studies.<br /> Drees and colleagues generated new alleles of pio which are lack of function alleles. They described a new phenotype for pio mutant embryos, namely over-elongated tubes. But they authors do not comment on why these new alleles reveal a new phenotype. Furthermore, using their piomCherry::pio line, the authors state that Pio is localised to the plasma membrane. This location is very difficult to assess. Both new results require clarification.<br /> The authors had already demonstrated that Np cleaves the ZP domain of Pio in vitro. Here they demonstrate this in vivo. It appears important to evaluate the formation of the dorsal tube when an NP non-cleavable form of Pio is expressed.<br /> Finally, the model proposing a coupling between the extracellular matrix and the membrane of tracheal cells is very interesting. The demonstration that cleavage of Pio by Np could participate in this coupling is very interesting for those interested in the integration of mechanical stress and cellular deformation. However, such a model has already been discussed in Dong et al (2014). In this article, Dong et al. proposed that a "coupling of the apical membrane and Dpy matrix core is essential for tube length regulation".

      The audience for this article should be specialised and oriented towards basic research. It may be of interest to people working on tubular systems or working on ZP proteins.

      My field of expertise is cell biology and developmental biology in drosophila and formation of tubular networks._

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary

      In this manuscript, Drees and colleagues analysed, during the formation and growth of tubular systems, how cells combine forces at the cell membranes while maintaining tubular network integrity. A fundamental question is to understand how cells manage to integrate the axial forces to stabilise the cell membrane and the apical extracellular matrix (aECM).<br /> To address this question, the authors study the formation of the tracheal system in Drosophila embryos, a well-established and detailed model system to investigate formation of tubular networks. In particular, they focused on the formation of the larger tube of the tracheal network, the dorsal trunk. The formation of this tube depends in part of axial extension along the antero-posterior axis.<br /> They concentrated their work on the function of Piopio (Pio), a Zona-Pellucida (ZP)-domain protein. They showed that Pio together with the protease Notopleural (Np) contribute the sense and support mechanical stresses when tracheal tubes elongate, thus ensuring normal membrane -aECM morphology.

      Major Comments

      In a previous work, Drees et al. (PLOS Genetics 2019), showed the matriptase-prostasin proteolytic cascade (MPPC), is conserved and essential for both Drosophila ECM morphogenesis and physiology.<br /> The functionally conserved components of the MPPC mediate cleavage of zona pellucida-domain (ZP-domain) proteins, which play crucial roles in organizing apical structures of the ECM in both vertebrates and invertebrates. They showed that ZP-proteins are molecular targets of the conserved MPPC and that cleavage within the ZP-domains is a conserved mechanism of ECM development and differentiation.<br /> Here, Drees et al. investigate further how the coupling between membrane and matrix takes place to ensure proper tube growth.<br /> Pio distribution and phenotypes<br /> They first focused on the tracheal phenotypes observed in a pio null mutant context. So far, the only pio mutant characterised was a point mutation in the ZP domain. Using CRISPR/Cas9, they generated new alleles of pio which are lack of function alleles. In the context, Drees and colleagues observed over-elongated dorsal trunk tubes, with bulges appearing at stage 16 between the apical domain of tracheal cells and adjacent extra-luminal matrix.<br /> Additionally, pio mutant embryos showed impaired tube lumen clearance of the some of the aECM components, which prevent gas-filling of the airways.<br /> To detect Pio distribution, the authors used either anti-Pio antibody directed toward a short stretch with the Pio ZP domain or generated a CRISPR/Cas9 piomCherry::pio line.<br /> The Pio antibody shows a strong luminal staining as already published. But the authors reported an apical membrane signal in tracheal cells. I find this apical membrane signal really difficult to observe in panel Fig. 2B. The overlap between the Pio dots and the apical membrane labelled with Uif showed in Fig 2C can be due to the 3D projection. It is only when endocytosis is unpaired (Suppl Fig. 2), that data are more convincing.<br /> When the author used their CRISPR/Cas9 piomCherry::pio line to characterise Pio distribution (Fig.4), Pio is localised at the apical plasma membrane before stage 16. Only at stage 16, Pio is detected within the lumen.<br /> This timing of Pio release in the lumen is critical for the model proposed by Drees at al. This is an important point to assess the difference between the use of the antibody (which mostly label the lumen) while piomCherry::pio line is mostly at the membrane.<br /> Another important point is to explain the discrepancy between the pio mutant alleles. The allele containing a point mutation in the ZP domain shows no over-elongated tubes (Dong et al 2014, Jazwinska et al. 2003) while the lack of function alleles does.<br /> A minor point, the author should provide hypothesis to explain why only the clearance of CBP, Obstructor-A and Knickkopf are affected in a pio mutant background and not Serpentine and Vermiform.

      Pio and Dumpy<br /> Dumpy (Dpy) is another ZP domain protein secreted by the tracheal cells and detected in the lumen. To follow Dpy distribution, Drees and colleagues used a Dpy::eYFP protein trap line, the same used in Dong et al. However, in this latter paper, Dong et al. stated, using a Crb staining, that Dpy is not at the apical cell surface but only in the lumen. However, Drees and colleagues reported (line 227 and Fig. 4C) that Dpy appears both at the apical cell surface and in the lumen of the tracheal system. But they did not show a co-localisation with an apical marker. Furthermore, in their previous work, (Drees et al. 2019) they called the apical staining a "peripheral shell" layer. In addition, in S2R+ cell culture, it is only when Pio and Dpy co-express that Dpy is detected at the cell membrane. The in vivo localisation of Dpy is an important point that needs to be clarified as it is of importance for the final model proposed Supp Fig. 9.<br /> Drees at al. also performed FRAP experiments on Dpy::eYFP protein trap embryos. As excepted as already shown by Dong et al.<br /> A minor point, the Dpy::eYFP protein trap line used in this study is not listed in the Materials and Methods section of the supplementary data.

      The serine protease NP and Pio release.<br /> Drees and colleagues have pervious shown, preforming in vitro studies, that protease Notopleural (Np) cleaves the Pio ZP domain (Drees at al. 2019). Here the authors went a step further in demonstrating that it is also true in vivo at stage 17. In addition, they showed that, in Np mutant embryos, mCherry::Pio is mostly detected within tracheal cells and the luminal staining is strongly reduced. In this mutant context, the authors conducted FRAP experiment on the mCherry::Pio signal even very weak in the lumen. They showed hardly no recovery after photobleaching.<br /> In Drosophila S2 cells, Drees and colleagues showed that co-expression of the catalytically inactive NpS990A with mCherry::Pio in showed as a prominent signal the 90kDa mCherry::Pio variant in the cell lysate (Fig. 5B), and live imaging revealed mCherry::Pio localisation at the cell surface (Fig. S6B). However, in this inactive form context, a strong signal is also detected at 60kDA corresponding to a cleaved form of the Pio ZP domain (Fig. 5B), and Pio localisation at the cell surface appears weaker than in controls. They authors did not consider that another protease could be at play.<br /> On the other hand, in their previous work, Drees et al. identified a mutant form of Pio (PioR196A) which is resistant to NP cleavage in vitro. It will be a step forward to establish by CRISPR/cas9, as the authors seems to be successful with this technique, a mutant line carrying this point mutation. It will be important to determine whether the observed phenotype resembles that of a mutant Np phenotype.<br /> In their previous work (PLOS Genetics 2019), in Np mutant embryos, Drees et al. did not report "budge-like" deformations from stage 16 onwards leading to the detachment of the tracheal cell from their adjacent aECM. Either the alleles or the allelic combination is different between the two studies which could explain this difference, or it is a new phenotype that has not been previously described. In the latter case, it becomes important to quantify the proportion of segments showing these bubbles. Is this a rare phenotype to observe?<br /> Minor point: I don't understand what the authors are trying to show in supplementary Figure 8. Tracheal cells detach and are found in the lumen?

      Np function conserved matriptase.<br /> In this work, Drees and colleagues showed that Np controls in vivo the cleavage of the Pio ZP domain.<br /> Dumpy and Piopio are not conserved in vertebrates but they both contain a ZP domain which is conserved. The authors tested if other ZP proteins can be cleaved by Np or the human homolog Matriptase. The authors tested in cell culture the ability of the type III Transforming growth factor-β receptor which contains a ZP domain to be cleaved either by Np or Matriptase.<br /> This could be a general mechanism that needs to be extended to other ZP domain proteins and that could be at play to structure the matrix and give it its physical properties.<br /> However, as it is all speculative, I find the discussion section related to these data, for too long and that does not help to understand better the work done in the formation of the tracheal tubes of the drosophila embryo.

      Minor points: Pio and cytoskeleton organisation.<br /> Line 78-79, the authors wrongly quoted a work from Brodu et al (2010). Pio does not anchor the microtubule severing enzyme Spastin. Instead, Spastin releases the microtubule-organising centre from its centrosomal location, then Pio contributes to its apical membrane anchoring. It can therefore be assumed that the organisation of the microtubule network is affected in a pio null mutant. In addition, ZP proteins have been shown to link the aECM to the actin cytoskeleton. Therefore, it would be interesting to look at the organisation of the actin and microtubule cytoskeletons in a pio mutant context in which enlarged apical cell surface area are observed.

      Referees cross-commenting

      I have just read the comments of the other two reviewers, who like me are specialists in the formation of the tracheal system in the drosophila embryo.<br /> I find the comments very fair and balanced. They are in the same spirit as my comments and are very complementary. I hope that all our comments will be constructive for the authors and will improve the quality of their work.

      Significance

      Overall, the methodology is sound, the quality of the data is good and the paper is very well written. Authors combine in vivo, in vitro studies as well a cell culture approach. Using CRISPR/Cas9, they generated a large number of new tools allowing in vivo studies.<br /> Drees and colleagues generated new alleles of pio which are lack of function alleles. They described a new phenotype for pio mutant embryos, namely over-elongated tubes. But they authors do not comment on why these new alleles reveal a new phenotype. Furthermore, using their piomCherry::pio line, the authors state that Pio is localised to the plasma membrane. This location is very difficult to assess. Both new results require clarification.

      The authors had already demonstrated that Np cleaves the ZP domain of Pio in vitro. Here they demonstrate this in vivo. It appears important to evaluate the formation of the dorsal tube when an NP non-cleavable form of Pio is expressed.

      Finally, the model proposing a coupling between the extracellular matrix and the membrane of tracheal cells is very interesting. The demonstration that cleavage of Pio by Np could participate in this coupling is very interesting for those interested in the integration of mechanical stress and cellular deformation. However, such a model has already been discussed in Dong et al (2014). In this article, Dong et al. proposed that a "coupling of the apical membrane and Dpy matrix core is essential for tube length regulation".

      The audience for this article should be specialised and oriented towards basic research. It may be of interest to people working on tubular systems or working on ZP proteins.

      My field of expertise is cell biology and developmental biology in drosophila and formation of tubular networks.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      This study provides valuable evidence showing a role for ZP proteins Piopio (Pio) and Dumpy (Dpy) during tracheal tube maturation. Following up on previous studies from the same group (Drees, et al, PLOS Genetics, 2019), the authors show that Piopio localization and function is regulated by the protease Notopleural (Np). Failure in this process leads to defects in trachea tubular defects.<br /> Overall, this work reinforces the previously known importance of the protein Pio in tracheal morphogenesis, specifically in tube expansion, and provides new mechanistic insight on how Pio is regulated by proteolysis during this process.<br /> The figures are clear, and the questions well addressed. However, I find that some of the claims are not completely backed by the data presented and have some suggestions that will hopefully make some points clearer.

      Major comments

      In the abstract and at the end of the introduction the authors claim that they show that Pio, Dpy and Np support the balancing of mechanical stresses during tracheal tube elongation. However, this is not shown in this manuscript, where tension or mechanical stress were not measured and it is therefore speculative.

      The authors state that all pio CRISPR/Cas9 generated mutants display identical tracheal phenotypes, however these data are not shown. Tracheal phenotypes, in particular DT phenotypes, of all mutants generated should be shown in supplementary materials.

      At stage 16, pio null mutants display DT overelongation phenotypes (Fig. 1). The authors should quantify this phenotype.

      The authors analyse Pio distribution under tubular stress, using mega mutants and Chitinase overexpression. Pio localization changes in these genetic backgrounds and this is shown in Figure 2 only in a qualitative manner. The authors should measure Pio localization at the lumen and at the membrane and provide quantitative data.

      Surprisingly and interestingly, wurst;pio transheterozygotes display very strong tracheal defects. The authors say they observe gas filling defects; however it is not clear from figure 2E if this indeed the case. From the panel in the figure, it looks like these embryos suffer from strong tracheal morphogenetic defects. It would be necessary to have a better analysis of these embryos. What is the penetrance of this phenotype. If this is 100% penetrant, one would expect it to be lethal. Therefore, double mutant balanced stocks are not viable? Having analyzed the phenotypes and confirmed which morphogenetic defects the transheterozygote embryos present, how does this genetic interaction fit with the model presented?

      mCherry::Pio Dpy::eYFP time lapse analysis and FRAP experiments is very interesting. However, it is not clear to which degree bleaching occurs in the tracheal lumen. The authors claim that recovery is very fast and can be seen from minute 2, however, frame-by-frame analysis of Movie S2 does not show a clear different between luminal Pio from minute 0 to minute 2. Rough comparison with the luminal area surrounding the bleached area, does not show a clear difference in luminal Pio before and after photobleaching. To claim fast recovery of luminal Pio after photobleaching, the authors should quantify luminal Pio, before and after bleaching. In addition, in figure 4D, the normalized mCherry::Pio fluorescence in the graph what does it refer to? Intracellular Pio?

      When mCherry::Pio Dpy::eYFP time lapse analysis and FRAP experiments was done in an Np mutant background, the authors describe lack of Pio recovery within the lumen (Movie S3). However, when comparing control and Np mutant background embryos, Pio is not properly released into the lumen of Np mutants (as stated by the authors and seen by comparing movies S1 and S4). Furthermore, on minute 0 of the FRAP experiment in Np embryos, there is no detectable Pio in the DT lumen. Therefore, recovery was not expected in Np mutants and should not be claimed as a conclusion for this experiment.

      Brodu et al (Dev Cell 2010) have shown that Pio is important for cytoskeletal modulation during tracheal maturation. Pio is important for non-centrosomal microtubule (MT) arrays anchored at the tracheal cell apical membranes. In addition, MT disruption in tracheal cells leads to lumen formation defects (Brodu et al, Dev Cell 2010). In the absence of Pio, the tracheal cytoskeleton is altered, and this could explain some of the results observed. Ideally, the work should be complemented with a basic cytoskeletal analysis, but if this is not possible, the authors should discuss some of the phenotypes in light of this Pio function.

      Minor comments

      The model should not be in supplementary materials and should be moved to the main manuscript.

      Throughout the manuscript embryonic stages are described using different nomenclature (stage X, stX and st X). Either way is correct, but the same nomenclature should be used throughout.

      In Fig. S1 B and C the authors should specify which pio allele is being analysed (as in Fig. 7). The same should be done in the text.

      Line 131, it is not correct to say that WGA visualizes cell membranes. WGA marks/stains cell membranes.

      Line 165 "leads to excessive tube dilation and length expansion due to strongly reduced luminal chitin" is not correct. Chitin reduction leads to excessive tube dilation but not to length expansion, as reported in the papers cited at the end of the sentence.

      Line 220-221, what do authors refer to as "stage 16 wt-like control embryos"?

      Line 221, "some minutes" should be replaced by a specific number of minutes. According to Movie S2 reappearance of tracheal cell Pio happens from minute 16.

      Line 291 "time laps" should be lapse.

      Line 302, "Pio was not shedded into the lumen but remained at the cell" should be "Pio was not shed into the lumen but remained in the cell".

      Referees cross-commenting

      I agree. Taken together, all the comments will improve the quality of the work and of a future manuscript. Also, everything seems quite doable and will not present any problems.

      Significance

      The findings shown in this manuscript shed light on the regulation of tubulogenesis by ZP proteins and how their interaction with the ECM can be regulated by proteolysis. It was known that Pio is involved in tracheal development, is secreted into the lumen, regulating tube elongation (Jaźwińska et al., Nat.Cell Biol., 2003) and anchoring MTs to the apical membrane during tubulogenesis (Brodu et al, Dev. Cell 2010). This work provides additional molecular insights into Pio dynamics and regulation during tube maturation.<br /> This work will be of interest to a broad cell and developmental biology community as they provide a mechanistic advance in ZP proteins involved in morphogenesis. It is of specific interest to the specialized field of tubulogenesis and tracheal morphogenesis.

      Field of expertise:

      Drosophila, morphogenesis, tracheal tubulogenesis, cytoskeleton

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript entitled "The proteolysis of ZP proteins is essential to control cell membrane structure and integrity of developing tubes" the authors Leonard Drees, Dietmar Riedel, Reinhard Schuh and Matthias Behr report on their study on the molecular and cellular mechanisms of tracheal tube size determination in the embryo of the fruity fly Drosophila melanogaster. The tracheal tubes of D. melanogaster are a model tissue to understand the molecular and cellular mechanisms of tube formation in animals. In brief, morphogenesis and terminal differentiation of this epithelial tissue runs through several overlapping events starting with the invagination of tracheal precursor cells, their fusion to tubes, the formation of a luminal matrix consisting of chitin and associated proteins for diameter and length determination, endocytosis of luminal material and gas filling at the end. L. Dress and colleagues show that the ZP protein Piopio (Pio), its partner Dumpy (Dp, also a ZP protein) and the Matriptase homolog Notopleural (Np) are needed for tube length determination. Pio with Dp constitutes a molecular anchor between the apical cell membrane and the luminal matrix thereby coordinating growth of the epithelial cells and the luminal matrix. After termination of morphogenesis, Pio is cleaved by the protease Np to initiate tracheal differentiation.

      Overall, this is an exciting work. There are, however, several open questions that the authors could address to facilitate understanding of their work. These points are:

      • On page 5, lines 113ff, the authors mention the membrane bulges that they analyse in figure 1. They show these deformations by light (confocal) and electron microscopy. However, the bulges seen by confocal microscopy seem to be bigger that those seen by electron microscopy. The authors could quantify the sizes of the bulges for clarification.
      • The subject of the manuscript is rather complicated; presentation of data from Figure 1C and D on lines 113ff and 169ff is confusing.
      • The quality of the sub-images of Figure 2E differs. Especially, the phenotype of the wurst, pio transheterozygous embryo is not well visible.
      • Lines 246ff: the protein size are given for the mCherry:chimeric proteins; an estimate of the native Pio portions should be given.
      • In Figure 6A, the appearance of chitin in the wildtype tube is different compared to the Np mutant situation, more filamentous. Can the authors comment on that?
      • In the discussion section, I would appreciate if the timing of events was discussed or even shown in a model. The central question is: how are the functions of Pio and Np coordinated in time? As I understand, Np should not cleave Pio before morphogenesis is completed. Is there any example in the literature for how such an interaction could be controlled? The overexpression of Np shows that either the ratio between Np and Pio is important, or the btl promoter expresses Np at the "wrong" time point.
      • Also for the discussion: We have two situations where Pio amounts/density are enhanced at the apical plasma membrane. The wurst experiments on lines 136ff show that Pio amount and density depends on endocytosis; is the wurst phenotype (Figure 2), at least partially, due to over-presentation of Pio? Likewise, in Figure 2C, there is more Pio in Cht2 overexpressing tracheae (but there is overall more Pio in these tracheae) - is actually endocytosis reduced in chitin-less luminal matrices? First: does the Pio signal at the apical plasma membrane correspond to membrane-Pio or free-Pio? Second, as in the case of wurst: would more Pio on the membrane (density) affect tracheal dimensions in Cht2 over expressing tracheae? Or are the consequences of Pio accumulation in the apical plasma membrane different in Cht2 and wurst backgrounds? Maybe cleavage of Pio and its endocytosis are dependent on its interaction with the chitin matrix. These questions connect to the question immediately above: how are the functions of the different players coordinated in space and time? We need a discussion on this issue.
      • The sentence on line 242ff should be rephrased: "dynamic" and "elastic" are not opposites.
      • A central question to me is the amounts and the density of factors in different genetic backgrounds as mentioned above. Is there any mechanism adjusting the amounts or the density of the players according to the size of the apical plasma membrane or the tracheal lumen? Pio seemingly responds to these changes.
      • The connection of Pio and taenidia is mentioned in the results section (page 7) but not discussed.
      • Dp remains cytoplasmic in pio mutant background - is the pio mutant phenotype due to defects by lack of Pio AND Dp function? What is the tracheal phenotype of dp mutants?
      • Lines 374ff: the reduced dorsal trunk in Np mutants is not significant; the respective statement should be formulated carefully. If we believe the statistics (no significance), this would mean that attachment of the apical plasma membrane to the luminal chitin via Pio is needed to restrict axial extension; release of Pio is needed for differentiation (taenidia formation, luminal clearance) beyond morphogenesis.
      • Why don't we see the apical Pio signal in Figure 4B?
      • The Strep signals in the merges in Figure 7C are not well visible.

      Significance

      This work brings together several factors (Pio, Dp, Np, Wst etc) already known to be needed for tracheal morphogenesis and differentiation in the embryo of D. melanogaster. Having worked myself with some of these factors, however, I recognize that the interaction between these factors is novel and very exciting. The experiments strongly indicate a new mechanism of cell-ECM connection that seems to be conserved to some extent (as they provide preliminary data on an example from humans). By integrating the functions of different factors, the work provides ample opportunity for future projects to elucidate this mechanism in detail. Therefore, I expect that it will have a significant impact not only on the field of developmental cell biology but also, due to the conserved proteins involved (ZP proteins, Matriptase), on the field of cell biology of human diseases.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript describes the strategy to efficiently synthesize a natural truncated version of the chemokine CXCL10 that lacks the last 4 amminoacids. In addition, it describes the biological activities of the CXCL10 truncated version (1-73) compared to the full length chemokine (1-77). By performing in vitro and in vivo experiments, authors have found that CXCl10 1-73 is not able to induce signalling and chemotaxis of CXCR3 expressing cells such as T lymphocytes. In addition, this C terminal truncated version does not bind GAGs while retains angiostatic activity, blocking migration and proliferation of endothelial cells.<br /> The paper is written very well, results are presented in a very logical sequence.

      Major comment

      The in vivo experiments shown in supplementary figures 7 and 8 are not significant and I suggest removing them from the manuscript.

      Minor comment

      In figure 9D authors showed the in vivo migration of CXCR3 positive T lymphocytes in the peritoneal cavity. However, the gating strategy showed in supplementary figure 6 is showing all the leukocytes CXCR3 positive. Please clarify.

      Significance

      The manuscript describes the biological activity of a truncated version of CXCL10 a very important chemokine that recruit Th1 lymphocytes and NK cells. The C terminal truncated version of CXCL10 is naturally occurring, but its functions were never described until now.

      The strength of the manuscript is the precise description of the synthesis and of the in vitro biological functions of the truncated CXCL10.

      For this reason, these results are of interest not only for a specialized audience working in the chemokine field, but also for a more broad audience for the development of an inhibitor of CXCR3 or for an angiostatic molecule.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      It has been reported that CXCL10 has several truncated forms (proteoforms) with different C-terminal truncation states, each with different functions. The authors have discovered and reported an efficient peptide synthesis method for CXCL10(a.a. 1-73) proteoform. The authors indicated that they could synthesize CXCL10(a.a. 1-73) proteoform consistent with the known functions of natural CXCL10(a.a. 1-73). The synthesized CXCL10(a.a. 1-73) successfully indicated the reduced effects on lymphocyte migration and similar effects on angiogenesis. These findings open the way for detailed functional analysis of CXCL10 (a.a. 1-73), which has been difficult to study in vivo and has potential for therapeutic use. However, as discussed below, the authors have made several statements that confuse their findings with the discoveries made by previous studies. The Introduction is a typical example of this. Also, there are several major issues noted in the next section.

      Major comments:

      1. Figure 4: Possibly an overinterpretation of results in CXCR3A overexpressing model cells<br /> The authors build their logic for the entire paper by drawing conclusions based on their assumption that CXCR3A overexpression model is equivalent to physiological lymphocytes and endothelial cells. However, CXCR3 has isoforms, including CXCR3A/B/alternative. They have different effects on cell proliferation and migration. Expression levels of CXCR3A/3B may vary among cell types and microenvironments. In addition, the downstream signals pAKT and pERK of CXCR3A/B are regulated by various regulatory factors. Therefore, it is important to perform the experiments shown in Figure 4 with primary lymphocytes and vascular endothelial cells, which are the subject of this paper. Based on the data presented by the authors, experiments with Primary lymphocytes and Endothelial cells would not be difficult.
      2. Figure 5: "In line with the observation of the signaling assays, COOH-terminal processing of CXCL10 also significantly diminishes its chemotactic properties on primary CXCR3+ T lymphocytes."<br /> The authors draw the conclusions described above from the results in Figure 4 and Figure 5. In other words, authors excluded other possibilities without data.<br /> In Figure 5, the Chemotaxis assay was performed in Transwells with 5 miro-meter pores pre-coated with fibronectin. CXCL10 is also known to interact with fibronectin. This suggests that, potentially, the interaction with fibronectin may be important for CXCL10 gradient formation on the transwell. However, interaction data between CXCL10(a.a. 1-73) proteoform and fibronectin is not shown. This information is essential in the interpretation of Figure 5 results.<br /> The authors should consider the possibility that readers unfamiliar with this experimental system may be given a false understanding that Chemotaxis shown here is determined solely via CXCR3A.<br /> Also, please indicate whether the conclusions here are supported in different Pre-coating (e.g. type I collagen, type 4 collagen, human fibronectin). How the activation changes with each Coating here is important information when considering how CXCL10(a.a. 1-73) behaves in the extracellular matrix in vivo. These add to the value of this study and provide important insights for readers to further work with CXCL10(a.a. 1-73).<br /> Furthermore, the Migration chamber here is pre-coated with bovine serum fibronectin. Please provide Lot and purity information for this Serum derived fibronectin. This is considered important both for the reader to reproduce the data and to interpret the results. Since Bovine serum fibronectin is a different species than human CXCL10 (a.a. 1-73), in order to correctly interpret its contribution to the Chemotaxis assay, it is interactions, respectively, should be evaluated.
      3. Figure 6: Over-interpretation of the results<br /> From Figure 6, the authors conclude that CXCL10 (a.a. 1-73) has no change in antiangiogenic action based on data on vascular endothelial cell migration and viability. Cell migration and endothelial cell viability are only one aspect of angiogenesis. It is problematic to conclude from these results that there is no change in "antiangiogenic action".<br /> Also, in Figure 6A, the authors cultured cells in the presence of FGF2 and in the presence of CXCL10 (a.a. 1-73) and CXCL10 (a.a. 1-77) for as long as 49 hours to evaluate Migration. Therefore, the results here include not only pure migration but also its effect on proliferation. However, Figures 6C/6D only show data on the viability of cells, not on the effects on cell proliferation. Therefore, in order to correctly interpret the results, the proliferation of vascular endothelial cells needs to be examined and presented.
      4. Figure 9 and supplemental Figure S6: Gating for T cells (gated as CD3+ NK1.1-) and activated CXCR3+ T cells (gated as CD3+ NK1.1- CXCR3+)<br /> Supplemental Fig. S6 raises a question as to whether the location of the Gating of CD3 and NK1.1 is correct. Please verify if this gating is proper by presenting Isotype control data as the basis.<br /> Gating for CXCR3 also seems to be gated in an unnatural position. Please present Isotype controls data and positive control data and explain the basis for this gating.
      5. Figure 9: Over-interpretation of the results<br /> It would be an oversimplified interpretation of the results here to explain them solely in terms of lymphocyte Migration. The authors should not rule out the possibility that the results obtained here could be due to effects quite different from those shown so far in vitro.<br /> Conclusions should be drawn after examining the following items

      6. Expression of lymphocyte adhesion-related molecules on the surface of vascular endothelial cells

      7. Effects on Tight junction of blood vessels
      8. Effect on vascular permeability

      If the above data are not presented, the authors should clearly describe that the author's conclusion is just one of the possibilities. The readers should be informed of the above possibilities, and the different potential mechanisms involved so that the readers do not misunderstand that the authors' conclusions are definitive conclusions.

      Minor comments:

      Figure 7C: Please provide higher-resolution images

      The quality and resolution of the images are low and very difficult to see. The image is of such low quality that it is barely possible to determine the presence or absence of cells. Here, providing higher-resolution images is important to give the reader a deeper understanding. The desired resolution is a resolution that allows determining what the Filopodia and Lamelipodia morphology of the cell looks like at the Edge of the Scratch, and how it differs or does not differ between CXCL10 (1-73) and 1-77, etc., desirable. Such an image could underpin the other data in this paper. Furthermore, such detailed forms can give the reader insights into more precise molecular mechanisms. In this sense, it is essential to provide high-quality images.

      Line 360-362, page 12 (Results)

      "Various naturally-occurring COOH-terminally truncated CXCL10 proteoforms were detected in human cell-culture supernatant of IFN-γ-stimulated human diploid skin/muscle-derived fibroblasts and primary human keratinocytes and potential processing enzymes were identified (Suppl. fig. 1). "<br /> This statement could be interpreted to mean that what is described in Supplemental figure 1 is identified in this paper. Although it is unlikely that most readers would make such a mistake, unnecessary misleading statements should be avoided.

      Line 508-509, page 16 (Discussion)

      "In the present study, we characterized the effects of a natural COOH-terminal truncation of CXCL10, which involves the shedding of the four endmost COOH-terminal amino-acids, on hallmark chemokine properties of CXCL10."

      The authors state that "we characterized the effects of a natural COOH-terminal truncation of CXCL10," which gives the reader the wrong impression.

      "Natural truncation of CXCL10" means physiological CXCL10, which is truncated form that normally occurs in vivo. These findings have been done in prior papers and were not first characterized in this paper. This should be described as a characterization of the synthesized peptide. This sounds like the authors have taken credit for prior studies.

      Figure 6A&6B: What is the "HRMVEs" on the Y-axis? Nowhere in the paper is there a description of this term.

      Figure 8A&8B: Some error bars are only on one side.

      Significance

      It has been reported that the functions of CXCL10 change dynamically in tissues depending on the C-terminal truncation state. However, this dynamic nature created a mixture of each Proteoforms (CXCL10 with different terminal truncation states), making the analysis of their functions difficult. CXCL10(a.a.1-73) is not commercially available like CXCL10(a.a.1-77) due to its difficult peptide synthesis; pure functional analysis of CXCL10(a.a.1-73) could not be performed in vivo. Therefore, the functions of CXCL10(a.a. 1-73) has been mainly reported as circumstantial evidence or in vitro studies using trace amounts of purified product purified using HPLC.

      In this study, the authors clarify the challenges of peptide synthesis and enable the synthesis of more CXCL10(a.a. 1-73). Thereby paving the way for implementing the function of pure CXCL10(a. 1-73) proteoform not only in vitro but also in vivo. It also potentially opens the door for the application of CXCL10(a.a. 1-73) in therapeutic interventions such as tissue repair.

      However, the paper has the problems mentioned above, and it would be desirable to verify and reinforce the reliability and logical development of the conclusions. Reinforcing additional experimental data such as that and validating the derivation of the conclusions would be a study of significance to basic medical researchers in vascular biology, immunology, and tissue repair, as well as to the clinical research community.

      General assessment:

      Strength:

      The authors have discovered and reported a stable method for synthesizing CXCL10 (a.a. 1-73), which has been difficult to synthesize in the past. This may provide researchers a way to solve the problem that it has been difficult to analyze clear molecular mechanisms due to the mixture of diverse CXCL10 proteoforms. The progress reported here may be expected to facilitate other researchers to investigate more detailed molecular mechanisms and explore unknown functions of CXCL10 (a.a. 1-73).

      It is also expected to solve the problem of in vivo analysis of CXCL10(a.a. 1-73) function, which has been impossible due to yield issues. In the future, this synthetic peptide may open the door to a variety of useful applications, such as therapeutic intervention for severe wound healing.

      Advance:

      The authors wrote the "Introduction" and "Abstract" focusing on the functional "discovery" of CXCL10 (a.a. 1-73). This may prevent the readers from understanding the true value of this study. The most significant finding of this study is the technological advance of increasing the yield of CXCL10 (a.a. 1-73), which has been difficult to synthesize to a level that allows in vivo experiments. Although there are many improvements to be made, I believe this is a significant study for the community described above if this synthesized peptide is widely available in the community.

      Audience:

      The current manuscript is suitable for a Specialized audience. If the issues raised here were solved, it might be suitable for broader audiences, including translational/clinical researchers.

      My field of expertise:

      Molecular biology, biochemistry, vascular biology, hematology and cancer

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Dillemans et al reports that synthesis, purification and functional characterisation of the truncated CXCL10 proteoform CXCL10(1-73). This lacks the four endmost COOH-terminal amino acids . The authors report that compared to the full length CXCL10(1- 77), CXCL10(1-73) had (i) diminished affinity for glycosaminoglycans, (ii) exhibited reduced capacity to induce signaling events (e.g. calcium mobilization as well as ERK and Akt phosphorylation) and (ii) reduced chemotactic T lymphocyte responses in vitro and in vivo. However, CXCL10(1-73) retained its anti-angiogenic properties, as assessed by inhibition of spontaneous and FGF-2-induced migration, wound healing and sprouting of human microvascular endothelial cells.

      The work is well performed though the pharmacological analysis is a little superficial and under-developed with incomplete/inconsistent concentration-dependent responses. The manuscript is rather verbose in places.

      Specific points:

      1. Fig 4: how do the authors know that the reduced calcium responses to full length CXCL10 following pr-treatment with the C-terminal truncated CXCL(1-73) is due to desensitisation rather than say partial agonism? They should compare internalisation of CXCR3 and/or loss of surface expression of CXCR3 following treatment with CXCL10 (1-73) versus CXCL13(1-77) to validate this.
      2. The choice of concentration ranges used for CXCL10(1.77) and CXCL10(1-73) across figs 4, 5 and 6 is inconsistent with no explanation given as to why.
      3. Figs 4: The dose response curves are rather limited narrow e.g.1, 3, 10 nM for CXCL10(1-77). The choice of concentrations for CXCL10(1-73) in fig 4 is a little unusual in Fig 4 (9, 45, 270nM). Has the maximum response to CXCL10(1-73) in figs 4-6 been achieved? It would be useful to know the EC50 values for both full length and truncated forms of CXCL10 in figs 4 and 5
      4. Fig 5: in contrast, to Fig 4, this figure has comparable concentration ranges at 5 points across (1-100 nM). What is the rationale for the inconsistent concentration ranges used across different assays?
      5. The bar graphs for pERK, pAkt responses would look better as line graphs and more complete concentration ranges (perhaps use 5 concentrations e.g. over 1-100 nM for CXCL10(1-77).
      6. Fig 6: the inhibitory effects of CXCL10(1-77) and CXCL10( 1-73) seem to occur at a single concentration (120 nM), Can the spontaneous HMVEC migration be further inhibited at higher doses of truncated and full length CXCL10? Both appear to have just reached 50% inhibition at 120 nM.
      7. Fig 7. What is the impact of both proteoforms on FGF-stimulated wound healing?
      8. Why is it necessary to provide Kd values in the main results text when these are already provided in Table 1. This is just one example of verbosity that that is often present in the manuscript
      9. The methods section is also very long.
      10. What phosphorylation sites are detected in the ERK1/2 and Akt ELISA assays? The authors should provide more details on this point. The ELISA assays alone does not really provide convincing analysis of phosphorylation and should be backed up with more robust assays to assess ERK and Akt phosphorylation e.g western blots and/or flow cytometry with phospho-specific Abs.

      Significance

      The study reveals that the COOH-terminal residues of CXCL10 Lys74-Pro77 are important for GAG binding, CXCR3A signaling, T lymphocyte chemotaxis, but dispensable for angiostasis .

      Study is of interest to basic researchers in areas of pharmacology, immunology and structural biology with relevance to drug discovery, inflammation and cancer biology.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their constructive criticism that helped us to improve the paper. We modified Fig.6I and Fig.7, replaced Fig.8, and added supplementary Figs. 3-5 and supplementary Tables S1-2. The manuscript was extensively re-written. A new paragraph was added in the Discussion section where relative adhesiveness was related to absolute adhesion strength and the cadherin knockdown result to earlier findings.

      Reviewer #1 (Evidence, reproducibility and clarity):

      Summary: This work examines the relationship between cell-cell contacts and pericellular matrix in Xenopus chordamesoderm, which is a tissue actively involved in convergent extension during gastrulation. By lanthanum staining of pericellular materials, the authors found that different types of pericellular matrix are present in cell-cell contacts in the chordamesoderm, which may mediate cell-cell adhesion. Knockdown of C-cadherin, Syndecan-4, fibronectin, and hyaluronic acid leads to the reduced abundance of cell contacts and cell packing density, but this does not seem to affect convergent extension. Based on these observations, the authors propose a model in which cell-cell contacts involve the interdigitation of distinct pericellular matrix units.<br /> Major points:

      1. Knockdown of adhesion molecules separates cells and leads to wide contacts with large interstitial spaces. Data in figure 1 show loosely packed morphant chordamesoderm cells. Intuitively, these should reduce cell-cell adhesion. However, a main conclusion from this manuscript is that reduced abundance of narrower contacts does not decrease adhesiveness. Although depletion of adhesion molecules modifies but not abolishes a contact, non-attached free surfaces increase significantly in morphant cells. It is therefore not easy to understand that how reduced cell contacts have no effect on cell adhesion.

      We added a section to the Discussion to address this issue (p.11ff). We show in the Results section (modified Fig.7) that relative adhesiveness is indeed significantly reduced in the morphants (Syn-4 always being the exception) when compared in the contact width range of normal chordamesoderm. However, contact width is strongly increased in the morphants, and adhesiveness increases linearly with width. We argue that these effects compensate for the initial lowering of adhesiveness. In other words, adhesive contacts become shorter (more gap surface) but wider (see Fig.6I), and become the more adhesive the wider they become. As in the original version of this paper, we then propose a model that explains the empirically observed increase of adhesiveness with width. How the abundance of cell-cell contact is reduced is less clear yet. Pericellular matrix deployment and structure is strongly affected by adhesion factor knockdown, and contact types are altered. Some contact types seem to widen but remain adhesive, others become non-adhesive, and still others may disappear without being replaced (see last paragraph of Discussion). To add detail to these notions and clarify this important issue to satisfaction will require future research.

      Importantly, the adhesiveness was not experimentally tested.

      Due to external circumstances, we were unable to perform additional experiments. However, we used our previously published quantitative data on adhesion in gastrula tissues including the chordamesoderm to interpret our present results for normal and C-cad-depleted chordamesoderm, and to relate relative adhesiveness to absolute adhesion strength, in a new section of the Discussion (p.11ff).

      1. It is surprising that reduced cell contacts, at least narrower cell contacts, do not affect convergent extension. Does this mean that active cell behavior changes in the chordamesoderm, which are required for convergent extension, are independent of cell contact types?

      We actually claimed that all treatments inhibited convergent extension, except for Syn-4 (Barua et al. 2021, and this manuscript, p.3, Fig.1B,C). Syn-4 knockdown had a dramatic effect on cell contacts, cell density and cell shape but none on convergent extension, at least up to the middle gastrula stage. This is surprising and does not fit easily to current views of cell intercalation during convergent extension, but analysing the underlying cell behaviors is beyond the scope of this article.

      1. Although the formation and localization of pericellular materials are differentially affected after knockdown of adhesion molecules, there is no clear evidence showing that different types of pericellular matrix mediate cell-cell adhesion in the chordamesoderm. It is possible that the disrupted distribution of pericellular materials in morphants only represents a secondary consequence of changed cell contacts. This may be supported by the fact that knockdown of adhesion molecules reduces narrow contacts and increases LSM-free gaps.
      2. The relationship between contact width spectra and LSM is also very elusive. Again, changes in contact width or abundance and distribution of LSM may be indirectly caused by loss of adhesion molecules. Therefore, although knockdown of adhesion molecules leads to changes of LSM localization, it cannot be concluded that cell-cell contacts in chordamesoderm are mediated different types of pericellular matrix.

      We find it difficult to interpret for example Fig.5A-F other than assuming an adhesive role for the pericellular matrix, in this case LSM, in normal and morphant tissue. What else would here hold two cells between two gaps together? The contacts are often much too wide for cadherin-cadherin binding. We indeed believe that changes in contact width or abundance are caused by the loss of adhesion molecules, directly or indirectly. Our LSM images show that remarkably, modified contacts (e.g. Fig.3D,F; Fig.5B,C) are still able to keep cells together over some distance, between interstitial gaps, and our quantitative data indicate similarly that e.g. contact widening is consistent with continued adhesion. However, some of the contacts may become non-adhesive, or be lost without being replaced, increasing non-adhesive gap surface. This is discussed now on p.11, middle paragraph.

      1. In contrast to the present observations, works by others using the same morpholinos have shown that Cadherin-dependent cell adhesion, fibronectin-rich extracellular matrix, and Syndecan-4-regulated non-canonical Wnt signaling are required for convergent extension. These discrepancies need to be appropriately addressed.

      As mentioned above, we found that all treatments affected convergent extension, as expected from the work of others and our own, except for Syn-4 depletion. We noticed that in the paper by Munoz et al. on Syn-4 overexpression and knockdown, only late gastrula/early neurula stages were evaluated. Syn-4 knockdown produced moderately strong axis defects, perhaps in part related to impaired neural plate closure. Unfortunately, we did not follow our morphants to these later stages to see whether defects developed then. But our main interest here is cell-cell contacts.

      1. If LSM and LSM-free contacts are similarly adhesive, what will be role of LSM in cell adhesion and how cell adhesion is established in these LSM-free contacts?

      We discuss now more explicitly the notion that gastrula non-epithelial cell adhesion is mediated by a mosaic of pericellular matrix patches of different composition, some containing LSM in different configurations, others not, but each similarly adhesive.

      Minor points:<br /> 1. It may be helpful to clearly define the pericellular matrix in this particular context and its relationship with LSM. It is also necessary to clarify whether the adhesion molecules examined in this work are considered as components of the pericellular matrix.

      We explain the use of these terms at the end of the first paragraph of the Introduction. The most general term is pericellular matrix; part of it is La3+ labeled – LSM; and some of the LSM can be compared to structures which in other systems are termed glycocalyx. We consider the adhesion molecules examined to be part of the pericellular matrix but are aware of other putative functions, like in cell signaling, which may indirectly affect contacts and thus contribute nevertheless to the phenomena studied here.

      1. In figure 1B, it appears that the Cadherin morphant has defects in chordamesoderm elongation and archenteron formation, suggesting impaired convergent extension.

      We find, in agreement with the work of others, that C-cad knockdown impairs convergent extension, and mention this when we describe Fig.1B.

      1. In figure 1C, the Syndecan-4 morphant gastrula clearly shows enhanced anteroposterior elongation of chordamesoderm and archenteron in comparison with the wild-type embryo. This seems to suggest that loss of Syndecan-4 promotes the movements of convergent extension. However, previous studies indicate that both gain and loss of Syndecan-4 impairs convergent extension.

      As mentioned above, late gastrula/early neurula stages were evaluated in the Munoz et al. paper, mid-gastrula stages in our work. One possible explanation would be that mild axis defects develop later, partly in connection with neural tube elongation and closure.

      1. Ideally, in knockdown experiments, control embryos should be injected with corresponding mismatch morpholinos.

      We explain in the Methods section that we only used morpholinos that were extensively characterized in previous publications.

      1. In figure 1E, it is unclear what type of cell contacts the light green arrowheads indicate.

      This is explained now in the figure legend.

      1. Figure 1 legend, "(wt) is from Barua et al. 2021". I am not sure it is appropriate to use previously published data.

      The present data were derived by further evaluations of the same samples and TEM sections as used in Barua et al. 2021. We show the previously published data (acknowledged in the legends) here for easy comparison (instead of citing the previous paper).

      1. There is no light blue arrowhead in figure 2, and in figure 3B and 3I, it seems that the same colored arrows are used to indicate different structures.

      This has been corrected.

      1. Triple-layered contacts are not clearly defined.

      We define this term now repeatedly, as consisting of two LSM layers enclosing a non-labeled layer between them.

      1. Page 2, "based on driven by" should be either "based on" or "driven by".

      Has been corrected.

      1. Page 8, "selectin" should be "selecting".

      Has been corrected.

      Reviewer #1 (Significance):

      Strengths:<br /> Demonstrated the effects of several adhesion molecules on the formation of cell contacts and pericellular matrix in Xenopus chordamesoderm.<br /> Limitations:<br /> The significance of chordamesoderm cell contact changes in convergent extension or gastrulation is not clear;

      Effects on gastrulation of PCM or membrane adhesion molecule depletion have very often been described as mediated by effects on cell signaling. Without excluding such possibilities, we liked to redirect attention here to other putative mechanisms by describing basic effects of treatments on cell-cell contacts including PCM deployment and structure. Future work must relate the specific, often dramatic, contact changes upon depletion of a specific factor to cell behavior during convergent extension and other tissue movements.

      there is no direct evidence showing the functional link between pericellular matrix, cell contacts and cell adhesion;

      Please see our response to main points 3 and 4 above.

      the absence of effects on convergent extension after depletion of several adhesion molecules is not fully consistent with previous reports.

      Please see our response to main points 2 and 5 and minor point 3 above.

      Advance: This work likely provides some fundamental and methodological advances for studying cell-cell adhesion. It shows promise for elucidating mechanisms underlying the regulation of cell contact changes in tissues involved in morphogenetic movements.<br /> Audience:<br /> This work likely interests readership studying embryonic cell adhesion in the field of developmental biology and cell biology. It may be also potentially interesting for people working on glycocalyx pericellular matrix in adult tissues.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary: During gastrulation, cells within vertebrate embryos require the ability to both adhere to one another and rearrange with their neighbors to shape the emerging body plan. These authors posit that such flexible adhesive contacts are mediated in part by the pericellular matrix (PCM), including multiple types of glycocalyces containing molecules such as fibronectin, hyaluronic acid, and syndecans, which they previously characterized in multiple embryonic tissues (Barua et al, PNAS, 2021). Here, in a follow-up to their 2021 study, the authors use electron microscopy to characterize the pericellular matrix within the chordamesoderm of Xenopus gastrulae. They identify several types of adhesive contacts within the chordamesoderm and assess how they are altered in the absence of key PCM molecules via morpholino knock-down. They conclude that syndecan-4 and hyaluronic acid comprise and promote assembly of PCM plaques whereas fibronectin and C-cadherin anchor them to cell surfaces. Cell packing density is decreased upon loss of all 4 of these molecules, which the authors attribute to a decrease in the number of cell contacts without affecting the strength of the remaining contacts. They further conclude that adhesiveness increases linearly with contact width, and that this relationship is unaffected by loss of any aforementioned adhesive/ PCM molecules.

      Major comments:<br /> Many conclusions in this manuscript are based on measurements of cell contact angles, which indicate the reduction of tension at cell contacts vs. free cell surfaces and thus relative adhesive strength. While this lab previously applied the same approach to live tissues (David et al, 2014), it is not clear to what extent such measurements accurately reflect adhesive strength in fixed tissues and/or electron micrographs. Especially given the issue of random sectioning planes, which cause distortion of contact angles. Although a correction was applied, the authors note this is not theoretically derived because the heterogeneity of gap sizes made such calculations too difficult. Indeed, it appears that the large gaps between cells within morphant embryos affect contact angle measurements, but if this is corrected for in any way, it is not mentioned.

      Geometrically determined contact angle distortion should affect angle or relative adhesiveness distributions in all conditions or treatments similarly and thus should not or only little affect comparisons of distribution peaks, averages, etc. Beyond this effect of random sectioning planes, we don’t see how large contact width should by itself affect measurements of angles.

      Because this is the sole measure of cell adhesion provided in the study, this reviewer is not convinced of the conclusion that loss of PCM components does not affect adhesive strength.

      In response to this criticism, we re-evaluated our adhesiveness-width data (Fig.7A-E). We noticed that there is indeed a reduction of relative adhesiveness when morphants are compared to normal chordamesoderm within the width range of the latter. But the addition of increased widths in the morphants and the linear increase of adhesiveness with width compensated or overcompensated the initial reduction of adhesiveness.

      Could such measurements not be made from live cells/tissues after manipulating PCM components, as the lab has done previously? Because the lab already has the necessary reagents and expertise for such experiments, the time and resources needed for such measurements shouldn't be prohibitive.

      Due to circumstances, we were unable to perform additional experiments. However, we used our previously published quantitative data on adhesion in gastrula tissues including the chordamesoderm to analyze our present results for normal and C-cad-depleted chordamesoderm, and to relate relative adhesiveness to absolute adhesion strength, in a section added to the Discussion (p.11ff).

      • As mentioned above, these authors previously measured adhesive strength in live Xenopus cells and tissues (David et al, 2014). In that study, they found that C-cadherin MO reduced relative adhesiveness whereas the current study found that relative adhesiveness actually increases in this condition. What explains this discrepancy?

      We explain now in the new Discussion section (p.11ff) and with the help of supplementary Figure S5 how adhesion strength and relative adhesiveness are related overall (tissue surface vs. cell contacts) and at gaps within a tissue (gap free cell surface vs. cell contacts). In the previous study (David et al, 2014), we discussed relative adhesiveness in relation to overall adhesion strength, and both are decreased upon C-cad knockdown. Here we examined these parameters at interstitial gaps, where we find a small increase of relative adhesiveness, due to overcompensation caused by a strong increase of adhesiveness with contact width. Using our David et al, 2014 data we quantitated the effects. We previously found a similar increase of relative adhesiveness at gaps in C-cad morphant ectoderm (Barua et al. 2017) which we could not explain at the time, but explain now by analogy to our chordamesoderm results.

      • No control morpholinos are used, and for the morpholinos that are used, the doses are very large. An equally high dose of control MO should be used to ensure that all observed phenotypes are specific.

      We detail in the Methods section that we used here and in previous publications only previously characterized morpholinos.

      • It appears that all the images analyzed were collected in the sagittal plane, and the analyses don't seem to consider the intrinsic polarity of the chordamesoderm. For example: cells in different positions within the tissue (basal vs. apical), or that WT chordamesoderm cells are mediolaterally polarized and actively intercalating whereas disruption of PCM components like fibronectin disrupts cell intercalation and randomizes cell polarity. It is possible that 1) cell-matrix (in basal cells) and 2) cell-cell (during intercalation) interactions may affect the measurements made in this study. In other words, that cell contacts could differ by position within the embryo and intercalation/polarity status... have such effects been accounted for in the current analysis?

      Here we only analyzed cell contacts deep in the chordamesoderm. Basal contacts were examined to some extent in Barua and Winklbauer, 2022, apical contacts not yet. Our present analysis is based on sagittal sections. The cells in the chordamesoderm are elongated and aligned mediolaterally but not in register, i.e. they are randomly wedged between each other. Thus, all mediolateral positions in cells should be present in our samples. Nevertheless, trends in the occurrence of contacts related to medial-to-lateral positions on cells (e.g. recognizable in spindle-shaped cells as wide vs narrow cell cross-sections) may have escaped our attention, and in particular, the protrusion-bearing medial and lateral ends of cells may develop special contacts. However, our goal in this study was to analyse basic properties of cell-cell contacts in this tissue, as a foundation for further detailed studies.

      • In this study, the authors state that chordamesoderm movements are preserved in syndecan-4 morphants, and in their 2021 article (Barua et al) they state that convergent extension movements are accelerated. But another study describing this MO found that it causes severe convergent extension defects (Munoz et al, NCB, 2006). What explains this discrepancy?

      In their knockdown experiments, Munoz et al. find relatively mild axis defects in late gastrula/early neurula stage embryos while we studied the mid-gastrula. Perhaps defects develop during later stages in Syn-4 morphant embryos.

      Also, the syn-4 morphant showed in Fig. 1 appears more developmentally advanced than the other embryo... if the embryos are not stage matched it could affect the measurements and conclusions drawn from them.

      Stage matching was not possible since C-cad and FN morphants did not involute or engage in convergent extension (i.e. were arrested at the initial gastrula stage), Syn-4 morphants appeared to gastrulate faster than normally. Therefore, embryos were strictly time matched. A limitation remains, that the time course of cell contact development over gastrulation was considered low priority in this initial study and was thus not determined.

      • In figure 7, the authors plot relative adhesion (measured from contact angles) vs. contact width, then fit regression lines to the lower boundaries of these scatter plots. It is not clear why this analysis is focused only on the lower boundaries rather than considering the full spread of the data. Particularly for syn-4 morphants, whose values do not appear to be concentrated along the lower boundary. This analysis is further confused by the introduction of alpha*, which represents relative adhesiveness relative to the regression.

      The lower boundary line is most convenient to extract (Fig.7A’-E’). But we agree that the “interior” of the scatter plot distribution should also be analyzed. Using average adhesiveness gives rise to artifacts since the density of data points decreases strongly with contact width but also with distance from the lower boundary, leading to the preferential disappearance of large adhesiveness values for higher widths. Instead, we constructed a line tracing the highest density in the scatter plot near the lower boundary (Fig.7B’’-E’’), by determining the positions of adhesiveness distribution peaks in consecutive width brackets (new Fig.8, Fig.S3). We abstained from introducing alpha*.

      • Based on these regression lines alone, the authors conclude that all 4 conditions are similar enough to pool the data for further analysis. If these contacts have different properties, which the data in Figures 1-6 suggest they do, it seems inappropriate to pool them together.

      We no longer pooled the data, except in supplementary Fig.S4 where we consider angle distortion. Instead, we show in Fig.8 relative-adhesiveness frequency distributions for different treatments and width brackets. This emphasizes differences between the different adhesion factor depletions and shows that adhesiveness is not simply normal or log-normal distributed, in agreement with different contact types contributing differently though similarly to overall adhesion. It also allows to follow main peaks as they shift position with width, roughly in proportion to the lower surface boundary.

      Based on this pooling, the authors then conclude that relative adhesiveness increases linearly with contact width over the entire width range, regardless of adhesion factor depletion. This again assumes that all contacts (morphant and WT) are functionally equivalent, and that what is observed in morphant embryos in very wide contacts would also hold true in WT contacts. But because WT contacts occupy only a small portion of the width range, we cannot know how they would behave if scaled to be wider, and I am not convinced that very wide morphant contacts are representative of or functionally equivalent to WT. In other words, we cannot know that contact width is the only factor increasing their relative adhesion, given the experimental manipulations that structurally alter these contacts.

      Although differences between contact types are apparent, we think that the contacts function very similarly. We still hold that relative adhesiveness increases with contact width, as seen in each of the separate plots for wt and adhesion factor depletions. But re-evaluating the alpha-width scatter plots now we show that in the narrow width range of normal chordamesoderm, C-cad, FN and Has depletions show similar, significantly decreased relative adhesiveness (Fig.7A-E). With alpha proportional to width, and width strongly increased in morphants, this initial decrease is compensated in total adhesiveness averages. The relative independence of adhesiveness from contact type could hint at non-specific PCM-PCM adhesion (Winklbauer, 2019). We think that although adhesion factor depletion leads to the loss of some contact types or renders others non-adhesive (thus lowering contact abundances), it modifies some contact types (e.g. by widening them) while only moderately lowering their adhesiveness per unit interaction surface.

      Minor comments<br /> - In their descriptions of PCM in different experimental conditions, the authors overstate some conclusions drawn from EM data. For example, that type I glycocalyces are absent in chordamesoderm (although this signal is only reduced),

      We qualified the statement.

      or that because the Has2 morphant phenotype is intermediate between C-cad and fibronectin morphants this indicates an adhesive role for hyaluronic acid.

      Overall, Has2MO increases the abundance of gaps, i.e. HA normally reduces gaps between cells, strongly suggesting an adhesive role of HA. HA is also required for the formation of 10-20 nm gaps, again proposing a direct or at least indirect adhesion-promoting role.

      • The authors state of the data in figure 1 that "All treatments significantly increase the size of non-adhesive gaps", but they don't show a quantification of the gaps size (they show the abundance).

      Has been corrected.

      • The authors state that LSM contacts exist as 10-20 and 20-50 nm subtypes. It is not clear what about the data suggest this division.

      In the LSM width difference spectra, CadMO and SynMO both increase the abundances of ≤ 20 nm contacts and decrease those of 20-50 nm contacts (Fig.4). The different response suggests at least two differently reacting subtypes.

      • In the same paragraph, the authors state that "C-cad and Syn-4... favor LSM width between 20-50 nm." What is meant by "favor"? Given that the number of 20 nm contacts is increased and 50 nm contacts is decreased in both conditions, this statement is unclear.

      The whole paragraph has been reworded.

      • On page 7, the authors say that the size of LSM structures is "consistent with larger plaques being assembled from small units", but if that were the case, wouldn't the plaque sizes be multiples of the size of a single unit? I.e. 100, 200, and 300 nm peaks? Because this is not the case, the data seem more consistent with a continuous range of LSM plaque sizes than with discrete units.

      The size of the units has a peak at 100 nm but a long tail (Fig.6F-H). Moreover, we discuss lateral compression (piling up of PCM material) or active stretching of plaques (to separate units for interdigitation), all factors that would blur plaque length patterns, i.e. we did not expect plaque sizes to be multiples of 100 nm.

      • On page 8, the authors refer repeatedly to LSM volume. Given that these measurements are made from TEM sections, how is volume being measured?

      This is explained now (p.7).

      • The authors present a model in which PCM interdigitates within cell contacts, but this is based on measurements from static tissues alone. Could the measurements of contact width instead be explained by compression of the PCM or some other mechanism? The data as presented don't rule out such possibilities.

      The model is in agreement with the linear increase of relative adhesiveness with contact width, with LSM height at gap surfaces not adding up to adjacent contact width, with visible interdigitation of glycocalyx units (“bushes”) described previously for prechordal mesoderm (Barua et al. 2021), and with the good agreement of calculated unit size with the size of measured LSM units. In addition, it agrees with literature data on endothelial glycocalyx plaques being composed of 100 nm units and of complete interpenetration of glycocalyces during blood cell adhesion.

      Some terms used are not clear, for example: "partial LSM", "triple layer contact", "random removal [of LSM plaques]".

      We point out the meaning of the terms now more clearly. That “partial LSM” is identical with “triple layer contact” (but shorter, for use in figure) is explained in the legend to fig.6.

      • In figure 5, the graphs depict negative "abundance". Recommend "difference in abundance" instead.

      Done. For shortness, Δ Abundance.

      • Statistics: In figure 1I, it is not clear what the asterisk in this graph means or if statistical differences between these groups was determined. And in figure 6, some groups are marked as n.s., but P values for groups that are statistically different are not presented.

      The asterisk in fig.1I was meant to indicate that this column is from Debanjan et al. 2021, but this is indicated by different shading and mentioned in the legend. The non-used n.s. marks were removed.

      Reviewer #2 (Significance):

      This detailed electron microscopy study advances our understanding of pericellular matrix within vertebrate embryos and how loss of its constituent molecules affects cell interactions. It further addresses the relationship between structurally distinct pericellular matrices and their adhesive properties, although this analysis is less convincing. This study adds to a body of literature in which cell-cell and cell-matrix adhesion are known to regulate morphogenetic cell movements, but how such contacts are remodeled as cells rearrange is poorly understood. Previous work has also used measurements from live cells, embryos, and tissues to infer physical forces within embryos such as adhesive strength, cortical tension, and viscosity. This work follows up directly on a previous study from this group that characterized glycocalyces within various tissues within Xenopus gastrulae by electron microscopy. The hypothesis that pericellular matrix enables flexible/fluid adhesion within highly dynamic embryonic tissues is exciting, and is likely to be of interest to developmental biologists - particularly those who apply mechanical concepts to embryos. However, additional evidence, preferably from live tissues and embryos, is needed to support this hypothesis. This assessment is based on over 15 years' experience studying gastrulation morphogenesis in multiple vertebrate species.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      During gastrulation, cells within vertebrate embryos require the ability to both adhere to one another and rearrange with their neighbors to shape the emerging body plan. These authors posit that such flexible adhesive contacts are mediated in part by the pericellular matrix (PCM), including multiple types of glycocalyces containing molecules such as fibronectin, hyaluronic acid, and syndecans, which they previously characterized in multiple embryonic tissues (Barua et al, PNAS, 2021). Here, in a follow-up to their 2021 study, the authors use electron microscopy to characterize the pericellular matrix within the chordamesoderm of Xenopus gastrulae. They identify several types of adhesive contacts within the chordamesoderm and assess how they are altered in the absence of key PCM molecules via morpholino knock-down. They conclude that syndecan-4 and hyaluronic acid comprise and promote assembly of PCM plaques whereas fibronectin and C-cadherin anchor them to cell surfaces. Cell packing density is decreased upon loss of all 4 of these molecules, which the authors attribute to a decrease in the number of cell contacts without affecting the strength of the remaining contacts. They further conclude that adhesiveness increases linearly with contact width, and that this relationship is unaffected by loss of any aforementioned adhesive/ PCM molecules.

      Major comments:

      • Many conclusions in this manuscript are based on measurements of cell contact angles, which indicate the reduction of tension at cell contacts vs. free cell surfaces and thus relative adhesive strength. While this lab previously applied the same approach to live tissues (David et al, 2014), it is not clear to what extent such measurements accurately reflect adhesive strength in fixed tissues and/or electron micrographs. Especially given the issue of random sectioning planes, which cause distortion of contact angles. Although a correction was applied, the authors note this is not theoretically derived because the heterogeneity of gap sizes made such calculations too difficult. Indeed, it appears that the large gaps between cells within morphant embryos affect contact angle measurements, but if this is corrected for in any way, it is not mentioned. Because this is the sole measure of cell adhesion provided in the study, this reviewer is not convinced of the conclusion that loss of PCM components does not affect adhesive strength. Could such measurements not be made from live cells/tissues after manipulating PCM components, as the lab has done previously? Because the lab already has the necessary reagents and expertise for such experiments, the time and resources needed for such measurements shouldn't be prohibitive.
      • As mentioned above, these authors previously measured adhesive strength in live Xenopus cells and tissues (David et al, 2014). In that study, they found that C-cadherin MO reduced relative adhesiveness whereas the current study found that relative adhesiveness actually increases in this condition. What explains this discrepancy?
      • No control morpholinos are used, and for the morpholinos that are used, the doses are very large. An equally high dose of control MO should be used to ensure that all observed phenotypes are specific.
      • It appears that all the images analyzed were collected in the sagittal plane, and the analyses don't seem to consider the intrinsic polarity of the chordamesoderm. For example: cells in different positions within the tissue (basal vs. apical), or that WT chordamesoderm cells are mediolaterally polarized and actively intercalating whereas disruption of PCM components like fibronectin disrupts cell intercalation and randomizes cell polarity. It is possible that 1) cell-matrix (in basal cells) and 2) cell-cell (during intercalation) interactions may affect the measurements made in this study. In other words, that cell contacts could differ by position within the embryo and intercalation/polarity status... have such effects been accounted for in the current analysis?
      • In this study, the authors state that chordamesoderm movements are preserved in syndecan-4 morphants, and in their 2021 article (Barua et al) they state that convergent extension movements are accelerated. But another study describing this MO found that it causes severe convergent extension defects (Munoz et al, NCB, 2006). What explains this discrepancy? Also, the syn-4 morphant showed in Fig. 1 appears more developmentally advanced than the other embryo... if the embryos are not stage matched it could affect the measurements and conclusions drawn from them.
      • In figure 7, the authors plot relative adhesion (measured from contact angles) vs. contact width, then fit regression lines to the lower boundaries of these scatter plots. It is not clear why this analysis is focused only on the lower boundaries rather than considering the full spread of the data. Particularly for syn-4 morphants, whose values do not appear to be concentrated along the lower boundary. This analysis is further confused by the introduction of alpha*, which represents relative adhesiveness relative to the regression.
      • Based on these regression lines alone, the authors conclude that all 4 conditions are similar enough to pool the data for further analysis. If these contacts have different properties, which the data in Figures 1-6 suggest they do, it seems inappropriate to pool them together. Based on this pooling, the authors then conclude that relative adhesiveness increases linearly with contact width over the entire width range, regardless of adhesion factor depletion. This again assumes that all contacts (morphant and WT) are functionally equivalent, and that what is observed in morphant embryos in very wide contacts would also hold true in WT contacts. But because WT contacts occupy only a small portion of the width range, we cannot know how they would behave if scaled to be wider, and I am not convinced that very wide morphant contacts are representative of or functionally equivalent to WT. In other words, we cannot know that contact width is the only factor increasing their relative adhesion, given the experimental manipulations that structurally alter these contacts.

      Minor comments

      • In their descriptions of PCM in different experimental conditions, the authors overstate some conclusions drawn from EM data. For example, that type I glycocalyces are absent in chordamesoderm (although this signal is only reduced), or that because the Has2 morphant phenotype is intermediate between C-cad and fibronectin morphants this indicates an adhesive role for hyaluronic acid.
      • The authors state of the data in figure 1 that "All treatments significantly increase the size of non-adhesive gaps", but they don't show a quantification of the gaps size (they show the abundance).
      • The authors state that LSM contacts exist as 10-20 and 20-50 nm subtypes. It is not clear what about the data suggest this division.
      • In the same paragraph, the authors state that "C-cad and Syn-4... favor LSM width between 20-50 nm." What is meant by "favor"? Given that the number of 20 nm contacts is increased and 50 nm contacts is decreased in both conditions, this statement is unclear.
      • On page 7, the authors say that the size of LSM structures is "consistent with larger plaques being assembled from small units", but if that were the case, wouldn't the plaque sizes be multiples of the size of a single unit? I.e. 100, 200, and 300 nm peaks? Because this is not the case, the data seem more consistent with a continuous range of LSM plaque sizes than with discrete units.
      • On page 8, the authors refer repeatedly to LSM volume. Given that these measurements are made from TEM sections, how is volume being measured?
      • The authors present a model in which PCM interdigitates within cell contacts, but this is based on measurements from static tissues alone. Could the measurements of contact width instead be explained by compression of the PCM or some other mechanism? The data as presented don't rule out such possibilities.
      • Some terms used are not clear, for example: "partial LSM", "triple layer contact", "random removal [of LSM plaques]".
      • In figure 5, the graphs depict negative "abundance". Recommend "difference in abundance" instead.
      • Statistics: In figure 1I, it is not clear what the asterisk in this graph means or if statistical differences between these groups was determined. And in figure 6, some groups are marked as n.s., but P values for groups that are statistically different are not presented.

      Significance

      This detailed electron microscopy study advances our understanding of pericellular matrix within vertebrate embryos and how loss of its constituent molecules affects cell interactions. It further addresses the relationship between structurally distinct pericellular matrices and their adhesive properties, although this analysis is less convincing. This study adds to a body of literature in which cell-cell and cell-matrix adhesion are known to regulate morphogenetic cell movements, but how such contacts are remodeled as cells rearrange is poorly understood. Previous work has also used measurements from live cells, embryos, and tissues to infer physical forces within embryos such as adhesive strength, cortical tension, and viscosity. This work follows up directly on a previous study from this group that characterized glycocalyces within various tissues within Xenopus gastrulae by electron microscopy. The hypothesis that pericellular matrix enables flexible/fluid adhesion within highly dynamic embryonic tissues is exciting, and is likely to be of interest to developmental biologists - particularly those who apply mechanical concepts to embryos. However, additional evidence, preferably from live tissues and embryos, is needed to support this hypothesis. This assessment is based on over 15 years' experience studying gastrulation morphogenesis in multiple vertebrate species.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This work examines the relationship between cell-cell contacts and pericellular matrix in Xenopus chordamesoderm, which is a tissue actively involved in convergent extension during gastrulation. By lanthanum staining of pericellular materials, the authors found that different types of pericellular matrix are present in cell-cell contacts in the chordamesoderm, which may mediate cell-cell adhesion. Knockdown of C-cadherin, Syndecan-4, fibronectin, and hyaluronic acid leads to the reduced abundance of cell contacts and cell packing density, but this does not seem to affect convergent extension. Based on these observations, the authors propose a model in which cell-cell contacts involve the interdigitation of distinct pericellular matrix units.

      Major points:

      1. Knockdown of adhesion molecules separates cells and leads to wide contacts with large interstitial spaces. Data in figure 1 show loosely packed morphant chordamesoderm cells. Intuitively, these should reduce cell-cell adhesion. However, a main conclusion from this manuscript is that reduced abundance of narrower contacts does not decrease adhesiveness. Although depletion of adhesion molecules modifies but not abolishes a contact, non-attached free surfaces increase significantly in morphant cells. It is therefore not easy to understand that how reduced cell contacts have no effect on cell adhesion. Importantly, the adhesiveness was not experimentally tested.
      2. It is surprising that reduced cell contacts, at least narrower cell contacts, do not affect convergent extension. Does this mean that active cell behavior changes in the chordamesoderm, which are required for convergent extension, are independent of cell contact types?
      3. Although the formation and localization of pericellular materials are differentially affected after knockdown of adhesion molecules, there is no clear evidence showing that different types of pericellular matrix mediate cell-cell adhesion in the chordamesoderm. It is possible that the disrupted distribution of pericellular materials in morphants only represents a secondary consequence of changed cell contacts. This may be supported by the fact that knockdown of adhesion molecules reduces narrow contacts and increases LSM-free gaps.
      4. The relationship between contact width spectra and LSM is also very elusive. Again, changes in contact width or abundance and distribution of LSM may be indirectly caused by loss of adhesion molecules. Therefore, although knockdown of adhesion molecules leads to changes of LSM localization, it cannot be concluded that cell-cell contacts in chordamesoderm are mediated different types of pericellular matrix.
      5. In contrast to the present observations, works by others using the same morpholinos have shown that Cadherin-dependent cell adhesion, fibronectin-rich extracellular matrix, and Syndecan-4-regulated non-canonical Wnt signaling are required for convergent extension. These discrepancies need to be appropriately addressed.
      6. If LSM and LSM-free contacts are similarly adhesive, what will be role of LSM in cell adhesion and how cell adhesion is established in these LSM-free contacts?

      Minor points:

      1. It may be helpful to clearly define the pericellular matrix in this particular context and its relationship with LSM. It is also necessary to clarify whether the adhesion molecules examined in this work are considered as components of the pericellular matrix.
      2. In figure 1B, it appears that the Cadherin morphant has defects in chordamesoderm elongation and archenteron formation, suggesting impaired convergent extension.
      3. In figure 1C, the Syndecan-4 morphant gastrula clearly shows enhanced anteroposterior elongation of chordamesoderm and archenteron in comparison with the wild-type embryo. This seems to suggest that loss of Syndecan-4 promotes the movements of convergent extension. However, previous studies indicate that both gain and loss of Syndecan-4 impairs convergent extension.
      4. Ideally, in knockdown experiments, control embryos should be injected with corresponding mismatch morpholinos.
      5. In figure 1E, it is unclear what type of cell contacts the light green arrowheads indicate.
      6. Figure 1 legend, "(wt) is from Barua et al. 2021". I am not sure it is appropriate to use previously published data.
      7. There is no light blue arrowhead in figure 2, and in figure 3B and 3I, it seems that the same colored arrows are used to indicate different structures.
      8. Triple-layered contacts are not clearly defined.
      9. Page 2, "based on driven by" should be either "based on" or "driven by".
      10. Page 8, "selectin" should be "selecting".

      Significance

      Strengths:

      Demonstrated the effects of several adhesion molecules on the formation of cell contacts and pericellular matrix in Xenopus chordamesoderm.

      Limitations:

      The significance of chordamesoderm cell contact changes in convergent extension or gastrulation is not clear; there is no direct evidence showing the functional link between pericellular matrix, cell contacts and cell adhesion; the absence of effects on convergent extension after depletion of several adhesion molecules is not fully consistent with previous reports.

      Advance:

      This work likely provides some fundamental and methodological advances for studying cell-cell adhesion. It shows promise for elucidating mechanisms underlying the regulation of cell contact changes in tissues involved in morphogenetic movements.

      Audience:

      This work likely interests readership studying embryonic cell adhesion in the field of developmental biology and cell biology. It may be also potentially interesting for people working on glycocalyx pericellular matrix in adult tissues.

  3. Jul 2023
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Please find our point-to-point response to the reviewer’s comments below, where we marked all changes implemented in the manuscript in italics.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      With the emergence and spread of resistance to Artemisinin (ART), a key component of current frontline malaria combination therapies, there is a growing effort to understand the mechanisms that lead to ART resistance. Previous work has shown that ART resistant parasites harbour mutations in the Kelch13 protein, which in turn leads to reduced endocytosis of host haemoglobin. The digestion of haemoglobin is thought to be critical for the activation of the artemisinin endoperoxide bridge, leading to the production of free radicals and parasite death. However, the mechanisms by which the parasites endocytose host cell haemoglobin remain poorly understood.

      Previous work by the authors identified several proteins in the proximity of K13 using proximity-based labelling (BioID) (Birnbaum et al. 2020). The authors then went on to characterise several of these proteins, showing that when proteins including EPS15, AP2mu, UBP1 and KIC7 are disrupted, this leads to ART resistance and defects in endocytosis leading to the hypothesis that these two processes are inextricably linked.

      In this manuscript, Schmidt et al. set themselves the task of characterising more K13 component candidates identified in their previous work (Birnbaum et al. 2020) that were not previously validated or characterised. They chose 10 candidates and investigated their localisations, and colocalisation with K13, and their involvement in endocytosis and in vitro ART resistance, 2 processes mediated by K13 and some members of the K13 compartments

      The authors show that of their 10 candidates, only 4 can be co-localised with K13. Then, using a combination of targeted gene disruption (TGD) as well as knock sideways (KS), they characterised these 4 proteins found in the K13 compartment. They show that MyoF and KIC12 are involved in endocytosis and are important for parasite growth, however their disruption does not lead to a change in ART sensitivity. The authors also confirm the findings of their previous publication (Birnbaum et al. 2020), using a slightly different TGD

      (note from the authors: we apologise if this has not properly transpired from the manuscript but the difference between the TGDs is substantial and relevant: one has less than 3% of the protein left and hence can be considered to fully inactivate MCA2 and has a growth defect whereas the other contains about two thirds of the protein (1344 amino acids/~66% are left), has no growth defect, although it lacks the MCA2 domain (hence that domain can not be critical for the growth defect)),

      that MCA2 is involved in ART resistance, however they did not check whether its disruption impacts haemoglobin uptake. They also show that KIC11 is not involved in mediating haemoglobin uptake or ART resistance. To finish, the authors used AlphaFold to identify new domains in the proteins of the K13 compartment. This led them to the conclusion that vesicle trafficking domains are enriched in proteins of the K13 compartment involved in endocytosis and in vitro ART resistance.

      The majority of the experiments conducted by the authors are performed to a good standard in biological and technical replicates, with the correct controls. Their findings provide confirmation that their 4 candidate genes seem to be important for parasite growth, and show that some of their candidates are involved in endocytosis. While the KD and KS approaches employed by the authors to study their candidate genes each have their own advantages and can be excellent tools for studying a large sets or genes, this manuscript highlights the many limitations of these approaches. For example, the large tag used for the KS approach can mislocalise proteins or disrupt their function (as is the case for MyoF), resulting in spurious results, or indeed the inability to generate the tagged line (as is the case for MCA2). The KS approach also makes the results of a protein with a dual localisation, like KIC12, extremely difficult to interpret.

      We thank the reviewer for this thorough and insightful review.

      The limitations mentioned above were addressed in the response to the main points and a general detailed response in regards to the systems used for this research are added at the end of this rebuttal. Briefly summarised here: while we agree that there are limitations of the system used, we are convinced that

      • the advantages of using a large tag in most cases outweighs the drawbacks as it permits to track the inactivation of the target, if need be on the individual cell level

      • while not optimal for MyoF, the partial inactivation actually helps in its functional study as detailed in major point 23&28 or reviewer#3 major point 11: it shows a consistent correlation of the phenotype with different causes and degrees of inactivation (this is now better illustrated in Figure 1L1M). Further, regarding the concern of the large tag: the effect of the tag based on localisation was overestimated in the review by what seems to have been a mix up comparing numbers from MyoF with a number from MCA2 (there is a difference, but it is only small) (see reviewer#1 major point #23).

      • KS is the optimal method for most of the assays in this work (e.g. bloated food vacuole assays and RSAs); these assays would be impossible or difficult to use with other inactivation systems currently used in P. falciparum research (see details in the response to the specific points and after the rebuttal)

      In regards to the difficulty to interpret KIC12 data: this is only true for measuring absolute essentiality, everything else we believe we actually have the optimal method. If not KS, which method targets a specific pool of a protein with a dual localisastion? Again, our assays targeting the K13 pool and revealing the specific function would have been difficult or impossible with any other system.

      Ultimately the question is whether any other system would have resulted in a different conclusion on the function of the proteins studied. At present we are confident this would not be the case and other systems probably would not have delivered the specific functional data shown in this work. Clearly, more in depth work will provide more nuanced and detailed insights into the proteins analysed in this work and this likely will also include the use of other systems for specific aspects they are most suitable for. However, this (e.g. different complementations in a diCre cKO) is complex and therefore beyond what fits into this work which had the goal to assess which proteins are true positives for the K13 compartment and to place them into functional groups in regards to endocytosis.

      Moreover, the manuscript is disjointed at times, with the authors choosing to conduct certain experiments for only a subset of genes, but not for others. For example, considering that the aim of this paper was to identify more proteins involved in ART resistance and endocytosis, it is confusing why the authors do not perform the endocytosis assays for all their selected proteins, and why they do not do this for the proteins they identify in their domain search. There is significant room for improvement for this manuscript, and a generally interesting question.

      The reviewer remarks that not every experiment was done for every target. Based on the rebuttal we tried to amend this but also note that there was some sentiment by the reviewers to better stick to the point and not make the manuscript more disjointed. We attempted to balance that as much as possible and hope we were able to honour both aspects (amendments were done as detailed in the point by point response below).

      In regards to endocytosis and choice of targets: We did do endocytosis assays for all proteins that showed a growth phenotype upon inactivation in this work. We therefore assume the reviewer here refers to major point #40 asking for endocytosis assays with KIC4 and KIC5 (which were not studied in this manuscript) as well as MCA2 (point 17). We fully agree with the reviewer that this would fill a gap in the work on K13 compartment proteins but such assays are difficult with TGDs (there are issues with non-comparable samples and compensatory effects) and proteins that are not essential (and hence likely have a smaller impact on endocytosis when truncated). We nevertheless now carried them out, but due to the limitations to do this with these lines would be hesitant to draw definite conclusions (see major point 17 and 40 for details and outcomes).

      But in it's current format, other than confirming that MCA2 is involved in ART resistance (which was already known from the Birnbaum paper), the authors do not further expand our understanding of the link between ART resistance and endocytosis in this manuscript.

      We would like to point out that the importance of the K13 compartment and endocytosis goes beyond ART resistance (see e.g. also newly published papers on the K13 compartment in Toxoplasma, (Wan et al., 2023; Koreny et al., 2023)). Endocytosis is an essential and prominent process in blood stages. However, in contrast to processes such as invasion, our understanding about endocytosis is only rudimentary. Hence, this manuscript provides important insights on an emerging topic that in our opinion deserves more attention:

      • it identifies novel proteins at the K13 compartment and provides 2 new proteins in endocytosis (MyoF and KIC12); getting an as complete as possible list of proteins involved in the process will be critical to study and understand it

      • it leads to the realisation that not all growth-relevant proteins detected at the K13 compartment are needed for endocytosis

      • it provides domains and stage specificity of function for several K13 compartment proteins, overall bolstering the model of endocytosis in ART resistance and providing a framework critical to direct future studies on endocytosis and their detailed mechanistic function at the cytostome

      • the identified vesicle trafficking domains (for instance now also found in UBP1) are expected to strengthen the support for the role of endocytosis of the K13 compartment; this and also the above points are important as (based on the current literature) there still seems to be prominent sentiment in the field that (in part due to the involvement of UBP1 and K13) the cause of ART resistance is due to various unclearly defined stress response pathways

      • with MyoF it also shows the first protein in connection with the K13 compartment that acts downstream of the generation of hemoglobin-filled containers in the parasite and provides the first protein that explains the suspected involvement of actin in endocytosis (so far this was only based on CytD studies)

      Overall we therefore believe this manuscript contains critical information and a framework for future studies on endocytosis and the K13 compartment. We hope the relevance of endocytosis as one of the most prominent and essential processes in the parasites and the connection to various aspects linked with many commercial drugs (in addition to the role of endocytosis in ART resistance), is adequately explained in the introduction. We also would like to mention that the main focus of the work is reflected in the title of the manuscript which does not mention ART susceptibility.

      Major Comments

      1) line 31: please change defined to characterised - defined suggests that novel proteins were identified in this study, which is not the case.

      We apologise, but we do not fully understand this comment. We did identify novel proteins not before known to be at the K13 compartment (MCA2 (admittedly this one was likely but had not previously been verified), MyoF, KIC11 and KIC12). In our view "further defining the composition of the K13 compartment" therefore is an accurate statement. Additionally, the identification of previously not-discovered domains, the stage-specificity and function of these proteins helped to further define the K13 compartment.

      If the reviewer is referring to the fact that the proteins analysed in this study were taken from a previously generated list of hits, we would like to stress that the presence in such a list (obtained from a BioID, but also if from an IP etc) can not be equalled for them to be true positives, they are merely candidates that still need to be experimentally validated. This is what we did in this work to find out which further proteins from the list can be classified as K13 compartment proteins (for hits with lower FDRs this is even more relevant as illustrated by the fact that 6 of the here analysed hits were not at the K13 compartment). In an attempt to address this comment in the manuscript, we changed the wording of this sentence to (line 31): "Here we further defined the composition of the K13 compartment by analysing more hits from a previous BioID, showing that MyoF and MCA2 as well as Kelch13 interaction candidate (KIC) 11 and 12 are found at this site."

      2) line 37: please change 'second' to "another". As explained further below, the authors identified 3 classes of proteins (confer ART resistance + involved in HCCU, involved in HCCU only, or involved in neither).

      We realized that the groups description wasn’t clear in the abstract. Please see response to major comment #41 for a detailed answer to this (endocytosis is an overarching criterion, ART resistance is a subgroup and applies only to those proteins with a function in endocytosis in ring stages). To clarify this (see also major point #8) we added an explanation on the influence of stage-specificity of endocytosis on ART susceptibility to the introduction (line 76): In contrast to K13 which is only needed for endocytosis in ring stages (the stage relevant for in vitro ART resistance), some of these proteins (AP2µ and UBP1) are also needed for endocytosis in later stage parasites (Birnbaum et al., 2020). At least in the case of UBP1, this is associated with a higher fitness cost but lower resistance compared to K13 mutations (Behrens et al., 2021; Behrens et al., 2023). Hence, the stage-specificity of endocytosis functions is relevant for in vitro ART resistance: proteins influencing endocytosis in trophozoites are expected to have a high fitness cost whereas proteins not needed for endocytosis in rings would not be expected to influence resistance.” The abstract was changed in response to this and other comments and hope it is now clearer in regards to the groups.

      3) Line 40: You define KIC11 as essential but according to your data some parasites are still alive and replicating 2 cycles after induction of the knock sideways. Please consider changing "essential" to "important for asexual parasite growth".

      We fully agree with the reviewer, we reworded the sentence as suggested.

      4) Line 40: please change 'second group' to 'this group'

      We reworded this part of the abstract and it know reads: (line 38): “While this strengthened the link of the K13 compartment to endocytosis, many proteins of this group showed unusual domain combinations and large parasite-specific regions, indicating a high level of taxon-specific adaptation of this process.”

      5) line 41: state here that despite it being essential, it is unknown what it is involved in.

      With the newly added data we show that this protein either has a function in invasion or very early ring development although we did not see any evidence for the latter. We therefore changed the sentence to (line 43): “We here identified the first protein of this group that is important for asexual blood stage development and showed that it likely is involved in invasion*..” *

      6) Line 50: the authors should state here that there is actually a reversal in this trend over the last few years.

      Done as suggested.

      7) Line 54: please separate out the references for each of the two statements made in this line (a: that ART resistance is widespread in SEA, and b: that ART resistance is now in Africa) Reference 14 also seems to reference ART resistance in Amazonia - which is not covered by the statement made by the authors (in which case the authors should state ART is now present in Africa and South America). The authors should also reference PMID: 34279219 for their statement that ART resistance is now found in Africa (albeit a different mutation to the one found in SEA).

      Done as suggested.

      8) Line 65: it is also worth mentioning here that there are other mutations in proteins other than K13, such as AP2mu and UBP1 (PMID: 24994911;24270944) that can lead to ART resistance.

      As suggested by the reviewer, we included a sentence about non-K13 mutations linked with reduced ART susceptibility in the introduction (line 74): Beside K13 mutations in other genes, such as Coronin (Demas et al., 2018) UBP1 (Borrmann et al., 2013; Henrici et al., 2020b; Birnbaum et al., 2020; Simwela et al., 2020) or AP2µ (Henriques et al., 2014; Henrici et al., 2020b)* have also been linked with reduced ART susceptibility." *

      We here also added data on fitness cost that is related to this and is also relevant for the issue of proteins with a stage-specific function in endocytosis, making a transition for this statement which might help clarifying the grouping of K13 compartment proteins (see also major point #2).

      9) Line 80, 86: ref 43 is misused. Reference 43 refers to Maurer's clefts trafficking which takes place in the erythrocyte cytosol and is not involved in haemoglobin uptake as far as I know. Please replace ref 43 with one showing the role of actin in haemoglobin uptake.

      We thank the reviewer for pointing this out, Ref 43 was removed from the manuscript.

      10) Line 98: the authors state here that they 'identified' further candidates from the K13 proxiome. This suggests that they identified new proteins in this paper, when in fact the list was already generated in ref 26. All they did was characterise proteins from that list that were not previously characterised. The authors should therefore remove identified from this statement.

      We agree with the reviewer that we did not identify further candidates, we identified new K13 compartment proteins from the list of potential K13 compartment proteins. We therefore changed “identified further candidates” into “identified further K13 compartment proteins” (line 116). Please see also response to major comment #1.

      11) Line 107-108: it is not clear from this sentence why these proteins were left out of the initial analysis in Ref 26. A sentence here explaining this would be valuable for the reader.

      This is a good point. One reason why we did not analyse more in our previous publication was that we had to stop somewhere and adding more would have been very difficult to fit into what was already a packed paper. However, as shown in this work, the list does contain further interesting candidates (e.g. K13 compartment proteins that are involved in endocytosis).

      We altered the relevant part of the introduction to highlight that we previously analysed the top hits, clarifying that the 'remaining' hits analysed in this work were further down in the list. This now reads: (line 113)“We reasoned that due to the high number of proteins that turned out to belong to the K13 compartment when validating the top hits of the K13 BioID (Birnbaum et al., 2020), the remaining hits of these experiments might contain further proteins belonging to the K13 compartment.” We hope this clarifies that we simply moved further down in the candidate list.

      12) Line 117-123: The authors say that PF3D7_0204300, PF3D7_1117900 and PF3D7_1016200 were not studied because they were not in the top 10 hits. However, the current organisation of Supplementary Table 1 shows all 3 proteins among the top 10 hits (MyoF, KIC12, UIS14 and 0907200 being after them). I think the authors should reorganise their table. It is also unclear according to what the proteins in the table are ranked. Could the authors indicate the metric used for the ranking?

      We thank the reviewer for alerting us to this. The issue here is that the 3 non-analysed proteins belong to a 'lower stringency' group comprising hits significant with FDRThe information about ranking is now also included as “Table legend” in the revised manuscript and the Table heading has been changed to: List of putative K13 compartment proteins, proteins selected for further characterization in this manuscript are highlighted.”

      13) Line 129-141: Can the authors be clearer with their explanations of the identification of mutation Y1344Stop? One dataset (ref 61) shows that 52% of African parasites have a mutation in MCA2 in position 1344 leading to a STOP codon. But another dataset (ref 62) shows that the next base is also mutated, reverting the stop codon. That should have been seen in the first dataset as well. Could the authors please clarify.

      This mutation was first spotted in the MalariaGEN database (https://www.malariagen.net) (MalariaGEN et al., 2021), which allows online accessing of the data by using the “variant catalogue” tool, which is in a table format of frequency rather than in a sequence context. Hence, only after further research later on it became evident to us, that this mutation does not occur alone when looking at individual MCA2 sequences from patient samples in (Wichers et al., 2021b). We hope this is accurately reflected in our results section.

      14) Line 147: the authors say that MCA2 is expressed throughout the intraerythrocytic cycle as shown by live cell imaging. In Birnbaum et al 2020 fig 4I, the authors show that MCA2 is mainly expressed between 4 and 16hpi. But in Figure 1B of this manuscript there is a clear multiplication of MCA2 signal between trophozoite and schizont. How do the authors explain this discrepancy? Could expression of the truncated MCA2 be different than the full length? This cannot be assessed as expression and localisation of the full-length HA tag MCA2 is not shown in Schizonts.

      The key difference lies in transcription vs protein expression (usually protein levels peak after mRNA levels peak and - depending on turnover - protein levels can stay high even after mRNA levels have declined). Figure 4 of the Birnbaum et al paper presents transcriptomic data, but with a peak in trophozoites (The axis label in Fig. 4l of that publication is a bit confusing, as hour 0 is at the top, 48 h at the bottom; it is clearer in Fig. S13 of that paper) which would fit very well with the multiplication of the signal between trophozoites and schizonts mentioned by the reviewer. So, overall, the temporal peaks of transcripts and protein of that protein fit well.

      For the signal in rings: Likely the protein has a turnover rate that is sufficiently low for some protein to be taken into the new cycle after re-invasion. Also different transcriptomic datasets e.g. (Otto et al., 2010; Wichers et al., 2019; Subudhi et al., 2020) available on plasmoDB show some mRNA present across the complete asexual development cycle, with each dataset showing maximum peak at a slightly different stage.

      Even when located in foci and hence aiding detection of small amounts of protein (as is the case for MCA2-Y1344-GFP), the MCA2 signal in rings is not strong. For MCA2-TGD, the GFP signal is dispersed and therefore likely below our detection limit, while the same amount of protein concentrated at the K13 compartment is visible as foci in the MCA2-Y1344 cell line. Please note that MCA2-TGD has only 2.8% of the protein left whereas MCA2-Y1344 has 66.5% left and based on our manuscript is almost fully functional, hence fitting the different locations between the two versions.

      Overall we believe this shows that there are actually no significant discrepancies of the expression of the different MCA2 versions.

      15) Line 158: would it not have been more useful for the authors to have episomally expressed MCA2-3xHA in their MCA2Y1344STOP-GFPENDO line to make sure that the truncated protein is indeed going to the correct compartment? The experiments done by the authors suggests that the MCA2Y1344STOP goes to the right location but does not really confirm it.

      We appreciate the reviewers caution here. However, considering that MCA2Y1344STOP-GFPendo co-locates with mCherryK13 and endogenously HA-tagged full length MCA2 does the same to a similar extent, there is in our opinion little doubt that MCA2 is found at the K13 compartment and that this is similar with both constructs. If there are minor differences, these might as well occur if MCA2 is episomally (as suggested in the comment) instead of endogenously expressed. Given the limited insight, we therefore decided against the episomal overexpression (which due to its size of > 6000bp may also be somewhat less straight forward than it may sound).

      16) Line 191: it is stated that MCA2 confers resistance independently of the MCA domain, however in both the MCA2-TGD and MCA2Y1344STOP-GFPENDO parasites, the MCA domain is deleted, and for both parasites, there is resistance (albeit to a lower level in the MCA2Y1344STOP-GFPENDO line). Therefore, how can the authors state that the ART resistance is independent of the MCA domain? This statement should be that resistance is dependent on the loss of the MCA domain.

      We agree that this can’t be categorically excluded. However, a ~5 fold difference in ART sensitivity was observed between the parasites with MCA2 truncated at amino acid 57 compared to those with MCA at amino acid 1344 even though both do not contain the MCA2 domain. Hence, at least this difference is not dependent on the MCA2 domain. The larger construct missing the MCA domain shows only a very moderate reduction in RSA survival, again suggesting the MCA domain is not the main factor. We amended our statement in an attempt to more accurately reflect the data (line 487): This considerable reduction in ART susceptibility in the parasites with the truncation at MCA2 position 57 compared to the parasites still expressing 1344 amino acids of MCA2, despite both versions of the protein lacking the MCA domain, indicates that the influence on ART resistance is not, or only partially due to the MCA domain.” We would be hesitant to state the reviewer's conclusion that “resistance is dependent on the loss of the MCA domain”, as the larger construct missing the MCA2 domain has a milder RSA effect compared to MCA2-TGD, which suggests the reduction in ART susceptibility is independent of the MCA domain. These considerations also agree with the fact that the parasites with the longer MCA2 version (in contrast to the MCA2-TGD) do not have any detectable growth defect which indicates that the protein can fulfil its function without the MCA2 domain.

      17) Line 192: Why did the authors not check if MCA2 is involved in endocytosis? They state later on in the manuscript that they did not do endocytosis assays with TGD lines, however if the authors include the correct controls, this could be easily done. It would also be really interesting to see whether endocytosis gets progressively worse going from WT to MCA2Y1344STOP to MAC2TGD. This experiment (as well as doing endocytosis assays for KIC4 and KIC5 TGD lines) would drastically increase the impact of this study. These experiments would not take more than 3 weeks to perform, and would not require the generation of new lines.

      So far were very hesitant to do bloated FV assays with TGDs (even though TGDs were available for the genes encoding MCA2 and KIC4 and KIC5). The reason for this was:

      1. the fact that these proteins could be disrupted indicated either redundancy or only a partial effect on endocytosis which might lead to only small effects that likely are difficult to pick up in an assay scoring for the rather absolute phenotype of bloated vs non-bloated. Using the refined assay measuring FV size could partly amend this but we note that also FV without hemoglobin have a certain size, reducing the relative effect if there are smaller differences.
      2. a TGD line does not permit tightly controlled inactivation of the target which makes comparing the outcome of bloated food vacuole assays difficult if there are smaller growth and stage differences to the 3D7 control.
      3. in contrast to conditional inactivation parasites, the TGD lines had ample times to adapt to loss of the target protein (compensatory mechanisms are well known for endocytosis, for instance in clathrin mediated endocytosis loss of individual components can be compensated (Chen and Schmid, 2020)). We nevertheless see the reviewer's point that this should at least be attempted and now conducted these assays (see also major point 40). For MCA2 (as requested in this point), the data is shown in Figure S5C-E. This assay showed that in MCA2-TGD, MCA2Y1344STOP-GFPendo (similar to the 3D7 control) >95% of parasites developed bloated food vacuoles. Additionally, we also measured the parasite and food vacuole size of individual cells in an attempt to solve some of the problems with TGDs with such assays. In order to specifically solve problem 2 mentioned above, we analysed the food vacuoles of similarly sized parasites, however, they were non-distinguishable between the three lines. Of note, in agreement with the reduced parasite proliferation rate (Birnbaum et al., 2020) a general effect on parasite and food vacuole size was observed for MCA2-TGD parasites, indicating reduced development speed in these parasites. Hence, it is possible that a potential endocytosis reduction was accompanied by a slowed growth, and the comparison of similarly sized parasites may have obscured the effect. It is therefore not sure if there indeed is no endocytosis phenotype, although we can exclude a strong effect in trophozoites.

      Based on the RSA results at least rings can be expected to have a reduced endocytosis in the MCA2-TGD. Apart from options 1-3 mentioned above, it is therefore possible there is an effect restricted to rings, although in that case the reduced growth in trophozoites would be due to other functions of MCA2. Overall, we can conclude that the MCA2-TGD parasites do not have a strongly reduced endocytosis, but given the fact that the parasites are viable, this is not surprising. Whether the MCA2-TGD has no effect at all on endocytosis we would be very hesitant to postulate based on these results.

      18) The authors should consider re-organising the MCA2 section, first showing that the 3xHA tagged line colocalises with K13, then performing the new truncation.

      We attempted to re-organise as suggested but because we now included additional fluorescence microscopy images of schizont and merozoites (in response to reviewer 2 major comment 3) the main figure would become even larger. To prevent this, we kept the 3xHA data in the supplement.

      19) Line 197: Once again ref 43 is not correct to illustrate that actin/myosin is involved in endocytosis

      We thank the reviewer for pointing this out – we removed Ref 43.

      20) Line 202: the authors state that MyoF localises near the food vacuole from ring stage/trophs onwards. However, how can this statement be made in schizonts based on these images (Fig. 2A), where it doesn't look like MyoF is anywhere near the FV? This statement can only be made for schizonts if co-localised with a FV marker (which is done in Fig. 2B), however, based on the number of MyoF foci, it appears that this was not done for schizonts. Please either remove the statement that MyoF is near the food vacuole from trophs onwards (because it is only seen near the FV up until trophs) or show the data in Fig. 2B of schizonts to substantiate these claims.

      This is a valid point. We originally did not focus on schizonts because most markers end up in some focal area in the forming merozoite but other proteins (such as e.g. K13) also have one or more additional foci at the FV, making interpretation unclear, particularly if the schizont is still organizing to become fully segmented. This is why we generally focused the K13 co-localisations on the trophozoite stage to obtain the clearest information on endocytosis. However, given the fact that this manuscript gives the first localization of MyoF in P. falciparum parasites, we now provide a comprehensive time course (Figure 1C, S1A) including schizonts, which show quite a complex pattern: while the MyoF-GFP localization in trophozoites appeared as multiple foci close to K13 and also the FV, the MyoF-GFP pattern changes in late schizonts (fully segmented) and merozoites, appearing as elongated foci no longer close to K13 or the FV. Of note, this pattern has been previously reported for MyoE in P. berghei (Wall et al., 2019).

      We therefore revised the statement about MyoF localization in schizont to better reflect the observed localization: (line 175): In late schizonts and merozoite the MyoF-GFP signal was not associated with K13, but showed elongated GFP foci (Figure 1C, S2A) reminiscent of the MyoE signal previously reported in P. berghei schizonts (Wall et al., 2019).”

      21) Line 204-206: what does this statement bring to the paper? Is it to show that it is the real localisation of MyoF because 2 tag cell line show the same localisation? I don't think this is needed, especially as later in the manuscript an HA-tag MyoF line is used and show similar localisation.

      We see the reviewers point, but prefer to keep this data included in the supplement, particularly because potential differences in the location of tagged MyoF were a major concern.

      Related to the tag issue: in order to get a better understanding of the effect of C-terminally tagging with different sized tags we now performed a more detailed analysis of the MyoF-3xHA cell line (Figure S2F-G), showing that this cell line shows a growth rate similar to the 3D7 wild type parasites, and has less vesicles than the 2x-FKBP-GFP-2xFKBP cell line, but still slightly, but significantly more than 3D7 parasites. Overall, this indicates that the smaller 3xHA tag has less effect on the parasite, than the larger 2x-FKBP-GFP-2xFKBP tag (see also new Figure 1L, showing a correlation of level of inactivation and the endocytosis phenotype for MyoF).

      22) Line 212: The overlap of K13 with MyoF in Figure 2C 3rd panel (1st trophozoite panel) is not obvious, especially as the MyoF signal seems inexistant. I would advise the authors to replace with a better image. Also, why are there no images of schizonts shown in Figure 2C?

      As suggested we exchanged the trophozoite image of panel Figure 2 C (now Figure 1C) and expanded this panel with images covering the complete asexual development cycle including schizonts in response to this and the previous points. As indicated above (point 20), schizont stages are complex to interpret. While late schizonts likely are not very relevant for endocytosis this is the first description of the location of the protein in this parasite and we therefore now provide a more thorough representation of the MyoF location across asexual stages in Figure1C and S2A.

      23) Line 217: the spatial association of MyoF with K13 is very different when it is tagged with GFP and when it is tagged with 3xHA. The way the authors word it here, it seems that there is agreement with the two datasets, when this is not in fact the case (59% overlap for MyoF-GFP and only 16% overlap with MyoF-3xHA). These data suggest that the GFP and the multiple FKBP tags are doing something to the protein and therefore maybe the ensuing results using this line should not be trusted or be taken with a pinch of salt.

      We agree with the reviewer that the location of this MyoF-GFP in the cell might differ due to the partial inactivation but in contrast to this comment, the data does not indicate any large differences. It seems the reviewer mixed something up (the 59% mentioned might come from the MCA2 figure?). The data with the two lines with differently tagged MyoF co-localised with K13 are actually quite comparable: GFP-tagged vs HA-tagged MyoF overlapping with K13 was 8% vs 16% full overlap, 12% vs 19% partially overlapping foci, 36% vs 63% foci that were touching but not overlapping (compare what now is Figure 1D and Figure S2C). Only in the 'no overlap' there is a much smaller proportion in the HA-tagged line. However, given that these are IFAs which on the one hand are more sensitive to see small protein pools but on the other hand also have pitfalls due to fixing of the cells (e.g. tiny increase in focus size due to fixing could increase the number of touching foci that in live cells might be close but did not touch), some variation can be expected to the live cells. We agree though that the partly reduced functionality of MyoF might be the reason for the consistent tendency of a lower overlap even though the difference is much less than indicated in the comment. We added "with a tendency for higher overlap with K13 which might be due to the partial inactivation of the GFP-tagged MyoF" to the sentence "IFA confirmed the focal localisation of MyoF and its spatial association with mCherry-K13 foci"

      While we expect the fact that the difference between these parasites is only small somewhat reduces the "pinch of salt" with the MyoF line, we do agree that the partial functional inactivation of the GFP-tagged MyoF line may have some impact. However, we do not think that this means the results with the MyoF-GFP line are untrustworthy. On the contrary, it provides insights into its function that in some ways is equivalent to a knock down or TGD. Overall all the MyoF lines show: few vesicles occur in the MyoF-HA-line, more in the MyoF-GFP line and even more after knock sideways of MyoF-GFP. Importantly the severity of this phenotype correlates with the growth rates in these lines. Hence, together with the bloated food vacuole assays, this provides consistent data indicating that MyoF has a role in the transport of HCC to the FV and its level of activity correlates with the number of vesicles and growth. To better highlight this, it is now summarised in Figure 1M.

      24) Line 219: the authors state here that they could not detect MyoF-GFP in rings, when in Figure 2C they show MyoF-GFP in rings, and also show that they could detect MyoF in Sup Fig. 3B with the 3xHA tagged line. Is this a labelling mistake in Figure 2C? If the authors could indeed not see MoyF-GFP in rings, this statement should have been made when Figure 2A was presented, and not so late in the manuscript, which causes confusion.

      We thank the reviewer for pointing this out. We now provide a detailed time course (see also previous points) which shows that there is no detectable MyoF-GFP signal during ring stage development until the stage where the parasites starts the transition to trophozoites (i.e. MyoF-GFP signal could only be observed in parasites already containing hemozoin). In addition to the extended time course in Figure 1C (previously 2C) we included a panel of example ring stage images below to further highlight this. We also changed the labelling of the parasite with MyoF-GFP signal the reviewer mentions in Figure 1C to “late ring stage” (it already contains hemozoin) to clarify this.

      The description of Figure 1A is now changed to: (line 153) *“The tagged MyoF was detectable as foci close to the food vacuole from the stage parasites turned from late rings to young trophozoite stage onwards, while in schizonts multiple MyoF foci were visible (Figure 1A, S2A).” *

      Please see our answer to major comment #45 where we provide an explanation for the difference between MyoF-3xHA and MyoF-GFP signal in ring stage parasites.

      [Figure MyoF]

      25) Line 237: Showing a DNA marker (DAPI, Hoecht) for Figure 2E, and subsequent figures using mislocalisation to the nucleus, would help the reader assess efficiency of the mislocalisation.

      Please see response to major comment #64 for a detailed answer on why we did not include DNA staining in the imaging used to assess mislocalization upon knock-sideways.

      26) Line 254-256: authors should show the results of the bloating assay for parental 3D7 parasites (+ and - rapalog) to see whether the MyoF line - rapalog has increased baseline bloating. This applies to all subsequent FV bloating assays.

      We did do several controls for bloated assays (including +/- rapalog of an irrelevant knock sideways line as well as using a chemical insult for which the control was 3D7 without treatment) in previous work (Birnbaum et al., 2020), which indicated that there is no effect of rapalog to reduce bloating. Although these controls are more stringent, we nevertheless did a 3D7 +/- rapalog control and added this to the manuscript (Figure S2I). As it is not possible to do this side by side with the assays that are already in the manuscript and the +/- rapalog 3D7 cells consistently showed no or very low numbers of cells without bloating (and stringent controls in the past equally did not show an effect), we believe adding this control once suffices.

      27) Line 254-257: The authors say that because fewer parasites show a bloated food vacuole upon inactivation of MyoF it means that less hemoglobin reached the food vacuole. I understand the authors statement, however, shouldn't they look at the size of the food vacuole, instead of the number of parasites with bloated FV, to make such a statement? This has been done for KIC12 so why not doing it for MyoF?

      This was now done and is provided as Figure 1J-K, S2J. The results confirm the assessment scoring bloated vs non-boated food vacuoles.

      28) Line 259-261: these results would be difficult to interpret namely because the authors have dying parasites, which is exacerbated with the protein being knocked sideways. The authors should mention the pitfalls their knock sideways and tagging design here. Line 260-261: RSA is an assay relying on measuring parasite growth 1 cycle after a challenge with ART for 6 hours.

      Fortunately, this concern is unfounded, as the survival (measured by parasitemia after one cycle) of the same sample + and - DHA is assessed, isolating the DHA effect independent of potential growth defects which are cancelled out. Hence, if there were parasites dying in the MyoF line (please note that they might not actually die, but simply grow more slowly), this factor applies for both the + and - ART condition. As we are testing for a decreased susceptibility to ART which would manifest as an increased survival in RSA surfacing above 1%, antagonistic effects of reduced MyoF function and ART treatment would not result in detectable differences as without effect, the RSA survival is always close to zero.

      The same applies for the knock sideways where we assess the survival of +rapalog between +ART and -ART. If the reduced MyoF activity of the knock sideways leads to a decreased survival, this applies to both +ART and -ART. Please also note that rapalog was lifted after the DHA pulse (see e.g. Figure S2K).

      That effects on growth are cancelled out is nicely illustrated for proteins where there is a stronger and more rapid effect on growth upon their conditional inactivation. For instance when KIC7 is knocked aside, there is a considerable increased of RSA survival, even though continued inactivation of KIC7 would have a severe growth defect (Birnbaum et al., 2020). Vice versa, a growth defect alone does not result in reduced RSA susceptibility as evident from knock sideways of an unrelated protein or using a chemical insult (Figure 4H in (Birnbaum et al., 2020) or simply slowing the ring stage by e.g. reducing EXP1 levels (Mesén-Ramírez et al., 2019). Hence, a growth reduction is not expected to alter the RSA outcome. And even if it did, it would only lead to an underestimation of the readout if growth is too severely affected (which would be obvious in the + rapalog without DHA sample, which was not the case).

      In that respect it is valuable to have the rapid kinetics of knock sideways which permit inactivation of a protein before severe growth defects occur (although the only partial responsiveness of MyoF clearly is not the most optimal). In contrast, the absolute loss of a gene (as is the case if diCre is used) prevents (or at least makes it extremely difficult as the timing would need to exactly hit sufficient protein reduction without killing the parasite until the end of the RSA) using this system in these experiments (again see (Mesén-Ramírez et al., 2021) where in a EXP1 diCre based knock out RSA was only possible because we complemented with a lowly, episomally expressed EXP1 copy to have parasites with only a partial phenotype to do this assay).

      29) Line 261-263: the authors sate that MyoF has a function in endocytosis but at a different step compared to K13 compartment proteins. I am not sure what they mean here. Can this be clarified?

      The different steps in endocytosis are explained in the introduction and we now tried to further clarify this (line 98). So far VPS45 (Jonscher et al., 2019), Rbsn5 (Sabitzki et al., 2023), Rab5b (Sabitzki et al., 2023), the phosphoinositide-binding protein PX1 (Mukherjee et al., 2022), the host enzyme peroxiredoxin 6 (Wagner et al., 2022) and K13 and some of its compartment proteins (Eps15, AP2µ, KIC7, UBP1) (Birnbaum et al., 2020) have been reported to act at different steps in the endocytic uptake pathway of hemoglobin. While inactivation of VPS45, Rbsn5, Rab5b, PX1 or actin resulted in an accumulation of hemoglobin filled vesicles (Lazarus et al., 2008; Jonscher et al., 2019; Mukherjee et al., 2022; Sabitzki et al., 2023), indicative of a block during endosomal transport (late steps in endocytosis), no such vesicles were observed upon inactivation of K13 and its compartment proteins (Birnbaum et al., 2020), suggesting a role of these proteins during initiation of endocytosis (early steps in endocytosis).

      VPS45 has not apparent spatial connection to the K13 compartment but the fact that MyoF does - and its inactivation also results in vesicle accumulation - indicates that it is downstream of vesicle initiation, providing the first connection from the initiation phase to the transport phase. More evidence for these different steps of endocytosis has been published in a recent preprint from our lab, where we simultaneously inactivated a protein of both “endocytosis steps” (Sabitzki et al., 2023).

      To clarify this in the results as requested, we changed the statement to: (line 256) Overall, our results indicate a close association of MyoF foci with the K13 compartment and a role of MyoF in endocytosis albeit not in rings and at a step in the endocytosis pathway when hemoglobin-filled vesicles had already formed and hence is subsequent to the function of the other so far known K13 compartment proteins.”

      30) Do the authors mean that it is involved in endocytosis but not in ART resistance? If so, this is a very difficult statement to make since the parasites are dying. Is there any evidence of point mutations in MyoF in the field?

      We split this point to address all issues raised here. Please see response to point 29 which clarifies that this was meant in a different way and our response to point 28 which explains why the dying parasite issue is not expected to affect the RSA (please also note that we do not have evidence of actually dying parasites in the MyoF-2xFKBP-GFP-2xFKBP line, most likely the growth is slowed).

      The mutation issue is interesting. In fact evidence exists that MyoF mutations may be associated with resistance (Cerqueira et al., 2017) (please note that there it is still called MyoC) but in a recent preprint from our lab we did not find any evidence for a significantly changed RSA survival in 12 tested mutations in the corresponding gene (Behrens et al., 2023).

      To clarify this we added the following statement to the discussion (line 709): "Of note, mutations in myoF have previously been found to be associated with reduced ART susceptibility (Cerqueira et al., 2017), but 12 mutations tested in the laboratory strain 3D7 did not result in increased RSA survival (Behrens et al., 2023)*. *

      31) Line 298: the authors state that there is no growth defect in the first cycle when rapalog is added to the KIC11 line, however based on Figure 3D, there is evidently a 25% reduction in growth compared to - rapalog at day 1 post treatment, and a 60% reduction by day 2, which is still within the 1st growth cycle. The authors should either revise their statement or provide an explanation for these findings. The authors should also explain why their Giemsa data in Fig. 3E is not in accordance with their FACS data.

      We think there is a misunderstanding here, as our figure legend was not detailed enough and we apologise if this had been misleading. The growth effect is restricted to invasion or possibly the first hours of ring stage development (see point 4&5, reviewer 2), which in asynchronous cultures more rapidly takes effect as the culture also contains schizonts that immediately generate cells that re-invade but can't due to inactivation of KIC11 (due to the rapid action of the knock sideways, KIC11 is already inactivated). In contrast, in highly synchronous cultures, this effect can only be evident once the parasites reached the schizont stage (starting with rings this takes close to 2 days). We now clarify that Figure 2E (previously Figure 3D) shows growth data obtained with an asynchronous parasite culture, while in Figure 2F the growth assay is performed with tightly synchronized (4h window) parasites as stated in the Figure legend.

      We now explicitly state in each Figure legend and for each growth experiment throughout the manuscript whether we used asynchronous or synchronized parasites for growth assays.

      Related to this, the incorrect y-axis label of what is now Figure 2E mentioned in major comment #58 is now corrected.

      32) Line 301: KIC11 could also be important very early for establishment of the ring stage for example for establishment of the PV. Also, was mislocalisation assessed in rapalog-treated parasites at 72 hours or in cycle 3?

      This is a valid point and this has now been addressed. We performed an invasion/egress assay revealing similar schizont rupture rates, but significantly reduced numbers of newly formed ring stage parasites (Figure 2H, S3G), indicating an effect of KIC11 inactivation either on invasion or possibly the first hours of ring stage development. A very similar point was raised by Reviewer 2, please see reviewer 2; major comment #4. This is now also reflected in line 302, which now reads: ”… indicating an invasion defect or an effect on parasite viability in merozoites or early rings but no effect on other parasite stages (Figure 2F-H, Figure S3F-G).”

      We further included an assessment of mislocalization 80 hours after the induction of knock-sideways by addition of rapalog in Figure S3E which showed mislocalization of KIC11 to the nucleus.

      33) Line 311: the authors should change the sentence from 'not related to endocytosis' to 'not related to endocytosis or ART resistance'.

      Done as suggested.

      34) Line 323-325: Authors say that a nuclear GFP signal can be observed in early schizonts for KIC12. According to the pictures provided in Figure 4A and Figure S5A it is not very obvious. Also faint cytoplasmic GFP signal could only be background as we can see that exposure is higher for schizont pictures

      We changed the sentence (line 339) to: “…nuclear signal and a faint uniform cytoplasmic GFP signal was detected in late trophozoites and early schizonts and these signals were absent in later schizonts and merozoites (Figure 3A, Figure S4A,B).” in order to emphasize that the nuclear signal disappears early during schizont development.

      35) Line 326-328: The authors say that kic12 transcriptional profile indicate mRNA levels peak (no s at peak) in merozoites. Should they show live cell imaging of merozoites then? Because from the Figure 4A schizont pictures where schizonts are almost fully segmented no signal can be observed.

      The observation that mRNA levels of early ring stage expressed proteins tend to increase already in mature schizonts and merozoites is well established (e.g. (Bozdech et al., 2003)). A very good example for this are exported proteins of which most show a transcription peak in schizonts but the proteins are only detected in rings see e.g. (Marti et al., 2004). Hence, our observation for KIC12 is quite typical.

      We originally did not include merozoites, as in the last row of Figure 3B fully developed merozoites within a schizont with already ruptured PVM are shown and no GFP signal can be detected in these parasites. We now provide images of free merozoites in Figure S4A-B showing again no detectable GFP signal.

      We thank the reviewer for pointing out the typo, "peak" has been corrected.

      36) Line 347: The authors state that using the Lyn mislocaliser the nuclear pool of KIC12 is inactivated by mislocalisation to the PPM. This tends to suggest that only the nuclear pool of KIC12 is mislocalised. How is it possible that only the nuclear pool is mislocalised?

      The Lyn mislocaliser is at the PPM which is continuous with the cytostomal neck where the K13 compartment likely is found. The effect of the Lyn mislocalizer on the KIC12 protein pool localizing at the K13 compartment is therefore somewhat unclear. For this reason we already had the following statement in the original submission (line 400): “Foci were still detected in the parasite periphery and it is unclear whether these remained with the K13 compartment or were also in some way affected by the Lyn-mislocaliser.” We would like to stress here that the same does not apply to the nuclear mislocaliser, which is only a trafficking signal delivering KIC12 to the nucleus and hence likely does not affect the nuclear pool of KIC12, only the K13 compartment pool (the main interest of this manuscript).

      We realised that the statement towards the end of this paragraph was unnecessarily ambiguous in regards to the K13 compartment pool of KIC12 which might have caused some confusion about the function of this pool of KIC12 and therefore modified it to (line 374): "Due to the possible influence on the K13 compartment located foci of KIC12 with the Lyn mislocaliser, a clear interpretation in regard to the functional importance of the nuclear pool of KIC12 other than that it confirms the importance of this protein for asexual blood stages is not possible. In contrast, the results with the nuclear mislocaliser indicate that the K13 located pool of KIC12 is important for efficient parasite growth.". It is also important to note that this limitation does not apply to the NLS knock sideways in regard to the K13 compartment and that the endocytosis function of this pool of KIC12 seems solid which with this statement is enforced.

      37) Line 368-369: Effect was also only partial for MyoF. Why didn't you measure the same metrics for MyoF?

      This was now done and is provided as Figure 1J-K, S2J, confirming our previous interpretation, see also point #27 which raises the same point.

      38) Line 379: you don't know if all proteins acting later in endocytosis will have an increased number of vesicles as a phenotype

      This is based on our current definition as stated in the introduction. It assumes a directional vesicular transport of hemoglobin to the food vacuole where inhibition of early stages will prevent transport before HCC-filled autonomous vesicular containers have formed and entered the cell. In contrast later inhibition stops such containers from further transport, leading to their accumulation. Such an accumulation is visible after VPS45-inactivation and other proteins (Jonscher et al., 2019; Mukherjee et al., 2022; Sabitzki et al., 2023) or treatment with cytochalasin D (Lazarus et al., 2008). While it is possible that there may be smaller intermediates formed at the K13 compartment that later on unite or fuse with the compartment evident after VPS45 inactivation and these might be missed due to small size (i.e. inhibition of a step between K13 compartment and an early endosome or equivalent), this would still be upstream of the VPS45 induced containers and hence would be earlier. We therefore believe that based on the framework given in the introduction (see also (Spielmann et al., 2020)) to assume that a phenotype manifesting as reduced food vacuole bloating without formation of detectable vesicles likely signifies inhibition of the process early whereas reduced bloating but with vesicles signifies inhibition later in the process.

      39) Line 413-414: The authors state that no growth defect was observed upon KS of 1365800. Is growth alone enough to say that there is no impact on endocytosis?

      This is an interesting point. The endocytosis proteins we studied so far indicate that efficient impairment of endocytosis manifests as a severe growth defect. Hence, lack of a growth defect can be assumed to be an indicator for absence of an important role for endocytosis (or any other growth relevant process). Clearly there is a gradual response, such as seen in the different MyoF versions resulting in proportional growth and vesicle appearance phenotypes. Hence, a protein with a minor role might have slipped our attention but then it probably is also not a very important protein in endocytosis.

      To further strengthen our assessment of PF3D7_1365800 importance for asexual blood stage development, we now also generated a cell line expressing the PPM Mislocalizer, enabling knock sideways to the PPM. This was done because this protein consistently has a focus at the nucleus that may be within the nucleus. Again this revealed no growth defect upon inactivation (Figure S7D).

      40) Line 432: in this section, the authors state that KIC4 and KIC5 seem to have domains that may suggest these proteins are involved in endocytosis, based on the alpha fold data that is publicly available. Considering the authors have TGD-SLI versions of these lines (Birnbaum et al. 2020) and have already confirmed in this previous publication that they confer resistance to ART; it would make sense to look at endocytosis for these genes. This would be a relatively simple and straightforward experiment, taking no longer than two to three weeks, and would require no additional reagents or line generation. Doing these experiments would add a lot more weight to this final section. The authors later state that KIC4 and 5 are TGD lines, so not the best for endocytosis assays. It is unclear why this would be difficult to do if an adequate control is contained in the experiment (such as parental 3D7). It explains why they did not perform the MCA2 endocytosis assays further up, but in my opinion, an attempt at doing these assays is important and would significantly increase the impact of this paper. Identical as major comment #17.

      As stated in the manuscript and above, we were originally hesitant to do these assays due to the fact that we can't induce inactivation which is less ideal than comparing the identical parasite population split into plus and minus and is further complicated by the likely smaller effect as the TGDs still permitted growth. However, we see the point of the reviewer and now performed these assays using 3D7 as controls and taking extra care to account for stage differences between the TGD lines and 3D7. However, there was no significant difference in the bloated food vacuole assays with these cell lines. Due to the reasons mentioned in major point 17, we are not sure this indeed means these proteins have no role in endocytosis. One possible reason why we were able to obtain these TGDs may have been because the effect on endocytosis is less than in the essential proteins (or is ring stage specific) and in a TGD an endocytosis defect may therefore not be detectable with our assays (see details and further possible explanations in response to point 17).

      In an attempt to address the TGD issue, we generated knock sideways cell lines for KIC4 and KIC5. Unfortunately, the mislocalization of KIC5 to the nucleus was inefficient (see figure below). As this did not result in a growth defect (in contrast to the clear KIC5-TGD growth defect (Birnbaum et al., 2020)), this line is not suitable to study a potential role of this protein in endocytosis. Therefore, we performed the bloated food vacuole assay only with KIC4-2xFKBP-GFP-2xFKBPendo+1xNLSmislocaliser parasites. However, this revealed no effect on HHC uptake, which is in line with the normal growth of KIC4-TGD parasites (Birnbaum et al., 2020) and suggests that this protein could only have a minor or redundant role in endocytosis (it is the line that shows the smallest effect in RSA). As the KIC4 and KIC5 knock sideway lines did not permit any conclusions, we did not include them into the revised manuscript but they can be found here:

      [Figure KIC4 knock sideways & KIC5 knocksideways]

      Figure legend: (A) Live-cell microscopy of knock sideways (+ rapalog) and control (without rapalog) KIC4-2xFKBP-GFP-2xFKBPendo+ 1xNLS mislocaliser parasites 4 and 20 hours after the induction of knock-sideways by addition of rapalog. Scale bar, 5 µm. Relative growth of asynchronous KIC4-2xFKBP-GFP-2xFKBPendo+1xNLSmislocaliser plus rapalog compared with control parasites over five days. Three independent experiments were performed. Growth of knock sideways (+ rapalog) compared to control (without rapalog) KIC4-2xFKBP-GFP-2xFKBPendo+1xNLSmislocaliser (blue) or KIC5-2xFKBP-GFP-2xFKBPendo+1xNLSmislocaliser (red) parasites over five days. Mean relative parasitemia ± SD is shown. (B) Live-cell microscopy of knock sideways (+ rapalog) and control (without rapalog) KIC5-2xFKBP-GFP-2xFKBPendo+1xNLSmislocaliser parasites 4 and 20 hours after the induction of knock-sideways by addition of rapalog. Scale bar, 5 µm. Growth of asynchronous KIC5-2xFKBP-GFP-2xFKBPendo+ 1xNLSmislocaliser plus rapalog compared with control parasites over five days. Four independent experiments were performed. __(C) __Bloated food vacuole assay with KIC4-2xFKBP-GFP-2xFKBPendo+1xNLSmislocaliser parasites 8 hours after inactivation of KIC4 (+rapalog). Cells were categorized as with ‘bloated FV’ or ‘non-bloated FV’ and percentage of cells with bloated FV is displayed; n = 3 independent experiments with each n=19-30 (mean 21.4) parasites analysed per condition. Representative DIC are displayed. Area of the FV, area of the parasite and area of FV divided by area of the corresponding parasites were determined. Mean of each independent experiment indicated by coloured symbols, individual datapoints by grey dots. Data presented according to SuperPlot guidelines (Lord et al., 2020); Error bars represent mean ± SD. P-value determined by paired t-test. Area of FV of individual cells plotted versus the area of the corresponding parasite. Line represents linear regression with error indicated by dashed line.

      41) Line 490-493: the authors state that the K13 compartment proteins fall in two groups, some that are involved in ART resistance AND endocytosis, and some that have different functions. However, in this manuscript the authors have demonstrated 3 flavours that K13 compartment proteins can come in: • Some that confer ART resistance and are involved in HCCU (MCA2) • Some that are involved in HCCU but not ART resistance (MyoF & KIC12) • Some that are involved in neither (KIC11) The authors should therefore revise this statement.

      We agree that this was not well phrased. To account for the fact that not all endocytosis proteins confer increased RSA survival to the parasites when inactivated we changed this statement (line 604): "This analysis suggests that proteins detected at the K13 compartment can be classified into at least two groups of which one comprises proteins involved in endocytosis or in vitro ART resistance whereas the other group might have different functions yet to be discovered.

      Generally, we believe that endocytosis is the overarching criterion and we therefore would like to keep the definitions of the main groups (endocytosis or not). As indicated by the title, the focus of the manuscript is on the K13 compartment for which so far endocytosis is the only experimentally associated function. That this group contains proteins that do not confer reduced ART susceptibility when conditionally inactivated (KIC12 and MyoF) is explained by their stage-specificity, making this a subgroup of the overarching endocytosis group.

      We realise that with the endocytosis data on the KIC4, KIC5 and MCA2 TGD there is now also a subgroup we were unable to demonstrate an endocytosis effect in trophozoites although they show changes in RSA survival. However, as indicated above, we would be hesitant to fully exclude some role of these proteins in endocytosis in rings. Particularly as a comparably small reduction in endocytosis protein activity or abundance is sufficient to increase RSA survival (Behrens et al., 2023). A principal classification of "endocytosis or ART resistance" or "neither endocytosis nor ART resistance" still accounts for this and therefore seems to us to be the most useful, particularly also in light of our domain identification that then can be linked with one or the other group.

      42) Line 508: the authors state that they expanded the repertoire of K13 compartments, when in fact they functionally analysed them - they did not do another BioID to identify more candidates.

      We respectfully disagree with the reviewer in this point, we did expand the repertoire of known K13 compartment proteins. Only independently experimentally validated proteins from proximity biotinylation experiments can be considered part of the K13 compartment (or any other cellular site or complex). Without validation of the location, the identified proteins can only be considered candidates. This is highlighted in this manuscript by the finding that several proteins of the list did not localize at the K13 compartment.

      43) Line 570-572: has anyone ever tested whether CytoD or JAS treatment in rings, is sufficient to mediate ART resistance? Something similar to what was done in PMID 21709259 with protease inhibitors. If not this would be a pretty interesting experiment for the authors to do that could shed more light on the MyoF data. It would take maybe 2 weeks to do and not require the generation of any new lines. This would clarify whether other Myosins other than MyoF are involved in endocytosis, as is suggested by previous publications (PMID: 17944961).

      We now included this experiment. In agreement with a lacking need of MyoF in rings and no effect on RSA survival, there was no increased survival of the parasites in RSA (neither on 3D7 nor on K13 C580Y parasites) after cytD treatment (new part in Figure 1M). We thank the reviewer for pointing out that this experiment might also inform on whether other myosins influence endocytosis in ring stages. We added (line 250): Similarly, also incubation with the actin destabilising agent Cytochalasin D (Casella et al., 1981), had no effect on RSA survival in 3D7 or K13C580Y (Birnbaum et al., 2020) parasites, indicating an actin/myosin independent endocytosis pathway in ring stage parasites (Figure 1M) and speaking against other myosins taking over the MyoF endocytosis function in rings.”

      44) Line 608: inhibitors targeting the metacaspase domain of MCA2 may inadvertently inactivate other essential parts of the protein. They authors should acknowledge this possibility in the text.

      The inhibitors used in the cited studies (Kumari et al., 2018) are validated metacaspase inhibitors, such as Z-FA-FMK (Lopez-Hernandez et al., 2003). Activity against the other parts of PfMCA2 - which apart from the MCA domain shows no homology to other proteins - is therefore unlikely.

      45) Line 624-625: the authors state that MyoF is 'lowly expressed in rings' - indeed this is the case in their MyoF-2xFKBP-GFP-2xFKBP line which the authors established has defects due to the tag, but it appears from their MyoF-3xHA tagged line that it is expressed in rings. The authors should therefore revise their statement, and be careful of making claims based on their defective line and using fluorescence imaging as their only metric. If they do want to make the statement that it is not there in rings, they should also do a western blot, which is much more sensitive since it amplifies the signal compared to an image of one parasite.

      This comment is related to major point #24. We also would like to stress that while the MyoF-GFP line already shows a phenotype, the impression of defectiveness based on its location is due to a mix up (see major point #23).

      We now provide a comprehensive time course of the MyoF-GFP signal (Figure 1C, S2A) showing that there is no detectable MyoF-GFP signal until the transition from ring to trophozoite stage. As this is all under the endogenous promoter, we do not think the partial functional inactivation of the tagging is the reason for the absence of the signal. If anything, we would have expected adding a stably folded structure such as GFP to increase the stability of the protein. The main reason for the discrepancy of MyoF signal in rings between the GFP-tagged line (of note there is also no detectable MyoF-GFP signal in MyoF-2xFKBP-GFP ring stage parasites (Figure S2B)) and the HA-tagged line likely is that IFA is much more sensitive than live GFP detection (similar to the high sensitivity the reviewer mentions in regards to WB). This discrepancy therefore is likely due to the fact that the lowly expressed MyoF only become apparent with the HA-tagged line due to the IFA. We therefore believe that MyoF is 'lowly expressed in rings' is an appropriate description of our results obtained with three different cell lines (MyoF-2xFKBP-GFP-2xFKBP, MyoF-2xFKBP-GFP and MyoF-3xHA). We hope this is sufficiently well reflected in the manuscript where we write ‘a low level of expression of MyoF in ring stage parasites.’ not that it is ‘not there in rings’ (line 174).

      46) Line 635: arguably this is the 3rd variety and not the 2nd (the authors already mentioned 2 types - ones that are involved in HCCU AND ART and those involved in HCCU only). See comment for line 490-493 above.

      See response for major comment #41, we now consistently used "or" instead of "and". See line 490-493 how this was resolved for what previously was line 635.

      47) Line 785: Bloated food vacuole assay/E64 hemoglobin uptake assay method specify that a concentration of 33mM E64protease inhibitor was used. However, in reference 44, cited in the manuscript, a concentration of 33µM E64 was used. Please confirmed if this is just a typo or if 1000x E64 concentration was used which renders the experiment invalid.

      We thank the reviewer for pointing this out, we corrected this typo and will look out for symbol font conversion errors for the resubmission.

      48) Line 788: it is unclear from this section what is considered a bloated food vacuole - is there an area above which the FV is considered bloated? Do the authors do these measurements manually or use an addon in FIJI/ImageJ? What is the cutoff for if a FV is bloated? Please clarify. Additionally, for the representative images + rapalog for Figures 2H and 4H, it would be useful to see where the authors delineate the FV (add a white circle showing what is actually measured).

      The bloated FV assay is well established (Jonscher et al., 2019; Birnbaum et al., 2020; Sabitzki et al., 2023). Although the bloating of the FV is a human judgment call, it is actually quite obvious: bloating appears as an easily spotted bulging of the FV in DIC. As also minor bloating is scored as 'bloated', it is a very conservative assay. Using an-add on to measure this is not straight forward. It is unclear how this bulging effect of the FV in DIC could be spotted by a software and due to the obviousness to human operators, potentially lengthy and complicated efforts to design appropriate machine learning options were not undertaken. The situation faced by the scorer of the assay is evident from Figure S4F-G which contains close to 50 "on rapalog" cells and close to 50 control cells, giving representative cells from all replicas of bloated FV assays with KIC12. Please note that these images shows the most complicated situation as far as bloated assays go, because the phenotype is not 100% (see Figure 3F) compared to e.g. KIC7 inactivation which leads to lack of bloating in almost all cells (see (Birnbaum et al., 2020) Figure 3E) but nevertheless the difference is still obvious. We are aware that in such situations (less than absolute inhibition) this assay scoring of "yes" or "no" is a surrogate for the actual level of inhibition and may be more subjective. This is why in this case we also did the FV size measurements (which are less dependent on human judgment) to further support this and give a better quantifiable measure. Of note, the bloated food vacuole judgments are done "blinded", i.e. the examiner does not know which sample they are looking at.

      In response to this reviewer's point we now also added the FV size refinement of the assay for MyoF inactivation which is one of the cases where inhibition of bloating is not in 100% of the cells (see major comment #27). Please also note here the advantage of the rapidly acting knock sideways technique for these assays which shows the sum of effect 8 h after initiating inactivation and for which we carefully control size of the cells which shows that there is no significant growth reduction over the assay time, excluding secondary effects due to a generally reduced viability. Compared to slower acting systems suggested to have been used instead (see introductory part and significance of this review), the rapid speed of knock sideways reduces the risk of potential pleiotropic or compensatory effects due to the time needed for proteins to be depleted if the gene or mRNA is targeted instead.

      The suggestion to include a ‘white circle’ (raised also as minor comment#27) is useful as an aid to see the food vacuole. However, in contrast to the Figures in (Birnbaum et al., 2020) (where we did add such a circle), we here included the DHE staining images in the figure, labelling the parasite cytosol which readily shows the FV (the FV corresponds to the region where there is no DHE staining). As this shows the position of the FV we would prefer to not obscure the DIC images with additional features to permit the reader to see the difference between bloated or non-bloated food vacuoles and keeping the image as natural as possible.

      49) Line 863-864: this sentence seems to be out of place.

      We thank the reviewer for pointing this out, the details of nucleus staining were moved to the correct part.

      50) Line 875: the authors state that there is a light blue wedge, when the circle consists of grey and black wedges. Please revise this.

      This has been corrected.

      51) Line 1059-1061: it is unclear whether the individual growth curves are different clones or whether they are just the same experiment repeated? If it is the latter, then why are they not combined, as is traditionally done?

      These are the individual replicates of the growth curves shown in Figure 1G of the same cell lines done on a different occasion. We always try to show as much of the primary data as possible and believe that showing individual data points from the different experiments is better than only the combined values which obscure the actual course of each experiment.

      52) Line 919-924: the authors mention a blue and red line, but there is only a black line in figure 3D. Moreover, the experiment of using the LYN mislocaliser was only done for KIC12 according to the manuscript. Additionally, the y axis of the figure states relative growth day 4[%] compared to rapalog, but then on the x axis there are several days. In the text it says there is no growth defect until the second cycle, but from this graph it appears the growth defect is evident as early as 1 day post rapalog treatment. Can the authors please clarify and correct the issues pointed out.

      We thank the reviewer for pointing this out, this was due to a copy & paste error in the figure legend that was now amended. We also fixed the incorrect axis label. For the last part (growth defect) please see detailed answer to Major comment#31 raising the same concern for KIC11 (in synchronous parasites the defect only takes effect once the cells reached the relevant stage whereas in asynchronous cultures there are always cells in the relevant stage that due to the rapid effect of the knock sideways already have a growth phenotype).

      53) Figure 1 panel B & C: the label of the figure where the signal from MCA2Y1344STOP-GFP is shown with the DAPI signal overlayed is deceptive since it suggests that this is the signal of full length MCA2. Please change the label of this panel from MAC2/DAPI to MCA2Y1344STOP/DAPI. The same is true for Panel C for the image labeled MCA2/K13 - please change this to MCA2Y1344STOP/K13.

      Done as requested.

      54) Figure 2B: what stages are these parasites? Please state this in the figure. Based on the MyoF pattern, it looks like rings in the upper panel and trophs in the bottom pannel. Why were schizonts not shown?

      Both are trophozoites (early trophozoite in top panel and late trophozoite in bottom panel). This is now labelled in what now is figure 1B. As stated above, schizont stages are less relevant for the topic of this manuscript and in order to prevent the manuscript from getting more disjointed and keeping it more focussed on the main topic, we decided to not include a schizont in the manuscript. Nevertheless, we included an example image below.

      [Figure MyoF_p40px schizont]

      55) Figure 2D&F: it is not very meaningful when growth assays are shown as a final bar after 4 days of growth. It is much more useful and informative to see a growth curve instead (as is shown in the supplementary), since it shows if the defect is apparent in the first growth cycle or later. With the way the data is currently shown, this is not apparent. I would advise the authors to switch the graph in 2F out of a combined graph of all the biological replicates growth curves for S3D - showing error bars.

      While we in principle fully agree with the reviewer in showing the course of the full experiment (which is available in Figure S2E), the key here is to show the overall difference. Hence, we would like to keep this comparison of the overall effect on growth in what now is Figure 1E and G. It is part of the argument to the doubts this reviewer raises to the function of MyoF (mainly in the overall assessment and the significance statement) to show that the phenotype is actually very consistent (partial inactivation through tagging or further inactivation using knock sideways increases endocytosis phenotypes, correlating with parasite viability).

      Please also note, that the growth curves upon knock sideways shown in Figure 1G, S2E are performed with asynchronous parasite cultures, which doesn’t allow us to draw direct conclusions about growth cycle effects.

      Nevertheless, we now also included the suggested combined data representation in Figure S2E.

      56) Figure 3: why were the calculation of FV area, parasite area and FV/parasite area only done for KIC12 and not done for MyoF? It would be interesting to see if any of these values are different for MyoF - whether the parasites are smaller in area and therefore FV smaller. Please present them Figure 2. Images should be already available and would not require further experiments to be done, only the analysis.

      This now has been done (confirming our results) and is included as Figure 1J-K, S2J. This point was also raised as major comment #37, please also see detailed answer there.

      57) Figure 3B: why is there no spatial association assessment for KIC11 and K13 as was done for the MCA2 and MyoF? The authors should show a pie chart showing the degree of association here as was done for the other proteins.

      This is now included in Figure 2C.

      58) Figure 3D: The y axis of the figure states relative growth day 4[%] compared to rapalog, but then on the x axis the experiment takes place over several days. Is this a typo in the y axis? Additionally, the authors state in line 287-290 that the growth defect upon addition of rapalog is only seen in the second cycle, but from this graph it appears the growth defect is already evident 1 day post rapalog addition. The figure legend also does not make sense for this figure since it mentions a blue and a red line, when there is only a black line present. The legend also mentions the LYN mislocaliser which was used for KIC12 not KIC 11 (see above).

      We apologise for the inadequate legend and colour issues, this was amended. This point was also raised in major comment #31 and #52, please find detailed answer there.

      59) Figure 3E: the colour for Control and Rapalog 4 hpi are very similar and very hard to discern. Please choose an alternative colour or add a pattern to one of the samples. The y axis is also missing a label. Is this supposed to be parasitemia (%)?

      We thank the reviewer for pointing this out, the missing label is now included and the colour has been adapted to make them better distinguishable.

      60) Figure 4A: the ring shown in this figure does not appear to be a ring (it is far too large and appears to have multiple nuclei?). Do the authors have any other representative images to show instead?

      This is in fact a ring, but we realize that we accidentally included an incorrect size bar in the ring image of Figure 4A (now Figure 3A) (size bar for 63x objective instead of the correct one for the 100x objective), we apologise for this oversight. We don’t think this parasite has multiple nuclei, instead the Hoechst signal shows the often elongated nucleus seen in rings that can appear as two foci in Giemsa stained smears which leads to the typical diagnostic feature of P. falciparum rings in diagnostics. In order to exclude any doubts about the nuclear localization of KIC12 in rings, we here attached a panel with more examples of KIC12-2xFKBP-GFP-2xFKBP ring stage parasites.

      [Figure KIC12]

      61) Figure 4B: why is there no spatial association assessment for KIC12 and K13 as was done for the MCA2 and MyoF? The authors should show a pie chart showing the degree of association here as was done for the other proteins. This should be done for the different life cycle stages considering the changing localisation of KIC12.

      This is now provided in Figure S4A. As suggested by the reviewer, we independently quantified the association for ring stage, early trophozoite and late trophozoites stage. As there is no KI12 signal in schizonts, we did not include a quantification for this stage.

      62) Figures 4C&E: it is extremely important to show the DNA stain in both these samples considering that a portion of KIC12 is in the nucleus! Please add the DAPI signal for these figures (as for all other figures!).

      Please see major comment #64 for a detailed answer why we did not include DNA staining in the imaging used to assess mislocalization upon knock-sideways.

      63) Figure 4E: this figure should be presented before 4D (considering the line being presented in 4E is used in an experiment in 4D). The authors should switch the order of these two.

      We see the point the reviewer is raising here, Figure 4D (now Figure 3D) also contains the data with the Lyn mislocaliser while we first talk about the NLS mislocaliser. This permits a better comparison between the two mislocaliser lines. However, first explaining the Lyn-mislocaliser and then going back to the NLS would make it rather complicated for the reader to follow the storyline and therefore we would like to keep the order as it is. We realise that this means the reader has to go back one figure part for seeing the Lyn growth data, but believe this is worth the benefit that the data is there compared to the NLS result.

      64) It is unclear why in many of the fluorescence images the authors do not show the DAPI signal - particularly when colocalising with K13 and when doing the knock sideways experiments. Please add these images to the figures - I would assume they have already been taken, so would simply involved adding the images to the panel.

      We did not include DNA staining (DAPI or Hoechst) for any of the images used to assess the efficacy of mislocalization, as we would prefer to keep the parasites as representative of a viable parasites in culture as possible. Hence they were imaged without DNA stain (these stains are toxic). We would like to point out that a DNA stain is not necessary, as the mislocaliser already marks the nucleus (in the case of the NLS mislocaliser), actually even somewhat more accurately, as it fills the entire nuclear space rather than only the DNA which is marked by DAPI or Hoechst.

      For LYN this admittedly is not the case, there the mislocaliser marks the plasma membrane. However, we think the proper control for efficient mislocalisation is the comparison between the GFP-tagged protein of interest and the mCherry mislocaliser to show mislocalisation, as previously done in our lab (e.g. (Birnbaum et al., 2017; Jonscher et al., 2019; Birnbaum et al., 2020)).

      Due to their toxicity, we also avoided nuclear staining in some other parts of the manuscript when we were of the opinion that a nucleus signal was not necessary.

      65) Throughout the manuscript, there is no western blot confirming the correct size of their modified proteins. This should be provided.

      We did perform Western blot analysis for both MCA2 cell lines. MCA2 is the only gene-product for which we generated a disruption for this work, and together with the severe truncation from previous work, we provided a Western blot-based confirmation of the correct size.

      The MCA2 disruptions are at least partially dispensable for in vitro parasite growth, hence if degradation occurred, this might not have been noticed. In that case we considered it relevant to show that the truncations were of the expected size. The other proteins in the main figures are essential for growth. Hence, if the tagging approach would lead to unexpected changes in protein integrity (which we assume is what was intended by this concern to be assessed with a Western blot), the parasites expressing the tagged MyoF, KIC11 and KIC12 would - due to their importance for asexual blood stage development - not have been obtained. Hence, we can assume the integrity of the tagged protein is very unlikely to have been affected in a functionally relevant way.

      66) None of the figures are appropriate for individuals with colour blindness, limiting their accessibility to the paper. Please change the colour schemes for all fluorescent images using magenta/green or an alternative colour combination appropriate for colourblind individuals.

      We thank the reviewer for this comment. This has now been amended, individual channels of fluorescence microscopy images are now shown in greyscale, while the overlay was changed to green/magenta.

      Minor Comments

      1) line 29: remove 'are'.

      Done.

      2) Line 29: the text says "HCCU is critical for parasite survival but is poorly understood, with the K13 compartment proteins are among the few proteins so far functionally linked to this process." The sentence should be: 'HCCU is critical for parasite survival but is poorly understood, with the K13 compartment proteins among the few proteins so far functionally linked to this process."

      Done.

      3) line 44: remove 'the'

      Done.

      4) Line 48: consider mentioning here that malaria is caused by the parasite Plasmodium - otherwise the first mention of parasite in line 52 is confusing for the non-specialist reader.

      Done.

      5) Line 49: estimated malaria-related death and case numbers are from the 2021 WHO World malaria report. You cite the 2020 WHO World malaria report.

      We now cite the newest WHO report.

      6) Line 53: please insert the word 'have' between now and also.

      Done.

      7) Line 54: please change 'was linked' to is linked

      Done

      8) Line 72: I would specify that free heme is toxic to the parasite. Especially as you mention that hemozoin is nontoxic.

      Sentence would be "where digestion results in the generation of free heme, toxic to the parasite, which is further converted into nontoxic hemozoin"

      Done.

      9) Line 90: authors should either say "in previous works" or "in a previous work"

      The text has been altered to say: “ in a previous work”.

      10) Line 91: "We designated these proteins as K13 interaction candidates (KICs)"

      Done.

      11) Line 95: please change 'rate' to number

      Done.

      12) Line 109: Please include a coma before (ii).

      Done.

      13) Line 112: as shown by Rudlaff et al in the paper you are citing, PPP8 is actually associated with the basal complex. You can say that "(ii) were either linked or had been shown to localise to the inner membrane complex (IMC) or the basal complex (PF3D7...).

      Done.

      14) Line 114: Protein PF3D7_1141300 is called APR1 in the manuscript but ARP1 in Supplementary Table 1. Please correct.

      Done.

      15) Line 131: please define SNP - this is the first use of the acronym.

      Done.

      16) Line 133-134: South-East Asia instead of "South Asia"

      Done.

      17) Line 135: please explain what TGD is - it is referred to over and over again in the manuscript without ever being explained.

      We apologise for this oversight. We now explain what is meant with TGD at the suggested point of the manuscript.

      18) Line 145: change 'Western blot' to western blot - only Southern blot is capitalised since it is named after an individual, while the other techniques are not.

      To the best of our knowledge this issue has not been resolved, some Journals capitalize the “W” (e.g. Science), while others don’t (e.g. Nature). We would prefer to continue to capitalize the “W”, as this is consistent with the original publication from (Burnette, 1981), but if there are strong objections, we would be happy to change this____.

      19) Line 152: add "the" between 'and spatial'

      Done.

      20) Line 158: please define SLI as selected linked integration, since it is the first use of the acronym.

      Done.

      21) Line 178: introduce a coma after protein. Sentence should be "Proliferation assays with the MCAY1344STOP-GFPendo parasites which express a larger portion of this protein, yet still lacking the MCA domain (Figure 1), indicated no growth ...

      Done.

      22) Line 195: the authors could mention that MyoF was previously called MyoC in the Birnbaum 2020 paper. I wanted to check back in the Birnbaum 2020 paper and could not find MyoF

      Good point, this was done.

      23) Line 200: "Expression and localisation of the fusion protein was analysed by fluorescent microscopy". Why expression was not analysed also by western Blot same as for MCA2?

      Please see major comment #64 for a detailed answer.

      24) Line 204: I could not find any mention of MyoF (Pf3D7_1329100) in reference 65. Please remove reference 65 if not correct. Also reference 66 looks at Plasmodium chabaudii transcriptomes so I would specify that "This expression pattern is in agreement with the transcriptional profile of its Plasmodium chabaudii orthologue"

      Reference 65 (Wichers et al., 2019) provides an RNAseq transcriptome dataset for asexual blood stage development of 3D7 (originating from the same source as the 3D7 used in this study). While Ref 66 (Subudhi et al., 2020) indeed contain transcriptomic data from P. chabaudi, the authors also provide a nice 2h window RNAseq transcriptome dataset for asexual blood stage development of Plasmodium falciparum. Both datasets are therefore suitable as reference for the statement about myoF transcription pattern. Both datasets are also easily accessible and show the pattern in a graph in PlasmoDB.

      25) Line 208: Please indicate a reference for P40 being a marker of the food vacuole

      Done.

      26) Line 220-224: The authors should consider changing to " Taken together these results show that MyoF is in foci that are mainly close to K13 and, at times, overlapping, indicating that MyoF is found in a regular close spatial association with the K13 compartment."

      The suggested wording introduces "mainly" for "frequently" and likely was in part motivated by the discrepancy in location between cell lines that we hope we now could clarify to be only minor (see major point #23). We therefore think the original wording appropriately summarises the findings (line 178): “*Taken together these results show that MyoF is in foci that are frequently close or overlapping with K13, indicating that MyoF is found in a regular close spatial association with the K13 compartment and at times overlaps with that compartment.” *

      27) Line 255: In Figure 2H, and subsequent figures showing bloated FV assay, I would delineate the food vacuole with dashed line as in Birnbaum et al. 2020 to help the reader understanding where the food vacuole is.

      In contrast to the Figures in Birnbaum et al. 2020, we here included the DHE staining (parasite cytosol) in images of bloated FV assays which visualizes the FV. We therefore decided to avoid any further marking, to keep the image as unprocessed as possible (see also major point 48).

      28) Line 265-266: Here the title says that KIC11 is a K13 compartment associated protein, but the title of Figure 3 says KIC11 is a K13 compartment protein. I noticed that you make the difference between K13 compartment protein et K13 compartment associated protein for MyoF for example which is not clearly associated with the K13 compartment. Which one is it for KIC11?

      The interpretation of the reviewer is correct, we indeed graded this subconsciously based on level of overlap. Based on the newly added quantification shown in Figure 2C, we describe KIC11 now as K13 compartment protein.

      29) Line 309-310: indicate a reference for your statement "which is in contrast to previously characterised essential K13 compartment proteins".

      Done, we now included Birnbaum et al. 2020 as reference for this.

      30) Line 377: Figure 4I, please correct 1st panel Y axis legend

      Done.

      31) Line 404: replace "dispensability" with dispensable

      Done.

      32) Line 416: can the authors provide any speculation as to why they observed these proteins as hits in the BioID experiments?

      As some of these proteins were less well or less consistently enriched, they could be background of the experiment. Alternatively, some could be proteins that only transiently interact with the K13 compartment.

      33) Line 451: Where the "97% of proteins containing these domains also contain an Adaptin_N domain and function in vesicle adaptor complexes as subunit a" come from. Do you have a reference?

      The statement now includes references and reads (with small changes to original submission): "More than 97% of proteins containing these domains also contain an Adaptin_N (IPR002553) domain (Blum et al., 2021) and in this combination typically function in vesicle adaptor complexes as subunit α (Hirst and Robinson, 1998; Traub et al., 1999) (Figure 5D) but no such domain was detectable in KIC5."

      34) Line 465-467: the same could be said for KIC4 as it also has a VHS domain.

      The critical issue is the combination of domains and their position within the protein. While KIC4 also contains a VHS domain, the VHS domain in KIC4 is N-terminal, not in a central position and it is also not the first structural domain to be identified in KIC4. The similarity to adaptin domains was already described ((Birnbaum et al., 2020) and annotated in PlasmoDB) and these domains are also involved in vesicle formation and trafficking. These aspects of the statement can therefore not be extended to KIC4. With regards to VHS domains being involved in vesicle trafficking, this is already stated in line 538: «KIC4 contained an N-terminal VHS domain (IPR002014), followed by a GAT domain (IPR004152) and an Ig-like clathrin adaptor α/β/γ adaptin appendage domain (IPR008152) (Figure 5A-C, Figure S8). This is an arrangement typical for GGAs (Golgi-localised gamma ear-containing Arf-binding proteins) which are vesicle adaptors first found to function at the trans-Golgi (Dell’Angelica et al., 2000; Hirst et al., 2000)

      35) Line 477-479: Can be rephrased to "However, we found this protein as being likely dispensable for intra-erythrocytic parasite development and no colocalisation with K13 could be demonstrated, suggesting a limited role for PF3D7_1365800 in endocytosis. Or something like that. Makes it clearer.

      We rephrased this sentence and it now reads (line 592): However, we found this protein as being likely dispensable for intra-erythrocytic parasite development and no colocalisation with K13 was observed, suggesting PF3D7_1365800 is not needed for endocytosis“.

      36) Line 535: Have AP-2a or AP-2b been shown to be at the K13 compartment?

      AP2m is at the K13 compartment (Birnbaum et al., 2020). Adaptor complexes are heterotetramers and their subunits do not typically function on their own and this is conserved across evolutionarily distant organisms. In agreement that this is also the case in P. falciparum, Henrici et al. (Henrici et al., 2020a) showed that both, AP-2a and AP-2b, were present in an AP2µ Co-IP, indicating that the AP2 complex consist of the ‘classical’ subunits in P. falciparum. Therefore, the presence of all subunits at the K13 compartment is very likely, although this has only been experimentally confirmed for AP2µ. Of note, for Toxoplasma gondii the presence of AP-2a and AP-2b at the micropore has been experimentally confirmed (Wan et al., 2023; Koreny et al., 2023) and interaction suggested by presence in the same IP as DRPC (Heredero-Bermejo et al., 2019).

      37) Line 569: reference 43 is wrong

      We thanks the reviewer for pointing this out – we removed Ref 43.

      38) Line 746: typo "ot" instead of or.

      Changed.

      39) Line 801: method for Domain Identification using AlphaFold specify that RMSDs of under 5Å over more than 60 amino acids are listed in the results. However, there is a typo in Figure 5B for KIC5 where it says "RMSD 4.0 Å over 8 aa". Please correct.

      Done. In addition, we have now applied a more stringent cut-off of 4Å over more than 60 amino acids to ensure a higher reliability of our hits. This decision was based on results from our preprint (Behrens and Spielmann, 2023). Because of this the phosphatase domain in KIC12 is no longer included in this manuscript and accordingly the following sentence has been deleted. In KIC12 we identified a potential purple acid phosphatase (PAP) domain. However, with the high RMSD of 4.9 Å, the domain might also be a divergent similar fold, such as a C2 domain, which targets proteins to membranes.”

      40) Line 856: In Figure 1E, please use the same Y axis legend as in Figure 2D "relative growth at day 4 [%] compared with 3D7"

      Done.

      41) Figure S1: Some PCR gels check for integration are presented as 5', 3' and ori whereas other gels are presented as ori, 5' and 3'. This is confusing.

      We agree that ideally the order of sample loading should be consistent and we apologise for this. The explanation for this is that these gels were run by different people at different times before we were able to better standardize the loading scheme. However, in the interest of not unnecessarily using resources for something that has a similar meaning, we would prefer not to repeat these PCRs and re-run them only for consistency reasons (as the conclusion is not affected by the different loading schemes).

      42) Figure S1: Why was the expression of only MCA2 was verified by Western blot? What about the other proteins?

      See response to major comment 56.

      43) Line 493: Considering KIC11 was not involved in HCCU or ART resistance it might be worth mentioning in this section that it is of note that there are no domains detected that would be involved in endocytosis.

      We agree that this is the case, however it is also the case for all other proteins that either are not involved in endocytosis and/or lowered susceptibility to ART. We therefore now added a summary statement addressing this in line 602: In contrast, the K13 compartment proteins where no role in ART resistance (based on RSA) or endocytosis was detected, KIC1, KIC2, KIC6, KIC8, KIC9 and KIC11, do not contain such domains (Figure 5E).” We did not add this at the suggested part of the manuscript as at that point the domain search results are not yet introduced and doing this each time for all the individual proteins would disconnect the flow of the manuscript.

      44) Line 503-506: is it wise to generate more drugs that target a pathway that is already highly susceptible to mutations? The authors should add a statement explaining how this might be avoided.

      The only protein for which mutations do not have a large fitness cost is K13 (see also our preprint on fitness cost of ubp1 mutation (Behrens et al., 2023) and even with K13 the level of resistance seems to be limited by amino acid deprivation when endocytosis is reduced (Mesén-Ramírez et al., 2021). We therefore do not think that this pathway is particularly prone for mutations. Further, the number of commercial drugs targeting the "endproduct" of endocytosis (hemoglobin digestion and detoxification of heme) highlight it as the most prominent vulnerability for drug-based intervention if we go by number of commercially available drugs acting on things associated with a single process.

      45) Throughout, scale bars are stated in the figure legends at the end of the legend. This is a slightly confusing format. The authors should consider stating the scale bar for each sub-legend where a fluorescence image is taken.

      Done.

      ** Referees cross-commenting**

      After reading reviewer 2 and 3's comments, I think there are significant overlaps in the key points raised in terms of questions about fusion proteins and their potential partial mis-localisation, better descripton of results and target selection. Overall I think we agree that the work has potential, but in its current form does not represent a major advance. It would be immensely helpful if the manuscript would be carefully edited for a better flow and linear description of results.

      We now rearranged the manuscript for better flow but would like to highlight that the many requests for smaller experimental issues (and "better description of results") worked somewhat in the opposite way of a more linear description. We hope the rearranged version acceptably balances these two issues. The issues raised in regards to target selection and potential partial mis-localisation are addressed in our responses mainly to this reviewer. Please also see comments on systems used at the end of the rebuttal.

      Reviewer #1 (Significance (Required)):

      The authors set out to test whether other proteins that are in the vicinity of K13 are involved in mediating ART resistance and endocytosis. This is an interesting question. However, other than MCA2 which was already known to be involved in mediating ART resistance (and was not tested for its involvement in endocytosis), none of their candidate proteins seem to be involved in mediating both these functions. The authors show that the other proteins tested appear important for parasite growth, with KIC12 and MyoF involved in mediating endocytosis. While these findings are novel, the KS approach used by the authors casts some doubt over the findings, and would mean that these findings would have to be re-tested with a more reliable approach, such as the GlmS system or generating a conditional knockout using the DiCre system. Despite not advancing our understanding of ART resistance, or identifying further players involved in this process, this manuscripts provides two candidates that are involved in mediating endocytosis and a further candidate that appears to be important for parasite growth. Further work on these proteins will be required to understand their exact roles. As stated above, there is currently limited interest for these results (limited to researchers working on endocytosis in apicomplexan parasites and possibly the wider endocytosis field from an evolutionary perspective), however with further work, this could increase the impact and interest of this work substantially.

      The authors do not describe any novel methods/approaches within this work.

      In the significance statement the reviewer indicates that other systems would have been more reliable for the work here. This is addressed in our response above and in a detailed considerations on the properties of conditional inactivation systems at the end of the rebuttal. The systems used in this work were not only chosen because they permit rapid targeting of many different proteins, but because they have merits that are beneficial for our assays. In fact many of the functional assays in this manuscript are difficult or impossible to carry with the suggested conditional inactivation systems (please note that we have extensive experience with the systems considered preferable:

      • DiCre (Birnbaum et al., 2017; Mesén-Ramírez et al., 2019; Mesén-Ramírez et al., 2021; Wichers et al., 2022; Kimmel et al., 2023)

      • glmS (Wichers et al., 2021c; Wichers et al., 2021a; Wichers et al., 2022; Wichers-Misterek et al., 2023)).

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In a previous publication the Spielmann lab identified the molecular mechanism of ART resistance in P. falciparum by connecting reduced levels of the protein K13 to decreased endocytosis (uptake of hemoglobin from the RBC cytosol), which results in reduced ART susceptibility. Using quantitative BioID the authors further identified proteins belonging to a K13 compartment, highlighting an unusual endocytosis mechanism.

      In the present manuscript the authors follow up on this work and closely examine ten more proteins of the K13/Eps15-related "proxiome". They successfully link MCA2 to ART resistance in vitro, while the proteins MyoF and KIC12 are involved in endocytosis but do not confer in vitro ART resistance when impaired. They further characterize one candidate (KIC11) that partially colocalizes with K13 in trophozoites but to a lesser degree in schizonts. Growth assays suggest an important function for KIC11 in late stages of the intraerythrocytic developmental cycle. Five analyzed proteins however do not colocalize with the K13 compartment, while a sixth was refractory to endogenous tagging.

      Using AlphaFold predictions of the KIC protein structures the author identify domains in most constituents of the K13 compartment, highlighting vesicle trafficking-related features that were not identified on primary sequence level before.

      The combination of functional data together with structure predictions leads them to propose a refinement of the K13 compartment as being divided into proteins participating in endocytosis and proteins that have an unknown function.

      We thank the reviewer for the assessment of the manuscript and the constructive comments.

      Major comments:

      1) -Table 1 is missing

      We apologise for this mistake; Table 1 is now included.

      2) -Lines 117-123: Given the total list of uncharacterized candidates encompasses 13 proteins, can the author gives the reason why only the top 10 and not all 13 were characterized in this study?

      A similar point has been raised by Reviewer 1 in major comment #12, please see our response there for an explanation why we chose which targets.

      3) -Line 174: 20% of observed MCA2 foci show no overlap with K13 and 21% only partially overlap, can the author confirm that the observed MCA2 foci in schizonts are the ones that co-localize with K13. (Addition of a schizont stage image in Fig 1C would be sufficient).

      We now extended Figure 4C with images of MCA2-Y1344STOP-GFP+mCherryK13 parasites covering the schizont and merozoite stage, showing that the majority of the MCA2 foci in schizonts are also mCherry-K13 positive.

      4) -The localization and observed phenotype of KIC11 is interesting but unfortunately the authors do not explore it further. Does KIC11 localize with markers of e.g. the secretory organelles (micronemes or rhoptries) in schizonts and could therefore be involved in RBC invasion?

      While we intended to focus mainly on the endocytosis aspect of these proteins, we see the reviewer's point and now generated new cell lines enabling assessment of spatial association of KIC11 with markers for rhoptry (ARO), micronemes (AMA1), and inner membrane complex (IMC1c). This revealed that the KIC11-GFP signal in schizonts does not overlap with apical organelle markers and the signal does not resemble a typical apical localization. In addition, we assessed all three organelle markers after inactivating KIC11 by knock sideways which showed that KIC11 inactivation has no apparent effect on the appearance of these markers, suggesting no major alterations in schizont morphology in respect to apical markers. These results are now presented as Figure S3A and in line 304 of the results.

      5) Can the author distinguish if KIC11 is involved in RBC invasion or in establishment of the ring-stage parasite?

      In order to look into this, we performed egress/invasion assays, quantifying schizont and ring stage parasites in tightly synchronized parasites at two different time points (pre-egress: 38-42 hpi & post-egress: 46-50 hpi). This revealed a significant decrease in newly formed ring stage parasite per ruptured schizont in parasites with inactivated KIC11, while the egress efficacy remained unaffected. This indicated an invasion or very early ring stage development defect (new Figure 2H, Figure S3G). To further determine at which point exactly the phenotype occurs (ie during invasion or early after invasion) would require extensive experimentation that goes beyond the scope of this study (e.g. invasion assays using video microscopy with a representative number of parasites or sophisticated flow based quantification assays). We hope by excluding egress and gross changes of apical organelles as well as no indication for similar number of early rings (indicating it is invasion or a very early ring-establishment phenotype) will sufficiently narrow down the phenotype for labs interested in invasion to more definitely answer this question.

      Minor comments:

      1) Table S1: Please add the criterion for the order of proteins (abundance in "proxiome"?) in the table as a separate column. I would also suggest adding a new column that highlights the 10 proteins investigated in this study as I found the color-coding slightly confusing.

      Done as suggested: we now include the “average log2 Ratio normalized Kelch13” values from the four DiQ-BioID experiments performed with K13 in (Birnbaum et al., 2020), as well as the suggested column to highlight the investigated proteins. Please also see reviewer 1 major point # 12 for additional information on the selection criteria and how this was added to the manuscript.

      2) -154-155: There is a discrepancy between the text and Fig1C regarding the % of partial overlapping and non-overlapping foci.

      We thank the reviewer for pointing this out, this was corrected.

      3) -The y-axis label is missing in Fig 3E

      Done.

      4) -Fig 4I left graph, the superscript 2 is missing in μm2

      We thank the reviewer for pointing this out, this is now changed.

      5) -Did the author colocalize KIC11 in schizonts with other proteins found in the K13 compartment group of proteins not involved in endocytosis/ART resistance? This may help to further subgroup these proteins.

      This is an interesting point but would actually be technically challenging to do. For this we would need to generate a KIC11endo parasite line for each of these KICs and then do co-localisation in schizonts. However, the outcome of this likely would not be very clear. The reason for this is as follows. There are foci of KIC11 that do overlap with K13 in schizonts. One can expect that these foci show KIC11 at the K13 compartment and that the other KICs would overlap with KIC11 in these K13 foci in schizonts. Hence, we would also need to see K13 to find the non-K13 compartment KIC11 foci and see if these contained the KIC of interest. This is technically challenging because it would mean we would need a third fluorescent protein which is not that trivial to do. Due to the difficulty to do this and the large amount of work involved and the already considerable amount of data in this manuscript, we believe this will be better suited for a different study.

      6) -As a general comment: to make the beautiful IFAs more accessible to a broader readership, I would encourage the authors to switch the color-coding to green/magenta/blue or an equivalent color system or add grayscale images.

      This was done as suggested, all fluorescence images are now provided as greyscale images and the overlays are shown in magenta/green.

      Reviewer #2 (Significance (Required)):

      Characterizing the molecular components involved in Plasmodium endocytosis will not only reveal interesting biology in these highly adapted parasites, but will more importantly lead to a better understanding and potentially open new avenues for intervention of ART resistance. The here presented manuscript is a carefully executed follow-up on previous work done in Dr. Spielmann's lab focusing on the K13 compartment. The authors use established assays to characterize novel components and reveal three new players in endocytosis with one mediating ART resistance in vitro. The proposition that parts of the K13 compartment have a function other than endocytosis is interesting, but will have to await more data from future studies. Taken together this manuscript adds significantly to our understanding of endocytosis in P. falciparum.

      This work is of interest for cell and molecular biologists working on Apicomplexa, but especially for the Plasmodium community.

      We thank the reviewer for this positive assessment.

      I am a cell and molecular biologist working on Toxoplasma gondii

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary: The authors characterized 4 proteins from P. falciparum via cellular (co-)localization, endocytosis, parasite growth, and artemisinin resistance assays. These proteins have been identified as candidates for Kelch13 compartment and a possible role in endocytosis in their previously work with quantitative BioID for potential proximity to K13 and Eps15 (Birnbaum et al. 2020). In the current work, additional 6 proteins were not confirmed as being associated to the K13 compartment. This experimental work was complemented by an in-silico analysis of protein domains based on AlphaFold algorithm. For this protein structure evaluation all proteins were chosen, which were experimentally confirmed to be linked to the K13 compartment in the current publication and previous work. With the work 3 novel proteins linked to artemisinin resistance or endocytosis could be functionally described (KIC12, MCA2, and MyoF) and a number of hypotheses were generated.

      We thank the reviewer for the assessment of the manuscript and the constructive comments.

      Major comments:

      The quality of the presented work is solid, the experimental design is adequate, and methods are presented clearly. The publication contains a lot of results both presented in text and in the figures and it is not always straight forward for the reader to follow the descriptions due to many details presented and a lack of context for some of these experiments.

      We thank the reviewer for this overall positive assessment.

      We now reordered the results section in an attempt to increase the flow of the manuscript. We also made changes to improve the context for the results. Given the further (very valid) requests for data on schizonts and invasion, there was an increased danger for a less linear manuscript that we hope to have acceptably managed with the re-arrange.

      Specific suggestions for consideration by the authors to improve the manuscript. Abstract: 1) R 31: Mention how the 4 proteins were identified as candidates, you need to refer to previous work to clarify this

      To clarify this the sentence was changed to (line 31): "Here we further defined the composition of the K13 compartment by analysing more hits from a previous BioID, showing that MyoF and MCA2 as well as Kelch13 interaction candidate (KIC) 11 and 12 are found at this site."

      2) R38: "Second group of proteins" is confusing - different from the 4 mentioned above? Significance to endocytosis unclear. Please unify terminology in the manuscript, see also comment below on proxiome.

      We changed the wording to clarify the group issue in the abstract as follows line 34: "Functional analyses, tests for ART susceptibility as well as comparisons of structural similarities using AlphaFold2 predictions of these and previously identified proteins showed that canonical vesicle trafficking and endocytosis domains were frequent in proteins involved in resistance or endocytosis (or both), comprising one group of K13 compartment proteins, While this strengthened the link of the K13 compartment to endocytosis, many proteins of this group showed unusual domain combinations and large parasite-specific regions, indicating a high level of taxon-specific adaptation of this process. Another group of K13 compartment proteins did not influence endocytosis or ART susceptibility and lacked detectable vesicle trafficking domains. We here identified the first protein of this group that is important for asexual blood stage development and showed that it likely is involved in invasion.”

      3) Abstract can only be understood after reading the full publication

      We attempted to amend this by expanding the abstract, particularly the changes highlighted in the previous two points.

      Results: 4) Table 1 is missing from the submitted materials

      We apologise for this mistake. Table 1 is now included.

      5) Consider to shorten and stratify the result section to focus on the significant data

      We rearranged the results in an attempt to streamline this section and are now starting with MyoF in the revised manuscript. However, as highlighted by the requests from reviewer 1, many details need to be available to support our conclusions. For instance the fact that GFP-tagging partially inactivated MyoF asked for further data to support our conclusion (HA-tagged version, showing that the location of the GFP-tagged version was consistent with the HA-tagged version, showing to what extent the different constructs affected growth and correlated with number of vesicles and bloating, see new figure 1M) or that KIC12 has two locations. Overall, we are therefore hesitant to remove data or description from the result part.

      6) Unclear how the localization and functionalization assays might be impaired by the fusion proteins Significance of ART resistance assay is not clear, in presence of strong growth effects due to inactivation or truncation of genes/proteins

      As indicated also in the example given in the previous point (this reviewer #5), the use of different cell lines (GFP-tagged live cells and small epitope tag in IFA) for targets with an indication for an effect of the tagging confirm that the location we assigned is reasonable. In the case of MyoF, the HA-tagged line, the partial inactivation due to GFP and the further inactivation in the GFP-tagged line by knock sideways show plausible increase of phenotypes (vesicle accumulation and bloated FV assays). Thereby the GFP-tagged line can be seen as a partial inactivation line that further supports our conclusions and overall this paints a consistent picture of the function of this protein in endocytosis (see new Figure 1M better illustrating this). Please note that the difference in location shown by this line compared to the HA-tagged proteins is only small (see also reviewer 1 major point 23ff). See also general discussion on tags at the end of this rebuttal.

      Significance of ART resistance assay: The ‘ART resistance assay’ is done comparing +/- ART (DHA) in identical parasites (originating from the same culture and the same condition). Hence, any growth effects are cancelled out and effects in reducing ART susceptibility would - if at all - be underestimated (see more detailed response to point 28, reviewer 1 and controls in Birnbaum et al., 2020 where we tested an unrelated essential protein, unrelated chemical insult and rapalog on 3D7 and did not detect any effect on RSA survival).

      MCA 7) Stratify results, order by significance of findings, it appears to be described in chronological order, improve readability/flow, eg ART resistance if mentioned in r138, but only reported in r183ff

      We attempted to stratify, but then the reason for generating the partial MCA2 disruption parasite line becomes very arbitrary and would leave the reader wondering why we at all truncated the protein at two thirds of the protein. Hence, we do not see a way around this chronological reporting. However, this part is now not at the start of the experimental results section anymore, possibly making it overall a bit more palatable.

      MyoF 8) R195 to 197 - consider moving to discussion as it is distracting here

      This was shortened and additional information (asked for by reviewer 1, major point 22) to clarify that MyoF was previously called MyoC, was added (line 147): “The presence of MyosinF (MyoF; PF3D7_1329100 previously also MyoC), in the K13 proxiome could indicate an involvement of actin/myosin in endocytosis in malaria parasites. "

      9) Term proxiome is introduced above, but not used in result section - suggest to unify language, eg r195 uses "K13 compartment DiQ-BioIDs" instead, which is not very convenient for the reader

      We carefully reviewed this and made this more consistent.

      10) What is the enrichment factor? Please provide for this and the following proteins, eg in Table 1

      The enrichment factor is log2 enrichment over control and this is now provided in table S1 (see also detailed answer for Reviewer 1 major point 12).

      11) R225 to 243 - overall significance of the growth experiments with mislocaliser is not clear, consider removing from manuscript or explain relevance more clearly

      See also point 28, reviewer 1: This experiment is actually quite important. It shows that if we conditionally inactivate the GFP-tagged MyoF, the growth is further reduced, as stated in line 208. It might have been confusing that the mislocalisation is only partial, but this is equivalent to a partial knock down and hence is useful. This becomes even more relevant with the specific assays following in the next paragraph: while the tagging of MyoF already resulted in vesicles, conditional inactivation with KS generated even more vesicles, showing that the same phenotype was rapidly increased when MyoF was further inactivated by a different means and this also correlated with growth. Hence, this is actually a very consistent phenotype that despite some shortcomings of the tools available to analyse this protein (due to the partial inactivation by the GFP tag) in our eyes looks very convincing. We now added a graph showing the correlation of growth and phenotypes to illustrate this (Figure 1L).

      We also tried to make this clearer by changing line 200 to: Hence, conditional inactivation of MyoF further reduced growth despite the fact that the tag on MyoF already led to a substantial growth defect, indicating an important role for MyoF during asexual blood stage development.” And line 208 to:“ This was even more pronounced upon conditional inactivation of MyoF by KS (Figure 1H), suggesting this is due to a reduced function of MyoF.”

      12) KIC11/KIC12 Enrichment factor?

      The enrichment (’average log2 Ratio normalized Kelch13 from Birnbaum et al. 2020’) is 1.65 for KIC11 and 1.32 for KIC12, which is now also explicitly shown in column D of Table S1.

      ** Referees cross-commenting**

      I would like to applaud reviewer #1 for a great, very thorough review and lots of detailed suggestions. I agree with the conclusions mentioned in the significance evaluation from reviewer #1 and #2: the work presented does not contain novel methods and the scope is rather narrow with the current results. (I am working on clinical studies with novel antimalarial agents)

      Reviewer #3 (Significance (Required)):

      On the one hand side, the authors have wrapped up some of the remaining protein candidates of the K13 compartment and could verify 4 of 10 proteins. The work is of interest for the scientific community working on endocytosis and malaria drug resistance mechanisms. Overall, the conclusions and findings from the previous work, Birnbaum et al. 2020, could be confirmed and extended mainly using the methods previously described. On the other hand, the authors made use of progress in protein structure predictions and identified domains linking the K13 compartment proteins to putative functions. The overlaid protein folds of the newly identified domains in figure 5 look convincing, but I can't comment on the technical details or cut-off used for this in-silico analysis.

      Extended general remarks on the systems used for this work:

      Mainly reviewer 1 suggest (in the general comments and the significance statement) that other systems would have been better suited to use for this work, namely glmS and diCre and also has concerns about the large tag which is seconded by a comment of reviewer 3. In light of this we here provide some extended considerations on the properties for conditional systems and tagging in regards to the goals of this work.

      We would like to point out that we do have experience with the systems considered better-suited by the reviewer (one of the first authors has extensively used glmS (Wichers et al., 2021c; Wichers et al., 2021a; Wichers et al., 2022; Wichers-Misterek et al., 2023) and our lab was one of the first to adopt the diCre system in P. falciparum parasites and we regularly us it (Birnbaum et al., 2017; Mesén-Ramírez et al., 2019; Kimmel et al., 2023)). Clearly, these methods have a lot of strengths but there are a number of issues to be considered for the assays we use in this work (see the next section on conditional inactivation systems). In a nutshell, we believe diCre would give a more reliable readout of the absolute level of "essentiality" (i.e. importance for growth) but is unsuitable or at least difficult to use for the assays that reveal the function of our interest in this work. GlmS basically combines the drawbacks of diCre and knock sideways and hence for most targets is not expected to give a better readout of level of "essentiality" but is similarly difficult to use for our specific assays. The fact that both of these systems are possible to use without adding a tag to the target may be an advantage but without tag one loses some very important features that can be critical to understand the outcome with a given system (see considerations on the tag further below).

      Conditional inactivation systems:

      1. __ speed of inactivation:__ glms acts on mRNA and diCre on the gene level, which makes them slower than techniques acting directly on the protein such as DD or KS. With diCre, mRNA and protein is still left, even if the gene is very rapidly excised. For instance for Kelch13 it takes 3-4 days after excising the gene until protein levels have waned enough that this manifests in a reduced growth (Birnbaum et al., 2017). While in some instances diCre permits same cycle analyses if the protein has a very rapid turn-over (e.g. Rab5a, (Birnbaum et al., 2017)), control in a few hours is still difficult. For vesicle accumulation and bloated food vacuole assays, which are done over comparably short time frames and with specific stages, it is rather challenging to hit the correct time of induction to have all the cells at the correct stage with suitably (and uniformly, ie all cells) sufficiently reduced target protein levels during the assay time. Slow acting systems are also more prone to secondary effects. The more immediate the inactivation, the closer it is to the core of the affected function. With vesicle trafficking processes this is particularly relevant as all vesicle trafficking in a cell is interconnected and there are always recycling pathways that maintain the membrane and protein homeostasis of individual compartments. Particularly for endocytosis there seem to be compensatory capacities at least in other organisms (see e.g. (Chen and Schmid, 2020)). One reason why knock sideways was developed is that it permitted to avoid compensatory changes when vesicle adaptors are inactivated (Robinson et al., 2010).

      The comparably short time frame for malaria parasites to go through different stages during blood stage development also is an issue relevant for inactivation speed. The advantage of speed and the danger of obscured phenotypes is highlighted by our work on VPS45 which showed that in trophozoites this protein is involved in the transport of hemoglobin to the FV whereas in late stages it also has a role in secretory processes. Both of these functions we were able to specifically assess in the same growth cycle using KS to rapidly inactivate the protein (Bisio et al., 2020) but with a slower system would have been more complicated to dissect.

      Speed of effect with glmS: unless the KS does not work well, glmS is slower acting than KS (it does not target the already synthesised protein which can remain in the cell) and also often suffers from only partial inactivation, hence the benefit of using it here is unclear. The option to have an untagged protein is a plus, however it also is a minus, as assessing efficiency (particularly in live cells e.g. for bloated assays etc a fluorescent tag is the only direct option to assess inactivation of target) is critical to ensure the phenotype manifests at the stage of interest.

      lethality/absolute phenotypic effects are detrimental to some assays to study the functions we are interested in for this work: no RSA can be conducted, if the gene is lost and the parasites die. Again, with diCre, one could attempt to hit the point when the parasites have lost sufficient amounts of the target protein when they are placed under ART but then the parasites need to continue growing for ~3 days, which is not possible if the cKO is lethal except for very slowly turning over proteins. However, in that latter case, the parasites likely still had full functionality of the target protein at the beginning of the RSA, when the drug pulse happens and there would be no effect. Knock sideways solves these problems by permitting knock sideways inactivation only under ART (or with a few hours pre-incubation depending on the inactivation speed) to not yet affect growth in a severe manner but inhibiting the process the protein is involved in. It may be possible to use glmS for RSAs, but the slow speed would complicate it (it would not permit control of target protein levels in a matter of a few hours to inactivate the target protein and then re-install it).

      None-absolute inactivation is also a strength for some functional assays. While we really like using diCre, in the case of EXP1 it made it necessary to complement the exp1 cKO parasites with low levels of EXP1 to be able to do functional assays without killing the parasites (Mesén-Ramírez et al., 2019; Mesén-Ramírez et al., 2021). While the lethality issue does not apply to glmS (like knock sideways, it also can be tuned), it is unclear what would be gained over knock sideways. Knockdown levels with glmS vary from gene to gene and cannot be predicted, it is in most cases considerably slower than KS, it requires glucosamine which becomes toxic at higher concentrations and might introduce off target effects and tracking protein levels during the assay would equally need GFP tagging.

      Integration of properties of conditional systems

      Given the above discussed properties, several factors have to be considered to be able to use a system for a given assay. Stage-specific transcription is one example. For diCre a protein not expressed in e.g. rings permits to remove the gene and the protein is never made in that parasite development cycle. We exploited this for instance for two proteins only expressed from the trophozoite stage onwards (Kimmel et al., 2023). However, if lethal (absolute effect problem), this also means one can also only see the phenotype on onset of expression of the target (e.g. if in mitosis, the first nuclear division in case the protein is absolutely essential for the process). This is just one example of such issues. Expression timing, turnover of the protein and homogeneity of stage-specific loss of protein will all influence how clearly the phenotype can be determined. All this will decide the exact time of loss/inactivation of the target protein to levels generating a phenotype and ideally therefore can be monitored during an assay (see considerations on tagging).

      For these reasons vesicle accumulation or bloated food vacuole assays are difficult with slow systems as ideally the target should rapidly be inactivated at the trophozoite stage and the result monitored before the cells have moved to the schizont stage. For this a well responding knock sideways is ideal as the protein can be rapidly taken away (sometimes within seconds) to visualise the immediate, direct effect in the cell.

      As shown for KIC11, there is also no disadvantage of using KS for proteins with other assays or proteins that result in different phenotypes. It permits stage-specific same cycle inactivation without having to worry about the turnover of mRNA and protein (Fig. 2F,G). Thus, besides the advantages of knock sideways for endocytosis related assays and RSAs, we also see no disadvantage of using knock sideways for the functional study of KIC11 which has a role other than endocytosis. KS also permits to specifically target the K13 pool of KIC12, something impossible or very difficult to do with other systems. Hence, we are of the opinion that the system for inactivation was adequate for most of the proteins analysed in this manuscript.

      Large tag: we agree that GFP-tagging can be a disadvantage but in our opinion its benefits often outweigh the drawbacks because it permits easy and immediate (on individual cell level, if need be) monitoring of the presence/location of the target protein (e.g. after KS, but given the discrepancy of the timing between gene excision and protein loss, it might be even more important for techniques such as diCre). No fixing/permeabilisation (prone to artifacts, prevents immediate view of cells) to detect a target with specific antibodies or via a small tag is needed with GFP. Similarly, the use of Western blots to do this is time consuming and impractical if monitoring of left-over protein in the course of an assay such as a bloated food vacuole assay is needed.

      In many cases, adding GFP has no negative effect. In addition, if the bulky folded structure of GFP is tolerated, it usually also tolerates the 2 to 4 12kDa FKBP domains in our standard tag. We also typically add a linker. This approach has worked for a large number of different proteins, including many essential ones for which we would not otherwise have obtained the integration cell lines (Birnbaum et al., 2017; Jonscher et al., 2019; Hoeijmakers et al., 2019; Birnbaum et al., 2020; Kimmel et al., 2023; Sabitzki et al., 2023). Hence, whenever a cell line is obtained with it, this tag in most cases is not a disadvantage. Admittedly an exception in this is MyoF and to some extent maybe MCA2 (we would like to stress that in the case of MCA2 the reason for not being able to obtain the full length tagged cell line is unclear: the protein can be severely truncated to less than 3% of its amino acid sequence and a GFP-tag is tolerated on the version with 2/3s of the protein left, which gives no good reason why the full length was not obtained; a potential reason could be a dominant negative effect). However, we obtained the full length with a small tag detected by IFA for both, MyoF and MCA2 and the location of these agreed well with the GFP tagged versions, indicating that the GFP-tagged versions are useful to show the location of these proteins in live cells.

      There are also tricks to attempt monitoring the effect of e.g. diCre without tagging the target. For instance, if a fluorescent protein is connected to excision without actually being fused to the target (ie excision of the gene leads to its expression of e.g. GFP), which would avoid adding a tag to the target itself. However, the problem with this is that expression of GFP does only show excision, but mRNA producing the target protein and left over target protein may still be there in the cell. All in all, the GFP-tag on the target, while with some drawbacks, is still our preferred method to control to monitor the target protein in the cell (in principle permitting quantification of ablation efficiency on the individual cell level).

      Conclusion on these considerations for this manuscript

      Based on these considerations we do not see the immediate benefit of changing the system for the conclusions drawn from this study and are unsure if they are indeed better suited for this work as suggested. While a more exact readout of "essentiality" might be possible with the diCre system we are of the opinion this is less important than learning the function of a protein which - as outlined above - we believe to be considerably more difficult with diCre and even more so with glmS considering our target functions. The same applies to target specific cellular pools of a protein as done here for KIC12. Clearly MyoF is one example where the employed systems shows limitations, but with the new Figure part showing consistency in phenotype with degree of inactivation (importantly with two different forms of inactivation) and the clarification that the location of the GFP-tagged and HA-tagged versions are actually quite similar in location, we do not think employing an extra system is warranted for the conclusions of this work. Admittedly, the apparent lack of need in ring stags might give an opening to attack MyoF using diCre (by excision before its major expression peak), but depending on lethality this might preclude extended analyses (possibly vesicle assays, for sure not RSAs).

      In the end the question is, if our approach provides the function of target analysed in this work and based on the data in our manuscript and the arguments in the rebuttal, we are reasonably confident that this is the case. It is not very likely the other mentioned techniques would result in a different conclusion on the function of the here studied proteins. In fact, we expect other commonly used techniques to be less suitable for the key assays in this work.

      References used in our responses to the reviewers’ comments:

      Behrens, H.M., Schmidt, S., Peigney, D., Sabitzki, R., Henshall, I., May, J., et al. (2023) Impact of different mutations on Kelch13 protein levels, ART resistance and fitness cost in Plasmodium falciparum parasites. bioRxiv 2022.05.13.491767.

      Behrens, H.M., Schmidt, S., and Spielmann, T. (2021) The newly discovered role of endocytosis in artemisinin resistance. Med Res Rev med.21848.

      Behrens, H.M., and Spielmann, T. (2023) Identification of domains in Plasmodium falciparum proteins of unknown function using DALI search on Alphafold predictions. bioRxiv 2023.06.05.543710.

      Birnbaum, J., Flemming, S., Reichard, N., Soares, A.B., Mesén-Ramírez, P., Jonscher, E., et al. (2017) A genetic system to study Plasmodium falciparum protein function. Nat Methods 14: 450–456.

      Birnbaum, J., Scharf, S., Schmidt, S., Jonscher, E., Hoeijmakers, W.A.M., Flemming, S., et al. (2020) A Kelch13-defined endocytosis pathway mediates artemisinin resistance in malaria parasites. Science (80- ) 367: 51–59.

      Bisio, H., Chaabene, R. Ben, Sabitzki, R., Maco, B., Baptiste Marq, J., Gilberger, T.W., et al. (2020) The zip code of vesicle trafficking in apicomplexa: Sec1/munc18 and snare proteins. MBio 11: 1–21.

      Blum, M., Chang, H.Y., Chuguransky, S., Grego, T., Kandasaamy, S., Mitchell, A., et al. (2021) The InterPro protein families and domains database: 20 years on. Nucleic Acids Res 49: D344–D354.

      Borrmann, S., Straimer, J., Mwai, L., Abdi, A., Rippert, A., Okombo, J., et al. (2013) Genome-wide screen identifies new candidate genes associated with artemisinin susceptibility in Plasmodium falciparum in Kenya. Sci Rep 3.

      Bozdech, Z., Llinás, M., Pulliam, B.L., Wong, E.D., Zhu, J., and DeRisi, J.L. (2003) The transcriptome of the intraerythrocytic developmental cycle of Plasmodium falciparum. PLoS Biol 1: e5.

      Burnette, W.N. (1981) “Western Blotting”: Electrophoretic transfer of proteins from sodium dodecyl sulfate-polyacrylamide gels to unmodified nitrocellulose and radiographic detection with antibody and radioiodinated protein A. Anal Biochem 112: 195–203.

      Casella, J.F., Flanagan, M.D., and Lin, S. (1981) Cytochalasin D inhibits actin polymerization and induces depolymerization of actin filaments formed during platelet shape change. Nature 293: 302–305.

      Cerqueira, G.C., Cheeseman, I.H., Schaffner, S.F., Nair, S., McDew-White, M., Phyo, A.P., et al. (2017) Longitudinal genomic surveillance of Plasmodium falciparum malaria parasites reveals complex genomic architecture of emerging artemisinin resistance. Genome Biol 18: 78.

      Chen, Z., and Schmid, S.L. (2020) Evolving models for assembling and shaping clathrin-coated pits. J Cell Biol 219.

      Dell’Angelica, E.C., Puertollano, R., Mullins, C., Aguilar, R.C., Vargas, J.D., Hartnell, L.M., and Bonifacino, J.S. (2000) GGAs: A family of ADP ribosylation factor-binding proteins related to adaptors and associated with the Golgi complex. J Cell Biol 149: 81–93.

      Demas, A.R., Sharma, A.I., Wong, W., Early, A.M., Redmond, S., Bopp, S., et al. (2018) Mutations in Plasmodium falciparum actin-binding protein coronin confer reduced artemisinin susceptibility. Proc Natl Acad Sci 201812317.

      Henrici, R.C., Edwards, R.L., Zoltner, M., Schalkwyk, D.A. van, Hart, M.N., Mohring, F., et al. (2020a) The plasmodium falciparum artemisinin susceptibility-associated ap-2 adaptin μ subunit is clathrin independent and essential for schizont maturation. MBio 11.

      Henrici, R.C., Schalkwyk, D.A. van, and Sutherland, C.J. (2020b) Modification of pfap2μ and pfubp1 Markedly Reduces Ring-Stage Susceptibility of Plasmodium falciparum to Artemisinin in Vitro. Antimicrob Agents Chemother 64.

      Henriques, G., Hallett, R.L., Beshir, K.B., Gadalla, N.B., Johnson, R.E., Burrow, R., et al. (2014) Directional selection at the pfmdr1, pfcrt, pfubp1, and pfap2mu loci of Plasmodium falciparum in Kenyan children treated with ACT. J Infect Dis 210: 2001–2008.

      Heredero-Bermejo, I., Varberg, J.M., Charvat, R., Jacobs, K., Garbuz, T., Sullivan, W.J., and Arrizabalaga, G. (2019) TgDrpC, an atypical dynamin-related protein in Toxoplasma gondii, is associated with vesicular transport factors and parasite division. Mol Microbiol 111: 46–64.

      Hirst, J., Lui, W.W.Y., Bright, N.A., Totty, N., Seaman, M.N.J., and Robinson, M.S. (2000) A family of proteins with γ-adaptin and VHS domains that facilitate trafficking between the trans-golgi network and the vacuole/lysosome. J Cell Biol 149: 67–79.

      Hirst, J., and Robinson, M.S. (1998) Clathrin and adaptors. Biochim Biophys Acta - Mol Cell Res 1404: 173–193.

      Hoeijmakers, W.A.M., Miao, J., Schmidt, S., Toenhake, C.G., Shrestha, S., Venhuizen, J., et al. (2019) Epigenetic reader complexes of the human malaria parasite, Plasmodium falciparum. Nucleic Acids Res 47: 11574–11588.

      Jonscher, E., Flemming, S., Schmitt, M., Sabitzki, R., Reichard, N., Birnbaum, J., et al. (2019) PfVPS45 Is Required for Host Cell Cytosol Uptake by Malaria Blood Stage Parasites. Cell Host Microbe 25: 166-173.e5.

      Kimmel, J., Schmitt, M., Sinner, A., Jansen, P.W.T.C., Mainye, S., Ramón-Zamorano, G., et al. (2023) Gene-by-gene screen of the unknown proteins encoded on Plasmodium falciparum chromosome 3. Cell Syst 14: 9-23.e7.

      Koreny, L., Mercado-Saavedra, B.N., Klinger, C.M., Barylyuk, K., Butterworth, S., Hirst, J., et al. (2023) Stable endocytic structures navigate the complex pellicle of apicomplexan parasites. Nat Commun 14: 2167.

      Kumari, V., Singh, A.P., Singh, J., Sharma, R., Akhter, M., Mishra, P.K., et al. (2018) Biochemical characterization of unusual cysteine protease of P. falciparum, metacaspase-2 (MCA-2). Mol Biochem Parasitol 220: 28–41.

      Lazarus, M.D., Schneider, T.G., and Taraschi, T.F. (2008) A new model for hemoglobin ingestion and transport by the human malaria parasite Plasmodium falciparum. J Cell Sci 121: 1937–1949.

      Lopez-Hernandez, F.J., Ortiz, M.A., Bayon, Y., and Piedrafita, F.J. (2003) Z-FA-fmk inhibits effector caspases but not initiator caspases 8 and 10, and demonstrates that novel anticancer retinoid-related molecules induce apoptosis via the intrinsic pathway. Mol Cancer Ther 2: 255–263.

      Lord, S.J., Velle, K.B., Mullins, R.D., and Fritz-Laylin, L.K. (2020) SuperPlots: Communicating reproducibility and variability in cell biology. J Cell Biol 219.

      MalariaGEN, Ahouidi, A., Ali, M., Almagro-Garcia, J., Amambua-Ngwa, A., Amaratunga, C., et al. (2021) An open dataset of Plasmodium falciparum genome variation in 7,000 worldwide samples. Wellcome open Res 6: 42.

      Marti, M., Good, R.T., Rug, M., Knuepfer, E., and Cowman, A.F. (2004) Targeting malaria virulence and remodeling proteins to the host erythrocyte. Science 306: 1930–3.

      Mesén-Ramírez, P., Bergmann, B., Elhabiri, M., Zhu, L., Thien, H. von, Castro-Peña, C., et al. (2021) The parasitophorous vacuole nutrient pore is critical for drug access in malaria parasites and modulates the fitness cost of artemisinin resistance. Cell Host Microbe 0: 283.

      Mesén-Ramírez, P., Bergmann, B., Tran, T.T., Garten, M., Stäcker, J., Naranjo-Prado, I., et al. (2019) EXP1 is critical for nutrient uptake across the parasitophorous vacuole membrane of malaria parasites. PLoS Biol 17: e3000473.

      Mukherjee, A., Crochetière, M.-È., Sergerie, A., Amiar, S., Thompson, L.A., Ebrahimzadeh, Z., et al. (2022) A Phosphoinositide-Binding Protein Acts in the Trafficking Pathway of Hemoglobin in the Malaria Parasite Plasmodium falciparum. MBio 13.

      Otto, T.D., Wilinski, D., Assefa, S., Keane, T.M., Sarry, L.R., Böhme, U., et al. (2010) New insights into the blood-stage transcriptome of Plasmodium falciparum using RNA-Seq. Mol Microbiol 76: 12–24.

      Robinson, M.S., Sahlender, D.A., and Foster, S.D. (2010) Rapid Inactivation of Proteins by Rapamycin-Induced Rerouting to Mitochondria. Dev Cell 18: 324–331.

      Sabitzki, R., Schmitt, M., Flemming, S., Jonscher, E., Hoehn, K., Froehlke, U., and Spielmann, T. (2023) Identification of a Rabenosyn-5 like protein and Rab5b in host cell cytosol uptake reveals conservation of endosomal transport in malaria parasites. bioRxiv 2023.04.05.535711.

      Simwela, N. V., Hughes, K.R., Roberts, A.B., Rennie, M.T., Barrett, M.P., and Waters, A.P. (2020) Experimentally engineered mutations in a ubiquitin hydrolase, UBP-1, modulate in vivo susceptibility to artemisinin and chloroquine in plasmodium berghei. Antimicrob Agents Chemother 64.

      Spielmann, T., Gras, S., Sabitzki, R., and Meissner, M. (2020) Endocytosis in Plasmodium and Toxoplasma Parasites. Trends Parasitol 36: 520–532.

      Subudhi, A.K., O’Donnell, A.J., Ramaprasad, A., Abkallo, H.M., Kaushik, A., Ansari, H.R., et al. (2020) Malaria parasites regulate intra-erythrocytic development duration via serpentine receptor 10 to coordinate with host rhythms. Nat Commun 11.

      Traub, L.M., Downs, M.A., Westrich, J.L., and Fremont, D.H. (1999) Crystal structure of the α appendage of AP-2 reveals a recruitment platform for clathrin-coat assembly. Proc Natl Acad Sci U S A 96: 8907–8912.

      Wagner, M.P., Formaglio, P., Gorgette, O., Dziekan, J.M., Huon, C., Berneburg, I., et al. (2022) Human peroxiredoxin 6 is essential for malaria parasites and provides a host-based drug target. Cell Rep 39: 110923.

      Wall, R.J., Zeeshan, M., Katris, N.J., Limenitakis, R., Rea, E., Stock, J., et al. (2019) Systematic analysis of Plasmodium myosins reveals differential expression, localisation, and function in invasive and proliferative parasite stages. Cell Microbiol 21.

      Wan, W., Dong, H., Lai, D.-H., Yang, J., He, K., Tang, X., et al. (2023) The Toxoplasma micropore mediates endocytosis for selective nutrient salvage from host cell compartments. Nat Commun 14: 977.

      Wichers-Misterek, J.S., Binder, A.M., Mesén-Ramírez, P., Dorner, L.P., Safavi, S., Fuchs, G., et al. (2023) A Microtubule-Associated Protein Is Essential for Malaria Parasite Transmission. MBio .

      Wichers, J.S., Gelder, C. van, Fuchs, G., Ruge, J.M., Pietsch, E., Ferreira, J.L., et al. (2021a) Characterization of Apicomplexan Amino Acid Transporters (ApiATs) in the Malaria Parasite Plasmodium falciparum. mSphere 6.

      Wichers, J.S., Mesén-Ramírez, P., Fuchs, G., Yu-Strzelczyk, J., Stäcker, J., Thien, H. von, et al. (2022) PMRT1, a Plasmodium -Specific Parasite Plasma Membrane Transporter, Is Essential for Asexual and Sexual Blood Stage Development. MBio 13.

      Wichers, J.S., Scholz, J.A.M., Strauss, J., Witt, S., Lill, A., Ehnold, L.-I., et al. (2019) Dissecting the Gene Expression, Localization, Membrane Topology, and Function of the Plasmodium falciparum STEVOR Protein Family. MBio 10: e01500-19.

      Wichers, J.S., Tonkin-Hill, G., Thye, T., Krumkamp, R., Kreuels, B., Strauss, J., et al. (2021b) Common virulence gene expression in adult first-time infected malaria patients and severe cases. Elife 10.

      Wichers, J.S., Wunderlich, J., Heincke, D., Pazicky, S., Strauss, J., Schmitt, M., et al. (2021c) Identification of novel inner membrane complex and apical annuli proteins of the malaria parasite Plasmodium falciparum. Cell Microbiol 23: e13341.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank all four reviewers for their helpful and constructive comments. We have gone through each and every comment and proposed how we would address each point raised by the reviewers. We are confident our proposed revisions are feasible within a reasonable and expected time frame. Some of the comments regarding minor typo/aesthetics and extra references have already been addressed in the transferred manuscript. The changes are highlighted in yellow in the transferred manuscript.

      2. Description of the planned revisions

      Reviewer #1

      Major points:

      1. The presented work itself (Figures 1-4) does not need significant adjustments prior to publication, in my view, with only a few points to address. However, the work in Figure 5- doesn't really support the claims the authors make on its own, and would require some additional experiments or at the very least discussion of the caveats to its current form.

      We thank the reviewer for these comments and will follow the reviewer’s suggestion by discussing the caveats regarding the interpretation of Figure 5. We will also add to the discussion to suggest future research approaches beyond the scope of this manuscript that would address the functional importance of localised mRNA translation. We will briefly mention in the discussion methods such as the quantification of the mRNA foci and the disruption of the mRNA localisation signals to disrupt localised translation and the use of techniques such as Sun-Tag (Tanenbaum et al, 2014) and FLARIM (Richer et al, 2021) to visualise local translation directly.

      Tanenbaum et al, 2014 DOI: 10.1016/j.cell.2014.09.039

      Richer et al, 2021 DOI: 10.1101/2021.08.13.456301

      1. Localized glia transcripts, are they "glial/CNS/PNS" significant or are they similar to other known datasets of protrusion transcriptomes? The authors compared their 4801 "total" localized to a local transcriptome dataset from the Chekulaeva lab finding that a significant fraction are localized in both. As the authors note, this is in good agreement with a recent paper from the Talifarro lab showing conservation of localization of mRNAs across different cell types. What the authors haven't done here, is further test this by looking at other non-neuronal projection transcriptomic datasets (for example Mardakheh Developmental Cell 2015, among others). If the predicted glia-localized processes are similar to non-neuronal processes transcriptomes, this would further strengthen this claim and rule out some level of CNS/PNS derived linage driving the similarities between glia and neuronal localized transcripts.

      This is a good point and we thank the review for pointing out this interesting cancer data set. We will do as the reviewer suggests and intersect our data with Mardakheh Dev Cell 2015 to test the further generality of localisation in neurons and glia, in other cell types. Specifically, we plan to intersect both glial (this study) and neuronal (von Kuegelgen & Chekulaeva, 2020) dataset with protrusive breast cancer cells (Mardakeh et al, 2015).

      von Kuegelgen & Chekulaeva, 2020 DOI: 10.1002/wrna.1590

      Mardakeh et al, 2015 DOI: 10.1016/j.devcel.2015.10.005

      1. The presentation/discussion around Figure 3 is a bit weaker than other parts of the manuscript, and it doesn't really contribute to the story in its current form. Notably there is no discussion about the significance of glia in neurological disorders until the very end of the manuscript (page 21), meaning when its first brought up.. it just sits there as a one off side point. The authors might consider strengthening/tightening up the discussion here, if they really want to keep it as a solo main figure rather than integrating it somewhere else/putting it into supplemental. In my view, Figures 2 & 3 should be merged into something a bit more streamlined.

      This is a good point. We plan to strengthen the presentation of Figure 3 and discussion of the significance of glia in neurological disorders by adding a description of the Figure in the Results section and highlighting the significance of glia in nervous system disorders in the Discussion section.

      1. Why aren't there more examples of different mRNAs in Figure 4? Seems a waste to kick them all to supplemental.

      We agree that it could be helpful to show different expression patterns in the main figure. To address this point we will add Pdi (Fig. S4D), which shows mRNA expression in both the glia and the surrounding muscle cell. This pattern is in contrast to Gs2, which is highly specific to glial cells. We will also note that although pdi mRNA is present in both the glia and muscle, Pdi protein is only abundant in the glia, suggesting that translation of pdi mRNA to protein is regulated in a cell-specific manner.

      1. The plasticity experiments, while creative, I think need to be approached far more cautiously in their interpretation. Given that the siRNAs will completely deplete these mRNAs- it really needs to be stressed any/all of the effects seen could just be the result of "defective" or "altered" states in this glial population- which has spill over effects on plasticity in at the NMJ. Without directly visualizing if these mRNAs are locally translated in these processes and assessing if their translation is modulated by their plasticity paradigm, all these experiments can say is that these RNAs are needed in glia to modulate ghost bouton formation in axons. This represents the weakest part of this manuscript, and the part that I feel does not actually backup the claims currently being made. Without any experiments to A. quantify how much of these transcripts are localized vs in the cell body of these glia, B. visualize/quantify the translation of these mRNAs during baseline and during plasticity; the authors cannot use these data to claim that localized mRNAs are required for synaptic plasticity.

      We are grateful to the reviewer for pointing out that we were not precise enough in defining our interpretation of the structural plasticity assay. We did not intend to claim that our results show that local translation of these transcripts is necessary for plasticity, only that these transcripts are localized and are required in the glia for plasticity in the adjacent neuron (in which the transcript levels are not disrupted in the experiment). Definitively proving that these transcripts are required locally and translated in response to synaptic activity would require genetic/chemical perturbations and imaging assays that would require a year or more to complete, so are beyond the scope of this manuscript. To address this point, we will clarify that the results do not show that localized transcripts are required, only that the transcripts are required somewhere specifically in the glial cell (without affecting the neuron level), and we can indeed show in an independent experiment that there are localized transcripts.

      Reviewer #2

      Major points:

      1. The authors analyse the 1700 shortlisted genes for Gene Ontology and associations with austism spectrum disorder, leading to interesting results. However, it is not clear to what extent the enrichments they observe are driven by their presumptive localization or if the associations are driven to a significant extent by the presence of these genes in the selected cell types in the Fly Cell Atlas. One way to address this would be to perform the GO and SFARI analysis on genes that are expressed in the same cells in the Fly Cell Atlas but were not shortlisted from the mammalian cell datasets - the results could then be compared to those obtained with the 1700 localized transcripts.

      This is a fair point raised by the reviewer as genes involved in neurological disease such as Autism Spectrum Disorder may be enriched in CNS/PNS cell types. We will follow the reviewer’s suggestion to perform GO and SFARI gene enrichment analysis in genes that were not shortlisted for presumptive glial localisation.

      1. Although the authors attempt to justify its inclusion, I'm not convinced why it was important to use the whole cell transcriptome of perisynaptic Schwann cells as part of the selection process for localizing transcripts. Including this dataset may reduce the power of the pipeline by including mRNAs that are not localized to protrusions. How many of the shortlisted 1700 genes, and how many of the 11 glial localized mRNAs in Table 5, would be lost if the whole cell transcriptome were excluded. More generally, what is the distribution of the 11 validated localizing transcripts in each dataset in Table 4? This information might be valuable for determining which dataset(s), if any, has the best predictive power in this context.

      We thank the reviewer for raising this point, which we will address with further analysis and adding to the discussion. We propose to address the criticism by running our analysis pipeline without the inclusion of the dataset using Perisynaptic Schwann Cells (PSCs) and then intersect with the PSCs-expressed genes, since their functional similarity with polarised Drosophila glial cells is highly relevant. We also agree with the reviewer that it would be a useful control for us to assess the ‘predictive power’ of each glial dataset by calculating their contribution to the shortlisted 1,700 glial localised transcripts and to the 11 experimentally validated transcripts via in situ hybridisation. To address this point, we plan to add this information in the revised manuscript.

      1. Did the authors check if any of the RNAi constructs are reducing levels of the target mRNA or protein? Doing so would strengthen the confidence in these important results significantly. In any case, the authors should also mention the caveat of potential off-target effects of RNAi.

      We thank the reviewer for their useful comment and agree that the extent to which the RNAi expression reduces the levels of mRNA is not specifically known. We will add a FISH experiment on lac, pdi and gs2 RNAi showing very strong reduction in mRNA levels. We will also add an explanation of the caveats of the use of the RNAi system to the discussion.

      1. Methods: what is the justification for assuming that if the RNAi cross caused embryonic or larval lethality then the 'next most suitable' RNAi line is reporting on a phenotype specific to the gene. If the authors want to claim the effect is associated with different degrees of knockdown they should show this experimentally. An alternative explanation is that the line used for phenotypic analysis in glia is associated with an off-target effect.

      We thank the reviewer for this comment. We agree that off target effects cannot in principle be completely ruled out without considerable additional experimental analysis beyond the scope of this manuscript. To address the criticism we will remove the expression data of the lines that cause lethality and revise the discussion to explain that the level of knockdown in each line is unknown, and would require further experimental exploration.

      Minor points:

      1. It would be helpful to have in the Introduction (rather than the Results, as is currently the case) an operational definition of mRNA localization in the context of the study. And is it known whether or not localization in protrusions is the norm in mammalian glia or the Drosophila larval glia? I ask because it may be that almost all mRNAs diffuse into the protrusion, so this is not a selective process. One interesting approach to test this idea might be to test if the 1700 shortlisted transcripts have a significant underrepresentation of 'housekeeping' functions.

      We thank the reviewer for this excellent suggestion. To address the comment, we will move our explanation of the operational definition of mRNA localization to the Introduction. We will also perform enrichment analysis of housekeeping genes within 1,700 shortlisted transcripts compared to the transcriptome background, as the reviewer suggested.

      Reviewer #3

      Major points:

      1. The authors have pooled data from different studies across different type of glial cells performed from in vitro to in vivo. While pooling datasets may reveal common transcripts enriched in processes, this may not be the best approach considering these are completely different types of glial cells with distinct function in neuronal physiology.

      We thank the reviewer for highlighting the need for us to further justify why we pooled datasets. We will revise the manuscript to better emphasise that the overarching goal of our study was to try to discern a common set of localised transcripts shared between the cells. The problem with analysing and comparing individual data sets is that much of the variation may be due to differences in the methods used and amount of material, rather than differences in the type of cells used. We will revise the discussion to make this point and plan to explain that our approach corresponds well with a previous publication pooling localised mRNA datasets in neurons (von Kugelgen & Chekulaeva 2021).

      von Kuegelgen & Chekulaeva, 2020 DOI: 10.1002/wrna.1590

      1. It is important to note the limitations of the study. For example, DeSeq2 is biased for highly expressed transcripts. How robust was the prediction for low abundance transcripts?

      The presented 1,700 transcripts were shortlisted based on their presence and expression level (TPM) in glial protrusions rather than their relative enrichment. Nevertheless, the reviewer makes a valid criticism of our use of DESeq2, where we compared enriched transcripts in glial and neuronal protrusions in Figure 1D. To address this point we will discuss this caveat in the relevant section.

      The issue raised regarding low abundance transcript prediction raises an important question: does the likelihood of localisation to cell extremities correlate with mRNA abundance? We have already partially addressed this point, since our analysis of the fraction of localised transcripts per expression level quantiles shows only limited correlation. To address this comment, we will add these results in the revised manuscript as a supplementary figure.

      1. The authors identify 1,700 transcripts that they classify as "predicted to be present" in the projections of the Drosophila PNS glia. This was based on the comparison to all the mammalian glial transcripts. Since the authors have access to a transcriptomic study from Perisynaptic Schwann cells (PSCs), the nonmyelinating glia associated with the NMJ isolated from mice; it would be more convincing to then validate the extent of overlap between Drosophila peripheral glial with the mammalian PSCs. This may reveal conserved features of localized transcripts in the PNS, particularly associated with the NMJ function.

      Thank you for the valuable suggestion. A similar point was also raised by [Reviewer #2 - Major point 2] to re-run our pipeline excluding the PSCs dataset and intersect with the PSC transcriptome post-hoc. Please see the above section for our detailed response.

      1. Fig 2: What is the extent of overlap between the translating fractions versus the localized fraction? It will be informative to perform the functional annotation of the translating glial transcripts as identified from Fig 1D.

      This is an interesting question. To address this point, we plan to: (i) compare transcripts that are translated vs. localised in glial protrusions, and (ii) perform functional annotation enrichment analysis on the translated fraction of genes.

      1. "We conclude predicted group of 1,700 are highly likely to be peripherally localized in Drosophila cytoplasmic glial projections". To validate their predictions, the authors test some of these candidates in only one glial cell type. It might be worthy to extend this for other differentially expressed genes localized in another glial type as well.

      The presented in vivo analyses made use of the repo-GAL4 driver, which is active in all glial subtypes, including subperineurial, perineurial and wrapping glia that make distal projection to the larval neuromuscular junction. We agree that subtype-specific analysis would be highly informative, but we believe this is outside the scope of the current work where we aimed to identify conserved localised transcriptomes across all glial subtypes. Nevertheless, to address the comment, we plan to further clarify our use of pan-glial repo-GAL4 driver in the Results and Method section of the revised manuscript.

      1. Figure 5: The authors perform KD of candidate transcripts to test the effect on synapse formation. However, these are KD with RNAi that spans across the entire cell. To make the claim about the importance of "target" RNA localization in glia stronger, ideally, they should disrupt the enrichment specifically in the glial protusions and test the impact on bouton formation. Do these three RNAs have any putative localization elements?

      We agree with the review, that we would ideally test the effect of disruption of mRNA localization (and therefore localised translation). However, we feel these experiments are beyond the scope of this current study, as they will require a long road of defining localisation signals that are small enough to disrupt without affecting other functions. To address this comment we will revise the Discussion section to mention those difficulties explicitly, and clarify the limitations of the approach used in our study for greater transparency.

      Reviewer #4

      Major points:

      1. The authors use FISH to validate the glial expression of their target genes, though these experiments are not quantified, and no controls are shown. The authors should provide a supplemental figure with "no probe" controls, and/or validate the specificity of the probe via glial knockdown of the target gene (see point 2). Furthermore, these data should be quantified (e.g. number of puncta colocalized with NMJ glia membrans).

      Thank you for requesting further information regarding the YFP smFISH probes. We have validated the specificity and sensitivity of the YFP probe in our recent publication (Titlow et al, 2023, Figure 1 and S1). Specifically, we demonstrated the lack of YFP probe signal from wild-type untagged biosamples and showed colocalization of YFP spots with additional probes targeting the endogenous exon of the transcript. Nevertheless, we will address this comment by adding control image panels of smFISH in wild-type (OrR) neuromuscular junction preparations.

      Titlow et al, 2023 DOI: 10.1083/jcb.202205129

      1. For the most part, the authors only use one RNAi line for their functional studies, and they only show data for one line, even if multiple were used. To rule out potential false negatives, the authors should leverage their FISH probes to show the efficacy of their knockdowns in glia. This would serve the dual purpose of validating the new probes (see point 1).

      Thank you for the suggestion. This point was also raised by [Reviewer #2 - Major point 3]. Please see above for our detailed response.

      1. In Figure 5 E, given the severe reduction in size in the stimulated Pdi KD animals, the authors should show images of the unstimulated nerve as well. Do the nerve terminals actually shrink in size in these animals following stimulation, rather than expand? The NMJ looks substantially smaller than a normal L3 NMJ, though their quantification of neurite size in F suggests they're normal until stimulation.

      We share the same interpretation of the data with the reviewer that the neurite area is reduced post-potassium stimulation in pdi knockdown animals. We will follow the reviewer’s suggestion and add an image showing unstimulated neuromuscular junctions.

      Minor points:

      1. The authors claim that there is an enrichment of ASD-related genes in their final list of ~1400 genes that are enriched in glial processes. It is well-appreciated that synaptically-localized mRNAs are generally linked to ASDs. Can the authors comment on whether the transcripts localized to glial processes are even more linked to ASDs and neurological disorders than transcripts known to be localized to neuronal processes?

      This is an interesting point. To address the comment, we will add a comparison of the degree of enrichment of ASD-related genes in neurite vs. glial protrusions in the revised manuscript.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1

      1. The use of blue/green or blue/green/magenta is difficult to resolve in some places. Swapping blue for cyan would greatly aid in visualizing their data.

      This comment is much appreciated. We have swapped blue for cyan in Figures 4 and S4. We have also changed Figure S1 to increase contrast and visibility as per reviewer’s comment.

      1. Make the colouring/formatting of the tables more consistent, its distracting when its constantly changing (also there is no need for a blue background.. just use a basic white table).

      This comment is much appreciated. We have applied a consistent colour palette to the Tables without background colourings and made the formatting uniform.

      Reviewer #2

      1. Introduction: 'Asymmetric mRNA localization is likely to be as important in glia, as it is in neurons,...'. Remove commas

      Thank you for pointing this mistake out. We have made the corresponding edits.

      Reviewer #3

      1. RNA localization in oligodendrocytes has been well studied and characterized. The authors should cite and discuss those papers (PMID: 18442491; PMID: 9281585).

      We thank the reviewer for this useful suggestion. We have added these references to the paper.

      Reviewer #4

      1. In Figure 5D, the authors should include a label to indicate that these images are from an unstimulated condition.

      We thank the reviewer for pointing this out. We have added the label as requested.

      1. The authors are missing a number of key citations for studies that have explored the functional significance of mRNA trafficking in glia, and those that have validated activity-dependent translation:

      - https://pubmed.ncbi.nlm.nih.gov/18490510/

      -https://pubmed.ncbi.nlm.nih.gov/7691830/

      -https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3001053

      -https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7450274/

      -https://pubmed.ncbi.nlm.nih.gov/36261025**_/

      _**

      We thank the reviewer for the comment. We have added these references to the text.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      Peripheral localization of mRNAs to cellular protrusions (e.g. axons, astrocyte peri-synaptic processes, myelin sheaths) is important for activity-dependent regulation of nervous system function. In Gala et al., the authors aim to identify conserved, peripherally located mRNAs in through a combination of unbiased transcriptomics and in vivo validation in Drosophila NMJ glia. To that end, they mine several published datasets enriched for peripherally located mRNAs in glia, identify which of these transcripts are conserved in fly, filtered for mRNAs that are enriched in processes over soma, and filtered for mRNAs that were enriched in three glial subtypes that ensheath their model system: Drosophila NMJ. Finally, they go on to validate 11 (of 15) predicted genes as located in NMJ glia, demonstrating that loss of these transcripts, in some cases, impacts synaptic plasticity.<br /> This is an interesting study that complements a growing interest in the community: how do glia locally support the neurons that interact with. I have a number of suggestions to support the conclusions made in this manuscript:

      Major:

      1. The authors use FISH to validate the glial expression of their target genes, though these experiments are not quantified, and no controls are shown. The authors should provide a supplemental figure with "no probe" controls, and/or validate the specificity of the probe via glial knockdown of the target gene (see point 2). Furthermore, these data should be quantified (e.g. number of puncta colocalized with NMJ glia membrans).
      2. For the most part, the authors only use one RNAi line for their functional studies, and they only show data for one line, even if multiple were used. To rule out potential false negatives, the authors should leverage their FISH probes to show the efficacy of their knockdowns in glia. This would serve the dual purpose of validating the new probes (see point 1).
      3. In Figure 5 E, given the severe reduction in size in the stimulated Pdi KD animals, the authors should show images of the unstimulated nerve as well. Do the nerve terminals actually shrink in size in these animals following stimulation, rather than expand? The NMJ looks substantially smaller than a normal L3 NMJ, though their quantification of neurite size in F suggests they're normal until stimulation.

      Minor:

      1. In Figure 5 D, the authors should include a label to indicate that these images are from an unstimulated condition.
      2. The authors claim that there is an enrichment of ASD-related genes in their final list of ~1400 genes that are enriched in glial processes. It is well-appreciated that synaptically-localized mRNAs are generally linked to ASDs. Can the authors comment on whether the transcripts localized to glial processes are even more linked to ASDs and neurological disorders than transcripts known to be localized to neuronal processes?
      3. The authors are missing a number of key citations for studies that have explored the functional significance of mRNA trafficking in glia, and those that have validated activity-dependent translation:
      4. https://pubmed.ncbi.nlm.nih.gov/18490510/
      5. https://pubmed.ncbi.nlm.nih.gov/7691830/
      6. https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3001053
      7. https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7450274/
      8. https://pubmed.ncbi.nlm.nih.gov/36261025/

      Significance

      Glia are morphologically complex cells that extend tens (microglia/oligodendrocytes) to thousands<br /> (astrocytes) of processes to interact with and support neurons. Given this complex morphology, and the ability of glia to dynamically respond to changes in neuronal activity, there has been a push in recent years to characterize local mechanisms of glia-neuron support. These mechanisms include mRNA trafficking and activity-dependent translation in distal processes. The authors of this study did a nice job computationally identifying a set of putative genes that are enriched in glial processes, which should be of broad interest to the glial community.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript by Gala et al, the authors perform meta-analysis of transcriptomic datasets from different studies focused on synaptically-associated mammalian glial cells. Based on a detailed analysis, the authors predict 1700 localized transcripts and attempted to identify the local transcriptome in glial cells conserved in Drosophila and mammals. This kind of data mining is informative, and can predict the top candidates relatively well, however, the study suffers from lack of depth and a careful assessment comparing across the different datasets. The main concerns have been listed below and suggestions for a more detailed an careful assessment of the data.<br /> 1. The authors have pooled data from different studies across different type of glial cells performed from in vitro to in vivo. While pooling datasets may reveal common transcripts enriched in processes, this may not be the best approach considering these are completely different types of glial cells with distinct function in neuronal physiology.<br /> 2. It is important to note the limitations of the study. For example, DeSeq2 is biased for highly expressed transcripts. How robust was the prediction for low abundance transcripts?<br /> 3. The authors identify 1,700 transcripts that they classify as "predicted to be present" in the projections of the Drosophila PNS glia. This was based on the comparison to all the mammalian glial transcripts. Since the authors have access to a transcriptomic study from Perisynaptic Schwann cells (PSCs), the nonmyelinating glia associated with the NMJ isolated from mice; it would be more convincing to then validate the extent of overlap between Drosophila peripheral glial with the mammalian PSCs. This may reveal conserved features of localized transcripts in the PNS, particularly associated with the NMJ function.<br /> 4. Fig 2: What is the extent of overlap between the translating fractions versus the localized fraction? It will be informative to perform the functional annotation of the translating glial transcripts as identified from Fig 1D.<br /> 5. "We conclude predicted group of 1,700 are highly likely to be peripherally localized in Drosophila cytoplasmic glial projections". To validate their predictions, the authors test some of these candidates in only one glial cell type. It might be worthy to extend this for other differentially expressed genes localized in another glial type as well.<br /> 6. Figure 5: The authors perform KD of candidate transcripts to test the effect on synapse formation. However, these are KD with RNAi that spans across the entire cell. To make the claim about the importance of "target" RNA localization in glia stronger, ideally, they should disrupt the enrichment specifically in the glial protusions and test the impact on bouton formation. Do these three RNAs have any putative localization elements?<br /> 7. RNA localization in oligodendrocytes has been well studied and characterized. The authors should cite and discuss those papers (PMID: 18442491; PMID: 9281585).

      Significance

      In the manuscript by Gala et al, the authors perform meta-analysis of transcriptomic datasets from different studies focused on synaptically-associated mammalian glial cells. Based on a detailed analysis, the authors predict 1700 localized transcripts and attempted to identify the local transcriptome in glial cells conserved in Drosophila and mammals. This kind of data mining is informative, and can predict the top candidates relatively well, however, the study suffers from lack of depth and a careful assessment comparing across the different datasets and the biological significance.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript by Gala et al. describes efforts to systematically identify candidate localizing mRNAs within glial cells of the Drosophila larval nervous system. They address this issue with an interesting and novel approach that involves compiling candidate localized transcripts from available mammalian cell datasets and cross-referencing a list of mRNAs from homologous Drosophila genes with a small number of candidates for mRNA localization in Drosophila glia cells that this team identified in a previous study (which they have now validated further). The results provide compelling evidence that their pipeline has predictive power. The authors then go on to perform functional analysis of a small number of validated localizing transcripts and provide evidence that several of these genes are functionally important in glia, including in processes related to synaptic plasticity. These observations raise the possibility that the localization process is functionally important but this was not tested directly (doing so would require a separate long-term study).

      The manuscript is well written and the figures are of high quality. Methodological details are provided, as is quantification where appropriate. However, some points are not yet sufficiently strongly supported by the data yet and the justification for some aspects of the workflow needs clarifying.

      Major points:

      1. The authors analyse the 1700 shortlisted genes for Gene Ontology and associations with austism spectrum disorder, leading to interesting results. However, it is not clear to what extent the enrichments they observe are driven by their presumptive localization or if the associations are driven to a significant extent by the presence of these genes in the selected cell types in the Fly Cell Atlas. One way to address this would be to perform the GO and SFARI analysis on genes that are expressed in the same cells in the Fly Cell Atlas but were not shortlisted from the mammalian cell datasets - the results could then be compared to those obtained with the 1700 localized transcripts.
      2. Although the authors attempt to justify its inclusion, I'm not convinced why it was important to use the whole cell transcriptome of perisynaptic Schwann cells as part of the selection process for localizing transcripts. Including this dataset may reduce the power of the pipeline by including mRNAs that are not localized to protrusions. How many of the shortlisted 1700 genes, and how many of the 11 glial localized mRNAs in Table 5, would be lost if the whole cell transcriptome were excluded. More generally, what is the distribution of the 11 validated localizing transcripts in each dataset in Table 4? This information might be valuable for determining which dataset(s), if any, has the best predictive power in this context.
      3. Did the authors check if any of the RNAi constructs are reducing levels of the target mRNA or protein? Doing so would strengthen the confidence in these important results significantly. In any case, the authors should also mention the caveat of potential off-target effects of RNAi.
      4. Methods: what is the justification for assuming that if the RNAi cross caused embryonic or larval lethality then the 'next most suitable' RNAi line is reporting on a phenotype specific to the gene. If the authors want to claim the effect is associated with different degrees of knockdown they should show this experimentally. An alternative explanation is that the line used for phenotypic analysis in glia is associated with an off-target effect.

      Minor points:

      1. It would be helpful to have in the Introduction (rather than the Results, as is currently the case) an operational definition of mRNA localization in the context of the study. And is it known whether or not localization in protrusions is the norm in mammalian glia or the Drosophila larval glia? I ask because it may be that almost all mRNAs diffuse into the protrusion, so this is not a selective process. One interesting approach to test this idea might be to test if the 1700 shortlisted transcripts have a significant underrepresentation of 'housekeeping' functions.
      2. Introduction: 'Asymmetric mRNA localization is likely to be as important in glia, as it is in neurons,...'. Remove commas

      Significance

      Whilst glial mRNA localization has been reported in other systems, this study is significant as it paves the way for functional and mechanistic dissection of this process in a genetically tractable model system that involves physiologically relevant neuron-glia interactions. The successful results of the data mining approach may also encourage others to address other problems in this manner. The study would be more impactful if additional hits from the 1700 transcripts were validated for glial localization but the findings in the current manuscript are still important. The advance is more methodological than conceptual. The work will appeal to cell biology and systems biology audiences.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study of mRNA localization in glia has been largely overlooked in comparison to neurons. However, glia are also polarized cells with long cytoplasmic processes that play important roles in neural function. In this manuscript, Gala and Lee et al attempt to fill in this missing void in the literature. They first use a meta-analysis of existing, cross-species, transcriptomic data to identify a set of 1,700 transcripts that are likely to be localized to the periphery of glia in Drosophila. They then used this list of transcripts to predict which mammalian glial transcripts are also likely to be localized. They're analysis suggests that a large proportion of the mammalian glial transcripts are predicted to be localized to the periphery of glia, and that these transcripts are enriched for functions involved in membrane trafficking, cytoskeleton regulation, local translation, and cell-cell communication. A connection with cross-cellular communication (ie glia-glia, glia-neuron, glia-muscle) at the neuromuscular junction, prompting the authors to assess if some of these transcripts play a role in plasticity of at the NMJ. Using siRNA driver lines, that loss of some of their assessed transcripts prevent new synapse formation- suggestive of a role for mRNA localization and local protein synthesis in driving synaptic plasticity. These findings suggest that mRNA localization is a widespread phenomenon in glia, and that it plays an equally important role as the more widely studied neuronal mRNA localization.

      Key Points

      • 1,700 transcripts are predicted to be localized to the periphery of PNS glia in Drosophila at the NMJ.
      • A large proportion of the mammalian glial transcripts are predicted to be localized to the periphery of glia.
      • Localized transcripts are enriched for functions involved in membrane trafficking, cytoskeleton regulation, local translation, and cell-cell communication.
      • Localized glial transcripts may function in synaptic plasticity.

      Major comments

      Overall, the work presented within manuscript is well reasoned and supports the points the authors generally are trying to make. The work as is, does fall a little on the light side, resting on almost exclusive bioinformatic analysis with some limited validation. The presented work itself (Figures 1-4) does not need significant adjustments prior to publication, in my view, with only a few points to address. However, the work in Figure 5- doesn't really support the claims the authors make on its own, and would require some additional experiments or at the very least discussion of the caveats to its current form.

      1. Localized glia transcripts, are they "glial/CNS/PNS" significant or are they similar to other known datasets of protrusion transcriptomes? The authors compared their 4801 "total" localized to a local transcriptome dataset from the Chekulaeva lab finding that a significant fraction are localized in both. As the authors note, this is in good agreement with a recent paper from the Talifarro lab showing conservation of localization of mRNAs across different cell types. What the authors haven't done here, is further test this by looking at other non-neuronal projection transcriptomic datasets (for example Mardakheh Developmental Cell 2015, among others). If the predicted glia-localized processes are similar to non-neuronal processes transcriptomes, this would further strengthen this claim and rule out some level of CNS/PNS derived linage driving the similarities between glia and neuronal localized transcripts.
      2. The presentation/discussion around Figure 3 is a bit weaker than other parts of the manuscript, and it doesn't really contribute to the story in its current form. Notably there is no discussion about the significance of glia in neurological disorders until the very end of the manuscript (page 21), meaning when its first brought up.. it just sits there as a one off side point. The authors might consider strengthening/tightening up the discussion here, if they really want to keep it as a solo main figure rather than integrating it somewhere else/putting it into supplemental. In my view, Figures 2 & 3 should be merged into something a bit more streamlined.
      3. Why aren't there more examples of different mRNAs in Figure 4? Seems a waste to kick them all to supplemental.
      4. The plasticity experiments, while creative, I think need to be approached far more cautiously in their interpretation. Given that the siRNAs will completely deplete these mRNAs- it really needs to be stressed any/all of the effects seen could just be the result of "defective" or "altered" states in this glial population- which has spill over effects on plasticity in at the NMJ. Without directly visualizing if these mRNAs are locally translated in these processes and assessing if their translation is modulated by their plasticity paradigm, all these experiments can say is that these RNAs are needed in glia to modulate ghost bouton formation in axons. This represents the weakest part of this manuscript, and the part that I feel does not actually backup the claims currently being made. Without any experiments to A. quantify how much of these transcripts are localized vs in the cell body of these glia, B. visualize/quantify the translation of these mRNAs during baseline and during plasticity; the authors cannot use these data to claim that localized mRNAs are required for synaptic plasticity.

      Minor points:

      1. The use of blue/green or blue/green/magenta is difficult to resolve in some places. Swapping blue for cyan would greatly aid in visualizing their data.
      2. Make the colouring/formatting of the tables more consistent, its distracting when its constantly changing (also there is no need for a blue background.. just use a basic white table).

      Significance

      This work would be well received within the field, being at a time where increasing focus on mRNA localization and local protein synthesis are major players in glia along with the more widely studied neurons. Additionally, that mRNAs localized in glia are relevant to neurological disorders also comes at a time where increasingly, especially in neurodegenerative disorders, glia are believed to be drivers of the diseases as well- suggesting that dysregulation of local protein synthesis in glia may play a role. The major weakness of this work is that being largely bioinformatic (from existing datasets), a lot of this is speculative and no conclusive data is shown to demonstrate how/that this glia population is utilizing mRNA localization in protrusions to fuel local protein synthesis for a specific purpose (ie synaptic plasticity).

      This work would be of interested to those broadly interested in glial biology and mRNA localization.

      The reviewers background is in RNA localization and local protein synthesis in neurons.