5,256 Matching Annotations
  1. Apr 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the referees for their valuable suggestions. We have revised the text accordingly and already conducted most of the requested experiments.

      Reviewer #1

        1. The authors state that addition of mannan increases length of Birbeck granules however, no data are presented. It would make this more convincing when the length is compared between conditions with and without mannan (as shown in Fig 4, where the condition without mannan is lacking).

      Reply: Thank you for pointing out the missing data. We added an EM image of Birbeck granules and quantification of Birbeck granules formation in the absence of mannan (Figure 4A-D).

      • Supp, fig 1B perhaps as a panel in main figure as this is an important control to show that Birbeck granules are isolated.

      Reply: We moved the supplemental figure 1B to main figure 1D.

        1. Only the(total) length of Birbeck granules is taken into account, but not the number of Birbeck granules. Is it possible to quantify the number of Birbeck granules.

      Reply: We added Figure 4D to show the number of Birbeck granules. Note that the difference in the number of Birbeck granules was less significant than that of total length because there were numerous short fragments in the mutant specimen.

      • Fig 5. Only the condition (ARGK) where there is virtually no Birbeck granules formation is included, however, is virus still internalized in the other conditions (MRGD or MRGK) as Birbeck granule formation was less effective but still present? It would be interesting to include those mutants. A more specific quantification would be by p24 ELISA. Is there a reason why immunoblotting has been chosen? In the supernatant condition, explain why the virus p24 seems less in the control condition whereas one would expect max concentration in that condition.

      Reply: Thank you for suggesting the use of ELISA. We chose immunoblotting because of its higher sensitivity and lower cost. But ELISA is advantageous when it comes to comparing large number of samples. We performed p24 ELISA and quantified the virus internalization in all the mutants available (Figure 5C). As you pointed out, the transfer efficiency of the immunoblot in Figure 5A was not uniform across the membrane; Pr55 bands became denser toward the right, while p24 bands had a gradient in the opposite direction. The immunoblots and ELISA showed that about ~1% of the viruses were attached or internalized and ~99% did not interact with the cells. Thus, the attached/internalized viruses did not affect the amount of viruses in the supernatant. Results of ELISA also showed the amount of viruses in the supernatant were nearly equal among the samples (Figure S3B).

      • Abstract First sentence: not mucosal tissue but mucosal epithelium Last sentence: Virual should be viral

      Reply: We corrected the typo. Thank you.

      • Discussion The last section comparing DC-SIGN and langerin is not clear and some overstatements are made. "Considering that DC-SIGN serves as an attachment receptor for viruses but not as an entry receptor, the possible structural coupling of lateral ligand binding and internalization implies that langerin functions as a more efficient entry receptor for viruses than DC-SIGN or other C-type lectins." It is not correct that langerin but not DC-SIGN can function as an entry receptor. DC-SIGN has been shown to facilitate infection of different viruses such DENV and ZIKV. In contrast, langerin can restrict viruses such as HIV-1 but also facilitate infection for example Influenza A and DENV. So attachment or entry is more likely a consequence of the internalization and dependence on pH changes for fusion as some viruses such as DENV fuse in acidic vesicles. This needs to be discussed more clearly.

      Reply: Thank you for pointing out our wrong statement. We replaced the statement with weakened one as below:

      Page 13, line 213: “The difference in the ligand-binding manner between langerin and DC-SIGN may contribute to their different carbohydrate recognition preferences (Valverde et al., 2020; Takahara et al., 2004).“

      Reviewer #2 1) Langerin can exist on the cell surface and in Birbeck granules. They should examine langerin cell surface expression in the 3 states, wildtype, mutated and lectin - . Do the mutations change cell surface expression?

      Reply: We performed surface labeling experiments and showed that those mutations did not affect surface expression of langerin (Figure S3A).

      2) Birbeck granules are present in the absence of mannan and pathogens (see Pena-Cruz JCI 2018, PMID: 29723162). Thus, this suggests that Birbeck granules are present even without langerin clathrin coated pit internalization from the cell surface. How does their model account for this observation?

      Reply: We think there are two possibilities:

      1. Birbeck granules were shown to stem from the endoplasmic reticulum (Valladeau et al Immunity 2000; Lenormand et al PlosONE 2013). Since the rER is the site of glycosylation, langerin is likely to capture the oligo-mannose-glycosylated proteins within the rER and form Birbeck granules.
      2. Blood plasma proteins such as immunoglobulin D, immunoglobulin E, and apolipoprotein B-100 are reported to carry high-mannose glycans (Clerc et al Glycoconj J. 2016). Those glycoproteins in the cell culture media can induce Birbeck granule formation.

        3) Different cell types can have varied Langerin levels (see Pena-Cruz JCI 2018, PMID: 29723162). Is Birbeck granule formation depend on certain level of langerin expression? Do Birbeck granules form when Langerin is present at low as compared to high levels?

      Reply: In the course of the experiments, we isolated a cell line stably expressing langerin. However, langerin expressing cells were extremely slow in proliferation and the expression levels were low. To answer this question, we recovered this “failed” stable cell line and found that the low langerin-expressing cells can form Birbeck granules, but with lower efficiency (Figure S3C-E).

      4) Authors use immunoblots to show that HIV is present in intra-cellular Langerin structures. It would be ideal to visualize HIV with presumably internal Birbeck granules using imaging techniques such as cryo-electron micrography or another form of high resolution imaging.

      Reply: We are currently working on ultra-thin section electron microscopy of HIV-infected langerin-expressing cells. Visualization of HIV-containing Birbeck granules using cryo-electron microscopy is highly challenging because the current precision of cryo-FIB-SEM milling technique is too low to target a specific intracellular structure. We believe conventional electron microscopy will provide sufficiently convincing evidence that HIV is present within Birbeck granules.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, investigators used cryo-electron tomography to reconstruct the structure of langerin and langerin composed organelle, termed Birbeck granule. They find that langerin trimers interact via carbohydrate binding cleft, mediated via 258 - 262 residues. Mutations in the residue prevent Birbeck granule structures. They propose a molecular structure for HIV binding and internalization.

      Significance

      This is highly interesting work with significance for understanding pathogen, such as HIV, recognition and clearance in mucosal antigen presenting cells.

      I am not an expert in structural studies but the cryo-electron tomography is impressive and convincing. I have concerns with some of the HIV - Birbeck granule aspects. Cell transfected with langerin and mutated langerin were exposed to HIV pseudotypes. They show that HIV binding occurs in the absence of mannan with both wildtype and mutated langerin. On the other hand, a langerin that lacks calcium binding does not bind virus (lectin -). They show that the mutated langerin has limited internalization, presumably because of lack of Birbeck granule formation.

      1. Langerin can exist on the cell surface and in Birbeck granules. They should examine langerin cell surface expression in the 3 states, wildtype, mutated and lectin - . Do the mutations change cell surface expression?
      2. Birbeck granules are present in the absence of mannan and pathogens (see Pena-Cruz JCI 2018, PMID: 29723162). Thus, this suggests that Birbeck granules are present even without langerin clathrin coated pit internalization from the cell surface. How does their model account for this observation?
      3. Different cell types can have varied Langerin levels (see Pena-Cruz JCI 2018, PMID: 29723162). Is Birbeck granule formation depend on certain level of langerin expression? Do Birbeck granules form when Langerin is present at low as compared to high levels?
      4. Authors use immunoblots to show that HIV is present in intra-cellular Langerin structures. It would be ideal to visualize HIV with presumably internal Birbeck granules using imaging techniques such as cryo-electron micrography or another form of high resolution imaging.

      All microbiologists, immunologists, and investigators interested in infectious disease will be interested in this work.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors here have used cryo-electron tomography, 3D reconstruction and modelling on isolated Birbeck granules to provide a molecular mechanism for langerin-induced Birbeck granule formation. Their data revealed a structure of the repeating unit of the honeycomb lattice of langerin in Birbeck granules. Their model suggests that the interaction between the two langerin trimers is mediated by docking the flexible loop at residues 258-262 into the secondary carbohydrate-binding cleft. Mutational analysis within the loop suggests that these interactions are important for Birbeck granule formation and virus internalization.

      The results presented in the manuscript are very interesting and propose an novel mechanism how langerin induces Birbeck granule formation and how two langerin trimers are able to interact with virus and induce Birbeck granule formation.

      Comments.

      Fig. 1. The authors state that addition of mannan increases length of Birbeck granules however, no data are presented. It would make this more convincing when the length is compared between conditions with and without mannan (as shown in Fig 4, where the condition without mannan is lacking).

      Supp, fig 1B perhaps as a panel in main figure as this is an important control to show that Birbeck granules are isolated.

      Fig. 4. Only the(total) length of Birbeck granules is taken into account, but not the number of Birbeck granules. Is it possible to quantify the number of Birbeck granules.

      Fig 5. Only the condition (ARGK) where there is virtually no Birbeck granules formation is included, however, is virus still internalized in the other conditions (MRGD or MRGK) as Birbeck granule formation was less effective but still present? It would be interesting to include those mutants. A more specific quantification would be by p24 ELISA. Is there a reason why immunoblotting has been chosen? In the supernatant condition, explain why the virus p24 seems less in the control condition whereas one would expect max concentration in that condition.

      Minor comments

      Abstract First sentence: not mucosal tissue but mucosal epithelium Last sentence: Virual should be viral

      Discussion The last section comparing DC-SIGN and langerin is not clear and some overstatements are made. "Considering that DC-SIGN serves as an attachment receptor for viruses but not as an entry receptor, the possible structural coupling of lateral ligand binding and internalization implies that langerin functions as a more efficient entry receptor for viruses than DC-SIGN or other C-type lectins." It is not correct that langerin but not DC-SIGN can function as an entry receptor. DC-SIGN has been shown to facilitate infection of different viruses such DENV and ZIKV. In contrast, langerin can restrict viruses such as HIV-1 but also facilitate infection for example Influenza A and DENV. So attachment or entry is more likely a consequence of the internalization and dependence on pH changes for fusion as some viruses such as DENV fuse in acidic vesicles. This needs to be discussed more clearly.

      Significance

      There is little known about the molecular mechanism of birbeck granule formation and the role of langerin as well as its ligand (HIV or mannan). here the authors convincingly reveal a mechanism which is corroborated by mutational analyses. This is important in the field. the major drawback which is that a cell-line has been used, 293T, and overexpression of langerin. I understand the reason (manipulation in other cells more difficult, no good LC cell-lines, primary cells probably impossible) but it makes the significance a bit less. overall this is a significant contribution to the field.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are grateful for the referees' rigorous review of our manuscript and for their overall positive reception of our work. We have pasted below the entirety of the reviewers’ comments, interleaved with our responses.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, Gama et al. use a biophysical assay DAmFRET, structural analysis, and optogenetic tools to uncover the nucleation mechanism of CBM signalosome. They performed experiments first in yeast cells that lack death folds or related signaling networks, then confirmed their discoveries in human cells. The results presented here are clear and convincing. The paper is very well presented and clearly written.

      They found it is the CARD domain of BCL10 that acts as a molecular switch that drives all-or-none activation of NF-kB. Monomeric BCL10 possesses an unfavorable conformation and serves as a nucleation barrier, keeping BCL10 in a supersaturated inactive state that allows for binary activation upon stimulation.

      They also characterized CARD9 CARD domain and a coiled-coil region. They reasoned that CARD9CARD functions as a polymer seed to nucleate BCL10, and that the coiled-coil region has multimerization ability to facilitate nucleation. Furthermore, they characterized that MALT1 activation doesn't depend on BCL10 polymers but its own proximity. And MALT1 induces graded NF-kB activation, thus further demonstrating the binary activation is conferred by BCL10.

      Major comments:

      1. Fig S1D and E, the authors used TNF-a to activate NF-kB independent of CBM signalosome and found the activation in each cell increased with dose. In contrast, CBM activation led to bimodal cell activation. The authors claim that this is evidence that positive feedback upstream of NF-kB. We do not believe this claim can be made from this comparative experiment alone. We agree that positive feedback is important for activating an NF-kB response, but the comparison between CBM and TNFa is inaccurate and glosses over published data. Specifically, there is published data that TNF-a does activate a 'switch-like' or digital response, as defined by the translocation of p65 (see (Tay et al. 2010) among other studies that have examined p65 translocation at the single-cell level). The difference in T-sapphire expression between CBM and TNF activation is most likely due to TNFa induced oscillations of p65 translocation (although this is speculation on our part). Therefore we suggest to the authors that the TNF-a data (Fig S1D and E) should be omitted, as the claim of switch or not-switch as pertains to TNF signaling is more complex and nuanced than presented here. We believe omitting this data will strengthen the manuscript and avoid confusion in the field. The bimodal expression of the T-sapphire NF-kB reporter driven by the CBM signalosome activation is sufficient to claim an all-or-none response.

      We thank the reviewer for this suggestion. We acknowledge that the activation of NF-κB by TNF-ɑ is more complex than we had presented, and agree that the differences in T-Sapphire reporter output could be attributed to p65 oscillations. We had not previously considered this interesting possibility -- which is not addressed by the present data -- believe it is worth future investigation. As suggested by the reviewer, we have now omitted the TNF-a data, and agree that this change does not impact the overall claims of the paper.

      Fig 3B, the authors introduced CARD9CARD-µNS as a stable condensed seed for BLC10. However, considering CARD9CARD can form polymers at high concentration (Fig 3B and S3D), are these high expression levels of CARD9CARD able to induce BCL10-mEos3.1 assembly (as measured by DamFRET in yeast cells)? Can the authors examine BCL10 FRET at these high expression level of CARD9CARD? We assume that BCL10 will be assembled in these cells. This would provide a valuable control experiment and support the author's conclusions.

      Indeed, this question is amenable to DAmFRET. Accordingly, we have now performed DAmFRET of yeast cells expressing Bc10-mEos3.1 in the presence of either CARD9CARD-mCardinal or mCardinal itself (see new Fig S6A and B, and associated results section). We confirmed that cells with high CARD9CARD-mCardinal expression had higher FRET on average than cells with low expression. Importantly, cells expressing high or low levels of mCardinal itself had the same FRET level (Fig S6).

      Fig 3C, the text said "Whereas WT CARD9CARD assembled into polymers at high concentration, the pathogenic mutants R18W, R35Q, R57H, and G72S failed to do so (Fig 3C and S7B,C), explaining why they cannot nucleate BCL10". This claim that these mutants can not nucleate BCL10 does not have a figure call out or a reference. The authors then show the results in Fig 3E which supports this claim. Even though they were done in the context of full-length CARD, all proteins contain the I107E mutation that releases autoinhibition. For clarity, the authors should consider rearranging the text to avoid explaining a phenomenon and making conclusions before showing the results.

      We have now rearranged this section to match the figures and claims.

      Fig 4D, E and Video 1, the authors showed the nucleation of BCL10 into puncta within live cells is followed by p65 translocation to the nucleus. The authors claim that 'this result suggests that BCL10 is indeed supersaturated prior to stimulation' (paragraph 2 section titled BCL10 is endogenously supersaturated'). We fail to understand how this live-cell experiment leads to the conclusion BCL10 is supersaturated before stimulation. We think this text should be deleted from the text, or put into context with the DAmFRET data that lead the authors to make this claim. It would be interesting for the authors to define in discussion what are the golden criteria to claim a protein exists in a supersaturated state with live cells (by microscopy or other methods)? Adaptor protein assembly into puncta and the subsequent nuclear translocation of transcription factors is a common phenomenon across signalling pathways. Not all these pathways rely on signaling adaptors existing in a supersaturated state. The field of cell signaling (and cell biology in general) would benefit from a detailed definition of how these physical-chemical definitions of proteins are supported by experimental data. We believe that this paper will become a seminal paper in the field, and future work will benefit from a clear definition of how a claim of supersaturation is derived from the data.

      We appreciate that the concept of supersaturation will be foreign to many biologists, and welcome this opportunity to elaborate. We have now rephrased the corresponding results section for figure 4D, E, and have added new evidence to support our claim that BCL10 is supersaturated, as had been requested by reviewer 2 (see below in response to point 1). Supersaturation, as we (correctly) use the term, occurs when the concentration of a protein in solution exceeds its equilibrium solubility for the given conditions. The term is also sometimes used to describe __global __protein “concentrations” in excess of the solubility limit, even if a dense phase has already formed and potentially depleted the effective concentration (in solution) to the solubility limit. This is a key distinction, as only the former implies a high-energy out-of-equilibrium scenario that predetermines a future change -- release of the excess energy via phase separation.

      How does one experimentally determine if a protein is supersaturated? In theory, one may conclude that a protein is supersaturated if its assembly causes a net loss of energy from the system (i.e. exothermic). Unfortunately, it is likely not yet possible to perform such measurements with sufficient sensitivity inside a living cell. However, it is possible to infer that a protein is supersaturated if assembly can be shown to occur without a net input of energy to the system, i.e. without any change in thermodynamic control parameters such as temperature, pH, post-translational modifications, concentration of the protein, or concentration of any interacting factor. To do this, one introduces a substoichiometric amount of pre-assembled protein to the system. This manipulation will trigger assembly if the protein is supersaturated. If the protein is instead subsaturated, assembly will not occur and the exogenously added assemblies will simply dissolve. This phenomenon, known as “seeding” in the prion field, is considered a golden criterion sufficient to conclude that a protein has prion behavior. However, because bona fide prions additionally require a means for dissemination between cells, seeding analyzed at the cellular rather than population level is more appropriately considered a sufficient criterion for supersaturation (which is a prerequisite for classical prion behavior (Khan et al. 2018)). Our CARD9CARD-Cry2 experiment was designed to test this criterion. Specifically, it allowed us to introduce a seed independently of receptor activation, thereby precluding any orthogonal cellular response that might lower Bcl10 solubility through e.g. a post-translational change. That the seeds were substoichiometric is evidenced by the fact that Bcl10 polymerized homotypically following stimulation (i.e. it didn’t just bind to the CARD9CARD puncta, but went on to deposit onto itself).

      How does assembly under this scenario differ in principle from the many examples of puncta formed by other signaling proteins that occur upon stimulation of their respective pathways? Puncta formation that is induced by a thermodynamic change in the cell cannot be said to have resulted from pre-existing supersaturation. Rather, the stimulus may have caused some change that either increases the effective concentration of the protein (e.g. upregulates its expression, induces a post-translational change that activates it, or releases an inhibitory factor) or reduces solvent activity (e.g. change in pH).

      An additional requirement (necessary but not sufficient) is that the assembly must be regular with respect to some order parameter. That is to say, it must be a bona fide “phase”. At a minimum, this implies a uniform density. Additionally, for supersaturation to persist over biological timescales under physiological conditions and confinement volumes, the assembly (once formed) must also have structural repetition in at least two dimensions, i.e. crystallinity (Rodríguez Gama et al. 2021; Zhang and Schmit 2016). We know this to be true for Bcl10.

      Rodríguez Gama A, Miller T, Halfmann R. 2021. Mechanics of a molecular mousetrap-nucleation-limited innate immune signaling. Biophys J 120:1150–1160. doi:10.1016/j.bpj.2021.01.007

      Khan, T., Kandola, T.S., Wu, J., Venkatesan, S., Ketter, E., Lange, J.J., Rodríguez Gama, A., Box, A., Unruh, J.R., Cook, M., et al. (2018). Quantifying nucleation in vivo reveals the physical basis of prion-like phase behavior. Mol. Cell 71, 155-168.e7.

      Zhang L, Schmit JD. 2016. Pseudo-one-dimensional nucleation in dilute polymer solutions. Phys Rev E 93:060401. doi:10.1103/PhysRevE.93.060401

      Regarding the supersaturated state of BCL10, the authors convincingly use optogenetics to show how transient assemblies of CARD-Cry2 can template BCL10 assembly. This is a convincing experiment that shows templated nucleation of BCL10. To strengthen the claim that BCL10 is supersaturated endogenously we suggest the author quantify the expression of BCL10-mScarlet and CARD-Cry2 and ideally show that this phenomenon can be observed at expression levels equivalent to endogenous.

      As stated above, that BCL10-mScarlet formed polymers that we observed to elongate homotypically off of the CARD9CARD seeds indicates that the protein was supersaturated under the conditions of the experiment. The concentration of CARD9 is not a relevant parameter in this case. We had already compared the expression of BCL10-mScarlet to endogenous BCL10 in 293T, THP-1, and human fibroblast cells by quantitative immunodetection (Fig. S10D), revealing that the expression level of our BCL10-mScarlet constructs matched that of endogenous BCL10, which was approximately the same in all cell lines. We also compared the distribution of expression levels of BCL10-mScarlet versus that of endogenous BCL10 using antibody staining followed by flow cytometry, which confirmed that the range of expression levels of BCL10-mScarlet falls within that of endogenous BCL10 in 293T cells (Fig. S10F). Hence, we believe our data suffice to conclude that Bcl10 is supersaturated at endogenous levels of expression.

      Minor comments:

      1. Special character "delta" is not displayed in the text (instead only a space).

      This error occurred upon exporting the manuscript from our text editor to a PDF. We now have made sure all special characters are present in the PDF version.

      Several cell lines including mouse, human, and yeast lines were used across this manuscript. It would be clearer and more helpful if the exact cell type of the line could be indicated. Such as, "BCL10-mEos3.1 yeast cells" instead of "BCL10-mEos3.1 cells", "BCL10-mScarlet HEK293T cells" instead of "BCL10-mScarlet cells".

      We have now modified all instances to indicate the origin of the cell lines tested.

      Fig 5B, the authors indicated that BCL10 colocalized with CARD9CARD, then please show the merged image as well.

      We have now included the merged image to indicate colocalization in the inset images.

      Fig 6E, authors claimed that cells were stimulated with blue light for the indicated durations. The longest duration is 12 hours. Please specify if it was continuous exposure or several rounds of exposure in the indicated durations.

      We have now specified in the figure legends, text, and methods section, that this specific experiment used a continuous exposure of blue light.

      Reviewer #1 (Significance (Required)):

      This work used a combination of FRET and optogenetic tools to engineer CBM signaling and visualize the effects. They incorporated knowledge from structure biology, together with their results from mutations and truncations, dissected the significance of each protein in CBM signalosome, and demonstrated in detail how higher-order assemblies make all-or-none cellular decisions. We believe this paper will be a seminal paper in the field of cell signalling and cytoplasmic organization. It defines a new paradigm of macromolecules assembly of signalling complexes as being dependent on protein existing in a supersaturated state. Importantly this paper opens up new questions regarding macromolecular signaling complexes (found in many innate immune signaling pathways): How is protein supersaturation maintained and used throughout evolution to construct biochemical signalling switches?

      This paper will be of particular interest to scientists working on immunity and cell signalling, especially in the field of higher-order assemblies. However, we feel the impact of this paper goes beyond these fields, and we believe this manuscript will be of broad interest to the cell biology and biophysics communities. For reference, our expertise is in innate immunity and cell biology.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In their manuscript entitled "A nucleation barrier springloads..." Rodriguez-Gama et al. dissect the assembly mechanism of the signalosome, composed of the proteins CARD9, BCL10 and MALT1, using a novel in-cell biophysical approach (DAmFRET). They first overexpressed fluorescently tagged versions of the proteins to promote their assembly in yeast and mammalian cells, finding that CARD9 forms higher order assemblies across a wide range of concentrations with no discontinuity in the DAmFRET profile. In contrast, the DAmFRET profile of BCL10 showed a clear separation between monomers and higher order assemblies, which started to form spontaneously only at higher BCL10 concentrations. Furthermore, at the two states of the proteins co-exist at all concentrations. These observations imply that there is a nucleation barrier to forming BCL10 assemblies. MALT1 showed no change in FRET regardless of its expression level. These observations, alongside fluorescence microscopy of the assemblies, and previous structural studies, suggest that BCL10 forms self-templating polymers that act as a switch for an all-or-nothing immune response, assayed in this case by monitoring the nuclear translocation of the NF-kB subunit p65. The authors also assessed the effects of known disease-causing mutations on the nucleation barrier, showing that changes in the strength of the nucleation barrier can have major effects on signalosome function. Finally, they used optogenetic methods to trigger assembly of individual signalosome components, providing insight into the minimal components/conditions required for signalosomes to work.

      Major comments

      Overall, the experiments by Rodriguez-Gama et al. offer convincing evidence that there is a nucleation barrier to BCL10 polymerisation, and that a CARD9 template is sufficient to overcome the barrier. Although the existence of a nucleation barrier had already been postulated, based on structural and other studies (referenced by the authors), it had lacked a rigorous demonstration. This work provides that demonstration, which is important for the signalosome field and more broadly applicable to researchers studying cellular decision making. The study further demonstrates that DaMFRET is an excellent to study protein assembly processes in their native environment, allowing the authors to tackle a question that would have been technically very difficult to address otherwise. The optogenetic experiments are a nice sufficiency test for their ideas.

      We feel there are a few key points to address before publication.

      1) One of the main conclusions is that spring-loading the nucleation barrier with high super-saturating BCL10 concentrations allows a decisive response. Although much of the data strongly imply this conclusion, the dependence of the immune response on BCL10 concentration was not tested directly. A key prediction of the nucleation barrier is that at concentrations below saturation, BCL10 should not be able to induce an all-or-nothing response when stimulated. At saturated/super-saturated concentrations BCL10 should be able to induce a response. At deeply super-saturated concentrations the response should start to be activated spontaneously in the absence of an external stimulus. These predictions could be tested using the doxycycline-inducible BCL10 system (Figure S2D), without establishing major new experimental avenues. We feel that such an experiment would strengthen the main conclusion. It might also help to shed light on whether being highly supersaturated enables a more decisive response than being just saturated.

      This is a great idea. As the reviewer suggested, our Doxycycline-inducible BCL10 system enables us to induce and track the state of BCL10 over time. We have now performed the requested experiments (Fig. S9D, E) and incorporated the results into the relevant section of the text. In short, our new analyses show that BCL10 indeed has a concentration threshold for activation by stimulation, and that it can also nucleate spontaneously when overexpressed. Note that our original analyses in Fig. 4B and C also demonstrate spontaneous BCL10 activation at high concentrations. With this new evidence and the orthogonal approaches used in Fig. 5, we believe our data definitively support our conclusion that BCL10 is supersaturated.

      2) Intuitively, readers might expect that if BCL10 is supersaturated then, once nucleated, it would rapidly assemble at the nucleation sites. In Figure 5B, CARD9CARD-miRFP670nano-Cry2 assemblies are optically induced throughout the cell. However, BCL10 appears to nucleate at just a few sites with a few minutes delay. More widespread nucleation and growth of BCL10 polymers seems to take longer (20-40 minutes, Figures 5B and 5C), after CARD9CARD-miRFP670nano-Cry2 has disassembled. Furthermore, in Figures 4D and 4E, very few BCL10 assemblies are visible/quantifiable after 70 minutes PMA exposure, but p65 has clearly entered the nucleus. It looks like BCL10 assembly slightly lags behind p65 nuclear entry. Can the authors provide a more detailed explanation of these kinetics?

      We do note that the number of CARD9CARD clusters formed upon opto-stimulation exceeds the apparent number of BCL10 nucleation sites. We believe this is consistent with nucleation-limited kinetics, where the clustering of CARD9-CARD increases the local probability of nucleation. As nuclei form and grow, they lower the probability of subsequent nucleation elsewhere in the cell. Additionally, it is possible that our artificial seeds do not perfectly mimic the native CARD9 seeds that form upon natural stimulation (e.g. due to potential steric interference from the fluorophore and Cry2). We also acknowledge that there is a slight delay in the visible appearance of BCL10 polymers relative to p65 nuclear translocation. We expect that MALT1 activates already when the polymers are still too small to see (sub-resolution), whereas the polymers only become microscopically visible once they’ve grown quite a bit more.

      3) Related to point 2 above, in Figure 5D, the leftmost cell in the field of view clearly contains CARD9CARD assemblies but there are no BCL10 assemblies and p65 is not imported into the nucleus (in contrast to the central cell in the field of view). How often does CARD9CARD optogenetic assembly lead to BCL10 assembly? In other words, can the authors quantify the cell-to-cell variability in this experiment?

      Throughout our experiments, whether analyzing BCL10 puncta formation, NF-kB transcriptional activity, or p65 translocation, we observed a persistent nonresponsive fraction of cells even at saturating levels of stimulation. Specifically, approximately 30% of THP-1 cells failed to acquire T-Sapphire fluorescence or form BCL10-mEos3.2 puncta when stimulated with high levels of β-glucan (Fig 1B and E, respectively), and approximately 25% of 293T cells failed to acquire T-Sapphire fluorescence or exhibit p65 nuclear translocation when stimulated with high levels of PMA (Fig 1C and Fig 4E, respectively). Because these numbers did not depend on whether BCL10 was endogenously or exogenously expressed, we know that the underlying cell-to-cell heterogeneity involves factors upstream of BCL10. Indeed, the fraction of recalcitrant cells drops to 10% in our optogenetic experiments that bypass upstream factors (Fig S11E). Possible sources of heterogeneity include different physiological states of the cells or fluctuations in the expression levels of any upstream factor in the signaling pathway. We believe that this phenomenon is not unique to the CBM signalosome, as we (unpublished) and others (Fernandes-Alnemri T et al, 2009, Dick M et al, 2016) have similarly observed a fraction of non-responding cells upon activation of the inflammasome, which involves nucleation-limited polymerization of the adaptor protein ASC. While this phenomenon is interesting and may be important to our understanding of the full complexity of signalosomes in vivo, we believe that identifying the source of heterogeneity would be outside the scope of the present manuscript. We now describe this phenomenon in the final paragraph of the “Endogenous BCL10 is constitutively supersaturated” section.

      Fernandes-Alnemri, T., Yu, JW., Datta, P. et al. AIM2 activates the inflammasome and cell death in response to cytoplasmic DNA. Nature 458, 509–513 (2009). https://doi.org/10.1038/nature07710

      Dick, M., Sborgi, L., Rühl, S. et al. ASC filament formation serves as a signal amplification mechanism for inflammasomes. Nat Commun 7, 11929 (2016). https://doi.org/10.1038/ncomms11929

      Minor comments

      While the work is scientifically well done, the text reads as though it is meant for experts rather than a broad audience. This is a pity because it risks alienating readers. We suggest that some adjustments to the text (mainly additional explanations and not ruling out alternative interpretations of the data) would widen the audience and increase the impact of this important study. Below are some suggestions that might help.

      1. In the first results section, the authors write: 'This suggests that Bcl10 but not CARD9 assembly occurs in a highly cooperative fashion that could, in principle (Koch, 2020), underlie the feed forward mechanism.' It isn't obvious how Figure 1 leads to this statement. Could the authors give a more detailed explanation?

      We have now revised the text to elaborate on this interpretation.

      One limitation of DAmFRET is that it can only detect a nucleation barrier where there is a difference in FRET between the monomer and the assembled form of the protein. However, it can't necessarily detect when there is not a nucleation barrier i.e. if there's no difference in FRET. The text seems to suggest that CARD9 and MALT1 don't have nucleation barriers to their assembly. While this might not be intentional, it would be helpful to explicitly state that CARD9 and MALT1 could also possess such barriers that are not detectable by this method. This wouldn't detract from the finding that BCL10 has a barrier that plays an important function.

      The reviewer is correct that DAmFRET would not be able to detect a nucleation barrier if the assembled phase does not condense the fluorophore to a sufficiently high density for FRET to occur. In our experience, this is only a concern for very large proteins whose bulk “dilutes” the fluorophores within the assembly. Death domains, on the other hand, are only ~ 3 nm in diameter, and FRET occurs within a range of ~10 nm; hence we think it very unlikely that the death domains could be forming cryptic polymers that escape our detection. In any case, when assembly does produce a change in FRET, we can with confidence determine how strongly that form of assembly is governed by concentration. Hence, for CARD9, which does produce a FRET signal upon assembly, we can say that assembly has a smaller intrinsic nucleation barrier than that of BCL10. We further eliminated the possibility of multi-step nucleation (which would reduce the apparent nucleation barrier relative to the one-step ideal case) for CARD9 by showing that artificial condensates of the protein expressed in trans do not influence the concentration-dependence of FRET (Fig. 4 B). Finally, under all conditions where CARD9 lacked FRET, it also lacked signaling activity, suggesting there is not a cryptic functional assembly that evades our assay. Likewise MALT1, which lacked FRET at all concentrations, was entirely unable to activate NF-kB upon overexpression (Fig. S8 A and B), suggesting that it too is not forming a cryptic functional assembly that evades our assay. We therefore feel confident in our conclusion that CARD9 and MALT1 lack nucleation barriers of a magnitude comparable to that of BCL10. Note that our claim is not that they entirely lack a nucleation barrier (CARD9 after all does form a multi-dimensionally ordered polymer), but rather that we fail to observe a nucleation barrier and hence any barrier that may exist is insufficient to manifest at the cellular level.

      In the final results section, the idea that MALT1 activation doesn't depend on BCL10 polymer structure doesn't necessarily follow from the data. An alternative interpretation is that optogenetic clustering of MALT1 causes it to recruit BCL10 and form BCL10-MALT1 filaments (structure solved by Schlauderer et al., 2018). Also, the optogenetic clustering of MALT1 may mimic some structure found in the BCL10 cluster. Therefore, we are neither convinced that the data unambiguously show that MALT1 activation strictly depends on multi-valency rather than an ordered structure of BCL10 polymers nor that this conclusion is truly necessary for the paper.

      We agree that the reviewer’s alternative interpretation of this experiment is possible. However, we consider it unlikely because we performed the experiment with MALT1 lacking its Death Domain (residues 126-824), which mediates its interaction with BCL10 (Schlauderer et al., 2018). Our experiments then suggest that MALT1 clustering is sufficient for activation independent of any structuring mediated by BCL10. Nevertheless, we have now performed an additional control in which we treated these cells with PMA to induce BCL10 polymerization. As expected, the NF-kB transcriptional reporter utterly failed to activate in this condition, indicating that MALT1 does not interact with BCL10 polymers when it lacks its death domain. This aspect has been further elaborated in our response to reviewer 3 point 5.

      What optical density do the yeast cells reach during the 16h induction in galactose? If they are in stationary phase, this could affect the assembly status of the proteins being expressed, as the cytoplasm becomes glassy when cells are starved, and this coincides with widespread protein aggregation/assembly (Joyner et al., 2016; Munder et al., 2016).

      In our DAmFRET strategy, we first dilute an overnight culture and regrow the cells to log phase prior to resuspending them in galactose media. Our strain is engineered to undergo cell cycle arrest upon protein induction, hence exponential growth is prevented and the cells do not deplete galactose during the 16 hr induction. We have also performed many time courses of DAmFRET following induction and generally find no qualitative difference between early and late times (unpublished). Early time points simply have lower expression and correspondingly fewer cells in the high FRET state. Importantly, all comparisons between proteins are made with the same 16 hr induction.

      Although these experiments show that thermodynamically lowering the BCL10 nucleation barrier (e.g. by post-translational modifications or protein expression levels) isn't required for a response, they don't rule it out. It would be good to state this in the discussion, as cells may have multiple mechanisms of switching on the signalosome.

      We thank the reviewer for this suggestion and have now explicitly stated in the discussion that our experiments do not argue against possible thermodynamic tuning of the nucleation barrier.

      The discussion compares signalosomes with condensates formed by liquid-liquid phase separation. This is an interesting comparison but it suggests that disordered assemblies would not be capable of performing signalosome-like functions. This needs to be explained more clearly. For example, non-amyloid prions seem to form gel-like assemblies with a high nucleation barrier that are capable of driving heritable traits, likely through self-templating (Chakravarty et al., 2020). Such examples could represent disordered assemblies with signalosome switch-like behaviour. Furthermore, there are examples of condensates that are induced by environmental changes e.g. Pab1 and Ded1 condensates (Riback et al., 2017; Iserman et al., 2020). This potentially allows the proteins to reach high concentrations and remain un-condensed until a change in heat or pH overcomes a nucleation barrier required for condensate formation. Although the condensates aren't self-templating, they seem to require energy for their disassembly. Combined, this also allows switch-like behaviour, where the switch is flipped back to the uncondensed off state once conditions return to normal. In general, crossing a phase boundary can represent a switch-like response. Finally, recent electron-tomography experiments show that ASC puncta comprise clusters of filaments (Liu et al., 2021, biorxiv). CARD9/BCL10 assemblies may have similar ultrastructures and liquid-liquid phase separation may well play a role in their assembly.

      Indeed, we explicitly maintain that liquid phases cannot themselves perform signalosome-like functions. Chakravarty et al. 2020 did not observe amyloids associated with their phenomena, but the relevant experiments were not designed to exhaustively exclude an underlying ordered phase. To the extent that gelation is involved, their observations are fully consistent with ours. IUPAC defines a “gel” as a colloidal network involving a solid phase and a dispersed phase. The existence of a solid phase necessarily implies an underlying disorder-to-order transition, even if limited to small length scales. In the case of gelation associated with liquid-liquid phase separation, nucleation of the ordered phase simply occurs in two steps (first condensation, then ordering). Note also that a liquid phase could in principle give rise to a heritable phenotype if it activates a positive feedback in a molecular biological process involving the protein of interest (e.g. upregulation of its expression or a change in interacting factors). Chakravarty et al. did not exclude such phenomena (it would be very difficult to do so); hence it cannot be concluded that phase separation is responsible for the sustained phenotypic changes.

      We do not fully follow the reviewer’s logic concerning the relevance of Pab1 and Ded1 condensates. These proteins only condense when their respective phase boundaries fall below the endogenous protein concentration, as upon thermal stress. The proteins are not supersaturated in the absence of such conditions (for example, they cannot be seeded), and it is incorrect to characterize the change in heat or pH as overcoming a pre-existing nucleation barrier. The concept of a nucleation barrier only applies under conditions where a phase is thermodynamically favored. It is also misleading to state that the Ded1 and Pab1 condensates require energy for disassembly. Rather, they require energy to disassemble rapidly. Unless the assemblies have accessed a more ordered phase as described above (two step nucleation), involving a lower phase boundary, they will inevitably dissolve after the conditions return to normal.

      We have much prior experience with ASC. Although it has not been explicitly shown, that it forms ordered polymers and can behave as a prionoid in vivo suggests that it very likely operates the same way as BCL10 (i.e. is physiologically supersaturated). That full-length ASC forms clusters of filaments is not relevant (in our view) to the mechanism shown here, which only requires that filaments are indeed formed. Formally, the size of the relevant nucleus determines the minimum length scale at which ordering must manifest in our mechanism. Based on the structure of death domain filaments, this could be as small as tetramers or hexamers (a minimal but structurally complete “polymer”).

      As stated above, and now elaborated in the discussion, our data do not exclude a role of thermodynamic regulation, as could lead to liquid-liquid phase separation, in tuning the nucleation barrier of Bcl10. What they do exclude is that such changes are required for Bcl10 to activate in the first place.

      Can the authors comment on the loss of BCL10 in Echinodermata, Anthropoda, Nematoda? Is there another protein that plays a similar role? Could a CARD or PCASP protein possess self-templating properties? Could other methods of control be at play e.g. protein expression?

      This is a very interesting question! We think the reviewer’s suggested explanations for the loss of BCL10 in those lineages are valid and worthy of future exploration. Nematodes such as C. elegans have lost multiple components of innate immunity. They have very few pathogen recognition receptors and also lack NF-kB! They do, however, have other adaptor proteins that the literature and our unpublished data suggest may have self-templating ability, such as TIR-1. Drosophila also encodes multiple TIR-containing proteins that are essential for innate immunity. In short, it is possible that other proteins have acquired the hypothetically essential role of supersaturation and nucleation-limited signaling in these organisms.

      Figures 1B/1C: Can the authors comment on why the active cells plateau at about 70-75%? This is a striking feature of the plots, but the explanation may not be obvious to readers.

      See our response to major point 3, above.

      Figures 1D/1E: What was the concentration of B-glucan used in this experiment? This could be included in the figure legend. If greater than 1ug/ml this means that the % of active cells in Figure 1B matches the % of cells with BCL10 assemblies in Figures 1D/1E, which is potentially an important point.

      We thank the reviewer for bringing this point to our attention. We have now indicated in the figure legend the concentration of B-glucan used in this experiment (10 μg/ml). That the percentage of active cells in Fig. 1B matches that of cells containing BCL10 polymers in Fig. 1D and E indeed strengthens the stated relationship between BCL10 assembly and NF-kB activation in THP-1 cells subjected to a relatively physiological stimulus. Additionally, we have performed experiments to measure the levels of p65 translocation in THP-1 cells treated with B-glucan that express BCL10-mEos3.2. This data is shown in Figs. S1D and E in response to reviewer 3.

      Use of both 'BCL10' and 'Bcl10' when referring to the protein.

      We have now replaced all instances where Bcl10 was used to follow guidelines for gene and protein name conventions.

      Bruford EA, Braschi B, Denny P, Jones TEM, Seal RL, Tweedie S. Guidelines for human gene nomenclature. Nat Genet. 2020;52(8):754-758. doi:10.1038/s41588-020-0669-3

      In the supplementary figures there are some formatting problems/missing words in the figure legends. In Figure S11 there is a black box covering the lower part of the figure.

      We have now fixed these instances.

      References used in this review

      Chakravarty, A.K. et al. (2020) "A Non-amyloid Prion Particle that Activates a Heritable Gene Expression Program," Molecular Cell, 77(2), pp. 251-265.e9. doi:10.1016/j.molcel.2019.10.028.

      Iserman, C. et al. (2020) "Condensation of Ded1p Promotes a Translational Switch from Housekeeping to Stress Protein Production," Cell, 181, pp. 818-831.e19. doi:10.1016/j.cell.2020.04.009.

      Joyner, R.P. et al. (2016) "A glucose-starvation response regulates the diffusion of macromolecules," eLife, 5. doi:10.7554/eLife.09376.

      Munder, M.C. et al. (2016) "A pH-driven transition of the cytoplasm from a fluid- to a solid-like state promotes entry into dormancy," eLife, 5(MARCH2016). doi:10.7554/ELIFE.09347.

      Riback, J.A. et al. (2017) "Stress-Triggered Phase Separation Is an Adaptive, Evolutionarily Tuned Response," Cell, 168(6), pp. 1028-1040.e19. doi:10.1016/j.cell.2017.02.027.

      Schlauderer, F. et al. (2018) "Molecular architecture and regulation of BCL10-MALT1 filaments," Nature Communications 2018 9:1, 9(1), pp. 1-12. doi:10.1038/s41467-018-06573-8.

      Reviewer #2 (Significance (Required)):

      The existence of a nucleation barrier had already been postulated, based on structural and other studies (referenced by the authors), it had lacked a rigorous demonstration. This work provides that demonstration, which is important for the signalosome field and more broadly applicable to researchers studying cellular decision making. The study further demonstrates that DaMFRET is an excellent to study protein assembly processes in their native environment, allowing the authors to tackle a question that would have been technically very difficult to address otherwise.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The study by Rodriguez Gama et al. addresses the molecular function of CBM complex-forming proteins CARD9, BCL10 and MALT1 in the activation of myeloid cells, using optogenetic tools, transcriptional reporters and biochemical approaches. It is known from previous studies that Bcl10 oligomerizes into filamentous oligomeric structures incorporating Malt1, and that these structures are nucleated by receptor-induced activation of CARD proteins such as CARD11 (in lymphocytes) or CARD9 (in myeloid cells), but the mechanism underlying the assembly of the resulting CBM complexes remain incompletely understood.

      The authors develop beautiful optogenetic tools to address this question, and convincingly demonstrate that CARD9-mediated nucleation of BCL10 triggers a binary cellular NF-kB response in a spring-load-like fashion, and identify mutants of BCL10 and CARD9 that impact this capacity. Unfortunately, however, the authors do not do a good job to simplify this complex problem so it can be easily understood. In particular, the choices of mutants, models and experiments are not consistent between figures, and some data seem to be arbitrarily added or omitted. Complex hybrid constructs are also used, without assessing whether these are indeed functional in the corresponding ko cells. The paper would therefore benefit from a major overhaul. We also noticed that the literature is often not cited adequately and have included a (non-exhaustive) list of examples of wrong, incomplete, or erroneous citations below.

      1. The initial observations of binary signaling are derived from a reporter system. Although there are controls to show that the reporter used does not function intrinsically cooperatively, it would be nice to see additional data to show that cooperativity occurs also at the level of endogenous response systems, for instance by qPCR-based assessment of a natural NF-kB target gene (induced for example by TNFa versus B-glucan in THP-1 cells, and by TNFa versus PMA in 293T cells).

      As detailed in the introduction, NF-kB has been shown by multiple labs to activate in a binary fashion. Our manuscript shows that NF-kB activation occurs in a binary fashion both at the level of transcription and at the level of nuclear translocation (upstream of any transcriptional output). While we do agree that additional data could further illustrate the biological significance of our findings, we do not feel it is necessary for our conclusions. Note also that because NF-kB activation occurs in a binary fashion per cell, a simple qPCR experiment would not suffice to extend our findings to the broader Nf-kB regulon. Instead, one would have to use e.g. RNA-FISH or single cell RNA-seq, nontrivial experiments that would take months to complete.

      The cell lines in Figures 1D-E (and also some of the BCL10 mutants used later on) would have been better run in the assays in the early parts of Figure 1. The final conclusion prior to the section The adaptor protein BCL10 is a nucleation-mediated switch is otherwise not justified. This is a central tenet of the paper, that is referred to again, with some other ancillary data to support it. These mutants reappear later in the paper, but it would have been better, and easier to make rescue lines of BCL10 KO in Figure 1, otherwise the logic is lost, and the models seem chosen arbitrarily.

      The choice of experiments in different panels of Fig. 1 resulted from a chronological progression of reagent construction as the project evolved. We do appreciate that switching between the assays may lead readers to doubt one or the other. Therefore, we have now immunostained for endogenous p65 in the same experiment as for Fig. 1D and confirmed that p65 translocated to the nucleus only in THP-1 BCL10-KO cells that have been reconstituted with WT BCL10-mEos3.2, but not E53R. We think this additional evidence along with our orthogonal measurements in other reporter systems confirms our findings that BCL10 nucleation determines NF-kB activity.

      Expression with microNS is not well controlled and gives little real evidence for what is occurring. It is unclear what the concentration of the protein expressed was, but certainly the relative expression of the CARD9(CARD) and the microNS version should be assessed.

      We believe these concerns result from a misunderstanding. We assume the reviewer is referring to the experiment in Fig. 3B. Expression of muNS on its own has no effect on the DAmFRET of other proteins, and we have previously used it in exactly the same way as here (Holliday M et al. 2019 and Kandola T et al. 2021). Please note that muNS fusion proteins in our experiment have an orthogonal fluorescent protein whose spectra do not significantly overlap with those of mEos3.1. The experiment evaluates a protein’s ability, when condensed via its fusion to muNS, to nucleate an mEos3.1-fused protein that is expressed in trans. Fusion of proteins to muNS does not affect their expression levels, as we now show for CARD9CARD-muNS-mCardinal versus CARD9CARD-mCardinal (Fig. S6D).

      Also, the AmFRET profile of CARD9CARD looks very weird, it cannot be compared to BCL10.

      We are unsure in what way the AmFRET profile of CARD9CARD is “weird”. It is fully consistent with expectations and has been thoroughly explained in the text. We suspect the reviewer was bothered by the sharp acquisition of FRET at approximately 100 uM. As explained in the text, this represents the phase boundary, also known as the solubility line, for CARD9CARD polymers, which we previously showed in vitro (Holliday M et al. 2019). Above this concentration, the protein self-assembles without a nucleation barrier, hence the sharp but continuous change in FRET. BCL10 plots, in contrast, show a discontinuous acquisition of FRET, which indicates a nucleation barrier. In order to highlight that the CARD9CARD transition is understood and expected, we have also now added a line to the plot to demarcate the phase boundary.

      We are not convinced of the usefulness of the introduction of a slew of disease-causing CARD9 mutations that may or may not be relevant to the authors' point. The fact that they do or do not function in a specific sub portion of an assay that may or may not be relevant to biological activity seems to be of interest but without biochemical understanding, little is clear.

      While several reports have shown the clinical importance of these CARD9 mutations on susceptibility to fungal infections, little was known about the molecular mechanism underlying their effects. The inclusion of the disease-causing mutants to this paper is justified for the following reasons. First, they demonstrate the relevance of our work to disease. Second, they build off our findings to provide an otherwise unknown molecular mechanism of these mutants. We showed using independent methods that CARD9CARD mutations disrupt the ability to nucleate BCL10, via two different mechanisms. Finally, validating the disease-causing mutations allowed us to use them as controls for subsequent experiments demonstrating that BCL10 is supersaturated.

      The Optogenetic experiments are interesting, but difficult to interpret without evidence that these MALT1 constructs are indeed still functional when expressed in MALT1-deficient THP-1 cells. We do not therefore think that this experiment shows a necessity for clustering to signal, just a sufficiency, and in a highly artificial construct.

      We welcome the opportunity to elaborate on the optogenetic experiments. Since BCL10 and MALT1 are expressed ubiquitously across cell types, the validity of our findings should not depend on the cell type used. Indeed, much of what we already know about innate immunity signalosomes comes from work in HEK293T cells. Our optogenetic experiments using MALT1 were performed in 293T MALT1-KO cells in Figures 6E and F, and employed two distinct functional assays (p65 nuclear translocation and a transcriptional reporter). While our approach employs light to control clustering, similar approaches using (no less-artificial) chemically induced dimerization domains have been used to study caspase activation (Oberst A et al, 2010, Boucher D et al, 2018). Our use of light affords higher specificity, reversibility, and spatial and temporal control over MALT1 assembly than does chemically induced dimerization.

      To demonstrate the necessity of clustering, we have now performed an experiment with MALT1(126-824)-miRFP670-Cry2 expressed in 293T MALT1 KO cells that contain a transcriptional reporter of NF-kB ,as in figures 6E and F. We added PMA to the cells and found that it failed to activate NF-kB (Fig. 6), confirming that the interaction of MALT1 (via its death domain) with polymerized BCL10 is required for activation. Note that MALT1 and BCL10 exist as a soluble heterodimer prior to BCL10 polymerization; hence it is polymerization, rather than the interaction itself, that activates MALT1. That artificial clustering rescues this defect strongly suggests that the effect of polymerization can be attributed to increased proximity rather than some allosteric effect communicated from BCL10 polymers through the MALT1 DD to its caspase-like domain.

      Oberst, A., Pop, C., Tremblay, A.G., Blais, V., Denault, J.-B., Salvesen, G.S., and Green, D.R. (2010). Inducible dimerization and inducible cleavage reveal a requirement for both processes in caspase-8 activation. J. Biol. Chem. 285, 16632–16642.

      Boucher, D., Monteleone, M., Coll, R.C., Chen, K.W., Ross, C.M., Teo, J.L., Gomez, G.A., Holley, C.L., Bierschenk, D., Stacey, K.J., et al. (2018). Caspase-1 self-cleavage is an intrinsic mechanism to terminate inflammasome activity. J. Exp. Med. 215, 827–840.

      In the introduction and other parts of the paper, there are numerous instances where the previous literature in the field is not adequately cited. Examples include:

      • In the introduction, it is weird to cite one original paper (a MALT1 ko study by Ruland et al., 2001; there are several other studies of ko papers for CBM components that would merit being citated along with this study) together with two reviews on that topic (Ruland and Hartjes 2019 and Gehring et al. 2018)

      • In the introduction, the original study by Wang et al., 2002 should be cited together with Rebeaud et al., 2002; the two studies on the same topic were published back-to-back

      • In the introduction, the statement "CARD10 and CARD14 are expressed in nonhematopoietic cells including intestinal and skin epithelia, respectively" should be supported by citations.

      • Still in the introduction, the 2 references for the statement "... CARD14 gain of function mutations cause psoriasis (Howes et al., 2016; Jordan et al., 2012)" are not appropriate. There are several reports of patients with CARD14 mutations (the study by Jordan et al is only one of them) and several CARD14 mouse models that provoke a psoriasis-like phenotype, which would merit being cited.

      • In the following sentence: "Point mutations and translocations involving BCL10 and MALT1 cause immunodeficiencies (Ruland and Hartjes, 2019), testicular cancer (Kuper-Hommel et al., 2013), and lymphomas (Zhang et al., 1999).", the citation style also seems completely random, combining the citation of a single original paper for lymphomas (Zhang et al. 1999) (there are several other important original studies on that topic or recent reviews that could be cited instead), together with a review on immunodeficiencies (Ruland and Hartjes, 2019) and then another single example for a role of BCL10 and MALT1 in carcinoma (the study by Kuper-Hommel et al. is one, but several other original publications exist on the latter topic, showing for example a role in breast carcinoma or glioblastoma).

      • In the first section of the results, the reference cited for endogenous CARD10 expression in 293T cells (Ruland et al., 2001) is wrong, no endogenous CARD10 expression was assessed in that study

      We have now revised the citations mentioned above and other instances to ensure adequate citations in each case.

      Reviewer #3 (Significance (Required)):

      The paper deals with a complex question, namely how the CBM signalosome assembles and functions to stimulate NF-kB signaling. This question is important to the understanding of pro-inflammatory immune responses and basic life sciences in general. As the focal point of the paper is complex, and tools to study such phenomena are at the limit of technical capabilities, this further increases the potential impact of the work.

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      The characterization of open-ended signalosomes in a number of innate-immunity and cell-death pathways, in particular formed by domains from the death-fold family, has led to the suggestions that these complexes allow a switch-like signalling response suitable for these pathways. It appears that this has been widely accepted. However, these suggestions are based largely on indirect observations and speculation.

      Rodriguez-Gama and coworkers have decided to test these suggestions more directly. Their results confirm the suggestions. Based on my own experience, papers that validate widely adopted suggestions are often not considered seriously by top journals, who are looking for hot topics/paradigm-changing/surprising type results. I would urge the editors to consider seriously work such as in this paper, which directly tests important suggestions and does so at a technically high standard. The authors use a range of ingenious approaches, both with recombinant proteins and in cells, and including proteins from organisms from different parts of the evolutionary tree, to support their interpretations, so it is an extensive and high-quality study. I am impressed that so many different fusion proteins with fluorescent tags continued to function as expected, but I guess the authors controlled for this as much as they could.

      Having said all this, I do get the feeling the authors are "over-selling" the nucleation barrier aspect of these signalling mechanisms. It is clearly an important and critical aspect of signalling in many systems, but then it is not the only important aspect; a number of other regulatory inputs play a role in different systems. So the statement "Our findings introduce a novel structure-function paradigm" in my view is overstretching things somewhat. Further in the Discussion section, the authors state "Existing explanations for the preponderance of ordered polymers in immune cell signalosomes have centered on the functions of multivalency at steady state, such as scaffolding and sensitivity enhancement resulting from the cooperativity of homo-oligomerization". They cite a small (and non-exhaustive) number of papers discussing this topic; all these include "seeding" or "nucleation" as an important part of the proposed mechanism. So I suggest the authors provide a more balanced discussion of this aspect. Different pathways appear to display a different level of switch-like behaviour, and one thing that the current version of the manuscript is missing is more discussion of other death fold-based systems and how the results on the CBM signalosome apply to these, and also other systems such as TIR domain-based ones, which currently get no mention whatsoever. In the CBM system, there seems to be one main nucleation barrier; can there be more than one in others?

      We appreciate the reviewer’s perspective and have now acknowledged in the introduction and discussion additional prior literature that has paved the way for our study. Nevertheless, we maintain -- as now stated in the abstract -- that “our results defy the usual protein structure/function paradigm, and demonstrate that protein structure can evolve via selection for energetic maxima in addition to minima”. We have elaborated in the introduction and discussion how immune signaling provides the functional context in which such a paradigm can evolve, and how our findings uniquely support the paradigm.

      One other aspect I need to express some criticism about is attention to detail - especially with a paper focusing on the physics behind biological processes, I would expect a higher standard of getting the terminology and units correct - see specific examples below. This can obviously be fixed easily.

      Specific points are listed below. No page or line numbers are provided so I have done my best to make it clear what the comments refer to.

      1. Abstract line 6 and throughout: in "NF-kB", the "k" is supposed to be "kappa" (Greek letter) - it stands for "nuclear factor kappa-light-chain-enhancer of activated B cells", not fully defined in the manuscript as far as I can see. Occasionally, small k is also used instead of the small cap K or whatever the authors used most of the time, but I don't think any of them use the Greek letter.

      We had indeed used a version of the small “kappa” κ. We have now fixed the cases where we mistakenly used k instead of κ.

      Page 2 (Introduction) paragraph 2 line 9: period missing at the end of sentence. Same Page 4 (Results: Assembly) paragraph 4 line 3.

      This is now fixed.

      Page 2 (Introduction) paragraph 2 line 15 and throughout: in long sentences, more commas can help help readability, for example before "leading" here. Similar page 15 paragraph 2 line 3 after "Additionally", paragraph 4 line 2 before "which".

      We have now included more commas and tried to improve readability throughout.

      Page 4 (Results: Assembly) paragraph 2 line 2: is "positive feedback" different from "cooperativity"? Is it a broader term that includes cooperativity, nucleation and other mechanisms? It may be useful to introduce some of these terms to avoid confusion by the readers.

      “Positive feedback” is the broadest term as it is agnostic to mechanism. “Nucleation” refers to the initiation of a first order phase transition, which is one mechanism of positive feedback. Nucleation involves “cooperativity”, in that a higher order species is more stable than smaller species. However, cooperativity can occur for oligomers of finite size, whereas nucleation is reserved for phase transitions to species of infinite size. We appreciate that the use of so many related terms may have created more confusion than necessary. Hence, we have now revised the text to omit the more general terms -- “positive feedback” and “cooperativity” where possible.

      Page 4 (Results: Assembly) paragraph 2 line 3: please define "TNF".

      We have now fixed this and other acronyms.

      Page 4 (Results: Assembly) paragraph 3 line 2: the use of size-exclusion chromatography to follow the size of complexes would assume that they are irreversible or very stable. It appears this may be the case here, but some discussion may be warranted.

      We have now explained that SEC is appropriate for this experiment because large nucleation barriers generally imply stable assemblies.

      Page 4 (Results: Assembly) paragraph 3 line 4 and throughout: the symbol for "kilodalton" is "kDa".

      We have now fixed this mistake.

      Page 4 (Results: Assembly) paragraph 3: I am not sure how the results discussed in this paragraph demonstrate that assembly occurs in cooperative fashion - just that there is a change in oligomeric states upon stimulation.

      Cooperativity is implied by the absence of oligomer sizes between monomer and the large assembly. Nevertheless, we realized this can only be concluded in the case of homotypic assembly, which we cannot yet assume at this point in the paper. Therefore, we have revised this paragraph to say that the distribution is “consistent with” an underlying phase transition (which we then go on to prove).

      Page 4 (Results: Assembly) paragraph 4 line 2: "WT" is not defined. Wild-type what? I presume "protein"?

      We refer here to the wild-type protein. We have now fixed this mistake.

      Page 4 (Results: Assembly) paragraph 4: it may be worth pointing out here the wild-type and mutant proteins expressed at similar levels; clearly the outcomes will depend on protein concentration in the cell. I believe the supplementary figure shows this to a large extent.

      Indeed, our supplementary figure shows that the WT and mutant protein express to comparable levels. We have now pointed this out in the text.

      Page 4 (Results: The adaptor) paragraph 1 line 4: "CARD domain" would stand for "caspase activation and recruitment domain domain". Please check throughout (including Supplementary Material).

      We have fixed this mistake.

      Page 4 (Results: The adaptor) paragraph 1 line 9: "expressed over a range of concentrations in cells" - this would imply the authors controlled expression - please rephrase to explain what exactly was done.

      We have now rephrased this sentence to indicate that the range of expression results from the use of a genetic construct with cell-to-cell variation in copy number.

      Page 5 (Results: The adaptor) paragraph 2 line 3 and throughout (including Supplementary Material): please use the Greek letter rather that "u" for micro.

      We have now fixed this mistake.

      Page 5 (Results: The adaptor) paragraph 3: this analysis is rather simplistic, it is not just the RMSD value, it is the nature of conformational change that is important? Please elaborate, I would think the papers presenting structural work have already discussed this to some extent?

      The reviewer is correct; it is the nature of the conformational change that is most important. We are unsure how to accurately estimate the energy barrier separating the two conformations for each protein. However, we have now undertaken a collaboration to attempt to do so via FAST molecular simulations (Zimmerman and Bowman 2015). In lieu of the results of these ongoing studies, we have modified the text to acknowledge that RMSD does not necessarily relate to nucleation barriers.

      Maxwell I. Zimmerman and Gregory R. Bowman. Journal of Chemical Theory and Computation, 2015, 11 (12), 5747-5757 DOI: 10.1021/acs.jctc.5b00737

      Page 5 (Results: The adaptor) paragraph 4 line 5 and further in this section: some symbol(s) do not show in the pdf - before "(delta)", next page line 3-5 after "higher" and "both".

      We have fixed this issue that resulted from exporting to a PDF file from our text editor.

      Page 6 (Results: The adaptor) paragraph 4: interface IIa and IIIb are not introduced, and there is not even any reference provided here.

      We have now added a reference for these mutations and elaborated on the interfaces IIa and IIIb.

      Page 6 (Results: Pathogenic) paragraph 1 line 12: "FL" is not introduced.

      We have now fixed this mistake.

      Page 8 (Results: Pathogenic) paragraph 7: the text "absent the pathogenic mutations" is missing something.

      We have now reworded this section.

      Page 10 (Results: BCL10) paragraph 3: why does CARD9 CARD clustering peak and then disassemble (I guess "clustering" doesn't disassemble, please rewrite as well).

      We have now fixed this mistake.

      Page 11 (Results: MALT1) paragraph 1: I presume dimerization doesn't achieve the same level of proximity as higher-order multimerization?

      Our interpretation here is that for MALT1, activation requires close proximity of more than two molecules. Although our dimerization module did not activate the caspase-like domain of MALT1, we know that it achieves close enough proximity to activate the caspase domain of CASP8. Hence we believe the MALT1 mechanism has a stoichiometry requirement in addition to a proximity requirement. This is, of course, consistent with the fact that activation normally occurs in the context of polymers rather than dimers.

      Page 11 (Results: Ancient) paragraph 1 line 4: is this AlphaFold2?

      That is correct, we used AlphaFold2. We have added that detail.

      Page 12 (Discussion) paragraph 4: not sure if "molecular examples of evolutionary spandrels" will be clear to most readers.

      We have now explained what evolutionary spandrels are, and elaborated on the relationship to our findings.

      Page 14 (Materials: Plasmid) line 2 and throughout: "Golden Gate" is usually capitalized. Similar for "Gibson" further in the paragraph. The English in this paragraph is not up to standard in general; for example "Then placing..." is not a complete sentence, and a number of sentences ending with "via gibson" need to be rewritten.

      We have now rewritten this paragraph.

      Page 16 (Materials: Cell) line 4 and throughout: "2" in "CO2" should be subscripted.

      This is now fixed.

      Page 16 (Materials: Transient) line 6 and throughout (including Supplementary Material): please use a space between number and unit ("35 mm").

      This is now fixed.

      Page 16 (Materials: Generation) line 4 and throughout: to distinguish from "gram", please italicize "g" and/or use "x g".

      We have now fixed this.

      Page 17 (Materials: Yeast) line 3: please specify which table is "table X".

      We have now fixed this mistake.

      Page 17 (Materials: Mammalian) line 1: please provide full reference. Same next paragraph line 2.

      We have now fixed this.

      Page 17 (Materials: DAmFRET) line 3: "SSC" and "FSC" are not defined.

      We have now fixed this.

      Page 18 (Materials: Fluorescence) line 10: "Coefficient" does not have to be capitalized. It does not have to be defined again in the next paragraph.

      We have now fixed this.

      Page 19 (Materials: Optogenetic) line 1: "performed" rather than "made"?

      We have now fixed this.

      Page 19 (Materials: Protein) line 12: the Compass software doesn't have a reference?

      We have now added the reference to the software.

      References: please make format consistent: articles titles in sentence or title case.

      We have now formatted all references to be consistent.

      Legend to Fig. 1: I suggest "Schematic diagram"; and "h" rather than "hrs"; please check throughout (including Supplementary Material).

      We agree with this suggestion.

      Legend to Fig. S1: is "TNF-a" supposed to be "TNF-alpha"?

      We have fixed this.

      Legend to Fig. S7: please capitalize "Figure 2H".

      We have fixed this.

      Legend to Fig. S10F: please move "Dox" behind the concentration.

      We have fixed this.

      Fig. S14B: the colours in the superposition make it difficult to see the differences.

      We have used a different color now.

      Legend to Fig. S14: I suggest "structure...predicted by AlphaFold" (2?) and include the reference.

      We agree with this suggestion.

      Reviewer #4 (Significance (Required)):

      As argued above, the significance of this paper is that it tests directly important hypotheses proposed or assumed previously, and does so at a technically high standard. No published report has done so to a similar extent.

      The paper should be of interest to a broad audience from cell biologists and immunologists to biochemists, biophysicists and structural biologists.

      My expertise is in structural biology or systems similar to the one studied here.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The characterization of open-ended signalosomes in a number of innate-immunity and cell-death pathways, in particular formed by domains from the death-fold family, has led to the suggestions that these complexes allow a switch-like signalling response suitable for these pathways. It appears that this has been widely accepted. However, these suggestions are based largely on indirect observations and speculation.

      Rodriguez-Gama and coworkers have decided to test these suggestions more directly. Their results confirm the suggestions. Based on my own experience, papers that validate widely adopted suggestions are often not considered seriously by top journals, who are looking for hot topics/paradigm-changing/surprising type results. I would urge the editors to consider seriously work such as in this paper, which directly tests important suggestions and does so at a technically high standard. The authors use a range of ingenious approaches, both with recombinant proteins and in cells, and including proteins from organisms from different parts of the evolutionary tree, to support their interpretations, so it is an extensive and high-quality study. I am impressed that so many different fusion proteins with fluorescent tags continued to function as expected, but I guess the authors controlled for this as much as they could.

      Having said all this, I do get the feeling the authors are "over-selling" the nucleation barrier aspect of these signalling mechanisms. It is clearly an important and critical aspect of signalling in many systems, but then it is not the only important aspect; a number of other regulatory inputs play a role in different systems. So the statement "Our findings introduce a novel structure-function paradigm" in my view is overstretching things somewhat. Further in the Discussion section, the authors state "Existing explanations for the preponderance of ordered polymers in immune cell signalosomes have centered on the functions of multivalency at steady state, such as scaffolding and sensitivity enhancement resulting from the cooperativity of homo-oligomerization". They cite a small (and non-exhaustive) number of papers discussing this topic; all these include "seeding" or "nucleation" as an important part of the proposed mechanism. So I suggest the authors provide a more balanced discussion of this aspect. Different pathways appear to display a different level of switch-like behaviour, and one thing that the current version of the manuscript is missing is more discussion of other death fold-based systems and how the results on the CBM signalosome apply to these, and also other systems such as TIR domain-based ones, which currently get no mention whatsoever. In the CBM system, there seems to be one main nucleation barrier; can there be more than one in others?

      One other aspect I need to express some criticism about is attention to detail - especially with a paper focusing on the physics behind biological processes, I would expect a higher standard of getting the terminology and units correct - see specific examples below. This can obviously be fixed easily.

      Specific points are listed below. No page or line numbers are provided so I have done my best to make it clear what the comments refer to.

      1. Abstract line 6 and throughout: in "NF-kB", the "k" is supposed to be "kappa" (Greek letter) - it stands for "nuclear factor kappa-light-chain-enhancer of activated B cells", not fully defined in the manuscript as far as I can see. Occasionally, small k is also used instead of the small cap K or whatever the authors used most of the time, but I don't think any of them use the Greek letter.
      2. Page 2 (Introduction) paragraph 2 line 9: period missing at the end of sentence. Same Page 4 (Results: Assembly) paragraph 4 line 3.
      3. Page 2 (Introduction) paragraph 2 line 15 and throughout: in long sentences, more commas can help help readability, for example before "leading" here. Similar page 15 paragraph 2 line 3 after "Additionally", paragraph 4 line 2 before "which".
      4. Page 4 (Results: Assembly) paragraph 2 line 2: is "positive feedback" different from "cooperativity"? Is it a broader term that includes cooperativity, nucleation and other mechanisms? It may be useful to introduce some of these terms to avoid confusion by the readers.
      5. Page 4 (Results: Assembly) paragraph 2 line 3: please define "TNF".
      6. Page 4 (Results: Assembly) paragraph 3 line 2: the use of size-exclusion chromatography to follow the size of complexes would assume that they are irreversible or very stable. It appears this may be the case here, but some discussion may be warranted.
      7. Page 4 (Results: Assembly) paragraph 3 line 4 and throughout: the symbol for "kilodalton" is "kDa".
      8. Page 4 (Results: Assembly) paragraph 3: I am not sure how the results discussed in this paragraph demonstrate that assembly occurs in cooperative fashion - just that there is a change in oligomeric states upon stimulation.
      9. Page 4 (Results: Assembly) paragraph 4 line 2: "WT" is not defined. Wild-type what? I presume "protein"?
      10. Page 4 (Results: Assembly) paragraph 4: it may be worth pointing out here the wild-type and mutant proteins expressed at similar levels; clearly the outcomes will depend on protein concentration in the cell. I believe the supplementary figure shows this to a large extent.
      11. Page 4 (Results: The adaptor) paragraph 1 line 4: "CARD domain" would stand for "caspase activation and recruitment domain domain". Please check throughout (including Supplementary Material).
      12. Page 4 (Results: The adaptor) paragraph 1 line 9: "expressed over a range of concentrations in cells" - this would imply the authors controlled expression - please rephrase to explain what exactly was done.
      13. Page 5 (Results: The adaptor) paragraph 2 line 3 and throughout (including Supplementary Material): please use the Greek letter rather that "u" for micro.
      14. Page 5 (Results: The adaptor) paragraph 3: this analysis is rather simplistic, it is not just the RMSD value, it is the nature of conformational change that is important? Please elaborate, I would think the papers presenting structural work have already discussed this to some extent?
      15. Page 5 (Results: The adaptor) paragraph 4 line 5 and further in this section: some symbol(s) do not show in the pdf - before "(delta)", next page line 3-5 after "higher" and "both".
      16. Page 6 (Results: The adaptor) paragraph 4: interface IIa and IIIb are not introduced, and there is not even any reference provided here.
      17. Page 6 (Results: Pathogenic) paragraph 1 line 12: "FL" is not introduced.
      18. Page 8 (Results: Pathogenic) paragraph 7: the text "absent the pathogenic mutations" is missing something.
      19. Page 10 (Results: BCL10) paragraph 3: why does CARD9 CARD clustering peak and then disassemble (I guess "clustering" doesn't disassemble, please rewrite as well).
      20. Page 11 (Results: MALT1) paragraph 1: I presume dimerization doesn't achieve the same level of proximity as higher-order multimerization?
      21. Page 11 (Results: Ancient) paragraph 1 line 4: is this AlphaFold2?
      22. Page 12 (Discussion) paragraph 4: not sure if "molecular examples of evolutionary spandrels" will be clear to most readers.
      23. Page 14 (Materials: Plasmid) line 2 and throughout: "Golden Gate" is usually capitalized. Similar for "Gibson" further in the paragraph. The English in this paragraph is not up to standard in general; for example "Then placing..." is not a complete sentence, and a number of sentences ending with "via gibson" need to be rewritten.
      24. Page 16 (Materials: Cell) line 4 and throughout: "2" in "CO2" should be subscripted.
      25. Page 16 (Materials: Transient) line 6 and throughout (including Supplementary Material): please use a space between number and unit ("35 mm").
      26. Page 16 (Materials: Generation) line 4 and throughout: to distinguish from "gram", please italicize "g" and/or use "x g".
      27. Page 17 (Materials: Yeast) line 3: please specify which table is "table X".
      28. Page 17 (Materials: Mammalian) line 1: please provide full reference. Same next paragraph line 2.
      29. Page 17 (Materials: DAmFRET) line 3: "SSC" and "FSC" are not defined.
      30. Page 18 (Materials: Fluorescence) line 10: "Coefficient" does not have to be capitalized. It does not have to be defined again in the next paragraph.
      31. Page 19 (Materials: Optogenetic) line 1: "performed" rather than "made"?
      32. Page 19 (Materials: Protein) line 12: the Compass software doesn't have a reference?
      33. References: please make format consistent: articles titles in sentence or title case.
      34. Legend to Fig. 1: I suggest "Schematic diagram"; and "h" rather than "hrs"; please check throughout (including Supplementary Material).
      35. Legend to Fig. S1: is "TNF-a" supposed to be "TNF-alpha"?
      36. Legend to Fig. S7: please capitalize "Figure 2H".
      37. Legend to Fig. S10F: please move "Dox" behind the concentration.
      38. Fig. S14B: the colours in the superposition make it difficult to see the differences.
      39. Legend to Fig. S14: I suggest "structure...predicted by AlphaFold" (2?) and include the reference.

      Significance

      As argued above, the significance of this paper is that it tests directly important hypotheses proposed or assumed previously, and does so at a technically high standard. No published report has done so to a similar extent.

      The paper should be of interest to a broad audience from cell biologists and immunologists to biochemists, biophysicists and structural biologists.

      My expertise is in structural biology or systems similar to the one studied here.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The study by Rodriguez Gama et al. addresses the molecular function of CBM complex-forming proteins CARD9, BCL10 and MALT1 in the activation of myeloid cells, using optogenetic tools, transcriptional reporters and biochemical approaches. It is known from previous studies that Bcl10 oligomerizes into filamentous oligomeric structures incorporating Malt1, and that these structures are nucleated by receptor-induced activation of CARD proteins such as CARD11 (in lymphocytes) or CARD9 (in myeloid cells), but the mechanism underlying the assembly of the resulting CBM complexes remain incompletely understood.

      The authors develop beautiful optogenetic tools to address this question, and convincingly demonstrate that CARD9-mediated nucleation of BCL10 triggers a binary cellular NF-kB response in a spring-load-like fashion, and identify mutants of BCL10 and CARD9 that impact this capacity. Unfortunately, however, the authors do not do a good job to simplify this complex problem so it can be easily understood. In particular, the choices of mutants, models and experiments are not consistent between figures, and some data seem to be arbitrarily added or omitted. Complex hybrid constructs are also used, without assessing whether these are indeed functional in the corresponding ko cells. The paper would therefore benefit from a major overhaul. We also noticed that the literature is often not cited adequately and have included a (non-exhaustive) list of examples of wrong, incomplete, or erroneous citations below.

      1) The initial observations of binary signaling are derived from a reporter system. Although there are controls to show that the reporter used does not function intrinsically cooperatively, it would be nice to see additional data to show that cooperativity occurs also at the level of endogenous response systems, for instance by qPCR-based assessment of a natural NF-kB target gene (induced for example by TNFa versus B-glucan in THP-1 cells, and by TNFa versus PMA in 293T cells).

      2) The cell lines in Figures 1D-E (and also some of the BCL10 mutants used later on) would have been better run in the assays in the early parts of Figure 1. The final conclusion prior to the section The adaptor protein BCL10 is a nucleation-mediated switch is otherwise not justified. This is a central tenet of the paper, that is referred to again, with some other ancillary data to support it. These mutants reappear later in the paper, but it would have been better, and easier to make rescue lines of BCL10 KO in Figure 1, otherwise the logic is lost, and the models seem chosen arbitrarily.

      3) Expression with microNS is not well controlled and gives little real evidence for what is occurring. It is unclear what the concentration of the protein expressed was, but certainly the relative expression of the CARD9(CARD) and the microNS version should be assessed. Also, the AmFRET profile of CARD9CARD looks very weird, it cannot be compared to BCL10.

      4) We are not convinced of the usefulness of the introduction of a slew of disease-causing CARD9 mutations that may or may not be relevant to the authors' point. The fact that they do or do not function in a specific sub portion of an assay that may or may not be relevant to biological activity seems to be of interest but without biochemical understanding, little is clear.

      5) The Optogenetic experiments are interesting, but difficult to interpret without evidence that these MALT1 constructs are indeed still functional when expressed in MALT1-deficient THP-1 cells. We do not therefore think that this experiment shows a necessity for clustering to signal, just a sufficiency, and in a highly artificial construct.

      6) In the introduction and other parts of the paper, there are numerous instances where the previous literature in the field is not adequately cited. Examples include:

      • In the introduction, it is weird to cite one original paper (a MALT1 ko study by Ruland et al., 2001; there are several other studies of ko papers for CBM components that would merit being citated along with this study) together with two reviews on that topic (Ruland and Hartjes 2019 and Gehring et al. 2018)
      • In the introduction, the original study by Wang et al., 2002 should be cited together with Rebeaud et al., 2002; the two studies on the same topic were published back-to-back
      • In the introduction, the statement "CARD10 and CARD14 are expressed in nonhematopoietic cells including intestinal and skin epithelia, respectively" should be supported by citations.
      • Still in the introduction, the 2 references for the statement "... CARD14 gain of function mutations cause psoriasis (Howes et al., 2016; Jordan et al., 2012)" are not appropriate. There are several reports of patients with CARD14 mutations (the study by Jordan et al is only one of them) and several CARD14 mouse models that provoke a psoriasis-like phenotype, which would merit being cited.
      • In the following sentence: "Point mutations and translocations involving BCL10 and MALT1 cause immunodeficiencies (Ruland and Hartjes, 2019), testicular cancer (Kuper-Hommel et al., 2013), and lymphomas (Zhang et al., 1999).", the citation style also seems completely random, combining the citation of a single original paper for lymphomas (Zhang et al. 1999) (there are several other important original studies on that topic or recent reviews that could be cited instead), together with a review on immunodeficiencies (Ruland and Hartjes, 2019) and then another single example for a role of BCL10 and MALT1 in carcinoma (the study by Kuper-Hommel et al. is one, but several other original publications exist on the latter topic, showing for example a role in breast carcinoma or glioblastoma).
      • In the first section of the results, the reference cited for endogenous CARD10 expression in 293T cells (Ruland et al., 2001) is wrong, no endogenous CARD10 expression was assessed in that study

      Significance

      The paper deals with a complex question, namely how the CBM signalosome assembles and functions to stimulate NF-kB signaling. This question is important to the understanding of pro-inflammatory immune responses and basic life sciences in general. As the focal point of the paper is complex, and tools to study such phenomena are at the limit of technical capabilities, this further increases the potential impact of the work.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In their manuscript entitled "A nucleation barrier springloads..." Rodriguez-Gama et al. dissect the assembly mechanism of the signalosome, composed of the proteins CARD9, BCL10 and MALT1, using a novel in-cell biophysical approach (DAmFRET). They first overexpressed fluorescently tagged versions of the proteins to promote their assembly in yeast and mammalian cells, finding that CARD9 forms higher order assemblies across a wide range of concentrations with no discontinuity in the DAmFRET profile. In contrast, the DAmFRET profile of BCL10 showed a clear separation between monomers and higher order assemblies, which started to form spontaneously only at higher BCL10 concentrations. Furthermore, at the two states of the proteins co-exist at all concentrations. These observations imply that there is a nucleation barrier to forming BCL10 assemblies. MALT1 showed no change in FRET regardless of its expression level. These observations, alongside fluorescence microscopy of the assemblies, and previous structural studies, suggest that BCL10 forms self-templating polymers that act as a switch for an all-or-nothing immune response, assayed in this case by monitoring the nuclear translocation of the NF-kB subunit p65. The authors also assessed the effects of known disease-causing mutations on the nucleation barrier, showing that changes in the strength of the nucleation barrier can have major effects on signalosome function. Finally, they used optogenetic methods to trigger assembly of individual signalosome components, providing insight into the minimal components/conditions required for signalosomes to work.

      Major comments:

      Overall, the experiments by Rodriguez-Gama et al. offer convincing evidence that there is a nucleation barrier to BCL10 polymerisation, and that a CARD9 template is sufficient to overcome the barrier. Although the existence of a nucleation barrier had already been postulated, based on structural and other studies (referenced by the authors), it had lacked a rigorous demonstration. This work provides that demonstration, which is important for the signalosome field and more broadly applicable to researchers studying cellular decision making. The study further demonstrates that DaMFRET is an excellent to study protein assembly processes in their native environment, allowing the authors to tackle a question that would have been technically very difficult to address otherwise. The optogenetic experiments are a nice sufficiency test for their ideas.

      We feel there are a few key points to address before publication.

      1) One of the main conclusions is that spring-loading the nucleation barrier with high super-saturating BCL10 concentrations allows a decisive response. Although much of the data strongly imply this conclusion, the dependence of the immune response on BCL10 concentration was not tested directly. A key prediction of the nucleation barrier is that at concentrations below saturation, BCL10 should not be able to induce an all-or-nothing response when stimulated. At saturated/super-saturated concentrations BCL10 should be able to induce a response. At deeply super-saturated concentrations the response should start to be activated spontaneously in the absence of an external stimulus. These predictions could be tested using the doxycycline-inducible BCL10 system (Figure S2D), without establishing major new experimental avenues. We feel that such an experiment would strengthen the main conclusion. It might also help to shed light on whether being highly supersaturated enables a more decisive response than being just saturated.

      2) Intuitively, readers might expect that if BCL10 is supersaturated then, once nucleated, it would rapidly assemble at the nucleation sites. In Figure 5B, CARD9CARD-miRFP670nano-Cry2 assemblies are optically induced throughout the cell. However, BCL10 appears to nucleate at just a few sites with a few minutes delay. More widespread nucleation and growth of BCL10 polymers seems to take longer (20-40 minutes, Figures 5B and 5C), after CARD9CARD-miRFP670nano-Cry2 has disassembled. Furthermore, in Figures 4D and 4E, very few BCL10 assemblies are visible/quantifiable after 70 minutes PMA exposure, but p65 has clearly entered the nucleus. It looks like BCL10 assembly slightly lags behind p65 nuclear entry. Can the authors provide a more detailed explanation of these kinetics?

      3) Related to point 2 above, in Figure 5D, the leftmost cell in the field of view clearly contains CARD9CARD assemblies but there are no BCL10 assemblies and p65 is not imported into the nucleus (in contrast to the central cell in the field of view). How often does CARD9CARD optogenetic assembly lead to BCL10 assembly? In other words, can the authors quantify the cell-to-cell variability in this experiment?

      Minor comments:

      While the work is scientifically well done, the text reads as though it is meant for experts rather than a broad audience. This is a pity because it risks alienating readers. We suggest that some adjustments to the text (mainly additional explanations and not ruling out alternative interpretations of the data) would widen the audience and increase the impact of this important study. Below are some suggestions that might help.

      1) In the first results section, the authors write: 'This suggests that Bcl10 but not CARD9 assembly occurs in a highly cooperative fashion that could, in principle (Koch, 2020), underlie the feed forward mechanism.' It isn't obvious how Figure 1 leads to this statement. Could the authors give a more detailed explanation?

      2) One limitation of DAmFRET is that it can only detect a nucleation barrier where there is a difference in FRET between the monomer and the assembled form of the protein. However, it can't necessarily detect when there is not a nucleation barrier i.e. if there's no difference in FRET. The text seems to suggest that CARD9 and MALT1 don't have nucleation barriers to their assembly. While this might not be intentional, it would be helpful to explicitly state that CARD9 and MALT1 could also possess such barriers that are not detectable by this method. This wouldn't detract from the finding that BCL10 has a barrier that plays an important function.

      3) In the final results section, the idea that MALT1 activation doesn't depend on BCL10 polymer structure doesn't necessarily follow from the data. An alternative interpretation is that optogenetic clustering of MALT1 causes it to recruit BCL10 and form BCL10-MALT1 filaments (structure solved by Schlauderer et al., 2018). Also, the optogenetic clustering of MALT1 may mimic some structure found in the BCL10 cluster. Therefore, we are neither convinced that the data unambiguously show that MALT1 activation strictly depends on multi-valency rather than an ordered structure of BCL10 polymers nor that this conclusion is truly necessary for the paper.

      4) What optical density do the yeast cells reach during the 16h induction in galactose? If they are in stationary phase, this could affect the assembly status of the proteins being expressed, as the cytoplasm becomes glassy when cells are starved, and this coincides with widespread protein aggregation/assembly (Joyner et al., 2016; Munder et al., 2016).

      5) Although these experiments show that thermodynamically lowering the BCL10 nucleation barrier (e.g. by post-translational modifications or protein expression levels) isn't required for a response, they don't rule it out. It would be good to state this in the discussion, as cells may have multiple mechanisms of switching on the signalosome.

      6) The discussion compares signalosomes with condensates formed by liquid-liquid phase separation. This is an interesting comparison but it suggests that disordered assemblies would not be capable of performing signalosome-like functions. This needs to be explained more clearly. For example, non-amyloid prions seem to form gel-like assemblies with a high nucleation barrier that are capable of driving heritable traits, likely through self-templating (Chakravarty et al., 2020). Such examples could represent disordered assemblies with signalosome switch-like behaviour. Furthermore, there are examples of condensates that are induced by environmental changes e.g. Pab1 and Ded1 condensates (Riback et al., 2017; Iserman et al., 2020). This potentially allows the proteins to reach high concentrations and remain un-condensed until a change in heat or pH overcomes a nucleation barrier required for condensate formation. Although the condensates aren't self-templating, they seem to require energy for their disassembly. Combined, this also allows switch-like behaviour, where the switch is flipped back to the uncondensed off state once conditions return to normal. In general, crossing a phase boundary can represent a switch-like response. Finally, recent electron-tomography experiments show that ASC puncta comprise clusters of filaments (Liu et al., 2021, biorxiv). CARD9/BCL10 assemblies may have similar ultrastructures and liquid-liquid phase separation may well play a role in their assembly.

      7) Can the authors comment on the loss of BCL10 in Echinodermata, Anthropoda, Nematoda? Is there another protein that plays a similar role? Could a CARD or PCASP protein possess self-templating properties? Could other methods of control be at play e.g. protein expression?

      8) Figures 1B/1C: Can the authors comment on why the active cells plateau at about 70-75%? This is a striking feature of the plots, but the explanation may not be obvious to readers.

      9) Figures 1D/1E: What was the concentration of B-glucan used in this experiment? This could be included in the figure legend. If greater than 1ug/ml this means that the % of active cells in Figure 1B matches the % of cells with BCL10 assemblies in Figures 1D/1E, which is potentially an important point.

      10) Use of both 'BCL10' and 'Bcl10' when referring to the protein.

      11) In the supplementary figures there are some formatting problems/missing words in the figure legends. In Figure S11 there is a black box covering the lower part of the figure.

      References used in this review

      Chakravarty, A.K. et al. (2020) "A Non-amyloid Prion Particle that Activates a Heritable Gene Expression Program," Molecular Cell, 77(2), pp. 251-265.e9. doi:10.1016/j.molcel.2019.10.028.

      Iserman, C. et al. (2020) "Condensation of Ded1p Promotes a Translational Switch from Housekeeping to Stress Protein Production," Cell, 181, pp. 818-831.e19. doi:10.1016/j.cell.2020.04.009.

      Joyner, R.P. et al. (2016) "A glucose-starvation response regulates the diffusion of macromolecules," eLife, 5. doi:10.7554/eLife.09376.

      Munder, M.C. et al. (2016) "A pH-driven transition of the cytoplasm from a fluid- to a solid-like state promotes entry into dormancy," eLife, 5(MARCH2016). doi:10.7554/ELIFE.09347.

      Riback, J.A. et al. (2017) "Stress-Triggered Phase Separation Is an Adaptive, Evolutionarily Tuned Response," Cell, 168(6), pp. 1028-1040.e19. doi:10.1016/j.cell.2017.02.027.

      Schlauderer, F. et al. (2018) "Molecular architecture and regulation of BCL10-MALT1 filaments," Nature Communications 2018 9:1, 9(1), pp. 1-12. doi:10.1038/s41467-018-06573-8.

      Significance

      The existence of a nucleation barrier had already been postulated, based on structural and other studies (referenced by the authors), it had lacked a rigorous demonstration. This work provides that demonstration, which is important for the signalosome field and more broadly applicable to researchers studying cellular decision making. The study further demonstrates that DaMFRET is an excellent to study protein assembly processes in their native environment, allowing the authors to tackle a question that would have been technically very difficult to address otherwise.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Gama et al. use a biophysical assay DAmFRET, structural analysis, and optogenetic tools to uncover the nucleation mechanism of CBM signalosome. They performed experiments first in yeast cells that lack death folds or related signaling networks, then confirmed their discoveries in human cells. The results presented here are clear and convincing. The paper is very well presented and clearly written.

      They found it is the CARD domain of BCL10 that acts as a molecular switch that drives all-or-none activation of NF-kB. Monomeric BCL10 possesses an unfavorable conformation and serves as a nucleation barrier, keeping BCL10 in a supersaturated inactive state that allows for binary activation upon stimulation.

      They also characterized CARD9 CARD domain and a coiled-coil region. They reasoned that CARD9CARD functions as a polymer seed to nucleate BCL10, and that the coiled-coil region has multimerization ability to facilitate nucleation. Furthermore, they characterized that MALT1 activation doesn't depend on BCL10 polymers but its own proximity. And MALT1 induces graded NF-kB activation, thus further demonstrating the binary activation is conferred by BCL10.

      Major comments:

      1) Fig S1D and E, the authors used TNF-a to activate NF-kB independent of CBM signalosome and found the activation in each cell increased with dose. In contrast, CBM activation led to bimodal cell activation. The authors claim that this is evidence that positive feedback upstream of NF-kB. We do not believe this claim can be made from this comparative experiment alone. We agree that positive feedback is important for activating an NF-kB response, but the comparison between CBM and TNFa is inaccurate and glosses over published data. Specifically, there is published data that TNF-a does activate a 'switch-like' or digital response, as defined by the translocation of p65 (see (Tay et al. 2010) among other studies that have examined p65 translocation at the single-cell level). The difference in T-sapphire expression between CBM and TNF activation is most likely due to TNFa induced oscillations of p65 translocation (although this is speculation on our part). Therefore we suggest to the authors that the TNF-a data (Fig S1D and E) should be omitted, as the claim of switch or not-switch as pertains to TNF signaling is more complex and nuanced than presented here. We believe omitting this data will strengthen the manuscript and avoid confusion in the field. The bimodal expression of the T-sapphire NF-kB reporter driven by the CBM signalosome activation is sufficient to claim an all-or-none response.

      2) Fig 3B, the authors introduced CARD9CARD-µNS as a stable condensed seed for BLC10. However, considering CARD9CARD can form polymers at high concentration (Fig 3B and S3D), are these high expression levels of CARD9CARD able to induce BCL10-mEos3.1 assembly (as measured by DamFRET in yeast cells)? Can the authors examine BCL10 FRET at these high expression level of CARD9CARD? We assume that BCL10 will be assembled in these cells. This would provide a valuable control experiment and support the author's conclusions.

      3) Fig 3C, the text said "Whereas WT CARD9CARD assembled into polymers at high concentration, the pathogenic mutants R18W, R35Q, R57H, and G72S failed to do so (Fig 3C and S7B,C), explaining why they cannot nucleate BCL10". This claim that these mutants can not nucleate BCL10 does not have a figure call out or a reference. The authors then show the results in Fig 3E which supports this claim. Even though they were done in the context of full length CARD, all proteins contain the I107E mutation that releases autoinhibition. For clarity, the authors should consider rearranging the text to avoid explaining a phenomenon and making conclusions before showing the results.

      4) Fig 4D, E and Video 1, the authors showed the nucleation of BCL10 into puncta within live cells is followed by p65 translocation to the nucleus. The authors claim that 'this result suggests that BCL10 is indeed supersaturated prior to stimulation' (paragraph 2 section titled BCL10 is endogenously supersaturated'). We fail to understand how this live-cell experiment leads to the conclusion BCL10 is supersaturated before stimulation. We think this text should be deleted from the text, or put into context with the DAmFRET data that lead the authors to make this claim. It would be interesting for the authors to define in discussion what are the golden criteria to claim a protein exists in a supersaturated state with live cells (by microscopy or other methods)? Adaptor protein assembly into puncta and the subsequent nuclear translocation of transcription factors is a common phenomenon across signalling pathways. Not all these pathways rely on signalling adaptors existing in a supersaturated state. The field of cell signaling (and cell biology in general) would benefit from a detailed definition of how these physical-chemical definitions of proteins are supported by experimental data. We believe that this paper will become a seminal paper in the field, and future work will benefit from a clear definition of how a claim of supersaturation is derived from the data.

      5) Regarding the supersaturated state of BCL10, the authors convincingly use optogenetics to show how transient assemblies of CARD-Cry2 can template BCL10 assembly. This is a convincing experiment that shows templated nucleation of BCL10. To strengthen the claim that BCL10 is supersaturated endogenously we suggest the author quantify the expression of BCL10-mScarlet and CARD-Cry2 and ideally show that this phenomenon can be observed at expression levels equivalent to endogenous.

      Minor comments:

      1) Special character "delta" is not displayed in the text (instead only a space).

      2) Several cell lines including mouse, human, and yeast lines were used across this manuscript. It would be clearer and more helpful if the exact cell type of the line could be indicated. Such as, "BCL10-mEos3.1 yeast cells" instead of "BCL10-mEos3.1 cells", "BCL10-mScarlet HEK293T cells" instead of "BCL10-mScarlet cells".

      3) Fig 5B, the authors indicated that BCL10 colocalized with CARD9CARD, then please show the merged image as well.

      4) Fig 6E, authors claimed that cells were stimulated with blue light for the indicated durations. The longest duration is 12 hours. Please specify if it was continuous exposure or several rounds of exposure in the indicated durations.

      Significance

      This work used a combination of FRET and optogenetic tools to engineer CBM signaling and visualize the effects. They incorporated knowledge from structure biology, together with their results from mutations and truncations, dissected the significance of each protein in CBM signalosome, and demonstrated in detail how higher-order assemblies make all-or-none cellular decisions. We believe this paper will be a seminal paper in the field of cell signalling and cytoplasmic organization. It defines a new paradigm of macromolecules assembly of signalling complexes as being dependent on protein existing in a supersaturated state. Importantly this paper opens up new questions regarding macromolecular signaling complexes (found in many innate immune signaling pathways): How is protein supersaturation maintained and used throughout evolution to construct biochemical signalling switches?

      This paper will be of particular interest to scientists working on immunity and cell signalling, especially in the field of higher-order assemblies. However, we feel the impact of this paper goes beyond these fields, and we believe this manuscript will be of broad interest to the cell biology and biophysics communities. For reference, our expertise is in innate immunity and cell biology.

      Referees cross-commenting

      In general, I agree with reviewer 4. However, I'm afraid I have to disagree with reviewer 3 that the paper requires 'a major overhaul'. I also believe that reviewer 3 suggestion #1 to use qPCR to assess NF-kB target genes is not a 'constructive and realistic suggestion'. Or, to put it another way, not within the guidelines of the RC for reviewers. This type of suggestion is too open-ended to be of use to the authors. Which should the authors analyze of the tens to hundreds of genes activated by NF-kB? A rigorous and robust editor should ignore this comment.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Two reviewers commented on the smeared appearance of Tae1 bands in our Western blot analyses (Figure 4F and 5B) and asked us to improve their technical quality.

      -We agree and will repeat these experiments with more careful attention to lysate preparation, using a higher percentage SDS gel for better separation of low molecular weight proteins as suggested.

      Reviewer 2 requested that we assess how Tae1 variants impact interbacterial competition outcomes.

      -We agree that this would be interesting to take a look at. While this will not be feasible for every variant we examine in the paper, we can conduct comparative interbacterial assays between P. aeruginosa and E. coli using P. aeruginosa strains with a tae1 point mutation for c110s. Given that our biochemical experiments show that this hyperactive variant evades inhibition by the cognate immunity protein, we expect that this may decrease P. aeruginosa fitness, even in the context of competition.

      More generally, we think that examining Tae1 variants in the context of interbacterial competitions would be a critical orthogonal approach in order to validate that the DMS results have any bearing on competition outcomes. However, we feel that major focus of this paper is on the more molecular and biophysical insights that our approach can offer. Our study tests our assumptions about the kinds of features and surfaces that are important for proteins that engage with non-canonical complex substrates. It is, of course, interesting to think about the implications of this for physiological phenotypes and the drivers of toxin evolution. It is also exciting to imagine how this kind of information could be used to one day engineer certain interbacterial outcomes. We hope that others in the field will push our efforts into these directions, but we do not feel that these directions are essential for our conclusions. However, our conclusions on the molecular and biophysical aspects have helped generate interesting hypotheses in microbial ecology that could be largely followed up on by others.

      In order to conduct well-controlled P. aeruginosa:E. coli competition assays for more Tae1 variants, we would need to generate a significant number of new P. aeruginosa strains encoding point mutations for each of our variants across several genetic backgrounds. The competitions themselves also require a considerable amount of work to optimize and quantify. We are able to do this for one of the variants as previously mentioned (C110S). It’s important to note that the first author of this paper, who was the primary driver of this work, is no longer in my lab or in academia. As for myself, I am also in the middle of a transition out of academia and am actively ramping down my lab at UCSF. I no longer have the space or appropriate set-up to support this longer-term effort.

      Reviewer 2 asked that we examine Tae1 (WT and C110S) expression levels in vivo to more precisely examine whether increased self-intoxication by Tae1C110S in P. aeruginosa was due to differences in toxin activity or toxin levels.

      We agree with this suggestion and will look at toxin protein levels by Western blot analysis in the context of P. aeruginosa cells grown 1) alone on solid media and 2) together with E. coli on solid media during interbacterial competition using conditions that match our other competition assays.

      All 3 reviewers asked us to provide more experimental evidence addressing the hypothesis that differential peptidoglycan (PG) affinity across Tae1 variants could explain variation in toxic activity.

      -We agree that this is an interesting point to follow up on further. To be clear, we also do not know whether this hypothesis is true at this stage, and the answer is not necessarily critical for our central advance, but we would like to give it a try! We have devised an approach to ask the question experimentally across a subset of our deep mutational scanning (DMS) variants.

      Reviewer 1 suggested that we quantify in vitro binding affinities for PG using isothermal titration calorimetry (ITC). However, given that ITC requires high concentrations of well-defined homogeneous substrates, which we are not able to generate for more complex higher order structures of cell wall PG, we propose a pull-down based approach.

      Briefly, we plan to conduct pull-downs using insoluble, purified cell wall sacculi from our two E. coli grown under the two conditions as bait for recombinant Tae1 proteins. Given that intact sacculi or inherently insoluble, we can simply collect bound Tae1 through centrifugation of sacculi pellets and examine the amount of Tae1 associated by Western blot analysis. These analyses will need to be conducted across a titration of Tae1 concentrations and also with catalytic activity inhibited to avoid solubilization of sacculi. We will block Tae1 hydrolysis by carrying out pull-downs in the presence of a general commercially-available cysteine hydrolase inhibitor, E64. If there is indeed differential affinity for PG underlying lytic differences across Tae1 variants, we would expect to see greater relative association of Tae1 variants with the type of cell wall sacculi that they more effectively lyse in our DMS screen. We would expect the reverse trend to also be true (lower affinity for less active variants).

      Reviewer 1 would like to know if we have done lysis experiments with any E. coli mutants that only impact PG density but not PG polymer structure? If they haven’t tested any E. coli mutants, have we done lysis experiments using drugs that have a similar impact on PG? Even if we don’t include these data in the paper, the reviewer would like us to comment on the trends we have observed.

      We have not done experiments in any mutants or chemical backgrounds known to only impact PG density but not polymer structure. We think this would be a very interesting angle! But unfortunately this is outside the scope of this study. It would require that we first experimentally confirm that the restrictive effect on only density is clearly demonstrated using a variety of techniques, including microscopy, chemical analyses, and biophysical probing of sacculi.

      Reviewer 1 asked for additional DMS screens in more conditions

      We love this idea! In fact, we hope that others are motivated to adopt our workflow to run many more DMS screens for T6S toxins, as we believe these screens provide a lot of useful and sometimes surprising insights that could be of great interest to others. However, we believe that the primary goal of this paper is to establish this methodology as a compelling approach for studying toxins and, more generally, proteins with complex cellular substrates. It does not necessarily fall within the scope of this paper to fully assess the mechanistic implications of cell wall diversity across a wide range of conditions.

      In our experience, rigorously conducting DMS screens requires a significant amount of effort and resources to establish consistent experimental conditions. Also, a non-trivial number of costly sequencing-based experiments are required across control and variables for the results to be statistically sound and meaningful. Furthermore, experimental validation of results are ultimately important for our ability to confidently generate hypotheses stemming from these datasets. As stated above, the first author of this paper, who was the primary driver of this work, is no longer in my lab or in academia. As for myself, I am in the middle of a transition out of academia and am actively ramping down my lab at UCSF. I no longer have the space or appropriate set-up to support this longer-term effort.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper by Radkov et al. represents an extensive structure-function-evolution treatment of the Type VI secretion effector protein Tae1. Using mutational scanning, the authors identify multiple residues that either enhance or reduce Tae1 function in an E. coli model, and validate these residues through direct functional assays. The main conclusion is that Tae1 contains a surprising number of non-intuitive residues important for its activity, particularly several surface-exposed residues far from the active site. The authors then suggest that these residues mediate binding to specific PG architectures and supply some evidence that the functional mutation landscape changes when the DMS assays is repeated in E. coli with altered cell wall architecture. Lastly, natural variants of Tsae1 are identified and discussed in the context of the trade-off between optimal toxicity and maintenance of self-immunity.

      I have no major comments. The study is beautifully-done, with all controls in place. It might be worth following up on their putative PG binding residue mutants with an additional binding assay (MST, or just a crude cell wall pulldown assay), but that is not critical to support the main conclusions.

      Minor comments

      • The Western Blot of the vector control in Fig. 4F has the same impurities as the one in Fig. 5 B. Was the control blot re-used? If so, please indicate in the figure legend. Also, please show full Western Blots in supplemental material.
      • Small typo in Fig. 5 legend ("does is not")
      • The citation in line 108 seems a little off - that does not seem to support a physiologically relevant context for Tae.
      • Line 122/123 - something seems to be missing in this sentence.
      • Line 148 - this is not clear to me. Did they sequence plasmid barcodes (are those in the plasmid backbone?), or the mutated orfs?

      Significance

      This paper makes an impactful contribution to the open question of substrate- and species-specificity of PG hydrolases, particularly those weaponized by Type VI secretion systems. The major advance here is that PG binding by the hydrolase, and PG architecture of the substrate, are important determinants of Tae function and that this has important evolutionary consequences. The study will be of interest to the Type VI secretion community, but also to the PG turnover field.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript aims to investigate the molecular basis underlying the differential toxicity of the bacterial T6SS amidase effector (Tae) by using Pseudomonas aeruginosa Tae1 as a model. The rationale is that while Tae is a conserved T6SS toxin degrading bacterial peptidoglycan (PG) by specific cleavage activity, different Tae toxins of the same family exhibit distinct lysis/antibacterial activity. Thus, the authors used combination of a deep mutagenesis scanning (DMS) coupled with fitness assay, NMR, and PG-binding/amidase activity to address this question by expressing Tae1 variants in E. coli. Besides finding the residues at/near the catalytic site critical for amidase activity, the authors further discovered many surface-exposed resides distant from the catalytic cleft also contributed to Tae activity likely by affecting binding or hydrolysis of PG. The authors further explored whether the residues contributing to loss or gain of Tae1 activity could be different against different PG structure by performing the same suite of DMS analyses from E. coli grown in the presence of D-Met, which resulted in reduced PG density and crosslinks. They discovered the fitness landscape of Tae1 variants shift dramatically, suggesting that Tae1 toxicity is highly context-dependent and optimizable for specific PG forms. A hyperactive Tae1 C110S variant is also naturally encoded in a subset of Proteobacteria outside of P. aeruginosa. This further led to a prediction that Tae1 C110S variant may evade binding and inhibition by cognate immunity, which was confirmed by the higher binding affinity of WT Tae1 than C110S Tae1 determined by ITC analysis. Together, the authors concluded that substrate-specificity and toxin-immunity interactions are the two distinct selective pressures for shaping diversity across the Tae1 toxin superfamily.

      Major comments:

      This is a well thought, carefully designed and executed research article reporting important and interesting findings. The conclusions made are mostly supported by the provided data. However, the toxicity assay for Tae1 variants except C110S was only validated by ectopic expression in E. coli or in vitro activity assay. Considering Tae1 is a bacterial toxin involved in interbacterial antagonism, the mutants with newly discovered key residues contributing to loss or gain of function shall be also evaluated for their role in the context of interbacterial competition, not simply by the cell lysis assay of expressed Tae1 variants in E. coli. Below are the specific comments that shall be addressed in order to claim the findings of this work .

      1. It is an exciting finding that several surface residues distal from catalytic core mediate PG hydrolysis or binding. While the validation of their cell lysis activity by expressing each Tae1 variants fused with LepB signal peptide is informative, the role of these surface residues in toxicity shall be also tested by interbacterial competition assay either using E. coli or susceptible Pseudomonas aeruginosa strain as a prey. Tai may be expressed in E. coli prey to determine its neutralization activity during interbacterial competition context.
      2. Based on the results that fitness landscape of Tae1 variants grown in the presence or absence of D-Met, the authors stated in line 334 "Condition-specific phenotypes suggests that Tae1 toxicity in vivo is highly context-dependent and optimizable for specific PG forms." However, there could be other physiological changes due to D-methionine. To claim this, the authors may test the surface residues with altered impacts on fitness between two growth conditions for their PG-binding activity using PG isolated from culture in the presence or absence of D-Met .
      3. Quality of western blotting for Tae1 variants in Fig. 4F, 5B should be improved as the signals from WT is not clearly detected for comparison. The authors may use higher percentage of SDS-PAGE for better resolution of small Tae1 proteins. Relevant protein marker should be indicated. In addition, why there is no western blot analysis of C30A variant?
      4. It is exciting that a hyperactive Tae1 C110S variant is also naturally encoded in a subset of Proteobacteria outside of P. aeruginosa. The authors showed higher binding affinity of WT Tae1 than C110S Tae1, which correlated with lower fitness of C110S variant in a competition setup (Fig. 6C, 6E). The authors suggest that "Tae1 of C110S variant lyses kin cells at a faster rate than Tae1 WT can bind and inhibit killing, leading to a fitness cost for this strain" (Line 460-463). To claim this, expression levels of endogenous Tae1 of both WT and C110S should be shown as well as their secretion levels to rule out the effect of protein abundance and secretion levels may affect the fitness. It would be also recommended to set up a real interbacterial competition assay by selecting the survival cfu of prey cells.

      Minor comments:

      1. Is Tae1 previously named as Tse1? Please clarify and indicate the previous name and accession number. As stated in Line 61" Although many T6S bacteria deploy similar toxins, interbacterial outcomes can vary considerably depending on the bacterial species engaged in T6S-mediated competition", the authors should also cite other relevant references showing differential Tae toxicity from different organisms (such as Serratia marcescens. Ssp1 and Ssp2 from English et al., 2012, Enterobacter cloacae Tae4 from Zhang et al., 2012, and Agrobacterium tumefaciens Tae from Yu et al., 2021). The manuscript shall gain more insights by discussing the biological significance of their conservation yet distinct toxicity and potential condition-specific activity of Tae toxins studied in different bacterial lineage besides those in P. aeruginosa.
      2. Line 111-113 "the Tae1 protein from P. aeruginosa, which is injected into E. coli and leads to cell lysis": citations are needed here.
      3. The heatmap in Fig. 4E also include those with mixed phenotypes. Are the averaged fitness score meaningful since some residues are likely derived from the mixed phenotypes, which make the data less reliable. I suggest the authors to only include those true GOF or indicate which one is true GOF and which one is from mixed phenotypes.

      Significance

      This manuscript used innovative approaches to investigate the mechanism and biological significance underlying the differential toxicity of the bacterial T6SS amidase effector (Tae). Tae superfamily can be classified into four families (Tae1-4), which are universally encoded in T6SS of diverse Proteobacteria. It is intriguing that Tae toxins classified in the same family produced by different bacterial lineage/strains exhibit distinct lysis/antibacterial activity but the underlying mechanism is unknown. This manuscript provided evidence suggesting the existing natural diversity of Tae1 in substrate-specificity and toxin-immunity interactions, which are the key selective pressures for shaping diversity across the Tae1 toxin superfamily. The findings provide an explanation how bacterial toxin effectors evolve in the context of interbacterial antagonism, which have not been answered from previous literatures (Russell et al., 2012; English et al., 2012; Chou et al., 2012; Zhang et al., 2013; Yu et al., 2021). The methods combining deep mutagenesis scan, biochemical, and structural analysis provide a comprehensive and unbiased view to understand the diversity of Tae1 family and their corresponding phenotypes and biochemical features. As a molecular microbiologist working on bacterial secretion systems and their effectors not familiar with structural studies, I am better qualified in evaluating the biological and biochemical data but not the structural studies in NMR and structural modeling. However, I highly appreciate the authors who nicely presented the story by explaining the concept of each method which allows the reviewers/readers to understand the data even though not within their expertise.

      Russell et al., Cell Host Microbe 11:538-549. https://doi.org/10.1016/j.chom.2012.04.007 English et al., Mol Microbiol 86:921-936. https://doi.org/10.1111/mmi.12028. Chou, S. et al. Cell Reports 1, 656-664. DOI: 10.1016/j.celrep.2012.05.016 Zhang et al., J Biol Chem 288:5928-5939. https://doi.org/10.1074/jbc.M112.434357 Yu et al., J Bacteriol 203:e00490-20. https://doi.org/10.1128/JB.00490-20.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this paper the authors characterize a member of the bacterial T6SS amidase effector (Tae) superfamily of toxins that are delivered by the Type 6 Secretion system of Pseudomonas aeruginosa into target prey bacterial cells. The authors focused on why this toxic effector because it shows different potency when delivered to different target species despite the fact that all target species have peptidoglycan, the substrate that Tae attacks. The authors use powerful approaches such as deep mutational scanning (DMS), to define critical residues near the Tae active site and other sites that affect its enzymatic activity and interaction with its cognate immunity protein. The discovery of the C110S mutation which increases Tae activity is a fine example of the power of this approach. When combined with structural biological analysis, the results of the study and discussion in the manuscript is of broad interest to the community of scientists interested in toxic bacterial effectors that digest the cell wall and also others that are interested in the remodeling of peptidoglycan during cell growth and shape determination. I would recommend acceptance of this paper for publication after the authors address a few minor comments:

      1. The authors observe PG changes caused by D-met (Fig 4 and relevant text). I'm curious as to whether changes in lysis are caused by differences in PG crosslinking or PG density. They point out that the sugar binding surface of WT could localize the PG digestion (paragraph at line 521) which would no longer be required at lower density PG. However, my concern is that they propose that variation in Tae activity in different target organisms could be explained by differences in PG affinity without testing this DMS screen in any other strain, let alone species.
      2. They also don't screen any Tae1 homologs, though they address one residue in their phylogeny. I'm not sure if there are species with such a low-density PG layer, so their repeated connection to Tae's variable lytic capacity between species in the text and discussion seems tenuous until they do a DMS screen with their plasmid library in another species (or at least another strain).
      3. If WT Tae1 has some checking mechanism to ensure it's in the PG layer, I can imagine it might be slower to fire in less dense PG. That would also make sense given the chemical perturbations in Fig 3 where residues on the opposite face from the catalytic site are involved in binding PG. I would be interested to see if WT Tae1 can bind multiple PG chains or binds at higher affinity. A calorimetry approach like the one they use later may answer those questions, but that might be outside the range of this paper.
      4. It's also worth looking around for E. coli PG synthesis mutants that don't change the PG polymer structure, only the density. That might also happen if the bugs are grown under osmotic stress, which should at the very least stretch out the sacculus. That may help differentiate differences in PG composition from differences in chain density. Perhaps subinhibitory amounts of drugs that affect PG synthesis my have the same of effect of increasing or even decreasing Tae potency by modulating PG density. Of course, this may have to be done under protective osmotic conditions. Have the authors tried these sorts of experiments and if so, please comment on the trends even if the data will not be presented in this report.

      Significance

      When combined with structural biological analysis, the results of the study and discussion in the manuscript is of broad interest to the community of scientists interested in toxic bacterial effectors that digest the cell wall and also others that are interested in the remodeling of peptidoglycan during cell growth and shape determination.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      We would like to thank the reviewers for their prompt and thoughtful input on our manuscript, and their willingness to participate in more portable review through ReviewCommons.

      2. Description of the planned revisions

      Reviewer #1, major comments:

      • A major concern is that the data are reported and analyzed on a per tomogram basis when many tomograms contain multiple mitochondria. Given that the mitochondria appear mostly well separated in Sup. Fig 1 with only a few connections visible, and the high degree of pleomorphism noted by the authors, I would strongly suggest that the authors use each mitochondrion as the basis for reporting their metrics rather than the FOV/tomogram as this would avoid mixing metrics from different mitochondria that may be in different states (e.g., fusion/fission). This would apply to data shown in Figures 3, 4, 5, and 6.

      We appreciate the reviewer suggestion to separate on a per mitochondrion vs per tomogram basis for our analysis. While we do not anticipate that this will significantly change the overall findings, we agree that splitting per mitochondrion will account for any possible variability between mitochondria within the given field of view. Furthermore, we anticipate that this will actually improve our analysis and statistical power by effectively increasing the total sample size per experimental group. For our next revision, we will divide surfaces on a per mitochondrion basis within a given tomogram, and re-run the full analysis pipeline. Additionally, per reviewer request, we will include an output histogram for each measurement per mitochondrion surface in a supplemental figure.

      • In Figure 3C the authors show the combined distribution of OMM-IMM distances within each condition. This may obscure some variability within populations. Individual histograms for all mitochondria should be included as supplementary material. Currently, it is difficult to judge if the peak of the combined distribution is appropriate and impossible to judge the variability between tomograms (preferably mitochondria, see above comment). Additionally, the shape of the distributions appears significantly different between conditions, suggesting that selecting a single peak value as representative and the basis for the statistical tests (Fig 3D) might not be appropriate. Please comment.

      We will include individual histograms for each measurement per mitochondrion surface in a supplemental figure.

      We agree that peak-based statistical tests limit our ability to quantify more complex differences, and this is why we chose to output histograms in addition to violin plots, so that shape differences can be observed qualitatively. A major challenge of shape-based statistical quantification is the assessment of independent samples. By using peak-based quantification, we could assume that each tomogram (and in the planned revision, each mitochondrion) is an independent sample, but for shape distribution this is inappropriate since there is more than one value represented per tomogram. Running a KS test with N equal to the number of tomograms yields no significance even in the visible cases where the shape appears very different.

      However, the number of triangles also poorly represents the number of independent samples, since 1) the number of triangles used to represent a surface is somewhat arbitrary and remeshing can change it dramatically and 2) Our chosen triangle size is considerably smaller than the visually observed feature size in order to allow effective vector voting in the pycurv AVV algorithm. The result of this is that when we use a KS test on the distribution of values per triangle, even visually identical distributions yield p-values below 10^-200.

      We do estimate the approximate smallest feature size during our calculations, since that is used to generate the radius used by pycurv in vector voting, to be 12 nm (the radius hit parameter in pycurv). During a public presentation of this work an audience member suggested that we might use the area implied by this feature size (~450 nm^2) as the size of an independent sample. This would yield around 1000 independent samples per tomogram. Because the choice of feature size is heuristic and manual, this is not as statistically sound as the peak-based metric, which is why we believe that the more conservative peak-based statistical testing is the gold standard for proving differences, but we believe this will be the most reliable way to quantify differences in shape of distributions. We plan to implement this quantification in our revision, and will evaluate whether it gives “expected” statistical results by a bootstrapping approach using subsampling of triangles from the same vs different mitochondria.

      We would welcome reviewer suggestions for additional shape-based metrics and will explore other potential metrics to capture shape as part of our revision. While our peak-based metrics demonstrate our ability to statistically capture small changes in ultrastructure with this method, shape-based quantification will significantly enhance the capability to capture finer changes in structure that may be critical to understand physiologically.

      Once this additional testing is complete, we will add a section to the results section describing choice of statistical framework. We also plan to generate a supplementary table showing the results of the peak-based quantification alongside all shape-based quantifications.

      • In Figure 4C-F, again combined distributions are shown. Authors should include individual histograms for all mitochondria as supplementary material. The diversity of distributions in the metrics are more pronounced than the distances in reported in Fig 3, again making assessment of variability difficult and raising doubt about using the single peak value.

      We will include individual histograms for each measurement per mitochondrion surface in a supplemental figure.

      As we describe above, we will make test several options for distribution-based statistical quantifications and incorporate the results in the manuscript. We expect them to be useful for every measurement we make.

      • It would be helpful to include the curvature or curvedness of the OMM for each mitochondrion in the supplementary material. The data to correlate OMM curvature with elongated/fragmented mitochondria should be available and might be of interest to some readers.

      We will calculate curvedness of the OMM for each mitochondrion and include these data in the supplemental material. The inverse of the curvedness of the OMM gives a reasonable approximation of the radius of the mitochondrial “tube”, a feature which can be challenging to quantify fully automatically, and we agree that this may be of particular interest to some of our readers – particularly if morphology changes or stress-driven changes alter that radius in a statistically significant way!

      Reviewer #1, minor comments:

      • For all data, exact n per condition should be given (in text and captions as appropriate), not a range for the whole set.

      We will report the exact n per condition in text and in captions after we separate our data on a per mitochondrion basis and update the analysis.

      • Fig 5E middle, legend obscures some of the data.

      We will reformat the graph such that the legend does not obscure the data after we separate our data on a per mitochondrion basis and update the analysis.

      Reviewer #2, major comments:

      Barad, Medina et al. presents a new toolkit for the analysis of membrane ultrastructure in cryo-tomograms. More specifically, the toolkit is designed to compare curvature, angles and spacing between different membrane types in mitochondria. These analyses allow for the quantitative comparison of membrane features e.g. for different growth conditions. To demonstrate the utility of the toolkit tomogram datasets of mitochondria in the presence and absence of ER stress were analyzed. The authors conclude that ER stress affects mitochondria morphology through remodeling of the membrane structure. The presented biological results and statistics are convincing and show active mitochondrial membrane remodeling in the cell when exposed to ER stress. It is also clear that there is a need for more quantitative evaluation based on the wealth of tomographic image features and mitochondrial membranes are certainly a well-chosen application. For this purpose, the authors developed a new workflow even though most of the discussed analyses are very specific to mitochondrial structures. Therefore, broader applications of these tools to other organelles are not easily envisaged without significant adaption. In that context, the title and abstract overpromise a much more powerful utility that can be applied to any other membrane analysis. Rather it seems that the proposed workflow is more of a specific tool or a pipeline for mitochondrial inner and outer membrane analysis instead of a toolkit for general morphological analysis. Hence, the manuscript cannot be accepted in its current form. In particular, the structure needs a significant rework of editing to become more comprehensible.

      We appreciate the criticism that our workflow as implemented at the time of preprint is seemingly too focused on mitochondrial membranes and is not general. We’ve overhauled our workflow into a configurable (through a project YML file) scripted workflow that can take a folder with arbitrary segmentations and convert them into high quality meshes, followed by per-triangle quantification of the four primary metrics we describe in the manuscript: inter-membrane distance, intra-membrane through-space distance, curvature, and orientation. Generating fully automated visualization tools is more challenging, because which quantities are measured and how they are sub-classified (e.g., as we did for cristae, junctions, and IBM) is very project-specific; however, we did convert our visualization script into a library of utilities to combine tomograms into experiment objects, with methods to serialize for rapid access and functions for generating statistics and plots. Our converted visualizations script has been reorganized to act as an example of how similar questions could be asked for arbitrary membranes.

      We propose to further demonstrate the generality of this updated approach by segmenting several examples of another organelle, the autophagosome, found in our dataset and applying the workflow to them in a supplementary figure.

      The focussing to a method paper will also require more in-depth descriptions of the methodology in the main text. Although the code is deposited at github, there is no script-based workflow and description presented in the manuscript. Although Figure 1 puts the work into context of tomography, it remains very superficial on the image analysis. What are the input and output formats required for each step to follow the sequence of the workflow and at which steps critical interactive input is needed? What are the hardware requirements (CPU, GPU) or performance characteristics (CPU hours for certain operations)?

      In addition to the changes mentioned above, we also added a “Supplemental Table 1” detailing computational requirements and time for each step.

      We expanded on the description of this approach in the first paragraph of the results section:

      “With this strategy, we were able to segment 32 tomograms containing mitochondria, divided between the elongated and fragmented bulk morphology populations and the two treatment groups (Figure 2, Supplementary Figure 1). The segmentation output was fed into the fully automated surface morphometrics pipeline (Figure 2B, Supplementary Figure 2, Supplementary Table 1). The voxel segmentation was converted to high quality membrane surfaces using the screened poisson algorithm32. Next, these surfaces were converted into triangle graphs and curvedness was estimated using pycurv15, and the distances within and between surfaces as well as the relative orientations of different surfaces were estimated using the resulting graph. Finally, the quantifications for each tomogram were combined into experiments to allow aggregate statistics and visualizations. This 3D surface morphometrics pipeline is configurable for any segmented membrane and is available at https://github.com/grotjahnlab/surface_morphometrics.”

      We added a description of the up to date workflow in the methods section:

      “Software workflow

      The surface morphometrics pipeline is a python 3 scripted workflow with requirements that can be installed as a conda environment contained in an environment.yml file. The workflow is fully scripted and configurable with a config.yml file, and is run in 3 steps, with statistical analysis and visualization as an optional fourth step. First, a segmentation MRC file is converted automatically to a series of surface meshes formatted in the VTP file format. Second, for each mesh, the surface is converted to a graph (tg format) and curvature is estimated using pycurv. Third, orientations and distances between and within surfaces are calculated using the resulting graphs, and a CSV with quantifications as well as a final VTP surface file is output with all quantifications built in. Fourth, the outputs from multiple tomograms are combined for visualization and statistical analysis. Times and computational requirements are shown in supplementary table 1.”

      Figures 3-7 contain colorful 3D renderings of the measured quantities. In addition, they are filled with histograms of every possible quantitative parameter, which often are not very significant or different between. The authors should focus the main results and the figures to show the most relevant and significant findings and put the remaining panels and results into the supplement.

      Figures 3-7 were organized around the different methodologies (inter and intra-membrane spacing, curvature, orientation) but we agree that focusing to the main results of each methodology is sufficient to show the value of these results. We propose to address this criticism by moving figure 4D,F (inter-crista and junction spacing), figure 6 E,G (the junction measurements) and Figure 7 to supplemental figures. These supplemental figures will also be joined by the previously requested OMM curvature analysis and our proposed analysis of autophagosomes.

      One of the key steps is the generation of a smooth surface from a segmented membrane, there is a question whether true membrane disruptions will be smoothed and may be overlooked in this approach. When these disruptions present true membrane ruptures, they may be of particular biological importance. The authors should support the choice and selection of the smoothing parameters in order to illustrate this potential pitfall.

      The smoothing and hole-filling parameters are now configurable using the point_weight and extrapolation_voxels parameters in the config.yml file. Notably, the surfaces used for quantification used minimal smoothing, and any triangles more than a single voxel away from the point cloud were deleted, in order to ensure that the quantifications were minimally impacted by “hallucinated” surfaces. Additionally, the following text was added to the methods section discussion surface reconstruction:

      “A surface mesh was calculated from the oriented point cloud using the screened Poisson algorithm32, with a reconstruction depth of 9, an interpolation weight of 0.7, and a minimum number of samples of 1.5. These settings were chosen to maximize correspondence to the data, rather than smoothness. The resulting surface extended beyond the segmented region, so triangles more than 1 voxel away from the point cloud were deleted. Interpolation weight (point_weight) and the mask distance (extrapolation_voxel) are both configurable in the surface morphometrics pipeline if more aggressive smoothing and hole filling are desirable.”

      Throughout the manuscript, the authors mention statistical significance several times and one of the main aims of the study is perform statistical hypothesis testing. It is important to specify the significance test (not only in the methods) and the p-value in order to support this claim. In the manuscript, the authors use exclusively the Mann-Whitney test. What is the rationale for choosing this test? Have the authors considered comparing the total distributions and not just the peaks with e.g. a Kolmogorov-Smirnov test? For a statistical methods paper, there are also no discussion on error analysis.

      This was a common concern raised by both reviewers, and we agree that a test based on total distribution would be more powerful than only looking at peaks. We address the use of the Kolmogorov-Smirnov test and the limitations we have run into thus far in our response to reviewer 1 in detail. In brief, KS tests tend to vastly overestimate statistical significance because the number of samples (the number of triangles) is vastly larger than the true number of independent features sampled in the data, so that even very similar looking distributions such as those in figure 5C yield p values in the range of 10^-200. We propose several approaches to better estimate the number of independent variables. We will also use a random subsampling approach within individual mitochondria to ensure sampling from the same distribution does not yield statistically significant results.

      In addition to testing additional approaches to incorporate KS testing (based on estimation of number of independent features in each tomogram), we propose to improve our peak-based statistics by estimating a standard error for the peak of each tomogram using a bootstrap approach, getting the peaks from different random subsamples of triangles.

      Reviewer #2, minor comments:

      1. https://github.com/grotjahnlab/surface_morphometricsshould include an example data set or tutorial for dissemination.

      We are in the process of uploading all frame-averaged tilt series, tomograms, segmentations, and reconstructed surfaces to EMPIAR. Additionally, we propose to implement a complete tutorial including a single tomogram for readier workflow testing, separate from the complete data upload.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1, major comments:

      • As the work reported here is heavily computational, additional details about the computer hardware used and the time it took for the calculations to complete would be helpful for readers considering applying the code to their own data.

      We appreciate the suggestion and included Supplementary Table 1 in the supplemental material outlining the computation time per step in our analysis pipeline:

      “Supplemental Table 1. Approximate time and for each step of the surface morphometrics workflow.

      Representative times and computational resources used for each step of the surface morphometrics workflow for each tomogram (unless otherwise noted) by the authors. Most time-intensive calculations were run in parallel on a compute cluster for each tomogram.

      Step

      Human Time (HH:MM)

      Computational Wall Clock time (HH:MM)

      CPU Cores Used

      RAM Used

      Automated initial segmentation (TomoSegMemTV)

      00:10*

      00:10*

      8

      64GB

      Manual segmentation cleanup and classification

      03:00

      N/A

      8

      64GB

      Point cloud conversion and mesh generation

      00:01

      00:03

      4

      16GB

      Graph generation and curvature estimation (pycurv)

      00:01

      01:40

      16

      128GB

      Distance and orientation measurement

      00:01

      00:10

      16

      128GB

      Assembly of outputs from multiple tomograms into dataframes and serialization

      00:01

      00:10

      1

      16GB

      Visualizations and statistical tests

      00:01

      00:10

      1

      16GB

      * Tomosegmemtv is sometimes run iteratively with different settings to improve output. 10 minutes is approximately the time taken for a run without iteration, in the case of good output.”

      Reviewer #1, minor comments:

      • Pink and purple very close, consider alternative pair of colors or different shades to distinguish OMM and IMM

      We kept OMM as purple but changed IMM to orange for Figure 3-7, and will make the associated changes to Figure 2 and Supplementary Movie 1 on final submission.

      • Orientation of scaleboxes/scalebars should be consistent per figure panel. If knowledge of the axes is important to the reader, these should be included as well.

      We followed the reviewer’s suggestion and updated the scale cubes to be standardized per panel.

      • In the last sentence of the introduction, the term "organellar architectures" is used, instead of the previously defined "membrane ultrastructure." Consider changing for clarity.

      We changed “organellar architectures” to “membrane ultrastructure” in the last sentence of the abstract.

      • Inconsistent use of the phrase "cryo-electron tomography" after defining and using "cryo-ET"

      We changed all instances of “cryo-electron tomography” to “cryo-ET” after defining in the first instance in the introduction.

      • Authors argue that the distinction between curvedness and curvature is important and that curvature is less appropriate in this context, but then use curvature in the abstract, throughout introduction and in the results section. Usage can be improved for readability.

      We changed all instances of “curvature” to “curvedness” throughout the text and figure legends.

      • In section "Development of a framework to automate quantification of ultrastructural features of cellular membranes" the second last sentence should read "... higher quality membrane surfaces as compared..."

      We changed “surface” to “surfaces” in text.

      • In section "IMM curvedness is differentially sensitive to Tg treatment in elongated and fragmented mitochondrial networks" the fourth sentence should perhaps read "... despite apparent visual differences, no significant..."

      We changed “difference” to “differences” in text.

      • The term "cell's growth plane" is not clear from the text nor from Fig 6A. Do the authors mean surface of the substrate the cell is growing on?

      We clarified and further defined the “cell’s growth plane” in the text by adding the following phrase:

      “… the cell’s growth plane (i.e. the plane of electron microscopy grid substrate to which the cell is adhered) (Figure 6A).”

      • In Materials and Methods:

      • The authors report that manual back-blotting was used in a Vitrobot. This is non-standard usage and more details should be provided.

      We added the following description to clarify our manual back-blotting procedure on the Vitrobot:

      “After 8 hours of incubation, samples were plunge-frozen in a liquid ethane/propane mixture using a Vitrobot Mark 4 (Thermo Fisher Scientific). The Vitrobot was set to 37° C and 100% relative humidity and blotting was performed manually from the back side of grids using Whatman #1 filter paper strips through the Vitrobot humidity/temperature chamber side port. The Vitrobot settings used to disable automated blotting apparatus were as follows: Blot total: 0, 2; Blot force: 0, 3; Blot time: 0 seconds.”

      • In section "Fluorescence Guided Milling" in the third sentence, the word "based" is repeated, second can be removed.

      We deleted the second instance of “based” in this sentence.

      • Symbol for degree (or the word degree) should be added to angular increment and tilt range for clarity.

      Added degree symbols to the following sentence in the “Tilt Series Data Collection” portion of the materials and methods:

      “Tilt series were acquired using SerialEM software (Mastronarde, 2005) with 2° steps between -60° and +60°.”

      • Capitalization of TomoSegMemTV is inconsistent.

      We changed all mentions to TomoSegMemTV.

      • Fig 3 title - consider replacing "Inter-mitochondrial membrane..." with "Intra-mitochondrial membrane..." for clarity.

      We clarified this point by changing “Inter-mitochondrial membrane distance” to “Distance between inner and outer mitochondrial membranes” in the figure legend:

      “Figure 3. Distance between inner and outer mitochondrial membranes is dependent on mitochondrial network morphology and presence or absence of ER stress.”

      • Fig 3C caption - should explicitly state it is a combined histogram and that the dashed lines correspond to the peak of the pooled data.

      We changed “Quantification of” to “Combined histogram of” and added the sentence ” to each of the relevant figure captions (Fig. 3c, 4c-f, 5b-e, 6d-g, 7c):

      “Dashed vertical lines correspond to peak histogram values of pooled data”

      • Fig 6B and 6C caption - upper and lower parts not explicitly described.

      We modified Fig 6B&C caption to more clearly describe the figure panel:

      “(B) Two representative membrane surface reconstructions of lamellar Tg-treated elongated mitochondria, colored by angle of IMM relative to OMM.

      (C) Two representative membrane surface reconstructions of a less rigidly oriented Tg-treated elongated mitochondria, colored by angle of IMM relative to the growth plane of the cell.”

      Reviewer #2, major comments:

      1. Title and abstract need to be toned down not to overpromise a very general toolkit. The presented method may be a tool or a collection of scripts - a toolkit can be used to address other types of (membrane) analysis problems. In the end, the analysis builds to a large extent on the previous developments and implementation of PyCurve. Perhaps, the most interesting contribution here is the application of the mesh generation by the Poisson reconstruction method to the segmented membranes, which is, however, well implemented in the used pymeshlab framework. The computation of distances and angles is straightforward.

      We appreciate this critique and do not want to overpromise with our work, although we believe the overhaul to a fully configurable workflow addresses the primary concern. We are quite clear in the text that we build on top of pycurv, and recommend citation of the original tool as well as our pipeline in the github repository as a result. With that said,

      We have changed the title as follows:

      “Quantifying mitochondrial ultrastructure in cryo-electron tomography using a surface morphometrics pipeline”

      We have also renamed our method to the surface morphometrics pipeline to reduce over-implication of generality, and made other small changes to increase degree of detail about what our method is resolving.

      When reading the manuscript, the reader is left in the open whether this is a method paper or a biological results paper. The title/abstract suggests that this is a method paper and the manuscript is more of a mitochondrial membrane report in ER stress. Therefore, the title/abstract does not reflect the manuscript very well.

      We aim to use this manuscript to describe the development of a workflow that enabled novel and interesting biological results. We adjusted the title to better match the combined development of a new pipeline and application to an interesting biological system as proof of concept:

      “Quantifying mitochondrial ultrastructure in cryo-electron tomography using a surface morphometrics pipeline”

      The manuscript also requires substantial structural editing. Several references to Figures are not appearing in the text in the order that the Figure panels are built. Excessive cross-referencing of figures also make the manuscript hard to read.

      We simplified our referencing of figures and made sure the text matched the order of the figure panels.

      The exact morphological discrimination between fragmented and elongated mitochondria is not easily understood from the results section. What is really meant by blinded manual classification? It only became clear when reading the methods. The results section should stand on its own. How is the overall population between fragmented and elongated cells is affected after Tg application?

      To clarify our methodology for blinded classification of mitochondrial network morphologies we included the following text:

      “We categorized cells for mitochondrial network morphology by blinded manual classification in which five researchers were given fluorescence microscopy images of exemplar network morphologies (elongated and fragmented) as references to assign morphologies to the experimental fluorescence micrographs.”

      We targeted similar ratios of elongated and fragmented cells in both vehicle and Tg treated conditions for tomography, but qualitatively saw the expected increase in the elongated population to what has been previously described during Tg treatment. Because of our single cell targeting approach we did not quantify the population shift.”

      Similarly, what is meant by manual classification of IMM, OMM and ER? Is there any clustering involved?

      Our automated segmentation approach labels all membranes, and the separation of the IMM, OMM, and ER membranes is done by an expert user selecting and relabeling each membrane based on cellular context (e.g. IMM is inside of OMM and contains cristae). We have added the following text to clarify our methodology for manual classification of IMM, OMM, and ER:

      “This was followed by manual labeling of membranes into mitochondrial IMM and OMM and ER membrane based on cellular context, as well as manual cleanup of individual membrane segmentations using AMIRA software (Thermo Fisher Scientific).”

      Reviewer #2, minor comments:

      What is meant by growth plane? This term is not defined in the manuscript.

      We clarified and further defined the “cell’s growth plane” in the text by adding the following phrase:

      “… the cell’s growth plane (the plane of electron microscopy grid substrate on which the cell is grown) (Figure 6A).”

      What is meant by vehicle treatment? There is no explanation in the main text of the manuscript.

      We clarified and further defined vehicle treatment in the main text by adding the following:

      “We applied our correlative approach to identify and target specific Tg-treated and vehicle (media with DMSO) treated MEFmtGFP cells with either elongated or fragmented mitochondrial network morphologies for cryo-FIB milling and cryo-ET data acquisition and reconstruction.”

      Have the authors noticed/calculated any differences in the width of the cristae?

      We measure this difference in figure 4C (Intra-crista distance). We found significant changes in width/intra-crista distance in response to Tg treatment in both elongated and fragmented morphologies.

      Methods: Automated surface reconstruction: "In cases where the resulting surface was very complex, the surface was simplified..." How was the complexity determined?

      With the updated state of the software, we simplify all surfaces to generate a maximum of 150,000 triangles. This has minimal effect on very small surfaces, but greatly speeds computation on very large surfaces. We corrected the language to match this:

      “The resulting mesh was simplified with quadric edge collapse decimation to produce a surface that represented the membrane with 150,000 triangles or fewer.”

      Methods: Calculation of distances between individual surfaces: "For surfaces with small numbers of triangles, this was accomplished using a distance matrix...". What is the threshold for a small number of triangles?

      As part of our software overhaul we have changed to always using a more memory-efficient KD tree based quantification, since the additional speed for the distance matrix approach is minimal when there are few enough triangles for it to be appropriate, and the hardwired cutoff was not as flexible for different hardware configurations. The updated text is below, but to satisfy any potential reviewer curiosity, the decision was made when the required distance matrix would use more than 128GB of memory. In the case of two identically sized surfaces, this crossover happens when there are approximately 45,000 triangles in each surface.

      “For calculations of distances between respective surface meshes, the minimum distance from each triangle on one surface to the nearest triangle on the other surface was calculated using a KD-tree.”

      Reviewer #2 (Significance (Required)):

      The aim of the paper is well motivated. Cryo-ET is a growth field and there is a need for quantitative parameterization of cryo-ET data. Recently a toolkit for the analysis of filaments from cryo-ET has been published (Dimchev et al. 2021 DOI: 10.1016/j.jsb.2021.107808). Given the specific nature of the implementation, i.e. the membrane structures of mitochondria, I cannot easily see that this implementation will be useful beyond the analysis of mitochondrial membrane structure.

      We hope that we have addressed this concern with generality has been addressed by our previously described updates to the software implementation.

      4. Description of analyses that authors prefer not to carry out.

      Review 2, minor comments:

      Angle between OMM and cristae: Maybe use the average angle of each cristae for comparison or fit a plane for each cristae because you are interested in the angle between the cristae and the OMM and the membrane of the cristae has a lot of uneven surfaces

      We believe that the advantage of our approach is the ability to incorporate more complex geometric information from uneven surfaces such as those seen in cristae. With that said, the ability to quantify metrics for individual cristae in an automated manner would be very appealing, since in many ways cristae are functionally independent compartments. Accomplishing this would require either subdividing the larger surface into individual cristae, which will require development of additional sub-graph processing strategies. Additionally, pairing surfaces to represent opposite sides of a crista will require additional development. While we agree that this will be an excellent extension of the surface morphometrics approach, we feel that the additional development required is out of the scope of this initial manuscript focused on the general workflow. New methods leveraging sub-graph analysis will be explored in future manuscripts.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Barad, Medina et al. presents a new toolkit for the analysis of membrane ultrastructure in cryo-tomograms. More specifically, the toolkit is designed to compare curvature, angles and spacing between different membrane types in mitochondria. These analyses allow for the quantitative comparison of membrane features e.g. for different growth conditions. To demonstrate the utility of the toolkit tomogram datasets of mitochondria in the presence and absence of ER stress were analyzed. The authors conclude that ER stress affects mitochondria morphology through remodeling of the membrane structure. The presented biological results and statistics are convincing and show active mitochondrial membrane remodeling in the cell when exposed to ER stress. It is also clear that there is a need for more quantitative evaluation based on the wealth of tomographic image features and mitochondrial membranes are certainly a well-chosen application. For this purpose, the authors developed a new workflow even though most of the discussed analyses are very specific to mitochondrial structures. Therefore, broader applications of these tools to other organelles are not easily envisaged without significant adaption. In that context, the title and abstract overpromise a much more powerful utility that can be applied to any other membrane analysis. Rather it seems that the proposed workflow is more of a specific tool or a pipeline for mitochondrial inner and outer membrane analysis instead of a toolkit for general morphological analysis. Hence, the manuscript cannot be accepted in its current form. In particular, the structure needs a significant rework of editing to become more comprehensible.

      Major comments:

      1. Title and abstract need to be toned down not to overpromise a very general toolkit. The presented method may be a tool or a collection of scripts - a toolkit can be used to address other types of (membrane) analysis problems. In the end, the analysis builds to a large extent on the previous developments and implementation of PyCurve. Perhaps, the most interesting contribution here is the application of the mesh generation by the Poisson reconstruction method to the segmented membranes, which is, however, well implemented in the used pymeshlab framework. The computation of distances and angles is straightforward.
      2. When reading the manuscript, the reader is left in the open whether this is a method paper or a biological results paper. The title/abstract suggests that this is a method paper and the manuscript is more of a mitochondrial membrane report in ER stress. Therefore, the title/abstract does not reflect the manuscript very well.
      3. The manuscript also requires substantial structural editing. Several references to Figures are not appearing in the text in the order that the Figure panels are built. Excessive cross-referencing of figures also make the manuscript hard to read.
      4. The focussing to a method paper will also require more in-depth descriptions of the methodology in the main text. Although the code is deposited at github, there is no script-based workflow and description presented in the manuscript. Although Figure 1 puts the work into context of tomography, it remains very superficial on the image analysis. What are the input and output formats required for each step to follow the sequence of the workflow and at which steps critical interactive input is needed? What are the hardware requirements (CPU, GPU) or performance characteristics (CPU hours for certain operations)?
      5. Figures 3-7 contain colorful 3D renderings of the measured quantities. In addition, they are filled with histograms of every possible quantitative parameter, which often are not very significant or different between. The authors should focus the main results and the figures to show the most relevant and significant findings and put the remaining panels and results into the supplement.
      6. The exact morphological discrimination between fragmented and elongated mitochondria is not easily understood from the results section. What is really meant by blinded manual classification? It only became clear when reading the methods. The results section should stand on its own. How is the overall population between fragmented and elongated cells is affected after Tg application?
      7. Similarly, what is meant by manual classification of IMM, OMM and ER? Is there any clustering involved?
      8. One of the key steps is the generation of a smooth surface from a segmented membrane, there is a question whether true membrane disruptions will be smoothed and may be overlooked in this approach. When these disruptions present true membrane ruptures, they may be of particular biological importance. The authors should support the choice and selection of the smoothing parameters in order to illustrate this potential pitfall.
      9. Throughout the manuscript, the authors mention statistical significance several times and one of the main aims of the study is perform statistical hypothesis testing. It is important to specify the significance test (not only in the methods) and the p-value in order to support this claim. In the manuscript, the authors use exclusively the Mann-Whitney test. What is the rationale for choosing this test? Have the authors considered comparing the total distributions and not just the peaks with e.g. a Kolmogorov-Smirnov test? For a statistical methods paper, there are also no discussion on error analysis.

      Minor comments:

      1. https://github.com/grotjahnlab/surface_morphometrics should include an example data set or tutorial for dissemination.
      2. What is meant by growth plane? This term is not defined in the manuscript.
      3. What is meant by vehicle treatment? There is no explanation in the main text of the manuscript.
      4. Angle between OMM and cristae: Maybe use the average angle of each cristae for comparison or fit a plane for each cristae because you are interested in the angle between the cristae and the OMM and the membrane of the cristae has a lot of uneven surfaces
      5. Have the authors noticed/calculated any differences in the width of the cristae?
      6. Methods: Automated surface reconstruction: "In cases where the resulting surface was very complex, the surface was simplified..." How was the complexity determined?
      7. Methods: Calculation of distances between individual surfaces: "For surfaces with small numbers of triangles, this was accomplished using a distance matrix...". What is the threshold for a small number of triangles?

      Significance

      The aim of the paper is well motivated. Cryo-ET is a growth field and there is a need for quantitative parameterization of cryo-ET data. Recently a toolkit for the analysis of filaments from cryo-ET has been published (Dimchev et al. 2021 DOI: 10.1016/j.jsb.2021.107808). Given the specific nature of the implementation, i.e. the membrane structures of mitochondria, I cannot easily see that this implementation will be useful beyond the analysis of mitochondrial membrane structure.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Barad and Medina, along with their co-authors, report on the development of a new software toolkit to quantitatively assess membrane structures that are observed in cryo-ET. This new toolkit builds upon existing methodologies by successfully incorporating additional methods and applying this to cryo-ET data to allow for more automated and reliable segmentations. This work addresses a long-standing difficulty in generating membrane segmentations, which are either done manually with huge labor investments or with automated methods that are known to be error prone. The authors demonstrate that their toolkit can generate high quality segmentations across multiple tomograms with limited manual intervention. They use correlative light and electron microscopy in combination with these segmentations to gain insight into the ultrastructural morphology of mitochondria within embryonic fibroblasts, both under control conditions and under endoplasmic reticulum stress induced by treatment with the drug Thapsigarin. Unlike changes to the ER which are more dramatic under stressed conditions, the changes to the mitochondria are more subtle and impossible to quantify without high quality segmentations. The authors show that inner and outer membrane distances change under stress, and that the distances between cristae, their junctions, and the angle of the cristae with respect to the margin of the mitochondria change. While they characterize the curvedness under the same set of conditions, they report no significant differences.

      Major comments:

      • A major concern is that the data are reported and analyzed on a per tomogram basis when many tomograms contain multiple mitochondria. Given that the mitochondria appear mostly well separated in Sup. Fig 1 with only a few connections visible, and the high degree of pleomorphism noted by the authors, I would strongly suggest that the authors use each mitochondrion as the basis for reporting their metrics rather than the FOV/tomogram as this would avoid mixing metrics from different mitochondria that may be in different states (e.g., fusion/fission). This would apply to data shown in Figures 3, 4, 5, and 6.
      • In Figure 3C the authors show the combined distribution of OMM-IMM distances within each condition. This may obscure some variability within populations. Individual histograms for all mitochondria should be included as supplementary material. Currently, it is difficult to judge if the peak of the combined distribution is appropriate and impossible to judge the variability between tomograms (preferably mitochondria, see above comment). Additionally, the shape of the distributions appears significantly different between conditions, suggesting that selecting a single peak value as representative and the basis for the statistical tests (Fig 3D) might not be appropriate. Please comment.
      • In Figure 4C-F, again combined distributions are shown. Authors should include individual histograms for all mitochondria as supplementary material. The diversity of distributions in the metrics are more pronounced than the distances in reported in Fig 3, again making assessment of variability difficult and raising doubt about using the single peak value.
      • It would be helpful to include the curvature or curvedness of the OMM for each mitochondrion in the supplementary material. The data to correlate OMM curvature with elongated/fragmented mitochondria should be available and might be of interest to some readers.
      • As the work reported here is heavily computational, additional details about the computer hardware used and the time it took for the calculations to complete would be helpful for readers considering applying the code to their own data.
      • Discussion should be expanded to include a comparison of semi-automated segmentations generated here versus manual results from Navarro (Ref 35) & Burt (Ref 54 / doi: 10.1371/journal.pbio.3001319) and how one might estimate the error.

      Minor comments:

      • In the fourth sentence of the third paragraph of the introduction, Hoppe 1992 is cited as evidence of the limitations of work published in 2020, which is confusing. Perhaps the sentence can be re-phrased?
      • Pink and purple very close, consider alternative pair of colors or different shades to distinguish OMM and IMM
      • For all data, exact n per condition should be given (in text and captions as appropriate), not a range for the whole set.
      • Orientation of scaleboxes/scalebars should be consistent per figure panel. If knowledge of the axes is important to the reader, these should be included as well.
      • In the last sentence of the introduction, the term "organellar architectures" is used, instead of the previously defined "membrane ultrastructure." Consider changing for clarity.
      • Inconsistent use of the phrase "cryo-electron tomography" after defining and using "cryo-ET"
      • Authors argue that the distinction between curvedness and curvature is important and that curvature is less appropriate in this context, but then use curvature in the abstract, throughout introduction and in the results section. Usage can be improved for readability.
      • In section "Development of a framework to automate quantification of ultrastructural features of cellular membranes" the second last sentence should read "... higher quality membrane surfaces as compared..."
      • In section "IMM curvedness is differentially sensitive to Tg treatment in elongated and fragmented mitochondrial networks" the fourth sentence should perhaps read "... despite apparent visual differences, no significant..."
      • The term "cell's growth plane" is not clear from the text nor from Fig 6A. Do the authors mean surface of the substrate the cell is growing on?
      • In Materials and Methods:
        • The authors report that manual back-blotting was used in a Vitrobot. This is non-standard usage and more details should be provided.
        • The description of the Leica microscope is insufficient. The objective lens and camera used should be included.
        • In section "Fluorescence Guided Milling" in the third sentence, the word "based" is repeated, second can be removed. A second Pt coat on top of the GIS would also be unusual, please check writing for accuracy.
        • Symbol for degree (or the word degree) should be added to angular increment and tilt range for clarity.
        • Capitalization of TomoSegMemTV is inconsistent.
      • Fig 1B: showing computational steps twice does not provide additional information. Consider just one example. Also, labels for elongated and fragmented would be more useful than the duplicated labels for each computational step.
      • Fig 2A caption - should report actual thickness range measured (as given in Materials and Methods section) instead of estimated range.
      • Fig 3 title - consider replacing "Inter-mitochondrial membrane..." with "Intra-mitochondrial membrane..." for clarity.
      • Fig 3C caption - should explicitly state it is a combined histogram and that the dashed lines correspond to the peak of the pooled data.
      • Fig 5E middle, legend obscures some of the data.
      • Fig 6B and 6C caption - upper and lower parts not explicitly described.

      Significance

      This work primarily describes a technical advancement in methods to analyze cryo-ET data. The novelty arises from the combination of methods and their application rather than completely new ideas or approaches. Demonstrations of the utility of this toolkit based on the authors' analyses are convincing and will likely help a number of researchers in the field who are engaged in explorations of cellular ultrastructure and organelle responses to stimuli. Importantly, this work will help the field move past qualitative descriptions, historically accepted only because quantitative measurements at this level have not been feasible. Overall, in this reviewer's opinion, while the biological findings are modest, the utility of the toolkit for the field is indisputable and the work is of sufficient quality for publication.

      My expertise is in cryo-EM, both single particle analysis and tomography, as well as CLEM workflows, applied mostly to cytoskeletal research and some ER stress. I do not have or strong background in mitochondrial biology nor sufficient computer science expertise to evaluate the numerical methods employed, but based on inspection of the github contents, the screened Poisson reconstruction algorithm is not reimplemented here.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Responses to reviewers’ comments are in blue text, original reviewers’ comments in black text.

      Response to Reviewer 1.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): In this manuscript Neiro et al. aim to expand our knowledge on the regulation of gene expression in stem cells of the planarian model organism. As a first step the authors used published available data to expand the repertoire of the planaria transcriptome. By combining 183 RNAseq datasets the authors were able to identify thousands of new coding and non-coding transcripts. They then screened for TF motifs in the new annotations, identifying 551 putative TFs, of which 248 were already described in the planarian literature. The most substantial contribution of this work to the field of stem cells and planaria biology is the characterization of new putative enhancers that were identified by performing H3K27ac ChIP-seq and ATAC-seq and combining these data with previously published H3K4me1 ChIPseq dataset.

      We thank the reviewer for their careful assessment of our work, we agree that the identification of likely enhancers genome wide is a substantial contribution. Equally the improved annotation of all genes, including transcription factors we choose to focus on here, is a substantial step forward for the planarian research community.

      By overlapping H3K27ac and H3K4me1the authors find 5,529 new enhancers, for which they report a higher chromatin accessibility than random points in the genome as assessed by ATAC-seq. By using ATAC-footprints Neiro et al. refined the subset of TFs that have binding motifs in the predicted enhancer-like regions and present a list of 22,489 such factors. The manuscript is well written and organized and overall, the reported data will provide an important resource to study gene expression regulation in planaria's stem cells. However, this manuscript would greatly benefit from some functional validation to support the predicted gene regulatory networks. One option would be to use a CRISPR-dCas9-KRAB system to silence the putative enhancers identified in the manuscript and check by qPCR the expression of nearby genes.

      Currently mis-expression technologies, in order too directly test enhancer elements in driving expression, are still not available in planarians. This also preempts us using the suggested silencing system used in mammals and other animals with robust mis-expression tools.

      If this type of experiment is not feasible in planaria (I am not an expert in this model organism) another simple but key experiment would be to perform a knockdown of one (or more) putative enhancer-bound TFs identified in this study followed by RNA-seq. This would allow the authors to verify what are the target genes of the putative enhancer-bound TFs and if they correspond to the predicted gene networks they identified. Simultaneously, this experiment would allow the authors to verify if there are any changes in the expression of differentiation/pluripotency markers as a result of the knockdown of the putative enhancer-bound TF.

      These experiments are possible, but this would be the work of many labs in the future expert in studying those TFs and their roles in planarian stem cells and regeneration. However, what we can do is analyze existing RNA-seq data further. There are a number of studies where TF have been studied and RNA-seq performed after RNAi. Although these studies are performed in specific experimental regenerative contexts, and not specifically in stem cells, it will be possible to look at expression changes of genes with predicted enhancers bound by these TFs. We propose to execute this analysis and add it to the manuscript, rather than perform further TF RNAi experiments. This analysis is feasible within a 3-month revision time. We would add that currently their no genes are implicated in controlling pluripotency in the same way we might consider, for example, OSKM in mammals. Our identification of the TFs enriched in stem cell expression and implicated in binding predicted enhancers suggests future candidates.

      Minor revision: • The authors have mostly focused on the identification of enhancer-bound TFs. However, it would be interesting to look at differential enrichment of TFs in promoters versus enhancers and identify if there are specific factors that are enriched specifically at the planarian newly identified enhancer regions.

      We have not looked at potential TF binding sites near promoters/transcriptional start sites. We will try to add an analysis that considers this in our revision.

      • All tornado plots are missing a colorbar (Fig3 and FigS2)

      We will fix this error

      • There is a typo in the discussion: "the combined use of chip-seq data, RNAi of a histone methyltransferase combines with chip-seq" should be changed to "combined".

      We will fix this and other typographical errors.

      Reviewer #1 (Significance (Required)):

      The manuscript is well written and organized and overall the reported data will provide an important resource to study gene expression regulation in planaria's stem cells.

      We thank the reviewer for their appreciation of our work

      **Referees cross-commenting**

      I agree with the other reviewers that additional functional data should be added to support the author's claims (such as knock down of potential TFs that are identified by computational analyses and assessing the impact on gene expression).

      See response above, with regard to adding further analysis for testing this possibility.

      In addition, as noticed by the third reviewer, all data should be made publicly available to the scientific community.

      We have made all data publicly available and will submit all relevant data to public database repositories in advance of final publication after final peer review.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      This manuscript aims at identifying enhancers in the planarian Schmidtea mediterranea. The authors start with the integration of transcriptome with genome sequencing data to more precisely annotate the genome of the planarian Schmidtea mediterranea. The second part of the manuscript actually then deals with the identification of potentially active enhancer elements in adult stem cells of this regenerating organism using genomic techniques like ATAC-seq and ChIP-seq of histone marks combined with motif searches and in silico footprint analysis. Using these data, the authors predict regulatory interactions potentially critical for pluripotency and regeneration in planarian adult stem cells.

      MAJOR COMMENTS:

      • Are the key conclusions convincing? 1) The authors claim (already in the abstract) that their study identifies enhancers regulating adult stem cells and regenerative mechanisms. This is an over-statement found throughout the manuscript, as none of these enhancers are functionally tested nor is it shown that target gene expression changes when transcription factors predicted to interact with such enhancers are knocked down.

      We agree and it was not our intention to overstate our results, this is why we have tried to refer to putative enhancers, enhancer-like elements etc in manuscript from the title onwards. Only once we have demonstrated a set of elements with key conserved and widely supported characteristics do we suggest we have a set of higher confidence enhancers to study. However, we will adjust the manuscript to reflect that our claims await direct testing as is the case for all enhancers implicated with the approaches used here.

      Another example is at the end of paragraph 1 of section 2.4. Here the authors claim that identifying many fate-specific transcription factor genes in the vicinity of potential enhancers is a further proof that the identified regions represent "real enhancers". It strongly supports this hypothesis, but no evidence for real enhancer activity.

      We agree the total body of evidence strongly supports that we have identified enhancer elements, but as above will adjust the language to suggest further directed functional work will follow from many groups.

      Thus, although the authors state that the regulatory interactions and networks they predict from their data can be studied now in future, they should be more careful with their wording and correct these over-statements. Therefore, the key conclusion is that they identified by various techniques potential enhancers, which are close to genes controlling adult stem cells and potentially controlling these genes, which has to be shown by further analyses.

      We agree

      Thus, also the title needs to be changed.

      We propose changing ‘enhancer-like’ to “predicted enhancers” in the title, and "defines" to "predicts" as well as broadly adjusting the text to caveat that further work will clarify their functions and roles.

      The authors have no proof that the networks are active in planarian adult stem cells, as they do not show that the predicted networks are active in the presented way.

      We agree, see comments above. It was not our attention to claim we are showing pathways that were definitely active, rather predicted by our experiments and analyses of the data from these experiments.

      2) Similarly, the identification of TF motifs within these potential motifs strongly suggests but not shows that these factors are binding, even when these sites were found to be bound by a protein using the ATAC-seq footprinting analysis. Thus, the authors need to be careful with their wording. One example is in the second paragraph of section 2.5, where the authors write that "We found that numerous FSTFs were binding to putative intronic enhancers ... ". The motif suggests that these factors bind, however, they have no experimental confirmation that these sequences are indeed bound by the planarian TFs.

      We agree. We will clarify that ATAC foot printing is the only data suggestive of these motifs being bound and that further experiments will be required for more evidence. We will state this in the section of results and add this explicitly to the discussion

      In sum, this manuscript uses existing genomic tools to define potential enhancer regions in the planarian Schmidtea mediterranea. The manuscript is informative yet descriptive, as tit presents no functional evidence for any of the predictions. If further toned down, the key conclusions are valid.

      Future functional experiments to test the roles of all TFs and enhancers is now possible due to our work.The combination of data and analyses provides strong support of enhancer elements activity in stem cells across the genome.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether? The experiments performed are well designed and in line with what is known in the field about enhancer architecture. However, as this model system is not very well characterized on that level and the authors do not provide real experimental evidence that any of the identified regions has really enhancer activity and that any of the identified motifs binds indeed the predicted TF, the authors need to be very careful with their statements. The authors should maybe emphasize even stronger that all the GRNs predicted under section 2.6 are really preliminary and need to be validated.

      Yes, we are happy to be even clearer about this as the reviewer suggests

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. One experiment that could provide more evidence for their predicted regulatory interactions is to knock-down one of the FSTFs for which motifs have been identified in potential enhancer regions and to study expression of associated genes (to confirm that the enhancers potentilla bound by these TFs control the expression of associated genes) or by analyzing the chromatin status of selected chromatin regions (by Q-PCR). These experiments would strongly support the claims of the authors. However, it also depends strongly on the journal whether I would consider these experiments essential or "nice to have".

      This suggestion of possible extra experiments is very similar to that of Reviewer 1. We are copying our earlier comment as this also addresses this point.

      “These experiments are possible, but this would be the work of many labs in the future expert in studying those TFs and their roles in planarian stem cells and regeneration. However, what we can do is analyze existing RNA-seq data further. There are a number of studies where TF have been studied and RNA-seq performed after RNAi. Although these studies are performed in specific experimental regenerative contexts, and not specifically stem cells, it will be possible to look at expression changes of genes with predicted enhancers bound by these TFs. We propose to execute this analysis and add it to the manuscript, rather than perform further TF RNAi experiments. This analysis is feasible within a 3-month revision time. We would add that currently their no genes implicated in controlling pluripotency in the same way we might consider OSKM in mammals. Our identification of the TFs enriched in stem cell expression and implicated in binding predicted enhancers suggests future candidates.”

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments. This reviewer is not an expert in Schmidtea mediterranea, thus it is hard to judge how time consuming these experiments would be. Cost-wise they should be feasible, as it would include primarily Q-PCR experiments. And some functional back-up of their claims would be very helpful.

      See previous comment regarding additional analysis.

      • Are the data and the methods presented in such a way that they can be reproduced? For the parts I can judge, yes.

      • Are the experiments adequately replicated and statistical analysis adequate? It is not clear from the manuscript how many replicates of the ChIP-seq experiments were done.

      Chip-Seq replicate data description will be explicitly added to the methods

      MINOR COMMENTS:

      • Specific experimental issues that are easily addressable.

      • Are prior studies referenced appropriately? For the literature I can judge, yes.

      • Are the text and figures clear and accurate? The figures are clear, the text (besides over-statements) is clear. However, the writing can be improved. A few examples: section 2.2 paragraph 1: "... we found 248 to be described in the planarian literature in some way." In which way described?; same paragraph: "... but significantly we could identify new homologs of ..." what does significantly mean? Which test etc? section 2.2, last paragraph: "Most TFs assigned to the X1 and Xins compartments and the least to the X2 compartment", "Very few TFs had expression in X1s and Xins to the exclusion of X2 expression as would be expected by overall lineage relationships"; what do these sentences mean?

      We thank the reviewer for paying careful attention to the language in our manuscript throughout. We will provide clearer explanation of the sentences indicated. We will better explain terms specific to the planarian model system that are obviously not intuitive

      . - Do you have suggestions that would help the authors improve the presentation of their data and conclusions? No over-statements.

      See previous comments agreeing with the need to carefully adjust our language to avoid this

      Reviewer #2 (Significance (Required)):

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field. This manuscript identifies genome-wide potential enhancers in adult planarian stem cells, and thus represents a very valuable resource for the community to study these enhancers and the gene regulatory networks they control in the future.

      • Place the work in the context of the existing literature (provide references, where appropriate). As I am not a planarian scientist, it is hard to judge this part.

      • State what audience might be interested in and influenced by the reported findings. In my opinion, this work will be primarily interesting for people working with planarian. When functional data exist, this might be also interesting for researchers working generally on regeneration.

      Given the nature of our data we also think all groups working on animal stem cells would be interested in our data and analyses

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate. My field of expertise is transcriptional regulation using genomic techniques, however I am not familiar with the model Schmidtea mediterranea.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Neiro et al. capitalize on existing genomic data for the planarian Schmidtea mediterranea and new ChIP-seq and ATAC-seq data to use computational approaches to identify putative enhancers in the planarian genome. They integrate analysis of enhancers with transcription factor binding sites to generate testable hypotheses for the regulatory function of transcription factors active in stem cells or control of cell lineage trajectories. Their work creates an excellent resource for future work to resolve the regulatory logic underpinning stem cell biology and tissue regeneration in planarians.

      We are glad the reviewer likes our research.

      Major: Overall, the work in this manuscript and methodology are well executed and presented. However, the authors should consider the following comments to improve the clarity and accessibility of the data and interpretations.

      1) The new transcriptome does not appear to be publically accessible. The links to Github resources are broken, and there is nothing on Neiro's Github page. Will the new transcriptome be integrated with Planmine?

      The new annotation has been available for over a year as we wished the community to have access to it ASAP (see Garcia Castro, 2021, Genome Biology https://doi.org/10.1186/s13059-021-02302-5). We tested the links in the paper before depositing our preprint and after review and they seemed to work for us both within and outside our institutional network. We can only apologize if they were broken or have not worked for the reviewer. We are unclear if this new annotation will be included in Planmine, but we will ask the colleagues maintaining this database to consider including it.

      2) Figure 1: Ternary plot in 1F. The legend is not clear or could be explained better. What is the metric? It could be my misunderstanding, but I didn't consider the ternary plots as insightful or unnecessary. Perhaps the authors can expand on what they are showing.

      These plots are important in demonstrating the distribution of mRNA expression of all genes across cell sorted compartments. Given the broad lineage relationship between sorted cell compartments This analysis allows us to identify genes expressed predominantly in one cell compartment or another, or across a specific transition. For example, genes enriched in X2 cells and Xins, but not X1 are likely to be enriched in post-mitotic differentiating progeny and differentiated cells. In contrast to single cell data where expression data can be sparse this analysis with bulk data allows identification and assignation of low expressed genes, like transcription factors. We will provide further explanation of this in the revised text.

      1I is a map of exons, not alternative splicing. So, it isn't clear what the authors intend t show. Are the specific exons that are more likely to be spliced? Is the figure necessary?

      We wish to demonstrate the power of annotation approach and the richness of the annotation for looking at alternate splicing. We propose to a more informative figure that indicates the variety of splice forms. We apologize for this oversight.

      3) Figure 2: 2A labels Xins as irradiation responsive. Is this the case (just making sure)?

      The reviewer is correct, this is wrong! This should read “irresponsive” or “irradiation resistant” In Figure 1A. We thank the reviewer for spotting this error. We will fix this.

      2F-G: Ternary plot in F seems redundant with G, but that could be my lack of understanding. In 2G, what is represented on the plots on the right of the hierarchical clusters?

      The ternary plot (2F) and heatmap of hierarchical clustering (2G) are complementary ways to visualize the proportional expression values of transcription factors. The ternary plot (2F) allows an overview of all the proportional expression values, while the heatmap (2G) shows how the proportional values may be grouped into clusters of similar expression profiles and displays the relative size of these clusters. For example, the heatmap shows that the clusters of X1 and Xins are more prominent than X2, suggesting that there are realtivey a few X2-specific transcription factors. We will add text to better to explain this difference.

      4) Figure 3: The heat maps need a legend (i.e., please define the colors). In addition, labeling the figures could help the reader. For example, in G-J, a header about the different experiments above each map, such as "enhancers" and "random," etc., would make the figure more accessible.

      We agree we label the figures to be more easily interpretable and provide an independent scale and legend for the heatmaps.

      5) Figure 5: Although it is in the figure legend, the authors could label the 6th track as "RNA-seq in X1."

      We will add this to the figure.

      6) Section 2.6 second page last sentence of the first paragraph "GRN of asexual reproduction is not active in neoblasts" data in the supplement? Is it not shown?

      We apologize for this poorly written sentence. In line with Reviewer 2s comments this statement needs to be toned down and clarified. The raw information is included in the general table of enhancers (Supplementary Table 2), but the genomic tracks visually highlighting the motifs at the promoters of lox5b and post2b were not included. We will add these to the Supplementary information and clarify Supplementary Table 2.

      7) Discussion: The discussion about pluripotency factors in planarians could be expanded. The authors could contrast the study's findings with Önal et al. 2012.

      We agree we will expand our discussion to compare with previous studies and also summarize what is available from other animals with pluripotent adult stem cells

      Minor: The manuscript has no page numbers or line numbers, so I'll provide a general location of the potential issues.

      1) Section 2 - newly identified isoforms are shorter (1656 vs. 1618). Is the order of the median length reversed?

      Yes, we will correct this.

      2) No mention of Figure S1B in the text.

      It is mentioned in the paragraph regarding splicing, but perhaps not in a useful context. We will add a correct reference to this figure in the presentation of transcript diversity.

      3) Figure 1H should be 1I in the text?

      Yes, we will correct this

      4) The discussion contains some minor typos and grammatical errors.

      We will address with careful rereading.

      We thank the reviewer for spotting these errors and we will fix them in revision.

      Reviewer #3 (Significance (Required)):

      Neiro et al. provide an excellent resource for the planarian community. The paper is generally very well written and easy to read. The new transcriptome described, which improves the annotation of the planarian genome, should be made readily available. It would be excellent if the transcriptome could be incorporated in Planmine.

      We will ask Planmine and the Rink lab to consider this. The annotation (without broad analysis) has been available since the pre-print for Garcia Castro, 2021, Genome Biology was deposited in BioRxiv.

      Furthermore, the authors provide a comprehensive list of transcription factors in the planarian Schmidtea mediterranea. Their work provides insight into which factors are highly expressed in the stem cell compartment. Their computational identification of transcription factors and putative enhancers will be helpful to the growing community of researchers studying stem cell and regenerative biology using planarians. In addition, the large dataset generated in this study could inform studies in the evolution of regulatory sequences and transcription factor function.

      **Referees cross-commenting**

      The data presented are well supported by previous studies. As noted by the authors, it is not possible to make transgenic planarians, and thus the field needs to rely on indirect methods. The authors focus on using the stem cell population, which can be isolated from the animals. Overall, I don't think additional experiments are necessary. Additional RNAi experiments combined with RNA-seq (using the stem cells) could take 6-12 months to complete. I believe this is a solid contribution that should be framed as a resource paper. The authors should pay close attention to Reviewer #2's suggestions and edit the paper accordingly.

      I have 20 years of experience in the field. It would be unreasonable to ask the authors to do more experiments, especially in this post-pandemic environment. I hope this helps.

      We thank the reviewer for the comments.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Neiro et al. capitalize on existing genomic data for the planarian Schmidtea mediterranea and new ChIP-seq and ATAC-seq data to use computational approaches to identify putative enhancers in the planarian genome. They integrate analysis of enhancers with transcription factor binding sites to generate testable hypotheses for the regulatory function of transcription factors active in stem cells or control of cell lineage trajectories. Their work creates an excellent resource for future work to resolve the regulatory logic underpinning stem cell biology and tissue regeneration in planarians.

      Major:

      Overall, the work in this manuscript and methodology are well executed and presented. However, the authors should consider the following comments to improve the clarity and accessibility of the data and interpretations.

      1. The new transcriptome does not appear to be publically accessible. The links to Github resources are broken, and there is nothing on Neiro's Github page. Will the new transcriptome be integrated with Planmine?
      2. Figure 1: Ternary plot in 1F. The legend is not clear or could be explained better. What is the metric? It could be my misunderstanding, but I didn't consider the ternary plots as insightful or unnecessary. Perhaps the authors can expand on what they are showing.

      1I is a map of exons, not alternative splicing. So, it isn't clear what the authors intend t show. Are the specific exons that are more likely to be spliced? Is the figure necessary? 3. Figure 2: 2A labels Xins as irradiation responsive. Is this the case (just making sure)?

      2F-G: Ternary plot in F seems redundant with G, but that could be my lack of understanding. In 2G, what is represented on the plots on the right of the hierarchical clusters? 4. Figure 3: The heat maps need a legend (i.e., please define the colors). In addition, labeling the figures could help the reader. For example, in G-J, a header about the different experiments above each map, such as "enhancers" and "random," etc., would make the figure more accessible. 5. Figure 5: Although it is in the figure legend, the authors could label the 6th track as "RNA-seq in X1." 6. Section 2.6 second page last sentence of the first paragraph "GRN of asexual reproduction is not active in neoblasts" data in the supplement? Is it not shown? 7. Discussion: The discussion about pluripotency factors in planarians could be expanded. The authors could contrast the study's findings with Önal et al. 2012.

      Minor:

      The manuscript has no page numbers or line numbers, so I'll provide a general location of the potential issues.

      1. Section 2 - newly identified isoforms are shorter (1656 vs. 1618). Is the order of the median length reversed?
      2. No mention of Figure S1B in the text.
      3. Figure 1H should be 1I in the text?
      4. The discussion contains some minor typos and grammatical errors.

      Significance

      Neiro et al. provide an excellent resource for the planarian community. The paper is generally very well written and easy to read. The new transcriptome described, which improves the annotation of the planarian genome, should be made readily available. It would be excellent if the transcriptome could be incorporated in Planmine.

      Furthermore, the authors provide a comprehensive list of transcription factors in the planarian Schmidtea mediterranea. Their work provides insight into which factors are highly expressed in the stem cell compartment. Their computational identification of transcription factors and putative enhancers will be helpful to the growing community of researchers studying stem cell and regenerative biology using planarians. In addition, the large dataset generated in this study could inform studies in the evolution of regulatory sequences and transcription factor function.

      Referees cross-commenting

      The data presented are well supported by previous studies. As noted by the authors, it is not possible to make transgenic planarians, and thus the field needs to rely on indirect methods. The authors focus on using the stem cell population, which can be isolated from the animals. Overall, I don't think additional experiments are necessary. Additional RNAi experiments combined with RNA-seq (using the stem cells) could take 6-12 months to complete. I believe this is a solid contribution that should be framed as a resource paper. The authors should pay close attention to Reviewer #2's suggestions and edit the paper accordingly.

      I have 20 years of experience in the field. It would be unreasonable to ask the authors to do more experiments, especially in this post-pandemic environment. I hope this helps.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This manuscript aims at identifying enhancers in the planarian Schmidtea mediterranea. The authors start with the integration of transcriptome with genome sequencing data to more precisely annotate the genome of the planarian Schmidtea mediterranea. The second part of the manuscript actually then deals with the identification of potentially active enhancer elements in adult stem cells of this regenerating organism using genomic techniques like ATAC-seq and ChIP-seq of histone marks combined with motif searches and in silico footprint analysis. Using these data, the authors predict regulatory interactions potentially critical for pluripotency and regeneration in planarian adult stem cells.

      Major comments:

      • Are the key conclusions convincing?
        1. The authors claim (already in the abstract) that their study identifies enhancers regulating adult stem cells and regenerative mechanisms. This is an over-statement found throughout the manuscript, as none of these enhancers are functionally tested nor is it shown that target gene expression changes when transcription factors predicted to interact with such enhancers are knocked down. Another example is at the end of paragraph 1 of section 2.4. Here the authors claim that identifying many fate-specific transcription factor genes in the vicinity of potential enhancers is a further proof that the identified regions represent "real enhancers". It strongly supports this hypothesis, but no evidence for real enhancer activity. Thus, although the authors state that the regulatory interactions and networks they predict from their data can be studied now in future, they should be more careful with their wording and correct these over-statements. Therefore, the key conclusion is that they identified by various techniques potential enhancers, which are close to genes controlling adult stem cells and potentially controlling these genes, which has to be shown by further analyses. Thus, also the title needs to be changed. The authors have no proof that the networks are active in planarian adult stem cells, as they do not show that the predicted networks are active in the presented way.
        2. Similarly, the identification of TF motifs within these potential motifs strongly suggests but not shows that these factors are binding, even when these sites were found to be bound by a protein using the ATAC-seq footprinting analysis. Thus, the authors need to be careful with their wording. One example is in the second paragraph of section 2.5, where the authors write that "We found that numerous FSTFs were binding to putative intronic enhancers ... ". The motif suggests that these factors bind, however, they have no experimental confirmation that these sequences are indeed bound by the planarian TFs.

      In sum, this manuscript uses existing genomic tools to define potential enhancer regions in the planarian Schmidtea mediterranea. The manuscript is informative yet descriptive, as tit presents no functional evidence for any of the predictions. If further toned down, the key conclusions are valid. - Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      The experiments performed are well designed and in line with what is known in the field about enhancer architecture. However, as this model system is not very well characterized on that level and the authors do not provide real experimental evidence that any of the identified regions has really enhancer activity and that any of the identified motifs binds indeed the predicted TF, the authors need to be very careful with their statements. The authors should maybe emphasize even stronger that all the GRNs predicted under section 2.6 are really preliminary and need to be validated. - Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      One experiment that could provide more evidence for their predicted regulatory interactions is to knock-down one of the FSTFs for which motifs have been identified in potential enhancer regions and to study expression of associated genes (to confirm that the enhancers potentilla bound by these TFs control the expression of associated genes) or by analyzing the chromatin status of selected chromatin regions (by Q-PCR). These experiments would strongly support the claims of the authors. However, it also depends strongly on the journal whether I would consider these experiments essential or "nice to have". - Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      This reviewer is not an expert in Schmidtea mediterranea, thus it is hard to judge how time consuming these experiments would be. Cost-wise they should be feasible, as it would include primarily Q-PCR experiments. And some functional back-up of their claims would be very helpful. - Are the data and the methods presented in such a way that they can be reproduced?

      For the parts I can judge, yes. - Are the experiments adequately replicated and statistical analysis adequate?

      It is not clear from the manuscript how many replicates of the ChIP-seq experiments were done.

      Minor comments:

      • Specific experimental issues that are easily addressable.
      • Are prior studies referenced appropriately?

      For the literature I can judge, yes. - Are the text and figures clear and accurate?

      The figures are clear, the text (besides over-statements) is clear. However, the writing can be improved. A few examples: section 2.2 paragraph 1: "... we found 248 to be described in the planarian literature in some way." In which way described?; same paragraph: "... but significantly we could identify new homologs of ..." what does significantly mean? Which test etc? section 2.2, last paragraph: "Most TFs assigned to the X1 and Xins compartments and the least to the X2 compartment", "Very few TFs had expression in X1s and Xins to the exclusion of X2 expression as would be expected by overall lineage relationships"; what do these sentences mean? - Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      No over-statements.

      Significance

      • Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      This manuscript identifies genome-wide potential enhancers in adult planarian stem cells, and thus represents a very valuable resource for the community to study these enhancers and the gene regulatory networks they control in the future. - Place the work in the context of the existing literature (provide references, where appropriate).

      As I am not a planarian scientist, it is hard to judge this part. - State what audience might be interested in and influenced by the reported findings.

      In my opinion, this work will be primarily interesting for people working with planarian. When functional data exist, this might be also interesting for researchers working generally on regeneration. - Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      My field of expertise is transcriptional regulation using genomic techniques, however I am not familiar with the model Schmidtea mediterranea.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript Neiro et al. aim to expand our knowledge on the regulation of gene expression in stem cells of the planarian model organism.

      As a first step the authors used published available data to expand the repertoire of the planaria transcriptome. By combining 183 RNAseq datasets the authors were able to identify thousands of new coding and non-coding transcripts. They then screened for TF motifs in the new annotations, identifying 551 putative TFs, of which 248 were already described in the planarian literature. The most substantial contribution of this work to the field of stem cells and planaria biology is the characterization of new putative enhancers that were identified by performing H3K27ac ChIP-seq and ATAC-seq and combining these data with previously published H3K4me1 ChIPseq dataset. By overlapping H3K27ac and H3K4me1the authors find 5,529 new enhancers, for which they report a higher chromatin accessibility than random points in the genome as assessed by ATAC-seq. By using ATAC-footprints Neiro et al. refined the subset of TFs that have binding motifs in the predicted enhancer-like regions and present a list of 22,489 such factors.

      The manuscript is well written and organized and overall, the reported data will provide an important resource to study gene expression regulation in planaria's stem cells. However, this manuscript would greatly benefit from some functional validation to support the predicted gene regulatory networks. One option would be to use a CRISPR-dCas9-KRAB system to silence the putative enhancers identified in the manuscript and check by qPCR the expression of nearby genes.

      If this type of experiment is not feasible in planaria (I am not an expert in this model organism) another simple but key experiment would be to perform a knockdown of one (or more) putative enhancer-bound TFs identified in this study followed by RNA-seq. This would allow the authors to verify what are the target genes of the putative enhancer-bound TFs and if they correspond to the predicted gene networks they identified. Simultaneously, this experiment would allow the authors to verify if there are any changes in the expression of differentiation/pluripotency markers as a result of the knockdown of the putative enhancer-bound TF.

      Minor revision:

      • The authors have mostly focused on the identification of enhancer-bound TFs. However, it would be interesting to look at differential enrichment of TFs in promoters versus enhancers and identify if there are specific factors that are enriched specifically at the planarian newly identified enhancer regions.
      • All tornado plots are missing a colorbar (Fig3 and FigS2)
      • There is a typo in the discussion: "the combined use of chip-seq data, RNAi of a histone methyltransferase combines with chip-seq" should be changed to "combined".

      Significance

      The manuscript is well written and organized and overall the reported data will provide an important resource to study gene expression regulation in planaria's stem cells.

      Referees cross-commenting

      I agree with the other reviewers that additional functional data should be added to support the author's claims (such as knock down of potential TFs that are identified by computational analyses and assessing the impact on gene expression). In addition, as noticed by the third reviewer, all data should be made publicly available to the scientific community.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study the authors present the discovery of two new splicing factors NRDE2 and CCDC174 that interact with the U1 snRNA and with 5' splice sites and modulate usage of 5' splice sites with generally weaker pairing potential to the U1 snRNA. They develop a new cross-linking technique BCLIP for monitoring RNAs interacting with a particular protein by deep sequencing, which modifies the classical CLIP protocol and appears to allow them to detect interactions with proteins of low abundance such as NRDE2. They propose that NRDE2 and CCDC174 may form alternative U1 snRNP complexes distinct from the canonical U1 snRNP, which may be partly responsible for selecting alternative, weaker 5'SS.

      The study provides a plethora of experiments that provide strong experimental support for a model in which NRDE2 interacts with the U1 snRNA, recruits CCDC174, and together they tend to promote correct usage of weak 5' splice sites that are often flanked by several weak, cryptic 5' splice sites. The RNA-Seq supports a genome-wide role for NRDE2 in promoting splicing of weaker 5'-splice sites, while the in vivo reporter assays are elegant experiments showing a role for NRDE2 in enforcing correct usage of the most upstream weak 5'splice site.

      While the authors provide strong evidence in support of the main model proposed in their discussion, there are a few significant matters that are not addressed. Firstly, the fact that only a small proportion of 5'SS bound by NRDE2 appears to be sensitive to NRDE2 KO, even when translation-linked surveillance is blocked by cycloheximide, raises the possibility that the RNA-Seq technique may miss a significant proportion of transcripts that are unspliced and are rapidly degraded, potentially co-transcriptionally, by the nuclear exosome in a manner that may not necessarily depend on MTREX. Given that a significant proportion of unspliced transcripts may follow such a pathway (reviewed in Gordon et al., Curr. Op. Gen. and Dev. 2021), the authors should at least consider this possibility in their presentation of results and discussion. Ideally one could try to combine rapid depletion of NRDE2 with depletion or partial inactivation of one of the nuclear exosome components RRP6 or RRP44, although this reviewer recognizes that this may be technically challenging and lead to indirect effects on cell growth that might confound the analysis. Sequencing of specifically nascent RNAs associated with Pol II from a chromatin fraction, might offer a way to uncover additional NRDE2-sensitive transcripts.

      Secondly, the fact that NRDE2D200 shows a massive increase in U1 snRNP reads by the BCLIP procedure potentially suggests that NRDE2 may actually be part of a surveillance pathway to enforce usage of specific 5'SS and minimise cryptic 5'SS use. In this model, NRDE2 might bind all 5'SS but needs to be dissociated from the U1 snRNP either before or during 5'SS transfer by a helicase (e.g. MTREX or DDX5) within a certain time frame to prevent transfer of cryptic 5'SS. This model would be reminiscent of the initial binding and subsequent dissociation of Mud2 by Sub2 during E complex formation in yeast or of U2AF by DEK during proofreading of initial 3'SS recognition in humans. The fact that targets sensitive to NRDE2, as judged by RNA-Seq and expression profiling, mostly do not overlap with those MTREX, does not exclude the possibility that NRDE2 may act in cooperation with MTREX to prevent usage of cryptic 5'SS, which might result in production of rapidly degraded unspliced transcripts that are not detectable by the RNA-Seq methodology used here. Minimally, this reviewer thinks this possibility needs to be considered and briefly discussed by the authors.

      Finally, although provocative, the idea that NRDE2 binds an alternative U1 snRNP is not necessarily implied by the data. The fact that U1A, U1C, and U1-70k are not detected by MS in a tandem IP set-up that uses disrupts RNA structure and potentially protein-RNA interactions cannot be considered clear evidence for an alternative snRNP. Structures of the U1 snRNP suggest that association of such auxiliary proteins may depend on the structure of the U1 snRNA. The authors need to either modulate and clarify their claim, or provide stronger evidence, e.g. from more gentle IPs with U1A and U1-70K that U1 snRNPs that associate with these factors are not also associated with NRDE2. Related to this, this reviewer thinks it is tenuous to claim that the harsher xTAP-MS analysis involving formaldehyde is more indicative of "native" interactions because it uncovers binding of one core spliceosomal component and of TFIP11 and DHX15.In fact, the opposite seems more likely, that such reported interactions are not indicative of direct proximity, but rather result from perturbations to the native RNP structure induced by benzonase. In this sense, the claim that these observations suggest an "alternative" spliceosome assembly pathway seems particularly problematic, especially in view of the fact that in vitro studies suggest that DHX15 can associate with the U2 snRNP and can disassemble complexes at all stages of spliceosome assembly and catalysis, including during the pre-catalytic stage. The authors should be more careful with their wording and interpretation here.

      Depending on how the authors address the issues raised above, and how they modify their claims in the text, additional experiments may be deemed beyond the scope of the present study and should not be strictly necessary for publication, with the likely exception of the issue of alternative U1 snRNPs, where additional IPs might clarify, and potentially strengthen, the authors' claims.

      Significance

      This reviewer is an expert in the biochemical and structural study of pre-mRNA splicing and considers the present study an important contribution to understanding 5' splice site usage in higher eukaryotes. While the observation that spliceosomes from higher eukaryotes use additional protein factors to modulate 5' splice site selection is not new, the discovery of specific factors bound to U1 snRNA that may directly affect its binding to stronger or weaker 5'SS is certainly novel and of potentially broader significance. Although the author's claim that this may reflect alternative U1 snRNPs is not fully supported by the evidence presented, the proposal itself is an important potential advance, if it holds up to more stringent testing. The potential for NRDE2 to be part of a more complex surveillance mechanism to enforce use of specific 5'SS, which the data may also point to, would be an equally important advance. Finally, the observed interactions with U4 and U6 snRNAs, on which the authors do not comment much, provide further support for the idea that transfer of the 5'SS from U1 to U6 may be a particularly crucial step in 5'SS selection that is modulated in higher eukaryotes by non-canonical factors. Indeed, this latter point is also a significant contribution of the present study.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Flemr and colleagues describe novel functions of the splicing-associated proteins NRDE2 and CCDC174. These two proteins have previously been implicated in splicing (Jiao et al., RNA, 2018) and interact with the helicase Mtr4 and the exon-junction complex (Richard et al., RNA, 2018), which targets RNAs for degradation by the exosome. Here, the authors use a combination of genomics, including a modified CLIP protocol, genetics, and mass spectrometry to establish two key findings: (1) NRDE2 and CCDC174 act in concert to promote pre-mRNA splicing from non-consensus 5' splice sites (5'SSs) and (2) together with U1 snRNA they form a novel non-canonical U1 snRNP. The data themselves are clearly presented and of very high quality, but I do not agree with the authors interpretations and the two key claims.

      On claim (1), instead of NRDE2/CCDC174 specifically and actively promoting the usage of correct weak 5'SSs, they could alternatively promote correct 5'SS choice indirectly, by suppressing nearby cryptic 5'SSs. This is consistent with known functions of the EJC (e.g. Boehm et al., Mol Cell, 2018), with which NRDE2 and CCDC174 interact (Jiao et al., RNA, 2018) and their known association with Mtr4, which is also re-produced in the manuscript. This is further supported by the authors data, which shows that NRDE2 and CCDC174 CLIP signal peaks at the same sites as EJCs, upstream of 5'SSs. Prior to publication the authors should experimentally distinguish between active (direct) and passive (indirect) 5'SS selection mechanisms by NRDE2 and CCDC174.

      On claim (2), a new U1 snRNP would be a major discovery, yet, given the presented data, this conclusion should either be removed completely from the manuscript or needs to be rigorously tested. See comments below.

      Major comments

      1. The authors show an enrichment by MS-IP of NRDE2 in Fig 1A, 1C (the improvied xTAP-MS protocol) for late-stage spliceosome components, such as TFIP11 that is required for spliceosome disassembly (ILS complex), consistent with earlier data in C. elegans (Jiao et al., RNA, 2018). Given the consistency of the late stage-spliceosome interaction and the EJC with published results, how do the authors reconcile the proposed functions in 5'SS selection with known interactions of NRDE2 and CCDC174 with the EJC and disassembling spliceosomes? If NRDE2 and CCDC174-U1 formed, they would dissociate from the spliceosome with U1 snRNA during the Prp28-dependent pre-mRNA handover from U1 to U6 snRNA. How would NRDE2 and CCDC174 re-associate after the subsequent Pre-B to B to Bact to C transitions in C, when the EJC binds the spliceosome, or after the subsequent C* to P to ILS transitions in the ILS, when e.g. TFIP11 binds. In a more likely model, early and late splicing factors co-IP in the authors MS experiments because splicing factors are enriched generally with eachother and in e.g. nuclear speckles. Perhaps more stringent washes in the xTAP-MS experiment could home in on more direct interactions of NRDE2 or CCDC174 to the spliceosome?
      2. The following comments relate to the claim of a non-canonical U1 snRNP.
        • Fig. 6B: to assess the predictive power of the CLIP signal to reveal protein-snRNA interactions, can the authors comment on the expected crosslink efficiency and specificity of a bona fide U1 snRNP protein to U1 or a U2/U4/U5/U6 snRNP protein to its respective snRNA as well as all other snRNAs? How would these efficiencies and specificities compare to NRDE2-U1?
        • Fig. 6C: Can the relative differences in snRNA abundance, U1 being the most abundant, explain the CLIP crosslink efficiencies without the requirement of a bona fide NRDE2-U1 complex?
        • In Fig. 6C, have the authors looked at other spliceosomal snRNAs and their enrichments in the northern?
        • Fig. 6G, did the authors measure cellular snRNA levels after SmE dTAG depletion? The prediction would be that all snRNAs are reduced in steady-state abundance, due to improper biogenesis, which could explain why the U1 snRNA CLIP-seq signal is reduced. This would be independent of an NRDE2-U1 interaction.
        • As it would be surprising and exciting, if U1A, U1C, and U1-70k were absent from a functional U1 snRNP, this requires additional proof. Can they authors use an anti-U1 snRNA oligo in tandem with the NRDE2 IP or CCDC174 IP to show that the Sm-ring proteins and U1 snRNA are highly enriched but not U1A, U1C, and U1-70k proteins or any other snRNA?
        • U1C provides a ZnF domain that stabilizes the pre-mRNA 5'SS in its binding to U1 snRNA (Kondo et al., Elife, 2015). U1-70k stabilizes the U1 snRNP (Kondo, Elife, 2015) and can couple to RNA polymerase II (Zhang et al., Science, 2021), and is important for U1 snRNP biogenesis (Byung Ran So et al., NSMB, 2016). How would NRDE2 or CCDC174 compensate for these essential activities? Given the various crucial functions known U1 proteins perform, the claim that NRDE2 or CCDC174 can substitute them, should be supported by proof of their functional substitution.
        • Since NRDE2 or CCDC174 and U1 snRNA would be conserved and presumably for a high-affinity complex, ideally the authors would provide biochemical proof of their interactions, though this is may be beyond the scope of the current manuscript.

      Minor comments:

      • The conditions of the dTAG experiment are insufficiently described, what was the efficiency of depletion (Western blots or mass spec?) and over which time-scale was this applied?
      • Introduction, in the sentence '[...] encode much shorter U1 snRNA [...]' the authors imply that longer U1 snRNAs are correlated with a lack of splice site degeneracy. Yet, structural and mechanistic data show that the expanded U1 snRNA segments in e.g. S. cerevisiae (or U2 snRNA, which contains a 1000 nt) insertion are distant from the U1 snRNA 5'-end that recognizes the 5'SS or the U2 snRNA BSL, which binds the branch site, and thus are unlikely to influence splice site selection. Please rephrase.

      Significance

      NRDE2 and CCDC174 are enigmatic proteins that are likely to function during mRNA biogenesis and both have been linked to splicing and RNA decay. It is thus interesting to understand their precise modes of action. While the authors provide excellent data, the conclusions are not substantiated. I have expertise in the mechanistic study of pre-mRNA splicing, based on which several of the authors claims such as a new U1 snNRP complex, are challenging to reconcile with the past decades of splicing research. Given the sizable impact a new U1 snRNP would have on the field, these data must be unimpeachable.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript, Flemr et al. characterize the roles that proteins Nrde2 and CCDC174 play in mammalian splicing regulation. The authors perform native (nTAP-MS) and cross-linking (xTAP-MS) based IP-MS methods to identify the NRDE2 and NRDE2 mutant interactomes, and demonstrate NRDE2's enriched interactions with splicing factors. The authors further develop a RNA footprinting method (BCLIPseq) in order to capture NRDE2 and CCDC174 binding patterns on RNA, revealing a preference for binding unspliced introns close to the 5' SS. Furthermore, upon NRDE2 knockdown, the authors note a significant increase in alternative 5'SS splicing events, making NRDE2 a putative regulator of cryptic 5'SS. Following up on this observation through luciferase reporter assays, the authors demonstrate how NRDE2 and CCDC174 work together to inhibit cryptic splicing at weak 5'SSs. Finally, the authors demonstrate that NRDE2 and CCDC174 interact with U1 snRNA (but not core U1 snRNP protein components), providing a basis for their interactions with 5'SS. Overall, the authors thoroughly characterize protein and RNA interactions with NRDE2, demonstrating its role in mammalian pre-mRNA splicing, and its concerted role with CCDC174 in regulating splicing at some weak 5' splice sites.

      The study would be greatly improved through additional controls and more careful analyses. For instance, many controls are missing for the cell lines used throughout the study, making interpretation of the data more difficult. Other issues include that techniques developed within the study are presented without validation experiments, and key analyses, including microscopy experiments and rMATs analysis of RNAseq data, are performed without proper quantification, weakening the authors' conclusions from these experiments. Finally, major conclusions of the paper, such as the potential role of NRDE2 in a non-canonical U1 snRNP complex, would be greatly strengthened by additional experiments. However, overall, it appears that many of the major concerns should be readily addressable.

      Major comments

      1. Data are not present to demonstrate that cell lines were validated and compared properly:
        • a. The authors say "We assessed potential consequences of the tagging approach on protein function by comparing RNA-seq gene expression profiles with that of untagged cells, which remained unchanged for the proteins we report hereinafter" (p.6). This analysis should be made available in the supplement. They should additionally show western blots of tagged/untagged protein, in order to demonstrate similar expression levels for endogenous and engineered proteins.
        • b. Knockdown (KD) levels of all proteins from engineered lines should be shown over the KD timeline used in the study. For instance, no westerns in the paper show the degradation efficiency of the dTAG KD lines.
        • c. The authors should address why knockdown lines were made in different ways for different proteins (ex. only one cell line is a dTAG degradation line) and why knockdowns were performed for different amounts of time for every protein.
        • d. The authors should consider that, since knockdowns are performed for different amounts of time, results between protein knockdowns may not be directly comparable. For instance, in figure 3D, Ccdc174 dTAG lines have less misspliced target introns than the other knockdown lines. However, this may simply be because the knockdown period is shorter for the dTAG line than the other knockdown lines, and the length of treatment affects the overall number of introns affected.
      2. Nuclear localization experiments would benefit from further controls and quantification:
        • a. The authors conclude that "NRDE2 localisation to nuclear speckles depends on active pre-mRNA splicing" (p.7), which seems to contradict their result that "Chemical inhibition of splicing with Thailanstatin A (Liu et al., 2013) resulted in...wild-type NRDE2 remaining concentrated in enlarged NSs (Figure 1H and S1J)" (p.8).
        • b. Since the splicing inhibitor Thailanstatin A also changes the localization patterns of U2AF2 (Fig. 1G-H), it is unclear if U2AF2 is still a reliable nuclear speckle marker in the presence of the drug. Additional controls (such as staining for other nuclear speckle markers) are necessary to make this assertion.
        • c. To make the conclusion that "NRDE2-D174R accumulated in nucleoli" (p. 8), the authors should also include a nucleoli marker in their microscopy experiments.
        • d. Signal quantification of NRDE2 distribution/overlap with U2AF2 signal would strengthen the conclusions in Fig. 1G-H.
        • e. Quantification would again be helpful in Fig. 5C to demonstrate changes in NS localization. In addition, it looks like Nrde2-KO does not just lead to lack of CCDC174 accumulation, but to a decrease in its overall expression. The authors should comment on this observation, or quantify CCDC174 signal in both images to demonstrate that the overall levels remain the same.
      3. Since BCLIPseq is a technique developed by the authors, a more in-depth discussion of the technique development and quality control of the resulting data is warranted.
        • a. The authors mention that BCLIPseq offers a "streamlined and sensitive alternative to existing CLIP techniques" (p.9), but they don't provide any specifics into the ways they improve existing CLIP techniques in the main text. In what ways is it more streamlined and sensitive? This should be discussed in the main text rather than just the discussion, in order for the assertions made to be backed up with (supplementary) figures. A comparison of the coverage provided by a BCLIPseq library for NRDE2 to a CLIP library, for instance, would help to support these assertions.
        • b. The authors should address or provide evidence for why on-bead polyadenylation is preferable/more efficient than adapter ligation, especially as polyadenylation may be variable across transcripts. For instance, the authors could show more controls demonstrating the efficiency of on-bead polyadenylation or cite papers that have already extensively tested on-bead polyadenylation.
        • c. Many other RNA footprinting techniques (eCLIP, RIPseq) have noted significant nonspecific background in the resulting libraries, and usually use input controls to filter for this nonspecific background. The authors should clearly state if their BCLIPseq libraries also suffer from the same nonspecific background, and if so, what quality control steps exist in their analysis pipeline to minimize this background.
        • d. Related to the previous point, there is a high amount of rRNA reads in all the BCLIP libraries except EIF4A3. The authors suggest it is likely background, but if they are using a FLAG antibody for all of these, I'm not sure why there would be so much more background for some and not the others. If it's because EIF4A3 pulls down much more RNA with it because it binds most exon-exon junctions, whereas binding of the others is more rare, then isn't it possible that the mRNA reads are also partially background? This could explain why there is a very small overlap between the BCLIP bound loci and the affected 5'SS. An input control would help to determine what is indeed background.
      4. The conclusion that "This [U1 snRNA binding] leads to the provocative idea that NRDE2 could potentially mediate the formation of a non-canonical U1 snRNP" (p.20) is a very intriguing conclusion that would largely benefit from additional experiments to strengthen the claim.
        • a. Depletion of a U1-specific protein (U1-70k, U1C) and analysis of the effect (or lack thereof) on Nrde-U1 snRNA interactions would strengthen the assertion that Nrde-U1 snRNA interactions are independent of core U1 snRNP components.
        • b. Depletion of a U1-specific protein (U1-70k, U1C) and analysis of the effect (or lack therefore) on Nrde2-KO sensitive introns would also strengthen the assertion that Nrde2 regulates introns as part of a non-canonical U1 snRNP.
        • c. Overall, a schematic in the last figure that depicts the splicing model presented in the discussion would be helpful for describing the Nrde2/Ccdc-174 model proposed.
        • d. The authors show that the majority of ms-snRNA reads map to U1 snRNA. However, U1 snRNA is generally more abundant than other snRNAs (Dvinge et al. 2019), so the authors should show how the distribution shown in Fig. 6B compares to the input distribution of snRNA levels in the cell line used. Also, relative levels of U1 snRNA detected by IP-Northern Blot (Fig. 6C) don't seem to match the results shown in Fig. 6A and B, as U1 snRNA seems most abundant in the NRDE2 IP by Northern Blot and most abundant in NRDE-Δ200 by BCLIP-seq.
        • e. In Figs. 6D, E and F, the authors suggest that NRDE2 and CCDC174 contact U1 snRNA at multiple positions based on observing highest enrichment over SL2 and SL3. However, snRNAs are highly structured and modified, which may interfere with reverse transcription during library preparation and lead to uneven signal throughout the gene body. To show that the proteins of interest are really enriched at these positions, the authors could perform the same experiment for a protein that is known to bind at a different location on U1 snRNA.
      5. The rMATs analysis performed is very lenient; notably, there is no reported filtering for splicing events with some minimum coverage across replicates, and the inclusion level difference threshold of >0 (rather than >0.1 etc) is extremely low. As the rMATs analysis is key to the authors conclusion that there is "frequent cryptic 5'SS upon Nrde2 knockout" (p. 13), it seems important that this analysis is performed with more stringency in order to capture robust and meaningful splicing changes.

      Minor comments

      1. Some parts of the paper are organized in a confusing manner:
        • a. It is unclear why development of the xTAP-MS protocol is under the section "NRDE2 Localization to Nuclear Speckles Depends on Active pre-mRNA Splicing"
        • b. The section "Nrde2 and Mtrex Knockouts Induce a 2C-like State", while interesting, seems to be outside the scope of the paper
      2. It would be interesting for the authors to look into the BCLIPseq data to see if there are any enriched RBP binding motifs for the proteins studied.
      3. Western blots for IPs (ex. Fig 1F) should show the input for both the IP bait and prey proteins, not just the prey. In addition, input and IP'ed protein should be displayed in the same western blot image (without cropping in-between).
      4. Previous studies (Boehm et al 2018) have found that other EJC-associated proteins also are important for regulation of 5' cryptic splice site usage. It would be interesting for the authors to compare the 5' cryptic splice sites identified in these earlier studies to look for overlap between the 5' cryptic splice sites regulated by these proteins vs. NRDE2.
      5. The luciferase reporter assays are an especially strong portion of the paper, and are a nice orthogonal validation of the link between Nrde2, splicing regulation, and SS strength.
      6. It would be interesting for the authors to investigate the effect of the mutations in the luciferase reporter constructs on the binding patterns of Nrde2 on construct transcripts. This may help provide a mechanistic basis for the chosen cryptic 5' SSs.
      7. "Thus, NRDE2 promotes splicing from the most upstream of a series of 5'SSs" (p. 16) is an interesting conclusion, but this statement is far too general given the low number of genes surveyed using the luciferase assay. The statement should be rewritten to reflect that this statement has only been shown to be true for the few genes tested.
      8. The statement "NRDE2 and CCDC174 promote splicing at many of the same weak 5'SSs" (p. 16) would be stronger if it was not just based on the genes studied through the luciferase assay, but based on splicing changes analyzed genome-wide through rMATs analysis. Do NRDE2 and CCDC174 promote splicing of the same weak splice sites globally?
      9. R squared values should be added to the correlations presented in Fig. S2B to support the claim that replicates "correlated strongly", since that is the basis for merging replicates in subsequent analyses.
      10. In Fig. 3A, the four clusters should be annotated on the figure to increase clarity.

      Significance

      Overall, the study is thorough in its approaches to studying NRDE2 biology and makes a strong case for the exciting role of NRDE2 and CCDC174 in 5'SS selection. Combined IP-MS and RNA footprinting approaches compellingly demonstrate NRDE2's associations with splicing factors and splice sites in vivo. In addition, the combination of genome-wide approaches (RNAseq) with targeted analyses (luciferase reporter assays) allow for detailed analyses of cryptic splice site choice in the absence of NRDE2, NRDE2 mutants, MTREX, or CCDC174. These experiments support the novel role of NRDE2 and its associated proteins in splice site choice.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      This paper seeks to identify genomic safe harbor loci in the human genome for the integration of transgenes. The authors use computational analysis to identify a set of potentially useful sites for transgene integration; they subsequently test a small subset of these identified locations in human embryonic stem cells to determine the impact of transgene integration on the transcriptome and the ability of these cells to differentiate into numerous cell types. They determine that the subset of sites they identify and test all seem promising as no major changes in transcription or differentiation were observed following integration.

      Major Comments:

      Overall, the conclusions of this paper are reasonably convincing, and the authors do a nice job of laying out the criteria for designation of genomic safe harbor loci and characterizing three of these loci. However, there are several places where the data and rationale for the experimentation could be clarified to make the conclusions more convincing. One major question is whether or not these safe genomic loci identified are actually better than traditionally used loci such as Rosa26 or CCR5. While the authors note in the discussion section that these traditional loci do not meet their criteria for a safe genomic integration site, they do no direct comparison of their new loci vs these more traditional ones. A side-by-side comparison would make the data more convincing that these loci are better suited for genomic integration (such as noting fewer changes in the transcriptome etc).

      The second major issue is that it is unclear how the authors picked seven loci for more extensive targeting out of the 25 initially identified. Without knowing the criteria used for these selections, it is challenging to know if there was a bias in selection of sites for further analysis that could alter results, or if the other identified sites are truly acceptable targets. In addition, only three of those GSH sites were successfully targeted. As a result, it is hard to determine the validity of the authors claim that they identified 25 unique GSH loci when they only fully characterize three of them. While it is not necessary to test all 25, it might be beneficial to test more than three before making these conclusions.

      The data and methods in the paper are generally presented in an understandable fashion, and the use of three biological replicates for characterization of the hESC lines seems reasonable.

      Minor Comments:

      There are several minor issues that addressing could help strengthen the claims in this manuscript. First, it is unclear how the authors used BLAT to narrow down their initial list of 49 safe loci down to 25. A more detailed explanation in the text (vs methods) would aid in reader understanding of methodology. In addition, a deeper explanation of how differentially expressed genes were identified would be helpful. The authors state that many of the DE genes in their GSH targeted loci were identical to those found in both control and untargeted cells. It is unclear what the comparator was in these experiments that was used to identify those DE genes; clarification of this in the text would be helpful for the reader. In figure 2, the labeling of the panels is quite confusing, as panel F appears between panels E and D. Finally, in figure 3, while the three new cell lines are shown to be differentiated into various cell types, no control images are shown for comparison. This would be helpful to add in.

      Significance

      Overall, the major advancement of this paper is the identification of numerous putative genomic safe harbor loci (GSH) for the integration of transgenes in the human genome. With the rapid development of novel gene therapy techniques, the characterization of locations in the genome that are acceptable for transgene integration with the lowest likelihood of unintended off-target or downstream consequences is important. As a result, these sites have the potential to be quite valuable for the gene therapy field and of great interest to many scientists. Thus far, few widely accepted safe locations for genomic integration have been identified, making these sites of interest to numerous labs. As someone who has generated numerous transgenic mouse lines using random integration, the ability to selectively target transgenes and know there will be minimal issues with silencing or off-target impacts is appealing. However, my knowledge of genomic and bioinformatics techniques is minimal, making it challenging for me to adequately assess that piece of this manuscript, though I believe it contains valuable information for the gene editing and gene therapy community.

      Referees cross-commenting

      I agree with comments from the other reviewers and think they are all very reasonable suggestions.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Autio MI and colleagues report their study aiming to identify novel genomic safe harbor (GSH) loci in the human genome. First, they conducted a computational analysis of publicly available data with a list of criteria previously suggested for GSH loci. Since expression units placed at GSH loci should stably active, they also examined candidate loci using GTex data and against chromatin regions in the active (A) compartment reported by Schmitt et al. They found 25 candidate loci after these analyses. Then, they successfully placed landing pad constructs on three loci in hESCs by use of the CRISPR technology. They have demonstrated that expression of Clover by the CAG promoter is homogeneous and stable even after differentiation to neuronal, live and cardiac cell lineages.

      Major points:

      1. They only examined expression from the CAG promoter unit. However, this does not guarantee stable expressions from other promoters. Since the CAG promoter is very strong, it may be resistant against cellular silencing activity. For research purpose, tetracycline-regulatable promoters are often used, and it has been reported that although CAG promoter is not silenced, the TRE promoter is silenced when an expression unit is placed at AAVS1 locus (Ordovas L et al. Stem Cell Rep, 5: 918-931, 2015). Therefore, before concluding that these loci are GSH, expression from the TRE promoter should be tested.
      2. They examined off-target integration by PCR and Sanger sequencing of the top 5 predicted off target sites. However, Southern blot analyses are needed to rule out off-target integrations. (This reviewer cannot evaluate data of copy number analysis using Digital PCR).

      Significance

      Identification of GSH loci will advance basic research and clinical applications.

      This reviewer is not good at bioinformatics and cannot evaluate the first half of this study.

      Referees cross-commenting

      I have found that comments by other reviewers are important. As suggested, functionality of differentiated cells should be tested and demonstrated. Again, examine other promoters beside CAG should be tested in those differentiated cells.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The Authors have taken a bioinformatic approach to identification of safe harbour loci in human genome, and then validated three of these in the H1 hES cell line. Overall, the rationale and data presented in clear and and the experiments appear to be reproducible.

      Minor concerns:

      1. please expand on the rationale for selection of the seven sites that were selected for initial targeting (i.e. what differentiated these from the other 18 sites as being suitable), and on the results for why no successful edits were identified for 4 of these loci.
      2. please add data that quantifies the number of cells expressing Clover, in the differentiated cell types. Ideally, multiple markers for each lineage should be used.
      3. Functional studies of the differentiated cell types would add substantial value to this paper. in the absence of such data, additional marker proteins that reflect functional properties or the maturity of the derived cell types could be added.

      Significance

      Identification and characterization of new safe harbour sites offers potential for generation of research tools and potentially for clinical applications. Those working in the fields of iPSC-based disease modelling and pre-clinical gene therapy are likely to be interested in this work and the cell line resources developed.

      Reviewer expertise: iPSC, CRISPR/Cas, neuronal differentiation.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-xx-xx

      Corresponding author(s): Yang Hong

      [The “revision plan” should delineate the revisions that authors intend to carry out in response to the points raised by the referees. It also provides the authors with the opportunity to explain their view of the paper and of the referee reports.

      The document is important for the editors of affiliate journals when they make a first decision on the transferred manuscript. It will also be useful to readers of the reprint and help them to obtain a balanced view of the paper.

      If you wish to submit a full revision, please use our "Full Revision" template. It is important to use the appropriate template to clearly inform the editors of your intentions.]

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We would like to thank all three reviewers for their comprehensive and constructive comments. We are in particular grateful to_ reivewer#3_ for suggestions on improving the manuscript text and figures. We have already incorporated most of these suggestions into the revised manuscript. Based on reviewers’ comments, it appears no additional significant experiments are required for the revision. Nonetheless, for the final revision we will do the following:

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.


      REVIEWER #1

      (Evidence, reproducibility and clarity (Required)): Summary: This manuscript takes a closer look at how hypoxia affects the accumulation of PI4P and PI4,5P2 (PIP2) in the plasma membrane of Drosophila ovarian follicular epithelial cells and how ATP depletion similarly affects the localization of the same phospholipids in HEK293 cells. They demonstrate that hypoxia results in the reversible loss of plasma membrane (PM) association of both lipids, with PIP2 disappearing ahead of PI4P, and recovering more slowly than PI4P when oocytes are returned to normoxia. They also show that the intracellular vesicular pools of PI4P are depleted ahead of the PM pools and the PI4P recovery occurs first in PM, then in the vesicles. They show that the disappearance and recovery of the polarity protein Lethal giant larvae (Lgl) parallels that of PIP2 during hypoxia and subsequent normoxia, with a very slight delay. The authors then go on to show the RNAi knockdown of the PM enzyme (PI4KIIIa) that phosphorylates PIP delays the recovery of PI4P at the membrane, with recovery first occurring in the vesicular pools. This knockdown also delays the recovery of PIP2 and, as with recovery of PI4P, the recovery of PIP2 now occurs first in vesicular pools. Lgl recovery follows that of PI4P and PIP2 with RNAi knockdown of PI4KIIIa. The knockdown of all three of the enzymes that phosphorylate PIP to generate PI4P delays recovery of PI4P, PIP2 and Lgl at the membrane even more. The authors show that proteins required for the PM localization of PI4KIIIa have similar effects on the recovery of PM PI4P, PIP2 and Lgl (with delays and recovery of vesicular pools before PM pools). Independently, the authors show that ATP depletion in HEK293 cells result in similar reversible depletion of PI4P, PIP2 and Lgl from the PM. From these studies and their previous findings, the authors conclude that pools of PI4P and PIP2 are likely rapidly turned over in the membrane even during normoxia and that this rapid recovery is dependent on the PM localized enzyme that phosphorylates phosphoinositol.

      Major comments: Overall, the data are beautifully presented; it is quite helpful to have a video of each experimental treatment showing the corresponding response of all three molecules that are being monitored. Signal quantification over time is carefully documented. With the exception that a link between hypoxia and depletion of ATP has not been demonstrated here, the key conclusions are convincing. However, as pointed out below (in the significance section), some of the major points have already been published by this group. Their conclusion that hypoxia induces acute and reversible reduction of cellular ATP levels (which are then proposed to affect the activities of the enzymes required for PI4P and, consequently, PIP2 production) was not shown. They did demonstrate that acute depletion of ATP had the same consequences on PM phospholipids as acute hypoxia (in HEK293 cells). And, indeed, it makes sense that hypoxia could affect enzymes required for ATP synthesis, but the authors would have to show that acute hypoxia results in acute reduction in cellular ATP pools to make the links they suggest. This is something they should be able to do in the HEK293 cells now that they have their ATP sensor. Just to note, this group did show that hypoxia can reduce levels of ATP in Drosophila oocytes in their previous paper (Dong et al., 2015, Figure S3), but it is unclear if this is reversible and happens in the time frame of the experiments presented in this current manuscript.

      In Dong et al 2015 we biochemically measured ATP levels in embryos treated by hypoxia, but due to lack of ATP biosensors it was not possible then to show real time ATP level changes in cells undergoing hypoxia and reoxygenation. Instead, we showed that direct ATP inhibition by antimycin treatment mimics the effect of hypoxia, to support the hypothesis that hypoxia acts through ATP inhibition.

      In the current manuscript, we demonstrated for the first time that hypoxia triggered acute and reversible ATP level reduction in Drosophila follicular epithelial cells (Figure S3). In the finalized manuscript we plan to add data to show ATP level changes in HEK293 cells under hypoxia, as suggested by the reviewer.

      My suggestions are the following: (1) The authors need to make it absolutely clear what was already known, including the following: (A) hypoxia reversibly affects PM pools of PI4P, PIP2, and Lgl (and other membrane associated proteins), (B) that hypoxia can affect ATP levels in Drosophila oocytes (although these previous studies do not show anything about the dynamics) and (C) that reducing ATP levels affects PM pools of PI4P, PIP2 and Lgl.

      We agree with reviewer and have revised the introduction (p3 line 24-34) to make clearer what we previously published on the hypoxia/ATP and PIP2 turnover. It should be noted though that our previous publications did not contain any data regarding PI4P under hypoxia or ATP inhibition, as the current manuscript is the first time we reported the making and use of PI4P sensor such as P4Mx2::GFP.

      (2) They should demonstrate that acute hypoxia and return to normoxia has acute and reversible effects on cellular ATP levels - they now have the tools to do this, at least in HEK293 cells.

      We agree with reviewer and will add this data to the final revision. Such experiments require significantly modified setup for imaging live HEK293 cells with controlled hypoxia/reoxygenation, but we are reasonably confident that such experiments are feasible.

      Minor comments:

      The manuscript is too long and the discussion unnecessarily repeats everything already presented in the results. The authors should find a way to streamline the discussion.

      We will revise the final manuscript to make the discussion more concise and streamlined.

      N values should be given for all figures and experiments, and the N=23/24 versus N=24/24 needs to be explained the first time it is used.

      We have revised manuscript so all N values are clearly provided and easier to understand.

      There are a few mismatches in terms of plural nouns and singular verbs and vice versa sprinkled into the manuscript, so some careful editing would be useful.

      We have revised the manuscript to eliminate such errors/typos, especially with the help of the generous and comprehensive list of by reviewer#3.

      Significance: I was initially quite excited about the novelty of their findings and the potential insight into the dynamics of PM pools of the two phospholipids that are critical to cell polarity and that play important signaling roles. However, at least a subset of their conclusions were either published in their earlier work or do not necessarily follow from what they have done in this manuscript. Their statement that hypoxia in Drosophila induces acute and reversible depletion of PM PI4P and PIP2 was presented in a previous publication (See Figure 8 of Dong et al., 2015).

      We greatly appreciate reviewer’s comments on the significance of our discovery. Again all data regarding PI4P are new in this manuscript and have not been published before. We only published very preliminary data suggesting the reversible depletion of PIP2 and PIP3 (but not PI4P) under hypoxia (Dong et al, 2015). The current manuscript provides a comprehensive set of quantitative live imaging data with high spatial and temporal resolution that demonstrate for the first time the dynamic turnover of PM PI4P under hypoxia and ATP inhibition, the correlation between such turnovers of PM PI4P and PIP2, and the direct correlations between PI4P/PIP2 turnover and Lgl electrostatic PM targeting and intracellular ATP levels. In addition, studies on the role of PI4KIIIa complex in such process have not been done before.

      This manuscript would appeal to an audience interested in the mechanisms of cell polarity and phosphoinositide signaling.

      I am a Drosophila developmental geneticist quite familiar with the topics that this paper addresses.

      REVIEWER #2

      (Evidence, reproducibility and clarity (Required)): Summary: This manuscript describes the effect of hypoxia on the levels of PI4P and PI45P2 , two key PPIs that are enriched on the inner leaflet of the plasma membrane. These PPIs are synthesized by the sequential phosphorylation of PI by a PI-4 kinase and subsequently a PI4P 5 kinases, both of which use ATP. The relevant PI-4 kinase at the plasma membrane, PI4KIIIa has been conclusively identified previously in mammalian cells by the DeCamilli lab (Nakatsu et.al JCB 2012) and its role in regulating the synthesis of PI4P and PI(4,5)P2 in two Drosophila cell types in vivo shown by two previous studies. Balakrishanan (photoreceptors during PLC signalling) and Basu et.al Dev.Biol 2020 ( in multiple larval cell types ). PI4KIIIa has been shown to exist as a complex of the enzymatic polypeptide, EFR3 and TTC7. The studies by Nakatsu, Balakrishnan and Basu have shown the importance of the complex subunits is regulating PI4P and PI45P2 levels in cultured mammalian cells and Drosophila cell types in vivo.

      We thank reviewer for pointing out the work of Balakrishanan et.al J.Cell Sci 2018 which showed similar results to our manuscript and Basu et al 2020. We have added a brief summary this reference to the Discussion in revised manuscript (p13, line 17)

      In the present study, Lu et. al build on their previous work showing that the polarity protein Lgl undergoes hypoxia induced translocation. They show that hypoxia also induces loss of PI4P and PI45P2 at the plasma membrane in these cells correlated with loss of Lgl localization to the PM. The manuscript then goes on to establish the requirement of the PI4KIIIa complex in regulating Lgl localization as well as PI4P and PI45P2 levels at the plasma membrane during hypoxia and the subsequent recovery of these at the plasma membrane.

      The strength of the manuscript is twofold. (i) The work is done to a high technical standard and the investigators have carried out the measurements of LGL localization, PI4P and PI45P2 levels along with simultaneous measurements of ATP levels in vivo. The work would be strengthened further if the authors could show the level of depletion of PI4K isoforms or PI4KIIIa complex subunits units induced in ovarian tissue under their experimental conditions by the GAL4 drivers used in this study. This is not a persnickety detail as RNAi lines can have very different effectiveness in Drosophila ovarian tissue compared to other fly cell types. This point is, in particular, important in cases where an RNAi line is being used and the conclusion is a lack of impact on a phenotype being studied.

      We are fully aware of the potential caveata of RNAi. In our previous publications we were able to validate RNAi knock-down efficiency against endogenously or ectopically expressed GFP-tagged target proteins (Dong et al., 2020; Dong et al., 2015; Lu et al., 2021) or endogenous proteins with available antibodies (Dong et al., 2015). It is regrettable that presently such reagents are not available for directly examining the level of RNAi knock-down for PI4KIIIa and PI4KIIa etc. We did show that rbo-RNAi efficiently knocked down the expression levels of Rbo::GFP (Figure S1C). In current manuscript, we have been very careful to draw conclusions based on negative RNAi results.

      (ii) A second strength is that the authors now illuminate a further in vivo cell type where the function of the PI4KIIIa complex in regulating PI4P and PI45P2 levels. This adds to the earlier work of Nakatsu, Balakrishnan and Basu.

      A key difficulty with the current story is the lack of specificity of the phenotype they demonstrate under hypoxia. Of course, hypoxia is expected to deplete cellular ATP levels but PI4KIIIa is not the only enzyme that this lack of ATP will impact. There will be dozens or more other kinases, both protein and lipid kinases whose function will be impacted by the drop in ATP levels. Therefore, it is hard to attribute a specific/particular role to the PI4KIIIa complex under these conditions. The mislocalization of LGL::mCherry while correlated with PI4P and PI45P2 levels at the plasma membrane may be just that- a correlation. It is quite possible, indeed likely, that the mislocalization of LGL-mCherry under hypoxia conditions is due to the reduction of the activity of another lipid or protein kinase due to the drop in ATP levels due to hypoxia (PKC is a possibility too).

      We agree with reviewer that PI4KIIIa almost certainly is only one of the enzymes that are involved in regulating the PI4P/PIP2 turnover triggered by hypoxia. This manuscript is our first effort to investigate the potential regulatory network underlying the hypoxia-triggered turnover of PM PI4P and PIP2, and it is our long term goal to identify more components in the regulatory network.

      As to underlying mechanisms of the loss of PM Lgl under hypoxia, we previously showed before that PM targeting of Lgl dependents on both PI4P and PIP2 and acute depletion of PI4P and PIP2 in cultured cells completely blocks the PM targeting of Lgl (Dong et al, 2015). Thus, although we cannot exclude the contribution from other lipids, it is highly plausible that loss of PM PI4P and PIP2 triggered by hypoxia is the main driving force disrupting the electrostatic PM targeting of Lgl.

      Lgl is phosphorylated by aPKC and such phosphorylation inhibits Lgl PM targeting by neutralizing the positive charges on Lgl’s polybasic motif (Dong et al, 2015, Bailey et al, 2015). Thus, potential inhibition of aPKC activity by hypoxia should not inhibit the PM targeting of Lgl. Consistently, we previously showed that aPKC-/- mutant cells showed same acute and reversible loss of PM Lgl under hypoxia (Dong et al, 2015).

      Minor comments:

      The authors must reference all published work on the PI4KIIIa complex in the literature. Some of it is excluded in the present version

      We apologize for the missing references and in the revised manuscript we have already added several additional references based on the suggestions of reviewer#1 and #3. In the finalized manuscript we will make our best effort to cover all the relevant studies.

      The Drosophila work, particularly cell types used, etc are not accessible to people who are not fly experts. This should be done.

      We added a sentence to the first paragraph of Results to specifically make it clear that all Drosophila studies used follicular epithelial cells from female ovary (p4, line 26).

      Significance: Adds to knowledge on the PI4KIIIa complex. Builds on existing knowledge in the PI4KIIIa field and maybe also cell polarity field.

      REVIEWER #3

      (Evidence, reproducibility and clarity (Required)):

      Summary: Phosphatidylinositol phosphates (PIPs) are key determinants of membrane identity and regulate crucial cellular processes such as polarization, lipid transfer and membrane trafficking. Despite decades of study, surprisingly little is known about how levels of PIPs are regulated in response to cellular stress. Here, using Drosophila ovarian follicular epithelial cells and human HEK293 cells, the authors show that levels of plasma membrane (PM) PI4P and PIP2 decrease rapidly in response to hypoxia, resulting in loss of polybasic proteins from the PM. These effects are reversed in response to reoxygenation. Similarly, hypoxia leads to acute depletion of ATP levels, which also regenerate following reoxygenation. Using a combination of quantitative image analysis and genetic analysis, they show that PI4KIIIalpha and its binding partners Rbo/ EFR3 and TTC7 are needed to maintain PI4P and PIP2 at the PM under normal and hypoxic conditions, whereas the other two Drosophila PI 4-kinases, Fwd/PI4KIIIbeta and PI4KII, play a less important role in PM PIP homeostasis. Their results suggest that manipulations with indirect effects on PIPs (hypoxia, ATP depletion, ischemia) can have a profound impact on electrostatic charge at the PM, as well as downstream processes that require PM PI4P and PIP2.

      Major Comments: 1. In general, the authors' conclusions are convincing. However, some of the results are less evident from the still images and graphs provided in the figures than from the movies that accompany the figures. Some suggested improvements are below.

      No additional experiments are essential to support the claims of the paper, although some additional quantitation would be helpful to the reader, as detailed below. Data and methods are generally presented in such a way that they can be reproduced, although some additional details would be helpful, as listed below. Experiments were adequately replicated, and statistical analysis appears adequate.

      We are extremely grateful to the generous efforts of the reviewer providing such a detailed list of suggested improvements. We have incorporated all the text revisions into the revised manuscript and will revise the figures accordingly the final revision too.

      Minor comments: 1. Although the data are generally quantified quite well, there are two instances in the first full paragraph on p. 5 where this is not the case. First, PM PI4P is described as "oftentimes" as showing a transient increase in the early phase of hypoxia. However, this is not quantified. How often did this occur among the samples examined? How large is the transient increase when it occurs (Fig. 1A' error bars are not obvious on the colored background)? Second, the authors state that the P4Mx2-GFP puncta "often" became brighter after recovery. How often did this occur? No quantitation is provided.

      Upon close inspection of the data, we conclude that during the early phase of hypoxia PM P4Mx2::GFP always showed an initial drop followed a transient increase. Thus we revised the sentence to delete “oftentimes”.

      We did not specifically quantify the transient increase of the PM P4Mx2::GFP during the early phase of hypoxia, as we consider it likely an artifact as discussed in the manuscript, making its quantification less meaningful.

      As to the P4Mx2::GFP puncta, regretfully we do not have imaging tools that can accurately and automatically recognize and measure such puncta in our live recordings. We are actively developing such software using trainable Weka segmentation tool (https://imagej.net/plugins/tws/) and hopefully such puncta quantifications will be possible in our future experiments.

      The authors conclude that "PI4KIIalpha and Fwd contribute significantly to the maintenance of PM PI4P" (bottom of p. 7), yet they did not validate their RNAi knockdowns of these two genes, so they do not know whether it is one or both of these PI4Ks that contribute.

      We agree with reviewer that our RNAi knockdowns on PI4KIIa and Fwd are not sufficient to tell whether one or both contribute to the PM PI4P maintenance. We revised the sentence to “Our data support that PI4KIIα and/or FWD contribute significantly to the maintenance of PM PI4P...”

      In Fig. 4B, a subset of the cells "show failed recovery of PM Lgl::GFP". However, some cells did recover. This average percentage of cells that recovered should be quantified, if possible.

      Added numbers of PI4K-3KD cells that show normal or failed hypoxia response of Lgl::GFP and revised the sentences accordingly (p8, line18)

      In Fig. 7A, B, the bottom cell in each example lags behind the top cell in recovery of the MaLionR sensor. The frequency of observed cells in each class for 7A, B should be quantified.

      Added n numbers of each cell classs to Figure 7A, B legend.

      In most cases, prior studies were referenced appropriately. However, two previous studies in Drosophila showing the effects of Sktl/PIP2 reduction on localization of polybasic proteins Lgl, Baz/Par-3 and Par-1 were not cited (relevant to the first paragraph of the Introduction, p. 3): Gervais et al., Development (2008), Claret et al. Curr Biol (2014). In addition, two studies showing the importance of Drosophila PI4KIIIalpha in synthesizing PM PI4P and PIP2 were not cited (relevant to the description of this enzyme, top of p. 6): Yan et al., Development (2011), Tan et al., J Cell Sci (2014). Data showing fwd null mutants are not lethal (relevant to top of p. 7) were published in Brill et al., Development (2000).

      We thank reviewer for suggesting the additional references. We added Yan et al and Tan et al for referencing PI4PIIIa, and Brill et al for referencing the original characterization of fwd. We discussed work from Gervais et al and Claret et al in the revised discussion (p14, line 9).

      For the most part, text and figures are clear and accurate. However, there are quite a few typos and grammatical mistakes, as well as instances of lack of clarity in the writing that should be addressed. In addition, there are a number of improvements to presentation of data that would make the figures easier to understand. These are listed below. Suggestions to improve presentation of data and conclusions are below.

      Again we greatly appreciate such generous efforts from the reviewer and have incorporated all the text revisions into the revised manuscript. We will revise the figures accordingly the final revision too.

      Significance: Overall, the authors do a nice job of showing that hypoxia leads to previously unappreciated effects on levels of PM PI4P and PIP2, resulting in loss of PM association of proteins important for normal cellular physiology. This finding is quite novel. Moreover, the authors provide insight into the identity of the PI4Ks that are responsible for regenerating PM PIP2 following return to normoxia. Their analysis of the dynamics of these changes provides multiple interesting insights, including the potential roles of intracellular pools of PI4P in replenishing PM PIP2 and the observation that intracellular accumulation of PIP2 is occasionally observed in association with the appearance of intracellular PI4P puncta, suggesting a novel route for PIP2 replenishment in response to hypoxic stress. Their results will provide the basis for future studies examining the cellular mechanisms involved. This study will be of interest to those studying phosphoinositide biology as well as cellular responses to hypoxic stress and recovery, such as occur during ischemia and reperfusion. Reviewer expertise: Drosophila molecular genetics, cell biology, developmental biology, phosphoinositides, PIP pathway enzymes, PIP effectors


      **REFEREES CROSS-COMMENTING** This session includes the comments of all reviewers.

      __Reviewer 3: __I agree with reviewer #1 that the authors did not do a good job of clarifying what they and others had previously shown, and I must confess I didn't carefully examine their previous papers carefully enough before preparing my review. In fact, they previously showed that hypoxia affects localization of Dlg at the plasma membrane and that its recovery depends on PI4KIIIalpha and PIP2 (Lu et al., Development 2021). This is in addition to their previous data showing effects of hypoxia on Lgl (Dong et al., J Cell Biol 2015). Thus, less of the information in the current manuscript is novel than I thought when I initially read it.

      I also agree with reviewer #2 that they need to do a better job of citing the relevant literature and considering the possibility that hypoxia and reduced levels of ATP might affect many different enzymes. In addition, as suggested by reviewer #1, it seems important

      __Reviewer 1: __I agree with what Reviewer 3 is suggesting and with reviewer 2 that the authors should do a better job of citing all of the relevant literature. I also appreciate the detailed edits provided by Reviewer 3 - it was very generous of them to do this.

      __Reviewer 2: __The points raised by reviewer 1 and 3 with regard to the citing or prior work (from the authors or other labs) also applies to their citing of literature on PI and PI4K signalling. Here too citing or prior work has been less than satisfactory making it difficult to do this.

      We want to thank all three reviewers for their thoughtful and constructive comments. We have revised the introduction to make it clear what we had observed in our in our previous studies. On the other hand, in this manuscript we presents a systematic study on the hypoxia-triggered turnover of PM PI4P and PIP2, the correlation between PI4P/PIP2 turnover and electrostatic PM targeting of Lgl, as well as a potential role of PI4KIIIa and its PM targeting mechanism in regulating the turnover of the PM PI4P and PIP2 under hypoxia. Although the latter by no means indicates that PI4KIIIa is the only enzyme in regulating such process, its characterization is the beginning for us to further identify additional enzymes and regulators in this hypoxia triggered phenomenon.

      We have already added additional references as suggested by the reviewers in the revised manuscript, and once additional experiments are completed we will finalize the manuscript to make sure all relevant references are cited.

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      We have incorporated nearly all of the suggestions from reviewer#3 into the current revision, with few exceptions as listed at the end of this letter. Below are point-to-point response to selected suggestions involving data interpretation and comprehensive text revisions:

        • 5, first paragraph, line 2: replace "oftentimes" with "often" and provide quantitation (see above) *

          Deleted the “oftentimes”. Upon close inspection of our data we conclude that PI4P always showed transient increase of PM signal in early hypoxia.

        • 5, first paragraph, line 6: the claim of "often" should be quantified (see above) *

          Deleted the “often”. PI4P puncta actually were consistently brighter after recovering from hypoxia.

        • 5, second paragraph: the extent of recovery of Lgl is less when Lgl-RFP is coexpressed with PLC-PH-GFP, potentially due to titration of PIP2 by PLC-PH; the authors should comment on this*

          This is a good suggestion from the reviewer. Revised by adding to the end of paragraph:* “Note that in Figure 1B Lgl::RFP recovery appears lower than in wild type, possibly due to the titration of PIP2 by PLC-PH::GFP expression.” *

        • 5, last line: the authors should provide information about the "targeted RNAi screen"; which genes were tested? did any others give relevant phenotypes? a table showing the results of the screen should be provided as supplementary information*

          Added Table S1 which summarize the results of RNAi screen.

        • 11, first full paragraph, line 6: what about PI4KIIIbeta? is the KmATP for this enzyme known?*

          Based on literature, KmATP of PI4KIIIbeat is similar to PI4KIIIa’s (~400uM, Balla and Balla 2006). We added the PI4KIIIb KmATP value to the revised discussion (p11, line22)

        • 11, last paragraph, line 2: what is meant by "etc." is unclear; remove "etc." and include specific information related to what was reported in the literature (with proper references)*

          Revised the sentence to “…that KmATP of PI4KIIIα was measured decades ago using purified PI4KIIIα enzymes from tissues such as bovine brains and uterus (Carpenter and Cantley, 1990)”. The reference (Carpenter and Cantley, 1990) is a review which contains detailed of biochemical characterizations of PI4K kinases from numerous publications.

        • 12, line 3: why do the authors claim that the intracellular pool of PI4P is first synthesized by PI4KIIalpha? what about PI4KIIIbeta? their results do not distinguish between these enzymes*

          We favor the hypothesis that PI4KIIalpha is responsible for synthesizing the intracellular pool of PI4P because the very low KmATP of PI4KIIa. PI4KIIIbeta has high KmATP just like PI4KIIIa.

        • 12, last paragraph, lines 6-7: for the reader, please clarity the mechanism that was invoked to explain how PIP5K can make PIP2 from PI in E. coli (Botero et al., 2019)*

          Revised the sentence to “PIP5K can be sufficient to make PIP2 from PI in E.coli by phosphorylating its third, fourth and fifth positions (Botero et al., 2019)

        • 13, first paragraph, last line: cannot conclude that components of PI4KIIIalpha are "highly interdependent" without testing effect of knockdown of PI4KIIIalpha on Rbo and TTC7, etc.; instead, can conclude that the data are consistent with all of the components acting in the same process; also, delete "the" before "proper"*

          Revised the sentence to “ .. supporting that components in PI4KIIIαa complex may act interdependently for the proper PM targeting in vivo.

        • 14, second paragraph, lines 3-5: expand on this idea; what additional lipids could be important here? are there examples of other proteins that require these additional lipids?*
        • 16, line 6: explain in brief what "pNP plasmid" is and how the multi-RNAi method works (what promoters drive expression of the shRNAs, how many shRNAs are included in the plasmid, etc.)*

          Added a section in Material and Methods to describe in details the generation of pNP constructs and fly stocks.

        • 16, lines 8-3: appropriate references should be included for each stock where available*

          Added references to stocks cy2-Gal4, rbo::GFP and UAS-AT1.03NL1.

        • 16, line 11: explain what UAS-AT1.03NL1 is*

          Added: “UAS-AT1.03NL1(DGRC#117011) was used to express the Drosophila-optimized ATeam ATP sensor AT[NL] in follicle cells (Tsuyama et al., 2017)

        • 16, lines 16-17: Gerry Hammond should not be listed as providing these constructs if he is a coauthor on the manuscript; appropriate references should be cited for these constructs*

          Revised the sentence to “Mammalian constructs of P4M::GFP, P4Mx2::GFP, and PLC-PH::GFP were as previously described (Hammond et al., 2014; Hammond et al., 2012).

        • 17, lines 3-4: sentence fragment "Images were further" is not complete*

          This was a typo, deleted.

        • 19, lines 5-6 from bottom: title doesn't accurately reflect that PI4P doesn't appear to recover in WT control; why is this the case? recovery was observed in other experiments*

          Live recording showed that PM PI4P did recover during reoxygenation (Figure 3A, Movie S7). This particular recording in Figure 3A/Movie S7 was a bit difficult for automatic quantification by our custom software due to that P4Mx2::GFP signal somehow was weak and noisy, resulting in less than “ideal” recovery curves.

        • 22, line 16: fix typo in "uncalibrated"; spell out what "AT[NL] sensor shows*

          Revised to “Heat map of the (uncalibrated) FRET ratio of ATeam ATP sensor AT[NL] in follicle cells

      Simple text revisions have been made for the following suggestions:

        • 3, first paragraph, line 11, and second paragraph, line 2: combine references (Dong, Dong, Lu) with (Bailey and Prehoda, Hong)*
        • 4, 8th line from bottom: PLC-PH is generally referred to as PLCdelta-PH in the literature and should be defined this way the first time "PLC-PH" is used p. 5, 5th line from top: I believe the callout should be to Fig. 1A' rather than Fig. 1B*
        • 5, first paragraph, line 4: delete "to" after "sensor" p. 5, first paragraph, line 7: insert "the" before "intracellular" p. 6, first full paragraph, line 4: change "cells" to "cell" p. 6, second full paragraph, line 1: change "was" to "were" p. 6, second full paragraph, line 4: delete "ref" at beginning of line) p. 7, first paragraph, line 4: explain briefly (either here or in the Methods) the "newly published multi-RNAi tools" reported by Qiao et al., 2018 p. 7, first paragraph, lines 9-10: suggest changing to "...(Fig. 3A,B); the latter...sensors being bound..." p. 7, second paragraph, line 5: replace "is" with "are" p. 7, last paragraph, line 2: "under normal conditions" is vague; replace "normal" with "aerobic" or "normoxic" p. 8, first paragraph, line 4: replace "normal" with "aerobic" or "normoxic" for clarity p. 8, last line: callout to Fig. 5A appears incorrect; should be Fig. S1A instead p. 9, first full paragraph, lines 4 and 5: callouts should be to Fig. 6A (line 4) and 6B (line 5) p. 10, second full paragraph, line 1: insert "human" or "mammalian" after "cultured" p. 10, second full paragraph, line 3: delete "Although" and start sentence with "For reasons unknown, out..." p. 10, second full paragraph, lines 4-5: replace "in HEK293" with "to HEK293" and add a period after "present"; start new sentence on line 5 with "However, our data strongly..." p. 10, third line from bottom: fix typo in "Discussion" p. 11, line 9: replace "require" with "requires" p. 11, line 10: perhaps the authors mean to refer to "PI and PIP kinases" rather than "PIP and PIP kinases" p. 11, line 10: delete comma at end of line and replace with "and" p. 11, first full paragraph, lines 3 and 4: replace "PI4KIIIa" with "PI4KIIIalpha" (two instances) p. 11, first full paragraph, lines 5 and 7: the "m" in "Km" should be a subscript p. 11, first full paragraph, lines 8-9: add "the before "intracellular PI4P pool" and replace "slower" with "more slowly" and "recovers faster" with "recover more quickly" p. 11, first full paragraph, line 10: replace "loss" with "lose" p. 11, last paragraph, line 1: insert "a" after "such" p. 12, line 4: insert "is" after "PM PI4P" and delete "is" after "PI4P pool"; also replace "the transfer" with "this transfer" p. 12, line 5: delete "the before "intracellular"; delete the "is" after "(Dickson et al., 2014)" and replace with a comma"; replace "our data showing the loss" with "our data show loss" p. 12, first full paragraph, line 2: "interconversion" does not require a hyphen" p. 12, first full paragraph, line 5: why is PIP5K in red? p. 12, first full paragraph, line 6: replace "attracted" with "recruited" p. 12, first full paragraph, line 8: insert commas around "however"; also, "knockdown" does not require a hyphen p. 12, last paragraph, line 5: delete "the" before "PI4P" p. 12, 5th line from bottom: replace "are" with "is" p. 12, 4th line from bottom: start sentence with "In this regard," rather than "To this regard," p. 13, first paragraph, line 1: delete "the" and insert "a" before "large" p. 13, second paragraph, line 2: add comma after "localization" p. 13, second paragraph, line 3: delete "the" before "PM PI4P" p. 13, second paragraph, line 10: replace "is" with "are" before "supported" p. 13, third line from bottom: should read "PIP2" rather than "PIP" p. 13, second line from bottom: "must have" is awkward and unclear; perhaps change to "is predicted to have a" p. 13, last line; replace "polybasic proteins or" with "polybasic and" p. 14, first paragraph, line 1: delete "the" before "PM PI4P" p. 14, first paragraph, line 3: replace "domains" with "motifs" p. 14, first paragraph, line 6: insert "the idea" before "that Lgl" p. 14, first paragraph, last line: replace "or" with "and" p. 14, second paragraph, line 2: insert "of" after "plenty" p. 14, last paragraph, line 4: replace "While" with "Although" p. 14, last paragraph, line 6: delete "the" p. 16, line 5: insert comma after "chromosome" p. 16, line 7: insert "Additional stocks used were" p. 16, line 12: fix typo in David Bilder's name p. 16, line 14: italicize lgl::GFP p. 16, line 18: briefly explain what the pGU vector is and how it works p. 16, line 21: delete "the" before "young females"; also, the age of the females should be specified (newly eclosed? 1-3 days old? yeasted or not?) p. 16, 7th line from bottom: replace "ensures" with "ensure" p. 17, line 10: "overexposure" does not require a hyphen; add a comma after "necessary" p. 17, line 12: insert "in" before "visualizing" p. 17, line 14: reword to say "imaged live in a temperature-controlled chamber at 37{degree sign}C." p. 17, line 15: delete "which" after "serum" p. 17, lines 16-17: replace "washout by replaced with chamber with normal serum" with "washed out by replacing with normal serum" p. 17, 10th line from bottom: replace "were" with "was" p. 17, second line from bottom: change "ROIs" to "ROI" p. 18, line 5: replace "were" with "was" p. 19, line 4: replace "the follicular cells" with "follicle cells" p. 19, line 6: fix typo in "reoxygenation" (here and in legends to Figs 2-5, multiple instances) p. 19, line 14: no RNAi is shown in Fig 1, so delete "(WT/RNAi); also, for consistency with other figure legends (e.g., Fig. 3), suggest changing text here to: "P4Mx2, PLC-PH (n=18,18), pLC-PH, Lgl (n=20,20), P4M2, Lgl (n=20,20)" p. 19, 7th line from bottom: should read "hr:min:sec"*
        • 20, line 3: replace "follicular" with "follicle" p. 20, line 5: replace "regulates" with "regulate"; also, it is not clear what is meant by "targeting and retargeting" (would be simpler to replace with "localization") p. 20, Figs 4-6 legends: the numbers of samples examined in these experiments are missing (n=?) p. 20, 10th line from bottom: replace "follicular" with "of follicle" p. 20, 7th line from bottom: delete "changes" p. 20, third line from bottom: replace "shown" with "seen" p. 21, lines 6 and 8: replace "(B)" with "(A')" and "(C)" with "(B)" p. 21, line 11: replace "(B)" with "(A)" and "(C)" with "(B)" p. 21, lines 18 and 23: insert "cells" after "12" p. 21, line 25: bold "(C')" p. 22, line 3: incomplete phrase should be replaced with "regardless of the transient loss of PM PM PIP2" (or "...PM PLC-PH-RFP") p. 22, line 4: replace "PLC-PH::GFP" with "PLC-PH::RFP" p. 22, line 6: replace "express" with "expression" p. 22, line 10: should this be "P4Mx2::GFP"? p. 22, line 12: add comma before "PLC-PH::GFP" pp 23-26: for clarity, delete word "samples" in legends to Movies S2-S16 p. 23: fix typo in "Lgl::mCherry" in legends to Movies S2 and S3 p. 24: change "Movie S09" to "Movie S9" p. 26: change "Time intervals is" to "Time intervals are" in legends to Movies S17-S19 Reviewer#3’s suggestions to improve the figures and movies: - show single-color images in grayscale, which is easier to see on black and helpful for colorblind readers (applies to all figures except Fig. S3); movies and merged still images should be shown in green and magenta for colorblind (not sure if channels in movies are difficult to change)*

      We converted Fig. 1A to gray scale but found it visually inferior to the color version, as the gray images make the temporal differences between green and red channels less pronounced. We will change red channel in movies to magenta color but to convert red channels in all figures requires a very large amount of work to recapture and recrop all the frames used. We hope that reviewer understand our decision to keep colors in figures unchanged.

      - replace colored labels on black boxes with colored labels on white background (Fig. 5A (left), Fig. 5B-D (top), Fig. 6A (left), Fig. 7A-C (left), Fig. S1A (left), Fig. S1C (top), Fig. S2A (left), Fig. S4 (left))

      We have revised the figures accordingly.

      - provide scale bars throughout (Figs 2-7, S1-S4)

      We have revised the figures accordingly.

      - replace pale colored boxes under labels for "hypoxia" and "air" with slightly darker boxes (Fig. 1A-C, Fig. 2A, Fig. 3A, Fig. 5A', Fig. 6A', Fig. S2B)

      We tested many different combination of colors and the current set appears to give the best contrast so far. We decided to keep the original color.

      - provide vertical lines similar to those in Fig. 4A' in all of the time-course graphs and/or making the background colors slightly darker (Figs 2A', 3A', 5A', 6A'); also make the error bars darker (Fig. 1A'-C', Fig. 4, Fig. 5A', Fig. S2B)

      We revised all backgrounds in charts to make them similar to Fig. 4A’.

      - for consistency, label PM index graphs in Fig. 4 and Fig. S2 as Fig. 4A' and Fig. S2A'

      We revised figures 4 and S2 accordingly.

      - why are some of the PM index graphs labeled "PM index" and others labeled "PM index-1" on the Y-axis? this should be explained or changed for consistency

      The mixed use of “PM index” and “PM index-1” are relics due to different versions of software used throughout the project. We revised all graphs to make Y-axis “PM Index” label consistent.

      - "blank diamonds" described in figure legend for Fig. 7B' are barely visible when printed

      Revised Figure 7B’ by filling blank diamonds with grey color to increase their visibility.

      - Fig. 7C is mislabeled (MaLionR label should be replaced with PLC-PH-RFP)

      We corrected this error in revised Figure 7C.

      - in Fig. S3A, it would help to know the size of the cells (i.e., how many were present in the area examined)

      Revised the Figure S3A legend to clarify that each circle covers approximately three to four cells.

      - movies should be referred to in order (current Movies S7 and S8 should be renamed S6 and S7, and current Movie S6 should be renamed S8)

      We appreciate reviewer’s suggestion but decided to keep the movies in current order. This manuscript contains a large number of movies (total of 19) and to make them better organized we purposefully grouped movies based on the RNAi experiments (e.g. all three PI4KIIIa-RNAi movies are named consecutively). Although this makes the call out of two Lgl movies slightly out of order, we consider it a reasonable compromise for easier movie browsing for readers.

      - in Movie S19, "PLC-PH::RFP" is mislabeled "PLC-PH::GFP" (both P4MX2 and PLC-PH are labeled GFP in the movie)

      We renamed to movie file to correct this typo.

      Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      Reivewer#3 suggested the following movie order changes:

        • 6, second full paragraph, lines 6 and 9: the callouts should be to Movies S5, not Movie S4 p. 7, second paragraph, line 3: change name of Movie S7 to S6, and call out Movie S6 here p. 7, third paragraph, line 2: change name of Movie S8 to S7, and call out Movie S7 here*
        • 8, first paragraph, line 8: change name of Movie S6 to S8, and call out Movie S8 here* We appreciate reviewer’s suggestion but decided to keep the movies in current order. This manuscript contains a large number of movies (total of 19) and to make them more accessible we purposefully grouped movies based on the RNAi experiments (e.g. all three PI4KIIIa-RNAi experiment movies are named consecutively). Although this makes call out of two Lgl movies slightly out of order, we consider it a reasonable compromise for easier movie browsing for readers..

      We are regretful that we are unable to directly evaluate the RNAi knock-down efficiency of several genes such as PI4KIIIa. We have nonetheless been careful to draw the conclusions in the manuscript in accordance with the potential caveat of RNAi experiments. We did directly show that rbo-RNAi directly knocked down the Rbo::GFP (Figure S1C). In addition, although we could not confirm the knock-down of ttc7-RNAi, we showed it can reduced level of Rbo::GFP, which is likely due to an effective knock-down of TTC7 (Figure 6B).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Phosphatidylinositol phosphates (PIPs) are key determinants of membrane identity and regulate crucial cellular processes such as polarization, lipid transfer and membrane trafficking. Despite decades of study, surprisingly little is known about how levels of PIPs are regulated in response to cellular stress. Here, using Drosophila ovarian follicular epithelial cells and human HEK293 cells, the authors show that levels of plasma membrane (PM) PI4P and PIP2 decrease rapidly in response to hypoxia, resulting in loss of polybasic proteins from the PM. These effects are reversed in response to reoxygenation. Similarly, hypoxia leads to acute depletion of ATP levels, which also regenerate following reoxygenation. Using a combination of quantitative image analysis and genetic analysis, they show that PI4KIIIalpha and its binding partners Rbo/ EFR3 and TTC7 are needed to maintain PI4P and PIP2 at the PM under normal and hypoxic conditions, whereas the other two Drosophila PI 4-kinases, Fwd/PI4KIIIbeta and PI4KII, play a less important role in PM PIP homeostasis. Their results suggest that manipulations with indirect effects on PIPs (hypoxia, ATP depletion, ischemia) can have a profound impact on electrostatic charge at the PM, as well as downstream processes that require PM PI4P and PIP2.

      Major Comments:

      1. In general, the authors' conclusions are convincing. However, some of the results are less evident from the still images and graphs provided in the figures than from the movies that accompany the figures. Some suggested improvements are below.
      2. No additional experiments are essential to support the claims of the paper, although some additional quantitation would be helpful to the reader, as detailed below.
      3. Data and methods are generally presented in such a way that they can be reproduced, although some additional details would be helpful, as listed below.
      4. Experiments were adequately replicated, and statistical analysis appears adequate.

      Minor comments:

      1. Although the data are generally quantified quite well, there are two instances in the first full paragraph on p. 5 where this is not the case. First, PM PI4P is described as "oftentimes" as showing a transient increase in the early phase of hypoxia. However, this is not quantified. How often did this occur among the samples examined? How large is the transient increase when it occurs (Fig. 1A' error bars are not obvious on the colored background)? Second, the authors state that the P4Mx2-GFP puncta "often" became brighter after recovery. How often did this occur? No quantitation is provided.
      2. The authors conclude that "PI4KIIalpha and Fwd contribute significantly to the maintenance of PM PI4P" (bottom of p. 7), yet they did not validate their RNAi knockdowns of these two genes, so they do not know whether it is one or both of these PI4Ks that contribute.
      3. In Fig. 4B, a subset of the cells "show failed recovery of PM Lgl::GFP". However, some cells did recover. This average percentage of cells that recovered should be quantified, if possible.
      4. In Fig. 7A, B, the bottom cell in each example lags behind the top cell in recovery of the MaLionR sensor. The frequency of observed cells in each class for 7A, B should be quantified.
      5. In most cases, prior studies were referenced appropriately. However, two previous studies in Drosophila showing the effects of Sktl/PIP2 reduction on localization of polybasic proteins Lgl, Baz/Par-3 and Par-1 were not cited (relevant to the first paragraph of the Introduction, p. 3): Gervais et al., Development (2008), Claret et al. Curr Biol (2014). In addition, two studies showing the importance of Drosophila PI4KIIIalpha in synthesizing PM PI4P and PIP2 were not cited (relevant to the description of this enzyme, top of p. 6): Yan et al., Development (2011), Tan et al., J Cell Sci (2014). Data showing fwd null mutants are not lethal (relevant to top of p. 7) were published in Brill et al., Development (2000).
      6. For the most part, text and figures are clear and accurate. However, there are quite a few typos and grammatical mistakes, as well as instances of lack of clarity in the writing that should be addressed. In addition, there are a number of improvements to presentation of data that would make the figures easier to understand. These are listed below.
      7. Suggestions to improve presentation of data and conclusions are below.

      Suggestions to improve the text:

      p. 3, first paragraph, line 11, and second paragraph, line 2: combine references (Dong, Dong, Lu) with (Bailey and Prehoda, Hong)

      p. 4, 8th line from bottom: PLC-PH is generally referred to as PLCdelta-PH in the literature and should be defined this way the first time "PLC-PH" is used

      p. 5, 5th line from top: I believe the callout should be to Fig. 1A' rather than Fig. 1B

      p. 5, first paragraph, line 2: replace "oftentimes" with "often" and provide quantitation (see above)

      p. 5, first paragraph, line 4: delete "to" after "sensor"

      p. 5, first paragraph, line 6: the claim of "often" should be quantified (see above)

      p. 5, first paragraph, line 7: insert "the" before "intracellular"

      p. 5, second paragraph: the extent of recovery of Lgl is less when Lgl-RFP is coexpressed with PLC-PH-GFP, potentially due to titration of PIP2 by PLC-PH; the authors should comment on this

      p. 5, last line: the authors should provide information about the "targeted RNAi screen"; which genes were tested? did any others give relevant phenotypes? a table showing the results of the screen should be provided as supplementary information

      p. 6, first full paragraph, line 4: change "cells" to "cell"

      p. 6, second full paragraph, line 1: change "was" to "were"

      p. 6, second full paragraph, line 4: delete "ref" at beginning of line)

      p. 6, second full paragraph, lines 6 and 9: the callouts should be to Movies S5, not Movie S4

      p. 7, first paragraph, line 4: explain briefly (either here or in the Methods) the "newly published multi-RNAi tools" reported by Qiao et al., 2018

      p. 7, first paragraph, lines 9-10: suggest changing to "...(Fig. 3A,B); the latter...sensors being bound..."

      p. 7, second paragraph, line 3: change name of Movie S7 to S6, and call out Movie S6 here

      p. 7, second paragraph, line 5: replace "is" with "are"

      p. 7, third paragraph, line 2: change name of Movie S8 to S7, and call out Movie S7 here

      p. 7, last paragraph, line 2: "under normal conditions" is vague; replace "normal" with "aerobic" or "normoxic"

      p. 8, first paragraph, line 4: replace "normal" with "aerobic" or "normoxic" for clarity

      p. 8, first paragraph, line 8: change name of Movie S6 to S8, and call out Movie S8 here

      p. 8, last line: callout to Fig. 5A appears incorrect; should be Fig. S1A instead

      p. 9, first full paragraph, lines 4 and 5: callouts should be to Fig. 6A (line 4) and 6B (line 5)

      p. 10, second full paragraph, line 1: insert "human" or "mammalian" after "cultured"

      p. 10, second full paragraph, line 3: delete "Although" and start sentence with "For reasons unknown, out..."

      p. 10, second full paragraph, lines 4-5: replace "in HEK293" with "to HEK293" and add a period after "present"; start new sentence on line 5 with "However, our data strongly..."

      p. 10, third line from bottom: fix typo in "Discussion"

      p. 11, line 9: replace "require" with "requires"

      p. 11, line 10: perhaps the authors mean to refer to "PI and PIP kinases" rather than "PIP and PIP kinases"

      p. 11, line 10: delete comma at end of line and replace with "and"

      p. 11, first full paragraph, lines 3 and 4: replace "PI4KIIIa" with "PI4KIIIalpha" (two instances)

      p. 11, first full paragraph, lines 5 and 7: the "m" in "Km" should be a subscript

      p. 11, first full paragraph, line 6: what about PI4KIIIbeta? is the KmATP for this enzyme known?

      p. 11, first full paragraph, lines 8-9: add "the before "intracellular PI4P pool" and replace "slower" with "more slowly" and "recovers faster" with "recover more quickly"

      p. 11, first full paragraph, line 10: replace "loss" with "lose"

      p. 11, last paragraph, line 1: insert "a" after "such"

      p. 11, last paragraph, line 2: what is meant by "etc." is unclear; remove "etc." and include specific information related to what was reported in the literature (with proper references)

      p. 12, line 3: why do the authors claim that the intracellular pool of PI4P is first synthesized by PI4KIIalpha? what about PI4KIIIbeta? their results do not distinguish between these enzymes

      p. 12, line 4: insert "is" after "PM PI4P" and delete "is" after "PI4P pool"; also replace "the transfer" with "this transfer"

      p. 12, line 5: delete "the before "intracellular"; delete the "is" after "(Dickson et al., 2014)" and replace with a comma"; replace "our data showing the loss" with "our data show loss"

      p. 12, first full paragraph, line 2: "interconversion" does not require a hyphen"

      p. 12, first full paragraph, line 5: why is PIP5K in red?

      p. 12, first full paragraph, line 6: replace "attracted" with "recruited"

      p. 12, first full paragraph, line 8: insert commas around "however"; also, "knockdown" does not require a hyphen

      p. 12, last paragraph, line 5: delete "the" before "PI4P"

      p. 12, last paragraph, lines 6-7: for the reader, please clarity the mechanism that was invoked to explain how PIP5K can make PIP2 from PI in E. coli (Botero et al., 2019)

      p. 12, 5th line from bottom: replace "are" with "is"

      p. 12, 4th line from bottom: start sentence with "In this regard," rather than "To this regard,"

      p. 13, first paragraph, line 1: delete "the" and insert "a" before "large"

      p. 13, first paragraph, last line: cannot conclude that components of PI4KIIIalpha are "highly interdependent" without testing effect of knockdown of PI4KIIIalpha on Rbo and TTC7, etc.; instead, can conclude that the data are consistent with all of the components acting in the same process; also, delete "the" before "proper"

      p. 13, second paragraph, line 2: add comma after "localization"

      p. 13, second paragraph, line 3: delete "the" before "PM PI4P"

      p. 13, second paragraph, line 10: replace "is" with "are" before "supported"

      p. 13, third line from bottom: should read "PIP2" rather than "PIP"

      p. 13, second line from bottom: "must have" is awkward and unclear; perhaps change to "is predicted to have a"

      p. 13, last line; replace "polybasic proteins or" with "polybasic and"

      p. 14, first paragraph, line 1: delete "the" before "PM PI4P"

      p. 14, first paragraph, line 3: replace "domains" with "motifs"

      p. 14, first paragraph, line 6: insert "the idea" before "that Lgl"

      p. 14, first paragraph, last line: replace "or" with "and"

      p. 14, second paragraph, line 2: insert "of" after "plenty"

      p. 14, second paragraph, lines 3-5: expand on this idea; what additional lipids could be important here? are there examples of other proteins that require these additional lipids?

      p. 14, last paragraph, line 4: replace "While" with "Although"

      p. 14, last paragraph, line 6: delete "the"

      p. 16, line 5: insert comma after "chromosome"

      p. 16, line 6: explain in brief what "pNP plasmid" is and how the multi-RNAi method works (what promoters drive expression of the shRNAs, how many shRNAs are included in the plasmid, etc.)

      p. 16, line 7: insert "Additional stocks used were"

      p. 16, lines 8-3: appropriate references should be included for each stock where available

      p. 16, line 11: explain what UAS-AT1.03NL1 is

      p. 16, line 12: fix typo in David Bilder's name

      p. 16, line 14: italicize lgl::GFP

      p. 16, lines 16-17: Gerry Hammond should not be listed as providing these constructs if he is a coauthor on the manuscript; appropriate references should be cited for these constructs

      p. 16, line 18: briefly explain what the pGU vector is and how it works

      p. 16, line 21: delete "the" before "young females"; also, the age of the females should be specified (newly eclosed? 1-3 days old? yeasted or not?)

      p. 16, 7th line from bottom: replace "ensures" with "ensure"

      p. 17, lines 3-4: sentence fragment "Images were further" is not complete

      p. 17, line 10: "overexposure" does not require a hyphen; add a comma after "necessary"

      p. 17, line 12: insert "in" before "visualizing"

      p. 17, line 14: reword to say "imaged live in a temperature-controlled chamber at 37{degree sign}C."

      p. 17, line 15: delete "which" after "serum"

      p. 17, lines 16-17: replace "washout by replaced with chamber with normal serum" with "washed out by replacing with normal serum"

      p. 17, 10th line from bottom: replace "were" with "was"

      p. 17, second line from bottom: change "ROIs" to "ROI"

      p. 18, line 5: replace "were" with "was"

      p. 19, line 4: replace "the follicular cells" with "follicle cells"

      p. 19, line 6: fix typo in "reoxygenation" (here and in legends to Figs 2-5, multiple instances)

      p. 19, line 14: no RNAi is shown in Fig 1, so delete "(WT/RNAi); also, for consistency with other figure legends (e.g., Fig. 3), suggest changing text here to: "P4Mx2, PLC-PH (n=18,18), pLC-PH, Lgl (n=20,20), P4M2, Lgl (n=20,20)"

      p. 19, 7th line from bottom: should read "hr:min:sec"

      p. 19, lines 5-6 from bottom: title doesn't accurately reflect that PI4P doesn't appear to recover in WT control; why is this the case? recovery was observed in other experiments

      p. 20, line 3: replace "follicular" with "follicle"

      p. 20, line 5: replace "regulates" with "regulate"; also, it is not clear what is meant by "targeting and retargeting" (would be simpler to replace with "localization")

      p. 20, Figs 4-6 legends: the numbers of samples examined in these experiments are missing (n=?)

      p. 20, 10th line from bottom: replace "follicular" with "of follicle"

      p. 20, 7th line from bottom: delete "changes"

      p. 20, third line from bottom: replace "shown" with "seen"

      p. 21, lines 6 and 8: replace "(B)" with "(A')" and "(C)" with "(B)"

      p. 21, line 11: replace "(B)" with "(A)" and "(C)" with "(B)"

      p. 21, lines 18 and 23: insert "cells" after "12"

      p. 21, line 25: bold "(C')"

      p. 22, line 3: incomplete phrase should be replaced with "regardless of the transient loss of PM PM PIP2" (or "...PM PLC-PH-RFP")

      p. 22, line 4: replace "PLC-PH::GFP" with "PLC-PH::RFP"

      p. 22, line 6: replace "express" with "expression"

      p. 22, line 10: should this be "P4Mx2::GFP"?

      p. 22, line 12: add comma before "PLC-PH::GFP"

      p. 22, line 16: fix typo in "uncalibrated"; spell out what "AT[NL] sensor shows

      pp 23-26: for clarity, delete word "samples" in legends to Movies S2-S16

      p. 23: fix typo in "Lgl::mCherry" in legends to Movies S2 and S3

      p. 24: change "Movie S09" to "Movie S9"

      p. 26: change "Time intervals is" to "Time intervals are" in legends to Movies S17-S19

      Suggestions to improve the figures and movies:

      • show single-color images in grayscale, which is easier to see on black and helpful for colorblind readers (applies to all figures except Fig. S3); movies and merged still images should be shown in green and magenta for colorblind (not sure if channels in movies are difficult to change)
      • replace colored labels on black boxes with colored labels on white background (Fig. 5A (left), Fig. 5B-D (top), Fig. 6A (left), Fig. 7A-C (left), Fig. S1A (left), Fig. S1C (top), Fig. S2A (left), Fig. S4 (left))
      • provide scale bars throughout (Figs 2-7, S1-S4)
      • replace pale colored boxes under labels for "hypoxia" and "air" with slightly darker boxes (Fig. 1A-C, Fig. 2A, Fig. 3A, Fig. 5A', Fig. 6A', Fig. S2B)
      • provide vertical lines similar to those in Fig. 4A' in all of the time-course graphs and/or making the background colors slightly darker (Figs 2A', 3A', 5A', 6A'); also make the error bars darker (Fig. 1A'-C', Fig. 4, Fig. 5A', Fig. S2B)
      • for consistency, label PM index graphs in Fig. 4 and Fig. S2 as Fig. 4A' and Fig. S2A'
      • why are some of the PM index graphs labeled "PM index" and others labeled "PM index-1" on the Y-axis? this should be explained or changed for consistency
      • "blank diamonds" described in figure legend for Fig. 7B' are barely visible when printed
      • Fig. 7C is mislabeled (MaLionR label should be replaced with PLC-PH-RFP)
      • in Fig. S3A, it would help to know the size of the cells (i.e., how many were present in the area examined)
      • movies should be referred to in order (current Movies S7 and S8 should be renamed S6 and S7, and current Movie S6 should be renamed S8)
      • in Movie S19, "PLC-PH::RFP" is mislabeled "PLC-PH::GFP" (both P4MX2 and PLC-PH are labeled GFP in the movie)

      Significance

      Overall, the authors do a nice job of showing that hypoxia leads to previously unappreciated effects on levels of PM PI4P and PIP2, resulting in loss of PM association of proteins important for normal cellular physiology. This finding is quite novel. Moreover, the authors provide insight into the identity of the PI4Ks that are responsible for regenerating PM PIP2 following return to normoxia. Their analysis of the dynamics of these changes provides multiple interesting insights, including the potential roles of intracellular pools of PI4P in replenishing PM PIP2 and the observation that intracellular accumulation of PIP2 is occasionally observed in association with the appearance of intracellular PI4P puncta, suggesting a novel route for PIP2 replenishment in response to hypoxic stress. Their results will provide the basis for future studies examining the cellular mechanisms involved. This study will be of interest to those studying phosphoinositide biology as well as cellular responses to hypoxic stress and recovery, such as occur during ischemia and reperfusion. Reviewer expertise: Drosophila molecular genetics, cell biology, developmental biology, phosphoinositides, PIP pathway enzymes, PIP effectors

      Referees cross-commenting

      This session includes the comments of all reviewers.

      Reviewer 3

      I agree with reviewer #1 that the authors did not do a good job of clarifying what they and others had previously shown, and I must confess I didn't carefully examine their previous papers carefully enough before preparing my review. In fact, they previously showed that hypoxia affects localization of Dlg at the plasma membrane and that its recovery depends on PI4KIIIalpha and PIP2 (Lu et al., Development 2021). This is in addition to their previous data showing effects of hypoxia on Lgl (Dong et al., J Cell Biol 2015). Thus, less of the information in the current manuscript is novel than I thought when I initially read it.

      I also agree with reviewer #2 that they need to do a better job of citing the relevant literature and considering the possibility that hypoxia and reduced levels of ATP might affect many different enzymes. In addition, as suggested by reviewer #1, it seems important

      Reviewer 1

      I agree with what Reviewer 3 is suggesting and with reviewer 2 that the authors should do a better job of citing all of the relevant literature. I also appreciate the detailed edits provided by Reviewer 3 - it was very generous of them to do this.

      Reviewer 2

      The points raised by reviewer 1 and 3 with regard to the citing or prior work (from the authors or other labs) also applies to their citing of literature on PI and PI4K signalling. Here too citing or prior work has been less than satisfactory making it difficult to do this.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript describes the effect of hypoxia on the levels of PI4P and PI45P2 , two key PPIs that are enriched on the inner leaflet of the plasma membrane. These PPIs are synthesized by the sequential phosphorylation of PI by a PI-4 kinase and subsequently a PI4P 5 kinases, both of which use ATP. The relevant PI-4 kinase at the plasma membrane, PI4KIIIa has been conclusively identified previously in mammalian cells by the DeCamilli lab (Nakatsu et.al JCB 2012) and its role in regulating the synthesis of PI4P and PI(4,5)P2 in two Drosophila cell types in vivo shown by two previous studies. Balakrishanan et.al J.Cell Sci 2018 (photoreceptors during PLC signalling) and Basu et.al Dev.Biol 2020 ( in multiple larval cell types ). PI4KIIIa has been shown to exist as a complex of the enzymatic polypeptide, EFR3 and TTC7. The studies by Nakatsu, Balakrishnan and Basu have shown the importance of the complex subunits is regulating PI4P and PI45P2 levels in cultured mammalian cells and Drosophila cell types in vivo.

      In the present study, Lu et. al build on their previous work showing that the polarity protein Lgl undergoes hypoxia induced translocation. They show that hypoxia also induces loss of PI4P and PI45P2 at the plasma membrane in these cells correlated with loss of Lgl localization to the PM. The manuscript then goes on to establish the requirement of the PI4KIIIa complex in regulating Lgl localization as well as PI4P and PI45P2 levels at the plasma membrane during hypoxia and the subsequent recovery of these at the plasma membrane.

      The strength of the manuscript is twofold.

      • (i) The work is done to a high technical standard and the investigators have carried out the measurements of LGL localization, PI4P and PI45P2 levels along with simultaneous measurements of ATP levels in vivo. The work would be strengthened further if the authors could show the level of depletion of PI4K isoforms or PI4KIIIa complex subunits units induced in ovarian tissue under their experimental conditions by the GAL4 drivers used in this study. This is not a persnickety detail as RNAi lines can have very different effectiveness in Drosophila ovarian tissue compared to other fly cell types. This point is, in particular, important in cases where an RNAi line is being used and the conclusion is a lack of impact on a phenotype being studied.
      • (ii) A second strength is that the authors now illuminate a further in vivo cell type where the function of the PI4KIIIa complex in regulating PI4P and PI45P2 levels. This adds to the earlier work of Nakatsu, Balakrishnan and Basu.

      A key difficulty with the current story is the lack of specificity of the phenotype they demonstrate under hypoxia. Of course, hypoxia is expected to deplete cellular ATP levels but PI4KIIIa is not the only enzyme that this lack of ATP will impact. There will be dozens or more other kinases, both protein and lipid kinases whose function will be impacted by the drop in ATP levels. Therefore, it is hard to attribute a specific/particular role to the PI4KIIIa complex under these conditions. The mislocalization of LGL::mCherry while correlated with PI4P and PI45P2 levels at the plasma membrane may be just that- a correlation. It is quite possible, indeed likely, that the mislocalization of LGL-mCherry under hypoxia conditions is due to the reduction of the activity of another lipid or protein kinase due to the drop in ATP levels due to hypoxia (PKC is a possibility too).

      Minor comments:

      The authors must reference all published work on the PI4KIIIa complex in the literature. Some of it is excluded in the present version

      The Drosophila work, particularly cell types used, etc are not accessible to people who are not fly experts. This should be done.

      Significance

      Adds to knowledge on the PI4KIIIa complex. Builds on existing knowledge in the PI4KIIIa field and maybe also cell polarity field.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This manuscript takes a closer look at how hypoxia affects the accumulation of PI4P and PI4,5P2 (PIP2) in the plasma membrane of Drosophila ovarian follicular epithelial cells and how ATP depletion similarly affects the localization of the same phospholipids in HEK293 cells. They demonstrate that hypoxia results in the reversible loss of plasma membrane (PM) association of both lipids, with PIP2 disappearing ahead of PI4P, and recovering more slowly than PI4P when oocytes are returned to normoxia. They also show that the intracellular vesicular pools of PI4P are depleted ahead of the PM pools and the PI4P recovery occurs first in PM, then in the vesicles. They show that the disappearance and recovery of the polarity protein Lethal giant larvae (Lgl) parallels that of PIP2 during hypoxia and subsequent normoxia, with a very slight delay. The authors then go on to show the RNAi knockdown of the PM enzyme (PI4KIIIa) that phosphorylates PIP delays the recovery of PI4P at the membrane, with recovery first occurring in the vesicular pools. This knockdown also delays the recovery of PIP2 and, as with recovery of PI4P, the recovery of PIP2 now occurs first in vesicular pools. Lgl recovery follows that of PI4P and PIP2 with RNAi knockdown of PI4KIIIa. The knockdown of all three of the enzymes that phosphorylate PIP to generate PI4P delays recovery of PI4P, PIP2 and Lgl at the membrane even more. The authors show that proteins required for the PM localization of PI4KIIIa have similar effects on the recovery of PM PI4P, PIP2 and Lgl (with delays and recovery of vesicular pools before PM pools). Independently, the authors show that ATP depletion in HEK293 cells result in similar reversible depletion of PI4P, PIP2 and Lgl from the PM. From these studies and their previous findings, the authors conclude that pools of PI4P and PIP2 are likely rapidly turned over in the membrane even during normoxia and that this rapid recovery is dependent on the PM localized enzyme that phosphorylates phosphoinositol.

      Major comments:

      Overall, the data are beautifully presented; it is quite helpful to have a video of each experimental treatment showing the corresponding response of all three molecules that are being monitored. Signal quantification over time is carefully documented. With the exception that a link between hypoxia and depletion of ATP has not been demonstrated here, the key conclusions are convincing. However, as pointed out below (in the significance section), some of the major points have already been published by this group. Their conclusion that hypoxia induces acute and reversible reduction of cellular ATP levels (which are then proposed to affect the activities of the enzymes required for PI4P and, consequently, PIP2 production) was not shown. They did demonstrate that acute depletion of ATP had the same consequences on PM phospholipids as acute hypoxia (in HEK293 cells). And, indeed, it makes sense that hypoxia could affect enzymes required for ATP synthesis, but the authors would have to show that acute hypoxia results in acute reduction in cellular ATP pools to make the links they suggest. This is something they should be able to do in the HEK293 cells now that they have their ATP sensor. Just to note, this group did show that hypoxia can reduce levels of ATP in Drosophila oocytes in their previous paper (Dong et al., 2015, Figure S3), but it is unclear if this is reversible and happens in the time frame of the experiments presented in this current manuscript.

      My suggestions are the following:

      1. The authors need to make it absolutely clear what was already known, including the following: (A) hypoxia reversibly affects PM pools of PI4P, PIP2, and Lgl (and other membrane associated proteins), (B) that hypoxia can affect ATP levels in Drosophila oocytes (although these previous studies do not show anything about the dynamics) and (C) that reducing ATP levels affects PM pools of PI4P, PIP2 and Lgl.
      2. They should demonstrate that acute hypoxia and return to normoxia has acute and reversible effects on cellular ATP levels - they now have the tools to do this, at least in HEK293 cells.

      Minor comments:

      The manuscript is too long and the discussion unnecessarily repeats everything already presented in the results. The authors should find a way to streamline the discussion.

      N values should be given for all figures and experiments, and the N=23/24 versus N=24/24 needs to be explained the first time it is used.

      There are a few mismatches in terms of plural nouns and singular verbs and vice versa sprinkled into the manuscript, so some careful editing would be useful.

      Significance

      I was initially quite excited about the novelty of their findings and the potential insight into the dynamics of PM pools of the two phospholipids that are critical to cell polarity and that play important signaling roles. However, at least a subset of their conclusions were either published in their earlier work or do not necessarily follow from what they have done in this manuscript. Their statement that hypoxia in Drosophila induces acute and reversible depletion of PM PI4P and PIP2 was presented in a previous publication (See Figure 8 of Dong et al., 2015).

      This manuscript would appeal to an audience interested in the mechanisms of cell polarity and phosphoinositide signaling.

      I am a Drosophila developmental geneticist quite familiar with the topics that this paper addresses.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Major comments:

      Are the key conclusions convincing?

      We discuss 4 key conclusions.

      __# 1 __A PRC of the segmentation clock was constructed.

      Although the authors have produced an interesting phase map, the regulation function F(\phi) of the circle map does not give the phase response curve (PRC) (Hoppensteadt & Keener 1982, Guevara & Glass 1982). This holds only when the system is stimulated with very short pulses (ideally Dirac delta), but the experimental pulses here are a quarter of the intrinsic period.

      There are several definitions of the PRC (Dirac pulses PRCs, linear PRCs, etc.). We use the general definition from Izhikievich, 2007: “In contrast to the common folklore, the function PRC (θ) can be measured for an arbitrary stimulus, not necessarily weak or brief. The only caveat is that to measure the new phase of oscillation perturbed by a stimulus, we must wait long enough for transients to subside“.

      The corresponding equation from Izhikievich (section 10.1.3) is

      PRC(θ)= θ_new-θ

      which is equivalent to our Equation 1.

      Hence, the key assumption we make is that after perturbing the system, we are back on the limit cycle as pointed out by Izhikievich. We think this is a reasonable assumption, because the perturbation we impose is relatively weak, despite pulsing for almost one quarter of the intrinsic period. The concentrations of DAPT we used in this current study are just enough to elicit a measurable response, and further lowering the concentration does not result in entrainment within our experiment time (0.5uM, Figure S7B in submitted version of the manuscript). Additionally, we previously reported that periodic pulsing with 2uM DAPT did not result in change of the Notch signaling activity with respect to control samples (Sonnen et al., 2018). Along similar lines, the DAPT drug concentrations we used are much lower compared to what has been used in previous studies aiming to perturb signaling levels, e.g. 100uM and 50uM used in study of segmentation clock in zebrafish embryos (Özbudak and Lewis, 2008 and Liao et al., 2016, respectively), and 25uM used in study of the segmentation clock in mouse PSM cells (Hubaud et al., 2017). Combined, we reason that we apply weak perturbations that allow to extract the PRC of the segmentation clock during entrainment. Additional evidence that indeed we have revealed a meaningful PRC is provided below, please see our response to point #3.

      __# 2 __Furthermore, in eq. 1 T_ext must be the winding number, and the modulus must be in units of

      phase, either one or two pi, for the circle map to be correct. Thus, calling the measured response of the system a PRC is not convincing.

      We thank the reviewer for pointing this out. We indeed rescaled everything to express the PRC in units of phase. We made this more explicit and updated equations throughout the text.

      __# 3 __The system is being entrained. Technically, It would also be easier to get the stroboscopic maps

      in the quasi-periodic regime since all the points in the circle will be sampled. Since no quasi-periodic response was demonstrated, the claim of entrainment is not convincing.

      While, in principle, PRC can be indeed obtained from responses in the “quasi-periodic” regime, such an approach is, in practice, challenging due to the intrinsic noise. The closest approximation to this is the phase response after the first pulse, that we reproduce below and compare to our inferred PRC, where we indeed clearly see a high noise level. Nevertheless, also the PRC based on the 1st pulse is in agreement with the PRC we derived from the entrainment data.

      In the entrained regime, one can get a much more reliable estimate of the phase response despite the noise. The level of noise in the stroboscopic map lowers as the samples approach entrainment (Figure S12), and the entrainment phase itself is a reliable statistical quantity that can be used to infer regions of the PRC as the detuning is varied.

      In addition, and maybe even more importantly, we identify several key features characteristics of entrainment, such as the change of entrainment phase as a function of detuning (Figure 7, Figure S6-S7 in submitted version of the manuscript) and the dependency of the time to entrainment as a function of initial phase (Figure 6). While additional features can be linked, in theory, with entrainment, i.e. period-doubling, higher harmonics (Figure 5), quasi-periodicity, we do not agree with the reviewer that all of these need, or in fact, can be found in the experimental data, in particular because of the influence of the noise. Conversely the positive experimental evidence that we provide for the presence of entrainment, combined with the theoretical framework we develop, justifies, in our view, the conclusions we make.

      __# 4 __The response of the system to external pulses is compatible with a SNIC. This is compatible, but

      it is equally compatible with other explanations. Assuming that the PRC is the same as the regulation function F(\phi), the PRC in Kotani 2012 (PRL 2012 fig. 3C) would be a similar shape as that shown by the authors. Similar models to that in Kotani et al., have been studied, but a SNIC has not been found (an der Heiden & Mackey 1982). It is relatively straightforward to construct a phenomenological model with a SNIC, but having underlying biological insight is not guaranteed. No argument for choosing a SNIC is given, so this emphasis of the paper is not convincing.

      It is true that the mapping of PRCs to oscillators is undetermined, in the sense that many systems could potentially give rise to similar PRCs. That said, there is value in parsimonious models, which often generalize very well despite their simplicity. This explains why in neuroscience, constant sign PRCs are generally associated with SNIC. There is a mathematical reason for this : 1-D oscillators with resetting (such as the quadratic fire-and-integrate model) are the simplest models displaying constant sign PRCs, and are the “normal” form for SNICs. In other words, SNIC bifurcations are among the simplest ones compatible with constant sign PRCs, and we think it is informative to point this out. In our manuscript, we go one step further by actually fitting the experimental PRC with a simple, analytical model that allows us to compute Arnold tongue for any values of the perturbation (contrary to more complex models).

      Other models such as Kotani 2012 can display similar PRC shapes, but they are of mathematically higher complexity, and furthermore it is not clear how such systems might behave when entrained. For instance that model in particular uses delayed differential equations, and as such contains long term couplings, so that a perturbation might have effects over many cycles, which is not consistent with the hypothesis we here make of a relatively rapid return to the limit cycle. Furthermore, for more complex models, PRCs are analytical only in the linear regime, while our model is analytical for all perturbations. That said, we agree that other types of oscillators can be associated with constant sign PRCs, and we have given more details in this part, in particular we better emphasize the Class I vs Class II oscillators as a way to broaden our discussion on PRC, and emphasize the “infinite period” bifurcation category which is more intuitive and further includes saddle node homoclinic bifurcations.

      __# 5 __The work demonstrates coarse graining of complex systems.

      This conclusion is correct, but coarse graining theory-driven analysis and control of dynamical systems has been established for many years. What is new here is that it is applied specifically to the in vitro culture system of the mouse segmentation clock.

      We agree it is new to successfully apply coarse-graining analysis and, importantly, control, to the in vitro culture system of the mouse segmentation clock. We also agree that such an approach has been pioneered and established for many years, especially in (theoretical) physics, but indeed, the key question is whether and how this can be applied to complex biological systems. Insights coming from theoretical considerations on idealized physical systems might not necessarily apply to biology, as already pointed out by Winfree.

      There are still very few examples in biology with coarse graining similar to what we do here. We think there is immense value in demonstrating that quantitative insights, and control of the biological systems, can be obtained without precise knowledge of molecular details, which is still counter-intuitive to many biologists. In this sense, we think our report will be of interest to both colleagues within the field of the segmentation clock and also to anyone interested to in the question, how theory and physics guided approaches can enable novel insight into biological complexity.

      Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Following on the points above, each of these needs to be corrected or re-done, and/or the conclusions need to be modified accordingly.

      We have modified the manuscript in response to all those points.

      # 6 Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation. If the authors wish to make the strong claim of determining a true PRC, Dirac delta-like perturbation needs to be applied, or approximated by short time duration pulses compared to the intrinsic period.

      Please refer to our response to point #1 and #3..

      # 7 *Are the suggested experiments realistic in terms of time and resources? It would help if you could *

      add an estimated cost and time investment for substantial experiments.

      It's not clear to this reviewer if it is feasible to deliver a very short pulse and record a response. But this may not be relevant, see above.

      Please refer to our response to point #1 and #3 .

      Are the data and the methods presented in such a way that they can be reproduced?

      Yes.

      Are the experiments adequately replicated and statistical analysis adequate?

      Yes.

      Minor comments:

      Specific experimental issues that are easily addressable.

      No issues.

      Are prior studies referenced appropriately?

      Yes.

      # 8 Are the text and figures clear and accurate?

      Figure 1D illustrates how a PRC should be obtained, but doesn't show the experimental protocol applied in the paper.

      Figure 1D is a general introduction on the phase description of oscillators and phase response. It demonstrates how a perturbation can change the phase and is not supposed to represent the experimental protocol. We describe how data are analyzed and how phases are extracted in Supplementary Note 1.

      __# 9 __In Figure 5B, 10 uM DAPT, the traces are already synchronized before the pulse train starts,

      which makes the subsequent behavior difficult to interpret.

      It appears here that by chance, the samples were already almost synchronized. We notice however that the establishment of a stable rhythm with the pulses (which here is not a multiple of the natural period) supports entrainment, and is already evident when looking at the timeseries with respect to the perturbation. The temporal evolution of the instantaneous period further confirms this, showing a change in period close to ½ zeitgeber period (which is very different from the natural period of ~140 mins). This also relates to point #35, in reply to both comments we have further expanded this figure to better show the 2:1 entrainment, adding statistics on the measured period and period evolution for a zeitgeber period of 300 mins.

      # 10 Do you have suggestions that would help the authors improve the presentation of their data and Conclusions? The text includes several paragraphs reviewing broad principles of coarse graining and making general conclusions. This is confusing, because, as mentioned above, there is no new general advance in this paper. The interesting contributions here are specific to the applications to the segmentation clock, and the text should be focused on this aspect.

      As commented above for #3 , we respectfully disagree that there is no “new general advance” in this paper. It is far from obvious that a complex ensemble of coupled oscillators implicated in embryonic development would be amenable to such coarse-graining theory. Of note, we still do not have a full understanding of neither the core oscillators in individual cells, nor what slows these down and eventually stops the oscillations, and multiple recent works suggest that both phenomena are under transient nonlinear control (e.g. our own work in Lauschke 2013). It is remarkable that despite this lack of detailed mechanistic insight, general entrainment theory can be applied to the segmentation process at the tissue level. We further show that classical entrainment theory alone is not sufficient to account for the experimental findings. Specifically, we need to account for a period change that we interpret as an internal feedback, an insight that would be impossible without our coarse-graining approach. While the results might of course be specific to the segmentation process, we think our approach motivated by coarse-graining theory and leading to new insights into the process is of general interest. We tried to make these points explicit in our conclusion.

      Reviewer #1 (Significance (Required)):

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Description of the complex mouse segmentation clock in terms of a simple model and its PRC is an interesting, original and non-trivial result. The proposal that the segmentation clock is close to a SNIC bifurcation provides a consistent dynamical explanation of slowing behavior that has been recognized for some time, but not fully understood. This proposal also raises a hypothesis about the behavior of the underlying molecular regulatory networks, which may be tested in the future. The increase or decrease of the intrinsic period due to the zeitgeber period is not expected from theory, pointing to structures in internal biochemical feedback loops, an idea which again may be tested in the future. Also surprising from a theoretical perspective, the spatial gradient of period in the system persisted after entrainment. Although the categorization of the generic behavior is interesting, by its nature there is little from this that might give a typical developmental biologist any conclusions about pathways or molecules. The successes and limits of the theoretical description do nevertheless focus future attention on interesting behaviors.

      # 11 Place the work in the context of the existing literature (provide references, where appropriate).

      Such an analysis of the segmentation clock is based strongly on the experimental system and results in Sonnen et al., 2018, and goes well beyond it in terms of the dynamical analysis. It provisionally categorizes the mouse segmentation clock as a Class I excitable system, allowing its dynamics at a coarse grained level to be compared to other oscillatory systems. In this aspect of simplification, it is similar to approach of Riedel-Kruse et al., 2007 who used a mean-field model of oscillator coupling to explain the synchrony dynamics observed in the zebrafish segmentation clock in response to blockade of coupling pathways, thereby allowing a high-level comparison to other synchronizing systems.

      It is interesting the reviewer sees similarities with the work of Riedel-Kruse et al, which uses a mean-field variable Z that corresponds to a classical approach, as described in Pikovsky’s textbook, to quantify synchronization of oscillators. In our view, while of course we work in the same context of coupled oscillators in the PSM, our approach based on perturbing and monitoring the system’s PRC in real-time provides a novel strategy to gain insight. This is evidenced by the fact that our quantifications of synchronization and insight into the PRC is the basis to exert precise control of the pace and rhythm of segmentation.

      State what audience might be interested in and influenced by the reported findings.

      Developmental biologists, biophysicists

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Developmental biology, somitogenesis, dynamical systems theory, biophysics, cell signaling


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary: This is a beautifully elegant study that tests how previously published theoretical predictions about entraining nonlinear oscillators applies to a biological oscillator, the segmentation clock. The authors use a combination of state of the art experimental techniques, signal processing and analytical theory to reach a series of interesting and novel conclusions.

      They show that the segmentation clock period can be entrained through Notch inhibitor (DAPT) pulses acting as an external clock (referred to as zeitgeber) using a previously developed and sophisticated microfluidic perfusion system. Pulsing DAPT every 120 to 180min can change the internal clock period while entrainment beyond this range leads to higher order coupling to the zeitgeber period, i.e. entrainment of every other pulse. They then perform entrainment experiments where the concentration of DAPT is changed to elicit a change in the strength of interaction between the internal clock and the external stimulus (referred to as zeitgeber strength); interestingly at low strength response to entrainment is more variable leading to entrainment occurring in some samples while others remain unaffected (Figure 4A); overall, higher concentration leads to faster entrainment (Figure 4C). The experimental data is then analysed using stroboscopic maps to reveal that a stable entrainment phase shift is achieved between the internal clock and the external zeitgeber. Phase response curve (PRC) analysis indicates that the system response is not sinusoidal but predominantly characterised by negative PRC, a behaviour consistent with saddle-node on invariant cycle (SNIC); it also reveals that the intrinsic period changes in a non-linear way and that this effect is reversible when external stimulation stops. Finally, a theoretical model is proposed to represent the segmentation clock as a dynamical system; this is based upon Radial Isochron Cycle with Acceleration (ERICA), an extension motivated by the PRC analysis results which are incompatible with a Radial Isochron Cycle (RIC); this model has predictive capability and could be used to design new control strategies for entrainment of the segmentation clock.

      This study makes a series of key conclusions which are of particular importance in understanding the dynamic response of a biological oscillators. Firstly, given it's the characteristics of the dynamic response to entrainment, the segmentation clock is likely close to a SNIC bifurcation and this can explain the tendency for relaxation of the period over time. Secondly, the clock period was changed in a non-linear way in the direction of the zeitgeber period, a finding which is interpreted to indicate the presence of feedback of the segmentation clock onto itself, potentially via Wnt. This makes an excellent prediction that if tested experimentally would greatly improve the impact of the study. It is also noted that the entrainment of the segmentation clock does not abolish spatial periodicity and phase wave emergence suggesting that single cell oscillators can adjust to periodic perturbation while maintaining emergent properties. This is also a significant result that would need to be followed up with experiments and computation however would be best suited to a separate study.

      Major comments:

      __# 12 __The coarse graining is a major point that would need to be clarified since the rest of the analysis

      and theoretical modelling in the paper flow from this. Firstly, the interpretation of the schematic in Figure 1A on experimental data collection is not immediately obvious to the reader, lacks a clear flow between the different panels or steps (which could be numbered for example) and does not have a legend to indicate the different colour mapping.

      We are grateful to the reviewer for this comment. We have implemented in Figure 1A all the changes suggested by the reviewer: we numbered the different steps and have added a colour mapping. In addition we have rephrased the caption of Fig 1A to better connect the experimental steps.

      __# 13 __Secondly, Figure 2A which explicitly addresses coarse graining is not clear enough. Is the

      message here that by excluding the inner parts of the sample with a radial ROI, a similar dynamic response is observed over time?

      Yes, indeed this is the point and we have adjusted the figure and text to explain this better. Our goal is to focus on the quantification of segmentation pace and rhythm. This is best captured by reporters such as LuVeLu, which has maximum intensity in regions where segment forms, and which dynamics is known to be strongly correlated to segmentation (Aulehla et al., 2007; Lauschke and Tsiairis et al., 20132). The global ROI is thus expected to precisely capture these segmentation and clock dynamics and we have now included more validation data and have also edited the text to make this very important point clearer:

      “To perform a systematic analysis of entrainment dynamics, we first introduced a single oscillator description of the segmentation clock. We used the segmentation clock reporter LuVeLu, which shows highest signal levels in regions where segments form \cite{Aulehla_A_2007}. Hence, we reasoned that a global ROI quantification, averaging LuVeLu intensities over the entire sample, should faithfully report on the segmentation rate and rhythm, essentially quantifying 'wave arrival' and segment formation in the periphery of the sample.”

      Figure 2A indeed shows that the dynamics (from the timeseries) is very similar when considering the entire field of view (global ROI) or when considering only the periphery of the 2D-assay (excluding central regions). We modified Figure 2A to clarify this point by indicating each measurement as either global ROI or global ROI minus the diameter of the excluded circular region (e.g. global ROI - 50px). We also emphasized in the caption that timeseries are obtained using global ROI, unless otherwise specified. We included a link (https://youtu.be/fRHsHYU_H2Q) in the caption to a movie of 2D-assay subjected to periodic pulses of DAPT (or DMSO) and corresponding timeseries from global ROI.

      Since the inner part of the sample corresponds to the posterior side how do we interpret similarities and differences between signals with different ROIs?

      As stated above, the global ROI measurements essentially capture the signal at the periphery where segments form and faithfully mirrors segmentation rate and rhythm. We have now included a comparison to the center ROI, also in response to reviewer’s comments, see our response #34.

      The result shows that the period and PRC in the center matches the one found in the periphery, i.e. global ROI. We have shown previously that center and periphery differ in their oscillation phase by 2pi, i.e., one full cycle (Lauschke et al., 2013). We interpret these findings as confirmation of our analysis strategy, i.e. the global ROI allows a very reproducible, unbiased quantification that reports on segmentation clock and period.

      __# 14 __A quantitative analysis of essential coarse-grained properties such as period and amplitude

      should be performed for different ROIs and across multiple samples. As this effectively masks any spatial differences, limitations of this approach should be clearly stated in the Discussion. For example in lines 466-470 where it is difficult to interpret the slowing down tendency and relate back to single cell level.

      As outlined in our response to comment #13 and also #34, we chose an analysis that allows to determine the segmentation pace and rhythm, i.e. segment formation, which is well captured by LuVeLu signal and a global ROI analysis. We agree that a spatially resolved analysis of dynamic behaviour is important (and indeed a gradient of amplitude might be relevant in such context), but we think this is beyond the scope of the current study focused on the system level segmentation clock behaviour. We have revised the discussion as suggested by the reviewer to make this point approach and the need for future studies clearer.

      __# 15 __The functional characterisation of the sample using LFNG, AXIN2 and MESP2 is unclear. The

      images included in Figure 2D representing expression observed when tissue explants are grown within the microfluidic chip are difficult to interpret and would require a more detailed description of anterior-posterior, pillars etc; it is also difficult to view the bright-field since it is presented as a merged image.

      It is particularly difficult to see the somite boundaries for the same reason. In lines 113-117 the authors state that the global oscillation period matches the periodic boundary formation. How do we reach this conclusion from these images? What is the variability between samples?

      If these two issues would be addressed it would increase confidence in the coarse graining argument and thus would strengthen the importance of the findings in the study.

      We thank the reviewer for this feedback, and we have added more quantifications to address this point directly in the modified Figure 2. Importantly, we added the quantification of the rate of segmentation in multiple samples based on segment boundary formation (new Figure 2D) and compared this to the global ROI quantifications using the reporter lines LuVeLu. This data provides clear evidence that the quantification of global ROI reporter intensities closely matches the rate of morphological segment boundary formation. In addition, we show that segment formation and also Wnt-signaling oscillations (Axin2-Achilles) and the segmentation marker Mesp2 (Mesp2-GFP) are all entrained to the zeitgeber period. We have also revised the text to clarify this important validation of our quantitative approach.

      In addition, we provide, in the revised Figure Suppl. 2, details of entrained samples, focusing on the segmenting regions. The brightfield and reporter channels were separated, emphasizing the segment boundaries and the expression pattern of the reporters. For ease of visualization, these samples were also re-oriented so that the tissue periphery (corresponding to anterior PSM) is at the top while the tissue center (corresponding to the posterior PSM) is at the bottom. This now additionally better shows the localization of the different reporters with respect to the segment boundary. We also included supplementary movies showing timelapse of samples expressing either Axin2-GSAGS-Achilles or Mesp2-GFP that were subjected to periodic DAPT pulses, with their respective controls.

      Several minor points could be addressed to improve the manuscript and are listed below:

      # 16 Figure 1 A the colormap and axes for the oscillatory traces should be defined

      We thank the reviewer, and we have modified the figure accordingly (related to point # 12). A colormap and axes for the illustrated timeseries are now included.

      # 17 Strength of zeitgeber is not defined and there is no analytical expression provided; how does it

      relate to DAPT concentration? Is the fact that low DAPT concentration corresponds to weak strength expected or is it a result?

      Zeitgeber strength generally refers to the magnitude of the perturbation periodically applied to an oscillator. With DAPT pulses, our expectation was that both the duration of the pulse and the drug concentration could influence the strength. Practically, the pulse duration was kept constant for all experiments and the concentration was varied. We thus expected that DAPT concentration would indeed be correlated to zeitgeber strength. We have discussed multiple evidence supporting this assumption in the main text, and this is indeed a result. In particular, as explained in the section “The pace of segmentation clock can be locked to a wide range of entrainment periods”, higher DAPT concentration gives rise to faster and better entrainment, as expected from classical theory. In the context of Arnold tongue, weaker zeitgeber strength corresponds to narrower entrainment region, which is experimentally observed (Fig 8F, showing regions where the clock is entrained).

      From a modelling standpoint, Zeitgeber strength corresponds to parameter A which is the amplitude of the perturbation. Possible zeitgeber strength was inferred from the model by matching the experimental entrainment phase with that obtained from the model isophases. As explained in Supplementary Note 2, we tested four concentrations of DAPT (0.5, 1, 2, and 3 uM) respectively corresponding to A values of 0.13, 0.31,0.43, 0.55. As we can see, those A values are not linear in DAPT concentrations, which is expected since multiple effects (such as saturation) can occur.

      __# 18 __In some figures it looks like the amplitude of oscillations may change with DAPT concentration

      and hence zeitgeber strength? Is this expected?

      We have not systematically analyzed the amplitude effect and have, intentionally, focused on the period and phase readout as most robust and faithful parameters to be quantified. Regarding the amplitude of LuVeLu reporter, we are cautious given that it is influenced, potentially, by the (artificial) degradation system that we included in LuVeLu, i.e. a PEST domain. This effect concerns the amplitude, but not the phase and period, explaining our strategy.

      That said, we agree with the referee that DAPT concentrations might change the amplitude of oscillations. Such change could even play a role in the change of intrinsic period (in fact a similar mechanism drives overdrive suppression for cardiac oscillators, Kunysz et al., 1995). But since the change of period can be more easily measured and inferred, we prefer to directly model it instead of introducing a new hypothesis on amplitude/period coupling, at least for this first study of entrainment.

      __# 19 __Figure 2A including the black area creates confusion and it is unclear which ROI is used in the

      rest of the study; consider moving this to a supplementary figure perhaps

      We thank the reviewer for this feedback (related to point #13), and we have modified the figure accordingly. As we responded to point # 13: We modified Figure 2A, by indicating each measurement as either global ROI or global ROI minus the diameter of the excluded circular region (e.g. global ROI - 50px). We also emphasized in the caption that timeseries are obtained using global ROI, unless otherwise specified.

      __# 20 __What type of detrending is used in Figure 2 and throughout (include info in the figure legend)?

      We used sinc-filter detrending, described and validated in detail previously (Mönke et al., 2020), as specified in Supplementary Note 1: Materials and methods > H. Data analysis > Monitoring period-locking and phase-locking: In this workflow, timeseries was first detrended using a sinc filter and then subjected to continuous wavelet transform. We thank the reviewer for pointing out that this detail is lacking in the figure captions, and we have modified the captions accordingly.

      __# 21 __Figure 2D merged images are difficult to read/interpret (see major comments)

      We thank the reviewer for this comment, and we have modified the figure accordingly (please see response to related point #15).

      __# 22 __Kuramoto order parameter is used to quantify the level of synchrony across the different samples

      however it is not defined in the text. Is it also possible to assess variability in each sample? For example how quickly does entrained occur in each sample? How faithfully the peaks of expression beyond 80min (to exclude initial unsynchronised state) match with zeitgeber time? This would help make the point that weak strength leads to a more variable response which is an interesting finding.

      We have now added a mathematical definition of the Kuramoto parameter in Supplementary Note 1.

      A high order parameter corresponds to coherence between samples, as also elaborated in respective figure captions (e.g. in the caption for polar plots in Figure 4D).

      In terms of variability in response to entrainment, we thank the reviewer for the comments, which has prompted us to perform an additional analysis, now included as Figure S13 in the Supplement.

      Briefly, we represent below figures showing how different samples get synchronized with the zeitgeber. To do this, we first represent the zeitgeber signal as a continuous uniformly increasing phase (“zeitgeber time”) with period : . The initial condition for is chosen so that the zeitgeber phase at the moment of last pulse is matching the experimental entrainment phase for each . We plot for each sample (dotted lines) and the zeitgeber phase (magenta line). To quantify how well each sample is following the zeitgeber time, we compute the Kuramoto parameter: . By the end of experiment most samples reach , indicating entrainment. Most samples need zeitgeber cycles to become entrained. For min the entrainment takes much longer (edge of the Arnold tongue). For min there is much variability, which can be explained by the horizontal region in the PRC around the entrainment phase. As suggested by the referee, synchronization is faster for higher DAPT concentration. So those dynamics are indeed consistent with the expectation from classical PRC theory.

      # 23 Do samples change period to Tzeit in similar ways - i.e. patterns over time. It looks like the

      kuramoto order parameter and period drop initially - why?

      We do not have a direct answer as to why the Kuramoto first order parameter and the period drop for the condition the reviewer specified. It has to be noted though that because of how wavelet analysis is done (cross-correlation of the timeseries with wavelets), the period and phase determination at the boundaries of the time series are less reliable (edge effects, see Mönke et al., 2020). Because of this, we should take caution when considering data to and from the first and last pulses, respectively. This was explicitly stated in the generation of stroboscopic maps: “As wavelets only partially overlap the signal at the edges of the timeseries, resulting in deviations from true phase values (Mönke et al., 2020), the first and last pulse pairs were not considered in the generation of stroboscopic maps.

      # 24 In Figure 4C why is the Kuramoto order parameter already higher in the 2uM DAPT conditions at

      the start of the experiment?

      Samples can, by chance, start synchronously and this results in a high Kuramoto first order parameter. Because of this likelihood, it is thus important to interpret the entrainment behaviour of multiple samples using various readouts, in addition to a high Kuramoto first order parameter. We investigated entrainment of the samples based on several measures: multiple samples remaining (or becoming more) synchronous (because each sample actively synchronizes with the zeitgeber), period-locking (where the pace of the samples match the pace of the zeitgeber, which can be distinct from natural pace), and phase-locking (where there is an establishment of a stable phase relationship between the samples and the zeitgeber).

      # 25 Figure 3C and Figure S2 require statistical testing between CTRL and DAPT in each condition

      p-values were calculated for the specified conditions and were added in the caption of the figures. These values are enumerated here:

      • Figure 3C
      • 170-min 2uM DAPT (vs DMSO control): p
      • Figure S2
      • 120-min 2uM DAPT (vs DMSO control): p = 0.064
      • 130-min 2uM DAPT (vs DMSO control): p = 0.003
      • 140-min 2uM DAPT (vs DMSO control): p = 0.272
      • 150-min 2uM DAPT (vs DMSO control): p = 0.001
      • 160-min 2uM DAPT (vs DMSO control): p To calculate p-values, two-tailed test for absolute difference between medians was done via a randomization method (Goedhart, 2019). This confirms that the period of samples subjected to pulses of DAPT is not equal to the controls, except for the 140-min condition (where the zeitgeber period is equal to the natural period, i.e. 140 mins).

      # 26 Figure 3A gray shaded area not clearly visible on the graph

      We have decided to remove the interquartile range (IQR) in the specified figure as it does not serve a crucial purpose in this case. By removing it in Figure 3A, the timeseries of individual samples are now clearer.

      # 27 Figure 6C colour maping of time progression is not clearly visible on the graph; the interpretation

      of this observation is unclear in the text and the figure

      We agree that the low quality of the image is unfortunate, and it seems that our file was greatly compressed upon submission. We have checked the proper quality of figures in the resubmitted version of the manuscript.

      Regarding the interpretation of Figure 6C, we conclude that in our experiments the entrainment phase is an attractor or stable fixed point, in line with theory (Granada and Herzel, 2009; Granada et al., 2009),. We had elaborated this in the text (lines 248-252 of the submitted version of the manuscript): at the same zeitgeber strength and zeitgeber period, faster (or slower) convergence towards this fixed point (i.e. entrainment) was achieved when the initial phase of the endogenous oscillation (φinit) was closer or farther to φent.

      # 28 Figure 7A circular spread not clearly visible on the graph

      Similar to point #27, we have provided a high resolution graph for the re-submission and hopefully resolved this issue.

      # 29 Figure S7A difficult to see the difference between colours

      See point #28.

      # 30 Is it possible to compare the PRC and the plots of period over time during entrainment? The PRC

      is mainly negative (Fig 8A1,A2), in my understanding this means a delay, however the periods seem to decrease over time before entraining to the Tzeit (Fig 3B). Is this reflective of a decrease in Kuramoto parameter and potential de-synchronisation of single cells before re-synchronisation at Tzeit?

      To address this question, we now plot the Phase response with colors indicating pulse number in new Supplementary Figure S13. While capturing the entire PRC as a function of time would require many more experiments (in particular to sample the phases far from entrainment phase), we still clearly see that the PRCs appear to translate vertically as the oscillator is being entrained, i.e. the latter time points are shifted up (down) for T_zeit = 120 (170) min, respectively.

      # 31 Fig 8A What is the importance/meaning of the PRC being similar shape between different

      entrainment periods? Does this reflect that the underlying gene network is the same?

      If one single gene network is responsible for oscillations, we expect from dynamical systems theory that the PRC are not only of similar shape but actually the same, independent of the entrainment period. What is surprising is that the PRC for different entrainment periods do not overlap, and the simplest explanation for this is that the intrinsic period changes with entrainment, all things being kept equal (including the underlying gene networks). This relates to the previous point since we indeed observe that the PRC “translates” vertically with the pulse number for longer periods. The change of period might be due to a long-term regulation as detailed in the discussion.

      # 32 The spatial period gradient and wave propagation under DAPT (Figure S8) should be included in

      the results and not just the discussion.

      We fully agree with the reviewer that both the establishment and the maintenance of a spatial phase gradient is of great interest. However, many more experiments would be required to fully quantify and understand the processes at play here, which we believe to be out of the scope of the current manuscript. To keep the focus of the paper on the global segmentation clock itself, we prefer to keep this figure in Supplement.

      Reviewer #2 (Significance (Required)):

      We currently do not have a detailed understanding of how biological oscillators integrate local signals from their neighbours as well global external signals to give rise to complex patterning that is important for embryonic development. Main bottlenecks that hinder our understanding are lack of real-time endogenous dynamic response together with known global inputs as well as comprehensive models that can explain emergent behaviour in a variety of tissues.

      This study goes a long way in addressing these bottlenecks in the embryonic tissue responsible for somite formation, a dynamical and oscillatory system also known as the segmentation clock. Firstly, they rely on a state-of-the-art previously developed system to entrain endogenous response in live tissue explants using precise microfluidic control. They test the complete range of exogenous perturbation periods and use an existing live reporter (LuVeLu) to monitor endogenous response. They also identify higher order coupling relationships whereby every other LuVeLu peak is entrained through external stimulation.

      As the stimulation system does not control but rather perturb the endogenous response, the observations from LuVeLu provide a unique opportunity in understanding input-output relationships and thus describing the dynamic response of the segmentation clock. Authors propose to study dynamic behaviour of the clock using coarse-graining and focus on describing the overall response over time while amalgamating spatial information. Appropriate coarse-graining is an important strategy in addressing complex problems and is widely used. They use sophisticated methodology such as phase response curves and Arnold tongue mapping to make several important observations. For example the nonlinear shortening and elongation of the period in response to stimulation is particularly interesting since this may indicates a feedback of the clock onto itself potentially via Wnt. Another key observation is that the spatial periodicity and phase wave activity persists in the perturbed conditions suggesting that individual single cell oscillators can adjust their behaviour to external input while retaining coordination with their neighbours. Finally, the authors go on to construct a general dynamical model of the segmentation clock and use this to conclude that the intrinsic period of the oscillator is altered and that the oscillator can be considered excitable.

      This work sheds light onto mechanisms of coordination of Notch activity in assemblies of cells observed in living tissue, an area of research that is important not only for somitogenesis but also for understanding gene expression patterning in many other tissues where Notch plays a critical role, for example in the development of the neural system and organs. As a study of a real-world nonlinear oscillator this work is directly of interest to theoreticians and synthetic biology experts interested in understanding complex patterning and emergence.


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript, authors studied the system-level responses of the somite segmentation clock by the coarse-grained theoretical-experimental approach, applying the theory of entrainment to understanding the phase responses of mouse pre-somitic mesoderm (PSM) tissues in the presence of periodic perturbation of Notch inhibitor DAPT generated by micro-fluidics technique. It was demonstrated that the segmentation clock is responsive to diverse range of the perturbation-periods from 120 to 180 min, can be period- and phase-locked, and the efficiency is dependent of the DAPT concentration (input-strength). The authors also observed two cycles of the segmentation-clock ticking in single cycles of 300 or 350 min period-perturbation, suggesting that higher order (2:1 mode) entrainment. They also applied stroboscopic maps to analysis and found that entrainment-phases are dependent of period of DAPT pulses, which is recapitulating theoretical predictions. The estimation of the phase response curve (PRC) of the segmentation clock revealed that the inferred PRC is an asymmetrical and mainly negative function, which represents characteristic features in oscillators that emerge after saddle-node on invariant cycle (SNIC) bifurcation. These results also indicated that the the segmentation clock changed the intrinsic period during entrainment.

      Major comments:

      # 33 I have major concerns about the relevance of the global time-series analysis proposed in Fig.2

      and conclusion about the changes of the intrinsic period during entrainment. The validity of the global time-series analysis should be carefully analyzed, because it could bring artifacts in estimated values of the intrinsic period. The authors concluded (page 3, line 172) that the period calculated by the global analysis represents similar values with the rate of segment formation, but there is no data about the quantification of the periods of segmentation, such as the frequency of Mesp2 reporter expression.

      We thank the reviewer for this feedback. We have now added the quantification of the period of segment formation (new Figure 2E) and show its strong correspondence to the dynamics of reporters used (Lfng, Axin2, and Mesp2). Please see also our response to point #15 with additional comments regarding the validation of the global time-series analysis.

      # 34 Another related issue is the presence of spatial period gradient as mentioned (page 13, line 524).

      One possible approach to circumvent this issue would be "local" time-series analysis; for instance, just focusing on the "putative posterior" regions that are close to source-positions of waves. Authors can re-compute and estimate PRCs by using such a method.

      We thank the reviewer for this suggestion and have accordingly now included the analysis of a localized ROI at the center (center ROI) of the 2D-assays (new Figures S5-S6). We also computed the PRC from center ROIs as shown below. We note strong correspondence between the global ROI and the center ROI.

      # 35 I have another major concern about the evidence of higher order entrainment shown in Fig.5. If

      the 1:2 entrainment is successful, we can expect that the values of observed period is close to the half of the period of pulses; However, the period shown in Fig.5B looks like 185 min longer than the half of 350 min. Is this gap due to the temporal accuracy of time-lapse movies?

      We do not think the discrepancy comes from a problem of temporal accuracy as the temporal accuracy is the same for all movies and there is no reason why there would be a specific issue for this set of experiments. In addition, we have re-analyzed the data to calculate the period from the stroboscopic maps. Mathematically speaking, we take the stroboscopic map as (see PDF) and use this to estimate the period of oscillation in entrained samples , in particular inverting the formula for 1:2 entrainment we have : see PDF.

      The advantage of this method is that it gives a more ``instantaneous” estimation of the period.

      The results are as follows:

      350 10uM: 187 +- 8 min (average across entrained samples from the last zeitgeber period)

      350 5uM: 193 +- 13 min (average across entrained samples from the last zeitgeber period)

      300 2uM: 148 +- 8 min (averaged across entrained samples and from two last periods)

      This additional analysis is in agreement with the wavelet analysis.

      The reviewer is right that for 350 minutes, entrained samples show an observed period that is higher than expected, also based on this new additional analysis. The reason for this is not known. One explanation is the relatively short observation time, especially considering for pulses separated by as much as 350-minutes, i.e. only 3 pulses are applied. [We notice that for 300 minutes pulses, the period converges to 150 mins between the 3rd and the 4th pulse]. We have adjusted the text in the results section to reflect that for 350min entrained samples, the observed period ‘approaches’ the predicted value, while for 300min entrained samples, the observed period is very close to it, i.e. 147mins In addition, we comment that the phase distribution narrows with time, another indication supporting higher order entrainment.

      # 36 Also, authors showed the period evolution towards 1:2 locking with just one condition (350 min).

      Authors can show the data for multiple conditions as in Fig. 3D, at least for 300 min and 325 min pulses and add the data about final entrained period with statistic analysis that supports the difference between the entrained period and the natural period (140 min).

      We thank the reviewer for this feedback and have modified the figure accordingly. In particular, in Figure 5A, we have added the period evolution plot for samples subjected to 300-min periodic pulses of 2uM DAPT (or DMSO for control). Additionally, we have added Figure 5D, which plots the average period in the 300-min and 350-min conditions. We summarize the median average period here with computed p-values:

      • 300-min pulses of 2uM DAPT (or DMSO for control): p-value = 0.191
      • CTRL: 130.39 mins
      • DAPT: 146.45 mins

      • 350-min pulses of 5uM DAPT (or DMSO for control): p-value = 0.049

      • CTRL: 127 mins
      • DAPT: 174.86 mins

      • 350-min pulses of 10uM DAPT (or DMSO for control): p-value = 0.016

      • CTRL: 142.82 mins
      • DAPT: 185.12 mins

      Minor comments:

      # 37 The authors can draw vertical lines indicating the T_zeit in Fig.3B, Fig.4B and Fig.5B in order to

      help comparisons between T_zeit and patterns of period (solid lines).

      We thank the reviewer for this comment. We have accordingly added a horizontal line indicating Tzeit in Figures 3B, 4B, S4A, and S5A (figure panel numbers based on the submitted version of the manuscript). We similarly added a horizontal line indicating 0.5Tzeit in the period evolution plots of 300-min and 350-min conditions in Figures 5A and 5B, respectively.

      # 38 In Fig.5A, the authors can show period evolution in the case of 300 min DAPT-pulses as shown

      in Fig.5B.

      We thank the reviewer for this feedback (related to point #36), and we have modified the figure accordingly.

      # 39 In Fig.6B DAPT panel, the authors can draw the points of phi_ent as shown in Fig.7A.

      We thank the reviewer for this comment, and we have modified the figure accordingly.

      # 40 In Fig. 8F, authors can put the information about DAPT concentration at the right y-axis.

      This is a similar comment as point #17, see above. In brief, we do not know the precise relation between the strength of the perturbation in our model and DAPT concentration, zeitgeber strength was inferred from the model by matching the experimental entrainment phase with that obtained from the model isophases.

      # 41 In Fig. 8G, the PRC in the panel "170 mins" does not have any fixed point (cross sections with

      horizontal lines of "0" phase response). If entrainment is successful, there should be stable and unstable fixed points, but those are absent, although 170 min pulses succeeded in the entrainment as shown in Fig.3D. Authors can explain where the fixed points are.

      The fixed points are indeed defined by the intersection with a horizontal line, but not with the ‘0’ line. They are found where the phase response compensates for the detuning/period mismatch, not at ‘0’ phase response. (See PDF for more details).

      Note however on Fig 8G that we further observe a vertical shift of the PRC, which prompted us to propose a change of the intrinsic period with (as explained in the text when we introduce Figs 8A1-2).

      Another way to visualize fixed points is offered in Fig 16 D-E, where we plot the inferred corrected PTC and the stroboscopic maps: there, fixed points correspond to intersections with the diagonal.

      Reviewer #3 (Significance (Required)):

      Although the phase-analysis has been widely applied to various biological systems, such as circadian clocks, cardiac tissues and neurons, this paper represents the first detailed experimental analysis of the segmentation clock based on the theory of phase dynamics. The major results are inline with theoretical predictions, whereas the suggestion about the SNIC bifurcation is attractive not only to the theoretical researchers but also to the experimental biologists; it has been believed that the segmentation clock consists of negative-feedback oscillator that emerge by Hopf bifurcation, whereas this paper proposes another possibility of the molecular network structure for the clockwork. This issue is related to recently proposed hypothesis about the excitable system in the segmentation clock based on the Yap signaling (Hubaud et al. Cell 171, 668 (2017)). However, unfortunately, discussion about detailed molecular networks are not abundant.

      # 42 Thus, maybe the main readers are computational biologists and systems biologists.

      We thank the reviewer for his/her significance comment. We have added comments on the bifurcation structure of the segmentation clock and on excitable systems in the discussion. While our focus is on coarse-graining so that we do not and cannot infer precise molecular details, we can still infer some properties of the underlying networks. In particular we now cite several papers explaining how systems with tunable periods/excitable are indicative of the interplay between positive and negative feedbacks. We think those considerations are of interest to a broad range of biologists interested in connecting experiments to theory.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, authors studied the system-level responses of the somite segmentation clock by the coarse-grained theoretical-experimental approach, applying the theory of entrainment to understanding the phase responses of mouse pre-somitic mesoderm (PSM) tissues in the presence of periodic perturbation of Notch inhibitor DAPT generated by micro-fluidics technique. It was demonstrated that the segmentation clock is responsive to diverse range of the perturbation-periods from 120 to 180 min, can be period- and phase-locked, and the efficiency is dependent of the DAPT concentration (input-strength). The authors also observed two cycles of the segmentation-clock ticking in single cycles of 300 or 350 min period-perturbation, suggesting that higher order (2:1 mode) entrainment. They also applied stroboscopic maps to analysis and found that entrainment-phases are dependent of period of DAPT pulses, which is recapitulating theoretical predictions. The estimation of the phase response curve (PRC) of the segmentation clock revealed that the inferred PRC is an asymmetrical and mainly negative function, which represents characteristic features in oscillators that emerge after saddle-node on invariant cycle (SNIC) bifurcation. These results also indicated that the the segmentation clock changed the intrinsic period during entrainment.

      Major comments:

      1. I have major concerns about the relevance of the global time-series analysis proposed in Fig.2 and conclusion about the changes of the intrinsic period during entrainment. The validity of the global time-series analysis should be carefully analyzed, because it could bring artifacts in estimated values of the intrinsic period. The authors concluded (page 3, line 172) that the period calculated by the global analysis represents similar values with the rate of segment formation, but there is no data about the quantification of the periods of segmentation, such as the frequency of Mesp2 reporter expression. Another related issue is the presence of spatial period gradient as mentioned (page 13, line 524). One possible approach to circumvent this issue would be "local" time-series analysis; for instance, just focusing on the "putative posterior" regions that are close to source-positions of waves. Authors can re-compute and estimate PRCs by using such a method.
      2. I have another major concern about the evidence of higher order entrainment shown in Fig.5. If the 1:2 entrainment is successful, we can expect that the values of observed period is close to the half of the period of pulses; However, the period shown in Fig.5B looks like 185 min longer than the half of 350 min. Is this gap due to the temporal accuracy of time-lapse movies? Also, authors showed the period evolution towards 1:2 locking with just one condition (350 min). Authors can show the data for multiple conditions as in Fig. 3D, at least for 300 min and 325 min pulses and add the data about final entrained period with statistic analysis that supports the difference between the entrained period and the natural period (140 min).

      Minor comments:

      1. The authors can draw vertical lines indicating the T_zeit in Fig.3B, Fig.4B and Fig.5B in order to help comparisons between T_zeit and patterns of period (solid lines).
      2. In Fig.5A, the authors can show period evolution in the case of 300 min DAPT-pulses as shown in Fig.5B.
      3. In Fig.6B DAPT panel, the authors can draw the points of phi_ent as shown in Fig.7A.
      4. In Fig. 8F, authors can put the information about DAPT concentration at the right y-axis.
      5. In Fig. 8G, the PRC in the panel "170 mins" does not have any fixed point (cross sections with horizontal lines of "0" phase response). If entrainment is successful, there should be stable and unstable fixed points, but those are absent, although 170 min pulses succeeded in the entrainment as shown in Fig.3D. Authors can explain where the fixed points are.

      Significance

      Although the phase-analysis has been widely applied to various biological systems, such as circadian clocks, cardiac tissues and neurons, this paper represents the first detailed experimental analysis of the segmentation clock based on the theory of phase dynamics. The major results are inline with theoretical predictions, whereas the suggestion about the SNIC bifurcation is attractive not only to the theoretical researchers but also to the experimental biologists; it has been believed that the segmentation clock consists of negative-feedback oscillator that emerge by Hopf bifurcation, whereas this paper proposes another possibility of the molecular network structure for the clockwork. This issue is related to recently proposed hypothesis about the excitable system in the segmentation clock based on the Yap signaling (Hubaud et al. Cell 171, 668 (2017)). However, unfortunately, discussion about detailed molecular networks are not abundant. Thus, maybe the main readers are computational biologists and systems biologists.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This is a beautifully elegant study that tests how previously published theoretical predictions about entraining nonlinear oscillators applies to a biological oscillator, the segmentation clock. The authors use a combination of state of the art experimental techniques, signal processing and analytical theory to reach a series of interesting and novel conclusions.

      They show that the segmentation clock period can be entrained through Notch inhibitor (DAPT) pulses acting as an external clock (referred to as zeitgeber) using a previously developed and sophisticated microfluidic perfusion system. Pulsing DAPT every 120 to 180min can change the internal clock period while entrainment beyond this range leads to higher order coupling to the zeitgeber period, i.e. entrainment of every other pulse. They then perform entrainment experiments where the concentration of DAPT is changed to elicit a change in the strength of interaction between the internal clock and the external stimulus (referred to as zeitgeber strength); interestingly at low strength response to entrainment is more variable leading to entrainment occurring in some samples while others remain unaffected (Figure 4A); overall, higher concentration leads to faster entrainment (Figure 4C). The experimental data is then analysed using stroboscopic maps to reveal that a stable entrainment phase shift is achieved between the internal clock and the external zeitgeber. Phase response curve (PRC) analysis indicates that the system response is not sinusoidal but predominantly characterised by negative PRC, a behaviour consistent with saddle-node on invariant cycle (SNIC); it also reveals that the intrinsic period changes in a non-linear way and that this effect is reversible when external stimulation stops. Finally, a theoretical model is proposed to represent the segmentation clock as a dynamical system; this is based upon Radial Isochron Cycle with Acceleration (ERICA), an extension motivated by the PRC analysis results which are incompatible with a Radial Isochron Cycle (RIC); this model has predictive capability and could be used to design new control strategies for entrainment of the segmentation clock.

      This study makes a series of key conclusions which are of particular importance in understanding the dynamic response of a biological oscillators. Firstly, given it's the characteristics of the dynamic response to entrainment, the segmentation clock is likely close to a SNIC bifurcation and this can explain the tendency for relaxation of the period over time. Secondly, the clock period was changed in a non-linear way in the direction of the zeitgeber period, a finding which is interpreted to indicate the presence of feedback of the segmentation clock onto itself, potentially via Wnt. This makes an excellent prediction that if tested experimentally would greatly improve the impact of the study. It is also noted that the entrainment of the segmentation clock does not abolish spatial periodicity and phase wave emergence suggesting that single cell oscillators can adjust to periodic perturbation while maintaining emergent properties. This is also a significant result that would need to be followed up with experiments and computation however would be best suited to a separate study.

      Major comments:

      The coarse graining is a major point that would need to be clarified since the rest of the analysis and theoretical modelling in the paper flow from this. Firstly, the interpretation of the schematic in Figure 1A on experimental data collection is not immediately obvious to the reader, lacks a clear flow between the different panels or steps (which could be numbered for example) and does not have a legend to indicate the different colour mapping. Secondly, Figure 2A which explicitly addresses coarse graining is not clear enough. Is the message here that by excluding the inner parts of the sample with a radial ROI, a similar dynamic response is observed over time? Since the inner part of the sample corresponds to the posterior side how do we interpret similarities and differences between signals with different ROIs? A quantitative analysis of essential coarse-grained properties such as period and amplitude should be performed for different ROIs and across multiple samples. As this effectively masks any spatial differences, limitations of this approach should be clearly stated in the Discussion. For example in lines 466-470 where it is difficult to interpret the slowing down tendency and relate back to single cell level.

      The functional characterisation of the sample using LFNG, AXIN2 and MESP2 is unclear. The images included in Figure 2D representing expression observed when tissue explants are grown within the microfluidic chip are difficult to interpret and would require a more detailed description of anterior-posterior, pillars etc; it is also difficult to view the bright-field since it is presented as a merged image. It is particularly difficult to see the somite boundaries for the same reason. In lines 113-117 the authors state that the global oscillation period matches the periodic boundary formation. How do we reach this conclusion from these images? What is the variability between samples?

      If these two issues would be addressed it would increase confidence in the coarse graining argument and thus would strengthen the importance of the findings in the study.

      Several minor points could be addressed to improve the manuscript and are listed below: -Figure 1 A the colormap and axes for the oscillatory traces should be defined -strength of zeitgeber is not defined and there is no analytical expression provided; how does it relate to DAPT concentration? Is the fact that low DAPT concentration corresponds to weak strength expected or is it a result? - In some figures it looks like the amplitude of oscillations may change with DAPT concentration and hence zeitgeber strength? Is this expected? -Figure 2A including the black area creates confusion and it is unclear which ROI is used in the rest of the study; consider moving this to a supplementary figure perhaps -what type of detrending is used in Figure 2 and throughout (include info in the figure legend) -Figure 2D merged images are difficult to read/interpret (see major comments) -Kuramoto order parameter is used to quantify the level of synchrony across the different samples however it is not defined in the text. Is it also possible to assess variability in each sample? For example how quickly does entrained occur in each sample? How faithfully the peaks of expression beyond 80min (to exclude initial unsynchronised state) match with zeitgeber time? This would help make the point that weak strength leads to a more variable response which is an interesting finding. - Do samples change period to Tzeit in similar ways - i.e. patterns over time. It looks like the kuramoto order parameter and period drop initially - why? -In Figure 4C why is the Kuramoto order parameter already higher in the 2uM DAPT conditions at the start of the experiment? -Figure 3C and Figure S2 require statistical testing between CTRL and DAPT in each condition -Figure 3A gray shaded area not clearly visible on the graph -Figure 6C colour maping of time progression is not clearly visible on the graph; the interpretation of this observation is unclear in the text and the figure -Figure 7A circular spread not clearly visible on the graph -Figure S7A difficult to see the difference between colours -Is it possible to compare the PRC and the plots of period over time during entrainment? The PRC is mainly negative (Fig 8A1,A2), in my understanding this means a delay, however the periods seem to decrease over time before entraining to the Tzeit (Fig 3B). Is this reflective of a decrease in Kuramoto parameter and potential de-synchronisation of single cells before re-synchronisation at Tzeit? -Fig 8A What is the importance/meaning of the PRC being similar shape between different entrainment periods? Does this reflect that the underlying gene network is the same? -The spatial period gradient and wave propagation under DAPT (Figure S8) should be included in the results and not just the discussion.

      Significance

      We currently do not have a detailed understanding of how biological oscillators integrate local signals from their neighbours as well global external signals to give rise to complex patterning that is important for embryonic development. Main bottlenecks that hinder our understanding are lack of real-time endogenous dynamic response together with known global inputs as well as comprehensive models that can explain emergent behaviour in a variety of tissues.

      This study goes a long way in addressing these bottlenecks in the embryonic tissue responsible for somite formation, a dynamical and oscillatory system also known as the segmentation clock. Firstly, they rely on a state-of-the-art previously developed system to entrain endogenous response in live tissue explants using precise microfluidic control. They test the complete range of exogenous perturbation periods and use an existing live reporter (LuVeLu) to monitor endogenous response. They also identify higher order coupling relationships whereby every other LuVeLu peak is entrained through external stimulation.

      As the stimulation system does not control but rather perturb the endogenous response, the observations from LuVeLu provide a unique opportunity in understanding input-output relationships and thus describing the dynamic response of the segmentation clock. Authors propose to study dynamic behaviour of the clock using coarse-graining and focus on describing the overall response over time while amalgamating spatial information. Appropriate coarse-graining is an important strategy in addressing complex problems and is widely used. They use sophisticated methodology such as phase response curves and Arnold tongue mapping to make several important observations. For example the nonlinear shortening and elongation of the period in response to stimulation is particularly interesting since this may indicates a feedback of the clock onto itself potentially via Wnt. Another key observation is that the spatial periodicity and phase wave activity persists in the perturbed conditions suggesting that individual single cell oscillators can adjust their behaviour to external input while retaining coordination with their neighbours. Finally, the authors go on to construct a general dynamical model of the segmentation clock and use this to conclude that the intrinsic period of the oscillator is altered and that the oscillator can be considered excitable. This work sheds light onto mechanisms of coordination of Notch activity in assemblies of cells observed in living tissue, an area of research that is important not only for somitogenesis but also for understanding gene expression patterning in many other tissues where Notch plays a critical role, for example in the development of the neural system and organs. As a study of a real-world nonlinear oscillator this work is directly of interest to theoreticians and synthetic biology experts interested in understanding complex patterning and emergence.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Major comments:

      • Are the key conclusions convincing?

      We discuss 4 key conclusions.

      A PRC of the segmentation clock was constructed. Although the authors have produced an interesting phase map, the regulation function F(\phi) of the circle map does not give the phase response curve (PRC) (Hoppensteadt & Keener 1982, Guevara & Glass 1982). This holds only when the system is stimulated with very short pulses (ideally Dirac delta), but the experimental pulses here are a quarter of the intrinsic period. Furthermore, in eq. 1 T_ext must be the winding number, and the modulus must be in units of phase, either one or two pi, for the circle map to be correct. Thus, calling the measured response of the system a PRC is not convincing.

      The system is being entrained.

      Technically, entrainment requires the existence of quasi-periodic response (outside Arnold tongues). It would also be easier to get the stroboscopic maps in the quasi-periodic regime since all the points in the circle will be sampled. Since no quasi-periodic response was demonstrated, the claim of entrainment is not convincing.

      The response of the system to external pulses is compatible with a SNIC.

      This is compatible, but it is equally compatible with other explanations. Assuming that the PRC is the same as the regulation function F(\phi), the PRC in Kotani 2012 (PRL 2012 fig. 3C) would be a similar shape as that shown by the authors. Similar models to that in Kotani et al., have been studied, but a SNIC has not been found (an der Heiden & Mackey 1982). It is relatively straightforward to construct a phenomenological model with a SNIC, but having underlying biological insight is not guaranteed. No argument for choosing a SNIC is given, so this emphasis of the paper is not convincing.

      The work demonstrates coarse graining of complex systems.

      This conclusion is correct, but coarse graining theory-driven analysis and control of dynamical systems has been established for many years. What is new here is that it is applied specifically to the in vitro culture system of the mouse segmentation clock.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      Following on the points above, each of these needs to be corrected or re-done, and/or the conclusions need to be modified accordingly.

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      If the authors wish to make the strong claim of determining a true PRC, Dirac delta-like perturbation needs to be applied, or approximated by short time duration pulses compared to the intrinsic period.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      It's not clear to this reviewer if it is feasible to deliver a very short pulse and record a response. But this may not be relevant, see above.

      • Are the data and the methods presented in such a way that they can be reproduced?

      Yes.

      • Are the experiments adequately replicated and statistical analysis adequate?

      Yes.

      Minor comments:

      • Specific experimental issues that are easily addressable.

      No issues.

      • Are prior studies referenced appropriately?

      Yes.

      • Are the text and figures clear and accurate?

      Figure 1D illustrates how a PRC should be obtained, but doesn't show the experimental protocol applied in the paper.

      In Figure 5B, 10 uM DAPT, the traces are already synchronized before the pulse train starts, which makes the subsequent behavior difficult to interpret.

      • Do you have suggestions that would help the authors improve the presentation of their data and conclusions?

      The text includes several paragraphs reviewing broad principles of coarse graining and making general conclusions. This is confusing, because, as mentioned above, there is no new general advance in this paper. The interesting contributions here are specific to the applications to the segmentation clock, and the text should be focused on this aspect.

      Significance

      Describe the nature and significance of the advance (e.g. conceptual, technical, clinical) for the field.

      Description of the complex mouse segmentation clock in terms of a simple model and its PRC is an interesting, original and non-trivial result. The proposal that the segmentation clock is close to a SNIC bifurcation provides a consistent dynamical explanation of slowing behavior that has been recognized for some time, but not fully understood. This proposal also raises a hypothesis about the behavior of the underlying molecular regulatory networks, which may be tested in the future. The increase or decrease of the intrinsic period due to the zeitgeber period is not expected from theory, pointing to structures in internal biochemical feedback loops, an idea which again may be tested in the future. Also surprising from a theoretical perspective, the spatial gradient of period in the system persisted after entrainment. Although the categorization of the generic behavior is interesting, by its nature there is little from this that might give a typical developmental biologist any conclusions about pathways or molecules. The successes and limits of the theoretical description do nevertheless focus future attention on interesting behaviors.

      Place the work in the context of the existing literature (provide references, where appropriate).

      Such an analysis of the segmentation clock is based strongly on the experimental system and results in Sonnen et al., 2018, and goes well beyond it in terms of the dynamical analysis. It provisionally categorizes the mouse segmentation clock as a Class I excitable system, allowing its dynamics at a coarse grained level to be compared to other oscillatory systems. In this aspect of simplification, it is similar to approach of Riedel-Kruse et al., 2007 who used a mean-field model of oscillator coupling to explain the synchrony dynamics observed in the zebrafish segmentation clock in response to blockade of coupling pathways, thereby allowing a high-level comparison to other synchronizing systems.

      State what audience might be interested in and influenced by the reported findings

      Developmental biologists, biophysicists,

      Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      Developmental biology, somitogenesis, dynamical systems theory, biophysics, cell signaling

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their excellent comments. The comments raised by the reviewers have tremendously improved our manuscript and allowed us to provide more clarity of our findings. Please find below the point-by-point answers to the reviewer's comments.


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The manuscript from Savva et al. focuses on a long-standing and unresolved challenge of metabolic (and not only) health in mammals: the sexual dimorphism. Authors couple transcriptomics and metabolomics to in-depth molecular phenotyping in offspring of dams fed HFD before conception and throughout pregnancy and lactation to isolate molecular determinants of sexually dimorphic response to maternal obesity.

      **Major comments:**

      1. While the manuscript present a compelling and exhaustive amount of data, which accurately describes the sexually dimorphic responses to maternal obesity, it lacks mechanistic insights and I personally think this to mainly be a timing issue. For example, I tried hard to find experimental details on the RNA sequencing and could not find much: when is the RNA sequencing performed? If, as I suspect, the sequencing experiments match the metabolomics experiments, I don't think they add much mechanistic insights onto the observed phenomena. They rather contribute to better describe them. Indeed, both metabolomic and transcriptomic profiles might be consequence of the observed phenotypes, rather than be causative (as the authors try to argue). Are these differences already present at birth? what happens to placenta and fetal tissues?

      ANSWER: We have clarified the experimental setting and added information about the maternal status. We did not collect the placenta and fetal tissues as the main goal of the study was to investigate the offspring metabolism in a longitudinal study, using the animal as its own control. Therefore, we looked at glucose tolerance, insulin sensitivity and lipid profile in the liver at two different timepoints on the same mouse. At end we sacrificed the mouse and collected tissues for further analysis (lipidomic, RNA sequencing and histology).

      Some adult phenotypes - especially metabolic and neurological phenotypes - might also be influenced by different maternal care early postnatal. Are the litters balanced by number and sex ratio? Would cross-fostering maintain the phenotypes?

      ANSWER: We agree with the reviewer that phenotype can be influenced by maternal care early postnatal. Each dam delivered litters in different proportions. Dams on CD delivered in average seven to eight littermates (same ratio of female and male offspring) whereas dams on HFD delivered less (five in average). Some of the offspring died at very early age of unknown reasons. The final number of offspring used for this study was 11 females and 12 males born from dams on CD and 11 females and 10 males born from dams on HFD. We did not perform cross-fostering to limit the stress in the animals.

      Of the two points above, I would love to see more details on the RNA sequencing, as well as placental and fetal tissues analysed. It would be also interesting to know about any litter balancing measure or at least have more statistics on the litter size and sex ratios.

      ANSWER: Each individual was followed over the course of the experimentation for body weight and food intake (six months). However, not all animals were used for every in vivo and ex vivo experiment justified under consideration of the 3Rs, sample throughput capacity and financial constraints (magnetic resonance, lipidomic, qPCR, and tolerance tests). However, all experimentation was performed according to a prior power calculation and published reports (PMID: 30808418, 23446231, 31811898, 25694038 and 31820027). Throughout the study, we opted for randomized experimental design (random selection of the animals for in vivo and ex vivo experiments).

      After weaning, up to five littermates were housed per cage. However, some male individuals had to be separated due to aggressive behaviors. When females showed hierarchical behaviors in the cage, we separated them to be sure that each individual had full access to food. Since mice are social animals, no individual was housed alone.

      Reviewer #2 (Significance (Required)):

      The manuscript from Savva et al. revolves around the unresolved challenge of how sexually dimorphic phenotypes are established. The topic is actual - although already a lot has been published, as acknowledged by the authors as well - and of broad interest to the community of mouse geneticists and physiologists. To understand the molecular underpinnings of sexually dimorphic phenotypes, the authors use in-depth molecular phenotyping in the mouse coupled to metabolomics and transcriptomics. While extremely informative and exhaustive, the actual dataset is - at least for me - purely descriptive, which might reduce its overall impact. I'm a mouse geneticist and a metabolic physiologist and I find the topic of sexual dimorphism extremely interesting.

      **Referee Cross-commenting**

      I generally agree with the other reviewers' comments. I think the ms is interesting and the dataset compelling although to a certain extent overlapping with previously published studies. There is general agreement on the lack of mechanistic data and the authors should definitely address this point.

      ANSWER: In mammals, including humans, biological sex is determined by a pair of sex chromosomes. Genes on the X chromosome (X-linked genes) have distinctive inheritance patterns because they are present in different number between females (XX) and males (XY). Moreover, a plethora of attributes can be determined by sex chromosomes and more importantly by X-linked chromosomes. Therefore, we extracted the sex-linked chromosomes that are affected by the maternal diet or by sex and presented them in figure 5 to give more insight into the molecular underpinning of the sexual dimorphism in metabolism homeostasis.

      We have presented the data in figures 4e-4g and commented on page 12 L300-L315 in the result section and in the last paragraph of the discussion section (page 18).

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this paper Savva et al. explore how maternal obesity influence hepatic metabolism in a sex-specific fashion. They first assess the contribution of the adipose tissue to the development of insulin resistance and glucose intolerance focusing on inflammation and oxidative phosphorylation pathways. Then they proceed to asses if maternal obesity could remodel the hepatic triglyceride levels and phospholipids using proton magnetic resonance spectroscopy and LC-MS lipidomic, respectively. Finally, they explore hepatic lipid metabolism and genes promoting cancer development.

      Despite the methodology is correct and elegant, the study does not explore a possible mechanism of action and some results are contradictory. Indeed, some of the results seems to be driven by the sex of the offspring independently of the maternal feeding.

      There are indeed, some limitations for the authors and editors to consider. To this reviewer the manuscript is difficult to read, particularly the results section in which the data are listed without discussing their relevance or their connection to previous research from other groups. Moreover, the discussion could benefit from an extensive rewrite. Indeed, this section lacks of clarity and references that could help elucidate the novel finding of the authors.

      ANSWER: We have rewritten the manuscript and we believe that it has been extensively improved in clarity. We have added references and commented on the differences observed, if any. We also have added one complete paragraph on the potential role of sex-biased modulators, especially the sex chromosomes XX or XY. These data are presented in figures 4e-4g and commented on page 12 L300-L315 in the result section and in the last paragraph of the discussion section (page 18).

      **Major comments:**

      Line 97: How the authors explain similar weight gain in the F-C/HF vs. F-HF/HF? A large body of literature reports that maternal high fat diet influences offspring weight gain, independently of sex, when compared to maternal standard diet (PMID: 23973955; 29872021; 31076636; 3036829).

      ANSWER: The literature is still controversial, possibly due to slightly different experimental settings (e.g. the exact composition of the control and high fat diet, exposure time to the different diets). In the current study we used a match control diet of the high fat diet to minimize potential diet-derived signaling molecule effect. We found several publications in line with our findings (PMID: 30405201, 31690792 and 29972240). In addition, in our study, we assessed the changes in body weight over a long time in the offspring, whereas only few studies show detailed measurements over time, which makes it more difficult to compare across studies.

      Line 99: Which is the explanation for the reduced body weight in M-HF/HF from birth until 9 week of age? Can the authors show the timeline for food intake?

      ANSWER: Excess gestational weight gain is associated with health risk for both the infant and the mother. A large number of epidemiological studies have demonstrated a direct relationship between birth weight and BMI attained in later life. Lower birth weight seems to be associated with later risk for central obesity, which also confers increased cardiovascular risk. It has been previously demonstrated that offspring born from obese mother have lower body weight at birth than control diet/lean mother’s littermates (PMID: 15116085 and 24936914). Here, offspring after weaning are all put on the HFD until six months of age (before sacrifice).

      We do not have timeline of food intake but we measured average food intake twice a week during three weeks at about 4-month of age, we presented the data in Fig.S1a and in the result section page 5 (Line 104). Interestingly, food intake tended to be induced and reduced by MO in female and male, respectively.

      Line 101: How the authors explain the increase in final weight of the male compared to the female if no differences in total fat, VAT or SAT was observed between the offspring?

      ANSWER: We have presented the graphs in relative amount body fat (% TF), and TF is corrected for the total body weight. Figs.1d-1f. When fat is reported to the body weight males have lower relative fat content than females. However, males are taller (bigger) and have bigger bone and muscle mass than females. Therefore, males weight more than females despite lower relative amount of fat.

      Line 116: The authors state: "The ratio between the total SAT and the Abd SAT revealed that MO redistributed SAT outside of the abdominal region in females but not in males" but Fig.1g displays no significant differences between F-C/HF and F-HF/HF. Please explain.

      ANSWER: Sex differences are observed (at END, MO tended to increase the ratio in F and decrease in M) to a lower level than F. Lower SAT on VAT ratio in obese males than in females is well recognized, however here we observed that MO tends to redistribute fat differently between sexes, and fat distribution has been strongly correlated to metabolic risks.

      Line 128-130: The authors state: "At MID, glucose tolerance was highly diet- and sex-dependent, and males but not females showed impaired glucose tolerance by MO." However, in Fig.1h no significant differences in glucose peak or glucose AUC were observed between M-C/HF and M-HF/F. Please explain. This is correct, however fasting glucose (T0) was higher and T60 and T120 as well. In addition, Ins levels during the OGTT was increased at T0, T30 and T120.

      ANSWER: We have clarified the sentence and agree with the reviewer that the males are not affected by MO but are already insulin resistant when born from lean mothers. We added the quantitative insulin-sensitivity check index (QUICKI) in Fig.1n and demonstrate that indeed males are less insulin sensitive than females, but MO does not impact the insulin sensitivity in both sexes.

      Line 130: OGTT only provides information on insulin secretion and action but does not directly yield a measure of insulin sensitivity. Please rephrase.

      ANSWER: We have measured the insulin level during the OGTT, this information, combined with the glucose disappearance curve gives important information on the ability of pancreatic beta cell to release insulin in response to a glucose load and on the ability of insulin to store the glucose into the cells i.e. insulin sensitivity.

      We have added the quantitative insulin-sensitivity check index (QUICKI) in Fig.1n and confirmed that males are less insulin sensitive than females with no effect of MO. Line 125

      Authors should rephrase the conclusion of the paragraph since MO does not seems to influence fat distribution or insulin resistance. Looking at figure 1 it seems that the only differences observed are driven by the sex of the offspring independently of maternal feeding.

      ANSWER: We have changed the conclusion and focused on the sex differences.

      Do the dams were insulin resistant? Indeed, hyperinsulinemia and insulin resistance are key programming factor of offspring metabolic syndrome.

      ANSWER: We agree that it would have been good to have the insulin levels of the mothers before mating. Unfortunately, we took only the body weight and the glucose level after 2 h fasting. We saw no differences between CD and HFD fed mothers. We included these results in the Figure 1b. Of note, we have performed long term HFD in young female mice for other purposes and noticed that females are resistant to hyperinsulinemia when fed a HFD in a long term, so we assumed that feeding a HFD for 6 weeks before mating would not affect insulin levels.

      Line 162: "There were no significant differences between the sexes. However, it is interesting to note that several pathways were regulated differently between sexes between the C/HF and HF/HF groups." Can the authors rephrase the concept, it is unclear.

      ANSWER: We have revised the sentences and gave some examples in P6, Lines 132-139.

      Line 170: The authors state: "insulin secretion pathway is reduced in males only". How this results are in line with the data reported in Fig.1i were both M-C/HF and M-HF/HF display increased insulin secretion?

      ANSWER: We agree with the reviewer that this is controversial. However, males from obese mothers showed slightly increased insulin levels during the OGTT and slightly reduced QUICKI as compared to males from lean mothers. Moreover, here we measured the total insulin level and not the C-peptide level that is more representative of the “active insulin”. One can be that males have higher insulin level but lower active insulin.

      Lines 174-175: All the genes reported in Fig. 1o, except for LPIN1, do not seems to be altered by MO. Please rephrase.

      ANSWER: Lpin1 and Pdk1 are reduced by MO in females. We have rephrased P6, Lines 142-147

      Line 184: The authors state that the signaling pathways was assessed both at transcriptional and post-transcriptional levels. Where are depicted the data of the post-transcriptional levels?

      ANSWER: We have corrected this sentence and rephrased it.

      Similarly to glucose metabolism and fat depot results, also in the case of the liver steatosis the increased number of lipid droplets seems to be linked to the sex of the animal rather than the maternal diet. Since the authors also investigated inflammatory pathways it could be of interest to assess CD68 infiltration by immunohistochemistry and Picrosirius red staining for the assessment of fibrosis.

      ANSWER: We agree with the reviewer that immune cells infiltration quantification would have been excellent. However, due to the lock down and the moving of our lab we could not performed the IHC.

      Liver histology in Fig.5a (M-C/HFD) is completely different from the one depicted in Fig.4a in terms of steatosis. Can the authors please explain this difference and report the magnification used in Fig.4a. Please also report the scale in Fig.5a.

      ANSWER: We have corrected this mistake and we apologize for the confusion. The Fig.4a described a M-moHF offspring liver. The pictures have been changed and magnification has been added.

      Changes in placental function are thought to be a key link between the maternal and intrauterine milieu and long-term health of the offspring (PMID:24107818). Alterations in placental function and structure in response to obesity and their underlying molecular mechanisms have been explored both in humans and in animal models (PMID:24484739; 22303323; 28291256). Others have shown that maternal hyperinsulinemia is strongly associated with offspring insulin resistance and excess placental lipid deposition and hypoxia (PMID: 28291256). Excessive lipid deposition leads to a lipotoxic placental environment that is associated with increased markers of inflammation and oxidative stress (PMID: 24333048). Can the authors could provide some data?

      ANSWER: We agree with the reviewer that maternal and intrauterine milieu are crucial to determine long-term health status of the offspring. However, in the current study we wanted to explore the sex differences in offspring fed a HFD and born from either lean or obese mothers in the long term.

      We did not focus the current project on the maternal status, but on the offspring and the sexual dimorphism in metabolic risks later in life.

      **Minor:**

      Fig. 2m Acox1 is not reduced by MO in female.

      ANSWER: We have corrected the sentence.

      Supp. TableS1 do not report Pdk1, Lpin1, Nox4 and Prlr.

      ANSWER: The reason why these genes do not appear in the TableS1 is because the table S1 shows all the significantly regulated genes extracted from the KEGG pathways. The genes mentioned above Pdk1, Lpin1, Nox4 and Prlr do not appear in the KEEG pathway.

      Supp. TableS2 do not report PCSK9 and PNPLA3

      ANSWER: Same as above, the Table S2 is based on KEGG pathway analysis and the genes Pcsk9 and Pnpla3 do not appear in the KEGG pathway.

      Reviewer #3 (Significance (Required)):

      The prevalence of obesity during pregnancy continues to increase at alarming rates. This is concerning as in addition to immediate impacts on maternal wellbeing, obesity during pregnancy has detrimental effects on the long-term health of the offspring. This paper is connected to an extended research field aiming at prevent the detrimental effect of maternal obesity on the offspring.

      An important limitation in the ability to design intervention strategies to prevent the detrimental effects of maternal obesity on offspring health is that it is currently unclear which of the many potential variables associated with obesity is the key programming factor mediating the effects on the offspring.

      **Reviewer field of expertise:**

      Molecular Biology, Type 2 Diabetes, Obesity and NAFLD.

      **Referee Cross-commenting**

      I agree with the other reviewers' comments, particularly on the lack of mechanistic data.

      ANSWER: In mammals, including humans, biological sex is determined by a pair of sex chromosomes. Genes on the X chromosome (X-linked genes) have distinctive inheritance patterns because they are present in different number between females (XX) and males (XY). Moreover, a plethora of attributes can be determined by sex chromosomes and more importantly by X-linked chromosomes. Therefore, we extracted the sex-linked chromosomes that are affected by the maternal diet or by sex and presented them in figure 5 to give more insight into the molecular underpinning of the sexual dimorphism in metabolism homeostasis. We have presented the data page 12 L300-L315in the result section and in the last paragraph of the discussion section (page 18).

      Reviewer #4 (Evidence, reproducibility and clarity (Required)):

      Maternal obesity is a common condition in western society. There is abundant literature showing the deleterious metabolic consequences of MO in the offspring. In this manuscript, Savva et al. characterized the transcriptomic and lipidomic profiles of the liver in male and female progeny of female mice that were fed a high-fat diet during and before pregnancy. After weaning, mice were also fed a high-fat diet. They found that both transcription and lipid composition were different in males and females, and they show that females are protected to metabolic and liver disease, whereas males develop insulin resistance, liver steatosis, and are prone to develop liver cancer. The first part of the study where the authors characterize the metabolism of the progeny, including weight, fat mass in the distinct depots, glucose and insulin tolerance, is not novel. Several publications have previously reported these findings (Programming effects of maternal and gestational obesity on offspring metabolism and metabolic inflammation. Sci Rep. 2019 Nov 5;9(1):16027). It was also previously reported in the cited publication, increased liver weight, steatosis and TG content, similar to the results of the present study. Some novelty of the manuscript is the in-depth analysis of the lipid content of the liver in the models used, as well as the transcriptional profile. Despite the substantial amount of data that the authors generated to prove differences between the male and female offspring, there is not, however, any cross analysis that could link both omics data. Overall, as discussed below the results do not support some conclusions. In particular this reviewer has the following concerns and suggestions.

      1. The metabolic status of the obese mothers has a direct impact on the offspring. It was previously reported that differences in glucose tolerance on the mother has a strong impact on the sex dimorphism in the metabolism of the progeny (please see the review: Sex and gender differences in developmental programming of metabolism. Molecular Metabolism 15 (2018) 8-19. What are the levels of glucose tolerance on the mothers used in the study? The weight of the mothers could also be shown.

      ANSWER: We added the body weight and the Dam glucose level after 2 h fasting. We observed an increased BW in obese mother and no differences in glucose level compared to control diet fed mothers. We included these results in the Figure 1b.

      The fact that mice were fed a high-fat diet after weaning could lead to confounding effects. Indeed, the lack of a group of mice fed a normal diet after weaning makes difficult to establish which is the relative contribution to the phenotype of the diet of the progeny, compared to the diet of the mother.

      ANSWER: We published a paper in 2021 (PMID: 33398027) where we show that offspring born from obese mothers have sex specific hepatic modulation and provide molecular evidence of sex dependent in utero metabolic adaptation in the offspring born from obese mothers. In this study offspring were fed the CD after weaning and we demonstrated some reversal effect of CD feeding in male offspring.

      Changes in food intake could also explain some differences.

      ANSWER: We measured average food intake twice a week during three weeks at about 4-month of age, we presented the data in Fig.S1a and in the result section page 5 (Line 104). Males weighed significantly more than females regardless of the maternal diet (Fig.1c), with higher food intake (Fig.S1a). Interestingly, food intake tended to be induced and reduced by MO in female and male, respectively.

      Lipidomics data show major differences between males e and females. However, the impact of these differences in the distinct metabolic phenotypes is not addressed.

      ANSWER: We have reformulated the major lipidomic data to integrate them into the sex dependent metabolic phenotypes we observed.

      The estrogen family and its two respective receptors, ERα and ERβ, have been widely suggested to be protective against obesity, diabetes, and cardiovascular disease. Does the transcriptional profile consistent with changes in the activity of estrogen receptor signaling?

      ANSWER: We have at the expression level of Era (we did not find Erb) in the liver of offspring and presented the data in Figure S1b. Era was higher expressed in females than males, independently of the maternal diet. MO had no effect on the expression level of Era. Interestingly, androgen receptor (Ar) was higher expressed in the liver of F-moC than M-moC, these differences vanished in moHF-offspring because MO tended to reduce and induce the expression level of Ar in female and male, respectively.

      Epigenetic modifications likely underlie the differences between males and females. Histone modifications or DNA methylation analysis could further improve the study.

      ANSWER: We agree with the reviewer that there is likely a sex dependent epigenetic modification. How maternal in utero environment can differently affect female and male offspring born from the same mother remain to be elucidated.

      In the last figure the authors claim that MO prevents HCC, but no data about HCC is shown, only gene expression analysis.

      ANSWER: We agree with the reviewer that showing HCC markers would have strength the conclusion on the possible role of MO and sex in HCC development. However, we did not have the possibility to run western blot or IHC on our livers. Nevertheless, H&E staining showed bigger cell proliferation spots in females than in males and a reduction of the size of these spots in females born from obese mothers as compared to those born from lean mothers, in line with the pathways analysis and gene expression. Further studies focusing on HCC development in obesity would be needed to unravel the mechanism behind.

      Reviewer #4 (Significance (Required)):

      The first part of the study where the authors characterize the metabolism of the progeny, including weight, fat mass in the distinct depots, glucose and insulin tolerance, is not novel. Several publications have previously reported these findings (Programming effects of maternal and gestational obesity on offspring metabolism and metabolic inflammation. Sci Rep. 2019 Nov 5;9(1):16027). It was also previously reported in the cited publication, increased liver weight, steatosis and TG content, similar to the results of the present study.

      **Referee Cross-commmenting**

      I agree with the comments. I also think that the MS is difficult to read. No conexion between the OMICS data.

      ANSWER: The current version of the manuscript has been extensively revised and we believe that it has improved in clarity and in novelty.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Maternal obesity is a common condition in western society. There is abundant literature showing the deleterious metabolic consequences of MO in the offspring. In this manuscript, Savva et al. characterized the transcriptomic and lipidomic profiles of the liver in male and female progeny of female mice that were fed a high-fat diet during and before pregnancy. After weaning, mice were also fed a high-fat diet. They found that both transcription and lipid composition were different in males and females, and they show that females are protected to metabolic and liver disease, whereas males develop insulin resistance, liver steatosis, and are prone to develop liver cancer. The first part of the study where the authors characterize the metabolism of the progeny, including weight, fat mass in the distinct depots, glucose and insulin tolerance, is not novel. Several publications have previously reported these findings (Programming effects of maternal and gestational obesity on offspring metabolism and metabolic inflammation. Sci Rep. 2019 Nov 5;9(1):16027). It was also previously reported in the cited publication, increased liver weight, steatosis and TG content, similar to the results of the present study. Some novelty of the manuscript is the in-depth analysis of the lipid content of the liver in the models used, as well as the transcriptional profile. Despite the substantial amount of data that the authors generated to prove differences between the male and female offspring, there is not, however, any cross analysis that could link both omics data. Overall, as discussed below the results do not support some conclusions. In particular this reviewer has the following concerns and suggestions.

      1. The metabolic status of the obese mothers has a direct impact on the offspring. It was previously reported that differences in glucose tolerance on the mother has a strong impact on the sex dimorphism in the metabolism of the progeny (please see the review: Sex and gender differences in developmental programming of metabolism. Molecular Metabolism 15 (2018) 8-19. What are the levels of glucose tolerance on the mothers used in the study? The weight of the mothers could also be shown.
      2. The fact that mice were fed a high-fat diet after weaning could lead to confounding effects. Indeed, the lack of a group of mice fed a normal diet after weaning makes difficult to establish which is the relative contribution to the phenotype of the diet of the progeny, compared to the diet of the mother.
      3. Changes in food intake could also explain some differences.
      4. Lipidomics data show major differences between males e and females. However, the impact of these differences in the distinct metabolic phenotypes is not addressed.
      5. The estrogen family and its two respective receptors, ERα and ERβ, have been widely suggested to be protective against obesity, diabetes, and cardiovascular disease. Does the transcriptional profile consistent with changes in the activity of estrogen receptor signaling?
      6. Epigenetic modifications likely underlie the differences between males and females. Histone modifications or DNA methylation analysis could further improve the study.
      7. In the last figure the authors claim that MO prevents HCC, but no data about HCC is shown, only gene expression analysis.

      Significance

      The first part of the study where the authors characterize the metabolism of the progeny, including weight, fat mass in the distinct depots, glucose and insulin tolerance, is not novel. Several publications have previously reported these findings (Programming effects of maternal and gestational obesity on offspring metabolism and metabolic inflammation. Sci Rep. 2019 Nov 5;9(1):16027). It was also previously reported in the cited publication, increased liver weight, steatosis and TG content, similar to the results of the present study.

      Referee Cross-commmenting

      I agree with the comments. I also think that the MS is difficult to read. No conexion between the OMICS data.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper Savvaet al. explore how maternal obesity influence hepatic metabolism in a sex-specific fashion. They first assess the contribution of the adipose tissue to the development of insulin resistance and glucose intolerance focusing on inflammation and oxidative phosphorylation pathways. Then they proceed to asses if maternal obesity could remodel the hepatic triglyceride levels and phospholipids using proton magnetic resonance spectroscopy and LC-MS lipidomic, respectively. Finally, they explore hepatic lipid metabolism and genes promoting cancer development.

      Despite the methodology is correct and elegant, the study does not explore a possible mechanism of action and some results are contradictory. Indeed, some of the results seems to be driven by the sex of the offspring independently of the maternal feeding.

      There are indeed, some limitations for the authors and editors to consider. To this reviewer the manuscript is difficult to read, particularly the results section in which the data are listed without discussing their relevance or their connection to previous research from other groups. Moreover, the discussion could benefit from an extensive rewrite. Indeed, this section lacks of clarity and references that could help elucidate the novel finding of the authors.

      Major comments:

      Line 97: How the authors explain similar weight gain in the F-C/HF vs. F-HF/HF? A large body of literature reports that maternal high fat diet influences offspring weight gain, independently of sex, when compared to maternal standard diet (PMID: 23973955; 29872021; 31076636; 3036829).

      Line 99: Which is the explanation for the reduced body weight in M-HF/HF from birth until 9 week of age? Can the authors show the timeline for food intake?

      Line 101: How the authors explain the increase in final weight of the male compared to the female if no differences in total fat, VAT or SAT was observed between the offspring?

      Line 116: The authors state: "The ratio between the total SAT and the Abd SAT revealed that MO redistributed SAT outside of the abdominal region in females but not in males" but Fig.1g displays no significant differences between F-C/HF and F-HF/HF. Please explain.

      Line 128-130: The authors state: "At MID, glucose tolerance was highly diet- and sex-dependent, and males but not females showed impaired glucose tolerance by MO." However, in Fig.1h no significant differences in glucose peak or glucose AUC were observed between M-C/HF and M-HF/F. Please explain.

      Line 130: OGTT only provides information on insulin secretion and action but does not directly yield a measure of insulin sensitivity. Please rephrase.

      Authors should rephrase the conclusion of the paragraph since MO does not seems to influence fat distribution or insulin resistance. Looking at figure 1 it seems that the only differences observed are driven by the sex of the offspring independently of maternal feeding. Do the dams were insulin resistant? Indeed, hyperinsulinemia and insulin resistance are key programming factor of offspring metabolic syndrome.

      Line 162: "There were no significant differences between the sexes. However, it is interesting to note that several pathways were regulated differently between sexes between the C/HF and HF/HF groups." Can the authors rephrase the concept, it is unclear.

      Line 170: The authors state: "insulin secretion pathway is reduced in males only". How this results are in line with the data reported in Fig.1i were both M-C/HF and M-HF/HF display increased insulin secretion?

      Lines 174-175: All the genes reported in Fig. 1o, except for LPIN1, do not seems to be altered by MO. Please rephrase.

      Line 184: The authors state that the signaling pathways was assessed both at transcriptional and post-transcriptional levels. Where are depicted the data of the post-transcriptional levels?

      Similarly to glucose metabolism and fat depot results, also in the case of the liver steatosis the increased number of lipid droplets seems to be linked to the sex of the animal rather than the maternal diet. Since the authors also investigated inflammatory pathways it could be of interest to assess CD68 infiltration by immunohistochemistry and Picrosirius red staining for the assessment of fibrosis.

      Liver histology in Fig.5a (M-C/HFD) is completely different from the one depicted in Fig.4a in terms of steatosis. Can the authors please explain this difference and report the magnification used in Fig.4a. Please also report the scale in Fig.5a.

      Changes in placental function are thought to be a key link between the maternal and intrauterine milieu and long-term health of the offspring (PMID:24107818). Alterations in placental function and structure in response to obesity and their underlying molecular mechanisms have been explored both in humans and in animal models (PMID:24484739; 22303323; 28291256). Others have shown that maternal hyperinsulinemia is strongly associated with offspring insulin resistance and excess placental lipid deposition and hypoxia (PMID: 28291256). Excessive lipid deposition leads to a lipotoxic placental environment that is associated with increased markers of inflammation and oxidative stress (PMID: 24333048). Can the authors could provide some data?

      Minor:

      Fig. 2m Acox1 is not reduced by MO in female. Supp. TableS1 do not report Pdk1, Lpin1, Nox4 and Prlr. Supp. TableS2 do not report PCSK9 and PNPLA3

      Significance

      The prevalence of obesity during pregnancy continues to increase at alarming rates. This is concerning as in addition to immediate impacts on maternal wellbeing, obesity during pregnancy has detrimental effects on the long-term health of the offspring. This paper is connected to an extended research field aiming at prevent the detrimental effect of maternal obesity on the offspring.

      An important limitation in the ability to design intervention strategies to prevent the detrimental effects of maternal obesity on offspring health is that it is currently unclear which of the many potential variables associated with obesity is the key programming factor mediating the effects on the offspring.

      Reviewer field of expertise:

      Molecular Biology, Type 2 Diabetes, Obesity and NAFLD.

      Referee Cross-commenting

      I agree with the other reviewers' comments, particularly on the lack of mechanistic data.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript from Savva et al. focuses on a long-standing and unresolved challenge of metabolic (and not only) health in mammals: the sexual dimorphism. Authors couple transcriptomics and metabolomics to in-depth molecular phenotyping in offspring of dams fed HFD before conception and throughout pregnancy and lactation to isolate molecular determinants of sexually dimorphic response to maternal obesity.

      Major comments:

      1. While the manuscript present a compelling and exhaustive amount of data, which accurately describes the sexually dimorphic responses to maternal obesity, it lacks mechanistic insights and I personally think this to mainly be a timing issue. For example, I tried hard to find experimental details on the RNA sequencing and could not find much: when is the RNA sequencing performed? If, as I suspect, the sequencing experiments match the metabolomics experiments, I don't think they add much mechanistic insights onto the observed phenomena. They rather contribute to better describe them. Indeed, both metabolomic and transcriptomic profiles might be consequence of the observed phenotypes, rather than be causative (as the authors try to argue). Are these differences already present at birth? what happens to placenta and fetal tissues?
      2. Some adult phenotypes - especially metabolic and neurological phenotypes - might also be influenced by different maternal care early postnatal. Are the litters balanced by number and sex ratio? Would cross-fostering maintain the phenotypes?

      Of the two points above, I would love to see more details on the RNA sequencing, as well as placental and fetal tissues analysed. It would be also interesting to know about any litter balancing measure or at least have more statistics on the litter size and sex ratios.

      Significance

      The manuscript from Savva et al. revolves around the unresolved challenge of how sexually dimorphic phenotypes are established. The topic is actual - although already a lot has been published, as acknowledged by the authors as well - and of broad interest to the community of mouse geneticists and physiologists. To understand the molecular underpinnings of sexually dimorphic phenotypes, the authors use in-depth molecular phenotyping in the mouse coupled to metabolomics and transcriptomics. While extremely informative and exhaustive, the actual dataset is - at least for me - purely descriptive, which might reduce its overall impact. I'm a mouse geneticist and a metabolic physiologist and I find the topic of sexual dimorphism extremely interesting.

      Referee Cross-commenting

      I generally agree with the other reviewers' comments. I think the ms is interesting and the dataset compelling although to a certain extent overlapping with previously published studies. There is general agreement on the lack of mechanistic data and the authors should definitely address this point.

  2. Mar 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity):

      The authors present further investigation of the Sox transcription factors in the model Cnidarian Hydractinia. They showcase the Hydractinia as now a relatively technically advanced model system to study animal stem cells, regeneration and the control of differentiation in animal cells. In this study they characterise the neural cells in hydractinia using FACS and sing cell transcriptome sequencing, investigate the sequential expression of SoxB genes in the i-cells and presumptive lineage giving rise to i-cells and investigate the neuronal regeneration making good use of transgenic rules. Finally, they investigate the role of SoxB genes in embryonic neurogenesis.

      There are no major or minor issues effecting the conclusions

      Reviewer #1 (Significance):

      This study helps to confirm the role of an important group of transcription factors is conserved across the metazoan as well as showcasing an exciting model organism for regeneration and stem cell biology. This will of interest to a broad audience of developmental and biologists.

      My own research is in the same field, using a different model system

      Referees cross-commenting

      I agree with the comments from the other reviewers, and am sure the authors can address these adequately with further explanation.

      Reviewer #2 (Evidence, reproducibility and clarity):

      Summary

      Chrysostomou et al. investigate the role of three putative SoxB genes in embryonic neurogenesis in the colonial hydrozoan Hydractinia. They show that SoxB1 is co-expressed with Piwi in the multipotent i-cells and, using transgenics, they show that these Piwi/SoxB1 cells become neurons and gametes, consistent with the cell types that differentiate from i-cells. They further suggest that SoxB2 and SoxB3 are expressed downstream of SoxB1 in the progeny of the i-cells and, using shRNAs, investigate the role of SoxB genes on embryonic neurogenesis. The primary conclusions center on the similarity between neural differentiation in humans and Hydractinia as both systems pattern neurons using sequential expression of SoxB genes during the differentiation of neurons. The manuscript presents a large and diverse set of data derived from analysis of transgenic animals, single-cell sequencing, and investigation of gene function; despite this, the conclusions are either not particularly novel or not well-supported. The co-expression of SoxB1 in Piwi-expressing i-cells appears to be both novel and significant but the implications are not clearly indicated. Additional specific concerns are detailed below.

      Major comments

      1. SoxB genes act sequentially<br /> Knockdown of SoxB2 has already been shown to result in the loss of SoxB3, so the sequential action of SoxB genes in this animal does not seem to be a terribly novel conclusion.

      Sequential expression of Soxb1-Soxb2 has not been demonstrated previously. Flici et al. did show some data on Soxb1 expression but these were not detailed. Furthermore, they have not shown in vivo transition to Soxb2. Our new single-molecule fluorescence in situ hybridization, and the transgenic reporter animals have been developed to address these issues.

      While this manuscript does appear to report the most comprehensive analysis of SoxB1 expression, the evidence for sequential activation of SoxB1 and then SoxB2 in the same lineage (Figure 4) is a bit troubling. Panel A of this figure appears to show complete overlap between SoxB1 and SoxB2, suggesting all the cells in this field are synchronously passing through the transition point from SoxB1 to SoxB2 expression. While this may reflect reality, it would be more convincing to see adjacent cells expressing SoxB1 only or SoxB2 only, reflecting the dynamic progression of cell type specification along the main body axis.

      As shown in Figures 1, Soxb1 is expressed by i-cells (together with Piwi1) in the lower body column of feeding polyps and in germ cells in sexual polyps. These cells do not express Soxb2. Figure 2 shows that Soxb2 is expressed more orally in a population of putative i-cell progeny as they migrate towards the head. These cells still express Soxb1. In the upper part of the body column, just under the tentacle line, there are Soxb2+ cells that do not express Soxb1. Therefore, cells expressing Soxb1 but not Soxb2 are present in the basal part of the polyp, Soxb1+/Soxb2+ double positive cells in the mid body region (i.e., the interface between the two domains where Soxb1+ cells start to express Soxb2 and downregulate Soxb1.), and cells expressing Soxb2 but not Soxb1 in the upper part of the polyp, just under the tentacle line. In Figure 4, we show the interface between these two domains using in vivo imaging of double transgenic reporter animals to visualize the Soxb1 to Soxb2 transition. Indeed, in the mid body area, most Soxb1+ cells also express Soxb2 (Figure 2). Hence, Figure 4 should be seen keeping Figure 2’s data in mind. At the mRNA level, the overlap between the Soxb1 and Soxb2 domains is smaller (Figure 2) than the one shown in Figure 4 because the latter constitutes a lineage tracing, showing fluorescent proteins with a long half-life. Therefore, when i-cells downregulate Soxb1 while starting to express Soxb2, the long half-life of tdTomato results in red fluorescence persisting longer than the mRNA encoding it. We have added cartoons to Figure 4 to indicate the position along the main body axis that are depicted.

      Panel B is more concerning; while the authors have highlighted a cell that does appear to transition from SoxB1+ to SoxB1+/SoxB2+, there are several cells in the background that appear to gain SoxB2 expression without first expressing SoxB1. Do these cells constitute a fundamentally different, SoxB1-indpenendent, lineage of SoxB2+ cells? This would be noteworthy but is not mentioned or characterized.

      The panels included in Figure 4 constitute selected confocal slices of stacks acquired in vivo. During imaging, cells move in three dimensions, making them appear and disappear in given optical planes over time. In other words, the individual time frames shown (T0-T5) were not always found in the same plane due to cell migration in the Z dimension. The cells that appear to gain Soxb2+ w/o having expressed Soxb1 first are an example of such cells. They are probably Soxb2+ cells that had already downregulated Soxb1 and migrated into the respective plane of image. We have added the explanation to Figure 4's legend.

      Figure 7 shows the effect of SoxB1 knockdown (by shRNA) on the number of Piwi-expressing cells, nematocytes, etc but why not show that SoxB2 and SoxB3 are also knocked down in these experiments? Figure S11 shows no effect of SoxB2 and SoxB3 knockdown on SoxB1 expression but why wasn't the reciprocal experiment performed? If SoxB2 and SoxB3 are really downstream of SoxB1, the authors should demonstrate that with the shRNA experiments.

      Our data show that Soxb1 is expressed in i-cells and its KD reduces the number of these stem cells (assessed by expression of Piwi1, an i-cell marker). Because i-cells give rise to all Hydractinia somatic lineages (and to germ cells), focusing specifically on Soxb2+ cells would provide no further insight because all cell types are expected to be affected. Indeed, injection of shRNA targeting Soxb1 resulted in smaller animals with multiple defects, including but not limited to the neural lineage.

      1. Knockdown of SoxB genes resulted in complex defects in embryonic neurogenesis<br /> The manuscript aims to detail the roles of SoxB1, SoxB2, and SoxB3 in embryogenesis but only one of the main figures even shows pre-polyp life stages (Figure 7) and the results presented in in this figure are confusing. The authors suggest that knockdown of SoxB3 had no effect on embryonic neurogenesis but another interpretation of these data is that the SoxB3 shRNA simply did not work. The authors should provide additional support to show that this reagent is working as expected.

      This information is included in Figure S11. Using mRNA in situ hybridization, we show that injection of shRNA targeting Soxb3 causes transcriptional downregulation of Soxb3 but not of Soxb2. The figure also shows the specificities of the shRNAs targeting Soxb1 and Soxb2.

      Further, the results for SoxB1 and SoxB2 knockdown do not support the previous investigation of the role of SoxB2 in neurogenesis (Flici et al 2017). If SoxB1 is upstream of SoxB2, how does knockdown of SoxB1 have such a dramatic effect on RFamide neurons and nematocytes but knockdown of SoxB2 has an effect only on RFamide neurons? Is it possible the SoxB2 shRNA also wasn't working as expected? Can the results of the Flici et al 2017 paper showing SoxB2 knockdown in polyps be recapitulated using these shRNAs? If the point is to argue that embryos and adults (polyps) use fundamentally different mechanisms to drive neurogenesis, then the results presented in Figures 1-6 (which investigate SoxB genes in polyps) can't really be used to make inferences about embryonic neurogenesis. I think the authors have more work to do to demonstrate that embryonic and adult neurogenesis fundamentally differ.

      The Soxb2 shRNA specificity is shown in Figure S11 (i.e., it KD Soxb2 but not Soxb1). We were equally surprised to discover that Soxb2 KD resulted in somewhat different phenotypes than the ones obtained by Flici et al. (2017) in polyps. At this stage, we cannot explain the difference. However, one could speculate that it resulted from slightly different regulation logic between embryonic and adult neurogenesis. More specifically, we propose different priorities for generating neural subtypes as explanation. Unfortunately, shRNAs work only with embryos, and long dsRNA mediated KD works only with polyps. CRISPR/Cas9-mediated KO is feasible in Hydractinia, but knocking out developmental genes, such as these Sox genes, would likely cause embryonic lethality. Other conditional KO/KD approaches are not available for Hydractinia. We believe we have made all possible efforts to clarify the roles of these genes using currently available techniques. Neurogenesis is a complex process that is only partially conserved among different animals and poorly studied in non-bilaterians. Furthermore, it is not possible to answer all questions in one study. As many studies before, our work contributes to the understanding of neurogenesis but also raises new questions. Addressing them is matter for future research. We have toned down the statement in the last sentence of the results and in the discussion and do not claim that embryonic and adult neurogenesis are fundamentally different.

      Minor comments

      Methods: A large bit of data from this manuscript relies on quantitative analysis of cell number but there's not enough information in the methods to understand how quantification was performed. How many slices from the z-stack were analyzed? Were counts made relative to the total tissue area in the X/Y dimension or relative to the number of total nuclei in the same section? How many individuals were examined for each analysis?

      All cell counting analysis was performed using ImageJ/Fiji software. Counts were made relative to the total tissue area in the X/Y dimension (for the shRNA experiments). A Z-stack covering the whole depth of each larva was obtained. Counting was performed on cells positive for the respective cell type marker based on antibody staining and numbers were compared between shControl and shSoxb1/2/3 animals. At least 4 animals were counted per condition.

      Page 11 - "Piwi2low cells, which are presumably i-cell progeny" - how were "high" and "low quantified?

      “High” and “low” were not quantified. This is because i-cells progressively downregulate Piwi genes (i.e., Piwi1 and Piwi2) as they differentiate but this is a continuous process. Hence, it is difficult to put a threshold of Piwi1/Piwi2 protein level below which a cell ceases to be an i-cell while becoming a committed progeny. This is a similar process that is well documented in other animals where stemness markers are gradually downregulated during differentiation.

      Page 13 - "a role in maintaining stemness" - this comment is not totally clear to me. Why would the number of EdU+ cells increase if the role of SoxB1 is to maintain stemness? Wouldn't SoxB1 knockdown then force stem cells to exit their program, resulting in early differentiation of i-cell progeny? This should be clarified.

      KD of Soxb1 resulted in a decrease in the number of i-cells (i.e., Piwi1+ ones), suggesting that the gene is required for stemness maintenance. The increase in the numbers of cells in S-phase in this context was not related to i-cells because most of them were Piwi1-negative (Figure 7B). The identity of the cells in S-phase remains unknown, but a plausible explanation is that i-cell progeny (e.g., nematoblasts; see also next comment) increase their proliferative activity when i-cells numbers are low as a compensatory mechanism. This is merely a speculation. We have rephrased the paragraph to increase clarity.

      Page 13 - "if progenitors are limiting" - if progenitors are limited why would there be an increase in nematocytes?

      We do not have a definitive answer to this question but speculate that nematoblasts (i.e., stinging cell progenitors) account, at least in part, for the excessive proliferation seen under Soxb1 KD. This may constitute a mechanism allowing a depleted i-cell population to recover by self-renewal (instead of differentiation), moving temporarily the proliferation task to committed progeny (e.g., nematoblasts) until i-cell numbers return to normal. However, in the absence of evidence we refrain from expanding on this in the text.

      Figures 1 and 2 claim to show "partial overlap" but they look perfectly overlapping to me. This makes the situation in Figure 4B difficult to interpret.

      Figure 1 shows full overlap between Piwi1 and Sox1 expression and this is reflected in the text. Figure 2 shows no overlap between Soxb1 and Soxb2 in the lower body column (where only Soxb1 is expressed), overlap in the mid body region, and Soxb2 only expressing cells in the upper part of the body, just under the tentacle line. Similarly, the figure shows overlap between Soxb2/Soxb3 under the tentacle line, and predominantly Soxb3 above it in the head region. The small cartoons at the left side of each panel indicate its position along the oralaboral axis. See also our reply to the second part of comment #1.

      Figure 4 - No indication of which part of the animal or which stage is shown in these images.

      We have added cartoons to indicate the area in the polyp from where the images were taken.

      Figure 5 - No indication of where these dissociated cells came from - polyps? Larvae?

      All tissue samples were taken from feeding polyps; this is now mentioned in the Materials and Methods section.

      Panel D is a bit perplexing - what are the "progeny" of Piwi+ cells if not SoxB2+ cells and their derivatives?

      In Panel D, we show three cell fractions. One constitutes i-cells, based on high Piwi1 expression (green fluorescence of the Piwi1::GFP reporter transgene) and morphology; one fraction includes nematocytes, based on the characteristic nematocyst capsule, and one constitutes a mixture of other i-cell progeny. The latter includes different cell types, given that i-cells are thought to contribute to all lineages. They have only dim GFP fluorescence because the Piwi1 promoter-driven GFP shuts down upon i-cell differentiation. Soxb2+ cells are also among them but are not the only i-cell progeny.

      Why are nematocytes but not neurons indicated?

      Neurons are shown on Panels E & F. See also next comment.

      Piwi seems to be maintained in Ncol-expressing cells but not in SoxB2- or RFamide-expressing cells? Does this suggest that Piwi is turned on in i-cells, off in SoxB2-expressing cells, and on again in terminally differentiating nematocytes? This would be quite surprising and should be verified with antibody labeling/imaging in Piwi transgenics to confirm the result. The resolution for Panel M is too low to evaluate this part of the figure.

      The Piwi1i gene is downregulated upon i-cell differentiation. In the Piwi1:GFP reporter animal, residual GFP fluorescence persists post differentiation due to GFP's long half-life. The brightness of which depends on the time elapsed since differentiation. Because nematocytes are short living cells with high turnover, most nematocytes have recently differentiated and are therefore relatively bright green in the Piwi1::GFP animal. Neuron turnover is lower, making most neurons in the same transgenic animal appear dim. The resolution of the imaging flow cytometer is limited because the machine images 1000s of cells per second through all optical channels. However, it is high enough to allow the identification of features such as cell shape, some organelles (e.g., nematocytes), nuclear size and shape, and fluorescence intensity.

      Figure 7 - the low magnification images provide nice overall context but the authors should also provide high magnification panels for the same images. Without them it is not possible to assess "defects in ciliation" or to determine if there are defects in GLWamide neurons from these knockdowns (e.g., neurite vs cell body defects). There's no mention of the fact that SoxB1 knockdown resulted in complete loss of RFamide cells, which is strange. Are there SoxB2-independent populations of RFamide? Panel B could be interpreted multiple ways - downregulation of Piwi in SoxB1 shRNA or upregulation in SoxB2/B3. The authors should provide an image of control shRNA-injected larvae with the same co-labeling of Piwi/EdU for context. From the images, it's not clear that there were differential effects of SoxB2 and SoxB3 on nematocytes.

      The resolution of the images is, in fact, high, allowing it to be blown up on the screen. Even higher magnification of ciliation can be seen in Figure S12. KD of Soxb1 resulted in complete or nearly complete loss of Rfamide+ neurons. We have added this statement to the text as requested. Panel B shows the relative difference in Piwi1+ and S-phase cells between shSoxb1, shSoxb2, and shSoxb3-treated animals. The quantification relative to the control is presented in Figure 7C.

      Figures 6 and S9 - why piwi2 and not piwi1?

      In Figure 6, we co-stained the regenerates with two antibodies: one was a rabbit anti-GFP (to visualize the RFamide+ neurons), and the other was a guinea pig anti-Piwi2 (to visualize icells). The anti-Piwi1 antibody that was used in other images to visualize i-cells was raised in rabbit and could not be used in conjunction with the anti-GFP one.

      Figure S1 - Kayal et al 2018 is the most recent phylogeny of cnidarians and should probably be cited in place of Zapata throughout the manuscript. Independent of this, the polytomy in Figure S1 panel A is not supported by either Zapata or Kayal and should be fixed.

      We have cited Kayal et al. 2018 and revised the tree in Figure S1 as pointed.

      Figure S3 - is this mRNA? Protein? Panels E-G are too small to interpret. Please provide stage/time for cartoons in panel H.

      As per the legend, Panels A, B, D, E, F refer to protein; C is lectin staining (DSA), and G is EdU. The resolution of Panels E-G is actually high, allowing blowing up of the images on the screen to view the details. The stages of the cartoon in Panel H are now provided in the figure legend.

      Figure S11 - please provide images of whole larvae as shown for Piwi knockdown in Fig S9 and some additional support (e.g., qPCR) to demonstrate the shRNAs are actually working.

      Figure S9 represents immunostaining using the anti-Piwi1 antibody. In Figure S11, we show the specificity of the shRNA treatments; we used highly sensitive single-molecule mRNA in situ hybridization. Whole animal imaging is not informative due to the punctuated nature of the single-molecule staining.

      Figure S12 - it's not clear what ciliary "defects" are being shown.

      In the control, cilia are uniformly distributed along the oral-aboral axis whereas in the shSoxb1-injected animals, the pattern is patchy. Additionally, shSoxb1-injected larvae could not swim (planulae swim by coordinated cilia beat).

      Reviewer #2 (Significance):

      Generally, the results are either equivocal or the conclusions are not well supported by the results (as detailed above). The significance of this work to vertebrate neurobiology is somewhat weak. (Especially considering the orthology of these genes to bilaterian SoxB genes is not well supported.) Why not compare these results to other cnidarians - the expression patterns of SoxB1 and SoxB2 in corals and sea anemones seem to differ quite a lot (Shinzato et al 2008; Magie et al 2005), suggesting these genes are almost certainly not behaving in the same way across cnidarians. This is exciting! What's happening in Hydra? Seems like it should be possible to mine the single-cell data set from Siebert et al to test these hypothesized relationships between the Sox genes in another hydrozoan which constantly makes new neurons.

      We have modified the concluding section in the discussion, in line with this comment. See also comment to Reviewer #3.

      Reviewer #3 (Evidence, reproducibility and clarity):

      This paper characterizes the role of Soxb genes in neurogenesis in Hydractinia. The authors use cutting edge approaches including FISH, transgenics, image flow cytometry, FACS and shRNA knock downs to characterize SoxB in Hydractinia. The images are beautiful, the data is sound and the interpretation of the data is appropriate.

      I have only minor suggested listed by section below:

      Abstract<br /> - The abstract and introduction should make clear that this is a colonial animal and the cell migration occurs from the aboral to the oral end of the polyp (not the animal, as there are many oral ends). This is relevant to the interpretation of the data as the polyps do not act in isolation as they interconnected and may communicate via the stolonal network that connects the polyps in the colony.

      We have added a section to the Introduction to address the reviewer's comment. The Abstract, however, is too short to include this explanation.

      • The human disease justification is a relatively weak one and does not need to be included. Using Hydractinia to understand the role of SoxB in the evolution of neurogenesis in animals is enough justification for the study.

      We have adopted the reviewer's comment and modified the statement in the discussion (see also comment to Reviewer #2).

      Introduction<br /> - Instead of Sox phylogenies (the term phylogeny is more appropriate for species trees), consider substituting, for Sox gene trees. And instead of "phylogenetic relation" use the term "orthology"

      This has been done.

      • The number of times the sentences that have the sentiment "....remain unknown." "....little is known.." "...unclear..." , "....difficult to establish...." etc. is distracting and detracts from what IS known about these genes. It is not necessary to continually justify the study throughout the introduction. Instead a clearer description of the background and setting up the question/hypothesis of SoxB paralog subfuctionalization in space and time - would be more informative to the reader.

      We have reduced the number of occasions as recommended.

      • The authors state that there are three SoxB genes in the Hydractinia genome? What genome? For several years there has been multiple papers published by subsets of these authors have used unpublished genome data, but the complete genome has yet to be released to the public. This is especially egregious because they cite their NSF funded EDGE proposal to CEF and UF which is supposed to develop tools to the community, and yet the community at large doesn't have access to the genome. If these data came from the genome, then the genome should be released. If these data came from a previously published transcriptome as in the previous SoxB paper then this should be stated explicitly.

      The Hydractinia genome assembly, annotation, RNA-seq data, and genome browser are now available in the Hydractinia genome project portal at the National Human Genome Research Institute (NIH) website (https://research.nhgri.nih.gov/hydractinia/). The raw data have been deposited in the NCBI Sequence Read Archive (SRA) under BioProject PRJNA807936. This information has been added to the 'Resource availability' section.

      Results<br /> - I assume there was no expression of Soxb2 and Soxb3 in the reproductive polyps? This should be stated explicitly.

      Soxb2 expression in sexual polyps was consistent with the nervous system and with maternal deposition in oocytes. It was not detected in male germ cells. We have added a new in situ hybridization image of Soxb2 to Figure 12.

      • The word "progeny" is used throughout to describe terminally differentiated cells. However, progeny implies offspring, but these are actually later stages of differentiation of the in a cell's ontogeny, thus the term should be changed to "differentiated cells"

      We used "progeny" to indicate that the corresponding cells derived from a specific progenitor cell type. We did try replacing it with "differentiated cells" but this completely changes the meaning of the sentence: first, it does not include the cell of origin info and second, not all progeny are already fully differentiated.

      • Typo on page 11 "This predictable generation of many new neurons provides an opportunity to study neurogenesis in [a ]regeneration." - Remove the "a"

      Corrected.

      • While the regeneration study is interesting, there is nothing revealed about the role of Soxb and there is not a lot of new information revealed about regenerations. Authors should better justify this section or consider omitting.

      These sections demonstrate de novo neurogenesis in head regeneration. This was not known in this animal before.

      Discussion<br /> - The authors assume that in the transgenic lineage, the fluorescent marker in differentiated cells is due to retention of fluorescence, but it is unclear if they can rule out that Soxb2 is still being expressed in those cells" Please clarify.

      We conclude this by comparing the mRNA expression (Figures 1 & 2) with the fluorescent proteins (Figure 3).

      • How did the authors determine that the shSoxb3 knockdown worked? Please discuss relevant controls and validation (either in discussion or methods). This is particularly important given that it didn't have an apparent phenotypic effect.

      The efficacy of all shRNAs determined by in situ hybridization, showing that each shRNA downregulates its own target mRNA but not the others (Figure S11).

      • Again, the connection to human health is a bit of a stretch. Instead, what is most interesting is the similarity of Soxb paralogs acting sequentially as has been found in vertebrates. This suggests a highly conserved mechanism of subfunctionization following gene duplication at the base of animals.

      We agree. This is now also better highlighted in the discussion.

      Figures<br /> - Its very hard to distinguish the overall abundance of Soxb2 and Soxb3 expression along the polyp body axis from the panels figure 2. A lower magnification or larger area in each region would be helpful

      In Figure 2, we performed single-molecule in situ hybridization. While highly sensitive, this method generates spotty images because they highlight single molecules and are not coupled to an enzymatic reaction as in other methods. They mostly looks poor when showing low magnification images. Because a previous study (Flici et al. 2017) has already shown the general expression pattern, we aimed at providing the details of the transition.

      • Figure 4 - either the figure is upside down or the text is upside down. It is also difficult to see the double staining (if any).

      The figure is oriented to position the oral end up. The resolution of the panels is high, enabling blowing-up on the screen. The quality of in vivo time lapse images cannot match that of fixed and antibody stained ones, or of single in vivo images. This is because the animals are imaged for many hours during which they tend to bleach.

      • Figure 5M is difficult to read due to the small print. Consider enlarging and moving it to Supplementary Material

      The size of the text is small but the resolution is very high, enabling blowing up the image on the screen. We thought that the information was important enough to be presented in the main text and given that most readers would use the electronic version we preferred this option on another supplemental figure on top of the 12 we already have.

      Reviewer #3 (Significance):

      This is an interesting and important study because although it is well known that SoxB genes function in neurogenesis in animals, it is unclear how and if subfunctionalization occurs outside of vertebrates. Hydractinia is an excellent model to study SoxB genes because of its colonial organization and continuous development of nerve cells throughout the life of the animal. In addition, it is part of the early diverging cnidarian lineage and thus can provide insight into the relative conservation of SoxB genes across animals.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper characterizes the role of Soxb genes in neurogenesis in Hydractinia. The authors use cutting edge approaches including FISH, transgenics, image flow cytometry, FACS and shRNA knock downs to characterize SoxB in Hydractinia. The images are beautiful, the data is sound and the interpretation of the data is appropriate.

      I have only minor suggested listed by section below:

      Abstract<br /> - The abstract and introduction should make clear that this is a colonial animal and the cell migration occurs from the aboral to the oral end of the polyp (not the animal, as there are many oral ends). This is relevant to the interpretation of the data as the polyps do not act in isolation as they interconnected and may communicate via the stolonal network that connects the polyps in the colony.<br /> - The human disease justification is a relatively weak one and does not need to be included. Using Hydractinia to understand the role of SoxB in the evolution of neurogenesis in animals is enough justification for the study.

      Introduction<br /> - Instead of Sox phylogenies (the term phylogeny is more appropriate for species trees), consider substituting, for Sox gene trees. And instead of "phylogenetic relation" use the term "orthology"<br /> - The number of times the sentences that have the sentiment "....remain unknown." "....little is known.." "...unclear..." , "....difficult to establish...." etc. is distracting and detracts from what IS known about these genes. It is not necessary to continually justify the study throughout the introduction. Instead a clearer description of the background and setting up the question/hypothesis of SoxB paralog subfuctionalization in space and time - would be more informative to the reader.<br /> - The authors state that there are three SoxB genes in the Hydractinia genome? What genome? For several years there has been multiple papers published by subsets of these authors have used unpublished genome data, but the complete genome has yet to be released to the public. This is especially egregious because they cite their NSF funded EDGE proposal to CEF and UF which is supposed to develop tools to the community, and yet the community at large doesn't have access to the genome. If these data came from the genome, then the genome should be released. If these data came from a previously published transcriptome as in the previous SoxB paper then this should be stated explicitly.

      Results<br /> - I assume there was no expression of Soxb2 and Soxb3 in the reproductive polyps? This should be stated explicitly.<br /> - The word "progeny" is used throughout to describe terminally differentiated cells. However, progeny implies offspring, but these are actually later stages of differentiation of the in a cell's ontogeny, thus the term should be changed to "differentiated cells"<br /> - Typo on page 11 "This predictable generation of many new neurons provides an opportunity to study neurogenesis in [a ]regeneration." - Remove the "a"<br /> - While the regeneration study is interesting, there is nothing revealed about the role of Soxb and there is not a lot of new information revealed about regenerations. Authors should better justify this section or consider omitting.

      Discussion<br /> - The authors assume that in the transgenic lineage, the fluorescent marker in differentiated cells is due to retention of fluorescence, but it is unclear if they can rule out that Soxb2 is still being expressed in those cells" Please clarify.<br /> - How did the authors determine that the shSoxb3 knockdown worked? Please discuss relevant controls and validation (either in discussion or methods). This is particularly important given that it didn't have an apparent phenotypic effect.<br /> - Again, the connection to human health is a bit of a stretch. Instead, what is most interesting is the similarity of Soxb paralogs acting sequentially as has been found in vertebrates. This suggests a highly conserved mechanism of subfunctionization following gene duplication at the base of animals.

      Figures<br /> - Its very hard to distinguish the overall abundance of Soxb2 and Soxb3 expression along the polyp body axis from the panels figure 2. A lower magnification or larger area in each region would be helpful<br /> - Figure 4 - either the figure is upside down or the text is upside down. It is also difficult to see the double staining (if any).<br /> - Figure 5M is difficult to read due to the small print. Consider enlarging and moving it to Supplementary Material

      Significance

      This is an interesting and important study because although it is well known that SoxB genes function in neurogenesis in animals, it is unclear how and if subfunctionalization occurs outside of vertebrates. Hydractinia is an excellent model to study SoxB genes because of its colonial organization and continuous development of nerve cells throughout the life of the animal. In addition, it is part of the early diverging cnidarian lineage and thus can provide insight into the relative conservation of SoxB genes across animals.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors present further investigation of the Sox transcription factors in the model Cnidarian Hydractinia. They showcase the Hydractinia as now a relatively technically advanced model system to study animal stem cells, regeneration and the control of differentiation in animal cells. In this study they characterise the neural cells in hydractinia using FACS and sing cell transcriptome sequencing, investigate the sequential expression of SoxB genes in the i-cells and presumptive lineage giving rise to i-cells and investigate the neuronal regeneration making good use of transgenic rules. Finally, they investigate the role of SoxB genes in embryonic neurogenesis.

      There are no major or minor issues effecting the conclusions

      Significance

      This study helps to confirm the role of an important group of transcription factors is conserved across the metazoan as well as showcasing an exciting model organism for regeneration and stem cell biology. This will of interest to a broad audience of developmental and biologists.

      My own research is in the same field, using a different model system

      Referees cross-commenting

      I agree with the comments from the other reviewers, and am sure the authors can address these adequately with further explanation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      Chrysostomou et al. investigate the role of three putative SoxB genes in embryonic neurogenesis in the colonial hydrozoan Hydractinia. They show that SoxB1 is co-expressed with Piwi in the multipotent i-cells and, using transgenics, they show that these Piwi/SoxB1 cells become neurons and gametes, consistent with the cell types that differentiate from i-cells. They further suggest that SoxB2 and SoxB3 are expressed downstream of SoxB1 in the progeny of the i-cells and, using shRNAs, investigate the role of SoxB genes on embryonic neurogenesis. The primary conclusions center on the similarity between neural differentiation in humans and Hydractinia as both systems pattern neurons using sequential expression of SoxB genes during the differentiation of neurons. The manuscript presents a large and diverse set of data derived from analysis of transgenic animals, single-cell sequencing, and investigation of gene function; despite this, the conclusions are either not particularly novel or not well-supported. The co-expression of SoxB1 in Piwi-expressing i-cells appears to be both novel and significant but the implications are not clearly indicated. Additional specific concerns are detailed below.

      Major comments

      1. SoxB genes act sequentially<br /> Knockdown of SoxB2 has already been shown to result in the loss of SoxB3, so the sequential action of SoxB genes in this animal does not seem to be a terribly novel conclusion. While this manuscript does appear to report the most comprehensive analysis of SoxB1 expression, the evidence for sequential activation of SoxB1 and then SoxB2 in the same lineage (Figure 4) is a bit troubling. Panel A of this figure appears to show complete overlap between SoxB1 and SoxB2, suggesting all the cells in this field are synchronously passing through the transition point from SoxB1 to SoxB2 expression. While this may reflect reality, it would be more convincing to see adjacent cells expressing SoxB1 only or SoxB2 only, reflecting the dynamic progression of cell type specification along the main body axis. Panel B is more concerning; while the authors have highlighted a cell that does appear to transition from SoxB1+ to SoxB1+/SoxB2+, there are several cells in the background that appear to gain SoxB2 expression without first expressing SoxB1. Do these cells constitute a fundamentally different, SoxB1-indpenendent, lineage of SoxB2+ cells? This would be noteworthy but is not mentioned or characterized. Figure 7 shows the effect of SoxB1 knockdown (by shRNA) on the number of Piwi-expressing cells, nematocytes, etc but why not show that SoxB2 and SoxB3 are also knocked down in these experiments? Figure S11 shows no effect of SoxB2 and SoxB3 knockdown on SoxB1 expression but why wasn't the reciprocal experiment performed? If SoxB2 and SoxB3 are really downstream of SoxB1, the authors should demonstrate that with the shRNA experiments.
      2. Knockdown of SoxB genes resulted in complex defects in embryonic neurogenesis<br /> The manuscript aims to detail the roles of SoxB1, SoxB2, and SoxB3 in embryogenesis but only one of the main figures even shows pre-polyp life stages (Figure 7) and the results presented in in this figure are confusing. The authors suggest that knockdown of SoxB3 had no effect on embryonic neurogenesis but another interpretation of these data is that the SoxB3 shRNA simply did not work. The authors should provide additional support to show that this reagent is working as expected. Further, the results for SoxB1 and SoxB2 knockdown do not support the previous investigation of the role of SoxB2 in neurogenesis (Flici et al 2017). If SoxB1 is upstream of SoxB2, how does knockdown of SoxB1 have such a dramatic effect on RFamide neurons and nematocytes but knockdown of SoxB2 has an effect only on RFamide neurons? Is it possible the SoxB2 shRNA also wasn't working as expected? Can the results of the Flici et al 2017 paper showing SoxB2 knockdown in polyps be recapitulated using these shRNAs? If the point is to argue that embryos and adults (polyps) use fundamentally different mechanisms to drive neurogenesis, then the results presented in Figures 1-6 (which investigate SoxB genes in polyps) can't really be used to make inferences about embryonic neurogenesis. I think the authors have more work to do to demonstrate that embryonic and adult neurogenesis fundamentally differ.

      Minor comments

      Methods: A large bit of data from this manuscript relies on quantitative analysis of cell number but there's not enough information in the methods to understand how quantification was performed. How many slices from the z-stack were analyzed? Were counts made relative to the total tissue area in the X/Y dimension or relative to the number of total nuclei in the same section? How many individuals were examined for each analysis?

      Page 11 - "Piwi2low cells, which are presumably i-cell progeny" - how were "high" and "low quantified?

      Page 13 - "a role in maintaining stemness" - this comment is not totally clear to me. Why would the number of EdU+ cells increase if the role of SoxB1 is to maintain stemness? Wouldn't SoxB1 knockdown then force stem cells to exit their program, resulting in early differentiation of i-cell progeny? This should be clarified.

      Page 13 - "if progenitors are limiting" - if progenitors are limited why would there be an increase in nematocytes?

      Figures 1 and 2 claim to show "partial overlap" but they look perfectly overlapping to me. This makes the situation in Figure 4B difficult to interpret.

      Figure 4 - No indication of which part of the animal or which stage is shown in these images.

      Figure 5 - No indication of where these dissociated cells came from - polyps? Larvae? Panel D is a bit perplexing - what are the "progeny" of Piwi+ cells if not SoxB2+ cells and their derivatives? Why are nematocytes but not neurons indicated? Piwi seems to be maintained in Ncol-expressing cells but not in SoxB2- or RFamide-expressing cells? Does this suggest that Piwi is turned on in i-cells, off in SoxB2-expressing cells, and on again in terminally differentiating nematocytes? This would be quite surprising and should be verified with antibody labeling/imaging in Piwi transgenics to confirm the result. The resolution for Panel M is too low to evaluate this part of the figure.

      Figure 7 - the low magnification images provide nice overall context but the authors should also provide high magnification panels for the same images. Without them it is not possible to assess "defects in ciliation" or to determine if there are defects in GLWamide neurons from these knockdowns (e.g., neurite vs cell body defects). There's no mention of the fact that SoxB1 knockdown resulted in complete loss of RFamide cells, which is strange. Are there SoxB2-independent populations of RFamide? Panel B could be interpreted multiple ways - downregulation of Piwi in SoxB1 shRNA or upregulation in SoxB2/B3. The authors should provide an image of control shRNA-injected larvae with the same co-labeling of Piwi/EdU for context. From the images, it's not clear that there were differential effects of SoxB2 and SoxB3 on nematocytes.

      Figures 6 and S9 - why piwi2 and not piwi1?

      Figure S1 - Kayal et al 2018 is the most recent phylogeny of cnidarians and should probably be cited in place of Zapata throughout the manuscript. Independent of this, the polytomy in Figure S1 panel A is not supported by either Zapata or Kayal and should be fixed.

      Figure S3 - is this mRNA? Protein? Panels E-G are too small to interpret. Please provide stage/time for cartoons in panel H.

      Figure S11 - please provide images of whole larvae as shown for Piwi knockdown in Fig S9 and some additional support (e.g., qPCR) to demonstrate the shRNAs are actually working.

      Figure S12 - it's not clear what ciliary "defects" are being shown.

      Significance

      Generally, the results are either equivocal or the conclusions are not well supported by the results (as detailed above). The significance of this work to vertebrate neurobiology is somewhat weak. (Especially considering the orthology of these genes to bilaterian SoxB genes is not well supported.) Why not compare these results to other cnidarians - the expression patterns of SoxB1 and SoxB2 in corals and sea anemones seem to differ quite a lot (Shinzato et al 2008; Magie et al 2005), suggesting these genes are almost certainly not behaving in the same way across cnidarians. This is exciting! What's happening in Hydra? Seems like it should be possible to mine the single-cell data set from Siebert et al to test these hypothesized relationships between the Sox genes in another hydrozoan which constantly makes new neurons.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are very grateful to the two referees for their constructive comments and suggestions which have helped improve the quality of our manuscript.

      ------------------------------------------------------------------------------ * Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Ribes et al developed a FACS-based serological assay to detect antibodies against the SARS-CoV-2 spike protein in various hosts. The authors described an assay that is more sensitive and quantitative, allowing the detection of anti-spike antibodies with just a few ul of blood, and highlighted the potential of the assay as an alternative to commercial ELISA-based assays against SARS-CoV-2 spike protein.

      Major concerns *

      1. * On being quantitative analysis - the authors have used 20/130 reference serum from NIBSC as an example in figure 1. How does the RSS of the described assay compare/correlate with the Ab values in WHO standards? This should be included. * Response: We thank the referee for this helpful suggestion, and have now included the information on the IgG BAU in the legend of figure 1, and alluded to the characterisation of the 20/130 by the Expert Committee on Biological Standardization (Mattiuzzo et al., 2020) on lines 410-414 in the main text of the manuscript

      * On sensitivity and specificity - AUC profiles should be performed and included. *

      Response: If the Jurkat-flow test was intended for clinical use, the precise determination

      of the sensitivity and specificity of the test would indeed be absolutely essential. As was already mentioned at the end of the introduction, the Jurkat-S&R-flow test is only destined to be used by research laboratories, for research purposes. This has now also been clarified at the end of the abstract : “Whilst the Jurkat-flow test is ill-suited and not intended for clinical use ….”

      As suggested by the referee, to establish the sensitivity and specificity of a diagnostic test, it is indeed practical to use the Area Under the Receiver Operating Characteristic (ROC) curve (AUC). A ROC curve is a plot of the true positive rate (Sensitivity) in function of the false positive rate (100-Specificity) for different cut-off points of a parameter. Determining properly the sensitivity and specificity of a test thus requires large collections of samples which are known to be either certainly positive or certainly negative, which we did not have access to.

      * Are there any cross-reactivity with the other spike proteins from other CoVs? If so, what is the level of cross-reactivity? *

      Response: To assess cross-reactivity with other CoVs, we would have needed either Jurkat cells expressing the spike proteins from other CoVs, or sera with known reactivity against CoVs. Since we did not have access to such cells or sera, we were not in a position to address such a question.

      * While the authors have showed that the flow-based assay has a more dynamic range, there is insufficient data showing that it is "more sensitive", as stated in the abstract. The authors should reflect this in the text. *

      Response: In the abstract, we do not state that the Jurkat-S&R-flow test is more sensitive than the ELISA, but “at least as sensitive”. On the other hand, we state that it is more sensitive than the HAT test, which it clearly is since there are more than a dozen samples on figure 2 that were positive with either or both ELISA and Jurkat-S&R-flow but were negative by HAT.

      Of note, we have recently described an improved protocol, called HAT-field, which significantly improves the sensitivity of HAT, albeit at the cost of decreased specificity (https://doi.org/10.1101/2022.01.14.22268980)

      * Is trimer or monomer Spike expressed on the surface of the cells? *

      Response: Several studies have shown that, when the spike protein is expressed in human cells after transfection or transduction, it is in its native trimeric form at the cells’ surface and can even cause fusion with cells expressing the ACE2 receptor. This has now been clarified in the introduction section.

      * While there are significant advantages of the flow-based assay, the authors should discuss the limitations of a flow-based assay as a serological assay, especially for sero-surveillance and cohort studies. For instance, HTS application is usually limited for cell-based assays. In addition, while the assay is relatively cheap, it is worth nothing that the cytometer is an expensive equipment that not all laboratories have. *

      Response: We bring the referee’s attention to the fact that those points are discussed at the end of the introduction (line 161-165) : “ Since the Jurkat-flow test calls for the use of both a flow cytometer and cells obtained by tissue culture, it is clearly not destined to be used broadly in a diagnostic context, but its simplicity, modularity, and performances both in terms of sensitivity and quantification capacities should prove very useful for research labs working on characterizing antibody responses directed against SARS-2, both in humans and animal models. “

      Minor concerns*: *

      1. * Figure 1 - text and numbers in the FACS plots are too small. Please adjust. In addition, for some of the FACS plots shown (eg. neg cont and serum 20/130), the population is right at the axis. Please pan the x-axis to allow better visualisation.
      2. Figure 3A - please label axis.
      3. Figure S2 - please label axis.
      4. In general, please check through all figures for axis labels and also adjust the front size. For most, the text is too small. * Response: Sizes of numbers and text increased, and axis labels added in all figures*

      Reviewer #1 (Significance (Required)):

      As already discussed by the authors, there have already been quite a number of studies that have demonstrated the advantages of a flow-based assay for serological analysis for SARS-CoV-2. However, Ribes et al showed a new way to separate out alloreactivity from specific staining, which is important in reducing false positivity in serological assay. As more and more people receive their vaccination, there is a significant interest in immune-monitoring following vaccination. Given the more dynamic range of the flow-based assay, this is one good way to monitor antibody response. *

      Expertise: My research interest focuses on the study of SARS-CoV-2 antibody responses following infection or vaccination. * *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): *

      In this paper, Joly and colleagues make use of a flow cytometry-based assay to measure in a reliable and sensitive manner the presence of IgG, IgA and IgM in blood samples from post-COVID human patients and also from laboratory (mouse and hamster) and domestic animals (dogs and cats). They find that the test is appropriate to detect the presence of humoral immunity in all species tested. The manuscript is clearly written and the Figures are clearly presented. The experiments with rodente deliberately infected with inactivated SARS-CoV-2 shows (Fig. 3) that the method is reliable and able to clearly discriminate positive from negative sera. Interestingly, dogs and cats were sampled from households in which the owners had been found to have passed COVID-19 by PCR. Among this cohort of house animals they find more than 90% seroconversion for dogs and slightly less than 30% of clear seroconversion in cats. We find however that the manuscript would benefit by establishing a clear cut-off value of "Specific Stain" for dogs and cats (Fig. 3). This could be implemented by including data from pre-COVID dog and cat sera or in its defect, sera from those species collected at households in which their owners were vaccinated and did not pass the infection. Another point of criticism that could be resolved is that the channels for flow cytometry in Figure 1 do not seem to be adequately compensated and there is evidence of some cross-contamination between FL1 and FL3. *

      Responses: We thank the referee for bringing our attention to the fact that we had presented the data on sera from cats and dogs in a confusing manner, which led the referee to believe that the sets of samples presented were representative of the population of animals whose owner had tested positive for Covid-19. In fact, for this experiment, which was only ever intended as a preliminary proof of concept that the test could be adapted very simply to companion animals, we used sets of sera which we knew would contain approximately 50 % of positive and 50 % negative samples because they had previously been screened by sero-neutralisation (incidentally, a manuscript by Bessière et al., describing that work on sera from 131 cats and 156 dogs, has very recently been submitted for publication). To prevent possible confusions, we have now reworded the description of this proof of concept experiment, in the legend of figure 3, the text, and the methods section.

      Regarding the question of a clear cut-off value, as when using human samples, we would suggest using a value of 40 for the instruments settings we used, corresponding to an RSS of 20 (i.e. 20 fold the value of the negative control). With such a value, it can be seen that one cat serum would be considered positive whilst showing no neutralising activity, but one dog serum which showed weak neutralising activity would be considered negative. If anything, this example highlights the difficulty in setting a precise cut off value for any biological test.

      Regarding the question of inadequate compensation between channels 1 and 3, this is due to the fact that the Cellquest software does not allow for FL1/FL3 compensation, which is explained in the figure legend (see lines 208-210). We decided to simply draw the gates as they appear on figure 1 because attempts at post-acquisition compensation using the Flowjo software did not give satisfactory results. Incidentally, no compensation is required when samples are acquired on a Fortessa flow cytometer, where mCherry can be excited by a different laser (see figure S1) or if one uses the Jurkat-S&G-flow version of the test as in figure 3D for hamster sera (using Jurkat-GFP as negative control, and secondary antibodies conjugated to Alexa 488).

      Minor points*: *

      *-Figure 1.- Please describe the y- and x-axis. Such as they are is difficult to find out. *

      Response: Done

      * -It would be advisable to mention in Materials and Methods (page 22) how blood was collected from cats and dogs. *

      Response: We thank the referee for highlighting this, and have now provided the information in the relevant method section.

      * -Line 856, page 22, "ad libidum" should be "ad libitum" *

      Response: We thank the referee for spotting this typo, which has been corrected

      * Reviewer #2 (Significance (Required)):

      This is another step in the implementation of flow cytometry tests, instead of ELISA or CLIA serological tests based on the use of recombinant proteins, as a more sensitive and reliable method. The description of the high frequency of human-domestic animal transfer of SARS-CoV-2 will also add to the idea that it is humans who transmit the virus to those animals. *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, Joly and colleagues make use of a flow cytometry-based assay to measure in a reliable and sensitive manner the presence of IgG, IgA and IgM in blood samples from post-COVID human patients and also from laboratory (mouse and hamster) and domestic animals (dogs and cats). They find that the test is appropriate to detect the presence of humoral immunity in all species tested.

      The manuscript is clearly written and the Figures are clearly presented. The experiments with rodente deliberately infected with inactivated SARS-CoV-2 shows (Fig. 3) that the method is reliable and able to clearly discriminate positive from negative sera. Interestingly, dogs and cats were sampled from households in which the owners had been found to have passed COVID-19 by PCR. Among this cohort of house animals they find more than 90% seroconversion for dogs and slightly less than 30% of clear seroconversion in cats.

      We find however that the manuscript would benefit by establishing a clear cut-off value of "Specific Stain" for dogs and cats (Fig. 3). This could be implemented by including data from pre-COVID dog and cat sera or in its defect, sera from those species collected at households in which their owners were vaccinated and did not pass the infection. Another point of criticism that could be resolved is that the channels for flow cytometry in Figure 1 do not seem to be adequately compensated and there is evidence of some cross-contamination between FL1 and FL3.

      Minor points:

      • Figure 1. Please describe the y- and x-axis. Such as they are is difficult to find out.
      • It would be advisable to mention in Materials and Methods (page 22) how blood was collected from cats and dogs.
      • Line 856, page 22, "ad libidum" should be "ad libitum"

      Significance

      This is another step in the implementation of flow cytometry tests, instead of ELISA or CLIA serological tests based on the use of recombinant proteins, as a more sensitive and reliable method. The description of the high frequency of human-domestic animal transfer of SARS-CoV-2 will also add to the idea that it is humans who transmit the virus to those animals.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Ribes et al developed a FACS-based serological assay to detect antibodies against the SARS-CoV-2 spike protein in various hosts. The authors described an assay that is more sensitive and quantitative, allowing the detection of anti-spike antibodies with just a few ul of blood, and highlighted the potential of the assay as an alternative to commercial ELISA-based assays against SARS-CoV-2 spike protein.

      Major concerns:

      1. On being quantitative analysis - the authors have used 20/130 reference serum from NIBSC as an example in figure 1. How does the RSS of the described assay compare/correlate with the Ab values in WHO standards? This should be included.
      2. On sensitivity and specificity - AUC profiles should be performed and included.
      3. Are there any cross-reactivity with the other spike proteins from other CoVs? If so, what is the level of cross-reactivity?
      4. While the authors have showed that the flow-based assay has a more dynamic range, there is insufficient data showing that it is "more sensitive", as stated in the abstract. The authors should reflect this in the text.
      5. Is trimer or monomer Spike expressed on the surface of the cells?
      6. While there are significant advantages of the flow-based assay, the authors should discuss the limitations of a flow-based assay as a serological assay, especially for sero-surveillance and cohort studies. For instance, HTS application is usually limited for cell-based assays. In addition, while the assay is relatively cheap, it is worth nothing that the cytometer is an expensive equipment that not all laboratories have.

      Minor concerns:

      1. Figure 1 - text and numbers in the FACS plots are too small. Please adjust. In addition, for some of the FACS plots shown (eg. neg cont and serum 20/130), the population is right at the axis. Please pan the x-axis to allow better visualisation.
      2. Figure 3A - please label axis.
      3. Figure S2 - please label axis.
      4. In general, please check through all figures for axis labels and also adjust the front size. For most, the text is too small.

      Significance

      As already discussed by the authors, there have already been quite a number of studies that have demonstrated the advantages of a flow-based assay for serological analysis for SARS-CoV-2. However, Ribes et al showed a new way to separate out alloreactivity from specific staining, which is important in reducing false positivity in serological assay. As more and more people receive their vaccination, there is a significant interest in immune-monitoring following vaccination. Given the more dynamic range of the flow-based assay, this is one good way to monitor antibody response.

      My research interest focuses on the study of SARS-CoV-2 antibody responses following infection or vaccination.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: This manuscript documents a very thorough biophysical, structural and functional dissection of interactions between the RNA-binding protein Rrm4 and the endosomal adaptor Upa1 in the filamentous fungus Ustilago maydis. It has been shown previously that the Rrm4-Upa1 interaction is critical for mRNA transport in this system as mRNAs hitchhike on motor-associated endosomes. Here, the authors reveal using modelling that Rrm4 has three MLLE domains, including a cryptic one that had not been identified previously. They then report the crystal structure of MLLE2 and analyze the distribution anf arrangement of the MLLE domains in the protein using SAXS. They then show using pulldowns and isothermal titration calorimetry that MLLE3 is critical for the Upa1 interaction (via the PAM2L domains of Upa1) and that MLLE2 contributes to Rrm4 localization in vivo when the MLLE3-Upa1 interaction is partially impaired. The study suggests that Rrm4 has a platform of MLLE domains for orchestrating Rrm4 function. Overall, this is technically a high quality study. However, a number of points (mostly minor) should be addressed.

      Major comments:

      __A key part of the study if the in vivo work illustrating a role for MLLE2 in regulating Rrm4 localization when the system is sensitized. Some aspects of this part of the work need clarifying.

      a) The authors should show that the abberant staining is indeed microtubule-related with the benomyl experiment that they used in Jankowski et al. 2019. __

      We included this important control in Figure EV5F demonstrating that the aberrant staining is no longer visible after the microtubule inhibitor benomyl treatment

      b) The authors claim from these experiments that MLLE2 contributes to endosomal targeting (as there is ectopic protein on other structures (presumptive microtubules)). However, to make this claim, the authors would need to measure the intensity of the mutant Rrm4 protein on endosomes and/or the colocalization of these Rrm4 variants with endosomes, as they do in other experiments in this paper. Otherwise, it is possible that the MLLE2 deletion has another effect, e.g. increasing protein stability, and thus increasing the likelihood of binding to structures other than endosomes. If available, data on the relative abundance in the cell of the protein expressed from the wild-type control (rrm4-kat) and MLLE2 deletion constructs (e.g. rrm4-m1,2delta-kat) should be provided.

      As indicated by the reviewer, a critical point is identifying a function of MLLE2. Surprisingly, the domain is conserved in evolution, but , we do not see a mutant phenotype under optimal culture conditions. Therefore, we challenged the system and observed the mislocalisation of Rrm4, if the MLLE2 domain is deleted. However, the overall amount of shuttling Rrm4-positive endosomes was not strongly affected according to our kymograph experiments. We observe aberrant staining, which is not seen with the Rrm4 wild-type protein. Thus, under challenging conditions, we do see a function of MLLE2.

      To address the valid point of the reviewer, we quantified the signal intensities in kymographs of the most important Rrm4 variants. As indicated in Figure 5E, we observed that the maximum fluorescence intensity in kymograph signals was reduced when Rrm4 variants are mislocalised to microtubules while the minimum intensities were comparable in all strains. This underlines that a subset of Rrm4 molecules are no longer shuttling through the cell and most likely are attached to microtubules (to prove the involvement of microtubules, we did benomyl treatment which is now shown in Figure EV5F). We also included a Western Blot experiment (Figure EV5G) demonstrating that neither MLLE1 nor MLLE2 deletion impacts the total protein amount of Rrm4. These data support the notion that MLLE2 contributes to endosomal targeting.

      c) Was the data in Figure 5D scored blind of the identity of the samples? Given that the classification has to be done manually, it is important to confirm the phenotypes are robust to blinding (at least for the key comparisons).

      We agree entirely that manual evaluation of microscopic images has to be carried out with utmost care. The phenotype of aberrant microtubule staining is not easily detectable, and it needs an experienced person to quantify this. The data were analyzed by a second experimentalist with experience in evaluating microscopy images to validate the system’s robustness. Notably, the key findings were confirmed in both cases aberrant microtubule staining was only observed when the MLLE domain was mutated. However, the second person reported difficulties in differentiating a bundle of Rrm4 signals or stained microtubules. Therefore, this person quantified higher values with less experience in Rrm4 movement. In essence, we can rely on the key findings. We included the information in the section “Materials and methods” and gave the comparison in Figure EV5H.

      If points b and c are addressed, it should be possible to draw an arrow between the gray question mark protein in Figure 6 and the endosome surface, which is what I assume the authors believe to be case based on their discussion.

      Having addressed both points, we have also improved the model. To this end, we added a second unknown protein component (grey oval with a question mark) that interacts with MLLE2 and the endosomal surface. Thereby the hierarchical order with the accessory role of MLLE2 during endosomal attachment is stressed.

      Minor comments:

      1. The first line of the abstract is quite bold. It is hard to quantify the role of transport vs RNA stability for example, so I suggest this sentence is toned down. Correct, the first line now reads, “Spatiotemporal expression can be achieved by transport and translation of mRNAs at defined subcellular sites”.

      Line 269: change "amount of motile Rrm4-M12delta-Kat positive signals" to "number of motile Rrm4-M12delta-Kat positive signals".

      Changed as mentioned above.

      Figure 3 legend: Insert "Variant" before "amino acids of the FxP and FxxP..." to indicate what is labeled in gray. Change "fond" to "font" in the same sentence.

      Corrected as mentioned above.

      The cartoons of the different protein variants are very helpful but I had problems spotting the Upa1-Pam2L deletions due to the similar gray to the background of the protein. This would perhaps be clearer if the gray used for the background was lighter than it currently is.

      We improved the contrast by reducing the background of Upa1 to a lighter grey tone in all the corresponding figures.

      The residual motility of wild-type Rrm4 when PAM2L1 and PAM2L2 are both mutated (Figure 5C) is reminiscent of what is seen in a complete Upa1 deletion in the group's previous work. It would be helpful to point this out to the reader, as well as the implication that other proteins are contributing to Rrm4's linkage to endosomes. After all, some of these other adaptors might contact MLLE2 of Rrm4.

      We addressed this point by referring to our previous publication with the following sentence: “Comparable to previous reports, we observed residual motility of Rrm4-Kat on shuttling the endosomes if both PAM2L motifs are mutated or if upa1 is deleted. This indicates that additional proteins besides Upa1 are involved in the endosomal attachment of Rrm4 (Pohlmann et al., 2015).”

      Some of the y-axes of the charts should be more descriptive so that the reader can understand the plots even before they consult the legends. For example, in Figure EV4A and EV5D and E, which protein is being to referred to in each 'number of signals' plot should be included. In Figure 5D, 'Hyphae [%]' would be clearer as 'Hyphae with MT staining of Rrm4 [%]'

      We improved this in Figures EV4, 5D and EV5.

      Figure EV5 legend title: this could be misleading as the authors are seeing ectopic MT localization rather than a deficit in microtubule association.

      Corrected to “Deletion of MLLE1Rrm4 and -2 cause aberrant staining of microtubules”.

      Reviewer #1 (Significance (Required)):

      __The Feldbrugge group has previously mapped interactions between Upa1 and Rrm4 (Pohlmann et al., 2015) and some conclusions are corroborated in the paper by Boehm et al. The paper under review is, however, a significant advance due to the identification of the third MLLE domain, detailed biophysical characterization of the interactions, the structural insights, and evidence of a subsidiary role of MLLE2. The work would of course be stronger if the target of MLLE2 had been identified but I think this is beyond the scope of this initial work. To my knowledge, this is one of the most extensive analyses of the interactions mediated by MLLE and PAM domains and will be of interest to others working on these protein features. The work will also appeal to those interested in the links of localizing mRNAs with motor-associated membranes, which is an emerging field.

      Reviewer expertise: I have a long-standing interest in molecular analysis of mRNA trafficking mechanisms. I do not have experience in fungal genetics. __

      **Referee Cross-commenting**

      It seems that we are in agreement that this is solid work and that biochemical and biophysical analysis of the MLLE-PAM interactions will be of significant interest to those working on those domains (or proteins containing those domains). I agree with the comments of the other reviewers and there are clearly some essential minor revisions needed to strengthen the evidence for their conclusions and some clarifications. I think it is a long shot that RNA binding to the RRMs will affect the MLLE-PAM interactions and would require quite a lot of work to show this conclusively. The study would, however, be more impactful if this was shown to be the case, or the target of MLLE2 was found. Nonetheless, I would not say these new avenues of research are necessary to find a home in one of the Review Commons journals.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Devan, Schott-Verdugo et al.

      Summary

      In this study the putative MLLE RNA-binding motifs of the endosomal RNA-binding protein, Rrm4, from Ustilago maydis were examined using structural and genetic analyses. MLLE motifs are conserved in polyA-binding proteins (Pab1/PABPC1) and found also in Rrm4, which was shown to reside on motile endosomes and deliver septin mRNAs for endosome-localized translation during polarized growth. Upa1 on the endosome interacts with Rrm4 via its PAM2L domain that itself interacts with the MLLE domains of proteins like Pab1. Mutations in the known MLLE domain of Rrm4 were earlier shown to affect localization to endosomes. Here, the C-terminal domain of Rrm4 was revealed to have three divergent MLLE motifs using comparative modeling; only two of which were previously predicted. Crystallization and X-ray diffraction analysis of a truncated version of bacterially produced Rrm4, showed MLLE2 is most similar to that of PABPC1 and UBR5, although MLLE1 and 2 are somewhat divergent in the key region of PAM2 binding. Small angle X-ray scattering of recombinant full-length or truncated Rrm4 revealed that the MLLE domains might form a platform that could allow for multiple contacts with different binding partners. In vitro binding studies with different N-terminal GST-tagged versions of the Rrm4 were used to examine for interactions with PAM2 sequences of Upa1 using N-terminal hexa-histidine-SUMO fusions. It was found that Pab1-MLLE interacts with the PAM2, but not PAM2L, domain of Upa1. In contrast, the complete Rrm4 MLLE region (G-Rrm4-NT4) interacted with the PAM2L domain, but not the PAM2 of Upa1. Notably, the interaction with PAM2L required the third MLLE and neither MLLE1 nor MLLE2, nor both. No significant differences in affinity were observed and were similar to that of the Pab1 MLLE. The results also show that the MLLE3 has a higher affinity for the PAM2L2 than PAM2L1 of Upa1.

      To examine the biological role of the Rrm4 MLLEs, U. maydis strains bearing deletions in the domains of Rrm4 were examined for hyphal growth and endosomal transport (latter using Upa1-GFP and Rrm4-mKate2). Only the loss of the MLLE3 domain inhibited polarized growth (as seen with the full deletion of RRM4) and not the deletion of either MLLE1 or 2. Similar results were obtained regarding endosome shuttling. Thus, in line with the biochemical experiments performed the MLLE3 domain alone (of the three identified) is necessary for the biological actions of Rrm4. This suggested the MLLE1 and 2 are not necessary for function under these conditions.

      To examine this further, Upa1 carrying mutations in the PAM2L 1or PAM2L2 domains were examined. It was found that the deletion of both PAM2L domains affected unipolar growth resulting in bipolar growth similar to the deletion of UPA1 alone. This phenotype was observed even upon the deletion of Rrm4 MLLE1 and 2 in the same background as the PAM2L mutants. The mutation of both PAM2L domains led to a reduction in Rrm4-labeled shuttling endosomes, which suggests that these domains help anchor Rrm4 to endosomes. When only the PAM2L1 domain is present in Upa1 there was a larger increase in hyphae with aberrant microtubule staining than upon the loss of PAM2L1. The authors suggest that this indicates PAM2L2 is more important and prescribes an accessory role for MLLE2 in endosome association.

      Comments: Overall, the study seems well conducted. We cannot comment on the structural aspect of the work since this is not our field of expertise. That said, the biochemical and genetic/functional studies appear solid, well thought-out, and clearly presented. No new experiments are necessary to support the general claims of the paper, however, experiments suggested below might make it more revealing with regards to the connection between RNA binding and MLLE-PAM2L interactions (i.e. endosome localization and RNA binding functions).

      1. Line 286 - It reads the they "Next, we investigated the association of Rrm4 -M12D-Kat in strains expressing PAM2L1. Thus, the endosomal attachment was solely dependent on the interaction of MLLE3 with the PAM2L2 sequence of Upa1." Unclear - wouldn't lacking PAM2L1 (and not expressing) fit the logic of the sentence? We corrected this with the sentence, “Next, we investigated the association of Rrm4-M1,2D-Kat in strains expressing Upa1 with mutated PAM2L1”.

      Several questions regarding the specificity of PAM2 vs. PAM2L domains. What happens when you switch/replace the PAM2L1 or 2 of Upa1 with Upa1 PAM2 domains? Are they exclusive? What happens when the MLLE3 of Rrm4 is switched with that of Pab1? And if one does both - does that restore functionality to Rrm4?

      These are very interesting suggestions. Previously, we have shown that a single PAM2L1 or PAM2L2 sequence of Upa1 is sufficient for unipolar growth and recruitment of Rrm4 to endosomes. Please note that Upa1 with mutated PAM2L1 and L2 still contains a PAM2 motif. Furthermore, mutating the PAM2 motif of Upa1 did not affect Rrm4 shuttling or unipolar growth. Thus, switching the domains would mostly address whether the precise location within Upa1 would be important. This is interesting but, unfortunately very labour-intensive and beyond the manuscript’s current scope.

      Switching MLLE3 with MLLE of PAB1 is an interesting approach. One might expect that Rrm4 can be recruited to endosomes again. However, Rrm4 would also interact with numerous other proteins containing PAM2 motifs like deadenylase Not4. Here it would compete with the MLLE of Pab1. Thus, it would be expected that Rrm4 is on the surface, but the protein will be mistargeted to other proteins causing pleiotropic alterations. It will be difficult to judge whether Rrm4 functionality is restored or whether other processes are disturbed. In essence, these are stimulating ideas, but we believe that these experiments are beyond the scope of the current study. In the future, we might address this point by using a heterologous peptide-binding pocket or tethering approach.

      Likewise, what happens if Upa1 only has PAM2L2 instead of only PAM2L1 domains? Does that alter function - perhaps now one can observe a contribution of MLLE1? If it it's there it's likely to have function. Anything known about the post-translational modification of these MLLE or PAM domains? Does it change during unipolar vs. bipolar growth? Perhaps the different MLLE domains are regulated in such a fashion?

      Again also very valid points. Upa1 with two PAM2L2 motifs might interact stronger. The problem is that one PAM2L motif is sufficient for interaction, and we do not see a strong phenotype.

      Currently, we do not know if post-translational modifications regulate the MLLE domains. This could alter the binding affinity or specificity, and by expressing fungal proteins in E. coli, we might have missed this type of regulation. However, we addressed the function of MLLE1 and MLLE2 in U. maydis using a genetic approach. We deleted the corresponding domains and interfered with potential regulation by posttranslational modification. Thus, we cannot exclude post-translational modification, but it appears to be not essential for function. We will address the posttranslational regulation of Rrm4 in more detail in the future.

      Can the authors show whether the binding of mRNA cargo (e.g. Cdc3 mRNA) to the RRM motifs of Rrm4 affects the interaction between any of the MLLE-PAM2L pairs, or vice versa (i.e. does the MLLE-PAM2L interaction affect mRNA binding)?

      In previous studies, we have investigated a version of Rrm4 carrying a mutation in the first RRM motif of Rrm4. According to RNA live imaging, the respective strains exhibit a loss of function phenotype and mRNA transport is strongly affected. However, the endosomal association of Rrm4-mR1-Gfp is not affected, indicating no direct cross-talk between RNA-binding via RRM1 and endosomal attachment via MLLE3. Also, a version of Rrm4 carrying a deletion of all three RRM domains is still shuttling on endosomes. The two functions, i.e. RNA binding and endosomal binding, appears to be carried out by two independent platforms, i.e. three RRMs and three MLLEs, respectively. The overall structure of the protein also reflects this. The RRM domains are structurally clearly separated from the flexible MLLE domains.

      Discussion line 311 It is written that the three MLLE domains "collaborate for optimal functionality..." Perhaps there's a misunderstanding here, but the authors show that MLLE3 domain alone is necessary & sufficient for function, so where is the collaboration? MLLE2 may have an accessory role according to the authors, but we do not know if it is in collaboration with MLLE3 or independent thereof. Since the KD of MLLE3 is not affected by the presence or absence of MLLE1,2 in vitro at least, it may be that they have independent, and not collaborative, roles.

      Correct, we rephrased this more carefully. We omitted the collaboration aspect. It now reads, ”but a sophisticated binding platform consisting of three MLLE domains with MLLE2 and MLLE3 functioning in linking the key RNA transporter to endosomes.”

      Reviewer #2 (Significance (Required)):

      This paper concerns functional domains found in an endosome-localized RNA binding protein, U. maydis Rrm4, which is necessary for localized translation on endosomes and subsequent unipolar growth. Here the authors show using structural, biochemical, and genetic studies that instead of one or two MLLE protein-protein interacting domain in Rrm4 there are three, although one (MLLE3) is necessary and sufficient for full function. This work is for an audience interested in those studying RNA trafficking and its role in cell physiology, which is our expertise. The work is interesting, but it could be made more so especially if a connection was established between the RNA-binding function of the RRM domains and the MLLE-PAM2L interaction(s). At this point it is solid technical work and could be published after minor revisions.

      **Referee Cross-commenting**

      I concur with the comments of the other reviewers in that the work is solid and necessitates minor revisions in order to be published. Clearly, establishing a connection between the RNA-binding function and the MLLE-PAM interactions of Rrm4 would be an interesting and worthy pursuit that might enhance the novelty of the work, but I agree that it could belong to future studies.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      __ Summary: Long-distance subcellular transport of mRNAs is achieved through selective and dynamic interaction with the transport machinery. Using the highly polarized hyphae of Ustilago maydis, the authors previously showed i- that mRNAs can hitchhike on actively transported endosomes for proper distribution, and ii- that the connection between mRNAs and endosomes is mediated by the interaction between a C-terminal MademoiseLLE (MLE) domain of the RNA binding protein Rrm4 and the Upa1 adapter protein. In this study, the authors aimed at more precisely characterizing the structural and molecular bases underlying the Rrm4-Upa1 interaction. Combining structural modeling and X-ray analyses, they discovered a non-canonical, and previously missed, MLE domain (MLE1) in Rrm4, and characterized the structure of the second MLE domains (MLE2) of Rrm4. Through binding assays, they showed that the three MLE domains exhibit different binding properties, and that MLE3 is the only domain capable of binding to the PAM2 domain of Upa1. Consistent with this finding, functional assays performed in U. maydis revealed that MLE3 is the main domain involved in interaction with endosomes and trafficking, MLE1 and 2 having either no or minor functions in this process.

      The manuscript is very-well written, the data are of high quality and clearly presented. A wide range of complementary approaches has been used to molecularly and functionally characterize the different MLE domains of Rrm4. From an "RNA transport" perspective, this manuscript falls short of a main novel findings as the domains characterized in this study (MLE1 and 2) don't have a clear function in connecting mRNAs to the transport machinery. From an "MLE domain" perspective, this work however provides interesting information about non-canonical domains and structures, and about binding and function specificity. As described below, my major concern relates to the role played by the ML2 domain of Rrm4, a role referred to as "accessory" by the authors. __

      __

      Major comments: __

      The authors conclude from their results that ML2 has an accessory role in promoting association with endosomes.

      1- This conclusion is made based on in vivo experiments showing that a form of Rrm4 lacking the M2 domain, in contrast to wild-type Rrm4, aberrantly attached to MTs in a context where the Rrm4-Upa1 interaction mediated by MLE3Rrm4 has been weakened (Upa1-pl2m). Although the results are convincing, their interpretation is less. The authors, indeed, claim that the observed phenotype results from "the static accumulation of Rrm4" due to reduced interaction with endosomes. Why then don't they see a decrease in the motility/transport properties of Rrm4-M2Δ in this context then? Also, do the authors see a decrease in the co-localization of Rrm4-M2Δ with endosomes (which would be expected if the interaction is decreased)? Can the authors perform IP or co-sedimentation experiments to strengthen their hypothesis?

      This is a fair criticism that was also raised by reviewer 1. In the improved version of the manuscript, we now include important control experiments demonstrating that (i) the aberrant localisation is microtubule-dependent (Fig. EV5F) (ii) the mutations do not cause differences in protein amounts of Rrm4 (Fig. EV5G) (iii) the key findings of the aberrant microtubule staining, which were scored manually in microscopic images were verified independently by two persons (Fig. EV5H) and (iv) most importantly, Rrm4 signal intensity is decreased in processive signals of our kymograph analysis (Fig. 5E). We firmly believe that this set of experiments strengthens our conclusion that MLLE2 plays an accessory role in the endosomal attachment (Fig. 6).

      2- Whether MLE2Rrm4 mediates interaction with endosomes through association with Upa1 is unclear, as the binding assays performed in Figure 3 test for association of Rrm4 variants with single isolated domains of Upa1, not with the full-length protein. Assessing the binding of Rrm4-M2Δ variants with Upa1-PL2m would help interpreting the phenotypes described in Figure 5.

      Unfortunately, it is difficult to express full-length Upa1 protein in E. coli due to the presence of extended unstructured regions. To overcome this limitation, we performed yeast two-hybrid experiments with full-length proteins of Rrm4 and Upa1. We were able to recapitulate qualitatively the results observed in vitro using the individual domains.

      Notably, the Rrm4 version carrying a deletion in MLLE1 and MLLE2 interacted with Upa1 versions carrying mutations in PAM2L1 or PAM2L2 (Fig. EV3C), suggesting that both MLLE domains of Rrm4 are dispensable for interaction with Upa1. MLLE3 is sufficient to interact with a single PAM2L sequence of Upa1. This suggests the presence of additional interaction partners for MLLE1 and MLLE2 and is entirely consistent with our genetic and cell biological analysis described in Fig. 5.

      __

      Minor comments: __

      1- The authors have previously characterized the effect of a C-terminal deletion of Rrm4 on Rrm4 motility and binding to Upa1 (Becht et al., 2006; Pohlmann et al., 2015). How their previously-described construct compares to the Rrm4-M3Δ used in this study is unclear (is it the same?).

      It is the identical mutation to allele rrm4GPD from Becht et al. 2006. We indicate the information in the text “(Fig. 4B-C; mutation identical to allele rrm4GPD in Becht et al., 2006).”

      2- page 6, line 141: refer to Fig. 1B rather than Fig. EV1A ?

      We included the reference to Fig. 1B.

      3- page 10, line 274: "Rrm4-Kat was found"

      We corrected this.

      4- page 11, line 286: "in strains expressing Upa1-PAM2L1", replace by "in strains expressing Upa1 with mutated PAM2L1"?

      We corrected this.

      5- The Figures and accompanying legends are overall very clear and detailed. In Figures EV4A and EV5D-E, it would however help if the authors would indicate on the Figure itself, left to each panel which markers/signals is being analyzed (e.g Rrm4-Kat (top) and Upa1-GFP (down) for Figure EV4).

      We clarified this.

      Reviewer #3 (Significance (Required)):

      Active transport of mRNAs along microtubule tracks has been shown to play a key role in the spatio-temporal control of gene expression in various cell types and species. How specific mRNAs mechanistically connect to molecular motors for their transport to their subcellular destination has however for long remained largely unclear. Recent work, including work from the authors, has uncovered that RNAs can hitchhike on membranous organelles through adapter proteins linking mRNAs and RNA binding proteins with trafficking membrane-bound organelles.

      This study aimed at investigating the structural and molecular bases underlying the interaction between RNA binding proteins and endosomes. While their identification and characterization of the MLE1 and MLE2 domains of Rrm4 did not provide significant new insight into the mechanisms involved in the endosome-mediated transport of mRNAs, it uncovered interesting new properties of MLE domains, including structural variations, selective binding and functional specificity. This work should thus be of interest for structural biologists and researchers interested in protein-protein interaction platforms.

      **Referee Cross-commenting**

      Our comments all converge to the idea that this study is solid as it is and requires only minor revision work to support the authors conclusions. Although characterizing further MLE/PAM2 binding specificity and MLE2 interactors would be of great interest and indeed provide a more complete understanding of interaction networks at play, I feel that this is beyond expected revision work.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Long-distance subcellular transport of mRNAs is achieved through selective and dynamic interaction with the transport machinery. Using the highly polarized hyphae of Ustilago maydis, the authors previously showed i- that mRNAs can hitchhike on actively transported endosomes for proper distribution, and ii- that the connection between mRNAs and endosomes is mediated by the interaction between a C-terminal MademoiseLLE (MLE) domain of the RNA binding protein Rrm4 and the Upa1 adapter protein.

      In this study, the authors aimed at more precisely characterizing the structural and molecular bases underlying the Rrm4-Upa1 interaction. Combining structural modeling and X-ray analyses, they discovered a non-canonical, and previously missed, MLE domain (MLE1) in Rrm4, and characterized the structure of the second MLE domains (MLE2) of Rrm4. Through binding assays, they showed that the three MLE domains exhibit different binding properties, and that MLE3 is the only domain capable of binding to the PAM2 domain of Upa1. Consistent with this finding, functional assays performed in U. maydis revealed that MLE3 is the main domain involved in interaction with endosomes and trafficking, MLE1 and 2 having either no or minor functions in this process.

      The manuscript is very-well written, the data are of high quality and clearly presented. A wide range of complementary approaches has been used to molecularly and functionally characterize the different MLE domains of Rrm4. From an "RNA transport" perspective, this manuscript falls short of a main novel findings as the domains characterized in this study (MLE1 and 2) don't have a clear function in connecting mRNAs to the transport machinery. From an "MLE domain" perspective, this work however provides interesting information about non-canonical domains and structures, and about binding and function specificity.

      As described below, my major concern relates to the role played by the ML2 domain of Rrm4, a role referred to as "accessory" by the authors.

      Major comments:

      The authors conclude from their results that ML2 has an accessory role in promoting association with endosomes.

      1- This conclusion is made based on in vivo experiments showing that a form of Rrm4 lacking the M2 domain, in contrast to wild-type Rrm4, aberrantly attached to MTs in a context where the Rrm4-Upa1 interaction mediated by MLE3Rrm4 has been weakened (Upa1-pl2m). Although the results are convincing, their interpretation is less. The authors, indeed, claim that the observed phenotype results from "the static accumulation of Rrm4" due to reduced interaction with endosomes. Why then don't they see a decrease in the motility/transport properties of Rrm4-M2Δ in this context then? Also, do the authors see a decrease in the co-localization of Rrm4-M2Δ with endosomes (which would be expected if the interaction is decreased)? Can the authors perform IP or co-sedimentation experiments to strengthen their hypothesis?

      2- Whether MLE2Rrm4 mediates interaction with endosomes through association with Upa1 is unclear, as the binding assays performed in Figure 3 test for association of Rrm4 variants with single isolated domains of Upa1, not with the full-length protein. Assessing the binding of Rrm4-M2Δ variants with Upa1-PL2m would help interpreting the phenotypes described in Figure 5.

      Minor comments:

      1- The authors have previously characterized the effect of a C-terminal deletion of Rrm4 on Rrm4 motility and binding to Upa1 (Becht et al., 2006; Pohlmann et al., 2015). How their previously-described construct compares to the Rrm4-M3Δ used in this study is unclear (is it the same?).

      2- page 6, line 141: refer to Fig. 1B rather than Fig. EV1A ?

      3- page 10, line 274: "Rrm4-Kat was found"

      4- page 11, line 286: "in strains expressing Upa1-PAM2L1", replace by "in strains expressing Upa1 with mutated PAM2L1"?

      5- The Figures and accompanying legends are overall very clear and detailed. In Figures EV4A and EV5D-E, it would however help if the authors would indicate on the Figure itself, left to each panel which markers/signals is being analyzed (e.g Rrm4-Kat (top) and Upa1-GFP (down) for Figure EV4).

      Significance

      Active transport of mRNAs along microtubule tracks has been shown to play a key role in the spatio-temporal control of gene expression in various cell types and species. How specific mRNAs mechanistically connect to molecular motors for their transport to their subcellular destination has however for long remained largely unclear. Recent work, including work from the authors, has uncovered that RNAs can hitchhike on membranous organelles through adapter proteins linking mRNAs and RNA binding proteins with trafficking membrane-bound organelles.

      This study aimed at investigating the structural and molecular bases underlying the interaction between RNA binding proteins and endosomes. While their identification and characterization of the MLE1 and MLE2 domains of Rrm4 did not provide significant new insight into the mechanisms involved in the endosome-mediated transport of mRNAs, it uncovered interesting new properties of MLE domains, including structural variations, selective binding and functional specificity. This work should thus be of interest for structural biologists and researchers interested in protein-protein interaction platforms.

      Referee Cross-commenting

      Our comments all converge to the idea that this study is solid as it is and requires only minor revision work to support the authors conclusions. Although characterizing further MLE/PAM2 binding specificity and MLE2 interactors would be of great interest and indeed provide a more complete understanding of interaction networks at play, I feel that this is beyond expected revision work.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Devan, Schott-Verdugo et al.

      Summary

      In this study the putative MLLE RNA-binding motifs of the endosomal RNA-binding protein, Rrm4, from Ustilago maydis were examined using structural and genetic analyses. MLLE motifs are conserved in polyA-binding proteins (Pab1/PABPC1) and found also in Rrm4, which was shown to reside on motile endosomes and deliver septin mRNAs for endosome-localized translation during polarized growth. Upa1 on the endosome interacts with Rrm4 via its PAM2L domain that itself interacts with the MLLE domains of proteins like Pab1. Mutations in the known MLLE domain of Rrm4 were earlier shown to affect localization to endosomes.

      Here, the C-terminal domain of Rrm4 was revealed to have three divergent MLLE motifs using comparative modeling; only two of which were previously predicted. Crystallization and X-ray diffraction analysis of a truncated version of bacterially produced Rrm4, showed MLLE2 is most similar to that of PABPC1 and UBR5, although MLLE1 and 2 are somewhat divergent in the key region of PAM2 binding. Small angle X-ray scattering of recombinant full-length or truncated Rrm4 revealed that the MLLE domains might form a platform that could allow for multiple contacts with different binding partners. In vitro binding studies with different N-terminal GST-tagged versions of the Rrm4 were used to examine for interactions with PAM2 sequences of Upa1 using N-terminal hexa-histidine-SUMO fusions. It was found that Pab1-MLLE interacts with the PAM2, but not PAM2L, domain of Upa1. In contrast, the complete Rrm4 MLLE region (G-Rrm4-NT4) interacted with the PAM2L domain, but not the PAM2 of Upa1. Notably, the interaction with PAM2L required the third MLLE and neither MLLE1 nor MLLE2, nor both. No significant differences in affinity were observed and were similar to that of the Pab1 MLLE. The results also show that the MLLE3 has a higher affinity for the PAM2L2 than PAM2L1 of Upa1. To examine the biological role of the Rrm4 MLLEs, U. maydis strains bearing deletions in the domains of Rrm4 were examined for hyphal growth and endosomal transport (latter using Upa1-GFP and Rrm4-mKate2). Only the loss of the MLLE3 domain inhibited polarized growth (as seen with the full deletion of RRM4) and not the deletion of either MLLE1 or 2. Similar results were obtained regarding endosome shuttling. Thus, in line with the biochemical experiments performed the MLLE3 domain alone (of the three identified) is necessary for the biological actions of Rrm4. This suggested the MLLE1 and 2 are not necessary for function under these conditions.

      To examine this further, Upa1 carrying mutations in the PAM2L 1or PAM2L2 domains were examined. It was found that the deletion of both PAM2L domains affected unipolar growth resulting in bipolar growth similar to the deletion of UPA1 alone. This phenotype was observed even upon the deletion of Rrm4 MLLE1 and 2 in the same background as the PAM2L mutants. The mutation of both PAM2L domains led to a reduction in Rrm4-labeled shuttling endosomes, which suggests that these domains help anchor Rrm4 to endosomes. When only the PAM2L1 domain is present in Upa1 there was a larger increase in hyphae with aberrant microtubule staining than upon the loss of PAM2L1. The authors suggest that this indicates PAM2L2 is more important and prescribes an accessory role for MLLE2 in endosome association.

      Comments:

      Overall, the study seems well conducted. We cannot comment on the structural aspect of the work since this is not our field of expertise. That said, the biochemical and genetic/functional studies appear solid, well thought-out, and clearly presented. No new experiments are necessary to support the general claims of the paper, however, experiments suggested below might make it more revealing with regards to the connection between RNA binding and MLLE-PAM2L interactions (i.e. endosome localization and RNA binding functions).

      1. Line 286 - It reads the they "Next, we investigated the association of Rrm4 -M12D-Kat in strains expressing PAM2L1. Thus, the endosomal attachment was solely dependent on the interaction of MLLE3 with the PAM2L2 sequence of Upa1." Unclear - wouldn't lacking PAM2L1 (and not expressing) fit the logic of the sentence?
      2. Several questions regarding the specificity of PAM2 vs. PAM2L domains. What happens when you switch/replace the PAM2L1 or 2 of Upa1 with Upa1 PAM2 domains? Are they exclusive? What happens when the MLLE3 of Rrm4 is switched with that of Pab1? And if one does both - does that restore functionality to Rrm4?
      3. Likewise, what happens if Upa1 only has PAM2L2 instead of only PAM2L1 domains? Does that alter function - perhaps now one can observe a contribution of MLLE1? If it it's there it's likely to have function. Anything known about the post-translational modification of these MLLE or PAM domains? Does it change during unipolar vs. bipolar growth? Perhaps the different MLLE domains are regulated in such a fashion?
      4. Can the authors show whether the binding of mRNA cargo (e.g. Cdc3 mRNA) to the RRM motifs of Rrm4 affects the interaction between any of the MLLE-PAM2L pairs, or vice versa (i.e. does the MLLE-PAM2L interaction affect mRNA binding)?
      5. Discussion line 311 It is written that the three MLLE domains "collaborate for optimal functionality..." Perhaps there's a misunderstanding here, but the authors show that MLLE3 domain alone is necessary & sufficient for function, so where is the collaboration? MLLE2 may have an accessory role according to the authors, but we do not know if it is in collaboration with MLLE3 or independent thereof. Since the KD of MLLE3 is not affected by the presence or absence of MLLE1,2 in vitro at least, it may be that they have independent, and not collaborative, roles.

      Significance

      This paper concerns functional domains found in an endosome-localized RNA binding protein, U. maydis Rrm4, which is necessary for localized translation on endosomes and subsequent unipolar growth. Here the authors show using structural, biochemical, and genetic studies that instead of one or two MLLE protein-protein interacting domain in Rrm4 there are three, although one (MLLE3) is necessary and sufficient for full function. This work is for an audience interested in those studying RNA trafficking and its role in cell physiology, which is our expertise. The work is interesting, but it could be made more so especially if a connection was established between the RNA-binding function of the RRM domains and the MLLE-PAM2L interaction(s). At this point it is solid technical work and could be published after minor revisions.

      Referee Cross-commenting

      I concur with the comments of the other reviewers in that the work is solid and necessitates minor revisions in order to be published. Clearly, establishing a connection between the RNA-binding function and the MLLE-PAM interactions of Rrm4 would be an interesting and worthy pursuit that might enhance the novelty of the work, but I agree that it could belong to future studies.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      This manuscript documents a very thorough biophysical, structural and functional dissection of interactions between the RNA-binding protein Rrm4 and the endosomal adaptor Upa1 in the filamentous fungus Ustilago maydis. It has been shown previously that the Rrm4-Upa1 interaction is critical for mRNA transport in this system as mRNAs hitchhike on motor-associated endosomes. Here, the authors reveal using modelling that Rrm4 has three MLLE domains, including a cryptic one that had not been identified previously. They then report the crystal structure of MLLE2 and analyze the distribution anf arrangement of the MLLE domains in the protein using SAXS. They then show using pulldowns and isothermal titration calorimetry that MLLE3 is critical for the Upa1 interaction (via the PAM2L domains of Upa1) and that MLLE2 contributes to Rrm4 localization in vivo when the MLLE3-Upa1 interaction is partially impaired. The study suggests that Rrm4 has a platform of MLLE domains for orchestrating Rrm4 function. Overall, this is technically a high quality study. However, a number of points (mostly minor) should be addressed.

      Major comments:

      A key part of the study if the in vivo work illustrating a role for MLLE2 in regulating Rrm4 localization when the system is sensitized. Some aspects of this part of the work need clarifying.

      a) The authors should show that the abberant staining is indeed microtubule-related with the benomyl experiment that they used in Jankowski et al. 2019.

      b) The authors claim from these experiments that MLLE2 contributes to endosomal targeting (as there is ectopic protein on other structures (presumptive microtubules)). However, to make this claim, the authors would need to measure the intensity of the mutant Rrm4 protein on endosomes and/or the colocalization of these Rrm4 variants with endosomes, as they do in other experiments in this paper. Otherwise, it is possible that the MLLE2 deletion has another effect, e.g. increasing protein stability, and thus increasing the likelihood of binding to structures other than endosomes. If available, data on the relative abundance in the cell of the protein expressed from the wild-type control (rrm4-kat) and MLLE2 deletion constructs (e.g. rrm4-m1,2delta-kat) should be provided.

      c) Was the data in Figure 5D scored blind of the identity of the samples? Given that the classification has to be done manually, it is important to confirm the phenotypes are robust to blinding (at least for the key comparisons).

      If points b and c are addressed, it should be possible to draw an arrow between the gray question mark protein in Figure 6 and the endosome surface, which is what I assume the authors believe to be case based on their discussion.

      Minor comments:

      1. The first line of the abstract is quite bold. It is hard to quantify the role of transport vs RNA stability for example, so I suggest this sentence is toned down.
      2. Line 269: change "amount of motile Rrm4-M12delta-Kat positive signals" to "number of motile Rrm4-M12delta-Kat positive signals".
      3. Figure 3 legend: Insert "Variant" before "amino acids of the FxP and FxxP..." to indicate what is labeled in gray. Change "fond" to "font" in the same sentence.
      4. The cartoons of the different protein variants are very helpful but I had problems spotting the Upa1-Pam2L deletions due to the similar gray to the background of the protein. This would perhaps be clearer if the gray used for the background was lighter than it currently is.
      5. The residual motility of wild-type Rrm4 when PAM2L1 and PAM2L2 are both mutated (Figure 5C) is reminiscent of what is seen in a complete Upa1 deletion in the group's previous work. It would be helpful to point this out to the reader, as well as the implication that other proteins are contributing to Rrm4's linkage to endosomes. After all, some of these other adaptors might contact MLLE2 of Rrm4.
      6. Some of the y-axes of the charts should be more descriptive so that the reader can understand the plots even before they consult the legends. For example, in Figure EV4A and EV5D and E, which protein is being to referred to in each 'number of signals' plot should be included. In Figure 5D, 'Hyphae [%]' would be clearer as 'Hyphae with MT staining of Rrm4 [%]'
      7. Figure EV5 legend title: this could be misleading as the authors are seeing ectopic MT localization rather than a deficit in microtubule association.

      Significance

      The Feldbrugge group has previously mapped interactions between Upa1 and Rrm4 (Pohlmann et al., 2015) and some conclusions are corroborated in the paper by Boehm et al. The paper under review is, however, a significant advance due to the identification of the third MLLE domain, detailed biophysical characterization of the interactions, the structural insights, and evidence of a subsidiary role of MLLE2. The work would of course be stronger if the target of MLLE2 had been identified but I think this is beyond the scope of this initial work. To my knowledge, this is one of the most extensive analyses of the interactions mediated by MLLE and PAM domains and will be of interest to others working on these protein features. The work will also appeal to those interested in the links of localizing mRNAs with motor-associated membranes, which is an emerging field.

      Reviewer expertise: I have a long-standing interest in molecular analysis of mRNA trafficking mechanisms. I do not have experience in fungal genetics.

      Referee Cross-commenting

      It seems that we are in agreement that this is solid work and that biochemical and biophysical analysis of the MLLE-PAM interactions will be of significant interest to those working on those domains (or proteins containing those domains). I agree with the comments of the other reviewers and there are clearly some essential minor revisions needed to strengthen the evidence for their conclusions and some clarifications. I think it is a long shot that RNA binding to the RRMs will affect the MLLE-PAM interactions and would require quite a lot of work to show this conclusively. The study would, however, be more impactful if this was shown to be the case, or the target of MLLE2 was found. Nonetheless, I would not say these new avenues of research are necessary to find a home in one of the Review Commons journals.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1: General comments:

      Fujimoto and collaborators use Nanopore-based cDNA sequencing for genome-wide transcriptome analysis of a collection of hepatocellular carcinomas (HCCs) and matched normal liver tissues. To improve detection of alternatively spliced isoforms and hybrid transcripts potentially deriving from genomic rearrangements, they develop a dedicated pipeline SPLICE, which they benchmark against available software used for the same analysis. Besides having dual functionality (calls both alternative transcripts and fused transcripts), SPLICE seems to outperform previous software in calling alternative/fused transcripts and accuracy. They use the SPLICE pipeline to call isoforms and gene fusions in normal liver cells and HCCs and perform basic functional validations on novel fusions identified. The manuscript is well written, and the analyses are well performed. Perhaps the benchmarking of the SPLICE pipeline could have been more extensive (i.e., performed on additional independent datasets).

      Major points: 1. Line 149-150: "We compared the results of mapping to the reference genome and the reference transcriptome sequences, and removed candidates if both were inconsistent (removal of mapping errors). " Please specify what "both were inconsistent" means.

      Our reply; Thank you for this comment. The accuracy of fusion gene detection is influenced by mapping errors. To remove possible mapping errors, SPLICE aligned reads to the reference genome and the reference transcriptome sequences and compared the results. If the results are inconsistent (for example, GeneA-GeneB in the reference genome and GeneA-GeneB in the transcriptome genome, or GeneA-GeneB in the reference genome and GeneA in the transcriptome genome), SPLICE considers the candidates as false positive and removes them from the analysis.

                We changed the sentence “We compared the results of mapping to the reference genome and the reference transcriptome sequences, and removed candidates if both were inconsistent (removal of mapping errors).” to “we compared the results of mapping to the reference genome and the reference transcriptome sequences, and removed candidates if both results did not detect same fusion genes (removal of mapping errors).”  (line 150-152).
      
      • Concerning TE-derived novel exons, in principle, this may lead to altered expression of the TE-transcript (as the Authors report for L1-MET) or to altered splicing of the transcript (i.e., other exon/introns could be retained or excluded). Can the Authors assess whether the inclusion of the TE in a transcript enhances its expression or affects the splicing of the "parental" transcript? If so, can they verify if the position of the insertion of the TE has any effect on expression and splicing?*

      Our reply; Thank you very much for this important comment. As the reviewer mentioned, exonization of TE may affect the splicing patterns and gene expression levels of transcripts. To determine the effect of TE on expression levels, we compared the expression levels of transcripts with TE-derived novel exons with those of known transcripts of the gene. We found that the expression levels of transcripts with TE-derived novel exon were lower than those of known transcripts (Figure 1 in the reply). Since the same results were observed in all novel transcripts (Fig. 1E,F), most TE exonization would not affect the expression level of transcripts.

                We then analyzed the effects of TE in the splicing change, we compared the numbers of novel splicing junctions between transcripts with TE-derived novel exons and other transcripts in each gene. The proportions of genes with novel splicing junctions were not significantly different between the transcripts with TE-derived novel exons and others (transcripts with TE-derived novel exons; 9.1% and others; 11.9%)  (Figure 2 in the reply). As observed in L1-*MET* and L2-*RHR1*, transposons can affect expression levels and structures of transcripts, however, their effect would be limited to a part of genes.
      

      Figure 1

      Comparison of expression levels of transcripts with TE-derived novel exon and known transcripts. Only transcripts derived from genes with TE-derived novel exons were compared. The total number of transcripts is shown below the plot. Transcript abundance was measured in reads per million reads (RPM), and log10 converted values for RPM were shown in the violinplot. P-values were calculated by Wilcoxon rank-sum test.

      Figure 2

      Comparison of the percentage of novel splicing junction in transcripts with novel TE-derived exon and other transcripts. The total number of genes are shown below the plot. Transcripts with TE-derived novel exons and other transcripts were compared. P-value was calculated by Fisher’s exact test.

      • Can the Authors explain why the NBEAL1-RPL12 was not detected by SPLICE?*

      Our reply; Thank you for this comment. Although NBEAL1-RPL12 fusion was detected by SPLICE, mapping results to the reference genome and the reference transcriptome were inconsistent and removed from the final result. AsNBEAL1-RPL12 was not validated by PCR (Supplemental Fig. S4B) (line 183-184), we consider that this fusion-gene is a false positive, and filtering of SPLICE successfully removed false-positive fusions.

      • Line 332: Can the Authors explain how the total amount of HVB mRNA was determined in each sample? Is it a relative amount calculated from the sequencing data? If so, it should be made clear in the text that this is a fractional measure.*

      Our reply; Thank you very much for this comment. Expression levels were calculated by log10 converted reads per million reads (log10(RPM)) for each sample. We added the following sentences to the "Expression from HBV" subsection in the Results (line 337-338); “Expression levels were estimated by log10 converted support reads per million reads (log10(RPM)) for each sample.”.

      • Fig4a: please specify if the y-axis "number of support reads" reports library normalized values.*

      Our reply; Thank you for this comment. The values of the y-axis are row read counts. We added the following sentences to the Figure legend (line 348); “Y-axis shows the total number of support reads (raw counts).”.

      • HCCs have more HBV-human genome fusion transcripts than normal liver. Could the authors clarify if these HCC transcripts are selectively found in tumors? or whether they are also expressed in normal liver samples? The paragraph starting from line 356 is confusing, and it is difficult to retrieve the above information for both HBs and HBx fusions.*

      Our reply; We apologize for the confusing description. All HBV-human genome fusion transcripts were selectively expressed in tumor or normal liver. We added the following sentence to the "Expression from HBV" subsection in the Results (line 365-366); “All of these HBV-human genome fusion transcripts were selectively expressed in the HCCs and the livers.”.

      • Figure 4C: what was the control used to calculate the relative viability in these analyses?*

      Our reply; Thank you for this comment. Fig. 4C shows the number of HBV-human fusion transcripts in the six categories. If this comment refers to Fig. 4H, cell lines transfected with the empty vector (pIRES2-AcGFP1-Nuc) was used as controls. This has been described in the "Gene overexpression" subsection of Methods (line 716-717).

      • MYT1L: the Authors report the identification of a novel MYT1L transcript downregulated in HCC, and argue it may have a potential tumor-suppressive function. For the sake of clarity, it will be advisable to show also the differential expression (HCC vs. Liver) of the other transcripts expressed from the same locus.*

      Our reply; Thank you for this important comment. In HCCs and normal livers, only the novel MYT1L transcript was expressed from this locus, and no known transcript of MYT1L was expressed. We changed the sentence “In the MYT1Lgene, a highly-conserved novel exon was detected (Fig. 2E), and this transcript was significantly down-regulated in the HCCs” to “In the MYT1L gene, a highly-conserved novel exon was detected (Fig. 2E), and only a transcript with the novel exon was expressed.” (line 471-472).

      • *

      Minor points: 1. Table S4: there is a typo, correct “secific” in “specific”

      Our reply; Thank you very much for this comment. We corrected the typo of Table S4.

      • *

      • *

      *Reviewer #2: General comments:

      Summary: This is both a presentation of a pipeline for analysis of Nanopore RNA-seq data, as well as an analysis of a cohort of 44 hepatocellular carcinomas against matched-normal liver tissue. It presents a number of quite intriguing results from the long-read RNA analysis, and suggests potential new targets for study in HCC. It is also worth noting that the current version of guppy (6) has functionality to detect primer sequences in the middle of reads and split those reads, which may obviate one of the steps in SPLICE.*

      *Major comments:

      1) The work done in this study used data that was basecalled using guppy 3.0.3. Since that version, I am aware of at least two major upgrades to the base caller accuracy, which would likely also improve the accuracy of isoform resolution. Given that the data is relatively low-coverage and that you have an automated workflow for the analysis, I would recommend re-basecalling using an updated basecaller and re-running your analysis using that. This is especially important given your comments in the paper about splice site misalignment.*

      Our reply; Thank you very much for this important comment. We performed basecalling of a sequence data of MCF7 using the latest guppy v6.0.6 and compared the result with that by guppy v3.0.3. We randomly extracted 1M reads from MCF-7 reads that passed qscore filtering in guppy basecaller. The same reads were extracted and basecalled by guppy v3.0.3. These two data were analyzed by SPLICE.

      The average error rate was 4.6 % for v6.0.6 and 6.8 % for v3.0.3. The number of transcripts was 9,674 for v6.0.6 and 9,329 for v3.0.3. Of these, the number of novel transcripts was 446 and 410, respectively. The number of fusion genes was 2 (BCAS3-BCAS4, and BCAS3-ATXN7) by v6.0.6 and one (BCAS3-BCAS4) by v3.0.3. As the reviewer mentioned, we found that using the latest version of guppy improved the accuracy and detected a larger number of transcripts.

      We added the results to Supplemental Table S12. We also changed the sentences from “Second, our analysis removed the change of splicing sites within 5 bp to remove alignment errors (Fig. 1B). We consider that this cutoff value is necessary due to currently available high-error reads (S____upplemental Data S____2). However, sequencing technologies and basecallers are improving, and in the near future, we should be able to use a smaller cutoff value and identify larger numbers of splicing changes.” to “Second, the accuracy of the analysis depends on the sequencing error rate. Although several filters are used for currently available high-error reads (Fig. 1B and ____Supplemental____ Fig. S1), sequencing errors would affect the accuracy of the result. Sequencing technologies and basecallers are improving, and in the near future, we should be able to identify larger numbers of splicing changes with high accuracy (Supplemental Table S10).” (line 538-542).

      2) You have compared your software to another tool for isoform analysis on Nanopore sequencing data, TALON. But a number of other tools exist for this purpose, including stringtie2, flair and bambu. My own testing has shown that stringtie2 outperforms TALON in terms of concordance with Illumina RNA-seq. It is quite important that you perform a complete comparison of your software to the state of the art for this purpose.

      Our reply; Thank you very much for this important comment. We compared our tool with four tools (TALON, FLAIR, StringTie, and bambu). For this comparison, we used sequence data of MCF-7 and HCC (RK107C). We randomly extracted 1 M reads from MCF-7 and HCC (RK107C) sequence data using Seqtk (v1.3) (params: sample -s1 1000000). Reads were mapped to the reference genome sequence (hg38) with minimap2 (v2.17) (params: -ax splice --MD), and the output SAM files were converted to BAM files and sorted with samtools (v1.7) (Li et al. 2009).

      For benchmarking of TALON (v5.0), we corrected aligned reads with TranscriptClean (v2.0.3) (Wyman and Mortazavi 2018). Next, we ran the talon_label_reads module to flagging reads for internal priming (params: --ar 20). TALON database was initialized by running the talon_initialize_database module (params: --l o --5p 500 --3p 300). Then, we ran the talon module to annotate the reads (params: --cov 0.8 --identity 0.8). To output transcript abundance, we first obtained a whitelist using the talon_filter_transcripts module (params: --maxFracA 0.5 --minCount 5), and then quantified transcripts using the talon_abundance module based on the whitelist. For FLAIR (v1.5), the sorted BAM file was converted to BED12 using bin/bam2Bed12.py. We then corrected misaligned splice sites with the flair-correct module. High-confidence isoforms were defined from the corrected reads using the flair-collapse module (params: -s 3 --generate_map). For benchmarking of StringTie (v2.2.1), Stringtie was performed with input files consisting of long-read alignment and reference annotation (params: -L -c 3). For benchmarking of bambu (v2.0.0), Bambu was performed with input files consisting of long-read alignment, reference annotation and reference genome (hg38) (params: min.readCount = 3). Candidates with low expression levels (support reads As a result, SPLICE identified the third-highest number of transcripts followed by FLAIR and StringTie (Supplemental Fig. S3A). In MCF-7 the concordance rate with IsoSeq MCF-7 transcriptome data was the highest in SPLICE for known transcripts and the second highest in SPLICE for novel transcripts (Supplemental Fig. S3B). These results indicate that SPLICE has sufficient accuracy for analyzing transcript aberrations.

      We added the text to the "Comparison of SPLICE method with other tools" subsection of the Results (line 165-177) and the "Benchmarking" subsection of the Methods (line 640-679). We added the results to Supplemental Fig. S3.

      3) Likewise, for fusion detection, you compare to LongGF. You should also compare to (and cite) JAFFAL.

      Our reply; Thank you very much for this important comment. We compared our tool with the two tools (LongGF and JAFFAL). We used 1 M reads randomly extracted from MCF-7 and HCC (RK107C) sequence data as described above.

                For benchmarking of LongGF (v0.1.2), reads were mapped to the reference genome sequence (hg38) with minimap2 (v2.17) (params: -ax splice --MD), and the output SAM files were converted to BAM files and sorted with samtools (v1.7). We then ran the *longgf* module and obtained the list of fusion genes (params: min-overlap-len 100 bin_size 50 min-map-len 200 pseudogene 0 secondary_alignment 0 min_sup_read 3). For benchmarking of JAFFAL (v2.2), we ran the *JAFFAL.groovy* module with zipped fastq files.
      
                In this comparison, close gene pairs (We added the text to the "Comparison of SPLICE method with other tools" subsection in the Results (line 178-186) and the "Benchmarking" subsection in the Methods (line 667-679). We showed the results in Supplemental Fig. 4.
      

      4) In terms of the source code, I have questions. Why did you use BASH to run the Python code, instead of making this into a Python package? Why did you not use the functionality already available in BioPython for a number of basic sequence data handling tasks? Why is there not even a single function defined anywhere, let alone classes?

      At some level, if it works, it works. But I have serious concerns about the long-term maintainability of the code in its current state.

      Our reply; Thank you very much for this critical comment. As the reviewer mentioned, we think it is better to make a python package and use BioPython for maintenance and long-term maintainability of the code. We have been building our analysis pipeline by trial and error, and at this stage, the current scripts are convenient for us (our group may need to learn software development). We provided a Docker package (see the reply to comment 5)), and this would promote usability.

      5) Also related to the code, it is generally the standard now to create a BioConda package or Docker container for a bioinformatics package. BioConda has the advantage that the BioContainers project automatically generate Docker and Singularity containers from it. Please provide one of these.

      Our reply; Thank you very much for this critical comment. We made a Docker file and provided it from our github page. It is available from the "Installation and usage via Docker" section.

      6) There is some quite nice functional validation work done on some of the DE transcripts that would have been hidden in a gene-level analysis. There is also some nice work on detecting HBV fusion genes. These both contain important results which are not mentioned at all in the abstract. I feel like the abstract as it stands is selling the paper short.

      Our reply; Thank you very much for this important comment. We added the following sentences to the abstract; “Comparison of expression levels identified 9,933 differentially expressed transcripts (DETs) in 4,744 genes. Interestingly, 746 genes with DETs, including LINE1-MET transcript, were not found by the gene-level analysis. We also found that fusion transcripts of transposable elements and hepatitis B virus (HBV) were overexpressed in HCCs. In vitro experiments on DETs showed that LINE1-MET and HBV-human transposable elements promoted cell growth.”.

      7) Fig 5C shows a Venn diagram of fusions detected by short-read vs long-read sequencing, in which there is quite low overlap between these. You make the statement in the paper that "a combination of short- and long-reads can detect more fusion genes". I find it more likely that the short-read ICGC data had much greater depth of coverage than the MinION data you produced, which allowed for the detection of fusions that were expressed at much lower levels. This could be easily tested by downsampling the ICGC data to the same amount of sequence data as was generated on the MinION, and re-creating the Venn diagram with the fusions detected that way.

      Our reply; Thank you very much for this very important comment. We compared the amount of data between our long-reads and the previous short-reads. However, the amounts of data were not quite different (Supplemental Fig. S14A). Therefore, differences in depth are not likely to be the cause of the low overlap. We considered that two possibilities could explain the low overlap. First, most of the fusion genes missed by short-read were very low expression levels, less than 1 reads per million reads (RPM) (Supplemental Fig. S14B), therefore, there are many fusion-genes with low expression levels, and they are difficult to be detected. Second, 28.9 % of transcripts in long-reads lacked 5' region (Supplemental Fig. S5 and Supplemental Fig. S14C,D). Therefore fusion-genes whose breakpoints are located in the 5' region were difficult to detect by long-read.

      We added the following sentences to the "Fusion genes" subsection in the Results (line 400-405); “We considered that two possibilities could explain the low overlap. Since the most of the fusion genes missed by short-reads had very low expression levels (Supplemental Fig. S14B), many fusion-genes with low expression levels would be missed by a single approach. In addition, 28.9 % of transcripts in long-reads lacked 5' region (Supplemental Fig. S5 and Supplemental Fig. S14C, D). Therefore fusion-genes whose breakpoints are located in the 5' region would be difficult to detect by long-read.”. We also added a figure on the amount of data to Supplemental Information (Supplemental Fig. S14A).

      8) Figure 5D is very interesting. What do you conclude from that result? Please comment in the manuscript.

      Our reply; Thank you very much for this important comment. We used samples that used for whole-genome sequencing in our previous study. Therefore, a list of SVs is available. We classified fusion-gene to these supported by SVs (SV detected fusion-genes) and others (no SV detected fusion-genes), and compared the expression levels of them (Figure 5D).

      Whole-genome sequencing can accurately identify clonal (high frequency) SVs, however, would miss sub-clonal (low frequency) SVs. Therefore, we considered that no SV detected fusion-genes were generated by sub-clonal SVs. This result suggests that there are a lot of sub-clonal fusion genes, and their expression levels are lower than clonal fusion genes. Although the functional importance of sub-clonal fusion genes is currently unknown, deeper RNA sequencing would detect a larger number of fusion genes.

                We added the following sentences to the “Fusion genes” subsection in the Results (line 410-412); “This result suggests that there are a lot of sub-clonal fusion genes, and their expression levels are lower than clonal fusion genes. Although the functional importance of sub-clonal fusion genes is currently unknown, deeper RNA sequencing would detect a larger number of fusion genes.”.
      

      *Minor comments:

      1) The manuscript has many small errors in English grammar, spelling and style. I would strongly recommend sending it for copy editing before submitting it to a journal.*

      Our reply; Thank you very much for this comment. Due to the limitation of time, the current version has not been proofread by a native-English speaker. We are planning to review English grammar by a native-English speaker.

      2) Neither the results section nor the methods section describing the sequencing that was performed specify whether it was done on a MinION or PromethION (or flongle). While this is implied elsewhere in the paper, it should definitely be specified in the methods at a minimum.

      Our reply; Thank you for this comment. We used a MinION for sequencing. We added the following sentences to the Method section (line 579-580); “Libraries were sequenced on a SpotON FlowCell MKⅠ(R9.4) (Oxford Nanopore), using the MinION sequencer (Oxford Nanopore)”.

      3) You also write in the introduction that your method, SPLICE, was developed for the MinION specifically. Please comment on its applicability to data generated on the PromethION and flongle Nanopore sequencers.

      Our reply; Thank you very much for this comment. We consider that our method is applicable to data from MinION, PromethION, and flongle. We added the following sentence to the Methods section (line 592-593); “In the present study, we analyzed sequence data from MinION. We consider that our method is applicable to data from MinION, PromethION, and flongle.”.

      4) The volcano plot in Fig 3A is missing its dots.

      Our reply; Thank you very much for this comment. We modified the Fig. 3A.

      *Reviewer #3: General comments:

      Summary: In this manuscript, Kiyose et al have developed and tested a novel methodology for identifying splicing alterations, and fusions, from full-length transcript or long read sequencing data. They apply this approach to liver cancer and paired, non-cancerous liver tissue from a prior publication, and use wet-lab/experimental methods to validate their in silico findings. They conclude that their new methodology, SPLICE, outperforms one existing method, and is uniquely suitable to identifying fusion genes.*

      Major Comments: 1) Figure 1B shows a schematic of common error patterns from MinION cDNA sequencing, and the text of the manuscript describes how the authors' new approach (SPLICE), overcomes several of these, e.g. sequencing errors, artificial chimeras, and mapping errors of highly homologous genes. However, there is a fundamental disconnect between the text and the graphic in Figure 1B. This should either be revised for clarity, or an additional graphic or flowchart placed in the supplementary materials to clearly show *how* SPLICE overcomes each of these limitations.

      Our reply; We apologize for the insufficient explanation in Figure 1. We showed a detailed explanation of the data analysis procedure in Supplemental Fig. S1.

      2) Why was TALON the only alternative approach chosen for validation of SPLICE performance? There are a number of other, more advanced pipelines such as SUPPA2, and IsoformSwitchAnalyzeR. It would strengthen the manuscript, and its conclusions, to incorporate at least one of these methods as a second comparator. This is particularly true for IsoformSwitchAnalyzeR, since Kiyose et al identify a number of differentially expressed transcripts (DETs) for genes that are not differentially expressed.

      Our reply; Thank you very much for this important comment. Another reviewer also requested additional benchmarking, therefore we performed an additional performance comparison for the revised manuscript. As SUPPA2 and IsoformSwichAnalyzeR are used to analyze the annotated output GTF files, and direct comparison with SPLICE is difficult. Since IsoformSwichAnalyzeR recommends StringTie as an annotation soft, we compared using StringTie instead.

      We compared the performance of SPLICE with that of four other methods (TALON, FLAIR, StringTie and Bambu) for splicing variant detection. SPLICE identified the third-highest number of transcripts followed by FLAIR and StringTie (Supplemental Fig. S3A). In MCF-7 the concordance rate with IsoSeq MCF-7 transcriptome data was the highest in SPLICE for known transcripts and the second highest in SPLICE for novel transcripts (Supplemental Fig. S3B).

      We added the text to the "Comparison of SPLICE method with other tools" subsection of the Results (line 165-177) and the "Benchmarking" subsection of the Methods (line 640-665). We added the results to Supplemental Fig. 3.

      3) The Venn diagram in Figure 5C appears to show that conventional short read sequencing identifies 46 fusion genes that are not also detected by long read sequencing. However, this result, and its implications are never addressed in the text.

      Our reply; Thank you very much for this important comment. We apologize for the insufficient explanation. We considered that two possibilities could explain the low overlap. First, most of the fusion genes missed by short-read were very low expression levels, less than 1 reads per million reads (RPM) (Supplemental Fig. S14B), therefore these are many fusion-gene with low expression level and they are difficult to be detected. Second, 28.9 % of transcripts in long-reads lacked 5' region (Supplemental Fig. S5 and Supplemental Fig. S14C,D). Therefore fusion-genes whose breakpoints are located in the 5' region were difficult to detect by long-read.

                We added the following sentences to the "Fusion genes" subsection in the Results (line 400-405); “We considered that two possibilities could explain the low overlap. The most of the fusion genes missed by short-reads had very low expression levels (Supplemental Fig. S14B). This result suggests that there are many missed fusion-genes with low expression levels. In addition, 28.9 % of transcripts in long-reads lacked 5' region (Supplemental Fig. S5 and Supplemental Fig. S14C, D). Therefore fusion-genes whose breakpoints are located in the 5' region would be difficult to detect by long-read.”. We also added a figure on the amount of data to Supplemental Information (Supplemental Fig. S14A).
      

      Minor Comments: 1) On pages 20-21, the language used to describe the HBV and/or HCV postive vs negative materials is very confusing. Please clarify that by "HBV- and HCV-related tissues" you in fact mean "HBV-and HCV-infected samples."

      Our reply; We apologize for the confusing wording. We converted "HBV and HCV-related tissues" to " HBV and HCV-infected samples" in the manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Kiyose et al have developed and tested a novel methodology for identifying splicing alterations, and fusions, from full-length transcript or long read sequencing data. They apply this approach to liver cancer and paired, non-cancerous liver tissue from a prior publication, and use wet-lab/experimental methods to validate their in silico findings. They conclude that their new methodology, SPLICE, outperforms one existing method, and is uniquely suitable to identifying fusion genes.

      Major Comments:

      1. Figure 1B shows a schematic of common error patterns from MinION cDNA sequencing, and the text of the manuscript describes how the authors' new approach (SPLICE), overcomes several of these, e.g. sequencing errors, artificial chimeras, and mapping errors of highly homologous genes. However, there is a fundamental disconnect between the text and the graphic in Figure 1B. This should either be revised for clarity, or an additional graphic or flowchart placed in the supplementary materials to clearly show how SPLICE overcomes each of these limitations.
      2. Why was TALON the only alternative approach chosen for validation of SPLICE performance? There are a number of other, more advanced pipelines such as SUPPA2, and IsoformSwitchAnalyzeR. It would strengthen the manuscript, and its conclusions, to incorporate at least one of these methods as a second comparator. This is particularly true for IsoformSwitchAnalyzeR, since Kiyose et al identify a number of differentially expressed transcripts (DETs) for genes that are not differentially expressed.
      3. The Venn diagram in Figure 5C appears to show that conventional short read sequencing identifies 46 fusion genes that are not also detected by long read sequencing. However, this result, and its implications are never addressed in the text.

      Minor Comments:

      1. On pages 20-21, the language used to describe the HBV and/or HCV postive vs negative materials is very confusing. Please clarify that by "HBV- and HCV-related tissues" you in fact mean "HBV-and HCV-infected samples."

      Significance

      There is somewhat strong significance to this advance. As promising as long read, full-transcript sequencing is for the field, current limitations such as its high error rate have limited applicability, and most of the current analytic pipelines require complementary short read RNA sequencing to be performed in parallel for error correction. The authors assert that SPLICE overcomes these limitations, and to some extent demonstrates this. As a predominantly wet-lab experimentalist in the area of RNA processing, I have the relevant expertise to most rigorously assess the downstream impacts of findings from pipelines such as SPLICE, e.g. the validation experiments shown in the latter portion of the manuscript. These are uniformly strong. Where I was challenged some is in the authors' explanations of how and why SPLICE's specific design, as an algorithm, overcomes the known limitations in current analytic pipelines for long-read sequencing.

      Referees cross-commenting

      I concur with Reviewer 2. I think the 3 of us were broadly enthusiastic, yet raised some of the same concerns. In my view, these concerns should be able to be readily addressed by the authors.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      This is both a presentation of a pipeline for analysis of Nanopore RNA-seq data, as well as an analysis of a cohort of 44 hepatocellular carcinomas against matched-normal liver tissue. It presents a number of quite intriguing results from the long-read RNA analysis, and suggests potential new targets for study in HCC. It is also worth noting that the current version of guppy (6) has functionality to detect primer sequences in the middle of reads and split those reads, which may obviate one of the steps in SPLICE.

      Major comments:

      1. The work done in this study used data that was basecalled using guppy 3.0.3. Since that version, I am aware of at least two major upgrades to the base caller accuracy, which would likely also improve the accuracy of isoform resolution. Given that the data is relatively low-coverage and that you have an automated workflow for the analysis, I would recommend re-basecalling using an updated basecaller and re-running your analysis using that. This is especially important given your comments in the paper about splice site misalignment.
      2. You have compared your software to another tool for isoform analysis on Nanopore sequencing data, TALON. But a number of other tools exist for this purpose, including stringtie2, flair and bambu. My own testing has shown that stringtie2 outperforms TALON in terms of concordance with Illumina RNA-seq. It is quite important that you perform a complete comparison of your software to the state of the art for this purpose.
      3. Likewise, for fusion detection, you compare to LongGF. You should also compare to (and cite) JAFFAL.
      4. In terms of the source code, I have questions. Why did you use BASH to run the Python code, instead of making this into a Python package? Why did you not use the functionality already available in BioPython for a number of basic sequence data handling tasks? Why is there not even a single function defined anywhere, let alone classes?

      At some level, if it works, it works. But I have serious concerns about the long-term maintainability of the code in its current state. 5. Also related to the code, it is generally the standard now to create a BioConda package or Docker container for a bioinformatics package. BioConda has the advantage that the BioContainers project automatically generate Docker and Singularity containers from it. Please provide one of these. 6. There is some quite nice functional validation work done on some of the DE transcripts that would have been hidden in a gene-level analysis. There is also some nice work on detecting HBV fusion genes. These both contain important results which are not mentioned at all in the abstract. I feel like the abstract as it stands is selling the paper short. 7. Fig 5C shows a Venn diagram of fusions detected by short-read vs long-read sequencing, in which there is quite low overlap between these. You make the statement in the paper that "a combination of short- and long-reads can detect more fusion genes". I find it more likely that the short-read ICGC data had much greater depth of coverage than the MinION data you produced, which allowed for the detection of fusions that were expressed at much lower levels. This could be easily tested by downsampling the ICGC data to the same amount of sequence data as was generated on the MinION, and re-creating the Venn diagram with the fusions detected that way. 8. Figure 5D is very interesting. What do you conclude from that result? Please comment in the manuscript.

      Minor comments:

      1. The manuscript has many small errors in English grammar, spelling and style. I would strongly recommend sending it for copy editing before submitting it to a journal.
      2. Neither the results section nor the methods section describing the sequencing that was performed specify whether it was done on a MinION or PromethION (or flongle). While this is implied elsewhere in the paper, it should definitely be specified in the methods at a minimum.
      3. You also write in the introduction that your method, SPLICE, was developed for the MinION specifically. Please comment on its applicability to data generated on the PromethION and flongle Nanopore sequencers.
      4. The volcano plot in Fig 3A is missing its dots.

      Significance

      Nature and significance of the advance: The paper presents several exciting advances in terms of tumour biology. The authors demonstrate how alternative splicing can drive liver cancer, while being undetectable by short-read sequencing. They also show a large number of fusion transcripts that were validated by RT-PCR but were undetectable with short-read sequencing. The analysis method they present, SPLICE, contains a number of smaller advances, but raises major concerns about its capacity to act as a maintainable piece of bioinformatics software.

      Comparison to existing published knowledge: The authors compare the software they present to a single tool in the same class for the two functions it performs (isoform analysis and fusion detection). A more thorough comparison to a broader range of available tools would be better.

      In terms of biology, the authors extensively cite related literature to place their discoveries in context.

      Audience: Cancer researchers, anyone interested in doing isoform-level differential expression analysis or gene fusion detection using Nanopore RNA-seq data.

      My expertise: I am a staff scientist working on developing and testing tools for Nanopore sequencing analysis at a cancer research centre.

      Referees cross-commenting

      I fully agree with all of the comments by the other two reviewers.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Fujimoto and collaborators use Nanopore-based cDNA sequencing for genome-wide transcriptome analysis of a collection of hepatocellular carcinomas (HCCs) and matched normal liver tissues. To improve detection of alternatively spliced isoforms and hybrid transcripts potentially deriving from genomic rearrangements, they develop a dedicated pipeline SPLICE, which they benchmark against available software used for the same analysis. Besides having dual functionality (calls both alternative transcripts and fused transcripts), SPLICE seems to outperform previous software in calling alternative/fused transcripts and accuracy. They use the SPLICE pipeline to call isoforms and gene fusions in normal liver cells and HCCs and perform basic functional validations on novel fusions identified. The manuscript is well written, and the analyses are well performed. Perhaps the benchmarking of the SPLICE pipeline could have been more extensive (i.e., performed on additional independent datasets).

      Major points:

      1. Line 149-150: "We compared the results of mapping to the reference genome and the reference transcriptome sequences, and removed candidates if both were inconsistent (removal of mapping errors). " Please specify what "both were inconsistent" means.
      2. Concerning TE-derived novel exons, in principle, this may lead to altered expression of the TE-transcript (as the Authors report for L1-MET) or to altered splicing of the transcript (i.e., other exon/introns could be retained or excluded). Can the Authors assess whether the inclusion of the TE in a transcript enhances its expression or affects the splicing of the "parental" transcript? If so, can they verify if the position of the insertion of the TE has any effect on expression and splicing?
      3. Can the Authors explain why the NBEAL1-RPL12 was not detected by SPLICE?
      4. Line 332: Can the Authors explain how the total amount of HVB mRNA was determined in each sample? Is it a relative amount calculated from the sequencing data? If so, it should be made clear in the text that this is a fractional measure.
      5. Fig4a: please specify if the y-axis "number of support reads" reports library normalized values.
      6. HCCs have more HBV-human genome fusion transcripts than normal liver. Could the authors clarify if these HCC transcripts are selectively found in tumors? or whether they are also expressed in normal liver samples? The paragraph starting from line 356 is confusing, and it is difficult to retrieve the above information for both HBs and HBx fusions.
      7. Figure 4C: what was the control used to calculate the relative viability in these analyses?
      8. MYT1L: the Authors report the identification of a novel MYT1L transcript downregulated in HCC, and argue it may have a potential tumor-suppressive function. For the sake of clarity, it will be advisable to show also the differential expression (HCC vs. Liver) of the other transcripts expressed from the same locus.

      Minor points:

      1. Table S4: there is a typo, correct "secific" in "specific"

      Significance

      The Authors show that applying long-reads sequencing to the study of the transcriptome, combined with their improved in-house analyses pipeline, leads to the identification of novel transcripts, which are alternative splicing isoforms and transcripts originating from novel gene fusions with potential oncogenic function. This provides a proof of principle study which show the advantages of long-reads sequencing and offers a solid data for further mechanistic studies on liver cancer.

      Referees cross-commenting

      I also agree with the other reviewers. All the concerns expressed by the reviewers seem addressable in a reasonable timeframe.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      Overall we were elated to have received such positive comments on the manuscript, with requests for only minor changes. We have made all suggested changes to clarify or tone down the language as suggested.

      We would like to thank each of the three reviewers for their assessment of our work. We note that all three reviewers agreed the phylogenetic analysis was interesting and convincing. Two of the three reviewers felt the study sufficiently demonstrated roles for Baramicin in the nervous system. We have responded to comments from Reviewer 2 to draw attention to some aspects of the data that they may have been overlooked, which we hope reassures them that our proposal of BaraB and BaraC involvement in the nervous system is robust, coming from different approaches that show consistent results.

      Reviewer 1 and Reviewer 3 compliment the study as being very worthwhile, and for suggesting concrete routes for how an AMP evolved non-immune functions. Both compliment its comprehensiveness, and describe the study as having striking findings that should have broad appeal to audiences interested in the crosstalk between the nervous system and the innate immune system.

      2. Point-by-point description of the revisions

      In the revised manuscript file, we have highlighted all text where changes were made.


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The authors provide convincing evidence for an evolutionary scenario in which duplications of an AMP gene with ancestral immune function led to paralogs specialist for neural functions. They focus on the Baramicin genes, coding for major Toll signalling targets in the context of antifungal defence. Their study uses infection experiments in several Drosophila species, a careful annotation of the Baramicin genes of D. melanogaster, the demonstration of neural expression of BaraB and BaraC, the KD analysis of Bara B revealing lethality and neurological phenotypes, a reconstruction of the evolutionary history of Baramicn genes in Drosophilids and an analysis of the sequence evolution of the IM24 domain providing the neural functions. In general the paper is well written. There are a few places in the manuscript where the language can be improved and one point, which needs clarification: - ine 297: ...,which did not present with... - line 314/315: ...to just 14% that of...to 63% that of - line 459: ..., we this motif... - line 518: What does "... genomic relatedness (by speciation and locus)..." mean? - line 527/528: ...drive behaviour or disease through interactions... - line 532: ... ancestrally encodes distinct peptides involved with either the nervous system or the immune response... line 535: ...with either the nervous system (IM24) or.... Do the data provide enough evidence suggesting that IM24 had a neural function in the ancestor? Ideally the authors should look at neural expression of the Baramicin gene in the ourgroup, S. lebanonensis. The authors later (line571) admit, that they cannot rule out that IM24 is also antimicrobial.

      We thank reviewer #1 for drawing attention to these points. We have made changes to each line to be more concise, clarify our meaning, or fix typos.

      Reviewer #1 (Significance (Required)):

      This is a very comprehensive study, which, to my knowledge for the first time, suggests concrete routes of how an AMP evolved non-immune functions. One of the striking findings of this paper is that duplications and subsequent truncations of the ancestral Baramicin locus linked to specialisation for neural functions occurred independently in different Drosophila lineages.

      We thank reviewer #1 for their very positive comments. We also agree with all suggested changes, including more careful phrasing to emphasize that we have not described a mechanism, just an involvement in the nervous system. For instance, see lines 556-568 are reworked to soften language and explicitly state the ancestral function of IM24 is unknown, and our suggestion that IM24 could underlie Dmel\BaraA interactions with the nervous system is speculation that should be tested.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Hanson and Lemaitre present a genomic and phylogenetic characterization of the Baramicin family of antimicrobial peptide genes in different species. They discover new Baramicin paralogs, united by the presence of an IM24 domain at the N-terminus. They show that among Baramicins, those that are not inducible by infection (which they improperly call non-immune since a protein can be non-inducible by infection and have very important immune functions), are truncated. They propose that an ancestor peptide with immune functions evolved into a neuronal regulator/effector via truncation.

      Although the hypothesis is interesting, the data do not really support it. This manuscript is rather descriptive at this point. The demonstration that IM24 is necessary for neural function is very tenuous. For example, in the paragraphs titled Dmel\BaraB is required in the nervous system during development and Baramicin B plays an important role in the nervous system, I did not find convincing data demonstrating that BaraB is required in the nervous system. The only data that links BaraB to the nervous system is a weak locomotion defect observed in the BaraB mutant. But how many genes, when inactivated, give a locomotion defect? This remains totally unexplained at the molecular level. The authors also mentioned that BaraB is expressed in a subset of mechanosensory neuron cells in the wing. What is the link between this expression and the nubbin phenotype? The authors also mention that data in the literature indicate that BaraC is expressed in glial cells but also in other tissues. Finally, we have no idea what role, if any, these peptides have in the nervous system.

      While the characterization of the Baramicin gene family and its evolution across species is convincing, the link between these AMPs and the nervous system is really too preliminary to be convincing. The manuscript would greatly benefit from being more concise.

      Reviewer #2 (Significance (Required)):

      see above

      We thank reviewer #2 for their fair assessment. We have made edits to soften our phrasing, and to emphasize that we have not described a mechanism, just an involvement, in the nervous system.

      Examples:

      line 270: “integral development role” -> “important for development”

      line 277: “Baramicin B plays an important role in the nervous system“ -> “Baramicin B suppression in the nervous system mimics mutant phenotypes”

      line 532: “Here we demonstrate that the Baramicin antimicrobial peptide gene of Drosophila ancestrally encodes distinct peptides involved with either the nervous system or the immune response.“ -> “Here we demonstrate that the Baramicin antimicrobial peptide gene of Drosophila ancestrally encodes distinct peptides that may interact with either the nervous system (IM24) or invading pathogens (IM10-like, IM22).”

      line 562 new text: “Thus while our results suggest that IM24 of different Baramicin genes might underlie Baramicin interactions with the nervous system, we cannot exclude the possibility that IM24 is also antimicrobial, or even that antimicrobial activity is IM24’s ancestral purpose. Future studies could use tagged IM24 transgenes or synthetic peptides to determine the host binding partner(s) of secreted IM24 from the immune-induced Dmel\BaraA, and/or to see if IM24 binds to microbial membranes.”

      We have also changed all instances of “non-immune Baramicins” to “Baramicins lacking immune induction” or something to that effect (e.g. new Lines 25,464, 469,478-82).

      We also made some small changes to be more concise (e.g. line 387, 447, cut lines 492-495 from previous version, cut lines 506-507 from previous version).

      We have responded below in the reviewer-to-reviewer comments for a few of the specific points raised there, which we hope further assuage some of Reviewer 2’s concerns.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Antimicrobial peptides are main effectors in (insect) immune defenses. It is becoming more and more clear, that AMPs can have pleiotropic effects or even acquire new functions. In the present paper, the authors investigate Baramicin, an antifungal AMP that they described first in publication last year. Here they show that in Drosophila melanogaster Baramicin A, which they described before, has paralogs, that are not immune-inducible. They then show that these paralogs, named BarB and BarC, which are truncated versions of BarA, are expressed in the head and neural tissues. That they have neural functions is supported by targeted gene-silencing experiments. They go on to show, using a comparative approach across Drosophila, that Baramicin A with its antimicrobial function constitutes the ancestral state. Moreover, Baramicin is also enriched in head samples of some of the other Drosophila species they study. This manuscript, which according to the acknowledgements has already been seen by reviewers, is in a very good shape.

      I have only a number of minor points, that might help to clarify the presentation.

      Lines 34-36: I would delete this sentence and replace it with a statement based on the main findings of the manuscript

      We now conclude the abstract with “As many AMP genes encode polypeptides, a full understanding of how immune effectors interact with the nervous system will require consideration of all their peptide products.”

      Lines 56-60. May be tone down a bit. Anti-inflammatory activities of AMPs have been known for a long time. I think the next paragraph makes a very good case what is already known and is hence a nice motivation for the current study.

      Toned down. This part now reads: “However AMPs and AMP-like genes in many species have recently been implicated in non-immune roles in flies, nematodes, and humans, suggesting non-immune functions might help explain AMP evolutionary patterns.”

      Line 125: classical instead of classically

      done

      Line 200: what is a 'novel' time course? I would just describe what has been done.

      Now reads: “We next measured Baramicin expression over development from egg to adult.”

      Line 268: hypomorph, I guess in the literature usually hypomorphic is used.

      done

      Line 279: I would suggest to tone this headline down. This is not a criticism of the paper, but the actual mechanisms of the roles in the nervous system are not studied here.

      Done. Now reads: “Baramicin B suppression in the nervous system mimics mutant phenotypes”

      Line 505: what does not really become clear is whether IM24 plays an important role in the nervous system of fly species that only have BarA.

      Edits from lines 556-568 now help highlight this question.

      Line 540-549. This comparison I find a bit far-fetched, or maybe it needs clarification how doublesex expression is related to Baramicins.

      Being completely honest: the doublesex discussion was requested during previous review at another journal. We agree that it is a bit of a tangent, and so we have removed these sentences.

      Line 584-585. I think that this has been known for much longer from studies in frogs and beetles.

      Our use of “in vivo” might have been a bit squishy here. We have edited this to reflect endogenous loss-of-function study, rather than simply “in vivo,” to clarify our intended sentiment.

      Reviewer #3 (Significance (Required)):

      Overall, I think that this is a very worthwhile and convincing story about the evolution AMPs and how they can acquire new functions. All the main statements are supported by careful experiments and data analysis. The paper does not go into any detail, of how the neurological role of BarB and BarC is achieved, but I think this is beyond the scope of the current manuscript. In short, this is a very worthwhile contribution to the growing literature of the role of AMPs in the nervous system. The authors provide the context of the main published papers in the area in the introduction. As opposed to most papers on this so far, the current manuscript also provides very interesting data on the evolutionary history of the Baramicin genes, both within the main study species, and within other Drosophila species. This paper should appeal to a rather broad audience of researchers interested in innate defenses, AMPs and the crosstalk between the nervous system and the innate immune system.

      My background is insect immunology with a focus on AMPs and evolutionary approach.

      We thank reviewer #3 for their very positive comments. We agree with all suggested changes.

      **Referees cross-commenting**

      This session contains the comments of all reviewers

      Reviewer 3

      Reviewer 2 and I share the view, that the evidence for the effects of BarB and C on the nervous system is rather limited. But I still think, that the paper provides enough new and interesting data that make it a very useful contribution. Though not a neurobiologist, I would assume that providing functional evidence for the role of BarA and B in the nervous system would justify a paper on its own. I agree though, that the relevant sections should be toned down.

      Reviewer 2

      As I mentioned in my review, I found the genomic and phylogenetic analysis interesting and convincing. I therefore totally agréé with reviewers 2 and 3 on that. Whether BarA and B are playing a role in the nervous system and how it does remain speculative. BaraB mutants show locomotion defects. But mutants in mitochondrial genes have locomotion defects. Can we conclude that mitochondria play a role in the nervous system? If I understand correctly, downregulating Bara in neurons only (With Elav-Gal4 driver) does not show the locomotion phenotype. it induces early lethality. How many genes when inactivated in neurons will give rise to such a phenotype? A lot. I really think that the implication of Bara in the nervous system should be seriously toned done and more presented as an hypothesis than a validated fact.

      We would like to note for Reviewer 2 here that it is specifically elav> BaraB-IR that results in lethality, and in weaker gene silencing experiments, adult elav>BaraB-IR flies emerge, and they do suffer locomotor defects. Often, they got stuck in the food shortly after emerging, or would move haphazardly (which was common in flies with nubbin-like wings). We have added explicit mention that elav>BaraB-IR also results in locomotor defects (Line 288-289).

      Our private speculation is that the reason flies fail to emerge from their pupae is because they are so uncoordinated that they sometimes cannot wriggle out of the pupal case before their cuticle hardens. In some instances, both using mutants and RNAi, we observed fully developed adults with mature abdominal pigmentation that died trapped inside their pupal cases.

      We’d also like to emphasize here that despite testing many other Gal4 drivers, including mef2-Gal4 (muscle/myocytes), nubbin-like wings and lethality were only found using elav-Gal4. A role interacting with mitochondria would likely have been revealed using mef2-Gal4, given the importance of mitochondrial function in muscle.

      For BaraC: expression in other tissues (like the rectal pad) could nevertheless be from e.g. nerves innervating the muscles controlling the sphincter. Or it could indeed be entirely unrelated to the nervous system. However we feel the nearly perfect overlap with Repo-expressing cells is a strong argument for a neural role. We also made an effort using RNAi to validate this pattern suggested by scRNAseq, which confirmed a strong knockdown of BaraC-IR with Repo-Gal4 (Fig. 3, Fig. S4).

      We hope these comments clarify for Reviewer 2 why we feel confident in proposing a role for Baramicins in the nervous system, even if we do not investigate a mechanism in this study.

      Reviewer 1

      I agree with reviewer 3 that the main message of the paper providing a concrete scenario of how non-immune functions of AMPs may evolve is an important contribution. A deep investigation of the neural function is definitely going beyond the scope of the paper. Indeed this might be quite tricky. But it would help if the authors could clarify their idea about the ancestral condition. Is there the possibility that IM24 had ancestrally already non-immune function? They are not really clear about this point.

      Reviewer 2

      I agree with the other reviewers that determining the exact role of Bara peptides could be complicated. I just ask that the authors limit themselves to proposing that the peptides have lost their immune function. I stress that this argument is not very strong. It relies solely on the lack of inducibility of these peptides following infection. I still think that the demonstration of the role of Bara in the nervous system is not provided.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Antimicrobial peptides are main effectors in (insect) immune defenses. It is becoming more and more clear, that AMPs can have pleiotropic effects or even acquire new functions. In the present paper, the authors investigate Baramicin, an antifungal AMP that they described first in publication last year. Here they show that in Drosophila melanogaster Baramicin A, which they described before, has paralogs, that are not immune-inducible. They then show that these paralogs, named BarB and BarC, which are truncated versions of BarA, are expressed in the head and neural tissues. That they have neural functions is supported by targeted gene-silencing experiments. They go on to show, using a comparative approach across Drosophila, that Baramicin A with its antimicrobial function constitutes the ancestral state. Moreover, Baramicin is also enriched in head samples of some of the other Drosophila species they study. This manuscript, which according to the acknowledgements has already been seen by reviewers, is in a very good shape.

      I have only a number of minor points, that might help to clarify the presentation.

      Lines 34-36: I would delete this sentence and replace it with a statement based on the main findings of the manuscript

      Lines 56-60. May be tone down a bit. Anti-inflammatory activities of AMPs have been known for a long time. I think the next paragraph makes a very good case what is already known and is hence a nice motivation for the current study.

      Line 125: classical instead of classically

      Line 200: what is a 'novel' time course? I would just describe what has been done.

      Line 268: hypomorph, I guess in the literature usually hypomorphic is used.

      Line 279: I would suggest to tone this headline down. This is not a criticism of the paper, but the actual mechanisms of the roles in the nervous system are not studied here.

      Line 505: what does not really become clear is whether IM24 plays an important role in the nervous system of fly species that only have BarA.

      Line 540-549. This comparison I find a bit far-fetched, or maybe it needs clarification how doublesex expression is related to Baramicins.

      Line 584-585. I think that this has been known for much longer from studies in frogs and beetles.

      Significance

      Overall, I think that this is a very worthwhile and convincing story about the evolution AMPs and how they can acquire new functions. All the main statements are supported by careful experiments and data analysis. The paper does not go into any detail, of how the neurological role of BarB and BarC is achieved, but I think this is beyond the scope of the current manuscript.

      In short, this is a very worthwhile contribution to the growing literature of the role of AMPs in the nervous system. The authors provide the context of the main published papers in the area in the introduction. As opposed to most papers on this so far, the current manuscript also provides very interesting data on the evolutionary history of the Baramicin genes, both within the main study species, and within other Drosophila species.

      This paper should appeal to a rather broad audience of researchers interested in innate defenses, AMPs and the crosstalk between the nervous system and the innate immune system.

      My background is insect immunology with a focus on AMPs and evolutionary approach.

      Referees cross-commenting

      This session contains the comments of all reviewers

      Reviewer 3

      Reviewer 2 and I share the view, that the evidence for the effects of BarB and C on the nervous system is rather limited. But I still think, that the paper provides enough new and interesting data that make it a very useful contribution. Though not a neurobiologist, I would assume that providing functional evidence for the role of BarA and B in the nervous system would justify a paper on its own. I agree though, that the relevant sections should be toned down.

      Reviewer 2

      As I mentioned in my review, I found the genomic and phylogenetic analysis interesting and convincing. I therefore totally agréé with reviewers 2 and 3 on that. Whether BarA and B are playing a role in the nervous system and how it does remain speculative. BaraB mutants show locomotion defects. But mutants in mitochondrial genes have locomotion defects. Can we conclude that mitochondria play a role in the nervous system? If I understand correctly, downregulating Bara in neurons only (With Elav-Gal4 driver) does not show the locomotion phenotype. it induces early lethality. How many genes when inactivated in neurons will give rise to such a phenotype? A lot. I really think that the implication of Bara in the nervous system should be seriously toned done and more presented as an hypothesis than a validated fact.

      Reviewer 1

      I agree with reviewer 3 that the main message of the paper providing a concrete scenario of how non-immune functions of AMPs may evolve is an important contribution. A deep investigation of the neural function is definitely going beyond the scope of the paper. Indeed this might be quite tricky. But it would help if the authors could clarify their idea about the ancestral condition. Is there the possibility that IM24 had ancestrally already non-immune function? They are not really clear about this point.

      Reviewer 2

      I agree with the other reviewers that determining the exact role of Bara peptides could be complicated. I just ask that the authors limit themselves to proposing that the peptides have lost their immune function. I stress that this argument is not very strong. It relies solely on the lack of inducibility of these peptides following infection. I still think that the demonstration of the role of Bara in the nervous system is not provided.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Hanson and Lemaitre present a genomic and phylogenetic characterization of the Baramicin family of antimicrobial peptide genes in different species. They discover new Baramicin paralogs, united by the presence of an IM24 domain at the N-terminus. They show that among Baramicins, those that are not inducible by infection (which they improperly call non-immune since a protein can be non-inducible by infection and have very important immune functions), are truncated. They propose that an ancestor peptide with immune functions evolved into a neuronal regulator/effector via truncation.

      Although the hypothesis is interesting, the data do not really support it. This manuscript is rather descriptive at this point. The demonstration that IM24 is necessary for neural function is very tenuous. For example, in the paragraphs titled Dmel\BaraB is required in the nervous system during development and Baramicin B plays an important role in the nervous system, I did not find convincing data demonstrating that BaraB is required in the nervous system. The only data that links BaraB to the nervous system is a weak locomotion defect observed in the BaraB mutant. But how many genes, when inactivated, give a locomotion defect? This remains totally unexplained at the molecular level. The authors also mentioned that BaraB is expressed in a subset of mechanosensory neuron cells in the wing. What is the link between this expression and the nubbin phenotype?

      The authors also mention that data in the literature indicate that BaraC is expressed in glial cells but also in other tissues.

      Finally, we have no idea what role, if any, these peptides have in the nervous system.

      While the characterization of the Baramicin gene family and its evolution across species is convincing, the link between these AMPs and the nervous system is really too preliminary to be convincing. The manuscript would greatly benefit from being more concise.

      Significance

      see above

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors provide convincing evidence for an evolutionary scenario in which duplications of an AMP gene with ancestral immune function led to paralogs specialist for neural functions. They focus on the Baramicin genes, coding for major Toll signalling targets in the context of antifungal defence. Their study uses infection experiments in several Drosophila species, a careful annotation of the Baramicin genes of D. melanogaster, the demonstration of neural expression of BaraB and BaraC, the KD analysis of Bara B revealing lethality and neurological phenotypes, a reconstruction of the evolutionary history of Baramicn genes in Drosophilids and an analysis of the sequence evolution of the IM24 domain providing the neural functions. In general the paper is well written. There are a few places in the manuscript where the language can be improved and one point, which needs clarification:

      • line 297: ...,which did not present with...
      • line 314/315: ...to just 14% that of...to 63% that of
      • line 459: ..., we this motif...
      • line 518: What does "... genomic relatedness (by speciation and locus)..." mean?
      • line 527/528: ...drive behaviour or disease through interactions...
      • line 532: ... ancestrally encodes distinct peptides involved with either the nervous system or the immune response... line 535: ...with either the nervous system (IM24) or.... Do the data provide enough evidence suggesting that IM24 had a neural function in the ancestor? Ideally the authors should look at neural expression of the Baramicin gene in the ourgroup, S. lebanonensis. The authors later (line571) admit, that they cannot rule out that IM24 is also antimicrobial.

      Significance

      This is a very comprehensive study, which, to my knowledge for the first time, suggests concrete routes of how an AMP evolved non-immune functions.<br /> One of the striking findings of this paper is that duplications and subsequent truncations of the ancestral Baramicin locus linked to specialisation for neural functions occurred independently in different Drosophila lineages.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for carefully reading our manuscript. We found their comments to be incredibly thoughtful and constructive and greatly appreciate their feedback. We are confident that addressing the reviewers’ concerns has strengthened our manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): Camuglia, Chanet and Martin investigate the mechanisms that control cell division orientation in vivo, using the mitotic domains (MDs) in the head of the Drosophila embryo as their main model system. They find that cells in the head mitotic domains rotate and align their spindles within 30 degress of the anterior-posterior axis of the embryo. The Pins protein, implicated in spindle orientation in other systems, is planar polarized in mitotic cells. Pins polarization precedes spindle rotation and is correlated with the division angle (but cell shape is not, violating Hertwig's rule). Overexpression of myristoylated Pins results in uniform Pins distribution on the membrane and affects spindle orientation. alpha-catenin RNAi (but not canoe RNAi) disrupts Pins polarity and spindle orientation in MDs 1, 3 and 5. Low dose CytoD injections (which should disrupt force transmission) also result in defective Pins polarity and spindle orientations. Finally, mechanical isolation by laser ablation also disrupts spindle orienttion. The authors find that preventing mesoderm invagination by snail dsRNA disrupts Pins polarity and spindle orientation in the head. MAJOR 1. Is there a certain chirality in the rotation of the spindles? From Movie 1, it seems like in MDs 1 and 3 at least, a majority of spindles on the right side of the embryo rotate clockwise, while spindles on the left side rotate counter-clockwise? Is that so, and in that case, are there geometric/molecular considerations that could explain that chirality?

      We thank the reviewer for pointing this out. They are correct in that there is a tilt to the spindle orientation relative to the AP axis. To illustrate this tilt, we performed our spindle analysis separately on the right and left sides of MD1 and found that spindles on the left side align with an average division angle of about 30from the AP axis whereas spindles on the right side align with an average division angle of -30from the AP axis. To determine whether spindles on either side rotated with a certain chirality, we found there was no preference in rotating clockwise or counterclockwise on the left and right sides (on the left side of MD1 53% of measured spindles rotated counterclockwise and 47% rotated clockwise, on the right side 46% rotated counterclockwise and 54% clockwise). We have added this data as Fig. 1I-J and discussed in the Results lines 134-145.

      1. The authors are experts in mesoderm invagination, and understandably concentrate on the role that forces from that process may have in the orientation of head MD divisions. However, the cephalic furrow forms much closer to the head MDs, and in an orientation that might also explain the alignment of spindles in the head. Is cephalic furrow formation important for Pins polarity and spindle orientation in the head MDs?

      This was certainly a possibility, but our experimental results strongly argues that mesoderm invagination is most relevant.

      1) Perturbing the ventral furrow (e.g. by Snail depletion) does not block the cephalic furrow (Vincent et al., 1997; Leptin and Grunewald, 1990), but does block mesoderm invagination. Snail depletion strikingly disrupted spindle orientation and Pins localization, which suggests mesoderm is most important.

      2) In addition, depletion of -catenin blocks ventral furrow invagination but not cephalic furrow formation. We see a disruption in spindle orientation and Pins localization in -catenin RNAi, which suggests cephalic furrow itself cannot orient spindles.

      3) Furthermore, light sheet imaging of the Drosophila embryo has shown that the head region of the embryo undergoes tissue movement in the direction of the cell division and that this is associated with mesoderm invagination (Streichan et al., 2018; Stern et al., 2022).

      See movies here: https://www.youtube.com/watch?v=kC11Upr30JY

      To further test the importance of mesoderm invagination, we will perform additional ablation experiments trying to disrupt forces transmitted to the mitotic domains from distinct directions. Once we get this experimental result we will include language in the Discussion that will summarize the experimental results and the weight of the evidence for the roles of either ventral or cephalic furrow.

      1. Does expression of myristoylated Pins affect mesoderm invagination (or cephalic furrow formation)? From Table S1 it seems that a maternal Gal4 driver was used to express myristoylated Pins, which could affect other tissues in the embryo. So it is in principle possible that effects of myristoylated Pins on mesoderm internalization/cephalic furrow formation could affect cell division orientation much like sna loss of function does, but in a mechanism that does not depend on Pins polarity. There is definitely an effect on mesoderm invagination in alpha-catenin RNAi (but not in canoe RNAi) embryos, so I wonder if the effect could be consistently through defects in mesoderm invagination (or cephalic furrow formation), and Pins polarity is really dispensable for spindle orientation. Are there head-specific Gal4 drivers that could be used to drive myristoylated Pins exclusively in the head?

      We apologize that we did not clarify this in the text. Maternal overexpression of myr-Pins does not obviously disrupt mesoderm internalization/cephalic furrow formation. But, we do see that targeted disruption of mesoderm internalization via a Snail depletion affects the orientation of division. Note that our paper demonstrates the effect of force transmission on Pins polarity and division orientation, which is new and the main conclusion. The role of these divisions in morphogenesis is more complicated and is beyond the scope of this study.

      In response to this comment we: 1) added language in the Results that states that gastrulation proceeds in myr-Pins expressing embryos (lines 206-208), 2) Added to the Discussion of the role of these oriented divisions to morphogenesis (lines 443-449), and 3) will add a figure showing ventral furrow and cephalic furrow formation in embryos ectopically expressing the myr-Pins.

      1. Related to the previous point, does mechanical isolation by laser ablation (Figure 6I-N) affect Pins polarity? This experiment could alleviate some of my concerns above, as it certainly does not (should not?) disrupt neither mesoderm invagination nor cephalic furrow formation.

      We agree that it would be useful to look at Pins polarity in laser ablated embryos. Currently, we have been unable to analyze Pins polarity after laser ablation, because the ablation to fully isolate the mitotic domain has bleached our Pins::GFP signal. Also, we have shown that Pins polarity is disrupted by 1) alpha-catenin-RNAi, 2) low dose CytoD injection, and 3) Snail depletion, all of which are expected to disrupt force generation and transmission through tissues.

      In response to the reviewer comment, we will determine if Pins::GFP can be analyzed in less aggressive (directional) laser ablations. Again, remember that myr-Pins does not affect mesoderm internalization and that Snail depletion affects Pins polarity.

      MINOR 1. Figure S5: I am a bit confused about the role of Toll 2, 6, 8 in orienting spindle orientation. In Figure S5D it seems that dsRNA treatment against these genes does not disrupt spindle orientation, but Figure S5F shows quite a significant (p=0.0057) effect in triple mutants. The authors favor the idea that Toll receptors do not affect spindle orientation, but the difference with the mutant should be addressed. Furthermore, what happens in MDs 3, 5 and 14 (if the germband extension defect does not affect those divisions)? Is there a difference between dsRNA and triple mutant embryos in these other MDs?

      We think this is a great point. We stated in the text that TLRs are not solely responsible (line 247) for spindle orientation as they do not recapitulate the random pattern of division seen in the myr-Pins expression condition. We acknowledge the differences between the dsRNA injection and TLR triple mutant in the manuscript (lines 242-247), but our data show a greater importance for the role of force transmission. We favor the idea that other mechanisms contribute to spindle orientation because of the small effect of mutating all three Tolls and the dramatic effects of depleting AJs, inhibiting actin (with CytoD), laser ablation, and blocking mesoderm invagination. The planned laser ablation experiments (described above) will also contribute to addressing this point.

      1. No statistical analysis is provided for any of the differences in polarity between Pins and Gap43, and this should be done to demonstrate the significance of the polarization of Pins. Also, particularly for MD14, they should compare anterior vs. posterior polarity, as based on the images in Figure 2H it is not clear that there is a difference between the anterior and posterior side of cells.

      We thank the reviewer for this point. We have added the statistical comparison.

      1. Figure 2A-D: the authors propose that Pins localizes preferentially to the posterior end of cells (instead of both anterior and posterior ends) in MDs 1, 3 and 14 (and anterior in MD 5). How is the asymmetry in the distribution of Pins along the AP axis accomplished, and is there any significance to it? This should be discussed in a bit more detail (currently no potential mechanisms provided in the discussion, just an acknowledgment of the question).

      __We agree the localization of Pins to the posterior end of cells in MDs 1, 3, and 14 and anterior end in MD 5 is of great interest. The details and further mechanism of this preferential localization are beyond the scope of this paper, but we have added an acknowledgment of the question and discuss possible models that could explain the result (lines 458-460). __TYPOS 1. Line 49: "one daughter cells" should be "one daughter cell". 2. Line 193: "rotation. (Figure 3E-F)." should be "rotation (Figure 3E-F)." 3. Lines 232-237: please review. 4. Line 238: "epithelia cells" should be "epithelial cells".

      We thank the reviewers for carefully reading our manuscript. We have fixed the typos mentioned.

      Reviewer #1 (Significance (Required)): This is the first study to my knowledge that demonstrates the role of mechanical forces in polarizing Pins, and provides a nice model to further investigate how mechanical forces generated in one tissue may affect cell division orientation in distant ones. The paper is clear, well written, and quantitative analysis is present for most results. I have some issues with the statistics (or lack thereof) for a couple of results, and potential alternative interpretations for some experiments that in my opinion should be addressed prior to publication. Specifically, it is not clear to me if Pins polarity is at all necessary for spindle orientation in any of the examined MDs.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): Overview: In this manuscript, Camuglia et al. show Pins/LGN, which is understood to drive spindle orientation, can localize asymmetrically (with respect to the tissue plane) in the Drosophila embryo. Experimental work (including drug treatments, laser ablation, and knockdowns) lead the authors to propose that this asymmetry is driven by tissue-level tension. The findings are quite interesting and the manuscript is well-written overall. Major Comments: • The authors propose that localization is driven by tissue-level tension, but the direction of the tension isn't clear from the experimental work. For example, the laser ablation experiments cut around the entire perimeter of the mitotic domain, rather than along just one tension axis. Similarly, the finding that disruption of the ventral furrow (by Snail RNAi) interferes with spindle orientation in the head is very puzzling; the furrow is A) outside the embryonic head and B) runs in the parallel direction to the divisions considered. The authors need to address the directionality of tension experimentally.

      We thank the reviewer for this comment and agree that better defining the direction of tension would strengthen our manuscript. We showed that blocking mesoderm invagination with Snail depletion disrupts spindle orientation, despite Snail not being required for cephalic furrow formation (refs). Recent light sheet data has shown that mesoderm invagination is associated with global movements throughout the embryo. Furthermore, the ventral furrow extends into the head region just past the anterior of MD5. To address the reviewer’s comments, we plan to: 1) Perform directional laser ablations to determine the directionality of the tension that orients the spindle, 2) Analyze strain rates in the mitotic domains prior to and during division, and 3) Add to our Discussion more about what is said in the literature about the movements that occur in the head during mesoderm invagination.

      • As acknowledged in the text, the asymmetric enrichment of Pins in MD14 is fairly weak. Since the cells being examined here border a divot in the tissue, and might therefore be curving relative to the focal plane, it would be good to rule out the possibility that some of the asymmetry in Pins intensity is just a consequence of cell/tissue geometry. One way this could be achieved is by showing multiple focal planes.

      Good point. We do not think that the asymmetric Pins enrichment in MD14 is due to tissue geometry or junction tilt. 1) MD14 divides ~10-15 minutes after mesoderm invagination is completed, so the cells do not border a divot (as seen with Gap43::mCh, Fig. 2I). The cells do round up, which can be seen as gaps between cells (Fig. 3E). 2) We compare Pins to GapCh and only see an enrichment with Pins (Fig. 2H-K). If the enrichment was due to tissue curvature or junction orientation relative to imaging axis, we would see the same enrichment in GapCh. 3) Expression of myr-Pins randomizes spindle orientation in MD14 (Fig. 3M, N).

      • In Figure 3I (and 3M?), it appears that there are fewer cell divisions in the presence of myr-Pins. Is this the case? Since cell shapes change during division, and cell shapes influence tissue tension, an increase in cell divisions could lead to a change in tissue tension. This would be important to address, since tissue tension plays an important role in the proposed model.

      These images are not taken at the same point of MD1 division ‘wave’, there are the same number of divisions in each condition. These mitotic domains exhibit a ‘wave’ of cell division (Di Talia and Wieschaus, 2012), and so the number of divisions in each image reflect the timing at which we captured the image. Quantifications involved divisions throughout this wave, but we have chosen images for figures which are most representative of what we see. We will add this to the text in the final version of the manuscript.

      • The alpha-catenin and Canoe results are a bit confusing: - The rose plot in Figure 4D doesn't show a random distribution of spindle angles, but rather a modest change; most spindles still orient in the normal range. The p value in the figure legend (0.0012) is very different from the one in the figure (5.8284e-04). - Alpha-catenin is the strongest way to disrupt AJs, but A) the epithelium appears to be intact in the knockdown condition and B) spindle orientation is impacted but not randomized. Does this mean that the knockdown is incomplete? Or is Cadherin-mediated adhesion (in which alpha-catenin participates) only partially responsible for force transduction?

      We acknowledge that perturbation using ____alpha-cat RNAi does not recapitulate the complete disruption of division orientation seen in embryos expressing myr-Pins. This is likely due to the variability in the strength of RNAi knockdown, which is observed for most RNAi lines that we use. To address the reviewer’s comment, we have added rose plots for individual embryos showing extremes in the severity of division orientation disruption (Fig. 4E and F). For the main plot (Fig. 4D), we have included all the data that we took because we obviously did not want to pick and choose which embryos were used for analysis. So Fig. 4D includes all the variability.

      • Given that previous studies implicate Canoe in Pins localization, it seems important to lock down the question of whether Canoe is participating in the mechanism described in this paper. How do the authors know the extent of Canoe knockdown? As suggested by the alpha-catenin results (described above), is it possible that Canoe knockdown is simply not strong enough to impact spindle orientation? Aren't there genetic nulls available? We thank the reviewer for bringing these points to our attention. There are certainly genetic nulls available (Sawyer et al., 2009), but the experiment suggested by the reviewer would not establish the necessity of Canoe in mitotic domain cells. This is because Canoe nulls severely disrupt mesoderm invagination (Sawyer et al., 2009; Jodoin et al., 2015), as well as affecting junctions in the ectoderm during germband extension (Sawyer et al., 2011). Therefore, we would not be able to distinguish what effect of Canoe would be responsible for the spindle orientation using a null mutation. We did better experiments, we used 1) a mutant which specifically compromised mesoderm invagination (snail), 2) laser isolation to show the importance of external force transmission in orienting mitotic domain divisions, and 3) RNAi to deplete Canoe so that mesoderm invagination initiates and pulls on the ectoderm, but where there is clearly compromised Canoe function. This treatment did not cause any effect on spindle orientation arguing against a role of Canoe in this case. In response to the reviewers comment, we added language to the Results to indicate that it is possible that the Canoe knockdown is not strong enough and our rationale for why we did not perform the experiment in a Canoe null (lines 279-282).

      Minor Comments:

      • It can be difficult to interpret some of the spindle orientation data since the AP axis is vertical in the diagrams but horizontal in the rose plots. Can one of these be flipped so they go together?

      We thank the reviewer for this suggestion and have flipped the rose plots so they match the images. Note that because of the large size of the figures, we have had to consistently orient anterior towards the top, which we establish at the beginning of the Results.

      • Figure S3 is important information for the reader and should be ideally moved into the main paper. - Protein localizations referred to in text should be annotated on images, as they can be hard to see.

      We disagree that S3 should be included in the main paper. The myr-Pins reagent has been used previously so the information in S3 is not new (Chanet et al., 2017).

      • There are some discrepancies between figures, legends and text. - p-values differ between figures, legends, and/or text. - Fluorescent markers are labelled differently in figures and legend (CLIP170 in Figure 1) - Graphs appear to show that MD3 polarizes on posterior side, but figure legend says anterior in Figure S1. Vice versa for MD5.

      We thank the reviewer for catching these typos. We have fixed these issues.

      • Ideally, multichannel image overlays should be shown along with individual channels (b/w). However, it is appreciated that the fluorescent signals are exceptionally weak in this study, presenting a challenge to presentation and to quantification.

      We agree the overlays would be nice. However, the Pins::GFP signal is weak compared to the tubulin and Gap43 signals, the merge does not provide more clarity, and the figures are already quite large. Therefore, we have only included the separated the images.

      • Graph axes depicting spindle orientation would be more clear if shown in degrees, instead of normalized or in radians.

      We thank the reviewer for this suggestion. We have changed the graph axes to be in degrees.

      Reviewer #2 (Significance (Required)): Several recent studies have demonstrated that division orientation (in the tissue plane) is governed by tissue level tension. Remarkably, it appears that diverse mechanisms link tension with spindle orientation. Here the authors provide the first in vivo evidence connecting tension to the asymmetric localization of Pins, an important and evolutionarily conserved spindle orientation factor.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): This beautiful manuscript uncovers a role for planar polarized PINS/LGN in orienting the mitotic spindle in Drosophila epithelia. In response to morphogenetic forces acting on adherens junctions, PINS/LGN localises to junctions in a planar polarized fashion to orient the spindle, and de-polarization of PINS/LGN prevents planar spindle orientation. The experiments are very well performed and the findings are robust. The conclusions are well supported by the data. Reviewer #3 (Significance (Required)): These important findings mirror previous work in human cell culture, but crucially reveal that the same phenomenon occurs in vivo in the Drosophila embryo. Thus, the findings underscore the highly conserved nature and in vivo relevance of this phenomenon.

      We thank this reviewer for reading the manuscript and their encouraging words.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This beautiful manuscript uncovers a role for planar polarized PINS/LGN in orienting the mitotic spindle in Drosophila epithelia. In response to morphogenetic forces acting on adherens junctions, PINS/LGN localises to junctions in a planar polarized fashion to orient the spindle, and de-polarization of PINS/LGN prevents planar spindle orientation. The experiments are very well performed and the findings are robust. The conclusions are well supported by the data.

      Significance

      These important findings mirror previous work in human cell culture, but crucially reveal that the same phenomenon occurs in vivo in the Drosophila embryo. Thus, the findings underscore the highly conserved nature and in vivo relevance of this phenomenon.

      Referees cross-commenting

      this session contains comments of all reviewers

      Reviewer 2

      My biggest concern was that the direction of tension isn't obvious. I was particularly puzzled over the ventral furrow experiments, since I'm not clear on how that manipulation impacts the head. I agree with Reviewer #1 that it makes more sense to disrupt the cephalic furrow, but I'm not sure how to do that.

      Reviewer 1

      Agreed. I guess the question is whether there are cephalic furrow mutants in which mesoderm invagination is not affected. If so, those would be ideal.

      Reviewer 3

      Hi both. I understand your comments, but I felt that the direction of tension was apparent from the spindle orientation and the cell division axis itself. So, I wasn't concerned about using the snail mutant to prevent gastrulation and thus abolish forces generally.

      Reviewer 2

      I see. Well I certainly suspect that you and the authors are correct - and I'm enthusiastic about that! - but I'm concerned that using the direction of division to define the direction of tension is getting a little bit circular with the argument. I noticed that their ablation experiments aren't directional; instead they isolate the entire MD. Reviewer 1, as an expert in ablations, do you think it would make sense to make cuts that are only AP or DV?

      Reviewer 1

      I agree with Reviewer 2 about the circularity of the argument. I was going to propose AP vs DV cuts in sna mutants,with the idea that the wound healing response to those would pull in specific directions. My concern is that It won't be an effect of the same magnitude as the entire mesodermal placode going in, but maybe worth trying?

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Overview:

      In this manuscript, Camuglia et al. show Pins/LGN, which is understood to drive spindle orientation, can localize asymmetrically (with respect to the tissue plane) in the Drosophila embryo. Experimental work (including drug treatments, laser ablation, and knockdowns) lead the authors to propose that this asymmetry is driven by tissue-level tension. The findings are quite interesting and the manuscript is well-written overall.

      Major Comments:

      • The authors propose that localization is driven by tissue-level tension, but the direction of the tension isn't clear from the experimental work. For example, the laser ablation experiments cut around the entire perimeter of the mitotic domain, rather than along just one tension axis. Similarly, the finding that disruption of the ventral furrow (by Snail RNAi) interferes with spindle orientation in the head is very puzzling; the furrow is A) outside the embryonic head and B) runs in the parallel direction to the divisions considered. The authors need to address the directionality of tension experimentally.
      • As acknowledged in the text, the asymmetric enrichment of Pins in MD14 is fairly weak. Since the cells being examined here border a divot in the tissue, and might therefore be curving relative to the focal plane, it would be good to rule out the possibility that some of the asymmetry in Pins intensity is just a consequence of cell/tissue geometry. One way this could be achieved is by showing multiple focal planes.
      • In Figure 3I (and 3M?), it appears that there are fewer cell divisions in the presence of myr-Pins. Is this the case? Since cell shapes change during division, and cell shapes influence tissue tension, an increase in cell divisions could lead to a change in tissue tension. This would be important to address, since tissue tension plays an important role in the proposed model.
      • The alpha-catenin and Canoe results are a bit confusing:
        • The rose plot in Figure 4D doesn't show a random distribution of spindle angles, but rather a modest change; most spindles still orient in the normal range. The p value in the figure legend (0.0012) is very different from the one in the figure (5.8284e-04).
        • Alpha-catenin is the strongest way to disrupt AJs, but A) the epithelium appears to be intact in the knockdown condition and B) spindle orientation is impacted but not randomized. Does this mean that the knockdown is incomplete? Or is Cadherin-mediated adhesion (in which alpha-catenin participates) only partially responsible for force transduction?
        • Given that previous studies implicate Canoe in Pins localization, it seems important to lock down the question of whether Canoe is participating in the mechanism described in this paper. How do the authors know the extent of Canoe knockdown? As suggested by the alpha-catenin results (described above), is it possible that Canoe knockdown is simply not strong enough to impact spindle orientation? Aren't there genetic nulls available?

      Minor Comments:

      • It can be difficult to interpret some of the spindle orientation data since the AP axis is vertical in the diagrams but horizontal in the rose plots. Can one of these be flipped so they go together?
      • Figure S3 is important information for the reader and should be ideally moved into the main paper.
        • Protein localizations referred to in text should be annotated on images, as they can be hard to see.
      • There are some discrepancies between figures, legends and text.
        • p-values differ between figures, legends, and/or text.
        • Fluorescent markers are labelled differently in figures and legend (CLIP170 in Figure 1)
        • Graphs appear to show that MD3 polarizes on posterior side, but figure legend says anterior in Figure S1. Vice versa for MD5.
      • Ideally, multichannel image overlays should be shown along with individual channels (b/w). However, it is appreciated that the fluorescent signals are exceptionally weak in this study, presenting a challenge to presentation and to quantification.
      • Graph axes depicting spindle orientation would be more clear if shown in degrees, instead of normalized or in radians.

      Significance

      Several recent studies have demonstrated that division orientation (in the tissue plane) is governed by tissue level tension. Remarkably, it appears that diverse mechanisms link tension with spindle orientation. Here the authors provide the first in vivo evidence connecting tension to the asymmetric localization of Pins, an important and evolutionarily conserved spindle orientation factor.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Camuglia, Chanet and Martin investigate the mechanisms that control cell division orientation in vivo, using the mitotic domains (MDs) in the head of the Drosophila embryo as their main model system. They find that cells in the head mitotic domains rotate and align their spindles within 30 degress of the anterior-posterior axis of the embryo. The Pins protein, implicated in spindle orientation in other systems, is planar polarized in mitotic cells. Pins polarization precedes spindle rotation and is correlated with the division angle (but cell shape is not, violating Hertwig's rule). Overexpression of myristoylated Pins results in uniform Pins distribution on the membrane and affects spindle orientation. alpha-catenin RNAi (but not canoe RNAi) disrupts Pins polarity and spindle orientation in MDs 1, 3 and 5. Low dose CytoD injections (which should disrupt force transmission) also result in defective Pins polarity and spindle orientations. Finally, mechanical isolation by laser ablation also disrupts spindle orienttion. The authors find that preventing mesoderm invagination by snail dsRNA disrupts Pins polarity and spindle orientation in the head.

      Major

      1. Is there a certain chirality in the rotation of the spindles? From Movie 1, it seems like in MDs 1 and 3 at least, a majority of spindles on the right side of the embryo rotate clockwise, while spindles on the left side rotate counter-clockwise? Is that so, and in that case, are there geometric/molecular considerations that could explain that chirality?
      2. The authors are experts in mesoderm invagination, and understandably concentrate on the role that forces from that process may have in the orientation of head MD divisions. However, the cephalic furrow forms much closer to the head MDs, and in an orientation that might also explain the alignment of spindles in the head. Is cephalic furrow formation important for Pins polarity and spindle orientation in the head MDs?
      3. Does expression of myristoylated Pins afect mesoderm invagination (or cephalic furrow formation)? From Table S1 it seems that a maternal Gal4 driver was used to express myristoylated Pins, which could affect other tissues in the embryo. So it is in principle possible that effects of myristoylated Pins on mesoderm internalization/cephalic furrow formation could affect cell division orientation much like sna loss of function does, but in a mechanism that does not depend on Pins polarity. There is definitely an effect on mesoderm invagination in alpha-catenin RNAi (but not in canoe RNAi) embryos, so I wonder if the effect could be consistently through defects in mesoderm invagination (or cephalic furrow formation), and Pins polarity is really dispensable for spindle orientation. Are there head-specific Gal4 drivers that could be used to drive myristoylated Pins exclusively in the head?
      4. Related to the previous point, does mechanical isolation by laser ablation (Figure 6I-N) affect Pins polarity? This experiment could alleviate some of my concerns above, as it certainly does not (should not?) disrupt neither mesoderm invagination nor cephalic furrow formation.

      Minor

      1. Figure S5: I am a bit confused about the role of Toll 2, 6, 8 in orienting spindle orientation. In Figure S5D it seems that dsRNA treatment against these genes does not disrupt spindle orientation, but Figure S5F shows quite a significant (p=0.0057) effect in triple mutants. The authors favor the idea that Toll receptors do not affect spindle orientation, but the difference with the mutant should be addressed. Furthermore, what happens in MDs 3, 5 and 14 (if the germband extension defect does not affect those divisions)? Is there a difference between dsRNA and triple mutant embryos in these other MDs?
      2. No statistical analysis is provided for any of the differences in polarity between Pins and Gap43, and this should be done to demonstrate the significance of the polarization of Pins. Also, particularly for MD14, they should compare anterior vs. posterior polarity, as based on the images in Figure 2H it is not clear that there is a difference between the anterior and posterior side of cells.
      3. Figure 2A-D: the authors propose that Pins localizes preferentially to the posterior end of cells (instead of both anterior and posterior ends) in MDs 1, 3 and 14 (and anterior in MD 5). How is the asymmetry in the distribution of Pins along the AP axis accomplished, and is there any significance to it? This should be discussed in a bit more detail (currently no potential mechanisms provided in the discussion, just an acknowledgment of the question).

      Typos

      1. Line 49: "one daughter cells" should be "one daughter cell".
      2. Line 193: "rotation. (Figure 3E-F)." should be "rotation (Figure 3E-F)."
      3. Lines 232-237: please review.
      4. Line 238: "epithelia cells" should be "epithelial cells".

      Significance

      This is the first study to my knowledge that demonstrates the role of mechanical forces in polarizing Pins, and provides a nice model to further investigate how mechanical forces generated in one tissue may affect cell division orientation in distant ones. The paper is clear, well written, and quantitative analysis is present for most results. I have some issues with the statistics (or lack thereof) for a couple of results, and potential alternative interpretations for some experiments that in my opinion should be addressed prior to publication. Specifically, it is not clear to me if Pins polarity is at all necessary for spindle orientation in any of the examined MDs.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      'The authors do not wish to provide a response at this time.'

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Hattori et al. assessed the role of astrocytic CD38 by generating astrocyte-specific conditional CD38 knockout mice and discovered defects in social memory, synapse, and spine density in the mPFC. They further showed that conditioned media from CD38-deficient astrocytes are defective in promoting synapse formation. A known astrocyte-derived synapse promoting protein, Sparcl1, is reduced in the conditioned medium from CD38 KO astrocytes and pharmacological experiments suggest that CD38 and calcium signaling regulates Sparcl1 secretion by astrocytes.

      The discoveries are novel and important and will be of broad interest to readers. However, the following concerns need to be addressed to improve the manuscript.

      Major comments:

      1. It's unclear if experiments were conducted while the experimenters are blinded to the genotype of the mice. This is essential for behavior tests.
      2. Hippocampus is also important for memory formation. Do synapse and spine densities change in the hippocampus?
      3. The proposed model of CD38 inducing Ryr3-mediated calcium release from internal stores is interesting. However, the Barres database showed that Ryr3 is not expressed by mouse astrocytes. Could the authors demonstrate the presence of Ryr3? That's a key link in their model that hasn't been demonstrated to operate in astrocytes.
      4. The authors demonstrated reduced synapse and spine density in mPFC. Interestingly, a battery of behavior tests showed no defect, except for the social memory test. Reducing synapses in mPFC should affect a range of behaviors. Why that is not the case here?
      5. The authors only tested very short-term memory (30 minutes delay). Does CD38 regulate long-term memory? It would be important to know but I realize that a single paper cannot address all questions and therefore do not think addressing this point is a prerequisite for publication.

      Minor comments:

      1. Fig. 2F, multiple comparison adjustment is needed.
      2. Fig. 3A, scale bar is 10 micrometers, not millimeters
      3. Fig. 4C, D, it is unclear if the quantification is normalized to actin loading control. BDNF levels appear lower in KO, though not significantly different, raising the question of whether an equal amount of samples was loaded.
      4. Need to validate whether CD38 levels are reduced in P42-46-injected adult knockout before concluding that CD38 is required only during development

      Significance

      Astrocytic contribution to social memory has not been reported. This study is thus the first report on the role of astrocytes in social memory. Their discovery of CD38-regulation of Sparcl1 release is also novel and important for synapse formation, although more evidence is needed to support this point (see major comments above). This study will be of broad interest to neuroscientists. I have expertise in cellular and molecular neurobiology and can evaluate all parts of the paper.

      Referees cross-commenting

      I agree with the issues that the other reviewers pointed out, especially the need for improving data reporting and consistency/accuracy. Overall, I think this manuscript has potential and the issues are addressable.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary

      In their manuscript, Hattori et al., put forward evidence that the knock-out of CD38 expression in astrocytes at approximately post-natal day 10 (referred to as CD38 AS-cKO P10) leads to a specific deficit in social memory in adult mice, while other types of memory remain unaltered. Using immunohistochemistry (IHC), the authors found a reduced number of excitatory synapses in the medial prefrontal cortex (mPFC) of CD38 AS-cKO P10 mice. Switching to in vitro primary cell culture models, the authors identify the astrocyte secreted protein SPARCL1 as a relevant synaptogenic factor. Using pharmacological dissection of relevant signaling pathways, Hattori et al., propose that cADPR formation and calcium released from intracellular stores, is essential for SPARCL1 secretion from astrocytes. Finally, the authors analyzed the transcriptome of primary CD38 KO astrocytes using bulk mRNA sequencing, and found that genes related to calcium signaling were downregulated in these cells.

      Major commments:

      • Are the key conclusions convincing?
        1. From a global perspective, the multiple lines of evidence provided by the authors strongly suggest that expression of CD38 in astrocytes is important for synaptogenesis in the mPFC of P10 mice, with ablation of CD38 and reduced synapse formation leading to social memory deficits at P70. However, the data concerning the role of astrocyte-secreted SPARCL1 is not particularly strong: further experiments are needed to support this claim (see below).
      • Are the claims preliminary or speculative?
        1. As it stands, there is no proof that the claimed astrocyte-specific deletion of CD38 is actually astrocyte specific. This evidence is crucial: without it the reported effects could be due to non-specific CD38 knock-out in other CNS cells. In this respect, the Western Blot in Supplementary Figure 1A does not provide information on astrocyte-specific deletion, merely that CD38 was globally reduced in the mPFC. Interestingly, the authors have previously published data (Hattori et al., 2017, 10.1002/glia.23139) showing that CD38 expression is mostly astrocyte-specific, peaking at p14, which coincides with the peak period of synaptogenesis. The degree of CD38 heterogeneity is also an issue that I think the authors need to consider. Do they information on this? Is CD38 expressed in every astrocyte of the CNS, or are there some astrocytes that are CD38 negative at P14? Is the mPFC a region specifically enriched in CD38 positive astrocytes and does this explain the observed behavioral deficit? I think if this is known, the authors should mention it in the "Introduction" or "Discussion". If this is not known, maybe the authors could provide data addressing the issue.
        2. I think the authors should take more caution in claiming that SPARCL1 is the main factor secreted through the CD38 signaling pathway and responsible for increased synaptogenesis. This is for several reasons, all centered on data displayed in Figure 4 and Supplementary Figure 6:
          • a) Western Blot (WB) data: The "Materials and Methods" section for WB does not indicate how protein loading and transfer efficiency were controlled for. Normalizing to β-Actin levels is an acceptable way to control for loading and transfer efficiency when using cell lysates. However, in the absence of such an abundant structural protein in conditioned media it is unclear how loading and transfer was controlled for under these conditions. Do the authors normalized the CD 38 KO AS ACM data by expressing protein levels relative to those from WT AS ACM? Is BDNF being used as a control, based on proteomics data? If so, why is proteomics data not given in the manuscript and why is this control not shown for all ACM blots? I realize that (quantitative) blotting using ACM is difficult, but I am also not convinced that the methodology used is sufficiently rigorous. Simple steps to give confidence would be Coomassie staining of gels both before and after membrane transfer, to show that i) the total protein amount loaded was the same in each lane of the gel and ii) the transfer to the nitrocellulose membrane was complete. In addition, Ponceau S staining of the nitrocellulose membrane should also have been performed and displayed, to show (roughly) equal amounts of protein were transferred for each lane. In summary, the WB data quantification needs to be better controlled. The values of the Y axis in these graphs (and throughout the manuscript) are simply too small to be read properly. Finally, I want to highlight the general lack of precision regarding the nature of the replication unit (the "n"). For example, the legend of Figure4C-D states "n = 6", but we have no idea if these are 6 independent primary cultures originating from 6 mice, 6 independent cultures from the same mouse, 6 repeats of the Western Blot using the same sample etc. This issue is valid for the whole manuscript: in my opinion, the authors should be more much careful when it comes to these crucial elements of scientific reporting.
          • b) While the data hint at an important role of SPARCL1 in synapse formation, when the authors tested if ACM from CD38 KO astrocytes supplemented with exogenous SPARCL1 could rescue synapse formation, the effect was incomplete, with only a trend to an increase in synapse number (Figure 4J-K). Perhaps the authors simply forgot to indicate the statistical significance of differences between the experimental groups (Figure 4K)? However, if there really were no statistically significant differences observed, the authors should reduce the strength of their conclusions regarding SPARCL1. This protein may well be pro-synaptogenic but, as it stands, other factors could well be in play. Perhaps the authors should have tried higher concentrations of SPARCL1 to further boost synaptogenesis? In this respect, the SPARCL1 knockdown (KD) experiment in Supplementary Figure 6B-D is an important addition, but should be supplemented by rescue with an siRNA-resistant recombinant SPARCL1? If SPARCL1 is a major player in synaptogenesis, the prediction is that synapse numbers would be close to wild type levels with this approach.
          • c) In my opinion, there are also issues with the data displayed in Figure 4H-I. The authors want to convince the reader that SPARCL1 is mostly an astrocytic protein using immunohistochemistry on mouse mPFC sections, co-labelled with antibodies against neuronal and astrocytic markers. In these panels, we are presented with images showing a few cells, in which it seems SPARCL1 is absent from NeuN positive cells, present in WT astrocytes and reduced in CD38 AS-cKO P10 astrocytes. However, the numbers of cell counted and lack of quantification severely impact on the strength of this conclusion. In my opinion, the authors should have quantified their IHC data by counting cells and establishing the ratios of SPARCL1 positive over NeuN or S100β positive cells, in both control and CD38 AS-cKO P10 animals. This experiment would provide critical information that the conditional gene targeting strategy is robust. The authors should also consider quantifying the intensity of the SPARCL1 signal in astrocytes. This is recommended as the image displayed in Figure 4I for the CD38 AS-cKO is problematic: are the authors really claiming that the reduction in SPARCL1 expression following cKO of CD38 in astrocytes is at best only partial? Is 11 days between the first tamoxifen injection and tissue fixation actually sufficient to allow for CD38 turnover? With low levels of protein turnover, the possibility exists that residual levels of CD38 are still sufficient to impact SPARCL1 levels. What would happen if there is a greater interval between tamoxifen administration and tissue recovery? Would levels of synaptogenesis be further reduced? Is this an issue of production versus secretion or a combination of factors?
        3. The heatmap (Figure 5E-F) is simply too small to interpret. The color choice is also not accessible for colorblind readers. The authors might consider displaying this heatmap in a separate figure. The authors should also provide a supplementary table where all the genes detected are listed along with their respective counts. Furthermore, it is surprising that the authors only found genes being downregulated in CD 38 KO astrocytes. Were they really no genes up-regulated? The authors might also want to indicate the genes belong to each of the ontological categories listed in Figure 5F. On p. 11, Figure 5E: The authors should indicate in the main text they performed bulk RNA-sequencing and not another type of RNA sequencing (like single cell RNA sequencing for instance). The authors indicate n = 2 but we have no indications of the nature of the replicate (also see earlier comments). Please amend.
      • Are additional experiments necessary? I think supplementary experiments are essential to support the claims of the paper. Most are described in the section above, but to summarize:
        1. Show data to prove that the CD38 AS-cKOP10 model is astrocyte-specific and leads to a total loss of CD38 in these cells.
        2. WB data: The issue of protein loading and transfer efficiency should be dealt with. Quantifications should be revisited.
        3. The authors should quantitatively analyze the different IHC performed in Figure 4H-I.
        4. The authors should provide more information on their RNA sequencing data: list of genes detected with their FPKM values etc. The authors should display the RNA sequencing data in a separate figure, allowing the heatmap to be enlarged.
        5. LC-MS/MS data: the authors should provide the list of all the proteins they identified in their LC-MS/MS experiment. As a supplementary table for instance? The majority of these experiments should be able to be performed with pre-existing samples/tissue slices. If not, the experimental pipeline necessary exits and these supporting experiments should not be too burdensome.
      • Data and methods presentation Methods: The authors need to work on this aspect of the manuscript. Most of the important details are already described, but some crucial ones are missing, while the phrasing used to describe methods is sometimes misleading. I will give some examples here, but this is not an exhaustive list. The fact that the manuscript is riddled with small mistakes, inconsistencies and/or oversights makes it difficult to read and creates a negative impression. The whole manuscript would benefit from a thorough proof-reading, preferably by a native speaker.
        1. in the "Immunohistochemistry and Synaptic Puncta Analysis" section on p. 21-22, we have no indication of which antibodies against "GFAP, NDRG2, VGlut1, PSD95, S100β, NenN(?) and SPARCL1" were used. It is standard practice to indicate the company, product number and lot number. The authors must also indicate the dilution at which they use these antibodies. On p.22, the authors write the cells were incubated with "Alexa- or Cy3-conjugated secondary antibodies". The excitation wavelengths of the Alexa dyes used need to be given.
        2. The authors need to provide more details on the microscope they used. Merely writing "using a 63× lens on a fluorescence microscope" (p.23) is insufficient.
        3. In the "LC-MS/MS" method the authors wrote: "Briefly, these proteins were reduced, alkylated, and digested by trypsin". I think that in the reduction and alkylation steps, chemicals other than trypsin were actually used. This sentence should be modified to reflect this.
        4. p.19: "uM" is written when the authors very likely mean "µM". Please check the whole manuscript for repeat examples. I know this is often lab "short-hand", but it should be avoided in scientific publications.
        5. The authors should be careful when describing their data to always indicate whether they referring to experiments performed using cultured astrocytes or not. As it stands, the text is confusing: for instance, when describing RNA-sequencing data in Figure 5, the main text appears to indicate that these astrocytes were acutely isolated from adult mice, when in fact they were obtained from primary cultures. Given concerns in the literature about potential differences between acutely isolated and cultured astrocytes (Foo et al., Neuron, 2011), this is essential. Data presentation: The figures appear to have been produced in a rush - and almost have a "screenshot" feel to them. This is not a scientific issue per se, but does impact on the overall impression given by the manuscript. The following is a non-exhaustive list of issues with the figures. I list the major ones that the authors should correct.
        6. Almost all Y axis labels are too small. The authors should comply to the basic journal requirements in terms of font sizes. Some axes do not end on a tick (e.g. Figure 3R). This is not dramatic, but should be corrected. Globally, the authors need to display bigger bar plots - most of them are extremely hard to read. Labeling should also be checked: Figure 4K, the Y axis label indicates values displayed are in %, when I think the axis graduation displays ratio values. Some of the IHC pictures are also too small to be easily interpreted.
        7. The heatmap in Figure 5E is impossible to read and, as such, has little or no value for the manuscript.
        8. Scale bars: where is the scale bar in Figure 2A? Figure 3A-H: Is the scale bar really representing 10 millimeters? Supplementary Figure 3A: scale bar is missing. Please check for similar issues throughout the manuscript.
        9. Figure Legends are problematic, and often contain incorrect or incomplete information. Examples include: Supplementary Figure 1: The description of panels J, L and N appears to be missing. Please also use the Greek letter beta and not 'b' for S100β. Supplementary Figure 5: I think the term "KO" is missing after CD 38 in the legend title. Figure 3: why state that nuclei were counterstained with DAPI in Figure 3P,Q, when this precision is not given for panels Figure 3A-H? Figure 3A-H: If the authors choose to explicitly state PSD95 is a post-synaptic marker, why not indicate that VGlut1 is a pre-synaptic marker? Same issue in Supplementary Figure 4.
        10. There are multiple instances of panels being wrongly referred to in the main text. On p.10, Figure 4H is referenced, when I think the authors mean Figure 4I; on p.10, Figure 4I-J are referred to when the authors clearly describe data found in Figure 4J-K. These types of mistakes are problematic and recur throughout the manuscript.
      • Statistical analysis As mentioned above, the exact nature of the replicates is often not stated, when the "n" number is indicated. The authors must correct this issue and give the information either at the appropriate point in the main text or in the figure legend.

      The authors should also be more consistent in the way they indicate which statistical tests were performed. This should also be indicated either at the appropriate point in the main text or in the figure legend. Furthermore, care should be taken to ensure statistics are presented in an appropriate manner: at the end of legend for Figure 4, it is indicated #p < 0.05 vs. CD38 KO ACM. This hashtag symbol is completely absent from the figure. In Figure 4F-G, the lack of statistical symbols seems to indicate no statistical tests were performed on these data, when the legend covering these panels states "*p < 0.05 versus P70", indicating some tests were done. We cannot interpret this panel without knowing which comparisons were done exactly and which were significant.

      In the "Materials and Methods", the authors give no indication that the assumptions of the statistical test they used were met (normality of data distribution for t-tests, homogeneity of variances for ANOVA...). This needs to be checked, and if not met, appropriate non-parametric tests should be used instead.

      Minor commments:

      • Specific experimental issues that are easily addressable. Most of the experimental issues that need to be addressed are given in previous sections and should be easily addressable.
      • Citation of previous studies? Adequate
      • Clarity and accuracy of text and figures There are issues with the clarity and accuracy of text and figures - which are described above. The text is also often problematic in its phrasing and other, more fundamental aspects. For instance, the authors spent a considerable amount of time speaking about the role of oxytocin, when they only performed one measurement of oxytocin levels in mice.
      • Suggestions to improve the presentation of data and conclusions? All my suggestions to improve the presentation of data can found in previous sections. As for improving the authors presentation of their conclusions, the authors should make a considerable re-drafting effort, particularly for the "Discussion", which lacks clarity in how supporting arguments are built and presented. For example, on p.13, I am confused with the argument made by the authors. Their data are focused on synapses onto pyramidal neurons of the mPFC, but here the discussion states that the behavioral phenotype they observed in CD38 AS-cKOP10 might be explained by a lack of mPFC neurons synapsing onto neurons in the Nucleus Accumbens (assuming that "NAc" really refers to this brain region, as the definition is missing from the text). I think the authors should make it clear if this is their interpretation of their own result, which essentially renders their focus on mPFC pointless, or a speculation on possible other mechanisms that could also explain their behavioral results. Personally, given the data shown, I believe the authors should focus on explaining how their data in mPFC might explain the behavioral output observed. The authors could also provide perspectives on how the hypothesis laid down in this paragraph would be tested. When the authors write on p.14 "We identified SPARCL1 as a potential molecule for synapse formation in cortical neurons" why use the word "potential"? Does this mean the authors consider their data on SPARCL1 (one of the key messages of the paper) invalid? If the authors themselves think the role of SPARCKL1 is ambiguous based on their own data, they should perform further experiments. P. 13, the authors write: "Moreover, many studies have shown that astrocyte-specific molecules, including extracellular molecules such as IL-6, are involved in memory function"; Interleukin 6 (Il-6, abbreviation not defined in the manuscript) is definitely not an astrocyte-specific molecule (see, for example, Erta et al., 2021 10.7150/ijbs.4679).

      Significance

      NATURE AND SIGNIFICANCE OF THE ADVANCE: I think that despite the issues described above, this manuscript, once revised, could have a strong impact in the field. It would fuel the current paradigm shift which puts astrocytes at the forefront of neuronal circuit wiring during development with links to adult behavior. By identifying clear molecular targets involved in astrocyte-driven synaptogenesis, this article could help the clinical field to find new druggable targets, which may help reverse aging-related cognitive decline.

      COMPARISON TO EXISTING PUBLISHED KNOWLEGDE: This work adds new data in the specific and growing line of research that study how astrocytes control synaptogenesis. Recent reviews have summarized advances in this field (Shan et al., 2021, 10.3389/fcell.2021.680301; Baldwin et al., 2021, 10.1016/j.conb.2017.05.006).

      AUDIENCE: Neuroscientists in general, clinicians interested in cellular and molecular causes of neurodevelopmental disorders leading to social dysfunctions.

      REVIEWER EXPERTISE: Astrocyte biology; Astrocyte-neuron interactions and synapse assembly; Neuronal circuit formation and plasticity

      Referees cross-commenting

      After careful reading of the other comments, I feel that there is considerable agreement/overlap between the reviewers on the main issues with this manuscript. Perhaps the major difference relates to the amount of further work necessary for the manuscript to be publication ready.

      As Reviewer 3 rightly points out, this is always a moot point: how much is it reasonable for reviewers to ask authors to do? While I agree with all of Reviewer 1's comments regarding the rigour of the mass-spec/western blot analysis, it seems to me that from a molecular/cell biological point of view, the key issue is whether Sparcl1 is a synaptogenic factor released from astrocytes following CD38/cADPR/calcium signaling (irrespective of whether other factors may be in play); and whether raising Sparcl1 levels is sufficient to recover spine morphology and synapse numbers. Of course, if these experiments were performed in vivo using AAV-mediated overexpression of Sparcl1, it is also reasonable to think that the deficit in social memory may be reversed on testing.

      The issues of whether there is a difference in observable behavioral phenotypes between the astrocyte-specific and constitutive CD38 knock-outs is an interesting one, as is why there is only a deficit in social memory seen following astrocyte-specific CD38 ablation. These issues should at least be discussed.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In their work submitted for review, Hattori et al. identify an astrocyte enriched protein (CD38) as important for social memory tasks in mice. The authors developed a conditional KO model to remove CD38 specifically in astrocytes using the GLAST-CreERT2 line crossed to a CD38 floxed line. The investigators use a three-chamber social approach test to show that loss of CD38 leads to reduced interaction time with a novel social stimulus only when the animal is given a break between test periods. The authors test whether changes in neuronal morphology or synapses in the medial prefrontal cortex (mPFC), a region important for social memory, can account for their behavioral phenotype. The researchers found that mPFC neurons in their conditional CD38 KO (cKO) animals have significantly less mature spines than wild-type (WT) controls. The authors then claim that this reduction in mature spines correlates with a reduction in VGluT1 positive excitatory synapse density in mPFC of cKO vs. WT. Next, the investigators use mass spectrometry of astrocyte conditioned media and neuronal cultures treated with astrocyte conditioned media to test whether a known astrocyte secreted synaptogenic factor, Sparcl1/Hevin, could underlie their reported changes in synapse density in their cKO animals. Finally, the authors use pharmacological inhibitors against different components of the CD38 signaling pathway to test whether CD38 regulates Hevin secretion by astrocytes. While the reported behavioral phenotype is interesting, this reviewer has several major concerns with the data claiming that reduction in Sparcl1/Hevin is underlying synaptic phenotypes in the CD38 cKO. Therefore, the paper is not suitable for publication without addressing the concerns listed below.

      Major Concerns:

      Synapse analysis in vivo: For the analysis of VGluT1 excitatory synapses in mPFC, it is not clear how the statistical analysis was performed. From the plotted error bars, it seems that the investigators used individual z-projections as the n for a t-test. This is inappropriate for this analysis as it would overinflate the N and down the p-value. It would be more appropriate to plot and compare animal averages between conditions or use a test that can account for the fact that there are repeated measures taken from the same animal. Additionally, the authors note a decrease in VGlut1+ puncta in the global CD38 KO but no change in the protein levels in both the global and cKO.

      Synapse analysis in vitro: The authors are missing key experimental controls for their analysis of synapse induction by astrocyte conditioned media. Firstly, the authors do not include a condition of neurons cultured alone without astrocytes or astrocyte conditioned media treatments. This is critical to this experiment because, without this control, it is impossible to assess the effectiveness of the astrocyte conditioned media or any recombinant protein treatments on synapse formation. Secondly, the authors give very few details and no supporting data about the purity of their neuronal cultures. This is critical to this experiment because any contaminating astrocytes in their cultures could severely skew the data for any given condition. Finally, the authors do not specify how they determined the doses for astrocyte conditioned media and Hevin treatments. The researchers give no details on how the astrocyte conditioned media was collected or treated before adding onto neurons. For this experiment to be viable, the researchers must collect the conditioned media in serum-free media, determine the protein concentration of their samples, and the dose-response to the astrocyte conditioned media must be performed to determine the optimal dose for each batch. When comparing between genotypes, this type of quality control is critical to assess whether there is, in fact, a difference in their synaptogenic capacity.

      Western blots: All western blot quantification of astrocyte conditioned media should include total protein normalization. The authors do not describe how they normalize the astrocyte conditioned media blots, but without a total protein stain to normalize, it is impossible to be sure the same amount of protein was loaded into the gel for each lane. In Figure 3L, the western blot data showing the expression of VGluT1 and PSD95 should be improved, and a better representation is recommended. It is also strange that the CD38 cKO has no expression because CD38 is also expressed in endothelial cells. Why not isolate astrocytes from CD38 KO? Also, for VGluT1 and PSD95 western blots, it would be better to test mPFC lysates rather than whole cortical lysates. Astrocyte morphogenesis: Since the astrocyte-specific deletion of CD38 from P10 impairs postnatal development of astrocytes, the authors should investigate if the impaired synaptogenesis seen in later stages is due to impaired astrocyte morphogenesis or the defect in the secretion of synaptogenic proteins like Sparcl1/Hevin or thrombospondins.

      Mass spectrometry: There is no information about how many samples were used for mass spectrometry. This reviewer is concerned that this experiment may be underpowered given that other published datasets have identified significantly more proteins in wild-type ACM (about double than what was identified here). There needs to be a quality assessment of the ACM to help ensure the production protocol can capture the full extent of proteins secreted by cultured astrocytes.

      RNA sequencing: RNA sequencing results seem underpowered, and an accurate description of their collection methods is missing. It also seems to this reviewer that any prolonged culturing of the astrocytes would lead to additional transcriptional changes independent of their genetic manipulation. To avoid confounds due to culture artifacts, it might be cleaner to FACS sort astrocytes using a fluorescent reporter such as the Aldh1l1-eGFP line or RTM in their GLAST-creERT2 model. In the latter case, this could also provide data on the specificity of their recombination, which is lacking elsewhere in the manuscript.

      Comparison between astrocyte-specific cKO and global KO: Considering the abundant expression of CD38 in astrocytes compared to other cell types in the brain, I am wondering whether the comparison between the current astrocyte-specific CD38 cKO and the previous constitutive CD38 KO mice would provide a different phenotype with respect to its importance in synaptic function in neural circuits that mediate social behaviors in various brain regions. The authors note the importance of CA1, CA2, and NAC in social memory, but they only assessed synapses in mPFC. Multiple studies, including one from the authors, have reported that constitutive CD38 KO mice exhibit impaired social behaviors. Expanding beyond what is already known would require better spatial and temporal regulation of CD38 expression than presented here.

      Rescue experiments: The authors claim that reduced levels of Hevin secretion are responsible for reducing intracortical synapses in mPFC and the inability of their CD38 KO ACM to stimulate synapse formation. However, Hevin has primarily been linked to the formation of VGluT2+ synapses with only a transient effect on VGluT1+ synapses. Furthermore, Hevin's synaptogenic effect in astrocyte conditioned media is masked by its homolog Sparc. To claim that Hevin is responsible for reducing VGluT1+ synapses in mPFC the authors need to do a rescue experiment by expressing hevin in CD38 KO through AAVs brains or intracortical injections of recombinant Hevin.

      Other synaptogenic factors: The authors focus on Sparcl1/Hevin; however, other synaptogenic factors have been reported to affect VGluT1+ excitatory synapse formation and development directly. Notably, thrombospondins have been shown to regulate the formation of this specific synapse type through their receptor a2d1. The authors do not report any investigation into this family of factors despite their clear link to VGluT1+ synapse development.

      Effect of CD38 cKO on astrocyte numbers: The authors note that CD38 cKO alters GFAP expression; however, they also report a decrease in the number of GFAP+ and S100ꞵ+ cells without a change in NDRG2+ cells. The authors should address this discrepancy in astrocyte numbers with additional known markers such as Sox9.

      MBP quantification: The authors previously reported changes in MBP expression and oligodendrocyte maturation in the global CD38 KO animals. However, there is no quantification of the MBP staining in the cKO in supplementary figure 1. It would be important to verify that white matter structures developed properly in their cKO model, especially in mPFC.

      Minor Concerns:

      1. SPARCL1 annotation should be Sparcl1.
      2. Avoid repetition of the same sentences in multiple places. E.g., The sentence- "Social behavior is essential for the health, survival, and reproduction of animals" is repeated both in the abstract and introduction.
      3. The introduction needs to be thoroughly revised. In the first paragraph, a description of various studies(Fmr/Mecp2) which indicated the importance of synaptic function in neural circuits that mediate social behaviors in various brain regions could be presented later part of the introduction in a very concise manner since the article doesn't cover anything related to these genes. This part can be presented along with the narration of CD38, where authors described its importance in social behavior. Introduce the importance of social behavior and their behavioral paradigm, especially what social memory is and what brain regions are important for it.
      4. Introduction feels too short and abrupt.
      5. In Figures 2 and 3, Are the spine numbers/density/synapses affected in the p42 ctrl/CD38 AS-cKO group compared to the p10 ctrl/CD38 AS-cKO group?
      6. In Figure 2; The authors should compare both the behavioral phenotype seen in two different tamoxifen injection/time points with the respective constitutive CD38 KO mice data.
      7. In Figures 3 and 4, the authors should analyze the spine numbers/density both in WT or CD38 KO ACM treated experiments and Sparcl1 KD/Sparcl1 treated rescue experiments?
      8. The discussion section needs to be revised to reflect better the conclusions drawn from the data without overstatement.

      Significance

      Understanding the mechanisms underlying control of behaviors is important and linking non-neuronal cell types to behavioral processes is novel and timely. However, the study at its current state lacks important controls, and interpretations are overstated and often too targetted to a favorite mechanism. These concerns limit the impact of the study and reduces its significance.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      RC-2022-01245 Willemsen et al., 2022

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Willemsen et al studied the contribution of PSMB4 and PSMB8 on proteasomal activity in adipocyte tissue/cells. Mutations in PSMB4/8 have been associated with metabolic diseases that lead to inflammation and proteotoxicity. They used mice and murine cell lines to assess the abundance and activity of the proteasome as well as the stress response upon depletion of psmb4 and regulatory factors. The study is interesting and could provide new insight into the role of specific proteasomal subunits of the immunoproteasome on the metabolism and associated diseases. However, there are a number of issues on the presented data that should be addressed. See below.

      RESPONSE: We thank the reviewer for her/his positive remarks.

      Major comments:

      1. Most of the presented data are secondary data (graphs) and the differences between the experimental conditions (e.g. siRNA) are sometimes only minor and statistical analyses lacking or only indicated with a letter (a, b,..), but its meaning not explained. RESPONSE: We apologize if some of the statistics were difficult to see. Using letter designations for groups of indifference declutters the figure compared to using asterisks to indicate significant differences between groups. We have now made sure the statistical analyses are emphasized in the methods and figure legends.

      The authors should introduce all tested marker genes e.g. the genes that were analyzed in figures 2H/I, 3D-F, 4A/D. What was the hypothesis and do they represent all or a selected set of genes of the integrated stress response?

      RESPONSE: We apologize if the relevance of these markers was left unclear. We have now introduced the marker genes in the text.

      Figure 1A: RNA levels were analyzed yet not protein levels. Why not?

      RESPONSE: As we performed loss of function experiments later anyway, we decided not to venture into more descriptive analyses. The fact that Psmb4 and Psmb8 are robustly expressed in adipocytes was enough for us to justify studying their function further.

      Figure 2C: This figure is problematic. Apart from the smear on the right side, a loading control is missing. This is essential to quantify signal intensities. Moreover, on the left side, the intensities of lanes 1 and 2 are very different, yet are both controls and were used for the conclusion that proteasomal activity is reduced upon siPsmb4 (lanes 3 and 4 - that do not differ from lane 2). In addition: from which day were the data collected? This information is missing, but important as Figure 2E shows opposing proteasomal activity on day 3 and day 5.

      RESPONSE: We agree with the reviewer that the duplicate of the scrambled control cells showed variation. Therefore, we have now repeated the experiment and replaced the figure (now figure 2A). The outcome is not affected, as knockdown reduces proteasomal activity and leads to abnormal proteasome formation in the native PAGE. Regarding the internal loading, it is common practice to display both in-gel activity and immunoblot on the membrane as is. For recent examples in the literature please see VerPlank et al., PNAS 2019 (10.1073/pnas.1809254116) or Yazgili et al., Cell Press STAR Protocols 2021 (10.1016/j.xpro.2021.100526). Obviously, the same amount of protein was loaded, and this is also seen in the immunoblot. As this is a native PAGE, there is no beta-tubulin or other commonly used loading controls for immunoblots. Furthermore, we apologize for the missing information, this experiment was performed using day 5 mature cells. This information is now included in the figure legend.

      Figure 2D: tubulin was used as loading control, yet the signal of tubulin in lane 1 is by far weaker compared to the other lanes. How does that affect the quantification (missing) of Nfe2I1?

      RESPONSE: We have now included the quantifications, which do not affect the outcome of the experiments and the conclusions drawn.

      Figure 3C and Fig 2E (control vs siPsmb4) contradict each other. Please clarify.

      RESPONSE: We respectfully point out that the reviewer might have overlooked that 2E shows the time course and 3C only shows day 5 data – both in 2E and 3C, day 5 total proteasomal activity is (insignificantly) increased. Hence, the panels do not contradict each other.

      Figure 3B: Issues with the loading control: tubulin signals are in first 3 lanes much weaker. Where is the quantification for that data set that takes the fluctuations of the tubulin signals into account?

      RESPONSE: We have now included the quantification, which does not affect the outcome of the experiments and the conclusions drawn. The quantifications can be found in figure S3.

      Minor comments:

      Why did the authors not use human adipocyte cells and performed all experiments in murine cells?

      RESPONSE: The advantage of the cell lines used lies in the ability to study both brown and white features as well studying aspects of adipogenesis and thermogenesis simultaneously. Based on this comment and the comment of reviewer #2, we have reproduced our findings in 3T3-L1 adipocytes, in the hope of strengthening our study. These data are shown in the new Supplementary figure 4.

      In which cell/tissue is Psmb4 expressed?

      RESPONSE: Thank you for this question, we have now measured Psmb4 expression in a panel of mouse tissues. As shown in figure S1, Psmb4 is ubiquitously expressed in all tissues measured with the highest levels in kidney and liver, followed by brown fat.

      Figure 4G: information on the different colors is missing.

      RESPONSE: Thank you for bringing this to our attention. We have now included a legend.

      The result section appears to have been restructured as sections do not build up on each other well. This should be corrected.

      RESPONSE: We appreciate this critical feedback. We have now improved the flow for an enhanced reading experience.

      There are a number of grammatical errors or doubling of words/phrases e.g. bottom of page 1: "In addition, PRAAS patients display suffer from..." or on page 2: "Adapting proteasomal activity to the needs of the UPS..." This statement does not make sense. Maybe the authors mean "proteolytic demands"?

      RESPONSE: Thank you, we have fixed the remaining typos.

      Although, the UPS is a proteostasis node, the authors should avoid statements such as "We show that proteostasis and lipid metabolism are intricately linked..." Better is "UPS activity and lipid metabolism..." Or the authors should expand their analysis to protein synthesis, folding and additional clearance pathways.

      RESPONSE: Thank you, we have specified our statement regarding proteostasis and UPS.

      Reviewer #1 (Significance (Required)):

      This study links clinical research with basic science and if the authors address the above mentioned issues this work will provide new insight into the role of the UPS in the lipid metabolism.

      target audience: clinical scientist on lipid metabolism and basic researchers on the UPS and associated pathologies

      my expertise: UPS

      RESPONSE: We are very thankful for these positive concluding remarks.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The molecular mechanism by which proteasome mutations cause lipodystrophy in PRAAS, which is caused by proteasome dysfunctions, has not been well understood. However, it was shown that Psmb4 (β4), a component without enzymatic activity, is required for the formation and maintenance of adipocyte function. In the proteasome dysfunction state of Psmb4 deficiency, the expression of Nfe2l1 was enhanced for proteostasis, but it could not complement the adipocyte formation defect. We showed that repression of Arf3, which is associated with stress response and is markedly expressed in this situation, resulted in the recovery of inflammation and adipogenesis.

      Major comments

      1. The Graphical abstract seems to indicate that Loss of PSMB4 activates NFE2L1 and ATF3, resulting in the suppression of proteotoxicity and Inflammation. In the case of NFE2L1, this is correct, but in the case of ATF3, as shown in Fig. 4D, ATF3 acts in a promotive manner on Inflammation, causing misleading to the reader. RESPONSE: Thank you, we agree that this aspect of the graphical abstract was partially misleading. We hope the new version now makes more sense.

      Fig. 2 shows the rise of Nfe2l1 and the restoration of proteasomal capacity on Day 5 (Fig. 2E). [Nfe2l1 is cleaved, and initiates the transcription of proteasome subunits, which results in restoration or heightening of proteasomal capacity (16,17).] It is known that the brown adipocyte mount an adaptive response to overcome UPS dysfunction, and the transcription of proteasome subunits was increased in this experimental system. However, there are no results showing that the transcription of proteasome subunits is actually increased in this experimental system.

      RESPONSE: Thank you for pointing this out. In Psmb4 KD cells, we see an increase in Psmd2 protein levels (Fig. 3B). In addition, we see a small increase in expression levels of various proteasome subunits. We have now included a graph showing these expression levels (Fig 3C).

      Are Psmb4KO mice available? If yes, are there any symptoms? Is there any change in proteasome activity, etc.?

      RESPONSE: We do not have a Psmb4 KO mouse model, yet, and to the best of our knowledge, none is available. We agree with the reviewer that it would be insightful to study Psmb4 in an in vivo model, but in this project, we have used a cell model to study the cell intrinsic mechanisms of Psmb4.

      4A. In PRAAS patients, most of the lipodystrophy occurs in white adipocytes, but if Psmb4 deficiency is induced in white adipocytes, do they show the same dynamics?

      RESPONSE: Thank you for your stimulating question. We have repeated our Psmb4 KD experiments in 3T3-L1 cells, to study the dynamics in a white adipocyte model. We found that also in 3T3-L1 cells, Psmb4 knockdown disrupts adipogenesis. The results can be found in Supplemental figure 4.

      4B. The first mention of heat production was made, but it was not clear how much the patient's cyclic fever symptoms were related to changes in brown adipocyte function.

      RESPONSE: Our data suggest that aberrant brown adipose tissue does not contribute to cyclic fevers in PRAAS patients. We elaborate on this in the Discussion.

      minor points

      fig2; The numbering of the figure is not correct.

      RESPONSE: Thank you, we have corrected the error.

      Fig. 2: (day 5) in the figure legend of (E) is unnecessary.

      RESPONSE: Thank you but based on the other reviewers’ questions we think it is important to indicate the stage of the differentiation.

      Fig. 4 The figure legend in (A) and (B) are switched.

      RESPONSE: Thank you, we have corrected the error.

      In Fig. 4 (G), there are n = 8 and n = 6 in the figure legend, which is difficult to understand.

      RESPONSE: Thank you, we have corrected the error.

      Reviewer #2 (Significance (Required)):

      It has been thought that the accumulation of defective proteins caused by proteasome dysfunction stresses cell metabolism and leads to lipodystrophy, but the detailed mechanism has not been understood. In this paper, we have clarified a part of the mechanism that links the accumulation of ubiquitinated proteins caused by proteasome dysfunction to the disruption of proteostasis, inflammation and adipogenesis. The results of the study, which showed the relationship between intracellular proteostasis, inflammation and lipid metabolism, will help us understand not only PRAAS patients but also abnormal lipid metabolism, obesity, induction of inflammation, and chronic inflammation with persistent inflammation.

      RESPONSE: We are very thankful for these positive concluding remarks.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this study Willemsen et al. investigated the role of proteasome subunit beta 4 and 8 (PSMB4/8) in immortalized brown (pre)adipocytes regarding adipogenesis ability, inflammation, function, and proteostasis. The group showed that Psmb4/8 are expressed in brown adipose tissue and adipocytes but that they are differently regulated. The loss of PSMB8 had no effect on brown adipose tissue/adipocyte function. In contrast, knock-down of PSMB4 altered proteostasis, which was partially compensated by NFE2L1, as well as reduced adipocyte differentiation, lipid accumulation, and beta adrenergic-stimulated glycerol release in immortalized brown adipocytes. The group further demonstrated that the effect of PSMB4 knock-down on impaired adipogenesis was mediated via Atf3 activation.

      The manuscript is well-written with clearly structured text and figures. The data and methods are presented in a way that makes it easy to reproduce the experiments. The statistical analysis is adequate.

      Some suggestions:

      1. Since you stated in 3.1. Result section that Psmb4/8 are robustly expressed in BAT, it would be interesting to directly compare the expression of Psmb4/8 in BAT. RESPONSE: We thank the reviewer for this interesting suggestion. The comparison is now included in figure S3.

      Please normalize the glycerol release to protein content (Fig 2J, 4F). It would be also interesting to show whether the reduced glycerol release is due to reduced TG content and/or lipolytic activity. Therefore, you should determine the expression of lipases (e.g. Atgl, Hsl) in adipocytes.

      RESPONSE: We thank the reviewer for this interesting suggestion. We have looked at the expression of lipases. Psmb4 knockdown did not alter the expression of lipases, which indicates that the reduced glycerol release is rather due to reduced TG content than due to the absence of lipases. The comparison is now included as Figure 4G. For the glycerol assays, we have normalized glycerol release to protein content. Comparing an undifferentiated (in this case the cells with silencing of Psmb4) vs differentiated cells (in this case the scrambled siRNA control cells) will result in many fundamental differences. Specifically, the lipid to protein ratio is very different, much higher in mature adipocytes, obviously. This obscures some if the differences in lipolysis when glycerol release is normalized to protein levels. Therefore, we have included a figure, in which we show the fold change. Interestingly, this way in the Psmb4 knockdown cells, it is evident that they become refractory to norepinephrine stimulation, and this is rescued when Atf3 is silencing, too.

      Please define early/late transfection - on which day of differentiation was Psmb8 silenced? (Fig S2)

      RESPONSE: Psmb8 was silenced on day(-1). We have now added this information.

      Reviewer #3 (Significance (Required)):

      This study clearly demonstrated that proteasome dysfunction via impaired PSMB4 action modulates brown adipocytes differentiation, function, and health. In this study a novel link between dysfunctional proteostasis and impaired lipid metabolism was identified.

      RESPONSE: We are very thankful for these positive concluding remarks.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this study Willemsen et al. investigated the role of proteasome subunit beta 4 and 8 (PSMB4/8) in immortalized brown (pre)adipocytes regarding adipogenesis ability, inflammation, function, and proteostasis. The group showed that Psmb4/8 are expressed in brown adipose tissue and adipocytes but that they are differently regulated. The loss of PSMB8 had no effect on brown adipose tissue/adipocyte function. In contrast, knock-down of PSMB4 altered proteostasis, which was partially compensated by NFE2L1, as well as reduced adipocyte differentiation, lipid accumulation, and beta adrenergic-stimulated glycerol release in immortalized brown adipocytes. The group further demonstrated that the effect of PSMB4 knock-down on impaired adipogenesis was mediated via Atf3 activation.

      The manuscript is well-written with clearly structured text and figures. The data and methods are presented in a way that makes it easy to reproduce the experiments. The statistical analysis is adequate.

      Some suggestions:

      Since you stated in 3.1. Result section that Psmb4/8 are robustly expressed in BAT, it would be interesting to directly compare the expression of Psmb4/8 in BAT. Please normalize the glycerol release to protein content (Fig 2J, 4F). It would be also interesting to show whether the reduced glycerol release is due to reduced TG content and/or lipolytic activity. Therefore, you should determine the expression of lipases (e.g. Atgl, Hsl) in adipocytes.

      Please define early/late transfection - on which day of differentiation was Psmb8 silenced? (Fig S2)

      Significance

      This study clearly demonstrated that proteasome dysfunction via impaired PSMB4 action modulates brown adipocytes differentiation, function, and health. In this study a novel link between dysfunctional proteostasis and impaired lipid metabolism was identified.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The molecular mechanism by which proteasome mutations cause lipodystrophy in PRAAS, which is caused by proteasome dysfunctions, has not been well understood. However, it was shown that Psmb4 (β4), a component without enzymatic activity, is required for the formation and maintenance of adipocyte function. In the proteasome dysfunction state of Psmb4 deficiency, the expression of Nfe2l1 was enhanced for proteostasis, but it could not complement the adipocyte formation defect. We showed that repression of Arf3, which is associated with stress response and is markedly expressed in this situation, resulted in the recovery of inflammation and adipogenesis.

      Major comments

      1. The Graphical abstract seems to indicate that Loss of PSMB4 activates NFE2L1 and ATF3, resulting in the suppression of proteotoxicity and Inflammation. In the case of NFE2L1, this is correct, but in the case of ATF3, as shown in Fig. 4D, ATF3 acts in a promotive manner on Inflammation, causing misleading to the reader.
      2. Fig. 2 shows the rise of Nfe2l1 and the restoration of proteasomal capacity on Day 5 (Fig. 2E). [Nfe2l1 is cleaved・・・・, and initiates the transcription of proteasome subunits, which results in restoration or heightening of proteasomal capacity (16,17).] It is known that the brown adipocyte mount an adaptive response to overcome UPS dysfunction, and the transcription of proteasome subunits was increased in this experimental system. However, there are no results showing that the transcription of proteasome subunits is actually increased in this experimental system.
      3. Are Psmb4KO mice available? If yes, are there any symptoms? Is there any change in proteasome activity, etc.? In PRAAS patients, most of the lipodystrophy occurs in white adipocytes, but if Psmb4 deficiency is induced in white adipocytes, do they show the same dynamics? The first mention of heat production was made, but it was not clear how much the patient's cyclic fever symptoms were related to changes in brown adipocyte function.

      Minor points

      fig2; The numbering of the figure is not correct.

      Fig. 2: (day 5) in the figure legend of (E) is unnecessary.

      Fig. 4 The figure legend in (A) and (B) are switched.

      In Fig. 4 (G), there are n = 8 and n = 6 in the figure legend, which is difficult to understand.

      Significance

      It has been thought that the accumulation of defective proteins caused by proteasome dysfunction stresses cell metabolism and leads to lipodystrophy, but the detailed mechanism has not been understood. In this paper, we have clarified a part of the mechanism that links the accumulation of ubiquitinated proteins caused by proteasome dysfunction to the disruption of proteostasis, inflammation and adipogenesis. The results of the study, which showed the relationship between intracellular proteostasis, inflammation and lipid metabolism, will help us understand not only PRAAS patients but also abnormal lipid metabolism, obesity, induction of inflammation, and chronic inflammation with persistent inflammation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Willemsen et al studied the contribution of PSMB4 and PSMB8 on proteasomal activity in adipocyte tissue/cells. Mutations in PSMB4/8 have been associated with metabolic diseases that lead to inflammation and proteotoxicity. They used mice and murine cell lines to assess the abundance and activity of the proteasome as well as the stress response upon depletion of psmb4 and regulatory factors. The study is interesting and could provide new insight into the role of specific proteasomal subunits of the immunoproteasome on the metabolism and associated diseases. However, there are a number of issues on the presented data that should be addressed. See below.

      Major comments:

      • Most of the presented data are secondary data (graphs) and the differences between the experimental conditions (e.g. siRNA) are sometimes only minor and statistical analyses lacking or only indicated with a letter (a, b,..), but its meaning not explained.
      • The authors should introduce all tested marker genes e.g. the genes that were analyzed in figures 2H/I, 3D-F, 4A/D. What was the hypothesis and do they represent all or a selected set of genes of the integrated stress response?
      • Figure 1A: RNA levels were analyzed yet not protein levels. Why not?
      • Figure 2C: This figure is problematic. Apart from the smear on the right side, a loading control is missing. This is essential to quantify signal intensities. Moreover, on the left side, the intensities of lanes 1 and 2 are very different, yet are both controls and were used for the conclusion that proteasomal activity is reduced upon siPsmb4 (lanes 3 and 4 - that do not differ from lane 2). In addition: from which day were the data collected? This information is missing, but important as Figure 2E shows opposing proteasomal activity on day 3 and day 5.
      • Figure 2D: tubulin was used as loading control, yet the signal of tubulin in lane 1 is by far weaker compared to the other lanes. How does that affect the quantification (missing) of Nfe2I1?
      • Figure 3C and Fig 2E (control vs siPsmb4) contradict each other. Please clarify.
      • Figure 3B: Issues with the loading control: tubulin signals are in first 3 lanes much weaker. Where is the quantification for that data set that takes the fluctuations of the tubulin signals into account?

      Minor comments:

      • Why did the authors not use human adipocyte cells and performed all experiments in murine cells?
      • In which cell/tissue is Psmb4 expressed?
      • Figure 4G: information on the different colors is missing.
      • The result section appears to have been restructured as sections do not build up on each other well. This should be corrected.
      • There are a number of grammatical errors or doubling of words/phrases e.g. bottom of page 1: "In addition, PRAAS patients display suffer from..." or on page 2: "Adapting proteasomal activity to the needs of the UPS..." This statement does not make sense. Maybe the authors mean "proteolytic demands"?
      • Although, the UPS is a proteostasis node, the authors should avoid statements such as "We show that proteostasis and lipid metabolism are intricately linked..." Better is "UPS activity and lipid metabolism..." Or the authors should expand their analysis to protein synthesis, folding and additional clearance pathways.

      Significance

      This study links clinical research with basic science and if the authors address the above mentioned issues this work will provide new insight into the role of the UPS in the lipid metabolism.

      target audience: clinical scientist on lipid metabolism and basic researchers on the UPS and associated pathologies

      my expertise: UPS

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC- 2021-01102

      Corresponding author(s): Rita Tewari; Mohammad Zeeshan

      1. General Statements [optional]

      We wish to thank the reviewers and the Editor for their constructive comments and valuable suggestions to improve our manuscript. We have addressed as far as possible all comments and concerns and we hope that this revised manuscript, with additional new data, will be acceptable for publication. Please find below detailed responses (in italicized red text) to all specific points raised by the reviewers.

      2. Point-by-point description of the revisions

      This section is mandatory. Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The study led by Dr. Zeeshan analyzes nine mouse Plasmodium parasites kinesin by determining their expression pattern and subcellular location in various stages of the parasites in the mammalian and mosquito host. The genetic and phenotypic analyses of all nine kinesins indicate that most are critical for parasite development in the mosquito host, except for Kinesin 13 being the only kinesin essential during the intraerythrocytic development cycle. The authors presented an in-depth analysis on kinesin 13 and 20, using an impressive pallet of molecular techniques such as promotor swapping, chromatin immunoprecipitation, and global transcriptomic analysis using RNAseq, as well as numerous microscopy techniques such as live fluorescence imaging, expansion microscopy, and electron microscopy. This comprehensive study provides an outstanding amount of data on Kinesins in Plasmodium parasites that would be best showcased with a rethinking of the manuscript structure and a more insightful discussion section that directed most of my comments in the review the manuscript. I believe no additional experiments are needed assuming that the authors will link Kinesin 13 and or 20 to the IMC formation in future work.

      Authors’ response: We are pleased that the reviewer likes this comprehensive study and believes that no additional data are required. We have now reorganized the manuscript with more focus on kinesin 13 and kinesin 20.

      **Major Comments:**

      •The current manuscript shows the " Location and function of Plasmodium kinesins" as the title suggests; however, I strongly recommend the authors consider alternative storytelling focusing on Kinesin 13 and 20.

      Authors’ response: We thank the reviewer for this recommendation. We have changed both the organization of the text and the title of the manuscript to focus on kinesin-13 and -20. The new title of the manuscript is “Key roles for kinesin-13 and kinesin-20 in malaria parasite proliferation, polarity, and transmission revealed by genome-wide functional analysis”.

      The author provides in-depth phenotypical analysis resulting in the most innovating and exciting data. In addition, the discussion section from lines 592 to 634 was fascinating compared to the following section (see details comments for Discussion section below). Authors’ response: We are very pleased that the reviewer considers the phenotypic analysis to be the most innovative and exciting data, and that they appreciated the discussion section of lines 592 to 634.

      The following significant comments are related to figures where I believe a restructuration is most needed to bring clarity to the paper.”

      •Figure 1. I suggest the authors move Figure 4A to figure 1; Figure1C should move to supplementary information

      Authors’ response: As suggested by the reviewer, old Figure 4A is now moved to Figure 1(as Figure 1D) and old Figure 1C is moved to supplementary information (as S3 Fig)

      except for Kinesin 13 and 20 data to center the paper's focus on these two proteins

      Authors’ response: The kinesin-13 and -20 data are now given prominence, as Figures 3 and 4 (kinesin-20) and Figures 5, 6, and 7 (kinesin-13).

      I would also present the kinesin data in the current Figure4A not by numeric order but by biological relevance. All the "normal" together and so on

      Authors’ response: We thank the reviewer for this suggestion; in Figure 1D (previously 4A) the data are now presented in the order of biological relevance.

      •Figure 2: Kinesin 5 and 8X have the same results. I suggest the authors present only one in the same manuscript and place the other one in Supplementary information.

      Authors’ response: kinesin-5 data are now part of S6 Fig in supplementary information and only kinesin-8X data are retained as part of Figure 2.

      I would recommend adding the little schematic used in Figure1C to help the reader quickly identify the parasite stages presented in the figures

      Authors’ response: A schematic is included in Figure 2C for clarification, as suggested.

      •Figure4: Panels B to E should be a supplementary information

      Authors’ response: These panels have now been moved to supplementary information as S5 Fig.

      •Figure 5: Panels H to J should be supplementary information

      Authors’ response: Panels H to J have been moved to supplementary information as part of S8 Fig.

      and I strongly recommend the authors to present data by stages; therefore, I would remove panels F and G and replace them with Figure 6A, the expansion microscopy represents the data in Figure 4B, C, D, and E beautifully

      Authors’ response: we thank the reviewer for this suggestion; expansion microscopy data are now incorporated into the new Figure 3, and the old panels F and G are now part of S8 Fig in the supplementary information.

      •Figure 6B: It is challenging to identify the layout between WT and delta-kinesin 20. All annotations on the EM data cover the data itself. I recommend drawing a representative schematic to guide the reader for identification of ultrastructure

      Authors’ response: we have now included a schematic diagram as Figure 4B, to highlight the key ultrastructural features and facilitate their identification by the reader.

      •Figure 8: Panel C and D should be supplementary information and replaced by the accurate colocalization data of Kinesin 13 presented in Supp figure 5

      Authors’ response: the kinesin-13 colocalization data are now in Figure 5, and the previous Figure 8 Panels C and D have been moved to supplementary information.

      In addition, comment line 442 is also actual for the ookinete. The true colocalization is with tubulin in male gamete and gametocytes in figure 5A/B

      Authors’ response: We agree with the reviewer; the colocalization data with tubulin in male gamete and gametocytes are now presented in Figures 6A and B.

      Figure 9: Panel F to J go to supplementary information and replace with the data in figure 10

      Authors’ response: We understand the reviewer’s concerns, however, we would like to include these data in the main figure because they provide important information on the differential regulation of transcripts involved in axoneme biogenesis and chromosome dynamics.

      Figure 10: Could be a great abstract figure in the current state. As a model figure, I would recommend incorporating more details

      Authors’ response: We have removed this figure (and therefore there are now seven rather than ten figures in the revised manuscript). We would, of course, be happy to use it as part of an abstract if required.

      **Minor Comments:**

      I will address my following minor comment by Line number rather than section:

      Figure 1C: It is unclear if the black square is an actual picture or a black square. I would suggest the authors present the absence of data by a white square or a bar

      Authors’ response: For this figure (now S3 Fig, previously Figure 1C), we have added the scale bars on the dark squares to indicate that these are actual pictures that show the absence of signal.

      Line 96: " a final synchronized round of S-phase" The classical mitotic terminology is poorly used in the field of Plasmodium mitosis due to the absence of canonical cell cycle checkpoint. I would recommend the authors rephrase as " a final synchronized round of DNA replication."

      Authors’ response: We thank the reviewer for this suggestion. We have now deleted this sentence as part of an effort to make the introduction more concise.* *

      Line 149-151: Could the authors indicate what stage of the life cycle the work was done?

      Authors’ response: We now indicate the stages in line number 127.

      Line 161: Missing space between the word "parasite and cell"

      Authors’ response: We have deleted this sentence while revising the introduction to make it more concise.

      Line 163: " These findings will inform a strategy ..." Could the authors explain in greater detail how the study is informative for targeting MT motors for therapeutic. I would argue with the authors that it is an overstatement since the paper did not provide structural data on kinesin as a foundation for drug discovery.

      Authors’ response: The sentence is now modified to remove this overstatement, in lines number 134-136.

      Line 368: What was the reasoning for examining whether other kinesin genes' expression is misregulated in delta Kinesin 20?

      Authors’ response: The main reason was the expression of other kinesins expressed in the cytoplasmic compartment of ookinete stages, such as kinesin-X3 and kinesin-13; and kinesin-13 that has a key role in MT organization during ookinete development. Therefore, we expected that the expression of other kinesins including kinesin-13 may be coordinated with that of kinesin-20 and modulated in the kinesin-20 knockout. We have added a sentence for clarification, lines 332-336.

      Line 515: Could the authors define what is a nuclear pole?

      Authors’ response: Nuclear pole is a synonym for spindle pole, which is in general usage with reference to electron micrographs. It serves as a microtubule-organizing center (MTOC) for mitotic spindles.* *

      Line: 576 - 579: The authors mention the absence of the IFT component for flagellum assembly due to the assembly of the axoneme in the cytoplasm. It is known that kinesin-2 is required for the anterograde transport in organism building cilia and flagella using IFT. In the current study, kinesin 2 is not part of the nine kinesins; therefore, it is unclear why the authors made these comments and did not reflect on them. I would suggest removing it or comment it.

      Authors’ response: it is well-established that axoneme assembly in Plasmodium gametocytes occurs in the cytoplasm, which does not require IFT, and the absence of a kinesin-2 gene is consistent with that process. In contrast, the location of kinesin-8B, kinesin-X4, and kinesin-13 suggests that they are involved in this non-canonical axoneme assembly. For clarification, we have added a sentence at line numbers 521-522.

      Line 546-560: this entire section of discussion would be best in a review paper. It is a well-written summary of the current literature with no discussion related to the data on the present study; therefore, I suggest the authors remove it from the discussion.

      Authors’ response: This section is now largely removed from the discussion except for a few relevant sentences at lines 509-515.

      Line 561 – 571: Great summary of the Kinesin-13 work without discussion.

      Authors’ response: This part (now at line numbers 595-602) has now been modified so that it is more relevant to the discussion.

      Line 572: What do the authors mean by " these findings"?

      Authors’ response: We have explained the meaning of “these findings” (line number 603).

      Line 573 – 589 (assuming 673-689): The authors miss the opportunity to elaborate on how the depletion of kinesin protein could impact the global transcriptome. Are we looking at downstream effects? I strongly recommend the authors resolve the lack of discussion related to the RNAseq data in the study.

      Authors’ response: we have now improved the discussion of the transcriptome data (line numbers 610-613).

      Reviewer #1 (Significance (Required)):

      This study is a tremendous amount of work done rigorously and will advance our knowledge in the biology of Plasmodium parasites. We are in urgent need to develop innovative ways to block the replication and transmission of Plasmodium spp. and it can happen only through advancing our knowledge in the basic biology of the parasite.

      Authors’ response: We thank the reviewer for their positive and encouraging comments.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      **Summary:** In this study, Zeeshan et al used live-cell imaging, ultrastructure expansion microscopy, and electron microscopy, gene deletion, genetic knockdown, RNA-seq, ChIP-seq analyses, and matrigel substrate to examine the subcellular localization and the function of Plasmodium kinesins throughout the P. berghei life cycle. They find that Kinesin-13 is the only kinesin essential for both asexual blood stages and sexual stages.

      This manuscript represents a lot of work by the authors. The data appear rigorous and well-executed. The data are clearly presented and the writing is clear. I have only minor comments that may improve the reader's comprehension.

      Authors’ response: We thanks the reviewer for their positive appreciation of our work.

      **Major comments:**

      Figure 2C:

      The ChIP-seq experiments examined the kinesin-5 and -8x binding site at the chromosome at 6 mpa. Did the authors do any tests at other time points post-activation?

      Authors’ response: We sampled only at 6 mpa because at this time point the expression of kinesin-5 and -8X is high, facilitating the ChIP-seq analysis using anti-GFP antibodies. We now include additional ChIP-seq data in S6 Fig.* *

      Figure 4:

      The authors conclude that kinesin-x3 and kinesin-x4 are non-essential for the P.berghei life cycle. Does deletion of kinesin-x4 affect the length of the flagella?

      Authors’ response: We observed no obvious change in the length of flagella after these deletions.

      Oocyte size: To the non-specialist, it is difficult to reconcile the images in panel E with the conclusions in panel A. Based on the images, it looks like only knocking out of kinesin-8x seems to affect oocyst size. Can the authors clarify and provide graphs of the quantification of oocyte size?

      Authors’ response: We agree with the reviewer that only the knockout of kinesin-8X affects oocyst size. Similar data, obtained using live-cell fluorescence imaging and electron microscopy, have been described and discussed earlier in Zeeshan et al, 2019 PLOS Pathogens (PMID: 31600347).

      **Minor comments:**

      line 190: typo, kinesin-x4

      Authors’ response: This typo has now been corrected (line 162).

      Figure 3: what do the arrows mean?

      Authors’ response: Arrows indicate the pellicle and axonemes that are mentioned in the figure legend (current Figures 2A and B).

      Figure 4F:

      1. Typo, scale bar, um.

      Authors’ response: We have corrected this (please see the legend for current S5D Fig).* *

      1. Does deletion of kinesin-5 show a significant difference?

      Authors’ response: Yes, the number of infective sporozoites in salivary glands is significantly reduced following kinesin-5 deletion (as published previously in the manuscript of Zeeshan et al 2020 [PMID: 33154955]).

      Reviewer #2 (Significance (Required)):

      The study provides comprehensive information on the diverse subcellular location and functions of P. berghei kinesins throughout the P. berghei life cycle. That is useful to exploit the therapeutic targets against malaria.

      The main findings are that kinesin-13 genetic knockdown affected MT dynamics during spindle formation and axoneme assembly in male gametocytes and subpellicular MT organization in ookinetes. In addition, Kinesin-13 shows different binding to kinetochores during the gametogenesis and ookinete development, suggesting other proteins may regulate kinesin-13 binding to kinetochores at various stages. The underlying mechanism will help to better understand the role of kinesin-13 in the parasite life cycle.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this manuscript the authors show a huge ambition to catalog biological functions of Plasmodium kinesins. This was done by generating transgenic cell lines where kinesins were deleted and/or tagged with GFP that served as a tool to gather as much biological information on each kinesin isoform. On one side I find this manuscript highly impressive in terms of the amounts of data and information. In particular, the cell biology and microscopy results are of high quality and certainly provide useful information to the research community. I am fairly convinced that most results genuinely represents the individual biological aspects of the kinesins in the best possible way. Unfortunately, I have major reservations about the presentation of these results in the compiled manuscript. In my view the authors were overambitious about the volume and diversity of data that they wished to present, which opened a lot of questions about the depth and quality of each of the experimental effort. There is 10 figures which is highly nonstandard for a scientific publication to start with and yet there is, in my view, major gaps in some results descriptions, data presentations e.t.c. Perhaps, because of this huge ambition the data are presented in a highly superficial manner often lacking negative and positive controls. Unfortunately that creates many doubts about the overall quality of the results and as such the interpretations. In my view the authors might be well advised to separate this large body of work into several publications each focusing on more tangible biological problem in the more in-depth manner. This would give the reader (me) better confidence about the validity of the statements made in this manuscript.

      I can give few examples of such discrepancies but cannot account for all.

      1.The authors created GFP-tagged transgenic cell lines for each of the 9 kinesins and generated life cell images for each of the line across multiple stages of the entire plasmodium life cycle. This is an impressive amount of work and data. It is certainly useful to see that in life cell imaging the different kinesins isoforms can be detected in different sets of developmental stages some diffused in the cytoplasm and some associated with the nucleus. Even though these results are impressive, there are based solely on life cell imaging that rely on a certain level of detection limit and GFP visibility. One can imagine that a kinesin may still be expressed in a developmental stage and not detected by life cell imaging.

      Authors’ response: We agree with the reviewer that live-cell imaging has a detection limit for the signal and we cannot rule out the possibility of expression below this limit. We also used immunofluorescence assays (IFAs) to confirm the presence or absence of the proteins at least in the asexual blood- and gametocyte stages. However, our focus was to examine expression by live-cell imaging during the transmission stages, and hence only those data were given in the manuscript.

      I believe that some other detection methods such a western blot, immunoprecipitation e.t.c. should be provided to truly demonstrate that an individual isoform of a kinesin IS of ISNOT expressed. Without that the Figure 1B is overstated.

      Authors’ response: Some western blots to confirm expression of the intact kinesin-GFP fusion protein has been published previously: for kinesin-8X (PMID: 31600347), kinesin-5 (PMID: 33154955), and kinesin-8B (PMID: 31600347). We now provide immunoprecipitation data for six kinesin-GFP fusion proteins, performed using GFP-trap antibody and with identification by mass spectrometry. These results (S2 Fig) clearly show the presence of the respective kinesins fused to GFP in the immunoprecipitates (S2 fig).

      Moreover, the authors claim that the punctuate signal in the nucleus corresponds to spindle. I do not see any supporting evidence for this in this figure.

      Authors’ response: We have previously provided IFA data using anti-tubulin antibodies (for detection of spindle MT) that clearly show co-localization with nuclear kinesins (kinesin-5 and kinesin-8X). For more detailed information please see Zeeshan et al., 2019, PLOS Pathogen (kinesin-8X; PMID: 31600347) and Zeeshan et al., 2020 Front. Cell. Infect. Microbiol (kinesin-5; PMID: 33154955).

      2.For the analysis of kinesin 5 and 8x the authors note two types of experiments. First they created a "cross" between the two cells lines. Second, the authors carry out ChIP-Seq to show that the proteins localize to the centromere. This could be an impressive result unfortunately there is very little if any information about it. Genetic crosses in Plasmodium are not standard techniques that one can assume works all the time. I believe there should be more evidence that the presented images come from a true genetic cross.

      Authors’ response: We now provide images obtained using both single and dual fluorescence in in the same panel, which show the signal of individual kinesins in different cells as well as both signals in one cell (please see S6A Fig). The two lines, one expressing a GFP-tagged protein and the other a mCherry-tagged protein are crossed by feeding gametocytes together to the mosquito where fertilization and genetic recombination takes place. This genetic cross follows Mendelian rules, producing parental single and two recombinant lines (1:2:1 ratio). The lines are not pure clones but contain parasites that express either both or single fluorescence signals.

      The least the authors could show that the florescence signal for both channels come from genuine integrations of the GFP proteins to their target kinesins by PCR or genome sequencing.

      Authors’ response: We have also confirmed the presence of genes for each tagged protein by integration PCR in these crossed lines, and by live-cell imaging, as shown in S6A Fig.

      Similarly for the chip-seq, there is a need to provide much detailed information about the entire results with a particular clarity about the position of the peaks in respect to projected centromeres. In addition the ChIP-Seq analysis should be supported by data along with positive and negative controls to truly show the kinesins associations with the centromeres.

      Authors’ response: We provide the positive and negative controls for the ChIP-seq data (please see S7 Fig). The raw data and further details are deposited in the NCBI Sequence Read Archive with accession number: PRJNA731497.

      1. In the middle part, the author present rater impressive analyses of several kinesin deletion trains and their effect on the development of the mosquito stages. In particular, they demonstrate the effect of kinesin 20 on ookinete development. Yet in the next paragraph they present RNA seq analysis of the kinesin 20 deletion on gametocyte induction, in which kinesis 20 should not have any effect; judging from the presented phenotypic assay. This experiment seems out of context as it is unclear why this assay was done and what is the outcome. The authors identified a small group of differentially expressed genes seemingly unrelated to neither kinesin function nor gametocyte induction. This experiment does not make sense to me in the context of the rest of the paper.

      Authors’ response: We agree with the reviewer about the effect of kinesin-20 deletion on ookinete development and the RNAseq analysis of the gametocyte stage. This is because kinesin-20 expression starts in female gametocytes and continues into later stages including ookinetes. We know that in Plasmodium there is translational repression in female gametocytes, which de-repress only in the zygote after fertilization, leading to translation of many proteins in the zygote. We wished to see whether there was a role of kinesin-20 in translational de-repression. Our transcriptomic data showed no role of kinesin-20 in this process. We have added a sentence for clarification in lines 338 -342.

      Reviewer #3 (Significance (Required)):

      As mentioned above, these three examples represent some of the discrepancies not necessarily about the data quality and fidelity but rather a confusing character of the entire study. From this perspective I have two types of problems with this manuscript. First, while reading this manuscript, lacking key controls and detailed description of some of the analyses, made me loose interest as well as confidence in other parts of the studies which may or may not be solid. Second, I struggle to see the key purpose of the presentation. Instead the manuscript seems to be a compilation of very diverse data some of which are interesting but other out of context, confusing and not connected to the rest of the study.

      Authors’ response: We are sorry for the confusion, but we have now streamlined the data presented, to focus in the manuscript mainly on two kinesins, kinesin-13 and kinesin-20. We provide the relevant controls and a detailed description of the analyses. All experiments were repeated at least three times, and, where appropriate, figure legends provide information on the number of samples and repeats. We hope the reviewer finds the manuscript clearer now.

      Overall I wish to reiterate that I believe that there are a lot of very good experimental results in this study but unfortunately many of these get lost in the overall presentation that is often superficial or out of context. My general impression is that the authors are trying to show too much, "too fast" and as such many of the presented results remain questionable. The author are likely able to correct all these discrepancies but this might not be possible to do in the manuscript.

      Authors’ response: We are thankful to the reviewer for finding a lot of very good experimental data and hope that the revised manuscript, with more focus on two kinesins, will give more confidence in our work to the reviewer.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript the authors show a huge ambition to catalog biological functions of Plasmodium kinesins. This was done by generating transgenic cell lines where kinesins were deleted and/or tagged with GFP that served as a tool to gather as much biological information on each kinesin isoform. On one side I find this manuscript highly impressive in terms of the amounts of data and information. In particular, the cell biology and microscopy results are of high quality and certainly provide useful information to the research community. I am fairly convinced that most results genuinely represents the individual biological aspects of the kinesins in the best possible way. Unfortunately, I have major reservations about the presentation of these results in the compiled manuscript. In my view the authors were overambitious about the volume and diversity of data that they wished to present, which opened a lot of questions about the depth and quality of each of the experimental effort. There is 10 figures which is highly nonstandard for a scientific publication to start with and yet there is, in my view, major gaps in some results descriptions, data presentations e.t.c. Perhaps, because of this huge ambition the data are presented in a highly superficial manner often lacking negative and positive controls. Unfortunately that creates many doubts about the overall quality of the results and as such the interpretations. In my view the authors might be well advised to separate this large body of work into several publications each focusing on more tangible biological problem in the more in-depth manner. This would give the reader (me) better confidence about the validity of the statements made in this manuscript.

      I can give few examples of such discrepancies but cannot account for all.

      1.The authors created GFP-tagged transgenic cell lines for each of the 9 kinesins and generated life cell images for each of the line across multiple stages of the entire plasmodium life cycle. This is an impressive amount of work and data. It is certainly useful to see that in life cell imaging the different kinesins isoforms can be detected in different sets of developmental stages some diffused in the cytoplasm and some associated with the nucleus. Even though these results are impressive, there are based solely on life cell imaging that rely on a certain level of detection limit and GFP visibility. One can imagine that a kinesin may still be expressed in a developmental stage and not detected by life cell imaging. I believe that some other detection methods such a western blot, immunoprecipitation e.t.c. should be provided to truly demonstrate that an individual isoform of a kinesin IS of ISNOT expressed. Without that the Figure 1B is overstated. Moreover, the authors claim that the punctuate signal in the nucleus corresponds to spindle. I do not see any supporting evidence for this in this figure.

      2.For the analysis of kinesin 5 and 8x the authors note two types of experiments. First they created a "cross" between the two cells lines. Second, the authors carry out ChIP-Seq to show that the proteins localize to the centromere. This could be an impressive result unfortunately there is very little if any information about it. Genetic crosses in Plasmodium are not standard techniques that one can assume works all the time. I believe there should be more evidence that the presented images come from a true genetic cross. The least the authors could show that the florescence signal for both channels come from genuine integrations of the GFP proteins to their target kinesins by PCR or genome sequencing. Similarly for the chip-seq, there is a need to provide much detailed information about the entire results with a particular clarity about the position of the peaks in respect to projected centromeres. In addition the ChIP-Seq analysis should be supported by data along with positive and negative controls to truly show the kinesins associations with the centromeres.

      3.In the middle part, the author present rater impressive analyses of several kinesin deletion trains and their effect on the development of the mosquito stages. In particular, they demonstrate the effect of kinesin 20 on ookinete development. Yet in the next paragraph they present RNA seq analysis of the kinesin 20 deletion on gametocyte induction, in which kinesis 20 should not have any effect; judging from the presented phenotypic assay. This experiment seems out of context as it is unclear why this assay was done and what is the outcome. The authors identified a small group of differentially expressed genes seemingly unrelated to neither kinesin function nor gametocyte induction. This experiment does not make sense to me in the context of the rest of the paper.

      Significance

      As mentioned above, these three examples represent some of the discrepancies not necessarily about the data quality and fidelity but rather a confusing character of the entire study. From this perspective I have two types of problems with this manuscript. First, while reading this manuscript, lacking key controls and detailed description of some of the analyses, made me loose interest as well as confidence in other parts of the studies which may or may not be solid. Second, I struggle to see the key purpose of the presentation. Instead the manuscript seems to be a compilation of very diverse data some of which are interesting but other out of context, confusing and not connected to the rest of the study.

      Overall I wish to reiterate that I believe that there are a lot of very good experimental results in this study but unfortunately many of these get lost in the overall presentation that is often superficial or out of context. My general impression is that the authors are trying to show too much, "too fast" and as such many of the presented results remain questionable. The author are likely able to correct all these discrepancies but this might not be possible to do in ne manuscript.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary: In this study, Zeeshan et al used live-cell imaging, ultrastructure expansion microscopy, and electron microscopy, gene deletion, genetic knockdown, RNA-seq, ChIP-seq analyses, and matrigel substrate to examine the subcellular localization and the function of Plasmodium kinesins throughout the P. berghei life cycle. They find that Kinesin-13 is the only kinesin essential for both asexual blood stages and sexual stages.

      This manuscript represents a lot of work by the authors. The data appear rigorous and well-executed. The data are clearly presented and the writing is clear. I have only minor comments that may improve the reader's comprehension.

      Major comments:

      Figure 2C:

      The ChIP-seq experiments examined the kinesin-5 and -8x binding site at the chromosome at 6 mpa. Did the authors do any tests at other time points post-activation?

      Figure 4:

      The authors conclude that kinesin-x3 and kinesin-x4 are non-essential for the P.berghei life cycle. Does deletion of kinesin-x4 affect the length of the flagella?

      Oocyte size: To the non-specialist, it is difficult to reconcile the images in panel E with the conclusions in panel A. Based on the images, it looks like only knocking out of kinesin-8x seems to affect oocyst size. Can the authors clarify and provide graphs of the quantification of oocyte size?

      Minor comments:

      line 190: typo, kinesin-x4

      Figure 3: what do the arrows mean?

      Figure 4F:

      1. typo, scale bar, um.

      2. Does deletion of kinesin-5 show a significant difference?

      Significance

      The study provides comprehensive information on the diverse subcellular location and functions of P. berghei kinesins throughout the P. berghei life cycle. That is useful to exploit the therapeutic targets against malaria.

      The main findings are that kinesin-13 genetic knockdown affected MT dynamics during spindle formation and axoneme assembly in male gametocytes and subpellicular MT organization in ookinetes. In addition, Kinesin-13 shows different binding to kinetochores during the gametogenesis and ookinete development, suggesting other proteins may regulate kinesin-13 binding to kinetochores at various stages. The underlying mechanism will help to better understand the role of kinesin-13 in the parasite life cycle.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The study led by Dr. Zeeshan analyzes nine mouse Plasmodium parasites kinesin by determining their expression pattern and subcellular location in various stages of the parasites in the mammalian and mosquito host. The genetic and phenotypic analyses of all nine kinesins indicate that most are critical for parasite development in the mosquito host, except for Kinesin 13 being the only kinesin essential during the intraerythrocytic development cycle. The authors presented an in-depth analysis on kinesin 13 and 20, using an impressive pallet of molecular techniques such as promotor swapping, chromatin immunoprecipitation, and global transcriptomic analysis using RNAseq, as well as numerous microscopy techniques such as live fluorescence imaging, expansion microscopy, and electron microscopy. This comprehensive study provides an outstanding amount of data on Kinesins in Plasmodium parasites that would be best showcased with a rethinking of the manuscript structure and a more insightful discussion section that directed most of my comments in the review the manuscript. I believe no additional experiments are needed assuming that the authors will link Kinesin 13 and or 20 to the IMC formation in future work.

      Major Comments:

      •The current manuscript shows the " Location and function of Plasmodium kinesins" as the title suggests; however, I strongly recommend the authors consider alternative storytelling focusing on Kinesin 13 and 20. The author provides in-depth phenotypical analysis resulting in the most innovating and exciting data. In addition, the discussion section from lines 592 to 634 was fascinating compared to the following section (see details comments for Discussion section below).

      •The following significant comments are related to figures where I believe a restructuration is most needed to bring clarity to the paper."

      •Figure 1. I suggest the authors move Figure 4A to figure 1; Figure1C should move to supplementary information except for Kinesin 13 and 20 data to center the paper's focus on these two proteins. I would also present the kinesin data in the current Figure4A not by numeric order but by biological relevance. All the "normal" together and so on

      •Figure 2: Kinesin 5 and 8X have the same results. I suggest the authors present only one in the same manuscript and place the other one in Supplementary information. I would recommend adding the little schematic used in Figure1C to help the reader quickly identify the parasite stages presented in the figures.

      •Figure4: Panels B to E should be a supplementary information

      •Figure 5: Panels H to J should be supplementary information, and I strongly recommend the authors to present data by stages; therefore, I would remove panels F and G and replace them with Figure 6A, the expansion microscopy represents the data in Figure 4B, C, D, and E beautifully.

      •Figure 6B: It is challenging to identify the layout between WT and delta-kinesin 20. All annotations on the EM data cover the data itself. I recommend drawing a representative schematic to guide the reader for identification of ultrastructure.

      •Figure 8: Panel C and D should be supplementary information and replaced by the accurate colocalization data of Kinesin 13 presented in Supp figure 5. In addition, comment line 442 is also actual for the ookinete. The true colocalization is with tubulin in male gamete and gametocytes in figure 5A/B.

      •Figure 9: Panel F to J go to supplementary information and replace with the data in figure 10.

      •Figure 10: Could be a great abstract figure in the current state. As a model figure, I would recommend incorporating more details

      Minor Comments:

      I will address my following minor comment by Line number rather than section:

      Figure 1C: It is unclear if the black square is an actual picture or a black square. I would suggest the authors present the absence of data by a white square or a bar.

      Line 96: " a final synchronized round of S-phase" The classical mitotic terminology is poorly used in the field of Plasmodium mitosis due to the absence of canonical cell cycle checkpoint. I would recommend the authors rephrase as " a final synchronized round of DNA replication."

      Line 149-151: Could the authors indicate what stage of the life cycle the work was done?

      Line 161: Missing space between the word "parasite and cell"

      Line 163: " These findings will inform a strategy ..." Could the authors explain in greater detail how the study is informative for targeting MT motors for therapeutic. I would argue with the authors that it is an overstatement since the paper did not provide structural data on kinesin as a foundation for drug discovery.

      Line 368: What was the reasoning for examining whether other kinesin genes' expression is misregulated in deltaKinesin 20?

      Line 515: Could the authors define what is a nuclear pole?

      Line: 576 - 579: The authors mention the absence of the IFT component for flagellum assembly due to the assembly of the axoneme in the cytoplasm. It is known that kinesin-2 is required for the anterograde transport in organism building cilia and flagella using IFT. In the current study, kinesin 2 is not part of the nine kinesins; therefore, it is unclear why the authors made these comments and did not reflect on them. I would suggest removing it or comment it.

      Line 546-560: this entire section of discussion would be best in a review paper. It is a well-written summary of the current literature with no discussion related to the data on the present study; therefore, I suggest the authors remove it from the discussion.

      Line 561 - 571: Great summary of the Kinesin-13 work without discussion.

      Line 572: What do the authors mean by " these findings"?

      Line 573 - 589: The authors miss the opportunity to elaborate on how the depletion of kinesin protein could impact the global transcriptome. Are we looking at downstream effects? I strongly recommend the authors resolve the lack of discussion related to the RNAseq data in the study.

      Significance

      This study is a tremendous amount of work done rigorously and will advance our knowledge in the biology of Plasmodium parasites. We are in urgent need to develop innovative ways to block the replication and transmission of Plasmodium spp. and it can happen only through advancing our knowledge in the basic biology of the parasite.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01015

      Corresponding author(s): Jordan, Raff

      1. General Statements [optional]

      We thank the reviewers for their thoughtful and constructive comments and have now revised our manuscript accordingly. We apologise that it has taken so long to send in these revisions, but this is in part because both first authors have now left the lab.

      2. Point-by-point description of the revisions

      Reviewer #1

      This reviewer was generally supportive. They note that it is unfortunate that our data suggests the CP110/Cep97 complex does not play a major part in controlling daughter centriole growth—although we believe that this is an important negative result—but feel that other aspects of our data are interesting. They requested no further experiments, but did comment that it would be interesting to determine when g-tubulin is incorporated into growing centrioles. Unfortunately, we cannot test this as the centrioles in these embryos recruit large amounts of g-tubulin to their PCM, so we cannot specifically assay the small amount of protein in the centriolar fraction.

      Reviewer #2

      Major Points:

      __Figure 1: The reviewer notes that Sas-4 and CP110 have antagonistic roles in promoting/repressing centriole growth and asks if Sas-4 is involved in promoting centriole elongation and whether it also oscillates. __It is unclear if Sas-4 directly promotes centriole elongation in flies. We have previously shown that centriolar Sas-4 levels do oscillate during S-phase, but with a timing that is distinct from CP110/Cep97 (Novak et al., Curr. Biol., 2014). These observations do not shed much light on the potential antagonistic relationship between CP110/Cep97 and Sas-4, so we do not comment on this here.

      Figure S1B: The reviewer requests that we image the centrioles with greater laser intensity to test whether some residual CP110 or Cep97 protein can be recruited in the absence of the other protein. The quantification of this data suggests that some residual CP110 or Cep97 can still be recruited to centrioles in the absence of the other (Graphs, Figure S1B,C), so we do not think it necessary to repeat this experiment at higher laser intensity to further test this point. We now state that the centriolar recruitment of one protein may not be completely dependent of the other (p6, para.2).

      Figure 3: The reviewer questions whether the reduction in CP110/Cep97 levels at the mother centriole that we observe during S-phase could be due to photobleaching. This is an interesting point that we now analyse in more detail (p8, para.2). We do not think the decrease in mother centriolar CP110/Cep97 levels is due to photobleaching as our new analysis (which includes more data points during mitosis) strongly suggests that centriolar levels on the mother rise again at the start of the next cycle (New Figure 3C,D).

      The reviewer asks whether the CP110/Cep97 oscillations occur at the tip of the growing centriole, and whether we can use super-resolution imaging to address this. A large body of evidence indicates that CP110/Cep97 are highly concentrated at centriole distal tips, and all our experiments suggest that it is this fraction that is oscillating. In Figure 3, for example, we use Airy-scan super-resolution imaging to follow the oscillation on Mother and Daughter centrioles in living embryos. Although the resolution in these experiments is not as high as we can achieve using 3D-SIM in fixed cells, it seems reasonable to assume that the dots of fluorescence we observe oscillating on these centrioles (Fig. 3) are the same fluorescent dots we observe localised at the distal tips of the mother and daughter using 3D-SIM in fixed cells (Fig. 1A).

      The reviewer requests additional quantification of the western blots shown in Figure S1 that we use to judge relative expression levels. As we now describe in more detail in the M&M, these ECL blots are very sensitive, but highly non-linear, so we usually estimate relative expression levels by comparing serial dilutions of the different fractions (see, for example, Figure 1B, Franz et al., JCB, 2013). As we now clarify, the key point is not precisely by how much these proteins are over- or under-expressed, but that we observe a similar oscillatory behaviour when they are either over- or under-expressed.

      __The reviewer points out that our statement that the CP110/Cep97 oscillation is entrained by the Cdk/Cyclin oscillator (CCO) is too strong as it is based only on a correlation. __We agree and apologise for this overstatement. To address this, we have now perturbed the CCO by halving the dose of Cyclin B (New Figure 5E—H). This extends S-phase length and we now show that the period of the CP110/Cep97 oscillation is also extended. This suggests that the CCO directly influences the period of the CP110/Cep97 oscillation.

      The reviewer notes that our conclusion that the centriole cartwheels are longer or shorter when CP110 or Cep97 are absent or overexpressed, respectively, is based only on Sas-6-GFP fluorescence intensity. They ask if this fluorescence intensity perfectly reflects cartwheel length, and if we can confirm these conclusions using EM. Sas-6 is the main structural component of the cartwheel, so the amount of Sas-6 at the centriole should be proportional to cartwheel length, and we have published two papers that support this conclusion and that use the incorporation of Sas-6 as a proxy to measure cartwheel length (Aydogan et al., JCB, 2018; Aydogan et al., Cell, 2020). Importantly, our previous EM studies support our conclusions about the relationship between cartwheel length and CP110/Cep97 levels: the centrioles in wing-disc cells are slightly longer in the absence of CP110 and slightly shorter when CP110 is overexpressed (Franz et al., JCB, 2013). The new findings reported here provide a potential explanation for this EM data, which was puzzling at the time.

      Minor Points:

      Figure 1C: The reviewer noted that our schematic illustrations in this Figure could be misleading____. We agree and have now redrawn them.

      Reviewer #3

      Major points:

      The reviewer requested that we clarify our use of the term oscillation, pointing out that oscillations are repetitive variations in levels/activity over time, whereas the “oscillations” we describe here occur during each round of centriole assembly. This is a fair point, and one that is often debated in the oscillation field, with many believing that too many biological processes are termed “oscillations”, when they are not truly driven by the passage of time. To avoid any ambiguity, we now no longer describe the behaviour of CP110/Cep97 as an oscillation (although, for ease of discussion, we still use the term in this letter).

      The reviewer thought that the data we show in Figure 1 was not relevant as we largely analyse centrioles in living embryos whereas the data in Figure 1 is derived from fixed wing-disc cells—and similar fixed-cell data has been shown in previous studies. The reviewer suggests we use super-resolution methods to analyse Cp110/Cep97 dynamics in the syncytial embryo, and show this relative to Sas-6 and Plk4. They ask if Plk4 and CP110/Cep97 colocalise at any time. While CP110/Cep97 localisation has been analysed by super-resolution microscopy previously (e.g. Yang et al., Nat. Comm., 2018; LeGuennec et al., Sci. Adv., 2020), CP110/Cep97 was a minor part of these studies and our data is the first to show that this complex sits as a ring on top of the centriole MTs in fly centrioles (that lack the complex distal and sub-distal appendages present in the previously analysed systems). As this localisation is important in thinking about how CP110/Cep97 might influence centriole MT growth, we would like to include it. We cannot show this detail in living embryos as the movement of the centrioles reduces resolution and we cannot observe the ring structure.

      Although we do use Airy-scan super-resolution microscopy to study CP110/Cep97 dynamics in living embryos (Figure 3), we cannot do this in two colours (to compare these dynamics to Sas-6 or Plk4 dynamics) as red-fluorescent proteins bleach too quickly. We now show the relative dynamics of CP110/Cep97 and Plk4 recruitment using standard resolution microscopy (New Figure S2). While it is well established that Plk4 and CP110/Cep97 are concentrated at opposite ends of centrioles, they are all recruited to the nascent site of daughter centriole assembly, effectively “colocalising” at this timepoint. This could provide an opportunity for the crosstalk we observe here, and we now mention this possibility (p17, para.1).

      The Reviewer questioned whether the loading of Sas-6-GFP onto centrioles can be used as a proxy for cartwheel length, pointing out that Sas-6 could load into centrioles in a way that does not change the cartwheel structure, and that EM is required to test this. As described in our response to Reviewer #2, Sas-6 is the main structural component of the cartwheel, and we have published two papers that use the incorporation of Sas-6 into the cartwheel as a proxy to measure cartwheel length (Aydogan et al., JCB, 2018; Aydogan et al., Cell, 2020). While we cannot exclude that Sas-6 might also associate with the cartwheel in a way that does not involve its incorporation into the cartwheel, it is not clear how EM might address this question. Moreover, even if such a fraction existed, it should not affect our conclusions—as long as Sas-6 is binding to the cartwheel in some way, then the amount bound should remain proportional to the length of the cartwheel. Perhaps the reviewer is suggesting that we perform an EM time course of cartwheel growth to back up our conclusions from the Sas-6 incorporation assay? If so, we think this impractical. The changes in cartwheel length shown in Figure 6 are revealed from analysing several thousand images of centrioles compared at precise relative time points. Such an analysis cannot be done in fixed embryos by EM.

      Similar to the point above, the reviewer notes that we use the length of the cartwheel to infer centriole MT length, but we never directly measure MT length. They suggest we perform either an EM analysis or use MT markers to directly measure the kinetics of centriole MT growth. In flies (and many other organisms), the centriole MTs grow to the same length as the centriole cartwheel (Gonzalez, JCS, 1998), so we can be confident that the final length of the cartwheel reflects the final length of the centriole MTs. Moreover, we previously measured the distance between the mother centriole and the GFP-Cep97 cap that sits at the distal tip of the centriole MTs as a proxy for centriole MT length, and found that the inferred kinetics of MT growth were similar to the kinetics of cartwheel growth (inferred from Sas-6 incorporation) (Aydogan et al., 2018). This manual analysis was very time consuming, and we have tried to implement computational analysis methods, but so far without success. For similar reasons to those described in the point above, it is not feasible to accurately measure centriole MT growth kinetics by EM (nobody has been able to do this). Moreover, the centrosomes in these embryos are associated with too much tubulin and the centriole MTs are not yet modified (e.g. by acetylation) as the cycles are so fast—so we cannot directly stain the centriole MTs in fixed embryos. We have now toned down our conclusions about MT length throughout the paper, and we make it clear that we cannot directly measure this.

      All of the experiments shown here are performed in the presence of endogenous untagged proteins, and the reviewer wonders if recruitment dynamics might be influenced by competition for binding from the endogenous protein. We have compared the behaviour of many centriole and centrosome proteins in the presence and absence of the untagged WT protein. In all cases, less tagged-protein binds to centrioles/centrosomes in the presence of untagged protein, presumably due to competition. Apart from this, however, we usually observe no real difference in overall dynamics and in Reviewer Figure 1 (see below) we show that CP110-GFP and GFP-Cep97 both oscillate even in the absence of any endogenous protein. As we feel this result is not very surprising, we do not show it in the manuscript.

      The reviewer correctly noted that our data was not strong enough to conclude that the CP110/Cep97 oscillation is influenced by the CCO. This was also raised by Reviewer #2 and, as described above (p2, para.3 above), we have now performed additional experiments to more directly demonstrate this point (new Figure 5G—H).

      The reviewer requests more discussion of why our conclusion that CP110/Cep97 levels oscillate on the growing daughter centrioles during S-phase is different to that reached by Dobbelaere et al, (Curr. Biol., 2020), who conclude that Cep97-GFP only starts to incorporate into the new daughter centrioles late in S-phase when the daughters are fully grown. We have discussed this discrepancy with these authors and they kindly shared their reagents with us (so our endogenous Cep97-GFP oscillation data comes from the same line they used in their experiments), but we have not come to a clear conclusion on this point. We have shown robust oscillations for CP110 and Cep97 by quantifying many hundreds of centrioles using multiple transgenes (both over- and under-expressed) in multiple backgrounds. Cep97 dynamics were a very minor part of the Dobbelaere et al., study, and they analysed a much smaller number of centrioles. We now briefly mention this discrepancy (p9, para.1), but do not discuss it in detail as we have no definitive explanation for it.

      The reviewer requests more experiments or more discussion to address the mechanism(s) of crosstalk between CP110/Cep97 and Plk4, and they suggest several avenues for further investigations. These are excellent ideas, and we are working hard on these approaches. These are all long-term experiments, however, and we feel it is important that the field be made aware of these surprising findings as soon as possible, as others may be better-placed to provide mechanistic insight into how this system ultimately works. We now briefly mention some of the future directions the reviewer highlights in the Discussion.

      The reviewer thought we should highlight the previous publications showing that Plk4-induced centriole amplification requires CP110 and that Plk4 can phosphorylate CP110. These studies (Kleylein-Sohn et al, Dev. Cell, 2007; Lee et al., Cell Cycle, 2017) were mentioned, but we now discuss them more prominently (p17, para.2).

      Minor Points:

      The reviewer raised a number of minor concerns that we have now addressed: (1) We discuss the model the reviewer suggests; (2) we no longer state that the crosstalk between CP110/Cep97 and Plk4 is unexpected; (3) We have clarified our description of the shift in timing of the peak levels of CP110/Cep97, which we no longer refer to as an oscillation; (4) We define mNG as monomeric Neon Green; (5) We have changed our schematics in Figure 1 as suggested by the reviewer; (6) We have corrected the mistake in the legend to Figure 8.

      Reviewer #4

      Major points:

      1. The reviewer noted that the amplitude of the CP110/Cep97 oscillations depended on protein expression levels, so the oscillations might not reflect the behaviour of the endogenous proteins. They requested that we either repeat our experiments with CRISPR knock-in alleles, or conduct experiments with the lines driven by the endogenous promotors but in their respective mutant backgrounds. We have not generated CRISPR knock-ins for CP110/Cep97, but have done so for many other centriole/centrosome proteins (>8) and found that most such lines are expressed at higher or lower levels than the endogenous allele (and sometimes very significantly so). This is also true for our standard transgenic lines, where genes are expressed from their endogenous promoters, but are randomly integrated into the genome. The blots in Figure 4 show that CP110-GFP and GFP-Cep97 expressed from a ubiquitin (u) promoter or from their endogenous promoters (e) are expressed at ~2-5X higher or ~2-5X lower levels than the endogenous proteins, respectively. As we observe CP110/Cep97 oscillations in all cases, it seems unnecessary to generate new CRISPR knock-ins (that are also likely to be somewhat over- or under-expressed) to show this again. As the reviewer asks, we show that Cep97-GFP and CP110-GFP still oscillate in in the absence of the endogenous proteins (Reviewer Figure 1). As this does not seem a surprising result, we do not show this in the main manuscript. In the same point the reviewer requests that we use antibody staining in fixed embryos to show that the untagged proteins also oscillate. Analysing protein dynamics is much harder in fixed embryos, as the levels of fluorescent staining are more variable and we can only approximately infer relative timing, rather than precisely measuring it (as we can in living embryos). Moreover, as both proteins in the CP110/Cep97 complex exhibit a very similar oscillatory behaviour when tagged with either GFP or RFP (e.g. Figure 2C), and this behaviour is distinct to that observed with several other GFP- or RFP-tagged centriole proteins (e.g. Novak et al., Curr. Biol., 2014; Conduit et al., eLife, 2015; Aydogan et al., JCB, 2018; Aydogan et al., Cell, 2020) it seems very unlikely that this behaviour is induced by the GFP (or RFP) tag.

      The reviewer also suggests that we show the data with the endogenous promoter before we show the data with the ubiquitin promoter. As we now explain better (and show in Figure 4), this seems unnecessary as the proteins expressed from the ubiquitin promotor are probably actually expressed at levels that are more similar to the endogenous protein.

      The reviewer questions whether the oscillations we observe might be due to the centrioles simply moving up and down in the embryo during the cell cycle, and they suggest we monitor Asl behaviour to rule this out. We have previously shown that Asl-GFP levels do not oscillate; they remain constant throughout the cell cycle on old-mother centrioles, and grow approximately linearly throughout S-phase on new-mother centrioles (see Figure 1D in Novak et al., Curr. Biol., 2014).

      We were not sure we understood this point properly, so we copy the reviewers comment in full here: ____The authors mention (for instance on p. 3) that the inner cartwheel and the surrounding microtubules assemble at opposite ends of the daughter centriole. However, my understanding is that the short centrioles present in the fly embryo have an inner cartwheel that extends throughout the organelle, such that it might be moot to make a distinction between the two ends in this case. Moreover, it is also my understanding that this inner cartwheel is itself surrounded by microtubules, so that microtubule assembly might not be expected to occur strictly at the distal end no matter what. The reviewer is correct that Drosophila centrioles are short (~150nm) and that the cartwheel extends throughout the centriole. We think the reviewer is suggesting that it may not be relevant therefore whether the cartwheel and centriole MTs grow from opposite ends—as the activities that govern their growth may not be spatially separated? However, because cartwheels grow preferentially from the proximal-end (Aydogan et al., JCB 2018) while centriole MTs are assumed to grow preferentially from the distal (plus) end, there is an intrinsic problem in ensuring they grow to the same size—no matter how short or long the centrioles are. The reviewer is correct that one possible solution to this problem is that the centriole MTs actually grow from their minus ends, but this is not widely accepted (or even proposed). We have tried to explain this issue more clearly throughout the revised manuscript.

      The reviewer points out that the schematic illustrations in Figure 1A and 1C are inaccurate and unhelpful. We agree and have now redrawn these.

      The reviewer asks that we provide information about the eccentricities of the centrioles in the different datasets used to calculate the protein distributions shown in Figure 1, particularly as the data for Sas-4-GFP and Sas-6-GFP were obtained previously using a different microscope modality, making comparisons complicated. The point that comparing distance measurements across different datasets is difficult is an important one, and we now state that such comparisons should be treated with caution. However, we have not provided information on the distribution of centriole eccentricities in the different experiments as it wasn’t clear to us how this information could be used to make such comparisons more accurate (presumably the reviewer is suggesting we could apply a correction factor to each dataset?). The very tight overlap in the positioning of CP110/Cep97 fusions (Figure 1C) strongly suggests that any difference in the average centriole eccentricities of the different populations of centrioles analysed, which are already tightly selected for their en-face orientation (i.e. eccentricity

      The reviewer requested that we show the “noisy data” we obtained during mitosis that we excluded from our analysis in Figure 3. As we now explain in more detail (p8, para.2), there are two reasons why the data for mitosis in this experiment is “noisy”: (1) The protein levels on the centrioles are low in mitosis and the centrioles are more mobile, so they are hard to track; (2) The Asl-mCherry marker used to identify the mother centriole starts to incorporate into the daughter (now new mother) centriole during mitosis, making it difficult to unambiguously distinguish mothers and daughters. As a result, we cannot track and assign mother/daughter identity to very many centrioles during mitosis—although we now include some extra data-points during mitosis for the centrioles where we could do this (revised Figure 3C,D). Importantly, it is clear that this “noisy” data hides no surprises: one can see (Figure 3C,D) that the signal on the centrioles is simply low during mitosis and then starts to rise again as the embryos enter the next cycle. This is confirmed in the normal resolution data (Figure 2B,C; Movies S1 and S2) where we can track many more centrioles due to the wider field of view and because we do not have to discard centrioles in mitosis that we cannot unambiguously assign as mothers or daughters.

      The reviewer requests that we conduct a super-resolution Airy-scan analysis of CP110/Cep97 driven from their endogenous promoters (eCP110 or eCep97) to ensure that the oscillations we see with these lines (shown in Figure 4C,D) are also occurring at the daughter centriole—as we already show for the oscillations observed with the uCP110 and uCep97 lines (shown in Figure 4C,D, and analysed at super-resolution on the Airy-scan in Figure 3). This is technically very challenging as super-resolution techniques require a lot of light and the centriole signal in the eCP110/Cep97 embryos is very dim compared to uCP110/Cep97 embryos (Figure 4C,D). We have managed to do this for eCep97-GFP and confirmed that—even in these embryos that express Cep97-GFP at much lower levels than the endogenous protein (Figure 4A)—the “oscillation” is primarily on the daughter (Reviewer Figure 2). As this data is very noisy, and as the ubiquitin uCP110/Cep97 lines express these fusions at levels that are closer to endogenous levels (Figure 4A,B), we do not show this data in the main text.

      The reviewer also asks for clarification as to why we use the Airy-scan for some experiments and 3D-SIM for others. As we now explain (p8, para.1), 3D-SIM has better resolution than the Airy-scan, but it takes more time and requires more light—so we cannot use it to follow these proteins in living embryos. Thus, for tracking CP110/Cep97 throughout S-phase in living embryos we had to use the Airy-scan.

      The reviewer questions why in some experiments we analyse the behaviour of 100s of centrioles, whereas in others the numbers are much smaller (1-14 in Figure 3—note, the reviewer quoted this number as coming from Figure 4, but it actually comes from Figure 3, so we have assumed they mean Figure 3). We apologise for not explaining this properly. The super-resolution experiments in Figure 3 are performed on a Zeiss Airy-scan system, which has a much smaller field of view than the conventional systems we use in other experiments. Thus, we inherently analyse a much smaller number of centrioles in these experiments. In addition, as explained in point 6 above, in these experiments we need to analyse mother and daughter centrioles independently, and in many cases we cannot unambiguously make this assignment, so these centrioles have to be excluded from our analysis.

      The reviewer questions why we selected the 10 brightest centrioles for the analysis shown in Figure S1B,C (note, the reviewer states Figure S2 here, but it is the data shown in Figure S1B,C that is selected from the 10 brightest centrioles, so we assume this is the relevant Figure). We apologise for not explaining this properly. In these mutant embryos very little CP110-GFP localises to centrioles in the absence of Cep97, and vice versa, so we cannot track centrioles using our usual pipeline and instead have to select centrioles using the Asl-mCherry signal. As the difference between the WT and mutant embryos is so striking, we simply selected the brightest 10 centrioles (based on Asl-mCherry levels) in both the WT and mutant embryos for quantification. We could select more centrioles, or select centrioles based on different criteria, but our main conclusion—that the centriolar localisation of one protein is largely dependent on the other—would not change.

      The reviewer also questioned why we performed the analysis shown in Figure S2 (new Figure S3) during S-phase of nuclear cycle 14, when the rest of the manuscript focuses on nuclear cycles 11-13. We apologise for not explaining this properly. In cycles 11-13 centriolar CP110/Cep97 levels rise and fall during S-phase, whereas both proteins reach a sustained plateau during the extended S-phase (~1hr) of nuclear cycle 14—making it easier to analyse CP110/Cep97 levels in embryos when their centriole levels are maximal. We now explain this.

      The reviewer requests that we quantify the western blots shown in Figure 4 in the same way we do in figure 8. To do this we would need to perform multiple repeats of these blots and we did not perform these because the blots shown in Figure 4 largely recapitulate already published data (Franz et al., JCB, 2013; Dobbelaere et al., Curr. Biol., 2020). Moreover, as described in our response to Reviewer #2, these ECL blots are very sensitive, but highly non-linear, so we always compare multiple serial dilutions of the different extracts to try to estimate relative levels of protein expression. We now explain this in the M&M.

      The reviewer suggests the data shown in Figure 8 is a “straw man”: we really want to test whether modulating CP110/Cep97 levels modulates centriolar Plk4 levels, but instead we test how they modulate cytoplasmic Plk4 levels. The language here is harsh, as it suggests that our intention was to mislead readers into thinking that we have addressed a relevant question by addressing a different, irrelevant, one. We apologise if we have missed something, but we believe we do perform exactly the experiment that the reviewer thinks we should be doing—quantifying how centriolar Plk4 levels change when we modulate the levels of CP110 or Cep97 (Figure 7). It is clear from this data that modulating the levels of CP110/Cep97 does indeed modulate the centriolar levels of Plk4. In Figure 8 we seek to address whether this change in centriolar Plk4 levels occurs because global Plk4 levels in the embryo are affected—a very reasonable hypothesis, which this experiment addresses quite convincingly (although negatively).

      Minor Points:

      The reviewer highlights a small number of mistakes and omissions, all of which have been corrected.

      Finally, we would like to thank the reviewers again for their detailed comments and suggestions. We hope that you and they will agree that the changes we have made in response to these comments have substantially improved that manuscript and that it is suitable for publication in The Journal of Cell Science.

      Sincerely,

      Jordan Raff

      __Reviewer Figure 1. CP110/Cep97 dynamics remain cyclical even when Cep97-GFP and CP110-GFP are expressed from their endogenous promotors in the absence of any endogenous protein. __Graphs show how the levels (Mean±SEM) of centriolar CP110/Cep97-GFP change during nuclear cycle 12 in (A) Cep97-/- embryos expressing eCep97-GFP or (B) CP110-/- embryos expressing eCP110-GFP. CS=Centrosome Separation, NEB=Nuclear Envelope Breakdown. N≥11 embryos per group, average of n≥15 centrioles per embryo.

      __Reviewer Figure 2. ____The cyclical recruitment of Cep97-GFP expressed from its endogenous promoter occurs largely at the growing daughter centriole. __The graph quantifies the fluorescence intensity (Mean±SD) acquired using Airy-scan microscopy of eCep97-GFP on mother (dark green) and daughter (light green) centrioles in individual embryos over Cycle 12. CS = Centrosome Separation, NEB = Nuclear Envelope Breakdown. Data was averaged from 3 embryos as the number of centriole pairs that could be measured was relatively low (total of 2-8 daughter and mother centrioles per time point; in part due to the much dimmer signal of eCep97-GFP in comparison to uGFP-Cep97).

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The authors report that CP110 and Cep97 localize near the distal end of centrioles in Drosophila embryos. CP110 and Cep97 tagged with GFP exhibit an oscillatory distribution, with levels on the daughter centriole being maximal in mid S-phase. These oscillations correlate with cell cycle progression. The authors also show that modulating CP110 or Cep97 levels changes the rate at which Sas6-GFP incorporates in the daughter centriole, as well as aspects of the previously reported oscillatory behavior of Plk4.

      These results could be of potential interest if the stated conclusions were backed up by more convincing data than that which is provided at present. The issues delineated hereafter must be addressed in full before further consideration of the manuscript.

      Major points

      1) The oscillatory amplitude of CP110/Cep97 tagged with GFP is much smaller when expression is driven by the endogenous promoters than upon overexpression (see Figure 4), raising the possibility that oscillation might not reflect, or only reflect in part, the behavior of the endogenous proteins. To address this issue, the authors could GFP tag the endogenous loci using CRISPR/Cas9. If this is too demanding, they should at the minimum conduct experiments with the extant lines driven by the endogenous promoters, but in the background of the available CP110 or Cep97 null mutants. Moreover, the authors should stain staged wild-type embryos with antibodies against CP110 and Cep97 to ensure that the mild oscillations reported in Figure 4 do not merely reflect the behavior of the tagged proteins, for example due to the presence of GFP. Related to this point, the authors should considering showing first the data with CP110-GFP GFP-Cep97 driven from the endogenous promoters (current Figure 4), perhaps relegating the results upon overexpression (current Figure 2) to a Supplementary Figure.

      2) In repeating the above experiments, the authors must ensure that potential mild oscillations do not simply reflect the fact that centrioles are located at a slightly different distance from the coverslip as a function of cell cycle stage. This could be addressed for example by simultaneously imaging a mother centriole marker such as Asl-mCherry.

      Other important points

      3) The authors mention (for instance on p. 3) that the inner cartwheel and the surrounding microtubules assemble at opposite ends of the daughter centriole. However, my understanding is that the short centrioles present in the fly embryo have an inner cartwheel that extends throughout the organelle, such that it might be moot to make a distinction between the two ends in this case. Moreover, it is also my understanding that this inner cartwheel is itself surrounded by microtubules, so that microtubule assembly might not be expected to occur strictly at the distal end no matter what.

      4) Partially related to the point above, the schematic representations in Figure 1 are somewhat confusing. The schematic in Figure 1A represents CP110/Cep97 strictly at the distal end of the centriole, yet the actual immunofluorescence data on the left suggests that CP110/Cep97 are in fact present very close to Asl-mCherry. This apparent difference must be resolved. Moreover, Figure 1C seems to indicate that all the depicted proteins are present throughout the centriole, which I guess is not what the authors wanted to convey.

      5) For the quantification of the data reported in Figure 1, the authors considered only centrioles for which CP110/Cep97 ring eccentricity was less than 1.2, to ensure that only near top views are considered (see p. 23). This is entirely reasonable, but the authors should report the distribution of eccentricities in the data set for the two proteins, and compare them to those of the Sas6-GFP and Sas4-GFP data set, all the more since the latter two were obtained previously with a different microscope modality, potentially complicating thorough comparisons. A slight difference in the fraction of centrioles with a slight tilt could easily skew the data when dealing with such small dimensions.

      6) In Figure 3, the authors chose not to report the "Noisy data" observed during mitosis. While it is understandable that the data is noisier at this stage, it must nevertheless be reported, as this may have bearing on assessing oscillations between cycles 12 and 13.

      7) The authors should conduct Airy-scan analyses of CP110/Cep97 oscillations driven from the endogenous promoters, to ensure that the variations across the cell cycle reported in Figure 4 reflect changes in the daughter centriole. Moreover, it was not clear why the authors used the Airy-scan for some super-resolution experiments and 3D-SIM for others.

      8) Why are solely 1-14 centrioles per embryo considered in the experiments reported in Figure 4 as compared to over 100 per embryo in Figure 2? And how were these centrioles chosen? This needs to be explained, justified and, potentially, rectified.

      9) Likewise, why are only the 10 brightest centriole pairs in each embryo retained for the analysis reported in Figure S2? And would the conclusion differ if more centrioles than that were included? Moreover, S phase of cycle 14 is analyzed in Figure S2 for Sas6-GFP, whereas the remainder of the manuscript analyzes CP110/Cep97 during cycles 11 through 13 (with an emphasis on cycle 12). This must be resolved.

      10) The Western blots in Figure 4A, 4B, as well as in Figure S1A, should be quantified in the same manner as those in Figure 8C, to achieve a better assessment of the differences in protein levels between conditions.

      11) The set up for the experiment reported in Figure 8 comes across as a straw man. What one would really like to find out is whether levels of Plk4 at centrioles are modulated by CP110/Cep97 levels, as the authors themselves acknowledge. Since this does not appear to be feasible, the authors set out to test whether cytoplasmic levels of Plk4 differ, finding that this is not the case. Since this experiment does not address what should be tested, it could be reported as a Supplementary Figure, not as the last main figure of the manuscript.

      Minor points

      • The authors forgot to mention the Tang et al. paper (doi: 10.1038/ncb1889) when referring to Sas-4/CPAP (for instance on p. 4).
      • On p. 9, the authors conclude that the "recruitment of CP110/Cep97 to centrioles is regulated by the CCO". Figure 5 shows that the two correlate, not that the latter regulates the former. A related comment holds for the discussion (bottom of p. 13).
      • It is not clear why the authors sometimes report SDs (Figure 7) and sometimes SEMs (Figure 3), or fail to report what is being shown (Figure 2). This needs to be clarified.
      • The legend of Figure 8A mentions Pie charts and other things that are not featured in the current rendition of the figure.

      Significance

      These results could be of potential interest if the stated conclusions were backed up by more convincing data than that which is provided at present.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      SUMMARY

      This study uses nuclear cycles 11-13 of Drosophila embryos to show the dynamics of the distal centriole localizing CP110/Cep97 complex during the predicted time of MT assembly during new centriole assembly. Continuing from prior work from this group, the authors find that the increase and decrease in CP110/Cep97 at new centrioles correlates with the timing of Cdk/Cyclin oscillations (CCO). The authors find that increased or decreased levels of CP110/Cep97 changes the dynamics of SAS6 and Plk4 levels. The authors suggest that there is crosstalk between the distal localizing CP110/Cep97 complex and the proximal localizing Plk4 and SAS6 proteins required for early centriole assembly.

      MAJOR

      Overall, the results are potentially interesting but I believe that there a number of instances in this manuscript where the conclusions need to either be strengthened with further experiments or toned down to reveal exactly what is shown in the manuscript.

      CP110/Cep97 OSCILLATIONS

      Because oscillations are repetitive variation in levels/activity with time, I think the manuscript needs to either use other terms that accurately describe what is measuring here or it should be defined what the authors are calling an oscillation. CP110/Cep97 only increases and then decreases during a single new centriole assembly and maturation event and I think that this should be clearly describe it this way.

      LOCALIZATION OF CP110/Cep97 TO DISTAL END OF CENTRIOLES

      Based on the existing published studies, it is clear that CP110/Cep97 localizes to the distal end of centrioles. Figure 1 does not show distal centriole localization in daughter centrioles of the syncytium that are the subject of this manuscript though. Its shows radial localization in the mother centriole of the fly wing. Figure 1 therefore has not relevance to the rest of the manuscript and has already been shown in prior studies.

      My suggestion would be that this figure should study the dynamic localization of CP110/Cep97 at daughter centrioles during new centriole assembly in the syncytium. Moreover, this should localize these proteins relative to SAS6 and Plk4 that are the subject of the manuscript. Are there localization dynamic changes during the oscillation? Are there times when these proteins do co-localize?

      SAS6 AND CW CONCLUSIONS

      The current manuscript routinely equates SAS6 levels to cartwheel growth. This is overstated and EM is required to understand whether this is truly impacting the actual cartwheel structure. Loading more sas6 protein doesn't necessarily mean the cartwheel structure changed.

      CONNECTION BETWEEN OSCILLATIONS AND MT GROWTH?

      Much as above, the manuscript infers MT growth without ever showing it. How does all of this relate to centriole length and growth dynamics.? Page 8 refers to prior work but it seems like this is necessary with EM or MT markers. Having this comparison seems important to the conclusion that MTs do not stop growing when CP110/Cep97 levels reach a threshold level at the distal end.

      The following statement is overstated when the data for MT growth are not even presented in this study. "...our findings essentially rule out the possibility that centriole MTs stop growing when a threshold level of CP110/Cep97 accumulates at the centriole distal end." To make such arguments in this study the manuscript would need to include EM and / or MT staining.

      ENDOGENOUS UNTAGGED PROTEIN AFFECTING DYNAMICS?

      The manuscript shows protein dynamics under conditions of both overexpressed and expression under the endogenous promoter. However, I believe that both of these conditions are also in the presence of untagged protein expression.(?). If so, is it possible that the dynamics represent competition for binding relative to the endogenous, untagged protein? I think this point should at least be discussed.

      CP110/Cep97 "INFLUENCED" BY CCO

      While I agree that it is likely to be the case that CP110/Cep97 rise and fall at the daughter centriole correlates with CCO, this study does not directly test if CCO changes impact CP110/Cep97 dynamics. Stating that "CP110/Cep97 oscillation is strongly influenced by the activity of the core Cdk/Cyclin cell cycle oscillator (CCO)" is overstated. Is does correlate though.

      DISTINCTION FROM PRIOR STUDIES

      Dobbelaere 2020 argue that CP110/Cep97 gets to the centriole distal end in late S phase. How could this be considering the data presented in this study? Need discussion of this point. Could Dobbelaere be following the dynamics of the core / basal levels and missed the dynamics that are found in this study? I think a discussion of the Cep97 functions needs to be provided.

      MECHANISM OF CROSS TALK

      How two apparently spatially separated complexes influence each other should be more mechanistically addressed through either or both experimentation and / or discussion. Obviously the impact of this study would greatly benefit by showing how they are associated and influence each other. CP110 is a phospho target of Plk4. Does this occur in the fly syncytium? Do these interact? What is the timing of the interaction and phosphorylation? Are the changes to SAS6 levels actually the result of Plk4 changes? At this point, these concepts are not tested.

      BACKGROUND

      In its current form the prior results that 1) Plk4-induced centriole amplification requires CP110 and 2) Plk4 phosphorylates CP110 is important for centriole assembly in some systems is not highlighted in this manuscript as further support for the model of interplay between CP110/Cep97, Plk4 and SAS6.

      REPRODUCTION OF DATA

      I believe that the data and methods are of high quality and described in such a way that they can be reproduced.

      MINOR

      ALTERNATIVE MODEL

      Because CP110 is a target of Plk4, I wonder if the elevated expression of CP110 sequesters Plk4 away from its cartwheel functions (Ana2/STIL/SAS5 phosphorylation) and this is therefore affecting SAS6 levels?

      OVERSTATED CROSSTALK

      The text states a "...reveals an unexpected crosstalk between proteins that are usually thought to influence the proximal end of the CW and the distal end of centriole MTs." This is true but there are enough data in the literature to suggest that CP110/Cep97 influence centriole assembly that would indicate that this is not "unexpected".

      PAGE 11 - SHIFT IN PEAK

      I could not find the data clearly showing that there was a shift in "the Plk4 oscillation to later in S-phase". Are the authors referring to the plateau in levels? Please explain further.

      WHAT IS "Plk4-NG"?

      I assume Neon Green but I don't see the definition.

      FIGURE 2

      A schematic of the system used for image averaging would help the reader to understand that these "oscillations" represent the mother and daughter centriole together and that each "oscillation" represents one event of the daughter centriole only increasing in CP110/CEP97 levels and then decreasing after peak intensity.

      FIGURE 5 and 8

      I think these could be supplemental images. I was unable to figure this out but something is wrong with the legend in Figure 8. (A) is referencing items that I cannot find in the figure.

      Significance

      This study's advance is an expansion of the authors' prior work showing that during the fly nuclear cycles centriole assembly proteins increase and then reduce in what the authors call an oscillation. Here they show that the CP110/Cep97 complex also oscillates and somehow influences the levels of Plk4 and SAS4 that typically reside at the proximal end of the centriole. This is consistent with prior work indicating that, in some systems, CP110/Cep97 influence centriole duplication and assembly.

      I believe that with additional experiments to strengthen the conclusions and toned down concluding statements this will be of interest to the centriole, centrosome, and cilia community. My research expertise is also in this community but I am not a Drosophila researcher. I do appreciate the beauty of this system that the authors use.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this study, Aydogan, Hankins, and colleagues, present an interesting work that follows up on their article "An Autonomous Oscillation Times and Executes Centriole Biogenesis" published last year in Cell. In this new study, they analyzed the distal complex consisting of CP110/Cep97 in the centriole of Drosophila embryos. They first demonstrated their oscillatory recruitment at the distal tip of the daughter centriole and they proposed that this protein complex is implicated in the control of centriole growth timing. They also demonstrated the importance of the crosstalk between CP110/Cep97 and Plk4 and its impact on cartwheel growth. This paper proposes a compelling model explaining how centriole growth is regulated. This manuscript is very well written and the data is of high quality. However, some point needs to be clarified before publication:

      Major points:

      • Figure 1: Since SAS-4 and CP110/CEP97 are only 5nm apart, SAS-4/CPAP is thought to have an antagonistic function to CP110 in the regulation of centriolar growth, and Plk4 can phosphorylate CPAP (DOI: 10.1038/emboj.2010.118), do the authors think that SAS-4 might also be involved in cartwheel/centriole elongation? Does SAS-4 oscillate?
      • Figure S1B: The reduction in the intensity of CP110 in Cep97-/- and of Cep97 in CP110-/- is very obvious, nevertheless it is surprising that the cytoplasmic background, even reduced, is not visible, the images are completely dark. Would it be possible to image with a higher laser power or boost the intensity to see if a small amount is present at centrioles?
      • Figure 3: The authors indicate that "uGFP-CP110 or uGFP-CEP97 levels remained relatively constant on the mother". However, the intensity clearly decreases over time. Can the authors explain this result, is it due to photobleaching?
      • Do the oscillations of CP110 and Cep97 occur at or around the tip of the growing centriole? Would it be possible to use super-resolution at different stages of the S-phase to answer this question?
      • The authors indicate that the level of overexpression of CP110-GFP and Cep97-GFP is 2.5X compared to their endogenous proteins (based on the western blot in Figure S1). Nevertheless, it seems that the overexpression of CP110 is more important. Quantification is necessary here.
      • The authors proposed that "The CP110/Cep97 oscillation is entrained by the Cdk/Cyclin cell cycle" because they observed a strong and significant correlation between the timing of the CP110/Cep97 peak and S-phase length for both uGFP-Cep97 and uCP110-GFP at all nuclear cycles. It seems to me that this correlation is not sufficient for this statement. If it is not possible to inhibit the CCO to check its impact on CP110/Cep97, this statement should be mitigated.
      • Figure 6: According to your results, cartwheels are longer in absence of CP110 or CEP97 and opposite in overexpression situations. Does the intensity perfectly reflect the length of the cartwheel? is the centriole longer? Could you confirm your observation on cartwheel/centriole length using electron microscopy?

      Minor points:

      • Figure 1C: as the authors show that CP110 and Cep97 are localizing at the distal end of the centriole, I suggest that they place CP110 and Cep97 distally and not at the level of the cartwheel, this representation can be misleading and suggest that CP110 and Cep97 are part of the cartwheel/MT connection.

      Significance

      The results presented are new and quite unexpected. This work allows a better understanding of phenotypes previously observed. I believe that this work will have an important impact in the field as it brings a whole new vision on the regulation of centriole growth. This article is primarily aimed at centriole/centrosome/cilia fields but may be of interest to a broader cell biology audience.

      My field of expertise is centriole/cilia biology

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript is a continuation of the previous articles of the authors (Aydogan et al., JCB 217:1233, 2018; Aydogan et al., Cell 181:1566, 2020). They reported that Plk4 initiates and times the growth of the cartwheel at the proximal end during early divisions of the Drosophila embryos. In this manuscript, they investigated roles of the CP110/Cep97 complex in the centriole growth control at the distal end of the centriole. The daughter centriole levels of the CP110/Cep97 complex oscillate in S phase in a similar manner to those of Plk4. The CP110/Cep97 oscillation is entrained by the core Cdk/Cyclin cell cycle oscillator but not by Plk4. Rather, the centriolar levels of Plk4 increased in the CP110 and Cep97 deletion embryos. The experiments seem to be carefully carried out, data are nicely presented, and manuscript is clearly written.

      Significance

      I agree with their interpretation that the CP110/Cep97 oscillation does not appear to play a major part in determining the period of daughter centriole growth during early divisions of the Drosophila embryos. The CP110/Cep97 complex seems to have a limited role in the centriole length control. The CP110/Cep97 complex may be important to prevent centrioles from over-elongating after the initial growth of centrioles.

      As suggested in the manuscript, phosphorylation may be a regulatory mechanism for CP110 behaviors at the centrioles. It was previously reported that CP110 is a substrate of the cell cycle kinases, such as Cdk2 (Chen et al., Dev Cell 3:339, 2002) and Plk4 (Lee et al., Cell Cycle 16:1225, 2017). Phosphorylation may be required for recruitment or removal of CP110 at the centrioles. Nonetheless, it is hard to interpret the functional significance of the S phase oscillation of the CP110/Cep97 complex with the data in the manuscript.

      It is unfortunate to conclude that the CP110/Cep97 complex may not be a major player for controlling the centriole growth. However, the manuscript includes other interesting observations. For example, they presented data supporting that the SAS6 protein is added at the proximal side of the centrioles, which is opposite to the microtubule growth. Microtubules in the daughter centrioles may assemble at the minus end rather than the plus end. It would be interesting to determine when γ-tubulins are recruited to the growing centrioles.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      This manuscript was evaluated at Review Commons by four individual reviewers. There was a consensus amongst reviewers that the localization behavior of altORF peptide to the Golgi is a compelling observation and that, with some additional characterization, would provide an effective cell biological tool for use in labeling and studying the Golgi. Our primary goal for this paper was to explore this surprising alternative protein hidden within the sequence of a centromere component and to establish this peptide as a cell biological tool that can be used to study the Golgi. However, the reviewers also highlighted some interesting open questions regarding the nature of this peptide. Below we summarize these core comments our current changes and plans

      • Where within the Golgi does the peptide localize? In the work currently included in the paper, we demonstrate that the altORF peptide robustly colocalizes with markers for the Golgi (GM130/TGN46), but not with markers for the Endoplasmic Reticulum (KDEL). However, the resolution at which we imaged the localization of the peptide was not sufficient to determine in which compartment of the Golgi the peptide resides. To address reviewer comments on the specificity of the peptide’s localization within the Golgi, we will attempt to use higher resolution imaging such as confocal or spinning disk microscopy to attempt to better resolve this.
      • How does the peptide target to the Golgi? In this manuscript, we show that the localization of the altORF peptide relies on a Cysteine residue present within in a minimal 10 amino acid sequence. Through treatment with 2-Bromopalmitate (2-BP; a palymityltransferase inhibitor) to disrupt its localization, our work suggests that the peptide is palmitoylated. In addition to this observation, the reviewers asked for an additional demonstration that this peptide is palmitoylated in cells. To test this, we have attempted to identify this modification using mass spectrometry of the isolated (IP) GFP tagged peptide from cells. However, we were unable to identify peptides that coincide with the modified peptide cysteine residue. Secondly, we have attempted to identify the modification using Click-chemistry labeling strategy, but this has proved to be technically challenging and infeasible. As an alternative approach for the revised version, we will attempt to perform hydroxylamine treatment followed by SDS-PAGE analysis to determine whether this results in a shift in migration of the GFP tagged altORF, as suggested by a reviewer, to provide additional evidence that the peptide is modified.
      • Can this peptide be used to ectopically target proteins to the Golgi? The reviewers asked whether the altORF peptide can be used to ectopically target proteins to the Golgi. In this manuscript, we demonstrate that the peptide sequence is sufficient to target both GFP and the Halo tag (two very different proteins) to the Golgi, and can be tagged at either terminus of the peptide, suggesting that it can be used as a powerful strategy to recruit other proteins to the outer surface of the Golgi. We have emphasized this point in the updated version that is included in this revision.
      • Does this peptide alter Golgi structure? For this peptide to provide a useful cell biological marker, it would be preferential for it not to alter cellular physiology. Our work demonstrates that expression of the altORF peptide does not affect the growth of cultured cells. For this updated version, we have performed additional analysis to test whether induced expression of the altORF peptide alters the structure of the Golgi or the localization of other Golgi-associated proteins. Based on a qualitative analysis of these cells, we do not detect any obvious changes in Golgi organization or morphology. This is now included as Supplemental Figure 2D.
        • Is this peptide expressed in human cells? *We have analyzed published ribosome profiling data that suggests that this altORF can be translated, although it is produced to a much lower degree than the full-length CENP-R protein. The short length of the peptide as well as the nature of the amino acid sequence makes this peptide highly challenging to identify via mass spec. It is also possible that this peptide would be expressed in different cell types in the human body, but not robustly expressed in cultured cells. We believe that these are beyond the scope of this paper. However, we now comment on these important points in the updated version.
        • Is this peptide “functional”? *Based on our deliberate analysis of the evolutionary conservation of this altORF within the CENPR transcript, it is clear that this peptide acquired the ability to localize to the Golgi only recently during evolution (only old world primates have this capacity). We believe that this peptide represents a great example of evolution in action, with minor sequence changes resulting in the acquisition of a new capacity and trait. However, as this peptide is not broadly conserved across mammals, it is unlikely to facilitate a core biological function that can be analyzed in cell culture. It is certainly possible that this peptide would contribute to a feature of human biology on the organismal level, but it is not feasible to test this experimentally. The functional nature of this peptide, and particularly the recent evolutionary acquisition of this novel trait are interesting points that we have now commented on in the updated manuscript (text changes in blue).
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #4

      Evidence, reproducibility and clarity

      The manuscript reports the characterisation of a 37 amino acid alternative open reading frame (altORF) within the RNA of the centromere protein, CENP-R. The resulting peptide, when expressed in different cell lines fused to GFP, localises on the Golgi complex, exposed on the cytosolic face of Golgi membranes. It remains associated with the Golgi complex under conditions inducing fragmentation or dispersal of the Golgi complex such as mitosis and BFA. The authors identify in aa 5-14 the minimal Golgi targeting motif and in cysteine 11 a key aa for the targeting. They suppose that palmitoylation may be involved in Golgi targeting as palmitoylation inhibitors prevent its Golgi targeting. The data are clearly presented and sustain the conclusions.

      Significance

      Though the identification of a Golgi targeting motif is of potential interest, the manuscript appears to be at a preliminary stage as it fails to provide any data on the possible function of the altORF of CENP-R palmitoylation or even evidence for its existence in the cells used in the manuscript. The authors appear to be aware of the limits of their study as they conclude their study led to the identification of an "easy-to-use Golgi labeling construct". Also in this scenario, however, some key information are missing: the actual sub-Golgi localisation of the probe, its possible impact on Golgi structure and function, and the formal proof that it is palmitoylated.

      Referess cross-commenting

      I see all the reviewers agree that the manuscript has major limits. Overcoming these limits wold require years if one had to provide proofs for the existence and for the physiological relevance of this alternative ORF, and months to provide the missing information that have been highlighted by the reviewers to consider "just" the technical aspect of this altORF as a possible Golgi reporter/targeting sequence.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      In this study, the authors characterized a potential alternative open reading frame close to the CENP-R open reading frame that had previously been found by ribosome profiling. Its 37 amino acid peptide sequence was included in a proteomic database and is conserved in primates. Transfection of different cell lines with the GFP-tagged peptide was used for immunofluorescence and proteolytic cleavage by a cytosolic protease was used to show that it localizes to the cytoplasmic face of Golgi membranes throughout the cell cycle and Brefeldin treatment had no influence on fragmentation or reformation of the Golgi stacks. The specific localization could be confirmed using different cell lines. The use of numerous truncation mutants allowed to narrow down the minimal Golgi recognition sequence to a 10 amino acid stretch including a species-specific conserved cysteine that required palmitoylation. From these data and comparison with similar sequences in other species the authors determined a consensus sequence for this Golgi targeting sequence in primates.

      Major comments:

      1. Without ultrastructural analysis it is always difficult to judge whether a localization is limited to just one organelle. Immunofluorescence alone gives no clear answer in particular when organelles differ in size and form from cell to cell. In particular when the authors claim that the peptide may serve as a marker. For example when you are working on secretion it is important to distinguish membranes derived from ER exit sites (ERES), the ER-Golgi intermediate compartment (ERGIC), the Golgi itself and Golgi-derived vesicles. I therefore recommend to add a subcellular fractionation by which numerous fractions can be analyzed by a gel in parallel using markers for all the above mentioned different membrane origins.
      2. Is it possible to confirm the in vivo existence of this peptide? There are probably no specific antibodies available, but it should be possible to detect the peptide in enriched Golgi membrane fractions by mass spectrometry.
      3. It would be interesting to reveal the potential in vivo role of this peptide, when it exists. The authors failed to identify potential interaction partners by IP-MS, so I wonder whether its role may be different by controlling the Golgi association of other well known Golgi interactors like GM130, Golgin or GORASP proteins. Is their Golgi association altered in the presence of the peptide?
      4. Finally the authors determined a consensus sequence which they claim to be a Golgi targeting sequence, however when this is true one would expect that there are other proteins in the cell that use this consensus sequence as targeting sequence. The authors only show that the consensus is conserved among the same alternative open reading frame in primates, but to serve as a Golgi targeting sequence it should be possible to identify unrelated other proteins using this consensus by bioinformatics. What happens when an otherwise differently addressed protein is attached to this Golgi sequence, is it mislocalized?

      Minor comments:

      There are a couple of typos and smaller issues - In the Introduction line 2 the citation is missing and skip the "a" in line 7. - In the Results and Discussion section page 5, line 5 "In our ongoing work, we..." - In the same section close to the end in the second from the last paragraphs Figure 5B should be Figure 5C - In the Methods section check the temperature specifications: 4{degree sign}C or 37{degree sign}C, not 4C or 37C - Also in the Methods: there are no secondary antibodies recognizing complete animals (antiMouse or antiRabbit)! The antibodies are directed either against IgG or IgM (e.g. anti-Mouse IgG) - Some subscripts are missing: MgCl2 not MgCl2, NaN3 not NaN3 - Also on the last page of the Methods section the antibody is specific for TGN46, not TGN146 - Last paragraph: for concentrations use μM not uM (also in the Fig.4 legend) - The end of the second from the last sentence is missing. - In the References, is the citation for the Samandi et al. manuscript correct, just one number? - Legend to Suppl. Fig 3: Golgi (capital letter), (~) is missing in figure - Suppl. Fig1B use Courier also for peptide sequence, this will omit alignment problems

      Significance

      Overall, this study is interesting and may provide a helpful tool for cell biologists working on trafficking projects (like myself) in particular because a general Golgi targeting sequence is missing. For techniques like RUSH (Retention using specific hooks) which can be used to synchronize secretory protein traffic reliable and highly specific targeting sequences are required. I am supportive of this study, however, to be useful for the audience the authors need to provide more examples demonstrating the targeting efficiency.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors have identified a alternately translated region with in the mRNA of CENP-R that encodes a small 37 aa peptide that localizes to the perinuclear Golgi region. The main premise here is the this peptide can be used as a novel Golgi marker. The peptide seems to localize peripherally to membranes and interacts with cis and TGN elements based on light microscopy. Mutational analysis indicates that a cysteine residue within a 10 aa region is critical and defines a minimal consensus sequence required for Golgi localization. Evidence is presented based on inhibition by 2-bromopalmitate that C11 is palmitoylated.

      Significance

      If this peptide probe is to be used as a Golgi-specific marker, there are several major issues that have not been addressed. The first is whether it actually binds to Golgi elements and if so, what are the specific elements? The light microscopy images are not of high enough resolution to determine if the peptide interacts with cis or TGN Golgi. The BFA experiments suggest it interacts with the TGN or some other associated vesicular compartment since staining fragments into vesicles and does not get integrated into the ER (Fig. 2B). The authors would have to use higher resolution confocal imaging or, more preferably, immuno-EM to identify exactly where the peptide is located.

      The second issue is the conclusion that the peptide is palmitoylated, which is only based on partial loss of 'Golgi' staining after 2-BP treatment (Fig. 4D). More conclusive evidence is required such as incorporation of radiolabelled or click-palmitate probes into peptide, or band shift after hydroxylamine treatment. In regard to the last point, the protein seems to migrate as a doublet on SDS-PAGE (Fig. 2D) suggesting some type of modification or cleavage that is not commented on.

      Lastly, I would be unlikely to use this as a Golgi probe for the reasons described above, as well as the fact that there is nothing known about the biological function of the peptide (this is potentially the most interesting aspect that is seemingly ignored). If you express the peptide what impact does it have on Golgi structure and function? I could envision that its binding to a Golgi element(s) could affect one of myriad functions that rely on Golgi activity.

      Referees cross-commenting

      This is more of a technical report that does not address the function of the peptide within the Golgi complex. Without this information, and identification of the compartments involved, I don't see the advantage of the probe compared to other methods. As one reviewer mentioned, this seems to be a preliminary study that is difficult to assess given the limited and ambiguous results.

    5. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The authors note that we currently lack a robust targeting signal to direct proteins to the cytoplasmic face of Golgi membranes. The presented work clearly identifies a novel Golgi targeting sequence rich in aromatic/hydrophobic/basic residues and with a key critical cysteine (C11). One can imagine a situation where the non-cysteine residues provide an underlying affinity for cell membranes and thereby allow access to membrane-associated zDHHC S-acyltransferases. I guess the unknown question is whether Golgi specificity comes from the amino acid sequence per se (mediating specific interaction with components of Golgi membranes) or instead by specific recognition of the cysteine by Golgi-localised zDHHC enzymes. It might be worth discussing this in the paper although this should not detract from the main focus/message of the paper- the identification of a Golgi targeting peptide. Data is compelling and support the conclusions of the paper. Although much of the data is not quantified, the data provided is convincing.

      Significance

      Interesting advance for researchers in the general membrane trafficking area and S-acylation field. Provides new information that can be used to target proteins of interest to the Golgi. I note that restriction of an S-acylated peptide at the Golgi is unusual as S-acylation is usually followed by trafficking to the plasma membrane. My expertise is in S-acylation and protein trafficking

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      *The reviewers are enthusiastic. They agree with the claims made and comment favorably with regard to the impact as well on the short- and long-term potential for translation. All three go out of their way to emphasize positive aspects. A variety of questions were raised and we submit a complete revision with point-by-point replies that addresses all of these. This includes addressing tumor organoid (tumoroid) plasticity (reviewer #1) and composition/heterogeneity (reviewer #3) by incorporating single cell data as well as other analyses. We thank the reviewers for the thorough feedback. The additional data, analyses and clarifications strengthen the study. *

      To keep the rebuttal as short as possible we have only copied the reviewers’ concerns/questions, not the favorable comments. The copied remarks are in highlighted. Our replies are in italics. Each question is accompanied by a reply and a brief description of changes made in response.

      2. Point-by-point description of the revisions

      __Reviewer #1, Major Comment #1: “__The authors provided a foundational validation of their organoids through various methods, and their protocol stands to impact the field of RMS biology. To validate the organoids as recapitulating the primary human tumors, the authors perform analysis on the bulk organoid and bulk human primary tumors. The authors showed through sequencing efforts that the bulk mutational profile and transcriptional profiles do not dramatically change between the parent tumor and organoids. This analysis was done well; however, the authors fail to rigorously illustrate that the organoids maintain tumor cell heterogeneity of the primary human tumors. To rigorously validate the organoid system, the authors should illustrate the organoid culture conditions do not alter the heterogeneity of cells (cell plasticity) compared to that of the primary tumor. A formal assessment of the cellular plasticity in the organoids to the primary tumor would determine how the organoid system either maintains or shifts the cancer cell plasticity because of changes in microenvironment (Oncogene, 2020, 39: 2055-2068). The addition scRNA-seq would illustrate whether the organoids maintain the same populations as the primary tumor or bias for the propagation of specific cell populations at a single cell level and provide more rigorous information about every cell type present.”

      Reply: The reviewer’s question is whether the tumor cells in the tumoroid culture have the same degree of plasticity and are therefore as heterogeneous in culture as they are in the tumors that they are derived from. We agree that evaluating the heterogeneity of tumor cells in the tumoroid culture is desirable. This would ensure that the procedure has not simply selected for a single type of tumor cell. We have therefore generated single-cell RNA sequencing (scRNA-seq) data of tumoroid cells as suggested. It is important to point out that a complete inventory of RMS tumor cell heterogeneity by scRNA-seq has not been published as yet. Such an undertaking, i.e., scRNA-seq of a large cohort of RMS tumors, is an entire study in itself and lies outside the scope of this study. It would also not be feasible due to limited sample material for many of the tumors used here. Nevertheless, as is being alluded to by the reviewer, there is ample evidence of tumor cell heterogeneity in primary RMS tumors based on previous studies using immunohistochemistry (for example the well-known heterogeneity in expression of RMS marker proteins such as Myogenin, MyoD1 and Desmin). As shown in new Fig. 2D, when cultured as tumoroid models, examples from both of the main tumor types (FP-RMS sample RMS127 and FN-RMS sample RMS444) show a large degree of heterogeneity in expression of the known, heterogeneously expressed tumor cell markers Myogenin (MYOG gene), MyoD1 (MYOD1 gene) and Desmin (DES gene). Comparison with the cell cycle marker Ki-67 (MKI67 gene) shows that this heterogeneity is not due the cells being present in different cell cycle phases. Tumor cell heterogeneity in the tumoroid culture is further indicated by the heterogeneous CNV patterns derived from the tumoroid scRNA-seq data (new Suppl. Fig. 1B).

      Both the CNV analysis and the scRNA-seq marker gene expression indicate that the tumoroid culture conditions neither stringently select for a single type of tumor cell, nor drive the tumor cells into a uniform expression pattern phenotype, consistent with maintaining plasticity, even after the 7 (RMS127) and 5 (RMS444) passages. These are good indications of retained plasticity/heterogeneity. Additionally, we make it clear in the revised version that a more exhaustive answer would benefit from having a complete cohort of tumor scRNA-seq data to first determine the degree of heterogeneity exhibited by RMS tumors for all genes.

      The related question of tumoroid cellular composition, with regard to the presence of non-tumor cells, is addressed in response to reviewer #3, major comment #1.

      Changes: Addition of a new Fig. 2D and a new Suppl. Fig. 1B with figure captions. Additional text in the Results and the Discussion sections. Additions to the Methods for the generation and analysis of the scRNA-seq data.

      Reviewer #1, Major Comment #2: “The authors took great strides to show that the organoids respond to therapeutics similarly to primary tumors. However, Figure 5A could be more transparent with more data labelled in the graph instead of just in the app and the implications of the variable responses could have been explored in the discussion section. Furthermore, for this model to be clinically relevant for pharmacokinetic studies, propagation in mice needs to be shown.”

      Reply:

      • We have made Fig. 5A more transparent by adding the drug names.
      • The different response between FN-RMS and FP-RMS subtypes for certain drugs is known and the implication that the models reflect this is discussed more thoroughly now as suggested.
      • We agree that animal experiments are imperative for pharmacokinetic studies of new drugs. However, most of the drugs that were included here, especially the ones highlighted, have already been evaluated in early phase clinical trials in adults and/or children. The pharmacokinetic data for humans is therefore already available for these drugs, making additional animal studies for pharmacokinetics of these drugs redundant. For future studies, various types of animal studies are likely to be required and we make this clear in the Discussion, also emphasizing that in general, tumoroid models do propagate in mice. To address this specifically for RMS, we have started a collaboration to generate PDX mouse models derived from RMS tumor and tumoroid samples in parallel. Anecdotally we can state here that at least 50% propagate. However, since we wish to investigate this systematically and with a complete set of tumoroid models, it is not prudent to wait for these results before publishing the current study. This would delay making the protocols, findings and tumoroid models available to the scientific community and as our (and many other groups’) work exemplifies, tumoroid models can yield important findings on their own. Changes: Drug names added to rows in Fig. 5A. The Discussion has been expanded to include the differential response of tumor subtypes and tumoroids to different drugs and to include the uses (including pharmacokinetics) of different types of models has been expanded.

      Reviewer #1, Major Comment #3: “Figure 1 is well put together to graphically demonstrate the process by which organoids were obtained and manipulated. Figure 1B, however, as a graphical summary is a little confusing, and the information would be greatly enhanced by the addition of a comprehensive table. Furthermore, additional information could be added to the table to make it a more inclusive and impactful addition to the paper.”

      Reply: We agree.

      Changes: A new Table 1 has been added as a separate file with a corresponding revised legend in the main document.

      Reviewer #1, Major Comment #4: “It is quite impactful that the authors were able to actively engineer the organoids with CRISPR/Cas9 and accurately delete TRP53, but controls were not represented in the figure. The experiment should have included a sgRNA targeting a pan-essential gene as a positive control and a non-targeting sgRNA as a negative control. We recommend addition of both controls to the experiment outlined in Figure 6 to increase the validity and rigor of the data presented.”

      Reply:* We respectfully note that all appropriate controls were done. This included a non-targeting sgRNA as negative control (see Methods lines 1137 to 1140). As also explained in Fig. 6A, the strategy for generating a P53 knock-out involved selection through nutlin-3 exposure, whereby cells wildtype for P53 are selected against. As described (Methods lines 1144 to 1146), cells transfected with the non-targeting sgRNA plasmid indeed died upon nutlin-3 exposure. A sgRNA against a pan-essential gene was not included in this strategy since the nutlin-3 already kills all cells with a wildtype P53. Finally, we draw attention to the fact that the success of the approach was assessed by Western Blotting (Fig. 6B) and Sanger sequencing (Suppl. Fig. 6A). *

      Changes: None.

      Reviewer #1, Major Comment #5: “Although the authors provide an insight into a useful preclinical RMS model, the paper lacks mechanistic insight besides cursory description of the model.”

      Reply: Insight into a wide variety of different molecular and cellular mechanisms will be exciting to explore in future studies. This publication is indeed focused on describing an approach that works for RMS, and therefore showing for the first time that this works systematically for mesenchymal-derived tumors. In addition, the study describes key characteristics of the tumoroid models that are important to establish their validity as models and that are essential to demonstrate before making the tumoroid models available to the wider scientific community in order to perform the further mechanistic analyses. The word cursory is in contrast to the many positive comments made by this reviewer and the other two reviewers with regard to the extensive characterization.

      Changes: None.

      Reviewer #1, Minor Comment #1: “Figure 3C and 4B are not transparent in their labels and could be altered so that every line has an associated gene in the publication. Furthermore, there are sample specific differences that could be explored in the discussion.”

      Reply: We agree.

      Changes: Gene names have been added for every row in both figures. The Discussion now incorporates the observed differences.

      Reviewer #1, Minor Comment #2: “In Supplementary Figure 1, higher magnification inserts are needed to get a closer look at the IHC. Furthermore, the white balance is not the same in all the images and needs to be corrected prior to publication. The difference in white balance can clearly be seen in the last panels depicting IHC for RMS335, where the MYOD1 staining has a yellow background whereas the H&E staining has a white background.”

      Reply: We agree.

      Changes: Higher magnification inserts have now been provided throughout Suppl. Fig. 1A. The white balance has been corrected.

      Reviewer #1, Minor Comment #3____: “The authors mentioned in line 202 that some of their organoids contain the novel fusion of PAX3 and WWTR1, but this fusion is not indeed novel as it has previously been seen in biphenotypic sinonasal sarcoma (Am J Surg Pathol 2019, 43:747-754).”

      Reply: We rephrased this to clarify that this is the first report of such a fusion in RMS, rather than in general.

      Changes: The corresponding sentence has been rephrased.

      Reviewer #1, Comment within the Significance Statement: “The authors state that this is the first system to use organoids but should recognize the advances demonstrated by Manzella et al. (Nat Commun, 2020, 11:4629). Additionally, the authors state that this is the first demonstration of pre-clinical models harboring FGFR4V550L mutations; this fails to recognize the prior reported work by several groups (Chen et al., Cancer Cell, 2013, 24:710-24; Manzella et al., Nat Commun, 2020, 11:4629; McKinnon et al., Oncogene, 2018, 37:2630-2644).”

      Reply:* We had in fact already recognized the advances described by Manzella et al. which was referenced in two places in the original submission (current lines 100 and 388). We thank the reviewer for pointing out the previous work done on an RMS cell line that harbored an FGFR4 p.V550L mutation. *

      Changes: We rephrased the corresponding passages concerning the FGFR4 mutation.

      We thank reviewer #1 for all the comments. This has resulted in many improvements.

      Reviewer #2: W____e thank reviewer #2 for the positive comments. There are no major/minor queries to address.

      Reviewer #3, Major Comment #1: “The authors describe the models derived as organoids/tumoroids implying that multiple cell types are represented potentially recreating the tumor microenvironment. Can the authors comment more specifically and demonstrate the extent to which cell types in addition to the tumor cells are represented, viable and are organized through analyses of the original and tumoroid sections (extend fig 2C/supplementary fig) and via analyses of the RNAseq data?”

      Reply: We use the term tumor organoid or tumoroids as coined by the field in general. This indeed indicates a degree of self-organization such as the three-dimensional growth in spheres and the propagation of a heterogeneous population of tumor cells (see comment #1, reviewer 1) for example. In general, however, tumoroids do not include growth of a non-tumor cell microenvironment inter-woven with the (different types of) tumor cells. Exceptions to this are very early passage tumoroids that are not yet stable and which may still contain non-tumor cells, or specialized co-culture conditions that are currently being actively sought to allow for co-culture of tumor cells within a non tumor cell microenvironment. It is therefore not anticipated that late passage tumoroid models will have non-tumor cells. The basis of the technology is that the defined set of growth factors in the medium mimics the tumor stimulating conditions of the non-tumor cell microenvironment. Since the mixed presence of tumor and non-tumor cells generally gives rise to one (frequently the non-tumour cell) outgrowing the other, it is often considered the hallmark of an unsuccessful tumoroid.

      The reviewer therefore raises an important point that we have failed to make clear. We have addressed this in two ways. We emphasize that the scRNA-seq data that are now included in response to reviewer #1, comment #1 do not indicate the presence of any non-tumor cells (as expected). In addition, this aspect of tumor organoid technology is explained better in the Introduction.

      Changes: The results section has been expanded with the description of the scRNA-seq data emphasizing the expected lack of non-tumor cells and the introductory section on tumor organoid technology has been improved to make it clear that currently this generally involves growth of different types of tumor cells only.

      Reviewer #3, Major Comment #2: “Does the quantification from the RT-qPCR analyses for the MYOD1, MYOG and Desmin of the models match that in the samples from which they were derived? Does the RNAseq that was performed on tumor and the culture at the time of the drug screen tie in with this?”

      Reply: The answer is yes. The figure below shows tumor and tumoroid bulk RNA seq of those genes also analyzed by RT-qPCR (i.e., DES, MYOG, and MYOD1). Note that this is also the same stage as for the drug screening. As can be appreciated, the expression of these markers is generally very comparable between tumors and the derived tumoroid models. Note that this also constitutes a nice independent (albeit indirect) verification of the similar degree of heterogeneity issue raised by Reviewer #1 (comment #1). Expression of the markers was lower in the tumoroid models of RMS000HQC and RMS000ETY compared to the primary tumor. In line with this, expression of these genes was also already lower in the early passages of the culture as determined by RT-qPCR (Fig. 2A). Nevertheless, copy-number analysis inferred from whole-genome sequencing showed that the resulting tumoroid models are indeed tumor cells (Suppl. Fig. 2A top panel and Suppl. Fig. 2B lower panel).

      We therefore conclude that the expression of probed marker genes is generally comparable between tumor and tumoroid and that early passage RT-qPCR based expression analysis of these markers can be reflective of the expression in the fully established model.

      *- Rebuttal letter includes corresponding figure here - *

      Changes: None. The expression data are already available within the interactive browser-based Shiny App.

      Reviewer #3, Major Comment #3: “How do the frequencies of SNVs compare with recent studies? Or are the numbers in the risk groups not appropriately represented?”

      Reply: The SNV frequencies are quite comparable to recent studies, with similar differences between risk groups, all as depicted in the new Suppl. Fig 2E. The SNV frequency was calculated from our WGS data following the procedure from the most recent report in pediatric cancer (https://www.biorxiv.org/content/10.1101/2021.09.28.462210v1). Across tumor and tumoroid models we found a somatic mutation frequency of SNVs with a VAF of >0.3 ranging from 0.03 to 1.92 mut/MB (median 0.70 mut/MB) which is comparable to the reported somatic mutation frequency in the afore-mentioned study (median 0.9 mut/MB in RMS). Concerning the risk groups, a recent study (https://pubmed.ncbi.nlm.nih.gov/31699828/) found a significant difference in the tumor mutational burden between fusion-negative (FN) and fusion-positive (FP) RMS (2.6 mut/MB vs. 1.0 mut/MB, respectively) with a higher mutational burden associated with poorer outcome. In our study, the FN-RMS tumoroid models also show a higher mutation frequency compared to the FP-RMS tumoroid models (FN 4 vs. FP 15, p = 0.02, Wilcoxon). Such a difference is also found between the original tumors but without statistical difference (FN 4 vs. FP 15, p = 0.15, Wilcoxon) likely related to the small sample sizes. This underscores the representative nature of the tumoroid models and is of obvious interest to include. We have made the appropriate changes.

      Changes: To include these analyses in the manuscript, we added a new Suppl. Fig. 2E with corresponding Suppl. Fig. legend and a new paragraph in the main text.

      Reviewer #3, Minor Comment #1: “The number of models and success rates would be useful to indicate in the abstract.”

      Reply: We agree.

      Changes: We added this information to the abstract.

      Reviewer #3, Minor Comment #2: “It would be helpful to define the SBS1, 5,and 18 in the figure legends. Do the age related signatures in any way correlate with patient age or aggressivness of tumors?”

      Reply:

      • Agreed. The definitions of SBS1, 5, and 18 have now been included the legends of Fig. 3B and 4A.
      • The age-related signatures SBS1 (but not SBS5) shows a weak albeit significant correlation with patient age only in RMS tumoroid models but not in RMS tumors. Furthermore, concerning aggressiveness, FP-RMS tumors and tumoroid models show a significantly higher contribution of SBS1 (but not SBS5) to their overall somatic mutation frequency compared to FN-RMS tumors and tumoroid models. However, since FP-RMS tumor samples were obtained from older patients (median 14 years versus median 6 years in FN-RMS tumor samples), this observation could also be related to the patient-age and not primarily to the fusion-status. The heterogeneity of samples (e.g., primary therapy-naïve samples versus relapse and therefore pre-treated samples) and the relatively low sample number could be explanations for the lack of a stronger correlation in general. Changes: Added definitions of SBS1, 5, and 18 in the legends of Fig. 3B and 4A. Added text in the Results section to indicate the observed correlations.

      Reviewer #3, Minor Comment #3: “Page 13 line 300 just because the RH30 cell line has TP53 mutation doesn't mean that it was acquired in culture - unless there is specific evidence that supports this.”

      Reply:* We thank the reviewer for this rectification. To our knowledge, there is indeed no specific evidence that this cell line acquired the TP53 mutation during culturing or whether the mutation was already present in the primary tumor the cell line was derived from. *

      Changes: The corresponding statement has been removed.

      We thank reviewer #3 for all the comments. This has resulted in many improvements.

      Besides the changes described above, additional minor changes were made:

      *We have moved the interactive, browser-based Shiny app to a server that is managed by our institute instead of having it hosted on shinyapps.io. We include the new URL in line 556. *

      The data upload to the European Genome-Phenome Archive (EGA) of the data from the initial submission has been completed and the raw sequencing data can now be accessed. The data upload of the scRNA-seq data generated for the revision is currently ongoing. We have therefore adapted and renamed the “Bulk sequencing data availability” section in the Methods in the manuscript (lines 1043 to 1050).

      We updated the code available at https://github.com/teresouza/rms2018-009* following the additional analyses performed for the revision. *

      Supplementary Table 1: The values for row “RMS000FLV” for columns “sample_body_site” and “primary_site_specific” were corrected as this tumor was located in the upper leg and not the upper arm of the patient. Furthermore, we added patient numbers as in the new Table 1 and corrected spelling errors. This does not change any of the conclusions in the manuscript.

      Figure 6A: The protein “P53” was spelled without capital “P” in the initial version. We corrected this.

      We included the recently described Zebrafish RMS PDX models (https://pubmed.ncbi.nlm.nih.gov/31031007/) in the Discussion of RMS models. See lines 507 to 510.

      With the addition of Fig. 2D, the figure legends of Fig. 2A and 2B were moved to the side (Fig. 2A) or below (Fig. 2B) the figure. With the addition of the single-cell copy-number plots, Suppl. Fig. 1 was divided in Suppl. Fig. 1A and 1B.

      Some of the original scale bars in Fig. 2C and Suppl. Fig. 1A were incorrectly labelled and this has now been corrected. This does not change any of the conclusions.

      Minor corrections in the sections Affiliations, Financial support, Author contributions and Conflict of Interests.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Meister et al., describe their methodology in establishing what they term organoid or tumoroid 2D/3D cultures derived from samples of rhabdomyosarcoma (RMS) patient tumours. They have success to varying degrees across the subtypes with greater success in those more clinically aggressive. Their analyses of markers, the somatic genetics and gene expression profiles suggest that they are largely representative of RMS and the tumor samples from which they were derived. Their utility in drug screening and manipulation by knocking out TP53 by CRISP/Cas9 is also demonstrated. The conclusion is that this represents a useful approach for generating patient derived models and a unique resource for preclinical analyses and other research into RMS.

      This is a major piece of work that is well written and presented. The link to interrogate the data worked. I have only a few comments.

      Major comments

      The authors describe the models derived as organoids/tumoroids implying that multiple cell types are represented potentially recreating the tumor microenvironment. Can the authors comment more specifically and demonstrate the extent to which cell types in addition to the tumor cells are represented, viable and are organized through analyses of the original and tumoroid sections (extend fig 2C/supplementary fig) and via analyses of the RNAseq data?

      Does the quantification from the RT-qPCR analyses for the MYOD1, MYOG and Desmin of the models match that in the samples from which they were derived? Does the RNAseq that was performed on tumour and the culture at the time of the drug screen tie in with this?

      How do the frequencies of SNVs compare with recent studies? Or are the numbers in the risk groups not appropriately represented?

      Minor comments

      The number of models and success rates would be useful to indicate in the abstract.

      It would be helpful to define the SBS1, 5,and 18 in the figure legends. Do the age related signatures in any way correlate with patient age or aggressivness of tumors?

      Page 13 line 300 just because the RH30 cell line has TP53 mutation doesn't mean that it was acquired in culture - unless there is specific evidence that supports this.

      Significance

      The significance of this study is in describing how a relatively large number of models of RMS were established plus increasing awareness of the biobank resource and associated data that has been created. The approach, although used in more ad hoc reports of smaller numbers of RMS, represents a useful development for mesenchymal tumors versus the more established development of such models in epithelial cancers. Although a lower success rate than xenografts, it is a useful and practical cost-effective alternative for preclinical testing and research. Likely interest to a speciaclist audience for those involved in the RMS, sarcoma and pediatric cancer field.

      Referees cross-commenting

      OK with the balance of comments for the authors to address. I think the extent to which they are prepared to address the heterogeneity issue, and the results of this for the models, is likely to affect the impact of their study.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript describes the possibility to generate a collection of pediatric rhabdomyosarcoma (RMS) tumor organoid models comprising broad spectrum subtypes from highly aggressive to extremely rare. The authors were able very successfully establish 19 RMS models from 46 pediatric RMS patient samples with 41 % efficiency. All RMS tumoroid models were thoroughly characterized and retained the molecular characteristics of the tumor they are derived from as well as they displayed genetic stability over time. Most of the tested tumors showed long-term propagation potential, reaching passage 40 and remaining stable. Though, establishing time for RMS tumoroid models varied with a median time from acquisition of the tumor sample to successful drug screening being 81 days, highly aggressive tumors were established in as little as 27 days. Also, authors shown us in elegant manner the suitability of RMS tumoroid models for research in two specific ways: via drug screening and CRISPER/Cas9 genome editing.

      Significance

      In summary, the author's work made significant progress in 3D culture and tumor organoid models of mesenchymal origin, being the first collection of tumoroid models from mesenchymal malignant tumors and the second thoroughly characterized tumoroid collection specific for pediatric cancers. Without doubt, biobanked collection of RMS tumoroids will be useful for drug screening as well as molecular editing. Also, these models will be a useful resource for future research and in preclinical and clinical testing therapeutics for RMS. In the future, organoids generated from patients with RMS may lead to precise and personalized treatment.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Meister et al. set out to develop a new organoid preclinical model of rhabdomyosarcoma (RMS). The authors comment that this system would be beneficial for preclinical modeling because it has the ability to maintain the tumor's molecular characteristics. The authors then proved that organoids derived from multiple RMS subtypes resembled their parent tumors using RT-qPCR for characteristic markers, histopathology, copy number profiles, mutational signature analyses, and transcriptional profiling. Importantly, the authors performed long term studies to show that the organoids remain stable over multiple passages and do not change their mutational landscape dramatically. Finally, the authors tested their organoids with known RMS therapeutics and for their ability to be engineered with the CRISPR/Cas9 system. Not surprisingly, the authors found their organoids sensitive to known RMS therapeutics and were successfully able to generate TP53-/- organoids with CRISPR/Cas9, underscoring this organoid system in translatable use. This report nicely describes a method for the establishment of human RMS organoid culture systems that can be leveraged for preclinical testing.

      Major Comments:

      1. The authors provided a foundational validation of their organoids through various methods, and their protocol stands to impact the field of RMS biology. To validate the organoids as recapitulating the primary human tumors, the authors perform analysis on the bulk organoid and bulk human primary tumors. The authors showed through sequencing efforts that the bulk mutational profile and transcriptional profiles do not dramatically change between the parent tumor and organoids. This analysis was done well; however, the authors fail to rigorously illustrate that the organoids maintain tumor cell heterogeneity of the primary human tumors. To rigorously validate the organoid system, the authors should illustrate the organoid culture conditions do not alter the heterogeneity of cells (cell plasticity) compared to that of the primary tumor. A formal assessment of the cellular plasticity in the organoids to the primary tumor would determine how the organoid system either maintains or shifts the cancer cell plasticity because of changes in microenvironment (Oncogene, 2020, 39: 2055-2068). The addition scRNA-seq would illustrate whether the organoids maintain the same populations as the primary tumor or bias for the propagation of specific cell populations at a single cell level and provide more rigorous information about every cell type present.
      2. The authors took great strides to show that the organoids respond to therapeutics similarly to primary tumors. However, Figure 5A could be more transparent with more data labelled in the graph instead of just in the app and the implications of the variable responses could have been explored in the discussion section. Furthermore, for this model to be clinically relevant for pharmacokinetic studies, propagation in mice needs to be shown.
      3. Figure 1 is well put together to graphically demonstrate the process by which organoids were obtained and manipulated. Figure 1B, however, as a graphical summary is a little confusing, and the information would be greatly enhanced by the addition of a comprehensive table. Furthermore, additional information could be added to the table to make it a more inclusive and impactful addition to the paper.
      4. It is quite impactful that the authors were able to actively engineer the organoids with CRISPR/Cas9 and accurately delete TRP53, but controls were not represented in the figure. The experiment should have included a sgRNA targeting a pan-essential gene as a positive control and a non-targeting sgRNA as a negative control. We recommend addition of both controls to the experiment outlined in Figure 6 to increase the validity and rigor of the data presented.
      5. Although the authors provide an insight into a useful preclinical RMS model, the paper lacks mechanistic insight besides cursory description of the model.

      Minor Comments

      1. Figure 3C and 4B are not transparent in their labels and could be altered so that every line has an associated gene in the publication. Furthermore, there are sample specific differences that could be explored in the discussion.
      2. In Supplementary Figure 1, higher magnification inserts are needed to get a closer look at the IHC. Furthermore, the white balance is not the same in all the images and needs to be corrected prior to publication. The difference in white balance can clearly be seen in the last panels depicting IHC for RMS335, where the MYOD1 staining has a yellow background whereas the H&E staining has a white background.
      3. The authors mentioned in line 202 that some of their organoids contain the novel fusion of PAX3 and WWTR1, but this fusion is not indeed novel as it has previously been seen in biphenotypic sinonasal sarcoma (Am J Surg Pathol 2019, 43:747-754).

      Significance

      As has been mentioned previously, this research is impactful to the field of RMS biology because the authors were successfully able to use organoid technology, which has not previously been reported. The authors do a great job of listing current RMS modelling techniques and explaining how their system addresses the pitfalls of the others. Furthermore, this protocol could be expanded to the development of other organoid systems for other sarcomas. The rhabdomyosarcoma field and larger sarcoma community would be keenly interested in this work. It is clear that this system has the potential for use in pre-clinical settings as well as in high-throughput screens, but further validation and increased rigor is required on both fronts.

      It is astounding and the authors should be complimented that they were able to show a median time from patient to drug screen was 81 days! This has enormous potential such as rapid translation of therapies and personalized medicine. That said, the authors must first refine the heterogeneity of the organoids and demonstrate how the organoids reflect the phenotypic and cellular plasticity of the parent tumors. Furthermore, the authors ought to be careful when making priority claims. The authors state that this is the first system to use organoids but should recognize the advances demonstrated by Manzella et al. (Nat Commun, 2020, 11:4629). Additionally, the authors state that this is the first demonstration of pre-clinical models harboring FGFR4V550L mutations; this fails to recognize the prior reported work by several groups (Chen et al., Cancer Cell, 2013, 24:710-24; Manzella et al., Nat Commun, 2020, 11:4629; McKinnon et al., Oncogene, 2018, 37:2630-2644).

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): *

      In this manuscript by Wang and colleagues, the authors analyse single-cell RNA-seq (scRNAseq) data by applying transition path theory to infer gene regulatory network (GRN) changes along the transition (reaction coordinate, trajectory) between free energy stable states (i.e. cell types). The work aims to understand how stable cell types, and their regulatory programs (combination of active and repressed genes) switches during differentiation/reprogramming/response (i.e. cell phenotypic transition/CPT). The premise of the work is to assess whether genes within GRNs undergo step-wise repression, state-change and activation (& vice-versa; analogous to SN1) or concurrently regulate gene expression (analogous to SN2). The GRNs are inferred based on highly variable genes and their expression dynamics from RNA velocity over CPT, across 3 scRNA-seq datasets.

      The authors first analyse public scRNA-seq dataset of 3003 human A549 adenocarcinomic basal epithelial cells treated with TGF-b for 0hrs, 8hrs, 1 day and 3 days (4 timepoints). The authors select two stable states (Day0-untreated; Epithelial and Day 3-treatment; Mesenchymal) using local kernel densities and set transition paths using Dijkstra shortest path, dividing state space into Voronoi cells (i.e. reaction coordinate value), and constructed single-cell GRNs based on RNA velocity differences (n=500 genes) and a linear model (from Qiu et al). This GRN is based on expression and velocity estimates, and does not distinguish direct from indirect regulation. Calculating interaction frequency (edges) across two stable states over 4 louvain clusters, the authors find global increase in effective edges that correlates with increased active genes; but with variable trend within inter-cluster edges. To quantify the concerted GRN changes between clusters, the authors utilise a "frustration" score (from Tripathi et al 2020). The average frustration score increases and peaks at day 1 treatment, followed by a decline over terminal stable state (day 3-treatment); similar to interaction frequency trends. The author also separately measure network heterogeneity and repeat analysis using alternative transition matrix. The authors conclude that EMT proceeds through concerted regulation of multiple genes first with an increase in inter-cluster edges, frustration and heterogeneity followed by a decrease into final stable state. The authors apply the analysis to scRNA-seq data from (i) pancreatic endocrine differentiation where Ngn3-low progenitors give rise to Ngn3-high, then Fev-high and into glucagon producing a-endocrine cells; (ii) dendate gyrus; radial glial cell differentiation into nIPCs, neuroblast, immature granule and mature granule cells. In both cases, the authors observe concerted regulation with initial increase in inter-community edges, heterogeneity during differentiation followed by decrease towards final stable state. **

      The study and ideas in the manuscript are interesting and the methods would be potentially be useful. However, there are a few specific and general comments stated below, which the authors should try to address.

      1 • P4: "RC increases first and reaches a peak when cells were treated with TGF-β for about one day, then decreases (Fig. 1G)". It would be better to label the figure with the treatment information. *

      Reply: Thanks for your advice. In the revised manuscript, we analyzed two additional datasets, and moved the EMT result in the supplemental Fig. EV8. In the new Fig. 1d, we marked the cell types along the reaction coordinate.

      *2 • Fig. 1G and EV1D: Why are the trends different? *

      Reply: In the original figures, ____Fig____.1g is the frustration score and EV1D shows the variation of pseudo-Hamiltonian along the reaction coordinate. The frustration score is the focus of this work. We also calculated the pseudo-Hamiltonian since it has been used in the literature. However, we realized that showing both of the results might lead to confusion, so we deleted all pseudo-Hamiltonian results in the revised manuscript.

      * 3 • How is the appropriate community/cluster/Louvain resolution selected? This can have a major impact on number of cell states, types and transition path from initial to final state. *

      Reply: The number of cell states, types and transition path from initial to final state____ are not determined from the community/cluster/Louvain analyses. For the EMT data, we assume most cells in the initial treatment time are epithelial cells, and those in the final time point are mesenchymal cells. For other datasets, we followed the original publications to assign cell types based on known marker expression.

      The Louvain method was applied to coarse grain the gene regulation network, and it does not affect the number of cell states, types and transition path, which were determined separately. To address the reviewer’s question, we also use the Leiden method to adjust the resolution ____(1)____. The resolution does not affect the result. The results are added to Fig. EV12. We tried three different resolution values 0.8,1.0 and 1.2. The number of inter-community edges consistently shows the trend that it increases first then decreases.

      Figure EV12 Cell-specific variation of the number of effective inter-community edges between communities calculated with different resolution parameter values for dentate gyrus neurogenesis (a), pancreatic endocrinogenesis (b), and bone marrow marrow hematopoiesis (c). Each dot represents a cell and the color represents the number of inter-community edges____.

      • * What effect does the Louvain resolution have on e.g. frustration scores? * Reply: The resolution of community division algorithm doesn’t affect the frustration scores, since the frustration score is based on the gene-gene interactions instead of community assignment.

      • * The authors match resolution to samples/timepoints/known prior cell types i.e. 3-4 communities. However it is unclear whether this is enough to describe entire differentiation/transition process. * Reply: This is a good question. In one above reply we have explained how the cell types were determined____. We also agree with the reviewer that these coarse-grained communities cannot reflect the overall heterogeneity and dynamics of the whole process. Notice in most of our analyses (e.g., reaction coordinate and transition paths), we treated the transition as continuous and the distribution of single cell data points in all datasets cover the whole space that involved in cell phenotype transition. The coarse-grained analyses are for further mechanistic insights on how gene regulatory networks are reorganized during the transition process.

      • * Gene selection: The selection based on minimum 20 counts as highly expressed genes is arbitrary and dependent on sequencing depth. Perhaps the authors could show distribution of gene counts for the datasets and have a data-driven filtering criteria * Reply: Thanks for the advice. The number 20 is a default value suggested in the package (scVelo) we use, and in another package dynamo the default number is 30. Following the reviewer’s suggestion (together with the next question on the influence of all highly variable genes), we looked for a data-drive filtering criterion. The method has been described in different tools ____(2-4)____. We first grouped the genes into 20 bins by their mean expression values, and____ scaled their dispersions by subtracting the mean of dispersions and dividing standard deviation of dispersions____. Figure EV9 shows the distribution of the minimum shared counts. ____As one can see, most genes counts are larger than 10, and using a smaller value causes error in the following velocity analysis. Therefore we set the minimum shared counts as 10 in the new results.

      Figure EV9 Shared counts distribution of the datasets. (a) Dentate gyrus neurogenesis; (b) Pancreatic endocrinogenesis; (c) Bone marrow hematopoiesis.

      • * The choice of 500 variable genes (for human A549 cells) is also quite arbitrary. Perhaps, the authors could compare how additional genes (all highly variable genes) affects their analysis and interpretation. * Reply: ____Thanks. Following previous question on shared counts and ____data-driven filtering criteria____,____ we take all the highly variable genes into consideration. The details of gene selection and binarization are given in the Materialss and Methods (Materials and Methods 2) section.

      • * How are other factors (sequencing depth, genes detected, #of cell types, multiple branches) affects the connectivity between communities at different phases of transition/development? * Reply: This is a good question. The A549 EMT dataset has a sequence depth of 40000-50000. The ____dentate gyrus neurogenesis dataset____ has a sequence depth of 56,700 reads. A saturation depth would be close to 1,000,000, but there is a compromise between cell number and depth. There are genes that are not detected even under the saturation reads setting. That is why the preprocessing is needed. On the other hand, the network we inferred include both direct and indirect interaction, so the influence of sequence depth and gene number detected can be reduced to a certain extent. We used a random subset of the selected gene and performed the same analyses. The results are consistent with what we obtained using all the genes (Fig. EV11b). With the new gene selection criteria (Materials and Method 2), our analyses are not related with the number of cell types.

      We did analysis on another beta branch of pancreatic endocrinogenesis data. The other branches show the same results (Fig. EV4). There are two additional branches in the pancreatic endocrinogenesis dataset. It has been reported that the RNA velocity estimation for the epsilon branch is incorrect ____(3)____. There are too few cells in the delta branch for reliable analyses. Therefore we didn’t present results for these two branches.

      Figure EV4 Analyses on the branch of glucagon producing β-cells in pancreatic endocrinogenesis.

      (a) Transition graph based on RNA velocity.

      (b) The RCs and corresponding Voronoi cells. The large colored dots represent the RC points (start from blue and ends in red). The small dots represent cells with color as cell type.

      (c) Frustration score along the RCs.

      (d) Cell-specific variation of effective intercommunity regulation. Each dot represents a cell. Color represents the number of effective intercommunity edges within each cell in the GRN.

        • Are the velocity graph, transition matrix and further shortest path estimation derived in a reduced latent space, and if so, how much (nPCs) and what impact does it have. Presumably, the density estimation is not performed in expression space. Reply: Yes. ____The calculation of transition matrix is based on neighbor information. The calculation of neighbors was in the reduced latent space in scVelo and Dynamo. We performed the same analysis by varying number of principal components. The results are similar because the first several components account for large proportion of variance. Figure R1 shows the results of dentate gyrus neurogenesis with the number of principal components being 10, 20 and 30, respectively. In the revised manuscript, we delete the step of using density estimation constrain to simplify the procedure. __Figure R1 Frustration scorer along RCs (left) and cell specific variation of number of effective intercommunity edges (Each dot represents a cell and color represents the number of effective intercommunity edges) in the GRN within each cell (right) when using different number of PCs in analyses (dentate gyrus neurogenesis): (a) number of PCs is 10.*__

      (b) number of PCs is 20. (c) number of PCs is 30

      * - The figure legends and labels were hard to read. These should be improved for better readability. *

      Reply: Thanks. We modified the figure legends and labels.

      * - A suggestion would be move the initial results section to methods and highlight the biological interpretation. *

      Reply: Thanks for your advice. We moved large part of this section to the Materials and Methods.

      *The authors could highly which GRN and representative genes/edge pairs are highest ranked within inter-community and to overall final stable states. *

      Reply: Thanks. We list some representative gene pairs in the Table. EV 2&EV 3 &EV 4 for different datasets. And we performed gene enrichment analysis for each community.

      * - How does the GRN inference compare to current state-of-the-art GRN inference scRNA-seq methods? *

      Reply: we used the method GRISLI to perform the same analysis ____(5)____. The results are similar to what obtained with our current method (Figure EV6). We want to emphasize that the focus of this work is not on another GRN inference method, but discussing some general principles of GRN reorganization during a cell phenotypic transition process.

      Figure EV6 Analyses of datasets of dentate gyrus neurogenesis (a), pancreatic endocrinogenesis (b), and hematopoiesis (c) based on GRN inferred with GRISLI.

      (a) Frustration score along the RCs of dentate gyrus neurogenesis (left) and cell-specific variation of the number of inter-community edges (right). Each dot represents a cell and color represents the number of inter-community edges in GRN within each cell.

      (b) Same as in panel (a), except for pancreatic endocrinogenesis.

      (c) Same as in panel (a), except for hematopoiesis.

      * - How do extremely noisy/stochastic genes vary in metrics between final stable states? How are the metrics affected by number of cells and stochasticity of expression within a given cluster/community. *

      Reply: To address this question, we selected two genes, Id2 and Cdkn1c, with high variance and compare their distributions in the initial and final states. ____The gene distributions show significant shift between the Ngn3 low EP cells and Alpha cells (Fig. R2 a &b left).____ Then we randomly selected a subset (half) of cells and compared the distributions of these high-variance genes in the sub-population (Fig. R2 a&b right). The results are similar to the full-set results.

      Fig. R2 Comparison of gene distribution in the initial and final states in pancreatic endocrinogenesis. (a) Comparison of the distribution of gene Id2 at the initial and final states (left), and in the randomly selected sub-population at the initial and final states (right). (b) Comparison of the distribution of Cdkn1c at the initial and final states (left), and in the randomly selected sub-population at the initial and final states (right).

      * - Given that the author's approach includes both direct and indirect genes effects, the authors could further prune genes based on existing TF databases or protein-protein validated networks. *Reply: This is a good suggestion. We will work on this idea in future work. As we mentioned, due to constrains of data quality, only tens of transcription factors can be analyzed in these dataset. We list some regulations of transcription factors inferred with current method in Table EV1.

      • *It is unclear which GRNs are already known and which ones are novel and biologically relevant * Reply: We compare some regulations inferred with the method and compare these interactions w____ith some references in Table. EV1____.

      * - It would be good for authors to comment when there are multiple bifurcations instead of A-B transitions. Particularly in datasets with multiple discrete stable states. *Reply: This is a good question.____ In our analysis, we focus on the transition from one stable state to another stable state. For transition process with multiple bifurcations like____ the pancreatic endocrinogenesis, the results are similar across different branches. For the transition that goes through multiple discrete stable states, for example, a transition from state A____à____B____à____C, we expect to observe two peaks in the frustration score and the number of inter-community edges. We added some discussions in the Discussion section.

      • *Another suggestion would be to highlight gene expression of selected markers based on f-regression and mi over the trajectory * Reply: As we modified the criteria of gene selection, we plotted trajectories of some high-variance genes versus the reaction coordinate obtained with different datasets in Fig. EV10 based on current criteria.

      Figure EV10 ____Typical trajectories of high variance genes versus RCs of dentate gyrus neurogenesis (a), pancreatic endocrinogenesis (b) and bone marrow ____hematopoiesis ____(c).

      * - If possible, a proof of principle could be re-analysis of a perturbation scRNA-seq dataset (e.g. where one path/transition path is stalled) *

      Reply: Thanks. This is a really a good suggestion. We will perform more systematic studies in future work.

      * Reviewer #1 (Significance (Required)): Nature and significance of advance: The study and ideas in the manuscript are interesting and the methods would be potentially be useful to community. Compare to existing published knowledge: *

      *Audience: Predominantly computational audience *

      *Your Expertise: PI with background in experimental, computational biology and expertise in single-cell genomic tools and developmental biology *

      *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Understanding the cellular and molecular basis of cell type or cell state transitions occurring during development or reprogramming is a fundamental challenge. scRNA-seq has provided a window into gene expression programs across thousands of cells undergoing such transitions. Wang and colleagues leverage scRNA-seq and develop an approach to reverse engineer gene regulatory network underlying cells along a path from one cell type/state to another, and characterize community-level properties of this network associated with various stages of the cell phenotype transition. The study is innovative and rigorous, and their results point to how intercommunity interactions increase and then decrease, indicating a concerted regulatory rewiring that orchestrates transitions. Application of their approach to three different datasets also shows that this trend is consistent across three different transitions and maybe a general trend. However, there are some major and minor concerns that need to be addressed.

      **Major comments and questions**

      1. The analogy to SN1 and SN2 mechanisms of chemical bond formation is very nice.
      2. What is the basis for the two statements made in paragraph 3 of Introduction (beginning with "A question arises ...") about transitions being sequential or concurrent? Please *Reply: Thanks. We added references in this paragraph.

      * 2.1. Provide references to previous experimental and computational studies that have investigated developmental and reprogramming gene expression programs. *

      Reply: Thanks. We added a paragraph in the Introduction.

      *

      2.2. Describe specific examples of findings that support the two possible transitions highlighted here. Why couldn't transitions happen through an entirely gradual process involving changes to overlapping subsets of genes. *

      Reply: Thanks. In the review paper of Naomi Moris et. al., they proposed the hypothesis that cell phenotype transition is similar to a chemical reaction ____(6)____. Thus we extrapolate this hypothesis and test it in our study. For the example of SN1 mechanism, ____Kalkan et al. showed that mouse embryonic stem cells can exit from ____naïve pluripotency____ but remain uncommitted ____(7)____.

      Just like the SN1 and SN2 mechanisms are two extremes in chemical reactions and there are cases lie in between, for cell phenotypic transitions we agree with the reviewer that such gradual process may exist. Actually the result in Fig. EV4d shows that the frustration score remains flat for the Fev+ ____à____ Beta transition, suggesting a possible gradual process. With the analyses provided in this work, such as the reaction coordinate, frustration score, heterogeneity, and inter-/intra- community edges, one may perform more systematic studies on a larger number of datasets and enumerate/classify possible patterns of transitions.

      • Please make plots of the number of effective intra-community edges vs. number of active genes to support the statement that these two numbers are correlated. *

      Reply: We plotted the corresponding intra-community active genes and calculated its correlation coefficient with the number of effective intra-community edges in dentate gyrus neurogenesis (Fig. EV1d). ____The correlation coefficients are 0.91,0.96, 0.99 and 0.96 for community 0, 1, 2 and 3 separately.

      * A bunch of notations are not clear:

      4.1. What is the "r" in "strongest intercommunity interactions at r = 10 (Fig. 1F)"? Is it the same as the "r" mentioned in the Methods section? *

      Reply: r____ is the index number of the discretized reaction coordinate. We added it when we define the reaction coordinate. We modified the conflict usage of r in Materials and Method 4.

      4.2. What is "s_i" in "cell-specific effective matrix, Fbar_ij = (2*s_i - 1)*F_ij"? Also, that description of F_ij, f_ij, and H should be moved to the Methods section, and a more high-level, intuitive description should instead be included in this Results paragraph. Reply: represent the binarized gene expression state. is 0 for when gene is in low expression level (silence) and is 1 when gene is in high express level (active). We modified this part following your advice.

      * How were the h_f and h_m thresholds chosen? *

      Reply: and are based on the distribution of each dataset. Following suggestions from another reviewer, we modified this part. All the highly variable genes were selected and the genes were binarized with the Silverman’s bandwidth method and ____K____means (Materials and Methods 2).

      * What is the "density of each single cell" ("_t")? The formulation of the penalty of the distance between cells i and j (the expression with -logP_ij...) is unclear. What is the intuition behind it? What is r? How were the values of r (0.5 and 0.8) chosen? *

      Reply: The probability density of cells in the expression space is based on the kernel density estimation. Intuitively, a region in the expression space with more cells is more likely passed by more cell trajectories. The values are based on the distribution of kernel density estimation in different datasets.

      In the modified manuscript, we used trajectory simulation and deleted this assumption for simplification.

      * One of the reasons the authors state to justify the choice of PLSR is "In the scRNA dataset, the number of genes is often comparable to or larger than the number of cells." This is not true most of the time. In nearly all recent studies, the number of cells is way larger than the number of genes measured. *

      Reply: The PLSR method definitely can be used for the data whose number of cells is larger than the number of genes. Also the PLSR method was applied on cells that are the k nearest neighbors of each reaction coordinate, which are a subset of the whole dataset (Materials and Methods 5). While we mainly presented results with the PLSR method, in this revised manuscript we also added results with another method of GRISLI (Materials and Methods 9). The results are similar with what we obtained with PLSR.

      * There is a fleeting reference to a nice previous finding that supports their observations: "several lines of evidence support that EMT proceeds through a concerted mechanism. Indeed, both in vivo and in vitro studies have identified intermediate states of EMT that have co-expressed epithelial and mesenchymal genes (Pastushenko et al, 2018; Zhang et al, 2014)". The authors should thoroughly survey the literature related to EMT transition, development of pancreatic endocrine cells, and development of the granule cell lineage in dentate gyrus, to find more previously identified molecular/cellular features relevant to cell state/type transitions, compared and contrasted with findings from this study. *

      Reply: Thanks. We added references on these cell phenotype transitions and modified the corresponding part. We do want to point out that the main focus of this work is that all these processes share a common feature of transient increase of intercommunity interactions.

      * What is the "dynamo" package, which is supposed to contain a Python notebook? As of now, the code and data have not been made available. Both need to be released along with thorough documentation on how to run the code to reproduce the analyses described here. *Reply: Thanks. Dynamo is a python package accompanying our recent publication ____(8)____. We uploaded the code on Github and added the link of Dynamo.

      * **Minor comments and questions**

      1. Replace "confliction" throughout the manuscript with "conflict" or "conflicting" as appropriate. *

      Reply: Thanks. We modified them.

      * Paragraph two of the Introduction (beginning with "Another example of transitions ...") is missing multiple references, esp. for the last four sentences. *

      Reply: Thanks. We added references.

      * There are direct quotes from previous papers like "predicts the future state of individual cells on a timescale of hours". The authors are highly encouraged to check for usage of exact phrasing using available text software such as iThenticate. *

      Reply____: ____Thanks a lot for pointing out this severe mistake. We re-edited the manuscript and checked with iThenticate. *

      *

      • "Each community contains both E and M genes": what does this mean? *

      Reply: The E (M) genes are defined as those genes that are active or have high expression levels in epithelial (mesenchymal) state or sample. As we reorganized the manuscript, we add this explanation for all datasets in the caption of Fig.1i.*

      *

      • Reference to Qui 2021 is missing in the "Path analysis" subsection under Methods. *

      Reply: We added it in the Methods.

      * Fix: "transition between the cells that their sample time points are successive" in Methods. *

      Reply: Thanks. ____We modified it.

      * In Methods, under "Network inference", it is "partial least square regression" (not *least* s square). *

      Reply: Thanks. We modified it.

      * Figure 1: The cyan, magenta, and lime in 1C are very hard to see and, perhaps, the grey of the points can be made lighter. Also, change the red and green colors for the arrows in 1I to something else. These colors are not colorblind-friendly. *

      Reply: Thanks. We re-plotted the figures and changed the colormap.*

      *

      • Periods and commas are missing at several places. Reply: Thanks. We modify these and re-edit the manuscript.

      Reviewer #2 (Significance (Required)):

      The study uses RNA-velocity calculated from scRNA-seq data in an inventive way to characterize paths that reflect cell phenotype transitions. Then, a sparse gene regulatory network is reverse engineered from the data and the community structure within this network is examined at various stages along the transition to make observations about inter- and intra-community regulation and network "frustration". However, the study lacks the context of existing literature in terms of previous work studying cell transitions both experimentally and computationally. Adding this context (as suggested in the comments) will considerably improve the utility and significance of the findings. Overall, this study will be of broad interest to researchers interested in development and reprogramming as well as computational scientists developing and applying methods for scRNA-seq data analysis, trajectory inference, and network reconstruction. All the comments and questions raised here are based on my background and expertise in omics data (including scRNA-seq) analysis and network biology.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The authors analyze three datasets of Single cell RNA velocity measured during phenotypic transition. They infer the gene regulatory network in each case and characterize the transition between the initial and final expression states (in which different sets of genes are expressed). Their motivating question was to find whether during such transitions first genes characterizing the initial state are no longer expressed and only then the genes associated with the final state start expressing or alternatively there is gradual transition through an intermediate state in which subsets of both initial and final state genes are transiently expressed.

      They define a measure of regulatory frustration representing the mismatch between regulatory signals a gene receives and its current expression state. They conclude that phenotypic transitions involve transient interactions between otherwise non-interacting gene modules and a temporary increase of gene frustration, which is relaxed once the final expression state is reached.

      The study uses of advanced inference and machine learning methods.

      I find the question studied in this manuscript interesting, opening avenue to further questions and studies and relevant to different scientific communities. Personally I think that the focus of the paper should be the exposition of the methods used this manuscript would benefit from a longer format, but that depends of course on the journal they are aiming at. *

      *

      Statistical analysis is missing. Especially since the authors mention the potential of over-fitting due to large number of genes (on the order of the number of cells) - I think the authors should provide a sensitivity analysis testing how sensitive are the conclusions to the choice of cells or genes by applying the methods to subsets of the cells / genes. *

      Reply: Thanks. For the subset of cells, we randomly selected cells from the dataset and performed the analyses (Fig. EV11a). For the subset of genes, we selected a subset of genes randomly and performed the analyses (Fig. EV 11b). We found the results are not affected. We also perform another statistical analysis by varying the value of resolution in community detection algorithm. And we found that the conclusion on variation of inter-community edges is not affected (Fig. EV12).

      Figure EV11 Statistical analyses of dentate gyrus neurogenesis. Each dot represents a cell and color represents the number of inter-community edges.

      (a) Frustration score along the RCs (left) and cell-specific variation of the number of inter-community edges (right) of a randomly selected sub-population of 2000 cells (from a total of 3184 cells);

      (b) Frustration score along the RCs (left) and cell-specific variation of the number of inter-community edges) (right) of cells on the space of 400 randomly selected genes (from a total of 678 genes).

      *What is the meaning of the distribution in the frustration plots? *

      Reply: For each cell we calculated a frustration score. Therefore for cells in each Voronoi cell (which is a geometric cell, don’t be confused with the biological “cells”) along the reaction coordinate (Fig.1d, Fig. 2b &2g), we obtained a distribution of the frustration scores.*

      In general, the conclusions are well-justified, but I think some statements in the discussion are inaccurate: "intercommunity interactions of a GRN are indeed minimized' - are they minimal or are they only lower at the stable states? There are two stable states - for which of them is intercommunity interaction lower? *

      Reply: Thank. We agree with the reviewer and modified the writing. Comparing with the transition state, the number of intercommunity interactions is less for the stable states. ____The datasets' quality are not high enough for us to investigate whether ____"intercommunity interactions of a GRN are indeed minimized”.*

      It is written in the discussion that 'for all three datasets frustration decreases with differentiation', but then Fig. 1g shows the opposite (final state is more frustrated than initial state). It is interesting to discuss the differences between the datasets analyzed in that respect and what could cause transition to a more frustrated state. I suggest that the authors also refer in the discussion to related questions and possible follow-up studies, such as: what determines the duration of the phenotypic transition? A relevant number is the switching time of a single gene. *

      Reply: Good suggestion. Compared to other datasets, we found that the result of EMT shows larger variances. The relative difference of the frustration score is also affected by the GRN inference algorithm. For example, the difference between initial and final frustration scores of the pancreatic endocrinogenesis is more significant when using the GRISLI method (Figure EV6b). Given these, the trend that the frustration scores in the transition states transiently increase keep consistent.

      Our conclusion is limited by the quality of the data. So we delete this part of discussion in the manuscript.

      Qiu et al. have shown that splicing-based ____RNA velocities are relative, while metabolic-labeling-based RNA velocities are more quantitative and accurate____(8)____. We will re-analyze this problem if data with metabolic labeling becomes available.

      * The authors mention at the end that the networks can often reach multiple final states from a common initial states. Do such transitions share some of their path (and in particular the intermediate frustrated state)? Given the intermediate connected state, it would be interesting to characterize the network stability to perturbations. *

      Reply: This is a very important question. To reliably address these questions, we need higher quality data. We plan to characterize the network stability to perturbations in future studies, while in our recent paper using a full nonlinear modeling framework____(8)____, we performed in silico perturbations.

      * While interesting, the manuscript itself is unfortunately hard to read and would benefit from major editing, including better exposition of the science and language editing. *

      Reply: Thanks. We revised the manuscript extensively.*

      Methods: Description of PCA and 'revised finite temperature string method' are missing in the Methods section. *

      Reply:____ Thanks. PCA is used in RNA velocity analysis for dimension reduction. We added this in Materials and Methods 3. The revised string method is in Materials and Methods ____4.

      *

      Some examples:

      Figure captions are very short and often non-informative. Some variables are not defined (or only defined later on) and the reader then needs to guess their meaning: it took me a while to understand what is 'r' in Fig. 1f and what 'r=10' (p. 4) means. *

      Reply: Thanks. ____r____ represents the index number of reaction coordinates. We added this in the manuscript where we define reaction coordinates.*

      p. 4: what are 'f' (as opposed to F) and 's_ij' and 's_j' (expression states?) Or is fs_ij one variable? What does a Hamiltonian of a cell mean (p. 4, bottom)? *

      Reply: is the regulation of gene ____j on gene i, and is the expression state of gene i (0 for silence, and 1 for active expression). is the frustration value of regulation from gene j to gene i.

      The pseudo Hamiltonian value is proposed in the literature as an analogy of ____the magnetic systems following the work of Boolean model in EMT ____(9)____. A high Hamiltonian value indicates that the cell is in an unstable state. In the original manuscript we included this quantity since it has been discussed in the literature. However we found it causes confusion and is not necessary for our discussions, so we removed the pseudo-Hamiltonian results in the revised manuscript. * P. 4: how are 'E and M genes' defined? *

      Reply: The E (M) genes are defined as those genes that are active or have high expression levels at the epithelial (mesenchymal) state or sample. We explained our general strategy in the caption of Fig.1i . * What does 'network heterogeneity' (p. 5) mean? *

      Reply: Network heterogeneity measures how homogenously the connections are distributed among the genes____(10)____. A high heterogeneity ____means that some genes have high degree of connectivity (the so-called hubs), while some have low degree of connectivity.

      *

      Fig. 1 is too tiny and hard to read and details are missing. *

      Reply: Thanks. We modified this figure and caption.*

      A glossary for all the acronyms used would be very helpful. *

      Reply: Thanks. We added glossary in the manuscript.*

      Language (some examples):

      p. 5 bottom: Another system is on development... invitro -> in vitro

      p. 6: 'measure on developmental potential' -> measure of... *

      Reply: Thanks. We modified these and re-edited the whole manuscript.*

      Reviewer #3 (Significance (Required)):

      This study presents a methodological advance in demonstrating the application of data analysis methods to study developmental phenotypic transitions. High throughput measurements and computation power available today enable putting to test theoretical conjectures, as made by Waddington. I think this is a promising line of research, which could be used to further develop the computational methods as well as to further our understanding of developmental transitions and potentially develop associated mathematical modeling frameworks.

      This study should be of interest to a diverse readership composed of developmental biologists as well as to quantitative biologists and CS researchers applying optimization techniques and data analysis methods to high-throughput biological data.

      I am not an expert on the computational methods applied in this manuscript and hence cannot assess their correct use and statistical analysis.

      *

      1. Traag VA, Waltman L, & van Eck NJ (2019) From Louvain to Leiden: guaranteeing well-connected communities. Scientific Reports 9(1):5233.
      2. Stuart T, et al. (2019) Comprehensive Integration of Single-Cell Data. Cell 177(7):1888-1902.e1821.
      3. Bergen V, Lange M, Peidli S, Wolf FA, & Theis FJ (2020) Generalizing RNA velocity to transient cell states through dynamical modeling. Nature Biotechnology 38(12):1408-1414.
      4. Wolf FA, Angerer P, & Theis FJ (2018) SCANPY: large-scale single-cell gene expression data analysis. Genome Biology 19(1):15.
      5. Aubin-Frankowski P-C & Vert J-P (2020) Gene regulation inference from single-cell RNA-seq data with linear differential equations and velocity inference. Bioinformatics (Oxford, England) 36(18):4774-4780.
      6. Moris N, Pina C, & Arias AM (2016) Transition states and cell fate decisions in epigenetic landscapes. Nature reviews. Genetics 17(11):693-703.
      7. Kalkan T, et al. (2017) Tracking the embryonic stem cell transition from ground state pluripotency. Development 144(7):1221-1234.
      8. Qiu X, et al. (2022) Mapping Transcriptomic Vector Fields of Single Cells. Cell 185(4):690-711.
      9. Font-Clos F, Zapperi S, & La Porta CAM (2018) Topography of epithelial–mesenchymal plasticity. Proceedings of the National Academy of Sciences 115(23):5902-5907.
      10. Gao J, Barzel B, & Barabási A-L (2016) Universal resilience patterns in complex networks. Nature 530(7590):307-312.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The authors analyze three datasets of Single cell RNA velocity measured during phenotypic transition. They infer the gene regulatory network in each case and characterize the transition between the initial and final expression states (in which different sets of genes are expressed). Their motivating question was to find whether during such transitions first genes characterizing the initial state are no longer expressed and only then the genes associated with the final state start expressing or alternatively there is gradual transition through an intermediate state in which subsets of both initial and final state genes are transiently expressed.

      They define a measure of regulatory frustration representing the mismatch between regulatory signals a gene receives and its current expression state. They conclude that phenotypic transitions involve transient interactions between otherwise non-interacting gene modules and a temporary increase of gene frustration, which is relaxed once the final expression state is reached.

      The study uses of advanced inference and machine learning methods.

      I find the question studied in this manuscript interesting, opening avenue to further questions and studies and relevant to different scientific communities. Personally I think that the focus of the paper should be the exposition of the methods used this manuscript would benefit from a longer format, but that depends of course on the journal they are aiming at.

      Statistical analysis is missing. Especially since the authors mention the potential of over-fitting due to large number of genes (on the order of the number of cells) - I think the authors should provide a sensitivity analysis testing how sensitive are the conclusions to the choice of cells or genes by applying the methods to subsets of the cells / genes.

      What is the meaning of the distribution in the frustration plots?

      In general, the conclusions are well-justified, but I think some statements in the discussion are inaccurate: "intercommunity interactions of a GRN are indeed minimized' - are they minimal or are they only lower at the stable states? There are two stable states - for which of them is intercommunity interaction lower?

      It is written in the discussion that 'for all three datasets frustration decreases with differentiation', but then Fig. 1g shows the opposite (final state is more frustrated than initial state). It is interesting to discuss the differences between the datasets analyzed in that respect and what could cause transition to a more frustrated state. I suggest that the authors also refer in the discussion to related questions and possible follow-up studies, such as: what determines the duration of the phenotypic transition? A relevant number is the switching time of a single gene.

      The authors mention at the end that the networks can often reach multiple final states from a common initial states. Do such transitions share some of their path (and in particular the intermediate frustrated state)? Given the intermediate connected state, it would be interesting to characterize the network stability to perturbations. While interesting, the manuscript itself is unfortunately hard to read and would benefit from major editing, including better exposition of the science and language editing.

      Methods: Description of PCA and 'revised finite temperature string method' are missing in the Methods section.

      Some examples:

      Figure captions are very short and often non-informative. Some variables are not defined (or only defined later on) and the reader then needs to guess their meaning: it took me a while to understand what is 'r' in Fig. 1f and what 'r=10' (p. 4) means.

      p. 4: what are 'f' (as opposed to F) and 's_ij' and 's_j' (expression states?) Or is fs_ij one variable? What does a Hamiltonian of a cell mean (p. 4, bottom)?

      P. 4: how are 'E and M genes' defined?

      What does 'network heterogeneity' (p. 5) mean?

      Fig. 1 is too tiny and hard to read and details are missing.

      A glossary for all the acronyms used would be very helpful.

      Language (some examples):

      p. 5 bottom: Another system is on development... invitro -> in vitro

      p. 6: 'measure on developmental potential' -> measure of...

      Significance

      This study presents a methodological advance in demonstrating the application of data analysis methods to study developmental phenotypic transitions. High throughput measurements and computation power available today enable putting to test theoretical conjectures, as made by Waddington. I think this is a promising line of research, which could be used to further develop the computational methods as well as to further our understanding of developmental transitions and potentially develop associated mathematical modeling frameworks.

      This study should be of interest to a diverse readership composed of developmental biologists as well as to quantitative biologists and CS researchers applying optimization techniques and data analysis methods to high-throughput biological data.

      I am not an expert on the computational methods applied in this manuscript and hence cannot assess their correct use and statistical analysis.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Understanding the cellular and molecular basis of cell type or cell state transitions occurring during development or reprogramming is a fundamental challenge. scRNA-seq has provided a window into gene expression programs across thousands of cells undergoing such transitions. Wang and colleagues leverage scRNA-seq and develop an approach to reverse engineer gene regulatory network underlying cells along a path from one cell type/state to another, and characterize community-level properties of this network associated with various stages of the cell phenotype transition. The study is innovative and rigorous, and their results point to how intercommunity interactions increase and then decrease, indicating a concerted regulatory rewiring that orchestrates transitions. Application of their approach to three different datasets also shows that this trend is consistent across three different transitions and maybe a general trend. However, there are some major and minor concerns that need to be addressed.

      Major comments and questions

      1. The analogy to SN1 and SN2 mechanisms of chemical bond formation is very nice.
      2. What is the basis for the two statements made in paragraph 3 of Introduction (beginning with "A question arises ...") about transitions being sequential or concurrent? Please

      2.1. Provide references to previous experimental and computational studies that have investigated developmental and reprogramming gene expression programs.

      2.2. Describe specific examples of findings that support the two possible transitions highlighted here. Why couldn't transitions happen through an entirely gradual process involving changes to overlapping subsets of genes.

      1. Please make plots of the number of effective intra-community edges vs. number of active genes to support the statement that these two numbers are correlated.
      2. A bunch of notations are not clear:

      4.1. What is the "r" in "strongest intercommunity interactions at r = 10 (Fig. 1F)"? Is it the same as the "r" mentioned in the Methods section?

      4.2. What is "s_i" in "cell-specific effective matrix, Fbar_ij = (2s_i - 1)F_ij"? Also, that description of F_ij, f_ij, and H should be moved to the Methods section, and a more high-level, intuitive description should instead be included in this Results paragraph.

      1. How were the h_f and h_m thresholds chosen?
      2. What is the "density of each single cell" ("⍴_t")? The formulation of the penalty of the distance between cells i and j (the expression with -logP_ij...) is unclear. What is the intuition behind it? What is r? How were the values of r (0.5 and 0.8) chosen?
      3. One of the reasons the authors state to justify the choice of PLSR is "In the scRNA dataset, the number of genes is often comparable to or larger than the number of cells." This is not true most of the time. In nearly all recent studies, the number of cells is way larger than the number of genes measured.
      4. There is a fleeting reference to a nice previous finding that supports their observations: "several lines of evidence support that EMT proceeds through a concerted mechanism. Indeed, both in vivo and in vitro studies have identified intermediate states of EMT that have co-expressed epithelial and mesenchymal genes (Pastushenko et al, 2018; Zhang et al, 2014)". The authors should thoroughly survey the literature related to EMT transition, development of pancreatic endocrine cells, and development of the granule cell lineage in dentate gyrus, to find more previously identified molecular/cellular features relevant to cell state/type transitions, compared and contrasted with findings from this study.
      5. What is the "dynamo" package, which is supposed to contain a Python notebook? As of now, the code and data have not been made available. Both need to be released along with thorough documentation on how to run the code to reproduce the analyses described here.

      Minor comments and questions

      1. Replace "confliction" throughout the manuscript with "conflict" or "conflicting" as appropriate.
      2. Paragraph two of the Introduction (beginning with "Another example of transitions ...") is missing multiple references, esp. for the last four sentences.
      3. There are direct quotes from previous papers like "predicts the future state of individual cells on a timescale of hours". The authors are highly encouraged to check for usage of exact phrasing using available text software such as iThenticate.
      4. "Each community contains both E and M genes": what does this mean?
      5. Reference to Qui 2021 is missing in the "Path analysis" subsection under Methods.
      6. Fix: "transition between the cells that their sample time points are successive" in Methods.
      7. In Methods, under "Network inference", it is "partial least square regression" (not least s square).
      8. Figure 1: The cyan, magenta, and lime in 1C are very hard to see and, perhaps, the grey of the points can be made lighter. Also, change the red and green colors for the arrows in 1I to something else. These colors are not colorblind-friendly.
      9. Periods and commas are missing at several places.

      Significance

      The study uses RNA-velocity calculated from scRNA-seq data in an inventive way to characterize paths that reflect cell phenotype transitions. Then, a sparse gene regulatory network is reverse engineered from the data and the community structure within this network is examined at various stages along the transition to make observations about inter- and intra-community regulation and network "frustration". However, the study lacks the context of existing literature in terms of previous work studying cell transitions both experimentally and computationally. Adding this context (as suggested in the comments) will considerably improve the utility and significance of the findings. Overall, this study will be of broad interest to researchers interested in development and reprogramming as well as computational scientists developing and applying methods for scRNA-seq data analysis, trajectory inference, and network reconstruction. All the comments and questions raised here are based on my background and expertise in omics data (including scRNA-seq) analysis and network biology.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this manuscript by Wang and colleagues, the authors analyse single-cell RNA-seq (scRNAseq) data by applying transition path theory to infer gene regulatory network (GRN) changes along the transition (reaction coordinate, trajectory) between free energy stable states (i.e. cell types). The work aims to understand how stable cell types, and their regulatory programs (combination of active and repressed genes) switches during differentiation/reprogramming/response (i.e. cell phenotypic transition/CPT). The premise of the work is to assess whether genes within GRNs undergo step-wise repression, state-change and activation (& vice-versa; analogous to SN1) or concurrently regulate gene expression (analogous to SN2). The GRNs are inferred based on highly variable genes and their expression dynamics from RNA velocity over CPT, across 3 scRNA-seq datasets.

      The authors first analyse public scRNA-seq dataset of 3003 human A549 adenocarcinomic basal epithelial cells treated with TGF- for 0hrs, 8hrs, 1 day and 3 days (4 timepoints). The authors select two stable states (Day0-untreated; Epithelial and Day 3-treatment; Mesenchymal) using local kernel densities and set transition paths using Dijkstra shortest path, dividing state space into Voronoi cells (i.e. reaction coordinate value), and constructed single-cell GRNs based on RNA velocity differences (n=500 genes) and a linear model (from Qiu et al). This GRN is based on expression and velocity estimates, and does not distinguish direct from indirect regulation. Calculating interaction frequency (edges) across two stable states over 4 louvain clusters, the authors find global increase in effective edges that correlates with increased active genes; but with variable trend within inter-cluster edges. To quantify the concerted GRN changes between clusters, the authors utilise a "frustration" score (from Tripathi et al 2020). The average frustration score increases and peaks at day 1 treatment, followed by a decline over terminal stable state (day 3-treatment); similar to interaction frequency trends. The author also separately measure network heterogeneity and repeat analysis using alternative transition matrix. The authors conclude that EMT proceeds through concerted regulation of multiple genes first with an increase in inter-cluster edges, frustration and heterogeneity followed by a decrease into final stable state. The authors apply the analysis to scRNA-seq data from (i) pancreatic endocrine differentiation where Ngn3-low progenitors give rise to Ngn3-high, then Fev-high and into glucagon producing -endocrine cells; (ii) dendate gyrus; radial glial cell differentiation into nIPCs, neuroblast, immature granule and mature granule cells. In both cases, the authors observe concerted regulation with initial increase in inter-community edges, heterogeneity during differentiation followed by decrease towards final stable state.

      The study and ideas in the manuscript are interesting and the methods would be potentially be useful. However, there are a few specific and general comments stated below, which the authors should try to address.

      • P4: "RC increases first and reaches a peak when cells were treated with TGF-β for about one day, then decreases (Fig. 1G)". It would be better to label the figure with the treatment information. • Fig. 1G and EV1D: Why are the trends different? • How is the appropriate community/cluster/Louvain resolution selected? This can have a major impact on number of cell states, types and transition path from initial to final state. • What effect does the Louvain resolution have on e.g. frustration scores? • The authors match resolution to samples/timepoints/known prior cell types i.e. 3-4 communities. However it is unclear whether this is enough to describe entire differentiation/transition process. • Gene selection: The selection based on minimum 20 counts as highly expressed genes is arbitrary and dependent on sequencing depth. Perhaps the authors could show distribution of gene counts for the datasets and have a data-driven filtering criteria • The choice of 500 variable genes (for human A549 cells) is also quite arbitrary. Perhaps, the authors could compare how additional genes (all highly variable genes) affects their analysis and interpretation. • How are other factors (sequencing depth, genes detected, #of cell types, multiple branches) affects the connectivity between communities at different phases of transition/development? • Are the velocity graph, transition matrix and further shortest path estimation derived in a reduced latent space, and if so, how much (nPCs) and what impact does it have. Presumably, the density estimation is not performed in expression space.

      • The figure legends and labels were hard to read. These should be improved for better readability.
      • A suggestion would be move the initial results section to methods and highlight the biological interpretation. The authors could highly which GRN and representative genes/edge pairs are highest ranked within inter-community and to overall final stable states.
      • How does the GRN inference compare to current state-of-the-art GRN inference scRNA-seq methods?
      • How do extremely noisy/stochastic genes vary in metrics between final stable states? How are the metrics affected by number of cells and stochasticity of expression within a given cluster/community.
      • Given that the author's approach includes both direct and indirect genes effects, the authors could further prune genes based on existing TF databases or protein-protein validated networks.
      • It is unclear which GRNs are already known and which ones are novel and biologically relevant
      • It would be good for authors to comment when there are multiple bifurcations instead of A-B transitions. Particularly in datasets with multiple discrete stable states.
      • Another suggestion would be to highlight gene expression of selected markers based on f-regression and mi over the trajectory
      • If possible, a proof of principle could be re-analysis of a perturbation scRNA-seq dataset (e.g. where one path/transition path is stalled)

      Significance

      Nature and significance of advance: The study and ideas in the manuscript are interesting and the methods would be potentially be useful to community.

      Compare to existing published knowledge: -

      Audience: Predominantly computational audience

      Your Expertise: PI with background in experimental, computational biology and expertise in single-cell genomic tools and developmental biology

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We are very grateful to the three referees for their constructive comments and suggestions which have helped improve the quality of our manuscript.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      In the publication HAT-field: a very cheap, robust and quantitative point-of-care serological test for Covid-19 by Joly and Ribes the authors describe an adaption and an improved protocol to their previously published haemagglutination based test to detect antibodies to SARS-CoV-2 in patient blood (Towsend et al., 2021). In detail, they analyzed the effect of several adaptions including buffer optimization, plate coating, usage of patient whole blood instead of washed RBCs and plasma. Additionally they tested different temperatures and stability of the reagents, namely the nanobody-RBD construct IH4-RBD. For validation they compared their optimized HAT-field assay with Jurkat-S&R as a FACS-based assay.

      Major comments:

      Introduction: This section is rather short and could benefit from a broader overview of currently established methods and assays to detect appropriate immune responses against SARS-CoV-2. The author are advised to summarize the current literature in the field more comprehensively and not only focus on their own work.

      Response: Hundreds of different tests to monitor immune responses against SARS-CoV-2 have been described to date, and the literature on these various tests is vast, with new articles coming out almost on a daily basis. We would not feel either that the introduction of our rather technical paper would benefit from being lengthened by such a review of the current literature, or even competent to carry out such a summary. Following the referee’s suggestion, we have, however, introduced a new sentence and given three references providing relatively recent overviews on the subject of immune-monitoring.

      Cross-reactivity with IH4-RBD. In Figure 6, the authors highlight the samples in red and orange that showed cross-reactivity with IH4-RBD. In their discussion, however, the authors state that only 2 of 60 (3%) were cross-reactive. In making this statement, they ignore the proportion of cross-reactive samples that were also positive in the Jurkat S&R assay. Therefore, the authors should acknowledge in the discussion that the actual number of cross-reactive samples was higher.

      Response*: The statement in the discussion about 2 cross reactive samples out of 60 concerns the results obtained after an incubation of one hour under normal gravity, and not the two red dots in each of the three graphs of figure 6, which correspond to the two negative samples which gave false-positive results in HAT plasma titrations after spinning (Figure 6C), for which we correctly state in the discussion that 12 samples showed cross-reactivity on IH4 alone. The data presented in Figure 6B corresponds to HAT-field after spinning, for which we correctly state in the discussion that 5 out of 60 showed cross-reactivity (4 orange dots + 1 red dot, the second red dot having a score of 0, in accordance with the fact that this sample showed no cross reaction on IH4 alone in HAT-field after spinning). *

      *To try to prevent this possible confusion, we have now clarified what data we are referring to at the start of that paragraph in the discussion. *

      Quantitative Assay. Since the HAT assay does not allow determination of the absolute number of antibodies reactive to SARS-CoV-2 in the blood samples, the authors should refrain from claiming that the HAT-field is a quantitative assay.

      Response*: Since immune sera are inherently polyclonal, they contain a multitude of different types of antibodies of different affinities and avidities, and we are not aware of any technique that allows to determine the “absolute number” of antibodies directed against a given antigen in such samples. *

      *For many serological tests, including ELISA and the initial protocol of HAT, serum or plasma titrations are used as a means to obtain what is widely considered as a quantitative evaluation of the amounts of antibodies in blood samples. Even FACS-based assays such as the Jurkat-S&R-flow test we have used, are commonly considered as quantitative but those only provide relative results and not absolute numbers. *

      We perceive that the close correlations we find between the results of the HAT-field protocol and those of the Jurkat-S&R-flow test as well as with serum titrations using the standard HAT protocol warrants considering the results of HAT-field as being as quantitative as those obtained with all those other tests.

      Morphological read out For field application, the morphological description of the observed deposits ("teardrop" vs. "button") could be problematic and might lead to bias depending on the user. Thus, the authors should provide a clearer description for phenotype classification.

      Response: We have now introduced a specific paragraph detailing how to score HAT assays in the Methods section, as well as a new figure providing a graphic description of positive, partial and negative RBCs deposits.

      Minor comments: Title: the authors should remove "very"

      Response*: We have now removed the word ‘very’ from the title, and thank the referee for this helpful suggestion. *

      By the way: What are the costs of IH4-RBD for a 96 well plate? Who will produce this reagent? Is the sequence of the IH4 fully disclosed?

      Response*: As specified in our original paper (see Townsend et al. 2021), the plasmid coding for the IH4-RBD is available upon request from Alain Townsend (Oxford, UK). Furthermore, his laboratory funded the production of 1 gram of the IH4-RBD reagent by a commercial company, and professor Townsend has been graciously sending aliquots of 1 mg of this reagent, which suffice for several thousand tests, to all the laboratories that have requested it from him. *

      *In its initial format, HAT only required 100 ng of IH4-RBD per well, corresponding to a cost of 0.0027 £ per well. For the HAT-field protocol, 5 times more reagent is needed, thus bringing the cost of the reagent to 1.5 cts per test, to which one would have to add a similar cost for the IH4-reagent alone. This would thus bring the cost of the two reagents to approximately 3 cts, which is still lower than the price of any of the cheap disposable plasticware necessary for the test (lancet, pipet, plastic tube and portion of a plate). *

      The sequence of the IH4 nanobody is indeed fully disclosed (see figure 1 of Townsend et al. 2021), and has actually been protected by a patent ( US9879090B2 ). Whilst IH4 can be used freely for research purposes, licensing rights would have to be taken into consideration by any health authority wishing to use the technique broadly, or for any commercial distribution.

      The usage of the CR3022 as positive control for neutralizing antibodies should be reconsidered since this antibody does not confer viral neutralization. Other well describe antibodies blocking the ACE2:RBD interface might be better suited.

      Response*: CR3022 was the one that we had at our disposal, but other mAbs can certainly be used instead of as positive controls, and this is actually indicated in the detailed HAT-field protocol provided. Since the use of a positive control is only to ensure that the IH4-RBD has not been degraded and works as well as expected, and that any negative samples are not due to a very rare glycophorin mutation that could prevent IH4 from binding to it at the surface of RBCs, we are not sure why using a mAb with neutralizing activity would necessarily be better than the CR3022 mAb. *

      Figure 2: Please state the concentration of IH4-RBD used. As stated in the figure legends for Figure 2 B, the authors should show the result all 4 replicates (incl. SD)

      Response: The concentration of IH4-RBD was 1 m*g/ml, i.e. the normal concentration for standard HAT tests. This was already indicated in the Methods section, but has now been added to the legend of Figure 2. *

      Whilst 4 experiments were indeed carried out, which all gave similar results, i.e. showed that using PBS-N3 or PBN did not hinder HAT performance, but could instead result in a slight increase in HAT sensitivity, those various experiments were not all exact replicates of the experiment shown on figure 2. Furthermore, performing of those various experiments was spread over a period of over a year, using different reagents, thus precluding numerical comparisons between the various results. We have clarified this issue by rewording the final statement to “Comparable results were obtained in four similar experiments.”

      Figure 3: Although the authors showed stability of IH4-RBD at 2 µg/ml they do not provide data for the stabilities at higher dilutions. As the authors suggest to predistribute the IH4-RBD in plates they should at least discuss this issue.

      We thank the referee for raising this valid point, which has now been discussed in the paragraph entitled “Practical considerations for performing HAT assays” in the Methods section: “One aspect that will have to be considered for the design and use of such individual strips of wells will be to ensure that, upon storage, the various dilutions of IH4-RBD are as stable in such strips as the working stocks of IH4-RBD (2 mg/ml) tested in Figure 3.”

      Figure 6/Supplementary Figure 1 and 3 The presentation of the data is not accurate, as many of the points (samples) are obviously identically positioned in the graph. The authors should choose a different representation of their data. E.g. they could adjust the size of the points to the number of overlapping samples.

      Response: We thank the referee for raising this issue, which was also pointed to by referee #2. This apparent inaccuracy is due to the fact that, on these plots, the scales for both x and Y axes used discrete values, which indeed results in multiple points overlapping on top of one another. This was resolved by adding numbers next to the positions where several dots overlapped

      Wording / text length In the current manuscript the text is very long. Thus, the authors should shorten it to report the essential findings more appropriately. Additionally they should check for correct English wording.

      Response*: We thank the referee for this remark, which helped us realize that the excessive length of the manuscript was mostly due to an extensive discussion of highly technical and practical points. The corresponding paragraphs were indeed out of place in the general discussion, and have not been deleted but have been moved to the Methods section since we feel that they contain very important information for people who would actually start to performing HAT assays. *

      Reviewer #1 (Significance (Required)):

      In summary, the authors describe the HAT-field test as a simple PoC test for the detection of SARS-CoV-2 antibodies in patients. Because of its ease of use and robustness, the test appears to be particularly well suited for use in countries with underdeveloped health care or limited testing facilities, as also reported previously. The value of this manuscript lies mainly in the detailed description of the protocol and its validation. In this context, the adaptations described are certainly useful and helpful from a practical point of view, but do not provide significant new scientific insights. In light of these considerations, we recommend that this work be submitted to an appropriate journal specializing in the publication of such methods

      Expertise The reviewers have established and published different serological assays to monitor immune responses against SARS-CoV-2

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this paper, the authors developed a feasible protocol for an affordable point-of-care serological test for SARS-CoV-2. This method was adapted from the HAT plasma titration test that the authors previously published. Specifically, the test utilizes a 96-well plate pre-coated with the RBD of SARS-CoV-2 spike glycoprotein fusing to a red blood cell targeting nanobody (IH4). By adding microliters amount of the blood or plasma samples to the plate, it allows the detection of antibodies against RBD by measuring the level of hemagglutination. In the current upgraded protocol (so called HAT-field), the authors made major modifications including optimizations of buffer and experimental protocol and the use of pre-titrated IH4-RBD on the plate, which collectively helped to lower the sample consumptions, improved the stability and the sensitivity of detection, and made the test more user-friendly under non-clinical settings.

      Major comments: My major concerns are related to the robustness and quantitative capability of this approach. Specifically: It seems that multiple variables may impact the results. These include volume of droplets, the presence/absence of serum IH4 or BSA cross-reactive antibodies, and the amount (%) of red blood cells which may vary substantially among samples. Could you find a way to normalize the results (e.g., the discrepancy shown in Figure 6) instead of only leaving them as false-positives or false-negative?

      Response*: Regarding the volume of the droplets, in other words, the amount of blood collected and used in an assay, two sentences in the manuscript underline the fact that this is not a critical variable: *

      In the Results section “the precise volume of blood collected is not critical; it may vary by as much as 30% with no detectable influence on the results.”

      In the discussion: “On this subject, we have found that increasing the amount of whole blood per well (in other words using blood that is less dilute) has very little influence over the HAT-field results, and, if anything, adding more blood can sometimes reduce the sensitivity, albeit never by more than 1 dilution.”

      Consequently the % of RBCs in samples seem unlikely to influence the HAT-field scores significantly. This is supported by the fact that, although men tend to have higher hematocrits than women, we have not noticed any detectable difference between men and women in the correlation of the HAT-field scores with those of the Jurkat-S&R-flow test.

      We are not sure that we fully understand what discrepancy shown in Figure 6 the referee is pointing to, but if it is about the increase in the number of samples found to be cross reacting on IH4 alone when the sensitivity increases, in the discussion, we propose to perform tests using titrations of the IH4 nanobody alone simultaneously to using the IH4-RBD reagent, so as to minimize the number of samples that would be identified as false positives if only one concentration of IH4 alone was used as negative control. Comparing the titers obtained with IH4-RBD and IH4 alone will then provide some level of normalization for the samples cross reacting on IH4. As for the hypothetical presence of antibodies cross reacting on BSA alluded to by the referee, since such antibodies would not bind to RBCs, we do not think they would affect the HAT results.

      Second, the score of the HAT-field ranges from 0 - 8. However, based on the current manuscript, it is not clear how the scoring and scaling works. How is the noise (non-specific antibody signal) defined here?

      Response: We have now introduced a specific paragraph and a new figure detailing how to score HAT assays in the Methods section.

      In addition, it is unclear how to translate the HAT-field score into a meaningful measure of protection by serum antibodies.

      Response*: Documenting the correlation between HAT-field scores and levels of protection against SARS-CoV-2 infections and/or Covid-19 severity would indeed be extremely interesting. This would, however, require setting up a large scale clinical trial carried out over several months. This type of work could only be carried out by a large consortium including clinicians or even preferably a national health agency. This was, however, far beyond the reach of this initial project, which was based on the work of a single person on a shoestring budget. *

      Can you provide more evidence to demonstrate that the test is quantitative? For example, performing additional orthogonal experiments to better validate the scoring and generate a correlation function?

      Response*: Inasmuch as it would have been very interesting to perform additional serological tests from commercial sources on the samples of our cohort, such tests are all very expensive (e.g. ca. 500 € for one ELISA plate). This was in fact the main reason for developing the Jurkat-S&R-flow test in the first place, since it is much cheaper, more modular, and at least as sensitive as ELISA (see Maurel Ribes et al. 2021). The funds for this whole project came from a single 15 k€ grant obtained from the ANR, and we simply did not have access to the funds, or to the human resources to carry out such experiments based on commercial serological tests. *

      Minor comments: Figure 6: are all results included? To me, it does not seem that all 60 samples data were included in the plot.

      Response: We thank the referee for raising this issue, which was also pointed to by referee #1. This apparent inaccuracy is due to the fact that the scales for both x and Y axes used discrete values, which results in multiple points overlapping on top of one another. This was resolved by adding numbers next to the positions where several dots overlapped.

      There are several redundant statements in the discussion and results section. Please make the text more concise.

      Response: The discussion has now been shortened considerably, mostly by moving the paragraphs pertaining to technical considerations to the Methods section.

      Reviewer #2 (Significance (Required)):

      The current paper is built upon the improvement of previous published work. In addition, there are similar approaches that have been published. It was unclear if the current method is superior to other works.

      Response: Whilst we have made no statement regarding whether the method we describe is superior to other methods, we are pretty confident that very few alternatives will be as frugal and simple as the HAT-field protocol described here. As alluded to in the final paragraph of the discussion, two recent reports have described that HAT could be performed on cards rather than in V-shaped wells, with semi-quantitative results being obtained in minutes. If such card-based approaches turn out to provide sensitivity and reliability comparable to those of the HAT-field protocol, they will certainly represent very interesting alternatives. As stated in our manuscript, we would be very interested if the comparative evaluation of the two approaches could be carried out by one or several independent third party.

      My research involves the development of antiviral antibody therapeutics. This method may be used as a point-of-care tool for the measurement of serologic response to RBD in less developed countries. However, due to the high vaccination rate and large infected populations, the overall needs for such detection drastically decrease. The significance of the work and utilities of the test may expand with more experiments related to the variants.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      This paper describes a low-cost robust and quantitative serological test based on haemgglutination, which could be used in resource limited settings for evaluating population-based and vaccine induced immunity. Neutralising antibodies to the receptor binding domain (RBD) on the SARS CoV-2 spike protein are an immunological correlate of protection. The HAT has a single reagent the RBD domain of SARS CoV-2 linked to a monomeric anti-erythrocyte single domain nanobody. When human polyclonal serum antibodies bind to the RBD they cross-link and agglutinate human red blood cells, resulting in haemagglutination which can be read visually.

      This paper thoroughly evaluate the stability of the HAT reagents used to measure human and monoclonal antibodies examining the robustness of the HAT reagent. It provides a comprehensive protocol for conducting field based HAT with limited reagents. The test can evaluate is subjects have been infected using a simple finger prick to detect RBD specific antibodies. The field HAT can also be used to define people that can be susceptible to reinfection or in need of vaccination, With the use of RBDs from the variants of concern the test can be rapidly adapted to evaluate antibodies as new variants arise to evaluate surrogate correlates of protection to allow timely evaluation of vaccine effectiveness and predict the need for vaccine booster doses. The data are very comprehensively presented with good figures demonstrating the most appropriate buffer to store the IH4-RBD reagent and the robustness of the HAT over time at different temperatures. No additional experiments are needed and suitable numbers of replicates are included. All data, methods and reagents are comprehensively described.

      Minor comments: The paper is well written but rather long in places and may have benefited from being more succinct.

      Response: The excessive length of the manuscript was mostly due to an extensive discussion of highly technical and practical points. The discussion has now been shortened considerably, mostly by moving the paragraphs pertaining to technical considerations to the Methods section.

      Panels in figures could be labelled as A, B, C etc to help in identifying the correct panel..

      Response: We thank the referee for this helpful suggestion, which we have followed.

      I would avoid the use of experiment and project and refer to next we confirmed... or in this paper or our results show Please make sure all abbreviation are defined upon first use. Perhaps include early in the paper that most of the work was conducted with the Wuhan RBD

      Response: We thank the referee for these helpful suggestions, which we have followed to the best of our abilities. The abstract now contains a mention of the fact the work on optimizing the protocol was carried out with the IH4-RBD carrying the Wuhan version.

      Figure 2: I would suggest placing either a solid line between the two halves of the plates to make it easier for the reader to differentiate between the two antibodies. It also would have been easier to read if the bottom PBS, PBS-N3 and PBN were at 45 degree angle. In B include the serum name (e.g. serum 197).

      Response: We thank the referee for these helpful suggestions, which we have followed.

      Legend to figure 4: please include the serum numbers after covid-19 patients. Perhaps include arrows to demonstrate the dilutions of serum and IH4-RBD in the figure.

      Page 6 it might be easiest to use the same times as in figure 6 and use for example more than one year in the discussion

      Response: We thank the referee for these helpful suggestions, which we have all followed.

      Legend figure 6 perhaps replace dots with circles page 10 include the R values from figure 6 in the description of results.

      Response: We are grateful to the referee for these helpful suggestions, but have not followed them since we do not feel that these changes would be real improvements.

      Page 12 of note perhaps this can be moved to the methods ?

      Response: This, and several other paragraphs of the Discussion, have now been moved to the Methods section.

      Supplementary figure 2 A can be seen, is something missing here?

      Response: An s was indeed missing : “A can be seen” corrected to “As can be seen “

      *

      Reviewer #3 (Significance (Required)):

      This paper describes a simple rapid field test for evaluating antibodies to the receptor binding domain of the spike of SARS CoV-2 using the Wuhan and delta variant. Whilst high income countries can provide booster doses and extensive testing (either lateral flow or RT-PCR based) and contact racing to control the waves of the pandemic, low income countries have had limited access to Covid vaccine and the extent of previous waves of the pandemic in the populations are unknown.

      This paper describes a robust and simple test for investigating human antibodies to SARS-CoV-2 which could be performed in resource limited settings providing a very useful tool for monitoring infection in the community and potentially for prioritising this scarce COVID-19 vaccines available.

      This study builds upon the work conducted on the HAT and has extensively studied and optimised the test so that it could be used globally. This paper provides a comprehensive protocol and has simplified the test to ensure it could be used in LMICs.

      This paper would be of great interest to a wide scientific audience who are interested in a rapid low-cost test to evaluate population based and vaccine induced immunity.

      Reviewer: serological assays for use in virology and vaccinology. Suitable competence to review the whole paper *

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This paper describes a low-cost robust and quantitative serological test based on haemgglutination, which could be used in resource limited settings for evaluating population-based and vaccine induced immunity. Neutralising antibodies to the receptor binding domain (RBD) on the SARS CoV-2 spike protein are an immunological correlate of protection. The HAT has a single reagent the RBD domain of SARS CoV-2 linked to a monomeric anti-erythrocyte single domain nanobody. When human polyclonal serum antibodies bind to the RBD they cross-link and agglutinate human red blood cells, resulting in haemagglutination which can be read visually.

      This paper thoroughly evaluate the stability of the HAT reagents used to measure human and monoclonal antibodies examining the robustness of the HAT reagent. It provides a comprehensive protocol for conducting field based HAT with limited reagents. The test can evaluate is subjects have been infected using a simple finger prick to detect RBD specific antibodies. The field HAT can also be used to define people that can be susceptible to reinfection or in need of vaccination, With the use of RBDsfrom the variants of concern the test can be rapidly adapted to evaluate antibodies as new variants arise to evaluate surrogate correlates of protection to allow timely evaluation of vaccine effectiveness and predict the need for vaccine booster doses. The data are very comprehensively presented with good figures demonstrating the most appropriate buffer to store the IH4-RBD reagent and the robustness of the HAT over time at different temperatures. No additional experiments are needed and suitable numbers of replicates are included. All data, methods and reagents are comprehensively described.

      Minor comments:

      The paper is well written but rather long in places and may have benefited from being more succinct.

      Panels in figures could be labelled as A, B, C etc to help in identifying the correct panel..

      I would avoid the use of experiment and project and refer to next we confirmed... or in this paper or our results show

      Please make sure all abbreviation are defined upon first use.

      Perhaps include early in the paper that most of the work was conducted with the Wuhan RBD

      Figure 2: I would suggest placing either a solid line between the two halves of the plates to make it easier for the reader to differentiate between the two antibodies. It also would have been easier to read if the bottom PBS, PBS-N3 and PBN were at 45 degree angle. In B include the serum name (e.g. serum 197).

      Legend to figure 4: please include the serum numbers after covid-19 patients. Perhaps include arrows to demonstrate the dilutions of serum and IH4-RBD in the figure.

      Page 6 it might be easiest to use the same times as in figure 6 and use for example more than one year in the discussion Legend figure 6 perhaps replace dots with circles page 10 include the R values from figure 6 in the description of results.

      Page 12 of note preps this can be moved to the methods Supplementary figure 2 A can be seen, is something missing here?

      Significance

      This paper describes a simple rapid field test for evaluating antibodies to the receptor binding domain of the spike of SARS CoV-2 using the Wuhan and delta variant. Whilst high income countries can provide booster doses and extensive testing (either lateral flow or RT-PCR based) and contact racing to control the waves of the pandemic, low income countries have had limited access to Covid vaccine and the extent of previous waves of the pandemic in the populations are unknown.

      This paper describes a robust and simple test for investigating human antibodies to SARS-CoV-2 which could be performed in resource limited settings providing a very useful tool for monitoring infection in the community and potentially for prioritising this scarce COVID-19 vaccines available.

      This study builds upon the work conducted on the HAT and has extensively studied and optimised the test so that it could be used globally. This paper provides a comprehensive protocol and has simplified the test to ensure it could be used in LMICs.

      This paper would be of great interest to a wide scientific audience who are interested in a rapid low-cost test to evaluate population based and vaccine induced immunity.

      Reviewer: serological assays for use in virology and vaccinology. Suitable competence to review the whole paper

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, the authors developed a feasible protocol for an affordable point-of-care serological test for SARS-CoV-2. This method was adapted from the HAT plasma titration test that the authors previously published. Specifically, the test utilizes a 96-well plate pre-coated with the RBD of SARS-CoV-2 spike glycoprotein fusing to a red blood cell targeting nanobody (IH4). By adding microliters amount of the blood or plasma samples to the plate, it allows the detection of antibodies against RBD by measuring the level of hemagglutination. In the current upgraded protocol (so called HAT-field), the authors made major modifications including optimizations of buffer and experimental protocol and the use of pre-titrated IH4-RBD on the plate, which collectively helped to lower the sample consumptions, improved the stability and the sensitivity of detection, and made the test more user-friendly under non-clinical settings.

      Major comments:

      My major concerns are related to the robustness and quantitative capability of this approach.

      Specifically:

      It seems that multiple variables may impact the results. These include volume of droplets, the presence/absence of serum IH4 or BSA cross-reactive antibodies, and the amount (%) of red blood cells which may vary substantially among samples. Could you find a way to normalize the results (e.g., the discrepancy shown in Figure 6) instead of only leaving them as false-positives or false-negative? Second, the score of the HAT-field ranges from 0 - 8. However, based on the current manuscript, it is not clear how the scoring and scaling works. How is the noise (non-specific antibody singal) defined here? In addition, it is unclear how to translate the HAT-field score into a meaningful measure of protection by serum antibodies. Can you provide more evidence to demonstrate that the test is quantitative? For example, performing additional orthogonal experiments to better validate the scoring and generate a correlation function?

      Minor comments:

      Figure 6: are all results included? To me, it does not seem that all 60 samples data were included in the plot.

      There are several redundant statements in the discussion and results section. Please make the text more concise.

      Significance

      The current paper is built upon the improvement of previous published work. In addition, there are similar approaches that have been published. It was unclear if the current method is superior to other works. My research involves the development of antiviral antibody therapeutics. This method may be used as a point-of-care tool for the measurement of serologic response to RBD in less developed countries. However, due to the high vaccination rate and large infected populations, the overall needs for such detection drastically decrease. The significance of the work and utilities of the test may expand with more experiments related to the variants.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the publication HAT-field: a very cheap, robust and quantitative point-of-care serological test for Covid-19 by Joly and Ribes the authors describe an adaption and an improved protocol to their previously published haemagglutination based test to detect antibodies to SARS-CoV-2 in patient blood (Towsend et al., 2021). In detail, they analyzed the effect of several adaptions including buffer optimization, plate coating, usage of patient whole blood instead of washed RBCs and plasma. Additionally they tested different temperatures and stability of the reagents, namely the nanobody-RBD construct IH4-RBD. For validation they compared their optimized HAT-field assay with Jurkat-S&R as a FACS-based assay.

      Major comments:

      Introduction: This section is rather short and could benefit from a broader overview of currently established methods and assays to detect appropriate immune responses against SARS-CoV-2. The author are advised to summarize the current literature in the field more comprehensively and not only focus on their own work.

      Cross-reactivity with IH4-RBD. In Figure 6, the authors highlight the samples in red and orange that showed cross-reactivity with IH4-RBD. In their discussion, however, the authors state that only 2 of 60 (3%) were cross-reactive. In making this statement, they ignore the proportion of cross-reactive samples that were also positive in the Jurkat S&R assay. Therefore, the authors should acknowledge in the discussion that the actual number of cross-reactive samples was higher.

      Quantitative Assay. Since the HAT assay does not allow determination of the absolute number of antibodies reactive to SARS-CoV-2 in the blood samples, the authors should refrain from claiming that the HAT-field is a quantitative assay.

      Morphological read out For field application, the morphological description of the observed deposits ("teardrop" vs. "button") could be problematic and might lead to bias depending on the user. Thus, the authors should provide a clearer description for phenotype classification.

      Minor comments:

      Title: the authors should remove "very" By the way: What are the costs of IH4-RBD for a 96 well plate? Who will produce this reagent? Is the sequence of the IH4 fully disclosed?

      The usage of the CR3022 as positive control for neutralizing antibodies should be reconsidered since this antibody does not confer viral neutralization. Other well describe antibodies blocking the ACE2:RBD interface might be better suited.

      Figure 2: Please state the concentration of IH4-RBD used. As stated in the figure legends for Figure 2 B, the authors should show the result all 4 replicates (incl. SD)

      Figure 3: Although the authors showed stability of IH4-RBD at 2 µg/ml they do not provide data for the stabilities at higher dilutions. As the authors suggest to predistribute the IH4-RBD in plates they should at discuss this issue.

      Figure 6/Supplementary Figure 1 and 3 The presentation of the data is not accurate, as many of the points (samples) are obviously identically positioned in the graph. The authors should choose a different representation of their data. E.g. they could adjust the size of the points to the number of overlapping samples.

      Wording / text length In the current manuscript the text is very long. Thus, the authors should shorten it to report the essential findings more appropriately. Additionally they should check for correct English wording.

      Significance

      In summary, the authors describe the HAT-field test as a simple PoC test for the detection of SARS-CoV-2 antibodies in patients. Because of its ease of use and robustness, the test appears to be particularly well suited for use in countries with underdeveloped health care or limited testing facilities, as also reported previously. The value of this manuscript lies mainly in the detailed description of the protocol and its validation. In this context, the adaptations described are certainly useful and helpful from a practical point of view, but do not provide significant new scientific insights. In light of these considerations, we recommend that this work be submitted to an appropriate journal specializing in the publication of such methods

      Expertise The reviewers have established and published different serological assays to monitor immune responses against SARS-CoV-2

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Estrach and colleagues seek to identify the ECM components that are key to regulating hair follicle stem cell (HFSC) activation using the highly-characterized mouse hair follicle as a model. They first use a targeted approach to examine key ECM components expressed by HFSC and find that Fibronectin (FN) is highly expressed. Further, wholemount analysis of the hair follicle reveals a meshwork of FN enveloping the hair follicle. They hypothesize that FN is a fundamental regulator of hair follicle (HF) cycling and then proceed to carry out longterm studies required to examine hair follicle cycling and knockout FN with two different HFSC Cre lines (Lrig1 and Krt19), as well as integrin coreceptor SLC3A2. They clearly show that absence of Fibronectin (FN) and SLC3A2 is detrimental to hair follicle stem cell activation and cycling (FN) and hair follicle identity (SLC3A2).

      Overall comments:

      The authors use the tail hair follicles as a model similarly to the highly-characterized, synchronous back skin hair follicles. However, the tail hair follicles are asynchronous (Braun et al. 2003, PMID: 12954714), thus reporting the age of the mouse from which the tail whole mounts came from is not sufficient to claim a HF cycle disorder - HF should be imaged in an unbiased manner and subsequently quantified for phase. The manuscript would greatly benefit from including more information in the figure legends, such as age of mice, number of mice and HF quantified, as well as what the error bars represent. Further, in samples where many HF were counted per mouse, these should be averaged and then the average per mouse displayed; super plots would be great to use here.

      Major comments:

      1. In Figure 1, the use of tail whole mount images indeed provides striking display of the fibronectin meshwork that envelops the hair follicle. However, addition of a marker of the regenerative phase (e.g. proliferation) and resting phase would provide more convincing evidence that this is the particular phase of the hair cycle that you have captured, especially given my overall comment regarding the asynchronous nature of the tail HF cycle.
      2. The authors show that FN is expressed in early-mid anagen and conclude that FN is a regenerative signal. This claim should be substantiated with FN staining on more time points across the HF cycle to substantiate the argument that it is a regeneration-specific signal, found only in the telogen-anagen transition.
      3. Lrig1-cre and K19-cre-mediated FN knockout result in HF that are thinner at D158 - this is not immediately apparent from histological sections. Can you use your thick sections to give better perspective?
      4. The authors measure the width of the infundibulum from lightsheet microscope images. It is a bit difficult to position whole tissues using this technique, and the images that are shown are not from the same perspective, and thus measurement of the width is not accurate from these images. I suggest either removing this analysis or using more comparable images. Further, if this is a true phenotype, can you speculate on what the thickened infundibulum might mean?
      5. The authors then show mislocalization of Lrig1+ cells to the infundibulum in absence of FN. Are other stem markers localized to the infundibulum or outside of the bulge? Further, what might the mislocalization of Lrig1+ cells might mean?
      6. Please explain your conclusion after Figure 3i and at the end of the manuscript that states that FN is required for stem cell anchorage. I think that a very plausible explanation is that FN is required for stem cell function and identity, but anchorage of the SC lacks sufficient evidence. Further, your only evidence to support the anchoring theory only comes from expression of Lrig1 in FN knockout and no other markers. Are they also mislocalized? Please either tone down this conclusion on SC anchorage or provide stainings for more SC markers to show mislocalization in absence of FN.
      7. In Figure 3l-o, you examine proliferation on the control vs the conditional deletion of FN in D30 and D158. However, in D30, these tissues are not at all directly comparable since one is obviously in anagen and the knockout in telogen. You must compare the anagen knockout sample, although this occurs a bit later than the control. Further, how was the infundibulum distinguished from the bulb in these control images?
      8. In Figure 3P, you carry out RT-qPCR on whole skin to detect HFSC markers. This should have been carried out on sorted epithelial cells as isolation of whole back skin introduces bias to the system in that the number of stem cells may artificially look different in skin that is in anagen vs skin that is in telogen as the anagen skin has a different proportion of SC to progenitor cells to dermal cells. This concern is also similar to point 9 - the control and FN knockout at D30 are not comparable given that they are in different phases of the hair cycle.
      9. Figure 4a these images need to be of the whole mouse - it is not possible to determine what we are looking at or where - there is not even a scale bar.
      10. After Figure 4, you argue that because fibronectin expression resolves from healing dermis is the reason that hair follicles do not form, and site Dekonick and Blanpain (PMID: 30602767) - however this review makes no mention of the dynamics of fibronectin in wound healing. Further, evidence from Driskell et al (2013, PMID: 30602767) would suggest that it is the fibroblast population that responds to the wound that determines whether HF regenerate. And further, very large wounds do regenerate HF (Ito et al PMID: 30602767). In addition, this would all be fibroblast-derived FN, as opposed to the current study which examines keratinocyte-derived FN. Please reconsider this argument.
      11. The authors knockout SLC3A2, an integrin coreceptor that is localized to the plasma membrane. They show a very similar, yet more severe phenotype to the Lrig1- and K19- mediated knockout of FN. Given the bidirectional communication that SLC3A2 is responsible for, can you reconcile whether the defects in the HF cycle and the HFSC are a result of outside-in or inside-out signaling? Further, is it possible that integrin function regulated by SLC3A2 is necessary for more than FN assembly? This could be especially relevant given that your targeted screen also identified Col17A1, which is well known to be required for HFSC function (Matsumura et al., PMID: 26912707)
      12. It is intriguing that in the absence of HFSC-derived SLC3A2 that no FN network forms. Is FN expressed or is the assembly perturbed in the absence of properly functioning integrins? The authors conclude that the signaling cascade flows from fibronectin to integrin to SLC3A2, but do not test where the FN phenotype arises in the SLC3A2 knockout - is it due to aberrant assembly of the FN meshwork or a change in transcriptional or translational levels?
      13. In the grafting assay in Supplemental Figure 3, keratinocytes undergo a de novo hair follicle morphogenesis - is Lrig1 expression maintained in order to carry out cre-mediated deletion? Further, the fibroblasts in this assay may adopt a wound-like phenotype, expressing FN, which you earlier claim to be required for hair follicle production in wounds. Yet in the absence of epithelial FN, no HF form. Can the authors reconcile this?

      Minor comments:

      1. In Figure 1a, the two populations are Lgr5+ and basal; please define what the basal population is in this experiment.
      2. Significative is not a word.
      3. In Figure 4 figure legend, there is reference to a grafting experiment but no experiment shown.
      4. The authors delete FN in Lrig1+ or K19+ cells starting D19 and harvest at D30, and conclude that the hair follicles do not enter anagen after the second telogen, can you please include the data supporting the statement that mutant HF did not reenter the hair cycle after D65.

      Significance

      The authors show for the first time that fibronectin is expressed during cutaneous homeostasis and that it is required for normal function of the hair follicle stem cells. This is significant conceptual advance for the field of skin biology because fibronectin is thought to only be present in wounds: derived first from infiltrating serum and second from fibroblasts to act as provisional dermal ECM to support epithelialization during wound-response, which is ultimately resolved upon the conclusion of wound healing (reviewed in: Singer and Clark, PMID: 10471461). Further, FN has also been characterized as an EMT marker during cancerous progression (Lamouille et al, PMID: 24556840). Estrach and colleagues show that fibronectin is actually expressed by hair follicle stem cell keratinocytes and then is assembled into a meshwork that envelops the hair follicle and is in fact necessary for hair follicle stem cell homeostasis. This work would be broadly interesting to the field of stem cell biology as well as those working on extra cellular matrix signaling. My field is epithelial stem cells and more specifically hair follicle development and cycling.

      Referee Cross-commenting

      I have no disagreement with any of the points raised by the other reviewers. In fact, we seem to agree on the majority of the concerns. This includes the use of the tail wholemount model, the use of Lrig1-cre, selection of timepoint vs phase of the hair cycle, the appropriateness of the link between Fibronectin and SLC3A2, and further significant issues related to display of data and their reproducibility. Further, all of the major comments raised need to be addressed in order to properly evaluate the conclusions that the authors make. In my opinion, none of the comments raised here are unreasonable.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this manuscript, Estrach et al., investigated whether extracellular matrix component fibronectin function in hair follicle regeneration, using a range of approaches including FACS, RT-qPCR, immunofluorescent staining, and mouse genetics. They proposed that fibronectin in Lrig1+ cells was necessary for hair follicle stem cell maintenance and activation, and the fibronectin expression and assembly relayed on the integrin co-receptor SLC3A2.

      Significance

      Major points:

      1. In Figure 1a, the author used Lrig1+GFP and a6 to isolate Lrig1+ cells in the infundibulum junction zone above the sebaceous gland at Day 28. However, in Figure 1f-h, the GFP expression was not only in infundibulum above SG, but also in some inner root sheath cells. Since the Lrig1+ cells do not include the hair follicle stem cells (CD34+ bulge cells), result in Figure 1a does not support fibronectin expression in HFSCs.
      2. In Figure 1b-e, the author detected fibronectin expression by IF staining with tail skin whole mount and back skin section. The fibronectin is mainly detected in differentiated cells in the inner root sheath (IRS) in anagen (Figure 1b and 1d), upper IRS in catagen (Figure 1c), hair germ in telogen (Figure 1e), but not in the bulge region (Figure 2i-l). Again, these results do not support fibronectin enrichment in HFSCs either.
      3. In Figure 2a-c, the author knocked out fibronectin in Lrig1+ cells with Lrig1-CreERT2, FN fl/fl mice, and then validated the knockout efficiency by IF staining. However, result shows the fibronectin expression was not only depleted in the GFP+ Lrig1+ cells, but also depleted in GFP- inner root sheath and matrix. Similarly in Figure 4n, fibronectin was only knocked out in Lrig1+ cells, however, result showed the fibronectin cannot be detected in any cell types in skin. The author should explain why fibronectin depletion in Lrig1+ cell lead to completely fibronectin depletion.
      4. In Figure 2r, by Flow cytometry, the author found a significative reduction of a6+CD34+ SC population when fibronectin is conditional knocked out in Lrig1+ cells. As the Lrig1+ cells and a6+CD34+ HFSCs are two distinct cell populations, the author needs to explain how fibronectin depletion in Lrig1+ cells affect the number and activation of HFSCs population.
      5. In Figure 3b-c and 2g-h, the author reported the HF thinning in Lrig1-CreERT cKO mice by back skin HE staining. However, by tail skin wholemount staining, the HF thinning was not observed in those cKO mice (Figure 3f-g; Figure 2k-l). The author needs to explain the discrepency. In addition to this, the low quality of HE staining and poor orientation of HF (Figure 2h and 3b), coupled with lack of quantification, made these results and conclusion unconvincing.

      Mini Points:

      1. Color information in each IF staining panel is only completely presented in Figure 4. It is incomplete or lost in other 4 Figures.
      2. In Figure 1a, FACS profile and representative figures are needed to demonstrate the author isolated the correct population at correct hair follicle stage.
    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary

      In the manuscript, Estrach et al addressed the role of skin epithelial stem cell-derived extracellular matrix in hair follicle regeneration. They found that Lrig1+ epithelial stem cells highly express fibronectin gene compared to other basal epithelial cells. Conditional deletion of fibronectin gene using Lrig1CreERt2-GFP or K19CreER (the latter is expressed in the bulge stem cells) resulted in hair follicle regeneration blockade, change in the expression pattern of Lrig1-GFP. Injection of fibronectin protein into the dermis of the conditional fibronectin mutants (Lrig1CreERt2-GFP) rescued the hair regeneration blockade phenotype. The authors also conditionally deleted SLC3A2, an integrin coreceptor, using the same CreER lines and found a decrease in fibronectin deposition and CD34+ bulge stem cell number. With the results from these mouse genetics and phenotype analysis, they conclude that fibronectin-SLC3A2 cascade finely tunes hair follicle stem cell fate and their tissue regenerative capacity.

      Major comments

      1 Immunostaining results of fibronectin in the tail epidermal wholemounts are not convincing enough and would require improvements. First, the tail epidermal wholemounts lack the mesenchymal matrix and the basement membrane (Fig. 1b, c, f-h; Fig. 2b, c; Fig. 5d-j; Fig. S1b, c) (Braun et al., 2003 Development (PMID: 12954714)). Fibronectin is localized mainly in the mesenchymal matrix and the basement membrane in the skin and other organs (Stenman and Vaheri, 1978 JEM (PMID: 650151); Couchman et al., 1979 Archives of Dermatological Research (PMID: 393184); Jahoda et al., 1992 J. Anat (PMID: 1294570)), thus this sample preparation method is not appropriate to assess fibronectin tissue distribution. The authors use thick back skin sections, which contain entire skin tissues, thus I would recommend this method. Furthermore, fibronectin antibody signals in the tail epidermal wholemounts are detected in the inner part of the hair follicle epithelium, where there is no expected ECM structure (see Couchman et al. and Jahoda et al. above). Consistently, fibronectin signals are localized inside the Lrig1-GFP+ epithelial basal cells (Fig. 1f-h). Thus the specificity of the fibronectin staining needs to be confirmed. The reviewer understands that the authors provide an image showing the great reduction of fibronectin staining in a D30 tail epidermal wholemount of 4-OHT-treated Lrig1CreERt2GFP,FNfl/fl mice (Fig. 2c). However, as the D65 tail epidermal wholemount from wildtype mice also show many hair follicles without fibronectin signals (Fig. 1c), rigorous assessments would be required.

      2 Lrig1+ stem cells have been reported to maintain the upper pilosebaceuos unit, containing the infundibulum and sebaceous gland, but contribute to neither the hair follicle nor the interfollicular epidermis under normal homeostatic condition (Page et al., 2013 Cell Stem Cell (PMID: 23954751)). However, only 11 days after the first 4-OHT treatment on Lrig1CreERt2GFP;FNfl/fl mice, Estrach et al found the defects in hair cycle blockade, reduced cell proliferation in the hair bulb, and significant reduction in fibronectin deposition in entire hair follicle structure. Please explain how the deletion of fibronectin gene in Lrig1+ stem cells, which do not contribute to hair follicle lineages, lead to significant hair regeneration defects in a short period of time. Current data do not well explain a causal relationship between the genetic perturbation and the observed phenotypes.

      3 In some experiments (listed below), description about the methods, replication and statistics is not adequate, raising concerns about reproducibility. 3.1 Fig. 1a: data variation for basal cells should be presented. Biological replicate number should also be indicated in the figure legend. 3.2 Fig. 2g, h: hair follicle thinning is described here, but only one HE staining image with only one hair follicle is not enough to support this important claim. 3.3 Fig. 2r, 3i: flow cytometric data should be presented. 3.4 Fig. 4: No biological replicate and reproducibility information are provided. 3.5 Fig. 5j: how many biological replicates and hair follicles were analysed? The authors should also perform statistical tests. 3.6 Fig. S3g, h: information for biological replicates should be described. Statistical tests should be applied to Fig. S3h. 3.7 Fig. 5k-n: only one HE staining panel from each mouse line cannot provide rigorous evidence of the defects, which are not obvious from the HE staining.

      4 In Fig. 3j, k, n, hair follicles in the control and 4-OHT treated skin are in different hair cycle phases. Therefore there is a possibility that the difference in their PCNA pattern simply reflects the difference in the cell proliferative activity between different hair cycle phases, but not indicates direct effects from the deletion of fibronectin gene in Lrig1+ cells.

      5 To assess the expression levels of signaling-related genes (Fig. 3p, S2), the authors used mRNA extracted from whole skin tissues, which contain all epithelial and mesenchymal cell populations in different hair cycle phases. Thus, the time and spatial resolution of the analysis is low and it also cannot eliminate confounding factors derived from the difference in hair cycle phases between control and cKO.

      6 In order to provide the characteristics and purity of the FACS isolated cell populations at D28 (Fig. 1a), their flow cytometry data and some marker gene expression data should be presented (see Page et al., 2013 Cell Stem Cell). This assessment is particularly important for the skin compared to other static organs, as it exhibits dynamic gene expression and tissue structural changes during the hair cycle. It is also important to check whether fibronectin protein accumulates around Lrig1+ stem cells in D28 dorsal skin, where upregulation of fibronectin gene expression was detected. The authors should not use tail epidermal wholemounts for the reason described above.

      Minor comments

      7 The increase in stem cell marker expressions shown in Fig. 3p contradicts to the reduction in the number of bulge stem cells shown in Fig. 2r and 3i. Please provide an explanation for this apparent discrepancy.

      8 Although Fig. 5s-v show reduction of a6+CD34+ bulge cell population, the bulge tissue structure can be observed in Fig. 5p. Please explain how to interpret this apparent discrepancy. They just lost the expression of CD34?

      9 Connectivity of the data in the fibronectin cKO with that of SLC3A2 cKO is weak. For example, it could be strengthened if the authors show colocalization of fibronectin and SLC3A2 in vivo.

      10 Although the format of the manuscript is free in Review Commons, the Introduction and Discussion of this manuscript are too brief for us to understand the background and significance of this study. So I would recommend the authors to provide more detailed background information and discussion.

      11 The authors use the term 'HFSC', but it is unclear which stem cell populations they mention; bulge, Lrig1+ or other stem cell populations?

      12 Please provide details of fibronectin protein and antibody used in this study.

      13 Due to the short for experimental information for Fig S3a, b, d, e, I cannot evaluate the data, thus several questions are raised. As the SLC3A2 level was significantly reduced in most cells in the plot, I assumed that Lrig1-GFP+ cells were gated before examining the expression level of SLC3A2. However, no information on the procedures for 4OHT treatment, isolation of cells and flow gating strategy is described. In the case of K19CreER mice (Fig. S3d, e), if the authors gated GFP+ cells before analysis, what GFP means in this case?

      14 The manuscript wants to be checked for copyediting.

      Significance

      These findings might provide a conceptual advance into the role of epithelial stem cell-derived extracellular matrix in regulating stem cell behaviour and tissue regeneration. As fibronectin is upregulated in development, wound healing and cancers in many other organs, their findings may point to the importance of fibronectin in activating tissue progenitors and stem cells in these processes. Thus, this manuscript is likely to be of interest to a wide range of readers, not only in skin biology, but also in stem cell, regenerative and matrix biology. The contributions of this paper could be enhanced if the documentation were to be made stronger and more rigorous in a revised manuscript.

      Referee Cross-commenting

      I totally agree with Reviewer #3's comments in this consultation session. I have no disagreement with any of the points raised by other reviewers.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements [optional]

      This section is optional. Insert here any general statements you wish to make about the goal of the study or about the reviews.

      We are grateful to the reviewers for their honest opinion regarding this work and plan to address the majority of the comments in a revised version either through new analysis or revision to the text, as we believe these will improve the manuscript by making some of the details clearer. There were few suggestions that will lead to substantiative changes to the findings. Here, we address the most salient critiques, the primary one being related to novelty.

      We respectfully disagree, as our detailed analysis of the DNA methylome in Octopus bimaculoides represents a significant advance to understanding how the epigenome is patterned in non-model invertebrates in general, and cephalopods in particular. We acknowledge that the previous report that the octopus methylome resembles the few other invertebrates where low DNA methylation has been found, the finding was part of a multi-organism study last year (de Mendoza et al., 2021), which lacked any detailed investigation. Our study provides the first in depth analysis on methylation patterning, the relationship with transposons and gene expression, and reports the finding of other key epigenetic marks in O. bimaculoides, and in other cephalopods.

      In short, we believe our study to be highly novel and that it represent the first analysis of this kind in cephalopods and one of the few existing in non-model invertebrate organisms. In addition, we identify the conservation of the histone code in cephalopods. While this may be expected, this is the first experimental evidence in this class and represents an important step forward to understand the epigenetic regulation of genes and transposons in invertebrates. Finally, we plan to provide an updated transcriptome annotation for O. bimaculoides that will be available for the scientific community as a new valuable resource. We believe these features will make this study highly cited.

      We believe that findings like ours will complement several recent studies that extend the epigenetics field out of the current narrow focus on model organisms to understand how epigenetic mechanisms function in diverse animals. This provides new insights regarding the epigenetic mechanism of gene regulation in an emerging invertebrate model.

      2. Description of the planned revisions

      Insert here a point-by-point reply that explains what revisions, additional experimentations and analyses are planned to address the points raised by the referees.

      Reviewer 1 raised the following points that we are planning to address:

      *- It is unclear why the authors did not use the original gene models of O. bimaculoides or tried to improve them. By only relying on adult tissue (but the relatively late hatchling stage), they would have omitted most developmentally expressed genes, that are incidentally also the ones that are subjected to extensive spatiotemporal gene regulation (which is also a problem to assess the role of methylation). I think more comparisons with existing gene models and how the newly generated stringtie models should be provided. *

      We agree that using as many tissues and developmental stages as possible will expand the octopus transcriptome.

      We plan to:

      • Add RNA-seq data from stage 15 embryos to improve this.
      • Compare the gene model used in the original version of the manuscript (Stringtie model to use in Trinotate for improving the annotation of the genes) to the existing annotation model and report on which has superior performance for annotating the * bimaculoides* transcriptome.
      • Extend the annotation of the transcriptome which we undertook in a focused fashion in the first iteration of this manuscript. Reviewer 2 raised the following points that we are planning to address:

      *- It is not exactly clear to me why the authors look for expression clusters in the first part of the manuscript? This information, while interesting, does not seem to be used in the methylation analysis. It is also somewhat contradictory because the authors first claim that, based on their GO-term enrichment analysis, that different expression clusters are associated with "complex regulatory mechanisms, potentially based in the epigenome". Yet at the end they conclude that, due to the global and tissue-overarching nature of methylation, this "argues against this epigenetic modification as a player in the dynamic regulation of gene expression". *

      We thank the reviewer for pointing out this issue and we plan to clarify the point through changing the text and additional analysis. Since we found that the methylation pattern was stable across tissues, and that it corresponded to gene expression levels regardless of tissues, we concluded that the methylation pattern is not likely relevant for the tissue-specific gene expression pattern reported in Figure 1.

      We plan to:

      • Ask whether there is a correlation between the gene clusters generated in Figure 1 and the DNA methylation patterns identified in Figure 4. *- At least for the trees that are shown in the main figures it would be great to show support values. *

      We thank the reviewer for this request.

      We plan to:

      • Add full Supplementary information regarding the support values in Supplemental Files for all the trees present in the main Figures. Reviewer 3 raised the following points that we are planning to address:

      *- It would be great to see more data on cephalopod TET and MBD structure. For example, it would be interesting to know whether octopus TETs have a CxxC domain or whether MBD proteins harbor functional 5mC - binding domains. *

      We agree that it would be of interest to examine the conservation of TET genes to expand upon the initial analysis by Planques et al 2021 showing that O. bimaculoides have one TET homolog, one MBD4 homolog and one MBD1/2/3 homolog. Detailed analysis of MBD4 protein has been already performed in de Mendoza et al. 2021 by using the protein sequence of O. vulgaris, as the MBD4 gene in the O. bimaculoides genome appears truncated.

      We plan to:

      • add the PFAM domain analysis for TET proteins This will be added as a new figure panel.
      • Update the text to include the reference to the identification of MBD4/MECP2 as the invertebrate homologs of vertebrate MBD4. *- Even though RRBS provides limited insight into DNA methylation patterns, the authors could have done more to explore read-level 5mC information. For example, by studying single reads, the authors could deduce the numbers of fully methylated, unmethylated or partially methylated reads. Such analyses might provide valuable insight into potentially different modes of epigenetic inheritance in different tissues i.e are there tissues that favor fully methylated or unmethylated stretches of DNA vs tissues that favor partial methylation? *

      We think this is a really interesting point. This has been partially addressed in a previous work (de Mendoza et al., 2021) which found limited to no partially methylated reads in whole-genome bisulfite sequencing from O. bimaculoides brain.

      We plan to:

      3. Description of the revisions that have already been incorporated in the transferred manuscript

      Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. If no revisions have been carried out yet, please leave this section empty.

      Reviewer 1 raised the following points that we have already addressed:

      We addressed all the comments raised by this Reviewer by revising the text, fixing references, typos and improving clarity.

      Reviewer 2 raised the following points that we have already addressed:

      We addressed all the minor Comments raised by this Reviewer regarding spelling errors and Supplementary Figures.

      - The finding that less than 10% of all possible sites are methylated is surprising. I could not (easily) find statistics of RRBS experiment read mapping to the genome.

      We have now provided this data and new Supplemental Table 1 (refereed in the text as Table S1).

      *- It is very exciting to see methylation of gene bodies and some correlation to their expression levels, but the authors may need to include a disclaimer that the methylation of TEs may go undetected due to the gapness of the genome. In fact, the authors may try to map their data onto a somewhat closely related Octopus sinensis genome sequenced with long reads available at NCBI to confirm overall pattern. It is likely though that due the evolutionary distance only gene bodies will have mapping. *

      The thank the reviewer for this suggestion and we included a sentence in the Result session indicating that methylation of TEs may go undetected due to the poor annotation of the octopus genome.

      *- The statistical reasoning (and methodology) behind how clusters in Figures 1 and 4 were defined is unclear. In particular, in Figure 4, it seems that the authors had asked the program to give four clusters in total - why was this number chosen? It seems that using the same generic clustering approach as in Figure 1 may benefit or confirm the results in Figure 4. *

      We clarified the rationale in the Material and Methods session to describe the bioinformatic analysis. We will put the full code used in the manuscript in our GitHub page (https://github.com/SadlerEdepli-NYUAD/) to have a more comprehensive understanding of the Method used.

      Reviewer 3 raised the following points that we have already addressed:

      We addressed all the minor comments in the text and figures raised by this reviewer regarding typos and clarity.

      *- There is little info on the generated 5mC data. To bolster its value as a resource, the manuscript should have a link to the table describing RRBS metrics. This should include: non-conversion rates, numbers of sequenced and mapped reads, read length and other info that the authors deem useful. *

      We have now provided this data in a new Supplemental Table 1 (refereed in the text as Table S1).

      4. Description of analyses that authors prefer not to carry out

      Please include a point-by-point response explaining why some of the requested data or additional analyses might not be necessary or cannot be provided within the scope of a revision. This can be due to time or resource limitations or in case of disagreement about the necessity of such additional data given the scope of the study. Please leave empty if not applicable.

      Reviewer 1 raised the following points that we are not planning to address:

      *- The newly sequence RNA-seq samples are using a ribodepletion protocol (RiboZero) while the other ones are using a polyA selection. This might be a slight problem to compare them quantitatively. Actually in the Figure 1, all 4 newly generated samples group together in the hierarchical clustering. *

      We acknowledge the reviewer’s point here and agree that heterogeneity in library prep and batch is a common issue when comparing public available with newly generated datasets. This could account for the clustering of the Ribosomal RNA depleted (i.e. RiboZero) from polyA selected RNA libraries. While this could potentially introduce bias, we do not believe that it substantially alters any of the main findings or the interpretations of this data. Our purpose for carrying out the cluster analysis of transcriptomic data from multiple tissues was to identify distinct gene patterns that defined different tissue types. This was accomplished regardless of the potential confounding variable introduced by different library preparations. In addition, we used TMP which seems to help in the comparison across different samples when used for qualitative analysis such as PCA and cluster analysis (Zhao et al. 2020; DOI: http://www.rnajournal.org/cgi/doi/10.1261/rna.074922.120). Therefore, even if not ideal we think that this approach is still valuable.

      *- I am not so sure about the way the authors used z-score normalized logTPMs and applied hierarchical clusters, this most likely would not fully alleviate the impact of expression level on the outcome compared to more advanced form of normalization and clustering. *

      We agree with the reviewer that applying z-score or a logTPMs normalization would not fully resolve the technical variance in the direct comparison of libraries generataed with different RNA selection methods. We did not apply z-score on logTPMs but these 2 methods were applied separately: z-score on TPMs in Figure 1B to define the gene clusters and log2(TPM+1) in Figure 4E. We have clarified the text to reflect this.

      *- I am not convinced that differences in western blot for histone modification could really provide a clear insight into their regulatory role. *

      We agree with the reviewer that Western blotting for histone modifications does not provide deep insight into their regulatory role. However, this is the first description of these marks in any cephalopod, and we believe that reporting a finding from experimental evidence is important, even if the result is aligned with the existing paradigm. Moreover, the marked difference in levels of distinct histone marks across tissues supports the hypothesis that they play a regulatory role. We observed this in mice where difference abundance in western blot correspond to different abundance and enrichment also by ChIP-seq (Zhang et al., 2021 DOI: https://doi.org/10.1038/s41467-021-24466-1). Considering the limited tools available in this species, we still consider this an important finding.

      Reviewer 2 raised the following points that we are not planning to address:

      *- The finding that less than 10% of all possible sites are methylated is surprising. I could not (easily) find statistics of RRBS experiment read mapping to the genome. I also wonder how much the gap-richness of the genome may affect the overall methylation estimate. If assembly permits, would it make sense to limit the sampled sites to areas where no flanking gaps are present (and sufficient scaffold length is available, maybe excluding very short scaffolds)? *

      We added all the statistical values regarding the RRBS in a NEW Supplemental Table 1. We used a single base pair analysis approach (not tiling windows), so the data we extracted is not biased by the length of the scaffolds. This is confirmed by the fact that the DNA methylation value obtained in our RRBS data matches the findings observed in Whole Genome Bisulfite Sequencing (WGBS). Moreover, global DNA methylation values assessed by Slot blot analysis as a technique independent from genome assembly confirmed what observed with RRBS.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The manuscript by Macchi et al describes the epigenome and the transcriptome of Octopus bimaculoides. While the manuscript itself is well written and the data are properly analyzed, it is fair to say that the work itself offers little biological novelty. Nevertheless, I still believe that the datasets and some of the analyses could be useful to researchers studying invertebrate epigenomes and gene regulation.

      1. It would be great to see more data on cephalopod TET and MBD structure. For example, it would be interesting to know whether octopus TETs have a CxxC domain or whether MBD proteins harbor functional 5mC - binding domains.
      2. There is little info on the generated 5mC data. To bolster its value as a resource, the manuscript should have a link to the table describing RRBS metrics. This should include: non-conversion rates, numbers of sequenced and mapped reads, read length and other info that the authors deem useful.
      3. Even though RRBS provides limited insight into DNA methylation patterns, the authors could have done more to explore read-level 5mC information. For example, by studying single reads, the authors could deduce the numbers of fully methylated, unmethylated or partially methylated reads. Such analyses might provide valuable insight into potentially different modes of epigenetic inheritance in different tissues i.e are there tissues that favor fully methylated or unmethylated stretches of DNA vs tissues that favor partial methylation?

      Minor comments:

      There are a few spelling errors throughout the manuscript. Please check for those: Figure 4F ("Trascrips" instead of transcripts), Schmedtea instead of Schmidtea. There are likely other errors as well.

      Page 3 - "intergenome"sounds a bit weird.

      The authors might consider citing Planques et al, 2021 (BMC Biol) alongside Mendoza et al when discussing unusually high 5mC levels in the sponge.

      Significance

      The main points of the paper are: i) a somewhat improved transcriptome, ii) DNA methylation data generated by RRBS that follows a canonical invertebrate pattern (low 5mCG levels present in GBs and absent from repeats), and iii) evolutionary analyses of epigenetic machinery components. While lacking biological novelty, the presented data have a resource value and could likely serve as a decent starting point for further exploration of cephalopod gene regulation. I therefore believe that with some revision the manuscript will merit publication in one of the Review Commons - associated journals.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The paper by Macchi et al studies DNA methylation patterns in Octopus bimaculoides, describing overall conservation of DNA methylation machinery and genome-wide methylation patterns and their effect on gene expression across broad tissue sampling. As such, the paper comrpises a key advancement in the emerging field of cephalopod (epi)genomics and gene regulation. Despite the difficulties relating to the genome assembly of O. bimaculoides, the authors have done a solid analysis of methylation patterns and the results look generally sound. I have a few points that may help the authors improve their manuscript:

      • The finding that less than 10% of all possible sites are methylated is surprising. I could not (easily) find statistics of RRBS experiment read mapping to the genome. I also wonder how much the gap-richness of the genome may affect the overall methylation estimate. If assembly permits, would it make sense to limit the sampled sites to areas where no flanking gaps are present (and sufficient scaffold length is available, maybe excluding very short scaffolds)?
      • It is not exactly clear to me why the authors look for expression clusters in the first part of the manuscript? This information, while interesting, does not seem to be used in the methylation analysis. It is also somewhat contradictory because the authors first claim that, based on their GO-term enrichment analysis, that different expression clusters are associated with "complex regulatory mechanisms, potentially based in the epigenome". Yet at the end they conclude that, due to the global and tissue-overarching nature of methylation, this "argues against this epigenetic modification as a player in the dynamic regulation of gene expression".
      • It is very exciting to see methylation of gene bodies and some correlation to their expression levels, but the authors may need to include a disclaimer that the methylation of TEs may go undetected due to the gapness of the genome. In fact, the authors may try to map their data onto a somewhat closely related Octopus sinensis genome sequenced with long reads available at NCBI to confirm overall pattern. It is likely though that due the evolutionary distance only gene bodies will have mapping.
      • At least for the trees that are shown in the main figures it would be great to show support values.
      • The statistical reasoning (and methodology) behind how clusters in Figures 1 and 4 were defined is unclear. In particular, in Figure 4, it seems that the authors had asked the program to give four clusters in total - why was this number chosen? It seems that using the same generic clustering approach as in Figure 1 may benefit or confirm the results in Figure 4.
      • In the discussion Scmidtea is misspelled.
      • Some supplementary figures have to be exported as spell checker highlights are still present (e.g., in Suppl Fig 4).

      Significance

      This manuscript is an important step towards understanding the workings of gene regulation at the epi-genomic level in octopus and cephalopods in general

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript focuses on the role of DNA methylation and histone modification in the gene regulation of cephalopods. It complements recently published RNA-seq and MethylSeq datasets with a few extra samples and generally confirms previous findings that DNA methylation does not play an active role in tissue or stage-specific regulation of gene expression in cephalopods (which is the general rule for most non-vertebrates). I don't see any methodological issue serious enough to preclude publication but some details should be strengthened.

      • the newly sequence RNA-seq samples are using a ribodepletion protocol (RiboZero) while the other ones are using a polyA selection. This might be a slight problem to compare them quantitatively. Actually in the Figure 1, all 4 newly generated samples group together in the hierarchical clustering.
      • It is unclear why the authors did not use the original gene models of O. bimaculoides or tried to improve them. By only relying on adult tissue (but the relatively late hatchling stage), they would have omitted most developmentally expressed genes, that are incidentally also the ones that are subjected to extensive spatiotemporal gene regulation (which is also a problem to assess the role of methylation). I think more comparisons with existing gene models and how the newly generated stringtie models should be provided.
      • I am not so sure about the way the authors used z-score normalised logTPMs and applied hierarchical clusters, this most likely would not fully alleviate the impact of expression level on the outcome compared to more advanced form of normalisation and clustering.
      • I am not convinced that differences in western blot for histone modification could really provide a clear insight into their regulatory role

      Significance

      This manuscript reports confirmatory results, partly reanalysing and confirming previous work. I would also like to stress that the methylation results have already been reported and discussed in a previous paper (de Mendoza et al. 2021). I don't have a fundamental problem with this but I also find the paper slightly overambitious and unspecific in its goals. I think it should benefit from being made slightly more concise. I find the part of histone marks is quite overstated. These marks are quite universal in eukaryotes and generally demonstrated to play a regulatory role, the fact that they can be detected in cephalopods by western blot is therefore not really a result.

      Comments on the text (difficult without line numbers):

      • Intro, first section: it would be good to have a few more references
      • "While this has been extremely fruitful in elucidating detailed mechanisms of epigenome patterning, regulation and function, they do not provide a comprehensive understanding of the multiple and varied ways that the epigenome functions." -> sentence is quite confusing and without very clear meaning
      • "In contrast, the most common invertebrate model organisms - Caenorhabditis elegans and Drosophila melanogaster - lack DNA methylation entirely. " -> could sound like this is the case for more invertebrates.
      • P4 "This is the case in many animals, " -> give examples, it is unclear which examples of TE control by methylation outside vertebrates have been corroborated by data. The paper cited do not deal with methylation in squid
      • Evolution has selected for variations in the canonical patterns of methylation -> such explanation could also be consistent with neutralistic explanations
      • p19: Schmidtea (type) "such as the planarian Schmedtea mediterranea"
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The manuscript entitled "Lhx2 is a progenitor-intrinsic modulator of Sonic Hedgehog signaling during early retinal neurogenesis" by Li et al is a very interesting study in which the importance of Lhx2 is studied in conditional knock-out background to decipher the importance during retinal neurogenesis of developing embryo. The study reveal importance of co-receptors essential to Shh signalling. Data presented are clean and would add depth of knowledge to the literature. The study/manuscript do have some lacunae which are listed below which would be good to address before it is published

      Major comments

      1. Authors do not seem to have performed a rescue experiment with Lhx2 CKO. If this is absolutely not possible, a conditional overexpression could have given confirmatory clues on the Lhx CKO phenotype discussed. In any case some sort of rescue experiments are essential with respect to Lhx2 as done with Cdon and Gas1
      2. Also, overexpression studies with the following genes, Cdon, and Gas1 is interesting in the CKO background. What about their over expression phenotype in Wild-type, Ptch1-CKO, and purmorphamine/Shh-N treated conditions. Alternatively, an ex-vivo or in vitro approach using cultured cells may also prove worthy to prove this point.
      3. A logical explanation of Tamoxifen administration at a window E11.5 - E15.5 is good to have in the results section.
      4. Authors say ".........that Lhx2-deficient RPCs can respond to recombinant Shh-N at more physiologically relevant concentrations, but their response is still attenuated compared to Lhx2-expressing RPCs." If the Shh-N dose is increased above physiological concentrations in Lhx-CKO conditions does the Gli1 read-out will restore to normalcy ? This would give insight on to the roles of co-receptors such as Cdon, Gas1.

      Minor comments

      1. In the 'Introduction' the authors write "Pathway activation is not achieved, however, by simple ligand binding to Patched, but also requires one of three co-receptors: Cell Adhesion, Oncogene Regulated (Cdon), Brother of Cdon (Boc), or Growth Arrest Specific 1 (Gas1)." Please give a citation of appropriate literature.
      2. In the introduction authors write "In this case, an interaction with Notch signaling is partly responsible, through Lhx2-dependent expression of ligand (Notch1), receptors (Dll1, Dll3), and downstream transcriptional effectors (Hes1, Hes5)" I feel Dll1 and Dll3 are ligand and Notch is receptor. Please check.
      3. Figure 1c: Western blots tubulin is less in CKO alongside Gli1. It would be better to do quantification for showing any significant changes.
      4. Figure 2A and Result 2: Why harvesting at E14.5 specifically for qPCR/ChIP sequencing, while for RNA sequencing and in situ , it was done E15.5.
      5. Figure 5D, E: It is not clear why there were more mCitrine+ cells in CKO explants at 96 hours.
      6. Figure 6 C,D,E,F: Does Lhx2 CKO, cause cell death as the levels of vsx2 are low in vehicle in CKO (D,F) as compared to ctrl (C,E).
      7. The levels of Gli1 are higher in CKO vehicle (D,F) than the control panel (C,E). This difference in Gli1 expression is more evident D versus C. Does it mean that CKO increases Gli1 expression in explants, which seems to be opposite of what shown in the first results (Figure 1D, E).
      8. It is not clear why the increase in Gli1 expression with Shh-N (E,F) is not that much evident than with purmorphamine (C,D) in both control and CKO explants.
      9. Figure 7 7-F: The changes in the levels of Cyclin D1 and Hes1 in Ctrl/Lhx2 CKO/ dCKO are not very clear from the data in present form. It would be better to show the changes in their mRNA levels by qPCR analysis. Further, it is not clear how the changes in CyclidD1 and Hes1 levels are proving that Lhx2 acts downstream of Shh signaling.
      10. Supplementary table 4: In the page 3 have typo error "centrations at 72 hr"
      11. Supplementary Figure 8 is labeled as Supplementary Figure 7, so there are two figures labeled as Supplementary Figure 7.

      Significance

      Study is significant, and adds more depth to existing knowledge in this science field. Developmental and cell biologists would benefit from this study.

      Retina regeneration, Cellular signalling, Epigenetics, One knock outs/knockdowns, transgenics, RNAseq, Microarray, ChIPseq, Cell sorting

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The manuscript by Li et al., entitled: "Lhx2 is a progenitor-intrinsic modulator of Sonic Hedgehog signaling during early retinal neurogenesis," focuses on an important topic in developmental biology-the regulatory interaction between transcription factors (TFs) and signaling pathways, namely how the TF confers cells' competence to respond to extrinsic cues. The study focuses specifically on Lhx2 regulation of Sonic Hedgehog (HH)-pathway genes in retinal development. The authors approach this complex topic through global transcriptomic analyses combined with elegant functional in-vivo studies, including a systematic examination of the pathway genes through the use of inhibitors and ligand on cKO retinal explants. The results reveal complex regulation by Lhx2 of several HH-pathway genes in the developing mouse retina, including Ptch, Gli1 and the co receptors Cdon and Gas1. The finding that a single TF controls several components of the signaling pathway is interesting. Nevertheless, probably due to the complex activity of Lhx2, which functions on additional targets, it remains unclear how regulation of HH-pathway genes by Lhx2 impacts the eventual phenotype of the Lhx2 cKO retinal progenitor cells (RPCs). In the following, I list the main findings and several comments that need to be addressed:

      Fig. 1 presents the experimental system using inducible Hes1-CreERT to mutate Lhx2 on E11.5 and examine the expression of Gli1 and Sonic Hedgehog in the control and mutant.

      • The authors should present the distribution of the Lhx2 protein in the control vs. mutant. Considering that the deletion is of only part of the gene (as shown in Fig. 2), it is important to present the loss of the protein as well as the efficiency of Cre activity. • On the figure, add a characterization of the cellular phenotype of the cKO retinas on E15.5 by presenting the expression of markers for ganglion cells and RPCs. Figs. 2 & 3 present the bulk RNA-seq analysis of the Lhx2 cKO retinas, including experimental design, validation and results. The integration of previously published ATAC-seq and ChIP-seq data for Lhx2 point to the direct targets and bound regulatory regions.<br /> • "4 biological repeats per genotype" - Specify if four eyes were sampled from two embryos or from four different embryos. Were the embryos from different litters? • Add GSEA analysis for the HH-pathway genes. Fig. 4 presents a published approach to quantifying the response to HH using a cellular reporter assay (Li et al., 2018), whereas in Fig. 5, availability of HH ligand is evaluated by elegantly implementing the cellular reporter assay. The results suggest that Lhx2 does not regulate ligand availability. • Fig. 4 presents a published approach and thus can be included in Fig. 5.<br /> Fig. 6 presents evidence that in the Lhx2 cKO, the Shh pathway is functional downstream of Smo, because the expression of Gli1 increases in cKO cells following Smo activation (with purmorphamine). Furthermore, the response to Shh-N is shown to be partly attenuated in the Lhx2 cKO retina. <br /> Figure 7 examines whether Ptch deletion can rescue aspects of the Lhx2 phenotype. This was done by comparing the phenotypes of cKOs of Lhx2, Ptch, or both Ptch and Lhx2. The results revealed partial rescue, in the Ptch and Lhx2 cKO, of the expression of Ptch1 and Gli1, but not of the proliferation and premature differentiation phenotypes based on expression of Cyclin D1, EDU, PCNA and Hes1. • Add images of the control to Fig. 7B,C. • Explain how the deletion of Ptch1 was examined. They next investigated regulation of the Ptch co - receptors Cdon and Gas1 by Lhx2 (Figs. 8, 9). Fig. 8 presents the developmental expression pattern of Cdon and Gas1 in the control, and their downregulation in the Lhx2 cKO (although Cdon is maintained in the dorsal optic cup). The results show that Cdon is the co-receptor that is normally expressed in RPCs. GAS1 seems to play a role in the peripheral progenitors destined to ciliary body and iris.<br /> Electroporation of both receptors into the Lhx2 cKO retinas resulted in increased pathway activity (based on Gli1 reporter). • Both Cdon and Gas1 were electroporated into the Lhx2 cKO retina, although Gas1 is not expressed in control RPCs (based on the analysis in previous panels). Explain why both were co-electroporated and the outcome of electroporating only Cdon. • The outcome of electroporation of the co-receptors into control retina should be presented. • It is important to include staining for Lhx2; it is possible that the cells that respond to the co-receptors are those that were not mutated (escapers). Presenting the loss of Lhx2 (or Cre activity through the use of a reporter) and comparing it to the outcome of electroporation into the control retina are therefore required.

      Finally, the authors present evidence that Lhx2 cKO, on E13.5 when Cdon is no longer expressed in the RPCs, continues to compromise the HH - pathway genes. This further supports continued regulation of several HH-pathway genes in early and late RPCs.

      • The finding that a Lhx2 controls several components of HH pathway could be relevant to Lhx2 activity in patterning of the cortex - I suggest to discuss the possible relevance of the findings to other organs.

      Additional comments:

      • Fig. 4E: Add explanation of the quantitative analysis. • Fig. 5: Explain how results were normalized based on retinal size (which is significantly smaller in cKO retinas). How many independent experiments were run here? How many different retinas were tested? Were retinas taken from the same mouse considered 'independent'?<br /> • Fig. 8B: Indicate the genotype of the presented tissue. • Fig. 8 A,B should be presented in one panel, in the same orientation. • Fig. 8D: Present the different channels, in addition to the merge image.

      Significance

      The study focuses on an important topic in developmental biology-the regulatory interaction between transcription factors (TFs) and signaling pathways, namely how the TF confers cells' competence to respond to extrinsic cues. The study focuses specifically on Lhx2 regulation of Sonic Hedgehog (HH)-pathway genes in retinal development. The results reveal complex regulation by Lhx2 of several HH-pathway genes in the developing mouse retina, including Ptch, Gli1 and the co receptors Cdon and Gas1. The finding that a single TF controls several components of the signaling pathway is interesting. Nevertheless, probably due to the complex activity of Lhx2, which functions on additional targets, it remains unclear how regulation of HH-pathway genes by Lhx2 impacts the eventual phenotype of the Lhx2 cKO retinal progenitor cells (RPCs).

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We plan to address the minor comments from both reviewers as here described:

      Reviewer 1 – comments____:

      1. Figure 1a + 1b: The pattern of H3K9me3 at SVA elements appears to be quite different between the two cell types tested - in iPSCs it appears to be more strongly centered on the SVA with some "spreading" occurring upstream of the element, however in NCCITs it appears to mark downstream of the element and is not clearly seen to occur on the SVA. Is this due to sequencing data from different labs or due to cell - type specific epigenetic marks? The authors should consider showing H3K9me3 marks at a comparable region for example repressed genes or another family of repressed TEs as an internal control as well as showing profile plots to more clearly show the spread of epigenetic marks over the SVAs. As recommended by the Reviewer, in Figure 1 we will show several examples of H3K9me3 marked SVAs in both cell types. We will propose explanations for discrepancies if needed, but acknowledge the possibility of a sequencing artifact, as suggested by the Reviewer.

      Figure 1: Given that the data in this paper is mostly from NCCIT cells it would be advisable for conclusions in hiPSCs to be tempered accordingly.

      We will temper the conclusions as recommended.

      Figure 1c: Which SVAs have neither H3K9me3 nor H3K27ac, what might these represent? Do they have any other notable features, or are they of a specific age?

      The (very few) SVAs that have neither of those markers are almost certainly an artifact of mappability issues. We will point this out in the manuscript, including both the text and the figure legend.

      Line 128-129: Is the assumption that SVAs are enriched in these region as opposed to dictating the epigenetic landscape as a consequence of their own sequence, is this correct?

      This is correct, and we will clarify this in the text.

      Figure 2c: Are there any enriched motifs in the repressed SVAs?

      We performed this analysis, and several motifs are enriched in the repressed SVAs. We will add this data in the revised manuscript and in Fig. 2C.

      Figure 2c: How many of the de-repressed SVAs contain this motif? It would be interesting to know if the YY1/OCT4 binding sites exist within any of the repressed SVAs and then, later, whether they accordingly lack actual binding of the TFs due to the presence of H3K9me3.

      We will perform this straightforward analysis using the MEME-suit or HOMER, and include it in the revised version.

      Figure 3d: It would be interesting to assess how close the differentially expressed genes are to an LTR5H element, or however many of the SVA-proximal and differentially expressed genes are also LTR5H-proximal?

      We will perform this straightforward analysis and include it in the revised version.

      Figure 3e: Are genes close only to de-repressed SVAs considered here? Worth specifying in the text. Also, what are the 20% of genes which are upregulated?

      Yes, the reviewer is correct, only genes close to de-repressed SVAs are considered. We will specify that in the manuscript, and also add a results paragraph on the 20% of genes that are upregulated.

      Figure 4a: Are the binding motifs for YY1 present at both binding sites? Are they the same, is the surrounding sequence the same? Are either of the binding sites present in repressed SVAs/is there any detectable binding of SVA/OCT4 in repressed SVAs? Are the YY1/OCT4 bound SVAs also those marked with H3K27ac in the Wysocka Lab dataset?

      The YY1-OCT4 motif is present only where both of the factors bind together. We will specify this in the manuscript (results section). As for the second question: yes these correspond to the regions marked by H3K27ac in the Wysocka dataset. We will clarify this in the manuscript.

      Figure 4c: Example used is a SVA-D element, are the YY1/OCT4-bound SVAs within the de-repressed group of a specific age?

      No, they are found in all SVA groups (SVA through F). We will specify this in the manuscript (results section).

      Figure 4d: are there GO enrichment terms for the genes bound by either YY1 and / or OCT4 different?

      We will perform this straightforward analysis using the Ingenuity Pathway Analysis toolkit, and include it in the revised version in the results section.

      Figure 5: Concluding figure should address how the SVAs subset in terms of binding, H3K9me3, gene expression changes and TFBS/TF binding - there are a lot of parameters which are assessed within the de-repressed subclass and it would be useful to show somewhere graphically where and when they co-occur or not.

      We will edit the model figure to show the mechanism highlighted by the reviewer, as requested.

      Finally, it would be helpful to see a discussion about what dictates the absence of H3K9me3 / presence of H3K27ac? Is this due to the TFBS sequence within the element? Further, a discussion on how the TFBS is gained in newer elements / lost in older elements is lacking. While the authors begin by stating that they are going to address what dictates whether an element is co-opted and conclude that it is due to sequence and location, I would suggest that as no conclusion is drawn on how the sequence changes to permit co-option and how the location dictates co-option, it may be worth tempering down the introduction on this point.

      We will edit the introduction and discussion, as requested.

      Reviewer 2 – comments____:

      1) Given the CRISPR sgRNAs also target LTR5Hs, which are also bound by OCT4 in NCCIT cells (PMID: 25896322), it would be helpful to rule out more specifically that the observed effects on gene regulation associated with SVAs are actually due to nearby LTR5Hs copies being similarly repressed by CRISPRi. Depending on those results, it may also be fair to further note in the Discussion that this aspect of sgRNA selection is a potential caveat.

      We will edit the discussion as recommended by the Reviewer.

      2) The approaches taken here provide surprisingly good locus-specific resolution of histone modifications and TF binding to SVAs using only uniquely mapping reads. An example of this, SVA_D_r153 (a heavily 5' truncated SVA) is provided. It could be really useful to convey the central theme of the study by providing in a main figure another SVA example. Except, show a longer SVA (to demonstrate mappability and perhaps enrichment of reads on specific SVA features) near a protein-coding gene, where the SVA becomes repressed in the CRISPRi approach and the gene is differentially expressed as a result. An IGV-style figure perhaps demonstrating each key component of the work in one figure.

      As recommended, we will update figure 4 by adding a full length SVA.

      3) Literature. Line 58 - would suggest adding PMID: 27197217 and PMID: 33186547. Line 60 - would add PMID: 33722937. Line 67 - would add PMID: 22053090.

      We will add these citations in the manuscript as recommended by the Reviewer.

      4) Clarifications: Line 108 - the same 751 SVAs came up in both iPSCs and NCCITs? Line 117 - how many (if any) of the SVAs called as repressed by H3K9me3 were also called as de-repressed by H3K27ac? i.e. are the two histone marks giving completely concordant results for calling SVAs as repressed / de-repressed. Line 215 - no evidence for OCT4 or YY1 binding to any SVAs after CRISPRi at all? We will provide the exact numbers in the manuscript to answer these questions.

      5) Finally, a point perhaps best left to the Discussion. Was there any cellular phenotype identified subsequent to the CRISPRi?

      We did not identify any obvious cellular phenotype and we will mention this in the discussion, as recommended.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Barnada et al. explore the regulatory impact of SVA retrotransposons on gene regulation, focusing on NCCIT cells as a workhorse model of pluripotency. They use published and new H3K9me3 and H3K27ac ChIP-seq datasets to identify repressed and de-repressed SVAs, noting that many are in close proximity to protein coding genes. De-repressed SVAs are enriched for YY1/OCT4 binding sites. The authors then use RNA-seq and ChIP-seq to demonstrate YY1 and OCT4 binding are abrogated by SVA CRISPRi, disrupting gene regulation enacted by the SVAs. These findings highlight an important mechanism by which SVA retrotransposons can regulate genes in pluripotent cells.

      This work is well executed. I appreciated the consideration paid to ChIP-seq read mappability and the implementation of CRISPRi followed by additional ChIP-seq. The following comments are intended to clarify the findings, which appear to have been obtained from robust experimental approaches.

      Minor issues:

      1. Given the CRISPR sgRNAs also target LTR5Hs, which are also bound by OCT4 in NCCIT cells (PMID: 25896322), it would be helpful to rule out more specifically that the observed effects on gene regulation associated with SVAs are actually due to nearby LTR5Hs copies being similarly repressed by CRISPRi. Depending on those results, it may also be fair to further note in the Discussion that this aspect of sgRNA selection is a potential caveat.
      2. The approaches taken here provide surprisingly good locus-specific resolution of histone modifications and TF binding to SVAs using only uniquely mapping reads. An example of this, SVA_D_r153 (a heavily 5' truncated SVA) is provided. It could be really useful to convey the central theme of the study by providing in a main figure another SVA example. Except, show a longer SVA (to demonstrate mappability and perhaps enrichment of reads on specific SVA features) near a protein-coding gene, where the SVA becomes repressed in the CRISPRi approach and the gene is differentially expressed as a result. An IGV-style figure perhaps demonstrating each key component of the work in one figure.
      3. Literature. Line 58 - would suggest adding PMID: 27197217 and PMID: 33186547. Line 60 - would add PMID: 33722937. Line 67 - would add PMID: 22053090.
      4. Clarifications: Line 108 - the same 751 SVAs came up in both iPSCs and NCCITs? Line 117 - how many (if any) of the SVAs called as repressed by H3K9me3 were also called as de-repressed by H3K27ac? i.e. are the two histone marks giving completely concordant results for calling SVAs as repressed / de-repressed. Line 215 - no evidence for OCT4 or YY1 binding to any SVAs after CRISPRi at all?
      5. Finally, a point perhaps best left to the Discussion. Was there any cellular phenotype identified subsequent to the CRISPRi?

      Significance

      This interesting study systematically demonstrates the importance of SVA-mediated gene regulation, mediated by YY1 and OCT4, in a cellular model of pluripotency. It would be of broad interest, particularly to those interested in gene regulation, pluripotency and retrotransposons. I would say most of the findings are new to the literature and that the closest publications I can think of in terms of scope either deal differently with SVA regulation (e.g. PMID: 33722937) or upon different retrotransposons (e.g. PMID: 30070637). The significant advance is therefore both technical and conceptual.

      Geoff Faulkner (University of Queensland)

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Genomic features underlie the co-option of SVA transposons as cis-regulatory elements in human pluripotent stem cells Barnada et al.

      Barnada et al., set out to address the question of which factors dictate when and how certain TEs become co-opted to regulate host cellular functions while others do not, citing a lack of global understanding of the mechanisms underpinning this widely occurring phenomenon. To do so they use human SVA elements as a study, interrogating the epigenetic regulation of these elements in NCCITs as a proxy for pluripotent cells to reveal that a subset of younger, human-specific SVAs lack canonical H3K9me3 repressive marks while being enriched for H3K27ac active enhancer marks. The authors show that these SVAs, termed de-repressed SVAs, contain adjacent YY1 and OCT4 binding motifs, are closer to genes and TFBSs than the repressed SVAs and as such propose that they may function as active enhancers. To demonstrate this they generate a CRISPRi NCCIT cell line to epigenetically silence de-repressed SVAs using two gRNAs targeting SVAs genome-wide. Upon activation of dCas9 in this system differential expression of >3000 genes occurs, notably the broad repression of de-repressed SVA-proximal genes which are enriched for GO terms and TFBSs related to gametogenesis. The authors then demonstrate that silencing of naturally de-repressed SVAs disrupts YY1/OCT4 binding which is seen in wildtype NCCITs to occur adjacently at one site in the SVA element with solo-YY1 binding also occurring in a subset of SVAs at a second site. Disruption of YY1/OCT4 binding by targeting H3K9me3 to derepressed SVAs leads to dysregulation of proximal genes providing further evidence that SVAs act as enhancers via YY1/OCT4 binding to regulate nearby cellular genes.

      Overall, the manuscript is clearly written and the computational and experimental approaches thorough; the findings are compelling and novel and make a helpful contribution to our current understanding of transposon co-option by host genomes. The demonstration of TFBS located within a subset of newer SVA elements is particularly interesting, with binding disrupted upon epigenetic silencing of these elements by CRISPRi. I have some minor comments and queries about further discussion points:

      Minor comments:

      1. Figure 1a + 1b: The pattern of H3K9me3 at SVA elements appears to be quite different between the two cell types tested - in iPSCs it appears to be more strongly centred on the SVA with some "spreading" occurring upstream of the element, however in NCCITs it appears to mark downstream of the element and is not clearly seen to occur on the SVA. Is this due to sequencing data from different labs or due to cell - type specific epigenetic marks? The authors should consider showing H3K9me3 marks at a comparable region for example repressed genes or another family of repressed TEs as an internal control as well as showing profile plots to more clearly show the spread of epigenetic marks over the SVAs.
      2. Figure 1: Given that the data in this paper is mostly from NCCIT cells it would be advisable for conclusions in hiPSCs to be tempered accordingly.
      3. Figure 1c: Which SVAs have neither H3K9me3 nor H3K27ac, what might these represent? Do they have any other notable features, or are they of a specific age?
      4. Line 128-129: Is the assumption that SVAs are enriched in these region as opposed to dictating the epigenetic landscape as a consequence of their own sequence, is this correct?
      5. Figure 2c: Are there any enriched motifs in the repressed SVAs?
      6. Figure 2c: How many of the de-repressed SVAs contain this motif? It would be interesting to know if the YY1/OCT4 binding sites exist within any of the repressed SVAs and then, later, whether they accordingly lack actual binding of the TFs due to the presence of H3K9me3.
      7. Figure 3d: It would be interesting to assess how close the differentially expressed genes are to an LTR5H element, or however many of the SVA-proximal and differentially expressed genes are also LTR5H-proximal?
      8. Figure 3e: Are genes close only to de-repressed SVAs considered here? Worth specifying in the text. Also, what are the 20% of genes which are upregulated?
      9. Figure 4a: Are the binding motifs for YY1 present at both binding sites? Are they the same, is the surrounding sequence the same? Are either of the binding sites present in repressed SVAs/is there any detectable binding of SVA/OCT4 in repressed SVAs? Are the YY1/OCT4 bound SVAs also those marked with H3K27ac in the Wysocka Lab dataset?
      10. Figure 4c: Example used is a SVA-D element, are the YY1/OCT4-bound SVAs within the de-repressed group of a specific age?
      11. Figure 4d: are the GO enrichment terms for the genes bound by either YY1 and / or OCT4 different?
      12. Figure 5: Concluding figure should address how the SVAs subset in terms of binding, H3K9me3, gene expression changes and TFBS/TF binding - there are a lot of parameters which are assessed within the de-repressed subclass and it would be useful to show somewhere graphically where and when they co-occur or not.
      13. Finally, it would be helpful to see a discussion about what dictates the absence of H3K9me3 / presence of H3K27ac? Is this due to the TFBS sequence within the element? Further, a discussion on how the TFBS is gained in newer elements / lost in older elements is lacking. While the authors begin by stating that they are going to address what dictates whether an element is co-opted and conclude that it is due to sequence and location, I would suggest that as no conclusion is drawn on how the sequence changes to permit co-option and how the location dictates co-option, it may be worth tempering down the introduction on this point.

      Significance

      This work represents a novel, comprehensive and significant contribution to the understanding of co-option of transposons into host gene networks and factors underlying these processes.

  3. Feb 2022
    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      The authors do not wish to provide a response at this time.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The article by Skokan and coworkers studies the regulation of macropinocytosis in the Hydra. They design a clever assay to image the formation of macropinosomes in the ectodermal cells of the Hydra body, by amputating the head and the foot of the animal and then helving it onto a thin glass rod, allowing them to study the dynamics of actin rings formation, associated with uptake of external fluid phase. They also observe the cyclic formation of macropinosomes during the oscillatory contractions of spheroids formed from amputated animals during regeneration. By using agonist and antagonist drugs targeting mechano-sensitive calcium channels, they show that the formation of macropinosomes correlates with the reduction of cell tension. Overall, the article is succint, but clear and convincing. However, in my opinion, two major points should be clarified, if not solved before considering publication.

      Major points:

      1. the function of macropinocytosis in the Hydra is not known. The author postulates that it could be linked to a regulation of membrane area during animal contractions. However, one may wonder if the membrane cell surface really changes during contractions. I wonder if another explanation is possible: most of the organisms leaving in fresh water require an efficient mechanism to remove excess water that comes in the cells through osmosis. The hydra regular contractile movement are part of this, and I am wondering the macropinocytosis could be linked to this mechanism. Would the author be able to apply osmotic shocks, in particular hypertonic shocks, and see how it changes the formation rate and the dynamics of macropinosomes? On the reverse, in paralyzed animal, I am wondering if macropinosomes are still formed? Results from these experiments may give a clue about the function of macropinocytosis in the Hydra.
      2. Because of the role of Piezo and other mechano-sensitive calcium channels, the author conclude that the factor that limits macropinocytosis is membrane tension. However, unless I am mistaken, actin cytoskeleton has also been involved in mechano-sensing channels, it could be that cortical tension, rather than membrane tension is playing a regulatory role. A direct proof of membrane tension (by measuring it) changes would be required to conclude as the authors do. The role of membrane tension versus macropinocytosis could be directly assessed using membrane tension probes such as FliptR or flipper probes. Otherwise, a less clearly defined term, that combines both cortical tension and membrane tension, such as cell surface tension or cell tension would be preferable.
      3. Number of macropinocytic cups(actin rings) per cell is used as a readout for rate of macropinocytosis. Yet in addition to the number of cups parameters the diameter increases in certain conditions such as GdCi3. It would ideally be interesting to show the changes in diameter of cups and how this varies per in different conditions. For example, in videos of Jedi1 treated body columns the cups seem bigger in size. Supporting experiments of monitoring macropinosomes via dextran uptake assays needs to be performed for quantifications a rate of change in macropinocytosis is proposed. Alternatively, dextran beads of different molecular sizes with different fluorophores could also be used to assess the differences in rate and volume of uptake via macropinocytosis in various conditions of this study.
      4. If membrane tension is altered upon dissecting Hydra fragments, would it make sense to study potential changes in macropinocytosis within the regenerating body column? Such as differences in actin ring formation in cells close to wound edges versus equatorial regions of regenerating body columns and spheroids?
      5. Reasoning for selection of Piezo as molecular target over other stretch activated channels has not been provided. Piezo activators have been used, on the contrary depletion of Piezo via RNAi could be performed in intact animals to assess increased macropinocytosis. Furthermore, rate of macropinocytosis could be assessed in body columns generated from Piezo depleted animals. This would further support the direct role of Piezo in the process.

      Minor points:

      1. The authors report differences in macropinocytosis based on different parts of the animal (Fig.S1), upon treating intact animals with GdCI3 (Fig.2C) how does this vary? Do the differences still persist in spite of increased macropinocytosis?
      2. Hydra are animals with an elongated body column. Dissecting body columns of different lengths could give rise to spheroids of different volume, these could then be inflated to establish a comparative volume study with different volumes and macropinocytosis.
      3. For all graphical representation it would also be ideal to state the p-values for each significant comparison to better appreciate differences instead of stars.
      4. For better understanding of figures, highlight in graph legends and figure panels which tissue sample has been used i.e intact animal, body column or spheroid.
      5. A graphical representation could be given for the comparison of macropinocytic cups between intact hydra versus body column samples with statistical analysis. To appreciate the claims made by the authors regarding the trend of more cups being observed in body columns versus intact animals, as only the mean values are stated in the text.
      6. In Fig.1F, there exist streaks of dextran distinctly outlining apical membranes of cell sets in the hydra epithelia, what are these suggestive of?
      7. Fig.S1 No p-values

      Significance

      Overall, the work is of interest for several research communities. The significance could be increase by providing a few more experiments about the physiological role of macropinocytosis in the Hydra.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      In this manuscript, Skokan et al. develop a platform of cnidarian Hydra vulgaris, a powerful model for cellular self-assembly and organismal regeneration, to enable visualization of macropinocytosis in living tissue. Utilizing this system and small molecule perturbation, authors discover that macropinocytosis occurs constitutively at the ectoderm across the entire body axis of Hydra, and is constrained by membrane tension through stretch-activated channels and the downstream calcium influx.

      Major Comments:

      The manuscript is clearly written and logically organized, and the imaging results are properly quantified. With the logical interpretation, adequate biological repeats and statistical analysis, the method and data in this manuscript are clear and compelling. The major concern is the missing physiological significance of macropinocytosis induced by membrane relaxation in Hydra, if any.

      Suggested experiments:

      1. In Fig 2, the importance of SAC and Ca2+ for macropinocytosis are addressed. However, only one SAC inhibitor was used, whereas Ca2+ concentration in Ionomycin treated Hydro remained high even after 60 min when macropinocytic cup density had recovered (Fig 2E and G). As the authors mentioned in the DISCUSSION, other SAC transported cations may be involved in and thus need to be tested. Simply, the medium depleted of specific cation or water containing specific cation could be used to monitor the requirement of each cation on Jedi2 treated Hydra.
      2. In Fig 3, the authors demonstrate that increased membrane tension leads to higher Ca2+ concentration and less macropinocytic cups in Hydras. The SAC inhibitors and EDTA (or calcium free buffer) used in Fig2 should be applied in the inflated regenerative spheroids to confirm that membrane tension inhibits macropinocytosis via SAC and Ca2+.
      3. The authors observed an increase of macropinocytic cups in both amputated Hydra and regenerative spheroids than intact animal (0.186 and ~0.3 compared to 0.015 cups per cell, Fig S1, 2E, 3C). Would the inflation or inhibition of macropinocytosis perturb spheroid regeneration or polarization/sorting? Authors have discussed several potential biological functions of macropinocytosis in Hydra, including tension homeostasis and surface remodeling that are important during spheroid regeneration. It will be worthy to examine if mild membrane tension increase or SAC activation would delay the sorting process of regenerating Hydra tissues.

      Minor points:

      1. Fig 2C is the quantification results of 2B but include three sets of data (labeled as 1, 2, and 3) without explanation.
      2. Would amputation of one tentacle lead to local or global Ca2+ reduction and macropinocytosis in a Hydra?

      Significance

      Macropinocytosis is an evolutionary conserved, from amoeba to human, and versatile endocytic route critical for mammalian immune and cancer cells for antigen surveillance and nutrient uptake. Despite ample understanding of macropinocytosis in cultured cells has been made, the function and mechanism of macropinocytosis at the organ or organismal level remains poorly studied. Therefore, this work is intriguing and timely to support the physiological occurence of macropinocytosis from the tissue and evolutionary aspects.

      Macropinocytosis is critical process for membrane trafficking, cell signaling, immune surveliance and cancer cell growth, and Hydra vulgaris is a powerful model organism for regeneration biology, neuro biology and marine biology. Therefore, audiences from these fields will be interested and influenced by this report studying developing a new method for visualizing macropinocytosis in living Hydra.

      I am a cell biologist studying the regulation of membrane remodeling and trafficking upon mechanical or biochemical stimuli. Due to my unfamiliar with Hydra as a model organism, the details of suggested experiments may need to be adjusted.

      Referees cross-commenting

      I agree with other reviewers and think their comments important and valid. This manuscript will be more clear and compelling after addressing these questions.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In the manuscript by Skokan et al, the authors demonstrate a constitutive and robust program of macropinocytosis in the outer epithelial layer of the cnidarian Hydra vulgaris. While the model system is less tractable than others including mammalian cell types from a genetic stand-point, the authors have devised a neat approach to visualizing the planar epithelium in live organisms and provide clear evidence for macropinocytosis by a tissue monolayer in vivo. This model also supports the ancient conservation of macropinocytosis, supporting the studies in Dictyostelium, and may represent early modes of nutrient acquisition in complex fluid environments. Using probes for the cytoskeleton, fluid phase indicators, and mechanical and pharmacological interventions, the authors describe how stretch-activated calcium channels inhibit micropinocytosis. In general, while the manuscript is concisely written, and the available data are compelling, much more rigorous experimentation is required to make such a conclusion. In addition, the physiological importance for mechanical stretch in orchestrating the arrest of macropinocytosis remains unclear. Conceivably, this may be involved in the regulation of membrane tension since macropinocytosis (high membrane turnover) would demand that cells have a high rate of membrane recycling to compensate. Below, I have outlined some approaches that the authors could take to improve the study without demanding them to utilize additional model systems, which I think would be outside the scope of the work.

      Major comments:

      Most importantly, the role of Ca2+ entry via stretch-activated channels and how this would inhibit macropinocytosis remains unclear. In fact, the findings are somewhat counterintuitive since stretch applied to the monolayer would increase membrane tension while Ca2+ influx would support membrane delivery and exocytosis, thereby restoring tensional homeostasis.

      In Fig 3, the authors demonstrate that applied stretch to the epithelium increases cytosolic Ca2+ and decreases membrane tension as expected. But whether the Ca2+ influx is required for the loss of macropinocytosis is not clear. This can be tested by either chelating Ca2+ transients in the cytosol or depleting the cells of Ca2+ by inhibiting ER-resident Ca2+ pumps and removing Ca2+ from the medium. In fact, if the authors think that extracellular Ca2+ is the only issue to arresting macropinocytosis, substituting Ca2+ for another divalent cation (or removing all divalent cations from the medium, should the epithelium be amenable to it for short periods of time) could be employed.

      The connection between [Ca2+]cyto and macropinocytosis is established by Jedi and ionomycin. In the case of ionomycin, the large and sustained increase [Ca2+]cyto, well beyond what could be expected in physiological conditions, leads to the loss of plasma membrane PIP2, PIP3, and membrane associated F-actin. Jedi1/2 are certainly more targeted, but it is difficult to attribute their effects to Piezo in this system. More worryingly, the Ca2+ influx in response to Jedi2 and especially Jedi1 occurs maximally after 10 min of exposure. Yet, the authors show the complete loss of macropinocytic cups after 10 min (Fig 2E). It's difficult to reconcile that the Ca2+ is the issue.

      The authors do not quantify macropinocytosis beyond Figure 1. Instead, they use "macropinocytic cups" as their surrogate for bona fide, sealed macropinosomes. Macropinocytosis can occur at different scales and different rates, so the authors should instead use the 70 kDa dextran as the gold standard in Figure 2. And as part of gold standard approaches, the authors would appease the macropinocytosis field if they tested the requirement for PI3K and Na H+ exchangers in Figure 1.

      The appearance of the GCAMP6s in Figure 2F before given Jedi2 is interesting. Aside from the Ca2+ signal that appears where the Hydra has been severed, the Ca2+ through the epithelium appears very heterogeneous. Does this Ca2+ signal oscillate in the cells and/or across the epithelium? Since the authors are able to image the cytoskeleton and Ca2+ in this system, it would be interesting to determine any correlations in their kinetics.

      Minor comments:

      At this point, minor comments may be less useful to the authors since some of the more major suggestions are likely to impact the overall breadth of the work.

      Significance

      The work represents a technical advance and new system to consider macropinocytosis, albeit with limited mechanistic insights owed to some intrinsic challenges and remaining experimentation.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We would like to thank all the reviewers for their time and for their positive and constructive review of our study. We are happy that they all regard this as a highly significant piece of work. We have addressed some of their suggestions in our updated preprint and indicate below where we are planning further revisions.

      Reply to Reviewer 1 Point 1

      The reviewer pointed out a possibility that the Golgi polarisation leads to local/centre-most regional E-cadherin junction “maturation”, then contribute to AMIS seeding. To address this suggestion, we did fluorescence recovery after photobleaching (FRAP) using a mESC line that expresses E-cadherin-GFP in the updated manuscript. We compared the recovery speed and rate in the centre-most region and side regions to discuss whether E-cadherin junctions have different stability at these regions. What we found is that though the E-cadherin and E-cadherin-GFP protein level is at the same level at the two regions in mESC doublets (Figure S3), the mobile fraction of E-cadherin-GFP is lower in the centre-most region than the side regions (Figure 3 I, J). This implies that E-cadherin junctions in the centre-most region are more stable. We have included corresponding description of this data in Results, Methods and Discussion. We will also include equivalent data from non-mitomycin c treated control cells in the final manuscript.

      Still, we do not know whether the more stable E-cadherin junctions were due to the Golgi polarization, but we have included the possibility of Golgi polarisation leads to local E-cadherin maturation in our Discussion in the transferred manuscript as follows:

      “In addition, a recent study of chick neural tube polarisation (where N-Cadherin is the dominant Cadherin) has demonstrated that the interaction of β-catenin with pro-N-cadherin in the Golgi apparatus is necessary for the maturation of N-Cadherin, which is in turn important for apicobasal polarity establishment (Herrera et al, 2021). This provides the possibility that the polarised Golgi apparatus that we observe in the mESC clusters might be directionally delivering mature E-cadherin to the central-most region of cell-cell contact.”

      Reply to Reviewer 1 Point 2

      The reviewer suggests it would be interesting to know whether there is a role for the proteins JAM-1 or Nectin in AMIS formation and in polarising the Golgi and centrosomes towards the cell-cell contact. Like E-Cadherin, these are transmembrane junctional proteins that are present at the initiation of spot adhesions in epithelial 2D monolayers and are known to be part of a complex network of interactions between PAR-complex, junctional molecules, MAGUK scaffolding proteins and the actin cytoskeleton. Whilst we don’t propose to untangle this network here, we agree that it would be interesting to know more about the potential role of JAM-1 and Nectin in initiating polarity in mESC 3D cultures. However, it is important to note that, regardless of whether JAM-1 and Nectin also play a role in polarisation and AMIS formation, our results already demonstrate that E-cadherin-based adhesions are sufficient to initiate AMIS localisation. For example, our results from figure 4C-E demonstrate that, in a reductionist system of a single cell plated on E-Cadherin covered glass, a centrally located AMIS still forms. Precisely unravelling the mechanisms by which this happens would be better for a future study (which we have now stated in the Discussion).

      Nevertheless, we now have new FRAP data (Figure 3I and J), which demonstrates that E-Cadherin is relatively more stable at the central-most point of contact between two adhering cells. This suggests that E-Cadherin is more stably bound via its downstream partners to the internal actin cytoskeleton at this point and may provide at least a partial explanation for why AMIS localisation occurs precisely at this region. We therefore suggest that the most relevant information to our study would be to determine whether either JAM or Nectin proteins are specifically localised at the AMIS, alongside PAR-3 and ZO-1, and might therefore be somehow enabling this stabilisation of E-Cadherin. We therefore plan to carry out IHC stains for JAM-A (new name for JAM-1), which has been found to be present in the mouse inner cell mass, to determine where it is localised within the mESC cell clusters with/without cell division and in WT/Cdh1 KO cells. We will update the supplementary results and discussion accordingly in the final manuscript.

      Depending on these results, we might also try to knock down the function of JAM-A, using siRNA. If successful knock down were achieved, we would carry out FRAP to determine whether E-cadherin junctional stability had been altered and would also stain for AMIS markers such as PAR-3 and determine whether Golgi and centrosomes were polarised. However, it is important to note that, although we were able to achieve E-cadherin RNAi to a certain degree, it is not always possible to achieve sufficient knock down of protein by the 24-hour AMIS timepoint. Since the results of these experiments would not alter the impact of our pre-existing data, we do not propose to create new knock out cell lines in the current study. Also, possible redundancy between different paralogs may affect the interpretation of this experiment so we would only include these results if they allowed for clear interpretation.

      A previous study (Gao L ,et al. Development. 2017) has already shown that knocking out Afadin (which would therefore disable Nectin junctions) in MDCK cell 3D cultures did not affect initial AMIS formation or localisation, although later cell division orientation and therefore lumen positioning was affected. Afadin was also not localised to the AMIS. Therefore, it is less likely that Nectin is involved in AMIS localisation and while we will stain for its localisation by IHC, we don’t propose to try to knock down its function.

      Reply to Reviewer 2 Point 1

      The reviewer pointed out using a different mitosis blocker beside Mitomycin C. a) In the updated manuscript, we included one additional drug treatment: Aphidicolin. The results showed the AMIS could form in the centre of cell-cell contacts in Aphidicolin treated, division-blocked cells. AMIS (PAR3, ZO1) and the Golgi network was also polarised towards this point (Figure S1 G-I). In the final manuscript, we will include a full data set with N=3 independent experiments. Though­ the same as Mitomycin C, Aphidicolin is a DNA replication blocker, it confirmed that the AMIS formation upon treatments is not a Mitomycin-only artefact. b) As the reviewer suggested to block mitosis at the M phase, we are testing using microtubule polymerization inhibitors, Nocodazole and Taxol and will include these results if appropriate. However, these treatments will also affect the cytoskeleton, significantly affecting the cell shape and potentially interrupting the cell-cell contact interface. Therefore, it may not be possible to include these experiments.

      Reply to Reviewer 2 Point 2

      The reviewer suggested to include more examples of movies showing 2 and 4 cell cluster formation in division blocked conditions. We will be happy to provide more examples of the movies included in Figure 2 and Movie 2 in the final submission. The puncta in submitted Movie 2 was not as clear as the in Figure 2D as the reviewer pointed out. This was largely due to the reduce-sized movie in the original submission. We will provide full-resolution movies in the final submission. We do often see the ‘perfect’ 4-cell shape in division-blocked cells (e.g. the last frame of movie 2, shown at timepoint 19:00 in figure 2D). The shape of the clusters appears largely dependent on how many cells fuse together.

      Reply to Reviewer 2 Points 3 & 5 and Reviewer 3 Point 2

      We appreciate the comments from the reviewers regarding qualifying some of the discussion of our results.

      Reviewer 2 points out that E-cadherin is not providing a ‘Symmetry breaking’ step, since cells are eventually able to polarise in the absence of E-cadherin (even though they can’t make an AMIS). We have therefore modified our discussion of this point to read: “Our results therefore suggest that Cadherin-mediated cell-cell adhesion may provide the spatial cue required for AMIS localisation during de novo polarisation.”. The last paragraph of the manuscript now reads: “In summary, our work suggests that Cadherin-mediated cell-cell adhesion is necessary for localising the AMIS during de novo polarisation of epithelial tubes and cavities.”

      Reviewer 3 points out that the E-Cadherin molecule by itself is not sufficient to recruit the AMIS proteins to the centre-most region of the cell-cell contacts since E-cadherin is localised all along the cell-cell contact. We have now included a FRAP analysis demonstrating that E-cadherin is more stable in the centre-most region of cell-cell contacts (Figure 3I,J), which supports the role of E-Cadherin in directing AMIS localisation to this centre-most region. Nevertheless, we accept the reviewer’s point that we still do not know the downstream mechanism by which the AMIS is precisely localised to the central region of cell-cell contacts, and we have extended our discussion of this point in the updated manuscript. To clarity the language, we have also altered our results heading and other references to this point to read: “E-Cadherin adhesions are sufficient to initiate AMIS localisation, independent of ECM signalling and cell division”. We believe our experiments with two methods support this claim that the formation of E-cadherin-based adhesions without cell divisions and ECM signals are sufficient to initiate AMIS localisation; in particular Figure 4C-E, in which a centrally located AMIS formed even in a reductionist system of only 1 cell plated on E-cadherin covered glass.

      Reply to Reviewer 2 Point 4

      The reviewer reasoned that the WT and Cdh1 KO mESC were from different genetic backgrounds. The WT (ES-E14) mESCs were generated from 129P2/Ola mice and the Cdh1 KO mESCs were generated from 129S6/SvEvTacArc mice. To confirm the results acquired based on the two cells lines, we are doing two approaches: 1) As the reviewer suggested, we are using siRNA knock-down of E-cadherin in the Wild-type mESCs (ES-E14) to confirm the results we had of the AMIS absence in the E-cadherin knock-out mESC cultures. As Figure S2C,D now shows, the concentrated PAR3 between two mESCs was largely reduced after E-cadherin knock-down. We will also include Mitomycin-treated conditions in this experiment for the final publication. 2) As an alternative approach, not dependent on RNAi functionality, we have acquired a 129S6/SvEvTacArc background mESC (the W4 line) as the wild-type mESC line that has the same background as the Cdh1 KO mESC line. We are using this line to perform the control experiments of Figure 3A-C to confirm the previous results, which so far are comparable in both the ES-14 and W4 mESC cell lines. Our preliminary data below show the same results as we had with the ES-E14 cells in the current Figure 3A. We will finish the full data set of N = 3 experiments and replace the current Figure 3A-C, S2A data with that from the W4 mESC cell line. In the meanwhile, we have labelled the type of wide type mESC used for each experiment in the manuscript.

      Reply to Reviewer 3 Point 1

      The reviewer pointed out we should include three independent experiments for our data in Figure 4E. We agree with the reviewer. We are very happy to do the suggested experiments and data analysis and will be able to provide the data of N=3 independent experiments in the final manuscript.

      Reply to Review 3 Point 3

      We agree with the reviewer. Our current data set of live imaging at day 3 are used to confirm the idea from the fixed images that a wrapping process does happen for lumenogenesis during the Cdh1 KO cyst formation. The current dataset could not exclude the possibility that the hollowing might co-exist. The reviewer therefore suggests including a live movie depicting early stages (before 78:00) of E-Cadherin knock-out cluster development. We did try to collect this data before we first submitted the manuscript but encountered significant technical problems due to the high sensitivity of early stage Cdh1 KO cells to phototoxicity. This meant that we could not image with less than one hour interval nor over longer than 24 hour and were therefore unable to analyse how the cell clusters behave before forming the cup-shaped cavity. We will attempt these experiments again (e.g. imaging from 12-24 hours and 24-36 hours). However, there is a high likelihood that the experiments will not be technically possible, which is why we list them in section 4 of our review plan. Instead, we include the following sentence in our discussion: “We were unable to live-image earlier stages of Cdh1 KO cluster development due to the sensitivity of these cells to phototoxicity so we can’t exclude the possibility that hollowing lumenogenesis occurs in parallel, although our IHC analysis does not indicate that this is the case.”

      Reply to reviewers’ minor points

      We have revised our texts, made the nomenclature of protein PAR3 consistent, and included the information of antibody suppliers, as the reviewers pointed out. Specific response to Reviewer 2; in p2 and p7, the texts were referring to zebrafish studies, where PAR3 is referred to as Pard3. We have marked it with “Pard3 (PAR-3)” now. We have increased the size of images in figure 5B and inverted the colour to make it more visible. Since this made the figure too big, we moved the ZO1 images to Figure S5A. We will provide a co-staining of mCherry (to label mCherry-PAR6B), Phalloidin and PAR-3 in a more updated manuscript to replace Figure 2A.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this manuscript, Xuan Liang and collaborators shed light on how the precise localisation of the apical membrane initiation site (AMIS), necessary for organised lumen formation, is directed at the single-cell level. By characterising de novo polarising mouse embryonic stem cells (mESCs) cultured in 3D, the authors have uncovered a division-independent mechanism of de novo polarisation and AMIS localisation based on adhesion molecules. More precisely, they suggest that E-CADHERIN-mediated cell-cell adhesion may provide the symmetry-breaking step required for AMIS localisation during de novo polarisation since this molecule alone is sufficient and necessary to drive correct AMIS localisation. Interestingly, a high proportion of E-Cadherin knock-out (Cdh1 KO) mESC cell clusters do not hollow but instead generate lumen-like cavities via a closure mechanism. Despite not knowing the mechanism involved in the closure of these lumen-like cavities, the role of E-CADHERIN in de novo polarisation would be associated with initial steps in lumen formation (AMIS formation and localisation) but not in later steps where E-Cadherin knock-out mESC cell clusters can still make an apical membrane but do so more slowly than in WT cells and without going through a centralised AMIS stage.

      Altogether, this study supports their previously published zebrafish neuroepithelial cell in vivo analysis, which demonstrated the division-independent localisation of Pard3 and ZO-1 at the neural rod primordial midline (Buckley et al., 2013). The authors have provided a novel mechanism of de novo polarisation and AMIS formation that occurs in vivo and in vitro. For this reason, this is a work with great significance that will undoubtedly be of general interest to the readers of Review commons. Nonetheless, several issues should be addressed before the publication of this manuscript.

      1. In figure 4, the authors tried to demonstrate that E-CADHERIN is sufficient for AMIS localisation, independent of ECM signalling and cell division. To this end, they cultured individual division-blocked mESCs onto either E-CADHERIN-FC recombinant protein or FIBRONECTIN pre-coated glass and carried out IHC for PAR-3 after 24 hours in culture. They then performed heatmaps and analysed PAR3 intensity (Fig. 4 D, E). Although the data presented are fascinating and show the effects the authors describe, the authors should improve their sample number and repeat this experimental procedure and analysis at least two more times for their results to be consistent (only 15 cells in one experiment have been used to carry out the statistical analysis described previously).
      2. The expression of E-cad is necessary for the proteins that define the apical membrane (Par3, Par6, aPKC) to be located in the AMIS. The results are clear and robust. Even so, I do not think this is sufficient, as the authors claim (headline of page 5, Figure 4). It seems clear that something more than Ecad is needed for the localisation of Par3 in the AMIS because, as the authors indicate in the discussion, Ecad is located along the entire cell-cell junction, while par3 is focused on AMIS. There must be something else that is necessary for the location of Par3. Therefore, the experiment in figure 4 does not prove that E-cad is sufficient but confirms that it is necessary for that location. Another series of experiments would have to be carried out to prove that it is sufficient. This must be clearly stated in the final version of the manuscript.
      3. It is clear that E-Cadherin knock-out mESC cell clusters open cup-shaped cavities before generating a lumen-like structure. Fig. 5 presents compelling data about this in vivo lumen formation mechanism without hollowing, though they briefly describe this process. Whilst I am conscious that they do not know the mechanism by which such 'closure' occurs and that this would be suitable for another manuscript, I would strongly suggest including a live movie depicting early stages (before 78:00) of E-Cadherin knock-out cluster development. Many queries arise with this piece of data, as it seems that a small lumen could be forming prior to the cup-shaped cavity.

      Minor points

      1. Fig. 2A: Actin staining could be included to better visualise the spheroids.
      2. Fig. 5B is very small, I would recommend them to present it bigger.
      3. I would encourage the authors to revise the figures as some have displaced text. (see Fig. 5, 72 hours, Cdh1 KO).

      Significance

      Altogether, this study supports their previously published zebrafish neuroepithelial cell in vivo analysis, which demonstrated the division-independent localisation of Pard3 and ZO-1 at the neural rod primordial midline (Buckley et al., 2013). The authors have provided a novel mechanism of de novo polarisation and AMIS formation that occurs in vivo and in vitro. For this reason, this is a work with great significance that will undoubtedly be of general interest to the readers of Review commons. Nonetheless, several issues should be addressed before the publication of this manuscript. My lab work in lumen formation in 3D organotypic cultures and organoids

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Summary:

      The importance of cell division and the post-mitotic midbody in the establishment of the apical membrane initiation site (AMIS) is quite well established. However, there are observations hinting to a cell division-independent mechanism of the AMIS formation. The authors hypothesized that cell adhesion involving E-cadherin could direct the site for AMIS localisation during de novo polarisation. As model system the authors used mouse embryo stem cell (mESC) culture in Matrigel, which has been used as an in vitro model for the de novo polarisation of the mouse epiblast. The slow lumen formation in culture allows for a relatively clear separation of the stages of de novo polarisation. This enables to study the initiation of apico-basal polarity of embryonic cells alongside the first cell-cell contacts between isolated cells and small cell clusters. Here, the goal was to determine the role of cell adhesion, and in particular E-cadherin, in mESC AMIS localisation.

      Major comments:

      1. Mitomycin C is commonly used to block cell division, however, what it does is it blocks DNA replication, and the blockage of cell division is a consequence. It could have other effects than only blocking cell division. What about using a mitosis blocker? It would be good to have a second way in addition to mitomycin C treatment of confirming that the results support the conclusion that cell division is dispensable for AMIS localization, as the full work builds up on that first observation and this experimental setup is carried on through the manuscript.
      2. Many clusters at higher cell stage that are shown (e.g. Fig 2C, 3A), look like they are in the perfect 4-cell stage after cell division. Movie 1 does not show that, only 2 cells are clustering. Movie 2 shows how two 2-cell clusters form a 4-cell cluster, however, that does not look as "perfect" as the 4-cell stages shown in the figures, which look as said more like 4-cell stages resulting from cell division. Maybe the authors could provide more movies that show the 2-cell cluster doublets (4-cell clusters)? Also, the puncta relocalization to the cell-cell contacts in the 2-cell cluster doublet is not so very clear in the movie 2. Maybe the authors have more movies for 2-cell cluster doublets?
      3. On page 4 the authors observe: "Whilst homogenous control doublets localised PAR-3 to the central region of the cell-cell interface, heterogeneous chimeric doublets did not localise PAR-3 centrally (Figure 3D,E). Golgi and centrosome localisation towards the cell-cell interface suggested that the overall axis of polarity was maintained, even in the absence of both cell division and E-CADHERIN (Figure 3F-H & S2C)." This means that the axis of polarity is established without E-cadherin and without the AMIS. So, the cell-cell contact site itself is able to establish polarity, but not localize the AMIS. In line with this, the authors state: "The results also demonstrate that ECM in the absence of E-CADHERIN is insufficient for AMIS localisation." So, without E-cadherin, there is no AMIS, the ECM cannot establish it, but polarity is still established. On p. 6, it is then concluded that E-cadherin and centralised AMIS localisation are not required for apical membrane formation, but that they promote its formation earlier and more efficiently in development. In the discussion, however, the authors then state in a rather contrary manner "Our results therefore suggest that CADHERIN-mediated cell-cell adhesion may provide the symmetry-breaking step required for AMIS localisation during de novo polarisation." However, cells are polarized and have an apical membrane (Figure 3F-H & S2C) without cadherin and the AMIS, so how can this be the symmetry-breaking step for de novo polarization? Later on, in line with the earlier statement that E-cadherin and the AMIS location are not required for apical membrane formation, the authors then refine the previous statement: "the role of E-CADHERIN in de novo polarisation is specifically to localise the AMIS, which enables the integration of individual cell apical domains to a centralised region preceding lumen hollowing." This seems to be more likely to me than the CADHERIN-mediated cell-cell adhesion as the symmetry-breaking step. I therefore disagree in this point with their overall summary on p. 8, and find this a bit confusing.
      4. As Wild-type mESCs (ES-E14) were purchased from Cambridge Stem Cell Institute and Cdh1 KO mESCs were gifted from a lab, it would be good to (genetically) characterize the cell lines because, apparently, they do not have the same origin and the KO cells were not derived from the parental mESCs. Alternatively, a control experiment with knockdown of Cdh1 in the purchased mESCs could be done, even if that would not lead to a complete knockdown, to make sure that the observed effects are the same as with the Cdh1 KO cell line.
      5. The existing data is carefully analyzed with appropriate statistics. Replicates are sufficient. The conclusions are not yet fully justified, as discussed.

      Minor comments:

      • Fig S1D: It is labeled "Pard3". While also correct, it should be consistent, i.e. PAR3.
      • P. 5 remove (slightly)
      • P. 2, p. 7 "Pard3", replace with PAR3
      • The antibody lists already contain catalog numbers in case of the primary antibodies, but no suppliers. They should be added, and also specified for the secondary antibodies for better reproducibility. In general, the text and figures are well prepared and of high quality. The citations appear appropriate.

      Significance

      The work describes the conceptual novelty of cell adhesion as alternative mechanism to cell division for AMIS localization, and in particular E-Cadherin as being required for AMIS positioning. It is still unclear why the AMIS is centered and the localization of cadherin is equal along cell-cell contacts (Fig 2C, S1E). How do Cadherin localization dynamics look like during the clustering of two cells? During cell division in a MDCK cyst (which is where my expertise lies), cell adhesion has to be partially removed during cytokinesis and abscission, and then be installed again, basically like a new cell-cell contact. Thus, could E-cadherin focus ("trap") the "AMIS initiation seed", rather than direct binding of PAR3 /PAR6 to cadherin as discussed by the authors, since E-cadherin is localized along the whole contact site and not centred? Could the unknown "apical seed" (which in cell division is the midbody) be trapped by cell adhesion? Could this be a common mechanism between cell division- and cell adhesion-driven AMIS localization? This finding could therefore have an even broader impact. What are the author's thoughts? While my speculation might be wrong, it might be worth hypothesizing on the connection between the role of E-cadherin in the two ways of AMIS localization.

      Another novelty is the observation that polarity and cavities form later on in development independently of E-cadherin and an AMIS. This type of mechanism should be discussed further and put more into perspective with the literature.

      The work describes a new mechanism which could be of broad importance in developmental biology. I therefore think that this work is highly significant.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Formation of tubes in a developing organism may arise from the closure of a pre-existing polarized epithelium or from de novo polarization and cavity formation in group of dividing cells. The concept of apical membrane initiation site (AMIS) refers to the fact that polarity proteins as PAR3 accumulate at a point where the apical membrane will be created. This accumulation occurs as early as the two cell stage. Previous reports have demonstrated the importance of the division process in defining this AMIS, however, in the present work the authors in vitro 3D cultures of mESC to report a mitosis independent mechanism that creates an AMIS, induces the polarization of groups of two or more cells, and permits the formation of a central cavity. The report shows that the mechanism is fully dependent on the polarized accumulation of E-cadherin at the cell membrane in contact with the other cells. Moreover, the mechanism does not require mitosis or interaction with the extracellular matrix.

      Major comments:

      The main objective of the work is to demonstrate that AMIS creation and cavity formation can be mitosis independent and that it is dependent on the accumulation of E-cadherin at the midline between two cells in contact. To demonstrate these objectives, the authors perform 3D cultures of mESC. To rule out the requirement of mitosis the authors perform cultures that are treated with mitomycin C and the purify single cells that are cultured again. The authors show time-laps experiments demonstrating that individual cells that do not dived create an AMIS when they contact one to each other. With this cultures they demonstrate that the process does not require an interaction with ECM (provided by the matrigel) but requires E-cadherin, to demonstrate, that they use E-cadherin KO cells (the same line where E-cadherin has been deleted). The work is well written and the objectives very clear. The technology used and the experiments done are adequate and sufficient to accomplish the proposed objectives and the results obtained clearly support the conclusions reached. The methods are well explained and transparent to be reproduced elsewhere and the number of replicas and the statistical methods applied seem corrects to me, although I am just a biologist, not a mathematician. Although the objectives of the work, that are: to demonstrate that AMIS formation can be independent of mitosis and that AMIS requires E-cadherin, there are parts of the results that could be farther studied or at least discussed more thoroughly. Firstly, the authors show that in non-dividing cells an AMIS is formed at the first contact site between the two cells, they also show that in the absence of E-cadherin the cell maintains the polarization of centrioles and Golgi apparatus, in spite that no AMIS is formed, this indicates that the deposition of E-cadherin at the midline membrane is part of a more global polarization event that most likely is initiated by the a directional activity of the Golgi apparatus that may direct the delivery of mature E-cadherin in that particular direction, initiating or maintaining the basis for an AMIS, since recent work (already cited in the manuscript) has demonstrated the importance of cadherin maturation for polarity establishment and maintenance (Herrera et al, 2021), the actual results should be farther discussed in this context. Secondly, it was previously shown that in different epithelia, upon cell-cell contact, the aPKC complex (that includes Par3 and Par6) is recruited early to the contact site where with the participation of Cdc42, aPKC is activated generating an initial spot-like adherent junction (AJs) (Suzuki et al., 2002). In that case it is thought to be mediated by a direct interaction between the first PDZ domain of PAR-3 and the C-terminal PDZdomain-binding sequences of immunoglobulin-like cell adhesion molecules: JAM-1 and nectin-1/3 (Fig. 3) (Ebnet et al., 2001; Itoh et al., 2001; Takekuni et al., 2003). Thus it wold be interesting to know if AMIS formation in absence of cell division depends on JAM-1 or nectin and whether JAM-/Nectin signalling is sufficient to initiate the Golgi and centriole polarization and which is the mechanism governing it.

      Minor comments:

      As I mentioned before, the paper is well presented and very clear, yes it is simple, but simple is always better, no complicated graphics or letterings, thank you. Although in my opinion the work is very well written, I have to admit that I am not qualified to evaluate the literary style of the work since English is not my mother tongue, also I have not reviewed typographical errors since I think that is the work of the editorial, not of scientific reviewers. Please include the full reference of all the antibodies used, including the company and not just the catalog number

      Quoted references:

      Ebnet, K., Suzuki, A., Horikoshi, Y., Hirose, T., Meyer Zu Brickwedde, M. K., Ohno, S. and Vestweber, D. (2001). The cell polarity protein ASIP/PAR-3 directly associates with junctional adhesion molecule (JAM). EMBO J. 20, 3738-3748.

      Itoh, M., Sasaki, H., Furuse, M., Ozaki, H., Kita, T. and Tsukita, S. (2001). Junctional adhesion molecule (JAM) binds to PAR-3: a possible mechanism for the recruitment of PAR-3 to tight junctions. J. Cell Biol. 154, 491-497.

      Takekuni, K., Ikeda, W., Fujito, T., Morimoto, K., Takeuchi, M., Monden, M. and Takai, Y. (2003). Direct binding of cell polarity protein PAR-3 to cell-cell adhesion molecule nectin at neuroepithelial cells of developing mouse. J. Biol. Chem. 278, 5497-5500

      Suzuki, A., Ishiyama, C., Hashiba, K., Shimizu, M., Ebnet, K. and Ohno, S. (2002). aPKC kinase activity is required for the asymmetric differentiation of the premature junctional complex during epithelial cell polarization. J. Cell Sci. 115, 3565-3573.

      Significance

      The paper describes for the first time that contrary to what was previously believed an AMIS can be generated without a cell division. This is very important because it opens the possibility that the mechanisms that originate the biologic cavities are in fact not really how we believed. The work is of interest of all cell biology scientists, specially working in developmental biology, cancer research.

      My particular field of expertise is cell biology and signaling, always applied to particular events as nervous system development or cancer, in particular I am interested in Wnt/b-catenin and Sonic Hedgehog pathways.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Here, authors confirm that glycolysis is important macrophage defense against mycobacterial infection and describe a central role of pyruvate in linking glycolysis and antimycobacterial mtROS production to control the intracellular burden. Alike previous authors who have demonstrated that the non-pathogenic Bacillus Calmette-Guerin and heat-killed M. tb increase glycolysis, they show that human primary macrophages infected with M. avium increase glycolysis to facilitate mycobacterial control. Rost and coll. show evidence that the killing mechanism act through the production of mtROS by the complex I of the electron transport chain via the engagement of RET. This mechanism acts in parallel to other immunometabolic defense pathways activated in M. avium infected macrophages, such as the production/induction of itaconate via the IRF-IRG1 pathways (Alexandre Gidon 2021). * They give evidence that IL-6 and TNFa are not involved in regulating the pyruvate-mtROS and show chemical evidence that mitochondrial import of pyruvate through MPC activity is necessary to generate a high membrane potential and the subsequent production mtROS. However, the data presented here don’t explain how pyruvate is driving RET and mtROS; if pyruvate targets the electron transport chain directly or is converted (via TCA) to another metabolite that initiates RET and mtROS. Above all, this work brings attention to the possibility of using compounds that specifically engage mtROS production for therapeutic perspectives*

      Reviewer #1 (Significance (Required)):

      While the data presented here don t explain how pyruvate is driving RET and mtROS; if pyruvate targets the electron transport chain directly or is converted (via TCA) to another metabolite that initiates RET and mtROS, this work merits to be deeply evaluated for potential publication in a RC journal. However, the language must be improved and polished before submission.

      We thank the reviewer for appreciating the importance of our findings. We are sorry for any inconveniences the language may have caused and have carefully revised the manuscript with the intention of improving it.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Overall evaluation

      This study addresses an interesting aspect of host-pathogen relationship, namely how the metabolism of the host impacts directly or indirectly on the metabolism and/or fitness of the pathogen. For example, the generation of ROS in a way independent of NADPH-oxidases has been suggested to play a role in a number of infections. In particular, whether and how such ROS might be part of the cell-autonomous defence against an intracellular bacterial pathogen, in the present case M. avium, is of relevance. Despite these positive points, the study and manuscript suffer from a high number of serious problems both in form and content. The authors are strongly advised to revise the experimental evidence presented, including by performing additional experiments and re-interpreting some of the ones documented, as well as extensively rewrite/reformat the manuscript.

      Major comments:

      1- Normally, I would list the following criticisms in minor comments, but their accumulation makes them a major point:

        • The reference style is cumbersome, because they are listed in alphabetical order of the FIRST NAMES of the first authors, which renders it difficult to identify what is cited and when *
        • Similarly, the citations in the text include the author first names, whoch is unusual and heavy. * We thank the reviewer for bringing this to our attention. It was a mistake and we have now changed the reference style accordingly by replacing first names with initials and listing citations alphabetically from the first author’s last name.

      In addition, the authors almost systematically introduce each of the articles cited with a sentence such as "Mills and colleagues did this and that ...". This is sometimes used in articles, but should not be the norm. Usually, this is used to emphasise that a given group has contributed not only substantially, but also on a regular basis to a field for years. One would write Palade and colleagues ..., or Rothman and colleagues ... . But in the present manuscript, this is mentioning first authors and their colleagues. Mills et al is a contribution from the O'Neils laboratory, which speaks to me.

      We see the point and have changed some of these sentences accordingly to either write “O’Neill and colleagues …(ref)”, “First-author et al. showed that…. (ref)”, or “Others have shown …(ref)”. Editing was made page 3, lines 49, 51 and 54-56; page 4, lines 74; page 6, line 111; page 9, lines 178, 182, 193 and 197.

      2- Page 4, the authors report that the positive control used to shift macrophage metabolism towards glycolysis did not work. This places doubt on the other experiments and conditions.

      We thank the reviewer for bringing this point of confusion to our attention. In an Agilent Seahorse assay, which is commonly used to report glycolytic flux in scientific publications, the extracellular acidification rate (ECAR) is used as an indicator of glycolysis. Extracellular acidification occurs as protons are exported from the cell alongside lactate. In figure 1B we show quantitatively (ng/cell/24h) that lactate export is significantly increased in MDMs after LPS challenge, which translates to an increased ECAR on the traditional Seahorse assay. We also show further evidence that LPS treatment does switch the metabolism to glycolysis: the two first intermediates of glycolysis, G6P and F6P, are consumed (Fig. 1D), though between-donor variation leaves the LPS-induced increase in glucose consumption not significant. Overall, we are confident that our NMR and mass spectrometric metabolic profilings, which have been tested in several publications listed in the methods section are reliable and recapitulate previous knowledge. We have rephrased the paragraph on page 4 line 76 to clarify this point.

      *3- Page 5. I am not sure to really understand the reasoning behind : "Quantification by targeted mass spectrometry did not reveal a significant accumulation in the intracellular level of pyruvate in macrophages infected with M. avium or treated with LPS when compared to untreated controls (Fig. 2A), suggesting that pyruvate is rapidly metabolized." *

      The rationale for performing mass spectrometric quantification of pyruvate was to confirm experimentally that pyruvate is consumed - which we already know indirectly as its reduced product, lactate, is produced and excreted by the infected cells (Figure 1B). The hypothesis is that a proportion of the pyruvate could also enter the mitochondria and TCA cycle as shown by Mills et al.

      We have tried clarifying this in the revised manuscript by replacing the original text by “Quantification by targeted mass spectrometry did not reveal a significant accumulation in the intracellular level of pyruvate in macrophages infected with M. avium or treated with LPS when compared to untreated controls (Fig. 2A), confirming that pyruvate is metabolized. Mills et al have demonstrated that during LPS activation, mouse macrophages switch to aerobic glycolysis while repurposing the TCA cycle activity to generate specific immunomodulatory metabolites (Mills EL 2016), which implies that a fraction of the pyruvate formed by glycolysis enters mitochondria.”(page 5 line 90).

      4- The metabolomics experiments seem to be performed on a global population of infected and uninfected cells, without any clear mention of the fraction of infected cells, which is potentially low (Fig 1 appears to indicate less than 50%), and very likely variable between experiments. This is a serious confounding factor and likely precludes interpretation of the results?! The percentage of infected cells, at time "zero" and at each time point post-infection has to be quantified in each experiment.

      The reviewer is right that the analysis was carried out on a mixed population. However, even with an infection level of 50% this should be sufficient to pick up significant changes in metabolite levels resulting from infection, which is also not seen with LPS treatment that you would assume activates all cells. We have tested another protocol of infection (MOI 10 for 120 min) that yields almost 100% infection with similar results. These data are included as supplementary figure 1 in the revised manuscript (page 6, line 124).

      It would also be useful to analyse and graph the total fluorescence (coming from M. avium) per cell and the average fluorescence per cell.

      Intracellular growth was quantified by measuring M. avium fluorescence intensity per cell (n>500 cells per donor and per condition) as mentioned in the figure legends. The bar charts represent the average intensity obtained from at least 500 cells per donor and conditions, each point representing an individual donor. We have successfully used this method to analyze and quantify M. avium growth in human primary macrophages (Gidon et al, PLoS Pathogens, 2017; Gidon et al, mBio 2021).

      5- Page 6. How can the authors conclude "Overall, this set of data reveals that no major perturbations of the TCA cycle are induced by the infection, excluding a potential antimicrobial property of these TCA intermediates" from their data? Their experiment do not test the potential antimicrobial activity of the metabolites!

      We agree with the reviewer that our data cannot preclude any anti-microbial effects of TCA intermediates. We agree that the phrasing is confusing and not as intended and have replaced it with the following sentence “Since we and others have previously found that altered intracellular levels of the TCA cycle-derived metabolite itaconate following an infection was indicative of an anti-microbial function (Gidon et al, mBio 2021; Chen et al, Science, 2020), we conclude that none of the TCA cycle intermediates warranted further investigation to explain the anti-microbial effect of glycolysis.”. (page 6, line 115).

      *6- The effect of the chemical inhibitors used has to be evaluated on the growth of bacteria in broth to exclude the possibility that they directly impact them. *

      We agree with the reviewer that this is important to control for. We have performed the suggested experiments and the results, showing that none of the different drugs influence M. avium in vitro growth, are included in a supplementary figure 2 in the revised manuscript (page 8, line 158).

      *7- Figures. None of the graphs present error bars. In addition, for example for Fig 1A, the number of points correspond each to one donor. But there is mention neither of the number of biological replicates nor of technical replicates. This is absolutely required. *

      The number of donors used for each experiment are included in the figure legend. All the experiments were done independently and are therefore biological replicates. Each point represents the value obtained for one independent donor with no technical replicates. Since we show all individual measurements (donors) an error-bar, to our opinion, is not needed. We have now changed the text in the legend to better reflect this information (page 17, lines 353, 357; page 18, line 361; page 20, lines 371, 375, 378, 383, 383; page 23, lines 410, 413, 417, 422).

      8- It is unclear whether the effects documented have been measured in the whole population or only in the infected cells. And when they are measured in infected cells and uninfected cells, are these cells from a population in the same well, or from a well containing only uninfected cells?

      By nature, antimicrobial effects can only be detected in infected cells therefore all the experiments measuring the effect on intracellular growth, the mitochondrial potential and the production of mitochondrial ROS were measured on infected cells. Control refers to a well containing only uninfected cells.

      *9- In Figure 3A, the localisation of M. avium has to be shown. *

      We have edited the Figure 3 that now includes images from the M. avium-CFP channel to help identify the infected cells.

      *10- The mechanism proposed at the end of the abstract "...this work stresses out that compounds specifically inducing mitochondrial reactive oxygen species could present themself as valuable adjunct treatments." should be tested to close the loop and validate the data and hypothesis. *

      We agree with the reviewer, and we are currently finalizing another manuscript on metformin, which is known to induce mitoROS, as a possible Host Directed Therapeutic agent in a mouse model of M. avium infection.

      Minor comments:

      1- The manuscript does not show any numbering, neither of pages nor of lines, which renders the writing of the review difficult.

      We are sorry for the inconvenience. This is now included in the revised manuscript.

      *2- The authors write "undirect" instead of indirect. *

      We have corrected the mistake in the revised manuscript.

      *3- They also use "if" instead of whether quite frequently. *

      We thank the reviewer for bringing that detail to our attention. We have changed the manuscript according to the comment.

      *4- Page 5, second line "... a 40% increase in cells treated ..." An increase of what? *

      Treatment of infected cells with 2-DG increases the fluorescence coming from M. avium, reflecting the increase of the intracellular burden. We have changed the manuscript to make this point clearer (page 4, line 82).

      *5- Page 5. The second paragraph belongs to the introduction or the discussion. *

      We don’t agree on this point. We feel that it is important to inform the reader on how pyruvate can be used within the cell before showing the results, but we feel it does not fit with the broader introduction on glycolysis. However, if the editor/reviewers disagree with us, we will move this paragraph in introduction.

      *6- Page 6. The authors mention that AMP, ADP etc... are nucleosides. But they are nucleotides. *

      We changed the manuscript according to the suggestion (page 6, line 120; page 13, line 279; page 20 line 381).

      Reviewer #2 (Significance (Required)):

      The study explores an interesting question, but in its present state, the conclusions are not sustained by the evidence.

      We thank the reviewer for acknowledging the importance of our work. We believe that we have addressed the concerns expressed in the comments.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      *This manuscript reported that macrophages rely on glycolysis and RET to control M. avium infection and provide molecular evidence linking pyruvate, the end-product of glycolysis, to anti-mycobacterial mtROS production. The advantages of this paper are the clear thinking, from phenomenon to molecular mechanism, strong logic. However, there are also many shortcomings: *

      Major comments:

      *1.The main shortcoming of this paper is that authors only found macrophages control M. avium infection through glycolysis and RET in vitro. Although they use primary macrophages from healthy donners, not the cells lines, is it consistent in vivo? Authors should use mouse model that challenged with M. avium. Moreover, authors can isolate primary macrophages from patients that infected with M. avium, and compared it with primary macrophages from healthy donners. *

      We agree with the reviewer opinion, and we are finalizing another manuscript using Metformin, a drug known to induce mitochondrial ROS, as a Host Directed Therapy in a mouse model. However, dissection of mechanisms involved such as pyruvate import to mitochondria and RET is not possible in vivo. We are not sure of the meaning of the suggested experiment: comparing primary macrophages from mav-infected patients vs healthy donors.

      *2.This paper found that macrophages control M. avium infection by producing mitochondrial reactive oxygen species. This is a very interesting observation. How does mitochondrial reactive oxygen species resist mycobacterial infection? *

      We thank the reviewer for appreciating our work. Yet, we are not sure what does the reviewer mean by “How does mitochondrial reactive oxygen species resist mycobacterial infection?”. It has been shown in many studies that cellular ROS causes oxidative damage and can be toxic to pathogens (and cells), including mycobacteria (Fang FC, Nature Reviews in Microbiology, 2004; Dryden M, Int. J. Antimicrob. Agents, 2018; Kim et al, J. Microbiology, 2019; Herb and Schramm, Antioxidants, 2021). However, the role of RET-induced mitochondrial ROS is a relatively new concept, that, to the best of our knowledge, has never been demonstrated to be involved in the control of mycobacterial infection nor in human primary macrophages. Conversely, bacteria have evolved defense mechanisms to protect and counteract the production of antimicrobial ROS (Kim et al, J. Microbiology, 2019).

      *3.To make this data solid, whether giving pyruvate supplements to patients with mycobacterium infection can alleviate their disease? or it can be tested in mouse model. *

      Initiating a clinical study is beyond the scope of this study. Furthermore, even if we could supplement infected mice with pyruvate, there is no guarantee it will get into the cells and further imported into mitochondria to induce the anti-mycobacterial effects shown in the present study. We rather believe that the key for future treatment would be to induce mitochondrial ROS through the use of other, known agonists to strengthen this cell-intrinsic defense mechanism. As stated above, we are finalizing another manuscript using a compound known to induce mitochondrial ROS as Host Directed Therapy in a mouse model.

      *4.This work demonstrated that IL-6 and TNF-α could control the intracellular burden of M. avium. Many cytokines are produced by macrophage during infection. Are there other pro-inflammatory cytokines that play a role? *

      We agree with the reviewer view that many cytokines influence host defenses to mycobacterial infections in addition to TNF-a and IL-6, e.g., IL-1, IL-10 and interferons. However, some of these are not induced in Mav infected macrophages (IL-1, interferons), and our previous works have shown that TNF-a and IL-6 are consistently induced by the infection (Gidon et al, PLoS Pathogens, 2017) and that they are involved in the control of the intracellular burden (Gidon et al, mBio, 2021). We therefore chose to focus on these.

      *5.In Figure 1C, authors did not observe an increase of glutamine consumption in LPS-activated human macrophage which is in contrary to previous published study. How author explain this contrary result? *

      We thank the reviewer for bringing this point to our attention. We have previously published the glutamine consumption of multiple myeloma cell lines quantified by the NMR based method described herein, proving it is sensitive enough to detect differences between cell lines at cell densities comparable to those of the seeded MDMs (Abdollahi et al, The FASEB journal, 2021). Hence, we are confident that the applied methodology would detect significant differences in glutamine consumption, given that the cells in question rely on glutamine. Previous observations of glutamine uptake were made using mouse macrophages and it is referenced that human and mouse macrophages do not share the exact same metabolism (Thomas et al, Frontiers in Immunology, 2014; Vijayan et al, Redox Biology, 2019). It’s worth noting that the species-specificity also extend to how macrophages respond to TLRs ligands (Sun et al, Science Signaling, 2016). As this result does not contribute significantly to the mechanism described in our paper, we do not feel the need to discuss it extensively.

      Minor comment:

      The authors do not provide sufficient information in the Materials and Methods, and figure legends, such as how many times the experiments were repeated? How to measure the concentration of citrate, isocitrate, succinate......

      The number of donors used for each experiment are included in the figure legends. All the experiments were done independently and are therefore biological replicates. Each point represents the value obtained for one independent donor with no technical replicates. The concentrations of citrate, isocitrate, succinate and the other TCA cycle intermediates were measured by capillary ion chromatography tandem mass spectrometry, as described in the legend of Figure 2 and in detail in the Materials and Methods section on page 13-14. All metabolite measurements by targeted mass spectrometry are based on validated and published methods from our laboratory (Kvitvang et al, 2014; Stafnes et al, 2018; Røst et al, 2020). We have included more details to the methods section describing mass spectrometric metabolic profiling (page 13-14).

      Reviewer #3 (Significance (Required)):

      Mycobacteria avium infection is a common and serious kind of inflammation, in which macrophages has been reported to play an important role. Recently metabolic reprogramming of macrophages is proved in many diseases. By using LPS stimulation, the metabolic reprogramming of macrophages has been reported and have been confirmed to play a role during infection. Therefore, it is not so exciting to see this role of metabolic reprogramming in controlling M. avium infection.

      We are sorry that our findings did not excite the reviewer, but we strongly disagree that our study does not report any novel findings. Both the significance of mitochondrial ROS in mycobacterial defense and the discovery that pyruvate can induce mitochondrial ROS via RET, are novel findings not shown before to our knowledge. And – as a note – a phenomenon described for LPS and/or in mouse macrophages does not necessarily reflect what happens during any bacterial or viral infections, nor in humans.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This manuscript reported that macrophages rely on glycolysis and RET to control M. avium infection and provide molecular evidence linking pyruvate, the end-product of glycolysis, to anti-mycobacterial mtROS production. The advantages of this paper are the clear thinking, from phenomenon to molecular mechanism, strong logic. However, there are also many shortcomings:

      Major comments:

      1. The main shortcoming of this paper is that authors only found macrophages control M. avium infection through glycolysis and RET in vitro. Although they use primary macrophages from healthy donners, not the cells lines, is it consistent in vivo? Authors should use mouse model that challenged with M. avium. Moreover, authors can isolate primary macrophages from patients that infected with M. avium, and compared it with primary macrophages from healthy donners.
      2. This paper found that macrophages control M. avium infection by producing mitochondrial reactive oxygen species. This is a very interesting observation. How does mitochondrial reactive oxygen species resist mycobacterial infection?
      3. To make this data solid, whether giving pyruvate supplements to patients with mycobacterium infection can alleviate their disease? or it can be tested in mouse model.
      4. This work demonstrated that IL-6 and TNF-α could control the intracellular burden of M. avium. Many cytokines are produced by macrophage during infection. Are there other pro-inflammatory cytokines that play a role?
      5. In Figure 1C, authors did not observe an increase of glutamine consumption in LPS-activated human macrophage which is in contrary to previous published study. How author explain this contrary result?

      Minor comment:

      The authors do not provide sufficient information in the Materials and Methods, and figure legends, such as how many times the experiments were repeated? How to measure the concentration of citrate, isocitrate, succinate......

      Significance

      Mycobacteria avium infection is a common and serious kind of inflammation, in which macrophages has been reported to play an important role. Recently metabolic reprogramming of macrophages is proved in many diseases. By using LPS stimulation, the metabolic reprogramming of macrophages has been reported and have been confirmed to play a role during infection. Therefore, it is not so exciting to see this role of metabolic reprogramming in controlling M. avium infection.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Overall evaluation

      This study addresses an interesting aspect of host-pathogen relationship, namely how the metabolism of the host impacts directly or indirectly on the metabolism and/or fitness of the pathogen. For example, the generation of ROS in a way independent of NADPH-oxidases has been suggested to play a role in a number of infections. In particular, whether and how such ROS might be part of the cell-autonomous defence against an intracellular bacterial pathogen, in the present case M. avium, is of relevance. Despite these positive points, the study and manuscript suffer from a high number of serious problems both in form and content. The authors are strongly advised to revise the experimental evidence presented, including by performing additional experiments and re-interpreting some of the ones documented, as well as extensively rewrite/reformat the manuscript.

      Major comments:

      1. Normally, I would list the following criticisms in minor comments, but their accumulation makes them a major point:
        • a. The reference style is cumbersome, because they are listed in alphabetical order of the FIRST NAMES of the first authors, which renders it difficult to identify what is cited and when
        • b. Similarly, the citations in the text include the author first names, whoch is unusual and heavy.
        • c. In addition, the authors almost systematically introduce each of the articles cited with a sentence such as "Mills and colleagues did this and that ...". This is sometimes used in articles, but should not be the norm. Usually, this is used to emphasise that a given group has contributed not only substantially, but also on a regular basis to a field for years. One would write Palade and colleagues ..., or Rothman and colleagues ... . But in the present manuscript, this is mentioning first authors and their colleagues. Mills et al is a contribution from the O'Neils laboratory, which speaks to me.
      2. Page 4, the authors report that the positive control used to shift macrophage metabolism towards glycolysis did not work. This places doubt on the other experiments and conditions.
      3. Page 5. I am not sure to really understand the reasoning behind : "Quantification by targeted mass spectrometry did not reveal a significant accumulation in the intracellular level of pyruvate in macrophages infected with M. avium or treated with LPS when compared to untreated controls (Fig. 2A), suggesting that pyruvate is rapidly metabolized."
      4. The metabolomics experiments seem to be performed on a global population of infected and uninfected cells, without any clear mention of the fraction of infected cells, which is potentially low (Fig 1 appears to indicate less than 50%), and very likely variable between experiments. This is a serious confounding factor and likely precludes interpretation of the results?! The percentage of infected cells, at time "zero" and at each time point post-infection has to be quantified in each experiment. It would also be useful to analyse and graph the total fluorescence (coming from M. avium) per cell and the average fluorescence per cell.
      5. Page 6. How can the authors conclude "Overall, this set of data reveals that no major perturbations of the TCA cycle are induced by the infection, excluding a potential antimicrobial property of these TCA intermediates" from their data? Their experiment do not test the potential antimicrobial activity of the metabolites!
      6. The effect of the chemical inhibitors used has to be evaluated on the growth of bacteria in broth to exclude the possibility that they directly impact them.
      7. Figures. None of the graphs present error bars. In addition, for example for Fig 1A, the number of points correspond each to one donor. But there is mention neither of the number of biological replicates nor of technical replicates. This is absolutely required.
      8. It is unclear whether the effects documented have been measured in the whole population or only in the infected cells. And when they are measured in infected cells and uninfected cells, are these cells from a population in the same well, or from a well containing only uninfected cells?
      9. In Figure 3A, the localisation of M. avium has to be shown.
      10. The mechanism proposed at the end of the abstract "...this work stresses out that compounds specifically inducing mitochondrial reactive oxygen species could present themself as valuable adjunct treatments." should be tested to close the loop and validate the data and hypothesis.

      Minor comments:

      1. The manuscript does not show any numbering, neither of pages nor of lines, which renders the writing of the review difficult.
      2. The authors write "undirect" instead of indirect.
      3. They also use "if" instead of whether quite frequently.
      4. Page 5, second line "... a 40% increase in cells treated ..." An increase of what?
      5. Page 5. The second paragraph belongs to the introduction or the discussion.
      6. Page 6. The authors mention that AMP, ADP etc... are nucleosides. But they are nucleotides.

      Significance

      The study explores an interesting question, but in its present state, the conclusions are not sustained by the evidence.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Here, authors confirm that glycolysis is important macrophage defense against mycobacterial infection and describe a central role of pyruvate in linking glycolysis and antimycobacterial mtROS production to control the intracellular burden. Alike previous authors who have demonstrated that the non-pathogenic Bacillus Calmette-Guerin and heat-killed M. tb increase glycolysis, they show that human primary macrophages infected with M. avium increase glycolysis to facilitate mycobacterial control. Rost and coll. show evidence that the killing mechanism act through the production of mtROS by the complex I of the electron transport chain via the engagement of RET. This mechanism acts in parallel to other immunometabolic defense pathways activated in M. avium infected macrophages, such as the production/induction of itaconate via the IRF-IRG1 pathways (Alexandre Gidon 2021).

      They give evidence that IL-6 and TNFa are not involved in regulating the pyruvate-mtROS and show chemical evidence that mitochondrial import of pyruvate through MPC activity is necessary to generate a high membrane potential and the subsequent production mtROS.

      However, the data presented here don t explain how pyruvate is driving RET and mtROS; if pyruvate targets the electron transport chain directly or is converted (via TCA) to another metabolite that initiates RET and mtROS. Above all, this work brings attention to the possibility of using compounds that specifically engage mtROS production for therapeutic perspectives

      Significance

      While the data presented here don t explain how pyruvate is driving RET and mtROS; if pyruvate targets the electron transport chain directly or is converted (via TCA) to another metabolite that initiates RET and mtROS, this work merits to be deeply evaluated for potential publication in a RC journal. However, the language must be improved and polished before submission.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      The paper tackles an important problem regarding the effect of demographic dependent vaccination protocols on the reduction in the number of deaths with respect to the situation of no vaccination (say J). A compartmental SIRD model with reinfection Y is proposed, stratified in two (age dependent) groups, based on a binary reduction of a given contact map, and given infection fatality risk (IFR). Several countries are then analyzed.

      As far as I understand we have a control variable v, parameters of the stratified model (i=1,2) tuned to match IFRi, and a control objective, i.e. minimization of J over one year.

      The paper is well written. The final message and some theoretical passages are not completely clear, at least to me. I have the following observations that the authors may want to consider.

      We thank the referee for the revision and are very glad that the overall evaluation is positive. Comments and suggestions have been thoroughly addressed, as we discuss in the following.

      1) The study of stability of infection free and endemic equilibria should be better developed. The 5 equations can be reduced to 4 (neglecting D) and the characteristic of the reduced Jacobian used to characterize the local asymptotic stability of equilibria, instability, bifurcation points etc... Alternatively, one can use a co-positive Lyapunov function (LF). For instance, if we take the LF V=S+I+Y+R, we get $\dot V=-\mu_I I-\mu_Y Y \le 0$. If $\mu_I$ and $\mu_y$ are strictly positive all equilibria are characterized by (S*,0 0,R*) and D=1-S*-R*. So, I don't understand the phrase after (7,8), notice that Y cannot be zero in finite time. For $\mu_y=0$ then Y* can be nonzero. I guess that closed-form computation of S* and R* is possible as function of the parameters at least in the case v=0. The stability result should be cast in function of the current reproduction number (not explicitated) wrt to S and R.

      The authors are invited to have a look at

      1.1) Pagliara et al, "Bistability and Resurgent Epidemics in Reinfection Models", IEEE CSLetters, 2018,

      for a theoretical analysis of stability on a similar (just a little bit simpler) model.

      We appreciate the suggestions of the referee for improvement of this material. We have carried out an in-depth revision of the stability analysis and significantly extended it. The major addition has been, as suggested, a section relating the current reproductive number at equilibrium (we call it the asymptotic reproductive number in the text) to the fixed points of the dynamics for three different scenarios: general model, no vaccination, and zero mortality of reinfected individuals. As Pagliara et al. show in their paper, the connection between the fixed points and the reproductive number is not trivial, but it is possible to derive it through the next-generation matrix technique, as we now do. Additional references regarding this technique have been added. We have included a Table summarizing the stability analysis (page 2 in SI 3) at the end of this new section.

      Other modifications include the reduction of 5 equations to 4 for the stability analysis and a clarification of possible equilibria (page 1 of SI 3), rephrasing and correcting our sentence after eqs. (7) and (8). We also attempted to obtain a closed-form computation of S* and R* but, to the best of our knowledge, concluded that it is not possible. We would be happy to pursue any insight in this respect the referee may have.

      What said before should be also extended to the stratified model, where a "network" Rt could be defined, see for instance

      1.2) L. Stella et al, "The Role of Asymptomatic Infections in the COVID-19 Epidemic via Complex Networks and Stability Analysis", SIAM J Cont. Opt., 2021, (arxiv.org/pdf/2009.03649.pdf)

      We thank the referee for pointing out this reference. Following the analysis in Stella et al., we have carried out a stability analysis for the stratified model as well. The results are included in a new section (pages 7-10 in the SI 3).

      2) It is not clear whether the free contagion parameters of the model have been fitted on real data (identification from infection and reinfection data). Notice that the interplay between vaccination strategies and NPI is important, see e.g.

      *2.1) Giordano et al, Modeling vaccination rollouts, SARS-CoV-2 variants and the requirement for non-pharmaceutical interventions in Italy", Nature Medicine 2021, *

      where progressive vaccination in reverse age order is considered together with different enforced NPI countermeasures.

      In the first part of our study, parameters are intendedly left free because we aim at describing the generic behavior of the model. Still, we derive several inequalities and relationships between parameter ratios that seem to be sensible attending to what the different classes in the model stand for. This is as described in sections regarding model parameters when the two generic models (SIYRD and S2IYRD) are introduced. The aim is to represent both the generic dependence with some variables and a broad class of contagious diseases, so parameters are mostly free. In agreement with this approach, parameters can be also freely varied in the companion webpage.

      In the second part of our study, the model is applied to COVID-19. In that case, we have used parameter values in agreement with observations, as (admittedly poorly) explained in pages 9-10 of the main text. Indeed, not enough information on parameter estimation was provided in the main text, and the SI 2 also needed some additional information. This has been amended. Let us explicitly mention that we have not fitted the dynamics of the model to any actual data set to fix specific values, as Giordano et al. do. In our case, we have first used different demographic data sets to evaluate contact rates and IFRs of the two population groups (these are parameters Mij and Ni in eqs. (7-10)). Secondly, recovery and death rates are estimated through the IFRi values for each age group i and the infectious period of COVID-19, that we fix at dI=13 days. Third, infection rate βSI=R0/dI has been estimated fixing R0=1, since the reproductive number of COVID-19 all over the world fluctuates around this value (Arroyo-Marioli et al. (2020) Tracking R of COVID-19: A new real-time estimation using the Kalman filter, PLoS ONE 16(1):e0244474). The reinfection rate is defined through its relationship with the infection rate, βRI= α1 βSI, where α1 was in the range 0-0.011 at early COVID-19 stages (Murchu et al. (2022), Quantifying the risk of SARS‐CoV‐2 reinfection over time, Rev Med Virol 32:e2260) and seems to be about 3-4 fold larger for the omicron variant (Pulliam et al., Increased risk of SARS-CoV-2 reinfection associated with emergence of the Omicron variant in South Africa, www.medrxiv.org/content/10.1101/2021.11.11.21266068v2). Given the relationships derived among parameters, our only free parameter was α2RY= α2 βRI, and we fixed it to α2=0.5 (i.e., reinfected individuals recover twice as fast as individuals infected for the first time).

      Once more, it was not our goal to precisely recover specific trajectories of COVID-19 or to point at possible future scenarios, but to illustrate the dependence of major trends with model parameters. Also, the appearance of new variants requires the reevaluation of parameters. For example, omicron has different IFR (therefore different mortality and recovery rates), a different infectious period, and higher infection and reinfection rates. In this context, the interactive webpage (where we will update demographic profiles and IFR data as they become available) is a useful resource to simulate any situation different from current or past ones.

      3) In the model the immunity waning is not explicitly considered (flux from R to S or better from a vaccinated compartment to S). It is clear that this complicates the model. Please discuss why the indirect way the waning is considered here is justified.

      3.1) Batistela et al, "SIRSi compartmental model for COVID-19 pandemic with immunity loss", Chaos Soliton and fractals, 2021.

      3.2) McMahon et al, "Reinfection with SARS-CoV-2: Discrete SIR (Susceptible, Infected,Recovered) Modeling Using Empirical Infection Data", JMIR Public health and surveillance, 2020.

      Though the model does not consider an incoming flux of individuals to compartment S, the existence of a "backward" flux from R to Y yields a transient phenomenology analogous to models with increases in the S class. Indeed, it is these fluxes that cause persistent endemic states; otherwise, the S class is monotonously depleted until infection extinction.

      In Batistela's et al. work, the possibility that individuals become reinfected is effectively implemented through a flux between the R and S classes, since only one class of infected individuals is considered and recovered individuals cannot be infected again. In our case, feeding back to S would mean that previous immunity is completely lost or that vaccines are not effective at all for some individuals. This is neither what McMahon et al. conclude when evaluating real data nor what more recent surveys indicate (see for instance the Science Brief published in October 2021 by the CDC, SARS-CoV-2 Infection-induced and Vaccine-induced Immunity, https://www.cdc.gov/coronavirus/2019-ncov/science/science-briefs/vaccine-induced-immunity.html).

      This nonetheless, complete immunity waning (feedback to the S class) and reinfections (feedback to a partly immune class experiencing overall lower severity of the disease) are equivalent to a large extent: the trend of COVID-19 seems to indicate that our Y class will be the "new S", and that fully naive individuals would arrive mostly due to demographic dynamics (birth and death processes, as also implemented by Batistela et al.). Summarizing, complete immunity waning is rare in the time scales considered in our simulations, while partial immunity that decreases the severity of the disease (after infection or vaccination) is the rule, in agreement with our choices.

      4) Reduction of deaths wrt no vaccination is of course important, but also reduction of stress in hospitals. This is particularly important now with the advent in Europe of the omicron variant. Please discuss on the real message you want to convey to policy makers in the actual scenario of the pandemic.

      The model in this work is deliberately simple. Our main goal was to explore the qualitative effects of demographic structure and disease parameters in protocols for vaccine administration. This was the reason to consider a mean-field model in a population structured into two groups. The main conclusion is that optimal vaccination protocols are demography- and disease-dependent. If this is so in our streamlined model, the more it will be in more realistic models, where one should include a finer stratification and, in all likelihood, heterogeneity in contagions. Our main message, therefore, is that there is no unique protocol for vaccine roll-out, valid for all populations and diseases. The abstract has been modified to highlight this conclusion.

      Some qualitative considerations also allow us to draw preliminary conclusions on the reduction of stress in hospitals. Since the number of hospital admissions is proportional to the incidence of the disease, the number H of hospitalized individuals can be represented as H=a I + b Y, with a>>b due to the partial immunity of vaccinated or recovered individuals (which belong to class Y upon (secondary) contagion). Therefore, minimizing the burden on the healthcare system amounts to minimizing the number of individuals in the I class. Beyond non-pharmaceutical measures, I is minimized when individuals are transferred as fast as possible to the Y class, that is, maximizing vaccine supply and acceptance. In terms of our model parameters, this entails maximizing v and also θ (the maximum fraction of individuals eventually vaccinated), for instace through devoted awareness campaigns. These ideas have been included in the Discussion section.

      Reviewer #1 (Significance (Required)):

      The final message and some theoretical passages are not completely clear, at least to me.

      Please discuss on the real message you want to convey to policy makers in the actual scenario of the pandemic.

      As discussed above, we have modified the manuscript following the advice given by the Reviewer. We think that both the presentation and the theory are clearer now.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this paper, a compartmental model of the propagation of an infection with vaccination and reinfection is studied. The impact that changes in the rates of these two processes have on disease progression and on the number of deaths is analyzed. In order to highlight the overall effect of the demographic structure of populations and the propagation of a given disease among different groups, the population is divided into two subpopulations and the model is extended to the two-dimensional case. In addition to the study of equilibria and their relative stability, the model is then applied in the case of COVID-19. Different vaccination strategies are studied using real demographic data and with a population split between under 80 and over 80 individuals. It is observed that for low vaccination rates, the advisable strategy is to vaccinate the most vulnerable group first, in contrast to the case of sufficiently high rates, where it is appropriate to vaccinate the most connected group first. The simulations show also that with a low fatality ratio, the strategy that yields the greatest reduction in deaths is vaccination of the group with the most contacts, while the situation is reversed for higher fatality ratio.

      The model and simulations presented are interesting and valuable. The comparison of the behavior of the model in the 4 different countries is very interesting, as well as the webpage created by the authors.

      We thank the referee for the very positive evaluation and are very glad that the study is found interesting and valuable.

      As minor comment, I think that the introduction of the model needs a more extensive literature review. For example, there is no mention of the classic SIR model of Kermack and McKendrick (1927) and other works on the introduction to epidemic models, which form the basis of the model presented by the authors.

      The referee is right. There is a long history of extensions and applications since Kermack & McKendrick introduced the SIR model that we obviated. This has been amended by adding an introductory paragraph with several new references at the beginning of the Models section, page 3 in the main text.

      Reviewer #2 (Significance (Required)):

      The model presented by the authors is quite original and simple enough to be suitable to different contexts and scenarios.

      Compared to previous work, this paper makes a twofold contribution, as explained by the authors. First, the introduction of reinfections shows the existence of long transients (or quasi-endemic states) that may precede the transition to a truly endemic state predicted for COVID-19. Second, the simplicity of model allows the characterization of systematic effects due to, at least, group size, demographic composition, and IFRs.

      I am involved in the study and analysis of epidemic models accompanied by network effects. I think this paper is a good contribution, although preliminary, in the analysis of the vaccination process and in the search for the optimal strategy.

      We thank the Reviewer and are glad that our goal, offering a model as simple as possible to obtain meaningful conclusions, is appreciated.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this paper, a compartmental model of the propagation of an infection with vaccination and reinfection is studied. The impact that changes in the rates of these two processes have on disease progression and on the number of deaths is analyzed. In order to highlight the overall effect of the demographic structure of populations and the propagation of a given disease among different groups, the population is divided into two subpopulations and the model is extended to the two-dimensional case. In addition to the study of equilibria and their relative stability, the model is then applied in the case of COVID-19. Different vaccination strategies are studied using real demographic data and with a population split between under 80 and over 80 individuals. It is observed that for low vaccination rates, the advisable strategy is to vaccinate the most vulnerable group first, in contrast to the case of sufficiently high rates, where it is appropriate to vaccinate the most connected group first. The simulations show also that with a low fatality ratio, the strategy that yields the greatest reduction in deaths is vaccination of the group with the most contacts, while the situation is reversed for higher fatality ratio.

      The model and simulations presented are interesting and valuable. The comparison of the behavior of the model in the 4 different countries is very interesting, as well as the webpage created by the authors.

      As minor comment, I think that the introduction of the model needs a more extensive literature review. For example, there is no mention of the classic SIR model of Kermack and McKendrick (1927) and other works on the introduction to epidemic models, which form the basis of the model presented by the authors.

      Significance

      The model presented by the authors is quite original and simple enough to be suitable to different contexts and scenarios.

      Compared to previous work, this paper makes a twofold contribution, as explained by the authors. First, the introduction of reinfections shows the existence of long transients (or quasi-endemic states) that may precede the transition to a truly endemic state predicted for COVID-19. Second, the simplicity of model allows the characterization of systematic effects due to, at least, group size, demographic composition, and IFRs.

      I am involved in the study and analysis of epidemic models accompanied by network effects. I think this paper is a good contribution, although preliminary, in the analysis of the vaccination process and in the search for the optimal strategy.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      The paper tackles an important problem regarding the effect of demographic dependent vaccination protocols on the reduction in the number of deaths with respect to the situation of no vaccination (say J). A compartmental SIRD model with reinfection Y is proposed, stratified in two (age dependent) groups, based on a binary reduction of a given contact map, and given infection fatality risk (IFR). Several countries are then analized.

      As far as I understand we have a control variable v, parameters of the stratified model (i=1,2) tuned to match IFRi, and a control objective, i.e. minimization of J over one year.

      The paper is well written. The final message and some theoretical passages are not completely clear, at least to me. I have the following observations that the authors may want to consider.

      1)The study of stability of infection free and endemic equlibria should be better developed. The 5 equations can be reduced to 4 (neglecting D) and the characteristic of the reduced Jacobian used to characterize the local asymptotic stability of equlibria, instability, biforcation points etc... Alternatively, one can use a co-positive Lyapunov function (LF). For instance, if we take the LF V=S+I+Y+R, we get \dot V=-\mu_I I-\mu_Y Y \le 0. If \mu_I and \mu_y are strictly positive all equilibria are characterized by (S,0 0,R) and D=1-S-R. So, I don't understand the phrase after (7,8), notice that Y cannot be zero in finite time. For \mu_y=0 then Y can be nonzero. I guess that closed-form computation of S and R* is possible as function of the parameters at least in the case v=0. The stability result should be cast in function of the current reproduction number (not explicitated) wrt to S and R. The authors are invited to have a look at

      1.1)Pagliara et al, "Bistability and Resurgent Epidemics in Reinfection Models", IEEE CSLetters, 2018,

      for a theoretical analysis of stability on a similar (just a little bit simpler) model. What said before should be also extended to the stratified model, where a "network" Rt could be defined, see for instance

      1.2)L. Stella et al, "The Role of Asymptomatic Infections in the COVID-19 Epidemic via Complex Networks and Stability Analysis", SIAM J Cont. Opt., 2021, (arxiv.org/pdf/2009.03649.pdf)

      2)It is not clear whether the free contagion parameters of the model have been fitted on real data (identification from infection and reinfection data). Notice that the interplay between vaccination strategies and NPI is important, see e.g. 2.1) Giordano et al, Modeling vaccination rollouts, SARS-CoV-2 variants and the requirement for non-pharmaceutical interventions in Italy", Nature Medicine 2021, where progressive vaccination in reverse age order is considered together with different enforced NPI countermeasures.

      3)In the model the immunity waning is not explicitly considered (flux from R to S or better from a vaccinated compartment to S). It is clear that this complicates the model. Please discuss why the indirect way the waning is considered here is justified.

      3.1)Batistela et al, "SIRSi compartmental model for COVID-19 pandemic with immunity loss", Chaos Soliton and fractals, 2021.

      3.2)McMahon et al, "Reinfection with SARS-CoV-2: Discrete SIR (Susceptible, Infected,Recovered) Modeling Using Empirical Infection Data", JMIR Public health and surveillance, 2020.

      4)Reduction of deaths wrt no vaccination is of course important, but also reduction of stress in hospitals. This is particularly important now with the advent in Europe of the omicron variant. Please discuss on the real message you want to convey to policy makers in the actual scenario of the pandemic.

      Significance

      The final message and some theoretical passages are not completely clear, at least to me. Please discuss on the real message you want to convey to policy makers in the actual scenario of the pandemic.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers


      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      This article focuses on one possible outcome of protein sequence evolution after duplication, in which the residue distribution at specific positions of a multiple sequence alignment becomes uncoupled from the distribution expected from the phylogeny of the protein family. The authors call these events "residue inversions" and interpret them as the result of functional pressures on family members with diverging cellular roles. Based on a theoretical model of residue evolution after duplication of the coding gene, the authors describe the criteria for categorizing a particular position in a protein as a "residue inversion" and develop an algorithm to identify such events in a multiple alignment. They then apply their approach to the family of Epidermal Growth Factor Receptors in Teleost fishes and identify 19 EGFR positions in a dataset of 88 fish genomes, which satisfy the criteria of "residues inversions". They provide support to the scoring scheme used in their approach through a simulated evolution run and conclude from a comparison of their positions to the ones predicted by SPEER to represent Specificity Determining Sites that the two are largely orthogonal and may therefore complement each other in sequence-based function prediction.

      Major comments: 1. Throughout the paper, the functional involvement of positions subject to "residue inversions" is indirect, inferred from the literature, and in parts sparse and tenuous. It therefore remains unclear to what extent the interpretation that "residue inversions" represent functional adaptations is correct. The authors acknowledge this uncertainty in several places, including the Conclusions.

      We agree with the reviewer that without experimental validation an uncertainty about the data interpretation remains, however testing protein function on a large scale and in non-model organisms is extremely challenging. Since we were aware of this obstacle, we validate our conclusions in different ways: 1. the theoretical model and the simulated MSA both show a lower chance of observing residue inversions than what we detected in the teleost fish EGFR example. 2. previous literature highlighted an identified inverted residue as the possible cause of sub-functionalization of teleost fish EGFR. 3 We generated the alpha fold models of teleost fish EGFR and performed molecular dynamic simulation of the two copies, in complex with the ligand. In our simulations, we see the same trend that we observe with the inter-paralog inversions at the functional level. The new results have been integrated in line 692-706.

      "Residue inversion" is a very unintuitive term, which took me several readings to penetrate and made reading the article difficult. The authors may wish to reconsider this term. Naively, a residue inversion would be the swapping of residues between two positions, such that a residue expected in position A is found in position B, while the residue expected in B is found in A. That is what I suspect most readers will think.

      We acknowledged that the terminology might be confusing. We therefore decided to define it as inter-paralog inversion of amino acids throughout all the text.

      Is the phenomenon described here just a curiosity, or an important aspect of divergent evolution after duplication? The authors seem to be of two minds about it, calling the phenomenon "rare" in the Abstract, but an "important and understudied outcome of gene duplication" in the Introduction, then hedging again that it "might be rare" in the Conclusions. The benefits of recognizing such positions are also formulated with great caution, for example in lines 309-311: "In summary, the identification of residue inversion event has the potential to improve functional residue predictions".

      We agree with the reviewer that we did not yet test the recurrence of this event on a large scale, however this does not exclude that this event is frequent. This work is focused on the observation, characterization, and implications of this event. Considering this comment and the one below we decided to perform a further analysis (see below for more details).

      Additionally, the analysis of the frequency of this event at the whole-organism scale on multiple organisms, while interesting, would be out of the scope of this paper, if not just because it requires a totally different (large-scale) approach compared to the one used in here. This type of analysis is also limited by the absence of a database collecting intermediate knowledge that would speed up the initial part of ortholog classification at a broad range.

      Finally, by rarity we mean the statistical chance of the event, not considering the effective chance of observing it from the real data. In fact, we rectified in the text using the reviewer’s observation.

      OLD VERSION (ppXX):

      Our work uncovers a rare event of protein divergence that has direct implications in protein functional annotation and sequence evolution as a whole.

      NEW VERSION:

      Our analysis shows a new way to investigate an important and understudied outcome of gene duplication.

      It would probably strengthen the article substantially if the authors would (I) use their program to scan a large number of multiple alignments in order to establish more reliably how frequent this phenomenon actually is, and whether it is universal or a specifc aspect of eukaryotic, maybe even only vertebrate evolution; and then (II) mapped the positions identified on structural models for the proteins, obtained by homology modeling or AlfaFold prediction, in order to substantiate their potential origin as functional adaptations.

      We thank the reviewer for the thoughtful suggestions. (I) we tested the inter-paralog inversion score at the proteome level using a reduced dataset (70) of reference teleost fish proteomes from Uniprot. We obtained 54 proteins that duplicated in the teleost specific whole genome duplication, then we run our pipeline on it. We found that the overall distribution of scores is more similar to the simulated evolution experiment rather than to the EGFR test case. We integrated the new results and discussion in a new paragraph and new figure in line 708-716.

      (II) We considered also the analysis requested in the second point. Unfortunately, we could not extract any meaningful data from the AlphaFold models.

      Reviewer #1 (Significance (Required)):

      A method to improve the functional annotation of proteins in a paralogous family would be very useful, given the abundance of sequence data.

      We thank the reviewer for acknowledging the importance of the question that we have addressed.

      I am knowledgeable in varios aspects of molecular evolution and functional annotation. I am neither a mathematician, nor a developer of phylogenetic methods, so I cannot judge these aspects of the paper.


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Review of Pascarelli and Laurino titled “Identification of residue inversions in large phylogenies of duplicated proteins”

      I find the topic of the paper very exciting and long overdue. Indeed, I was under the impression that the question of parallel evolution in paralogous copies must have been addressed long ago: to my surprise, having looked in depth at the literature, that is only partially so. The manuscript, therefore, addresses a relatively novel and fundamental question of broad interest.

      We thank the reviewer for his positive comment.

      Having said this, I also found the manuscript to suffer from an identity problem, which in many places encroaches on the underlying quality of the science. I will structure my review into three concerns: the identity issues, the novelty issue and the emergent quality issues from the two.

      Identity issues:

      The manuscript is primarily dealing with an evolutionary issue – or I am biased to see it this way as an evolutionary researcher myself. Nevertheless, much of the language and terminology of the paper either misuses evolutionary terms or invents new ones in its place with a bias towards a protein chemistry perspective. Specifically, what the authors call “residue inversions” is called “parallel evolution” or “convergent evolution” in the literature. Also, "residues" are typically used for physical amino acids in a structure. If we are talking about sequence level “amon acid” would be a better term. The issue is further confounded by the meaning of “inversion” in genetics as a single mutation that inverts the position of nucleotides (i.e. an “AT” becomes “TA”).

      I strongly recommend for the authors to become familiarized with the common usage of existing and widely used terms in evolutionary biology that describe the phylogenetic patterns they see: parallel evolution, convergent evolution, homoplasy, etc, and to use them consistently throughout the manuscript.

      The same goes for "mutation", which the authors confuse on two levels: evolutionary and biochemical. Sometimes the authors refer to “mutation” of amino acids (which can be entertained at some level, but from a genetic perspective only nucleotides mutate – in the protein biochemistry field this term is frequently applied to amino acid residues, which is the basis of the identity issue). However, since the authors also use “mutation” to refer to a “substitution” (which is what we call a mutation that has become fixed in evolution) this creates another level of confusion. I urge the authors to change this aspect of the language of the manuscript to better reflect evolutionary concepts.

      As part of the language issues I am not sure how meta-functionalization in the author’s view differs either from neofunctionalization or specialization of duplicated genes.

      We thank the reviewer to point out the terminology issue, this will also help reaching a broader audience. We clarify the confusion surrounding the terms “mutation” and “residue inversion” by changing the former to “substitution”, while the latter to “inter-paralog inversions” (see also other reviewer comments).

      We understand the importance of the usage of the correct term to talk about this event of protein sequences evolution. Therefore, we used convergent and parallel evolution accordingly when we discussed the nuances between Metafunctionalization and parallel evolution in the text, in lines 188 and 399.

      Novelty issues:

      As I mentioned, the issue of parallel evolution of gene duplications is an extremely interesting topic. I was sure that the people who studied parallel evolution, or those interested in gene duplications, must have published extensively on this. However, my search of the literature revealed only a modest pre-existing effort. Nevertheless, previous efforts are not entirely non-existent and should be cited and discussed in this paper too. The most pertinent example is

      https://bmcecolevol.biomedcentral.com/articles/10.1186/s12862-020-01660-1

      which has an identical setup from what I can tell (compare Figure 1 in each paper).

      This paper was not hard to find using "parallel evolution", thus my focus on the language issues in the previous section.

      We thank the reviewer for his suggestion, we included the relevant papers in the text in lines 520-523. Interestingly, the cited paper shows that a comprehensive analysis of the fate of duplicated genes at the sequence level was done. However, in this paper, the ‘fate’ of a paralog is determined by counting the number of sites that support one or the other fate, independently of the orthologous relationship. In our study, we start from the orthologous relationship to pre-determine the fate of the paralogous protein, then we identify the sites that break this assumption. Our type of analysis is deemed to work only where the orthologous relationship is unequivocal. That is the reason why we chose an example with relatively short branch lengths after duplication (the teleost specific duplication). Our rationale is that with a higher genome coverage across organisms, resolving the orthologous relationship will get easier in time. However, our study focuses on a distinct case (asymmetric divergence) where the diverging paralogs converge to the same phenotype. In such a case, neutral substitutions related to the ancestral relationship of a protein can be filtered out to better search for functional adaptations.

      Content issues:

      The lack of attention to evolutionary concepts, in my opinion, provided some missed opportunities for the authors to attack the problem in a more convincing fashion. Specifically, in the setup to distinguish between parallel evolution of paralogues versus orthologues ("inversion" versus "species-specific adaptation" in the author's text) one must be able to distinguish between the two copies and assign true evolutionary relationship. In practice, that is not always possible based on tree lengths or topologies alone because of confounding factors such as independent duplications or gene conversion events.

      I would feel better about the results of this study if the following two things were integrated.

      The use of synteny to better determine homologous relationships (declare copies to be true paralogues if they occupy the same syntenic region). To compare the frequency or parallel evolution of paralogues versus orthologues as a null model of the expected number of parallel events in paralogous copies.

      We agree that a synteny analysis has to be included. We tested it for the EGFR proteins in fish and the results support the orthologous relationship of EGFRa and EGFRb in the two groups compared (Cypriniformes versus other teleosts). The results were included in the text and in the Supplementary figure in lines 303-305.

      The second point targets the way the model derives the expectations: at the author's own admission the model makes a number of unrealistic assumptions, ") equal branch length between the two paralogs; 2) only zero to one mutation can occur in each of the six branches; 3) after a mutation, each residue is equiprobable; 4) no selective pressure; 5) the probability of a mutation on a branch solely depends on the branch length (mutation rate). The authors do not really test the resulting tree on deviation from these assumptions (I am sure that it does not conform) but essentially comparing the occurrence of parallel events in paralogues versus orthologues may solve the problem with a less restrictive set of assumptions (that one expects an equal number of parallel events in paralogues and orthologues unless there is some paralogue-specific selection pressure, which is what the authors are looking for.

      We compared the occurrence of the two outcomes in both the simulation and in the real data. In all cases, the two score distributions have a very similar shape, with a 99th percentile score of respectively 0.062 and 0.113. Most sites in an alignment (>99%) are not expected to be inverted and will have scores very close to 0, making the identification of inversions a quest for outliers. Furthermore, in case of the real data, each distribution can be independently affected by different selective pressures that might bias the background distribution. While the inversion in paralogs is expectedly involving few, functional, residues, the inversion in orthologs is expected to have a broad effect. For example, a temperature adaptation might shift the number of polar residues on the protein surface (see for example: https://academic.oup.com/peds/article/13/3/179/1466666). Also, a different protein chosen for analysis might generate a different background distribution of the two events. In the larger dataset, the similarity of the two distributions is even more (99th percentile of 0.07 and 0.08). Because of the shown similarity of the two event distributions, and the possible issues with different selective pressures, we leave the analysis suggested by the reviewer as a post-processing possibly performed by the user. We report a summary of this result born from the reviewer’s observation in line 478.

      In summary, I believe that the topic is very interesting, the authors potentially found a new aspect of evolution of a specific gene family. However, in my opinion a major revision is needed to unite this text with the terms in the field, the previous publication and to integrate the two additional analyses I suggested.

      Minor Comments:

      I started adding these specific comments before generalizing the broader deviation from the common evolutionary language. There are more further along in the manuscript, but in the interest of time I will not articulate them here hoping that the authors will first try a major revision targeting these issues.

      Line 64: While neutral mutations help to determine the phylogenetic position of a protein, mutations of functional residues are a signal of functional shifts that might occur independently of the phylogeny. - this is quite misleading. All substitutions (neutral or beneficial) have a phylogenetic signal. In any case, this is discussed here in phylogenetic terms: https://pubmed.ncbi.nlm.nih.gov/10742039/

      We corrected the sentence to refer to divergence time instead of phylogenetic signal.

      OLD VERSION:

      While neutral mutations help to determine the phylogenetic position of a protein, mutations of functional residues are a signal of functional shifts that might occur independently of the phylogeny.

      NEW VERSION:

      While neutral substitutions are directly proportional to the time of divergence, a change in functional residues could be a signal of a functional shift that might occur independently of the divergence time.

      Line 107: "under high evolutionary pressure" - I do not know what evolutionary pressure is nor why it can be high or low.

      We corrected the term to “selective pressure”.

      OLD VERSION:

      Lorin et al. showed that both copies of EGFR might have been retained because they are involved in the complex process of skin pigmentation (40), which is under high evolutionary pressure in most fish.

      NEW VERSION:

      Lorin et al. showed that both copies of EGFR might have been retained because they are involved in the complex process of skin pigmentation (40), a trait that is under selective pressure in most fish

      Line 112 "linearly inherited across orthologs" - linear is a poor choice of a word here. The first thing that comes to my mind is quadratic inheritance as an alternative. Perhaps the authors are looking for "vertical" versus "horizontal" - these are established terms in phylogenetics (think "horizontal gene transfer").

      We corrected the term to “vertically inherited”.

      OLD VERSION

      Therefore, the power to predict functional residues is limited by our ability to track protein function on the phylogenetic tree when it is not linearly inherited by orthologs.

      NEW VERSION

      Therefore, the power to predict functional residues is limited by our ability to track protein function on the phylogenetic tree when it is not vertically inherited by orthologs.

      It is my invariant practice to reveal my identity to the authors,

      Fyodor Kondrashov

      Reviewer #2 (Significance (Required)):

      Addressed in the above

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of Pascarelli and Laurino titled "Identification of residue inversions in large phylogenies of duplicated proteins"

      I find the topic of the paper very exciting and long overdue. Indeed, I was under the impression that the question of parallel evolution in paralogous copies must have been addressed long ago: to my surprise, having looked in depth at the literature, that is only partially so. The manuscript, therefore, addresses a relatively novel and fundamental question of broad interest.

      Having said this, I also found the manuscript to suffer from an identity problem, which in many places encroaches on the underlying quality of the science. I will structure my review into three concerns: the identity issues, the novelty issue and the emergent quality issues from the two.

      Identity issues:

      The manuscript is primarily dealing with an evolutionary issue - or I am biased to see it this way as an evolutionary researcher myself. Nevertheless, much of the language and terminology of the paper either misuses evolutionary terms or invents new ones in its place with a bias towards a protein chemistry perspective. Specifically, what the authors call "residue inversions" is called "parallel evolution" or "convergent evolution" in the literature. Also, "residues" are typically used for physical amino acids in a structure. If we are talking about sequence level "amon acid" would be a better term. The issue is further confounded by the meaning of "inversion" in genetics as a single mutation that inverts the position of nucleotides (i.e. an "AT" becomes "TA").

      I strongly recommend for the authors to become familiarized with the common usage of existing and widely used terms in evolutionary biology that describe the phylogenetic patterns they see: parallel evolution, convergent evolution, homoplasy, etc, and to use them consistently throughout the manuscript.

      The same goes for "mutation", which the authors confuse on two levels: evolutionary and biochemical. Sometimes the authors refer to "mutation" of amino acids (which can be entertained at some level, but from a genetic perspective only nucleotides mutate - in the protein biochemistry field this term is frequently applied to amino acid residues, which is the basis of the identity issue). However, since the authors also use "mutation" to refer to a "substitution" (which is what we call a mutation that has become fixed in evolution) this creates another level of confusion. I urge the authors to change this aspect of the language of the manuscript to better reflect evolutionary concepts.

      As part of the language issues I am not sure how meta-functionalization in the author's view differs either from neofunctionalization or specialization of duplicated genes.

      Novelty issues:

      As I mentioned, the issue of parallel evolution of gene duplications is an extremely interesting topic. I was sure that the people who studied parallel evolution, or those interested in gene duplications, must have published extensively on this. However, my search of the literature revealed only a modest pre-existing effort. Nevertheless, previous efforts are not entirely non-existent and should be cited and discussed in this paper too. The most pertinent example is

      https://bmcecolevol.biomedcentral.com/articles/10.1186/s12862-020-01660-1

      which has an identical setup from what I can tell (compare Figure 1 in each paper).

      This paper was not hard to find using "parallel evolution", thus my focus on the language issues in the previous section.

      Content issues:

      The lack of attention to evolutionary concepts, in my opinion, provided some missed opportunities for the authors to attack the problem in a more convincing fashion. Specifically, in the setup to distinguish between parallel evolution of paralogues versus orthologues ("inversion" versus "species-specific adaptation" in the author's text) one must be able to distinguish between the two copies and assign true evolutionary relationship. In practice, that is not always possible based on tree lengths or topologies alone because of confounding factors such as independent duplications or gene conversion events.

      I would feel better about the results of this study if the following two things were integrated.

      The use of synteny to better determine homologous relationships (declare copies to be true paralogues if they occupy the same syntenic region). To compare the frequency or parallel evolution of paralogues versus orthologues as a null model of the expected number of parallel events in paralogous copies.

      The second point targets the way the model derives the expectations: at the author's own admission the model makes a number of unrealistic assumptions, ") equal branch length between the two paralogs; 2) only zero to one mutation can occur in each of the six branches; 3) after a mutation, each residue is equiprobable; 4) no selective pressure; 5) the probability of a mutation on a branch solely depends on the branch length (mutation rate). The authors do not really test the resulting tree on deviation from these assumptions (I am sure that it does not conform) but essentially comparing the occurrence of parallel events in paralogues versus orthologues may solve the problem with a less restrictive set of assumptions (that one expects an equal number of parallel events in paralogues and orthologues unless there is some paralogue-specific selection pressure, which is what the authors are looking for.

      In summary, I believe that the topic is very interesting, the authors potentially found a new aspect of evolution of a specific gene family. However, in my opinion a major revision is needed to unite this text with the terms in the field, the previous publication and to integrate the two additional analyses I suggested.

      Minor Comments:

      I started adding these specific comments before generalizing the broader deviation from the common evolutionary language. There are more further along in the manuscript, but in the interest of time I will not articulate them here hoping that the authors will first try a major revision targeting these issues.

      Line 64: While neutral mutations help to determine the phylogenetic position of a protein, mutations of functional residues are a signal of functional shifts that might occur independently of the phylogeny. - this is quite misleading. All substitutions (neutral or beneficial) have a phylogenetic signal. In any case, this is discussed here in phylogenetic terms: https://pubmed.ncbi.nlm.nih.gov/10742039/

      Line 107: "under high evolutionary pressure" - I do not know what evolutionary pressure is nor why it can be high or low.

      Line 112 "linearly inherited across orthologs" - linear is a poor choice of a word here. The first thing that comes to my mind is quadratic inheritance as an alternative. Perhaps the authors are looking for "vertical" versus "horizontal" - these are established terms in phylogenetics (think "horizontal gene transfer").

      It is my invariant practice to reveal my identity to the authors,

      Fyodor Kondrashov

      Significance

      Addressed in the above

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This article focuses on one possible outcome of protein sequence evolution after duplication, in which the residue distribution at specific positions of a multiple sequence alignment becomes uncoupled from the distribution expected from the phylogeny of the protein family. The authors call these events "residue inversions" and interpret them as the result of functional pressures on family members with diverging cellular roles. Based on a theoretical model of residue evolution after duplication of the coding gene, the authors describe the criteria for categorizing a particular position in a protein as a "residue inversion" and develop an algorithm to identify such events in a multiple alignment. They then apply their approach to the family of Epidermal Growth Factor Receptors in Teleost fishes and identify 19 EGFR positions in a dataset of 88 fish genomes, which satisfy the criteria of "residues inversions". They provide support to the scoring scheme used in their approach through a simulated evolution run and conclude from a comparison of their positions to the ones predicted by SPEER to represent Specificity Determining Sites that the two are largely orthogonal and may therefore complement each other in sequence-based function prediction.

      Major comments:

      1. Throughout the paper, the functional involvement of positions subject to "residue inversions" is indirect, inferred from the literature, and in parts sparse and tenuous. It therefore remains unclear to what extent the interpretation that "residue inversions" represent functional adaptations is correct. The authors acknowledge this uncertainty in several places, including the Conclusions.
      2. "Residue inversion" is a very unintuitive term, which took me several readings to penetrate and made reading the article difficult. The authors may wish to reconsider this term. Naively, a residue inversion would be the swapping of residues between two positions, such that a residue expected in position A is found in position B, while the residue expected in B is found in A. That is what I suspect most readers will think.
      3. Is the phenomenon described here just a curiosity, or an important aspect of divergent evolution after duplication? The authors seem to be of two minds about it, calling the phenomenon "rare" in the Abstract, but an "important and understudied outcome of gene duplication" in the Introduction, then hedging again that it "might be rare" in the Conclusions. The benefits of recognizing such positions are also formulated with great caution, for example in lines 309-311: "In summary, the identification of residue inversion event has the potential to improve functional residue predictions".

      It would probably strengthen the article substantially if the authors would (I) use their program to scan a large number of multiple alignments in order to establish more reliably how frequent this phenomenon actually is, and whether it is universal or a specifc aspect of eukaryotic, maybe even only vertebrate evolution; and then (II) mapped the positions identified on structural models for the proteins, obtained by homology modeling or AlfaFold prediction, in order to substantiate their potential origin as functional adaptations.

      Significance

      A method to improve the functional annotation of proteins in a paralogous family would be very useful, given the abundance of sequence data.

      I am knowledgeable in varios aspects of molecular evolution and functional annotation. I am neither a mathematician, nor a developer of phylogenetic methods, so I cannot judge these aspects of the paper.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: Figures we made to respond to the referee comments appear to be not supported by the ReviewCommons system.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      **Summary:**

      The authors characterized a new lncRNA locus named FLAIL that controls flowering time in Arabidopsis thaliana. The functional validation of this locus is strongly supported by the use of several different tools (CRISPR-Cas9 deletions, T-DNA insertion, amiRNA gene silencing, and transgene complementation of KO lines). It is also suggested that FLAIL lncRNA works in trans but not in cis. There are strong observations supporting that FLAIL works in trans.

      Moreover, it is suggested that FLAIL regulates gene expression by interacting with distant chromatin loci. This was assessed using RNA-Seq and ChIRP-Seq. Yet, the overlap between DEGs in the flail mutant and FLAIL binding sites at the chromatin is very small, with only 12 genes. From those, only 2 flowering genes' expression was rescued by FLAIL transgene complementation. The final conclusion that FLAIL lncRNA represses flowering by direct inhibition of the 2 flowering genes expression is correlative, and lacks genetic validation.

      #1.1 We plan to support the conclusions in the manuscript genetically as the reviewer suggests. We started these experiments yet they will require the timeframe of the full revision.

      In addition inspection of the supplementary file shows that the ChIRP analysis was done without filtering for the FDR so that some of the positive hits have an FDR of 0,232.

      #1.2 We strengthened the manuscript by implementing and FDR filter of ChIRP-seq results. The distribution of FLAIL binding sites in Fig. S7B and Table S4, and overlapping numbers between DEGs and FLAIL-ChIRP in Fig. S8A were correspondingly updated.

      In addition, many of the peaks land in intergenic regions with is not mentioned in the text a graph with the position of the peaks in respect to nearby genes would help.

      #1.3 Thank you for the suggestion, we strengthened the manuscript with the requested analysis. We implemented the FDR filter, then we used "tssRegion" in ChIPseeker to set distance to the nearest TSS as (-1000, 1000), then most peaks were located in promoter regions (67.24%) and in intergenic regions with 16.38%. Since many papers present the position of the peaks by ChIPseeker (PMID: 32338596, PMID: 28221134, PMID: 31081251, PMID: 32012197, PMID: 31649032, PMID: 32633672) we also applied a similar method to display a distribution of FLAIL binding loci relative to distance from the nearest TSS in Fig. S7C.

      In one sentence, the authors used the right model system and methodology, including advanced techniques, to characterize a new trans-acting lncRNA important for controlling the flowering time in Arabidopsis but lack evidence supporting a mechanism of action that goes beyond the interaction with several chromatin loci.

      **minor points:**

      line#63-64 the authors say the COLDAIR and ASL work on FLC in cis in my view the original papers suggested/showed they work in trans.

      #1.4 We increased precision by changing this sentence to ‘Vernalization-induced flowering associates with several lncRNAs such as ____COOLAIR____, COLDAIR____, ANTISENSE LONG (ASL), and COLDWRAP____ that in cis or in trans locally repress gene expression of FLOWERING LOCUS C (FLC), a key flowering repressor at different stages of vernalization’____.

      Fig 1B please add some more protein-coding RNAs for the bio-info analysis for comparison

      #1.5 ____done.

      Order of Supplementary Fig citation is mixed with S2 coming before S1B

      #1.6 Thank you, we ordered all figures by appearance in the text. __

      __

      It would help the reader to have a schematic of the crisper deletions, T-DNA insertion, and position of primers used for the RT-qPCR.

      #1.7 We enhanced our presentation of Fig. 1A. It shows a schematic of them as well as positions of primers.

      In the supplementary PDF file, some of the text is missing on page 3 beginning and end of lines.

      #1.8 we ensured all text in new submission.

      Reviewer #1 (Significance (Required)):

      The use of several different tools to validate the biological function of FLAIL locus is a major strength of this work.

      The authors propose that flowering time and its gene regulation are controlled by sense FLAIL lncRNAs. However, the sense transcription of FLAIL locus is not detected in wild-type plants by TSS-Seq, TIF-Seq, or plaNET-Seq.

      #1.9.1 There appears to be some confusions. Transcription of sense FLAIL can be observed in chr-DRS, TSS-seq, TIF-seq in wild type and even in plaNET-seq in NRPB2-FLAG nrpb2-1 plant. We enhanced presentation of Fig. 1 and provided a more clear description in Line 81-99.

      If the authors would have explored further the expression of FLAIL transcripts in different stages of development (vegetative and non-vegetative) and in response to different conditions, it would make their claims on the function of FLAIL lncRNAs more convincing. Additionally, flail mutants could have been obtained in the hen-2 background, since it's there where we can observe FLAIL transcription.

      #1.9.2 Thank you for the suggestion. We included additional analyses in ____Fig. S2 for FLAIL transcription level in different tissues and different abiotic stress conditions base on 20,000 publicly available RNA-seq libraries (PMID: 32768600). Although many libraries are non-stranded, this analysis determined that sense FLAIL or total FLAIL (including sense and antisense) is broadly expressed over many tissues and induced in response to many abiotic stresses (Fig. S2A-B), therefore suggesting that FLAIL may be needed broadly in Arabidopsis.

      FLAIL locus lays on the proximal promoter region of PORCUPINE (PCP), an important regulator of plant development. As flail mutants, pcp mutants display an early flowering phenotype. The authors show no link between FLAIL and PCP from the overlap between re-analysis of published RNA-Seq data for pcp and RNA-Seq and ChIRP-Seq from the authors. This analysis is not enough to exclude the involvement of PCP from the FLAIL function. PCP expression using RT-qPCR should be performed in flail mutants to further support that FLAIL works independently from PCP.

      #1.10 We strengthened this conclusion by adding the requested experiment. PCP transcription level in flail3 mutant was provided by RT-qPCR and RNA-seq in Fig. S11A-B.

      This work does not hypothesize any molecular mechanism besides the interaction of FLAIL lncRNAs with several chromatin loci. It was recently proposed in Arabidopsis that a trans-acting lncRNA interacts with distant loci via the formation of R-loops. The authors do not comment on that. This work would benefit in correlating FLAIL binding sites with R-loop-forming regions mapped in Arabidopsis, regardless of the results from this analysis. Additionally, the authors could attempt to look for a motif responsible for FLAIL binding.

      Check R-loop forming data R-loops (Santos-Pereira and Aguilera, 2015) in Arabidopsis, determined by DRIP-seq (Xu et al., 2017).

      #1.11 Thanks very much for this excellent suggestions.

      First, we searched for a consensus DNA motif on FLAIL binding regions by Homer. We determined four commonly enriched DNA sequence motifs among FLAIL target genes (Fig. 4G). Notably, the target genes CIR1 and LAC8 contained consensus sequences that matched to all FLAIL binding motifs (Fig. 4G). These data are consistent with a model where FLAIL binds DNA targets through a sequence complementary mechanism. Functionally important sequences are frequently conserved among evolutionarily distant species, we observed three motifs that appeared to cross-species conserved (Fig. S9), suggesting a potential evolutionarily constrained role.

      Second, we indeed identified R-loops peaks on several of FLAIL binding sites by DRIP-seq (Xu et al., 2017). For example, we observed R-loop formation over three FLAIL binding motifs at CIR1 locus and one at LAC8 (Fig. R1), indicating that R-loop formation may also be a factor determining FLAIL binding. Even though R-loop peaks are present at several FLAIL targets, full elucidation if R-loop formation determines FLAIL targeting requires further experimental evidence is beyond the scope of the current manuscript.

      Fig. R1 Representative tracks at LAC8 and CIR1 showing R-loop formation by DRIP-seq on Watson strand (w-R loops), Crick strand (c-R loops). Undetectable R-loops after RNAse-H treatment was shown as negative control. Four conserved sequence regions of FLAIL binding motifs were indicated by red arrows at LAC8 and CIR1 loci. Gene annotation was shown at the bottom.

      Most of the key conclusions are convincing, except for the flowering time control directly through CIR1 and LAC8, which should be mentioned as speculative

      ____#1.12____ Thank you for finding most key conclusions convincing. We plan strengthen the manuscript with additional genetic evidence to as part of the full revision.

      The words locus and loci are latin and they should be written in italic. The word Brassicaceae, referring to the family should be in italic, and should not be "Brassicaceaes". The word analysis has the wrong spelling.

      #1.13 We follow conventions given in Scientific Style and Format: The CBE Manual for Authors, Editors and Publishers (1994) Cambridge University Press, Cambridge, UK, 6th edn. The words locus and loci are common Latin terms and should not be italicized. However, should the format of the final prefer these words in italics we will change it later. We improved consistency of using italics. “Brassicaceaes” was changed to “Brassicaceae”.

      "How much time do you estimate the authors will need to complete the suggested revisions: this is difficult to answer as it depends to which level the author would like to take their work. In my view, if all new experiments would have to be started from scratch it is too far away to be estimated.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      In this ms, the authors identified the FLAIL lncRNA that represses flowering in Arabidopsis from a locus producing sense and antisense transcripts. They use an allelic series involving T-DNA insertions, CRISPR/Cas9 and artificial miRNAs to study the role of FLAIL in flowering. A complementation series of constructs of the flail3 allele allowed them to show that the sense FLAIL lncRNA can act in trans. RNAseq revealed a small group of genes linked to the regulation of flowering whose expression is affected in the mutant and restored in the complementation line. To gain further insight into FLAIL function, the authors used a ChIRPeq approach to test whether the lncRNA can recognize potential target genes along the genome and they could show that FLAIL binds specific genomic regions. Clearly, this paper shows very nice evidence that the FLAIL lncRNA can act in trans to regulate gene expression. Nevertheless, there are certain points that need to be clarified to further support the action of the sense FLAIL transcript.

      1.According to Fig. 1 A, the antisense FLAIL is "internal" to the DNA genomic area spanning the sense FLAIL. Hence, with direct RT-qPCR is very difficult to distinguish between these molecules as a minor "RT" activity of the Taq polymerase may lead to detection of low levels of antisense, idem if RDRs may generate low antisense levels. Although I think that the plaNET seq brings strong evidence about the start and ends of these molecules, to measure them by RT-qPCR is not trivial and requires the use of strand-specific RT-PCR using a 5' extension of the oligo and amplification with one oligo of the FLAIL sequence (sense or antisense) and the added oligo.

      #2.1.1 Thanks for this good suggestion. We tested both sense and antisense FLAIL transcription using oligo linked gene specific reverse primers for RT and a pair of the linked oligo and gene specific forward primer for qPCR. Primer locations were shown in Fig. 1A and new data were in Fig. 1C-D, Fig. 2B-C, and Fig. S4B-C.

      It is not clear how they could distinguish precisely sense and antisense particularly when both RNAs correlate as it is the case here in all alleles (Fig. 1 C and 1D). This should be more explicitly mentioned in the materials and methods section.

      #2.1.2 We gave a description of strand specific RT-qPCR method in detail in Line 397-402.

      2.In Fig. 2, what are the levels of antisense in the complementing lines with the sense transcript? And reciprocally sense levels in antisense constructs?

      #2.2 We added this data in Fig. 2B-C and described in Line 136-143. We indeed observed that sense FLAIL transcripts in the transformed asFLAIL construct or asFLAIL transcripts in the transformed sense FLAIL construct was similar to the control 35S:GUS (Fig. 2B-C), validating that NOS terminator inhibits antisense transcripts. We also noted that the transformed 35S:GUS and sense FLAIL construct expressed higher asFLAIL compared to the flail3 mutant (Fig. 2C). This may be caused by a T-DNA insertion of the resulting transgenic plants.

      This will definitively demonstrate the assumption that the T-NOS termination will not allow any expression on the other strand. At present, only one of the lncRNAs is measured in each experiment?

      #2.3 We appreciate the next-level reflection of this reviewer, with so many regions initiating cryptic antisense transcription it is an interesting challenge to identify a 3´- terminator that initiates no or poor antisense transcription.

      First, previous published data argue that the NOS terminator is largely abolishing initiation of antisense transcription (PMID: 33985972, PMID: 30385760, PMID: 27856735). All these studies address roles of antisense transcription by generating mutations abolishing antisense lncRNA transcription using the NOS terminator sequences.

      Second, to satisfy the curiosity of this reviewer, we provide data below that from another manuscript of the lab in preparation. It’s a screenshot of plaNET-seq in fas2-4 NRPB2-FLAG nrpb2-1 mutant carrying a pROK2 construct. The pROK2 T-DNA coincidentally carries a NOS terminator. We mapped plaNET-seq reads to the pROK2 scaffold to display the reads. In pROK2, a NOS promoter activates NPTII expression (red) with NOS terminator as a terminator sequence. No antisense transcription (blue) is detectable by this sensitive method to detect nascent transcripts. Taken together, the selection of the NOS terminator as a region suppressing initiation of antisense transcription represents a valid choice.

      Fig. R2 Genome browser screenshot of plaNET-seq at NPTII locus of pROK2 T-DNA vector in fas2-4 NRBP2-FLAG nrpb2-1 mutant. This mutant carries a pROK2 construct, in which a NOS promoter activates NPTII expression with NOS terminator a terminator sequence. Sense strand was shown in red and antisense strand in blue. pROK2 annotation was shown at the bottom.

      3.In Fig. 3, it will be important to also show the FLAIL locus in the flail3 mutants (in comparison to the wt) as well as the transgene locus. Here the reads will be strand specific and furthermore this will allow to show that the transgene is not generating antisense transcripts (through RDRs for gene silencing?) and confirm that the sense FLAIL is required for the complementation.

      #2.4 Thank you very much for this suggestion. NGS reads for endogenous FLAIL and transgenic FLAIL both map to the FLAIL locus, so we show the FLAIL locus in Fig 3B. This representation shows that sense FLAIL transcripts were significantly reduced in flail3 and rescued in complementation line comparing to wild type. These data argue against the idea of gene silencing and linked antisense production from the transgene. However, RNA-seq suggests that an isoform of asFLAIL appears to accumulate in flail3. Since we fail to identify this accumulation by strand specific RT-qPCR result in flail3 and in CRISPR-deletion lines, this may be an asFLAIL isoform resulting from the T-DNA insertion.

      4.In Fig. S5, the expression of FLAIL is shown in the artificial miRNA lines. Is the antisense FLAIL affected "indirectly" by the cleavage of the amiRNA or remains constant? This is likely the case but should be shown.

      #2.5 We added this result in Fig. S4C and expression level of asFLAIL remains constant compared to the transformed empty vector control.

      5.The ChIRPseq data adds major novelty to the ms and brings new ideas about the way of action of FLAIL. However, are there any common epigenetic states between ChIRP targets (e.g. histone modifications, antisense RNA production, homologies "detected" in the conserved regions between Camelina and Arabidopsis and the target loci? Or others) that may highlight potential mechanisms leading to repression mediated by FLAIL of these loci? There are many databases that could be explored (even during flowering) to search for potential relationships. Although precise description of the mechanism is out of the scope of this ms, this can be discussed in more detail to further expand on the nice data obtained.

      #2.6 We searched for a consensus DNA motif on FLAIL binding regions by Homer. We determined four commonly enriched DNA sequence motifs in target genes. Notably, the target genes CIR1 and LAC8 contained consensus sequences that matched to all FLAIL binding motifs (Fig. 4G). These data are consistent with a model where FLAIL binds DNA targets through a sequence complementarity mechanism. Functionally important sequences are frequently conserved among evolutionarily distant species, we observed three motifs that appeared to cross-species conserved (Fig. S9), suggesting a potential evolutionarily constrained role.

      **Minor comments:**

      6.In Fig. S3, a global alignment between FLAIL and two loci in Arabidopsis and Camelina is sown. What is the extent of homology? How conserved is this sequence at nucleotide level (small or very long?) to support the conservation of this lncRNA. Are there potential structures conserved among these lncRNAs?

      #2.7 T____wo consensus regions of ____FLAIL____ sequences among eleven disparate Brassicaceae genomes were shown in Fig. S9. ____Camelina sativa_ shared 98-nucleotide_ conserved sequences with Arabidopsis thaliana. In the future, it will be interesting to explore evolutional conserved structures among Brassicaceae genomes. However, these analyses are beyond the scope of the current manuscript.

      7.In Fig. S4B, arrows may help to understand which seeds were selected.

      __#2.8 Thanks. Arrows were included.____

      __

      Reviewer #2 (Significance (Required)):

      This paper is a very nice piece of work and demonstrate the action of a long non-coding RNA (lncRNA) in trans on specific targets involved in the regulation of a developmental process, flowering. There is growing evidences that the non-coding genome hides large number of lncRNAs and there is little detailed genetic support for the action of lncRNAs globally. In contrast to many descriptive papers in the field, this ms demonstrates genetically, through an allelic series and complementation experiments, that this lncRNA locus is involved in flowering regulation and that its sense lncRNA recognizes target loci genome-wide, bringing interesting perspectives on potential new mechanisms of transcriptional regulation mediated by non-coding RNAs.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In the manuscript by Jin et al authors characterize the FLAIL DNA locus in Arabidopsis (using a wide array of publicly available datasets), which produces a set of sense and anti-sense lncRNAs.

      While our work on the FLAIL manuscript was ongoing we published the manuscripts where we presented these novel genomics methods and related data to capture nascent transcription and cryptic isoforms. We shared most data with TAIR, so we are happy to hear that these data are considered publically available.

      Authors determined that the sense FLAIL lncRNA (or a set of sense lncRNAs, which isn't fully clear from the way the data are presented) is involved in flowering time in Arabidopsis based on the fact that the several flail mutants lead to the early flowering phenotype and this flowering defect is complemented by transgenic FLAIL DNA, meaning that FLAIL lncRNA acts in trans.

      A series of experiments lead us to conclude that the sense isoform of FLAIL is responsible for the effect. We improved the data representation and writing of the manuscript to enhance accessibility.

      The T DNA flail3 - mutant results in expression changes (up or down) of 1221 genes, including twenty genes linked to flowering in various ways. Expression of a group of these flowering-related genes could be either fully (for eight genes) or partially (for five genes) rescued by transgenic FLAIL. Authors also conducted the ChIRP-seq to determine which genes are physically bound by FLAIL lncRNA genome-wide. It was found that 210 genes in the genome are bound by FLAIL lncRNA. Comparison of the dataset of differentially expressed genes in the T-DNA flail3 mutant with the ChIRP-seq dataset of genes that are bound by FLAIL lncRNA revealed the 12 overlapping genes.

      Among these twelve overlapping genes, four were found to be functionally connected to flowering with expression of these four genes being down in flail3 T-DNA mutant. Two out of these four genes were ruled out from being involved in the regulation of flowering by FLAIL. Authors conclude that the two other genes (Cir1 and Lac8) are responsible for the late flowering phenotype of flail mutants based on the three lines of evidence: (i) these genes expression is reduced in the flail mutant, (ii) FLAIL lncRNA directly interacts with these genes chromatin, (iii) the mutants of these genes were previously reported by others to display early flowering phenotypes too. While I find many of the findings reported in the manuscript very interesting, building a good foundation on which to expand the study and providing a very good leads for follow up experiments, I also have serious concerns about the manuscript in its current form.

      Most importantly, this reviewer doesn't think that the mechanism of FLAIL lncRNA action was convincingly demonstrated. The main question would be how FLAIL lncRNA works and this question wasn't fully answered. It is great that FLAIL lncRNA binds directly to the two flowering-related genes, but what does it mean? Does it change any chromatin context of these genes quantitatively or qualitatively to affect the transcription? Or does it bind any components of transcriptional machinery and thus controls the transcriptional output?

      #3.1 This manuscript addresses an important question in the field question: what is the evidence for functional elements in non-coding regions of genomes? Despite many efforts, convincing genetic support for these functions often remained limited. In addition to our strong genetic data, we provided new evidence that FLAIL recognizes targets with evolutionally conserved sequence motifs as part of the revision in Fig 4F and Fig. S9. Additionally, we plan to do ChIP-qPCR to identify histone modifications on FLAIL targets.

      Additionally, flail3 T-DNA mutant affects the expression of 1221 genes and FLAIL lncRNA physically interact with 210 genes, so how can authors be fully sure that FLAIL lncRNA has only direct effect on these two genes and doesn't also contribute to the regulation of the upstream to Cir1 and Lac8 genes or even components of the transcriptional machinery that regulate these genes?

      #3.2 We agree with this opinion. It is the reason why we felt stating this exact conclusion in our previous manuscript was justified. We improved accessibility of our manuscript in the revision, these clarify our model, that the trans-acting lncRNA sense FLAIL can interact with the chromatin regions of its target genes to directly or indirectly regulate gene expression changes involving flowering (Line 274).

      Theoretically, doing RNA-seq in the amiR-FLAIL sense lncRNA mutant might have a chance of reducing the number of affected DEGs, making it easier to analyze the FLAIL targets, even if the allele can't be used for complementation experiments.

      #3.3 Thanks for this suggestion. We plan to confirm key gene expression changes using amiRNA-FLAIL in full revision.

      Also, auhors totally neglect putting the Cir1 and Lac8 genes into the context of flowering regulation, but it is something that needs to be done.

      #3.4 ____We discussed roles of CIR1 and LAC8 in flowering regulation in Line 260-272. Flowering is fine-tuned to maximize reproductive success and seed production and by endogenous genetic cues and external environmental stimuli such as photoperiod. Nevertheless, many details of the flowering pathways and their integration remain to be investigated____. CIR1 is a circadian clock gene, induced by light and involved in a regulatory feedback loop that controls a subset of the circadian outputs and thus determines flowering time. Our GO analysis supports that a subset of DEGs are connected to the response to red or far red light that contains among other key flowering genes such as ____phytochrome interacting factor____ 4____ (PIF4) and CONSTANS (CO)____. FLAIL also binds the chromatin region of LAC8. LAC8 is a laccase family member that mainly modulates phenylpropanoid pathway for lignin biosynthesis____. Similar to flail, lac8 mutants flower early. While intermediates in this pathway or dysregulation of lignin-related genes could promote flowering in plants, the molecular connections of reduced LAC8 expression to effects on flowering time will require further investigation.

      Lastly, the paper needs to be totally rewritten to be even properly evaluated. In its current state it reads like a very short draft.

      #3.5 We reorganized the structure of manuscript, improved clarity and provided new mechanistic evidence in Fig. 4G and Fig. S9 to present a more complete manuscript.

      The Abstract is weak, the Introduction is written in a such telegraphic style that it is barely readable, in many places there is no connections between sentences leading to the information appear to be presented as random, even if it isn't.

      #3.6- We strengthened the Abstract by providing new evidence and improved for the Introduction.

      The Results section is written rather rudimentary with information not being sufficiently provided to describe the results but rather scattered between the Results and Figure legends.

      #3.7 Thanks for your suggestions, we described each FLAIL length and all constructs in detail in Results, put a schematic of T-DNA and CRISPR mutants in Fig. 1A, moved comparative genomics data to the end of Results and ensured all figures in order.

      The Discussion is the best written part of the manuscript.

      Thanks for your appreciation of the Discussion.

      The Conclusion section carries no specific information and reads more like a little summary suitable for a review article rather than experimental paper.

      #3.8 We agree this opinion, this paragraph fits Discussion better and Conclusion was removed.

      Therefore, this reviewer thinks that regardless of how authors will choose to proceed with the current experimental version of the manuscript, it'd be in the authors' best interests to at least fully revise the paper before resubmitting anywhere. I'd also advise authors to seek professional editorial help specifically using an editor with the background in the plant sciences.

      Authors might also want to consider moving Fig.3 into the Suppl. as it doesn't carry much weight or significance and perhaps make existing figures more meaningful and comprehensive and by including a better diagram of the locus (e.g., Fig. S1), etc.

      #3.9 We thank this helpful suggestion. Fig.3 represents the RNA-seq data. In combination with supporting data in the supplementary material, it gives an easy visual readout of the reproducibility of the findings in replicates of stranded RNA-seq. In a new submission, we moved it to Fig. S5B and highlighted 13 differentially expressed flowering genes as well as sense FLAIL in flail3 that were rescued in complementation line in Fig. 3A. Moreover, we gave screenshots of FLAIL itself and four flowering related FLAIL targets in RNA-seq with a clear schematic representation of each locus. We believe these revisions improve Fig. 3.

      It's not practical to list all issues with the writing as the paper requires total re-writing, so I can just make a few suggestions without any specific order to help authors improve the paper:

      We are happy to improve our manuscript with the help of the reviewers. We addressed all comments including from reviewer #3 with a constructive spirit. However, since colleagues and reviewers #1 and #2 found the manuscript comprehensible to the point where they could make expert-level comments that illustrate understanding of the manuscript, a total re-writing did not feel like the most constructive suggestion to improve the manuscript.

      --There is no statement anywhere that states the goal of the study.

      #3.10____ We stated the goal of the study in line 50-69 and we think this is a misunderstanding. We summarized three issues currently exist in characterization of functional lncRNA in the last sentence of the first three paragraphs in Background: 1 in Line 50, the broad range of candidate hypotheses by which lncRNA loci may play functional roles call for multiple approaches to distinguish alternative molecular mechanisms. 2 in Line 59, functional characterization of trans-acting lncRNAs remains a key knowledge gap to understand the regulatory contributions of the non-coding genome. 3 in Line 69, the contribution of trans-acting lncRNAs to the regulation of distant flowering genes is currently unclear. So in the last paragraph of the background, we claimed that our goals are to address these questions through characterization of functional FLAIL lncRNA in flowering repression using multiple genetic approaches and various genomic data.

      --No rational is provided on why authors decided to examine this specific genomic locus.

      #3.11 For several years, our lab studies the rules and roles of non-coding transcription. We characterized and are characterizing several loci with evidence of non-coding transcription in a range of species. Early experiments suggested that FLAIL functioned in flowering, this manuscript clarifies that the function is executed as trans-acting lncRNA of the sense FLAIL isoform.

      --Typically, the significance is in studying the function of lncRNA or a group of lncRNAs produced from a genomic locus, I don't think I ever encountered the instances when it was exciting to study just a specific genomic locus. If the locus does indeed have any significance for initiating the study, it needs to be explained.

      #3.12 This study is remarkable in many aspects. We fully discuss key strengths in the discussion. First, we ____exhibit a trans-acting lncRNA FLAIL that represses flowering by promoting the expression of floral repressor genes as discussed in Line 247-281_; Second, in Line 284-306, we informed that this study provide a compelling model about how to apply _series of convincing genetic data____ to functionally characterize lncRNA loci. Third, in Line 307-312, evolutionary conserved FLAIL sequences across species is key to characterize the functional _microhomology in other _Brassicaceae.

      --The locus can produce lncRNAs, but it can't harbor them.

      #3.13 We clarified this confusion by enhancing ____presentation of Fig. 1 and providing a more clear description of each sequencing method and results in Line 81-99. Although we provided evidence that transcription of both sense and antisense FLAIL are more stable in hen2-2, they were clearly observed in chr-DRS in wild type and plaNET-seq in NRBP2-FLAG nrpb2-1 and sense FLAIL was even detected in TSS-seq and TIF-seq in wild type.

      --No length of FLAIL lncRNAs or their range is provided in the first section of Results.

      #3.14 We gave the length of sense FLAIL in Line 82 and antisense FLAIL in Line 86.

      --On many occasions authors don't state rational for doing experiments, which leads to information often flowing as random.

      #3.15 we enhanced clarity of the rational for each experiment and made some connections between sentences to make more fluent. For example, in sentences in Line 99, Line 113, Line 126, Line 159, Line 183, Line 214, and Line 219.

      --What do authors mean by the subtitle "FLAIL characterizes a trans-acting lncRNA repressing flowering"? How can lncRNA FLAIL or FLAIL locus characterize lncRNA?

      #3.16 We changed it to “FLAIL represses flowering as trans-acting lncRNA” in Line 112.

      --Check all figures. E.g., Fig. 3B-E mentions only accession numbers for the genes.

      #3.17 The systematic gene IDs are a valid way to represent data, in particular for genomics data since it facilitates cross-comparisons. To make it more accessible we also show systematic names of each gene in Fig. 3A-F, Fig. S6 and Table S3.

      --It is not clear where exactly the T-DNA insertion is located relative to sense FLAIL in flail3 mutant (Fig. S4).

      #3.18 We moved the schematic to clarify this to revised Fig. 1A and the exact T-DNA insertion site is mentioned in the legend.

      --- What is the length of the complementing sense FLAIL lncRNA?

      #3.19 We now include the length of the complementing sense and antisense FLAILs in Line 351-352.

      --Check the description of each and every construct used and provide explanation for each in the Results. E.g., the pFLAIL:gFLAIL18/88 and pasFLAIL:gasFLAIL18/39 constructs aren't explained in Results, and can only be found in Fig. 2 legends.

      #3.20 We described each construct including pFLAIL:gFLAIL18/88 and pasFLAIL:gasFLAIL18/39 constructs in Line 133, amiR-FLAIL-11 and amiR-FLAIL-11 in Line 149.

      Reviewer #3 (Significance (Required)):

      Tens of thousands of lncRNAs have been identified in various eukaryotes, but their biological roles have been shown only for a small fraction of them, and the mechanisms of their action are delineated for only a very few of them. Most of the advances on the field of lncRNAs are reported in metazoan, while the field of lncRNAs in plants is lagging far behind in terms of knowledge about lncRNAs with assigned biological functions or lncRNAs with delineated mechanisms of action. From this point of view, this reviewer is always excited to see any new functional plant lncRNAs for which either biological or mechanistic functions have been determined, and deems the information on this subject significant. The manuscript's findings are potentially very interesting and present a decent body of work that lays a very solid groundwork for future experiments. My main concern about the manuscript's significance in its current form is the fact that no real solid mechanism of action for the described lncRNA or a set of lncRNAs (?) has been demonstrated. The best mechanistically studied lncRNAs in Arabidopsis are involved in the regulation of flowering time, particularly those that function in the vernalization flowering pathway and to lesser extent in autonomous pathway. The new FLAIL lncRNA or lncRNAs (?) described in this manuscript also appear to regulate the flowering time in Arabidopsis, however more experiments would be needed to provide a definite conclusion about how direct FLAIL's effect is and how exactly it functions. That unfortunately obviously diminishes the significance of the manuscript and makes it potentially interesting only to researches studying flowering in Arabidopsis and even then the manuscript results would be incomplete to make solid conclusion.

      Lots of functional phenotype have

      Additionally, the manuscript requires complete re-writing.

      We thank this reviewer for the appreciation of __a decent body of work and a very solid groundwork for future experiments. We are confident that our revisions make the manuscript more comprehensible to highlight the qualities of our manuscript more accessibly.____

      __

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In the manuscript by Jin et al authors characterize the FLAIL DNA locus in Arabidopsis (using a wide array of publicly available datasets), which produces a set of sense and anti-sense lncRNAs. Authors determined that the sense FLAIL lncRNA (or a set of sense lncRNAs, which isn't fully clear from the way the data are presented) is involved in flowering time in Arabidopsis based on the fact that the several flail mutants lead to the early flowering phenotype and this flowering defect is complemented by transgenic FLAIL DNA, meaning that FLAIL lncRNA acts in trans. The T-DNA flail3 mutant results in expression changes (up or down) of 1221 genes, including twenty genes linked to flowering in various ways. Expression of a group of these flowering-related genes could be either fully (for eight genes) or partially (for five genes) rescued by transgenic FLAIL. Authors also conducted the ChIRP-seq to determine which genes are physically bound by FLAIL lncRNA genome-wide. It was found that 210 genes in the genome are bound by FLAIL lncRNA. Comparison of the dataset of differentially expressed genes in the T-DNA flail3 mutant with the ChIRP-seq dataset of genes that are bound by FLAIL lncRNA revealed the 12 overlapping genes.

      Among these twelve overlapping genes, four were found to be functionally connected to flowering with expression of these four genes being down in flail3 T-DNA mutant. Two out of these four genes were ruled out from being involved in the regulation of flowering by FLAIL. Authors conclude that the two other genes (Cir1 and Lac8) are responsible for the late flowering phenotype of flail mutants based on the three lines of evidence: (i) these genes expression is reduced in the flail mutant, (ii) FLAIL lncRNA directly interacts with these genes chromatin, (iii) the mutants of these genes were previously reported by others to display early flowering phenotypes too. <br /> While I find many of the findings reported in the manuscript very interesting, building a good foundation on which to expand the study and providing a very good leads for follow up experiments, I also have serious concerns about the manuscript in its current form. Most importantly, this reviewer doesn't think that the mechanism of FLAIL lncRNA action was convincingly demonstrated. The main question would be how FLAIL lncRNA works and this question wasn't fully answered. It is great that FLAIL lncRNA binds directly to the two flowering-related genes, but what does it mean? Does it change any chromatin context of these genes quantitatively or qualitatively to affect the transcription? Or does it bind any components of transcriptional machinery and thus controls the transcriptional output? Additionally, flail3 T-DNA mutant affects the expression of 1221 genes and FLAIL lncRNA physically interact with 210 genes, so how can authors be fully sure that FLAIL lncRNA has only direct effect on these two genes and doesn't also contribute to the regulation of the upstream to Cir1 and Lac8 genes or even components of the transcriptional machinery that regulate these genes? Theoretically, doing RNA-seq in the amiR-FLAIL sense lncRNA mutant might have a chance of reducing the number of affected DEGs, making it easier to analyze the FLAIL targets, even if the allele can't be used for complementation experiments. Also, authors totally neglect putting the Cir1 and Lac8 genes into the context of flowering regulation, but it is something that needs to be done. Lastly, the paper needs to be totally rewritten to be even properly evaluated. In its current state it reads like a very short draft. The Abstract is weak, the Introduction is written in a such telegraphic style that it is barely readable, in many places there is no connections between sentences leading to the information appear to be presented as random, even if it isn't. The Results section is written rather rudimentary with information not being sufficiently provided to describe the results but rather scattered between the Results and Figure legends. The Discussion is the best written part of the manuscript. The Conclusion section carries no specific information and reads more like a little summary suitable for a review article rather than experimental paper.

      Therefore, this reviewer thinks that regardless of how authors will choose to proceed with the current experimental version of the manuscript, it'd be in the authors' best interests to at least fully revise the paper before resubmitting anywhere. I'd also advise authors to seek professional editorial help specifically using an editor with the background in the plant sciences. Authors might also want to consider moving Fig.3 into the Suppl. as it doesn't carry much weight or significance and perhaps make existing figures more meaningful and comprehensive and by including a better diagram of the locus (e.g., Fig. S1), etc.

      It's not practical to list all issues with the writing as the paper requires total re-writing, so I can just make a few suggestions without any specific order to help authors improve the paper:

      --There is no statement anywhere that states the goal of the study.

      --No rational is provided on why authors decided to examine this specific genomic locus.

      --Typically, the significance is in studying the function of lncRNA or a group of lncRNAs produced from a genomic locus, I don't think I ever encountered the instances when it was exciting to study just a specific genomic locus. If the locus does indeed have any significance for initiating the study, it needs to be explained.

      --The locus can produce lncRNAs, but it can't harbor them.

      --No length of FLAIL lncRNAs or their range is provided in the first section of Results.

      --On many occasions authors don't state rational for doing experiments, which leads to information often flowing as random.

      --What do authors mean by the subtitle "FLAIL characterizes a trans-acting lncRNA repressing flowering"? How can lncRNA FLAIL or FLAIL locus characterize lncRNA?

      --Check all figures. E.g., Fig. 3B-E mentions only accession numbers for the genes.

      --It is not clear where exactly the T-DNA insertion is located relative to sense FLAIL in flail3 mutant (Fig. S4). --- What is the length of the complementing sense FLAIL lncRNA?

      --Check the description of each and every construct used and provide explanation for each in the Results. E.g., the pFLAIL:gFLAIL18/88 and pasFLAIL:gasFLAIL18/39 constructs aren't explained in Results, and can only be found in Fig. 2 legends.

      Significance

      Tens of thousands of lncRNAs have been identified in various eukaryotes, but their biological roles have been shown only for a small fraction of them, and the mechanisms of their action are delineated for only a very few of them. Most of the advances on the field of lncRNAs are reported in metazoan, while the field of lncRNAs in plants is lagging far behind in terms of knowledge about lncRNAs with assigned biological functions or lncRNAs with delineated mechanisms of action. From this point of view, this reviewer is always excited to see any new functional plant lncRNAs for which either biological or mechanistic functions have been determined, and deems the information on this subject significant. The manuscript's findings are potentially very interesting and present a decent body of work that lays a very solid groundwork for future experiments. My main concern about the manuscript's significance in its current form is the fact that no real solid mechanism of action for the described lncRNA or a set of lncRNAs (?) has been demonstrated. The best mechanistically studied lncRNAs in Arabidopsis are involved in the regulation of flowering time, particularly those that function in the vernalization flowering pathway and to lesser extent in autonomous pathway. The new FLAIL lncRNA or lncRNAs (?) described in this manuscript also appear to regulate the flowering time in Arabidopsis, however more experiments would be needed to provide a definite conclusion about how direct FLAIL's effect is and how exactly it functions. That unfortunately obviously diminishes the significance of the manuscript and makes it potentially interesting only to researches studying flowering in Arabidopsis and even then the manuscript results would be incomplete to make solid conclusion. Additionally, the manuscript requires complete re-writing.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In this ms, the authors identified the FLAIL lncRNA that represses flowering in Arabidopsis from a locus producing sense and antisense transcripts. They use an allelic series involving T-DNA insertions, CRISPR/Cas9 and artificial miRNAs to study the role of FLAIL in flowering. A complementation series of constructs of the flail3 allele allowed them to show that the sense FLAIL lncRNA can act in trans. RNAseq revealed a small group of genes linked to the regulation of flowering whose expression is affected in the mutant and restored in the complementation line. To gain further insight into FLAIL function, the authors used a ChIRPeq approach to test whether the lncRNA can recognize potential target genes along the genome and they could show that FLAIL binds specific genomic regions. Clearly, this paper shows very nice evidence that the FLAIL lncRNA can act in trans to regulate gene expression. Nevertheless, there are certain points that need to be clarified to further support the action of the sense FLAIL transcript.

      1.According to Fig. 1 A, the antisense FLAIL is "internal" to the DNA genomic area spanning the sense FLAIL. Hence, with direct RT-qPCR is very difficult to distinguish between these molecules as a minor "RT" activity of the Taq polymerase may lead to detection of low levels of antisense, idem if RDRs may generate low antisense levels. Although I think that the plaNET seq brings strong evidence about the start and ends of these molecules, to measure them by RT-qPCR is not trivial and requires the use of strand-specific RT-PCR using a 5' extension of the oligo and amplification with one oligo of the FLAIL sequence (sense or antisense) and the added oligo. It is not clear how they could distinguish precisely sense and antisense particularly when both RNAs correlate as it is the case here in all alleles (Fig. 1 C and 1D). This should be more explicitly mentioned in the materials and methods section.

      2.In Fig. 2, what are the levels of antisense in the complementing lines with the sense transcript? And reciprocally sense levels in antisense constructs? This will definitively demonstrate the assumption that the T-NOS termination will not allow any expression on the other strand. At present, only one of the lncRNAs is measured in each experiment?

      3.In Fig. 3, it will be important to also show the FLAIL locus in the flail3 mutants (in comparison to the wt) as well as the transgene locus. Here the reads will be strand specific and furthermore this will allow to show that the transgene is not generating antisense transcripts (through RDRs for gene silencing?) and confirm that the sense FLAIL is required for the complementation.

      4.In Fig. S5, the expression of FLAIL is shown in the artificial miRNA lines. Is the antisense FLAIL affected "indirectly" by the cleavage of the amiRNA or remains constant? This is likely the case but should be shown.

      5.The ChIRPseq data adds major novelty to the ms and brings new ideas about the way of action of FLAIL. However, are there any common epigenetic states between ChIRP targets (e.g. histone modifications, antisense RNA production, homologies "detected" in the conserved regions between Camelina and Arabidopsis and the target loci? Or others) that may highlight potential mechanisms leading to repression mediated by FLAIL of these loci? There are many databases that could be explored (even during flowering) to search for potential relationships. Although precise description of the mechanism is out of the scope of this ms, this can be discussed in more detail to further expand on the nice data obtained.

      Minor comments:

      6.In Fig. S3, a global alignment between FLAIL and two loci in Arabidopsis and Camelina is sown. What is the extent of homology? How conserved is this sequence at nucleotide level (small or very long?) to support the conservation of this lncRNA. Are there potential structures conserved among these lncRNAs?

      7.In Fig. S4B, arrows may help to understand which seeds were selected.

      Significance

      This paper is a very nice piece of work and demonstrate the action of a long non-coding RNA (lncRNA) in trans on specific targets involved in the regulation of a developmental process, flowering. There is growing evidences that the non-coding genome hides large number of lncRNAs and there is little detailed genetic support for the action of lncRNAs globally. In contrast to many descriptive papers in the field, this ms demonstrates genetically, through an allelic series and complementation experiments, that this lncRNA locus is involved in flowering regulation and that its sense lncRNA recognizes target loci genome-wide, bringing interesting perspectives on potential new mechanisms of transcriptional regulation mediated by non-coding RNAs.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The authors characterized a new lncRNA locus named FLAIL that controls flowering time in Arabidopsis thaliana. The functional validation of this locus is strongly supported by the use of several different tools (CRISPR-Cas9 deletions, T-DNA insertion, amiRNA gene silencing, and transgene complementation of KO lines). It is also suggested that FLAIL lncRNA works in trans but not in cis. There are strong observations supporting that FLAIL works in trans.

      Moreover, it is suggested that FLAIL regulates gene expression by interacting with distant chromatin loci. This was assessed using RNA-Seq and ChIRP-Seq. Yet, the overlap between DEGs in the flail mutant and FLAIL binding sites at the chromatin is very small, with only 12 genes. From those, only 2 flowering genes' expression was rescued by FLAIL transgene complementation. The final conclusion that FLAIL lncRNA represses flowering by direct inhibition of the 2 flowering genes expression is correlative, and lacks genetic validation. In addition inspection of the supplementary file shows that the ChIRP analysis was done without filtering for the FDR so that some of the positive hits have an FDR of 0,232. In addition, many of the peaks land in intergenic regions with is not mentioned in the text a graph with the position of the peaks in respect to nearby genes would help.

      In one sentence, the authors used the right model system and methodology, including advanced techniques, to characterize a new trans-acting lncRNA important for controlling the flowering time in Arabidopsis but lack evidence supporting a mechanism of action that goes beyond the interaction with several chromatin loci.

      minor points:

      line#63-64 the authors say the COLDAIR and ASL work on FLC in cis in my view the original papers suggested/showed they work in trans.

      Fig 1B please add some more protein-coding RNAs for the bio-info analysis for comparison

      Order of Supplementary Fig citation is mixed with S2 coming before S1B

      It would help the reader to have a schematic of the crisper deletions, T-DNA insertion, and position of primers used for the RT-qPCR.

      In the supplementary PDF file, some of the text is missing on page 3 beginning and end of lines.

      Significance

      The use of several different tools to validate the biological function of FLAIL locus is a major strength of this work.

      The authors propose that flowering time and its gene regulation are controlled by sense FLAIL lncRNAs. However, the sense transcription of FLAIL locus is not detected in wild-type plants by TSS-Seq, TIF-Seq, or plaNET-Seq. If the authors would have explored further the expression of FLAIL transcripts in different stages of development (vegetative and non-vegetative) and in response to different conditions, it would make their claims on the function of FLAIL lncRNAs more convincing. Additionally, flail mutants could have been obtained in the hen-2 background, since it's there where we can observe FLAIL transcription.

      FLAIL locus lays on the proximal promoter region of PORCUPINE (PCP), an important regulator of plant development. As flail mutants, pcp mutants display an early flowering phenotype. The authors show no link between FLAIL and PCP from the overlap between re-analysis of published RNA-Seq data for pcp and RNA-Seq and ChIRP-Seq from the authors. This analysis is not enough to exclude the involvement of PCP from the FLAIL function. PCP expression using RT-qPCR should be performed in flail mutants to further support that FLAIL works independently from PCP.

      This work does not hypothesize any molecular mechanism besides the interaction of FLAIL lncRNAs with several chromatin loci. It was recently proposed in Arabidopsis that a trans-acting lncRNA interacts with distant loci via the formation of R-loops. The authors do not comment on that. This work would benefit in correlating FLAIL binding sites with R-loop-forming regions mapped in Arabidopsis, regardless of the results from this analysis. Additionally, the authors could attempt to look for a motif responsible for FLAIL binding.

      Most of the key conclusions are convincing, except for the flowering time control directly through CIR1 and LAC8, which should be mentioned as speculative

      The words locus and loci are latin and they should be written in italic. The word Brassicaceae, referring to the family should be in italic, and should not be "Brassicaceaes". The word analysis has the wrong spelling.

      I was asked "How much time do you estimate the authors will need to complete the suggested revisions: this is difficult to answer as it depends to which level the author would like to take their work. In my view, if all new experiments would have to be started from scratch it is too far away to be estimated.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2021-01118

      Corresponding author(s): Jun, Nakayama and Kentaro, Semba

      1. General Statements

      We are grateful to all of the reviewers for their critical comments and insightful suggestions that have helped us considerably improve our paper. As indicated in the responses that follow, we have taken all of these comments and suggestions into account in the revised version of our paper, including the supplementary information.

      In the revised manuscript, we focus on the existence of two cancer stem cell-like populations in TNBC xenograft model and patients. The response to each reviewer is described below.

      Sincerely,

      Jun Nakayama

      Kentaro Semba

      Department of Life Science and Medical Bioscience

      School of Advanced Science and Engineering

      Waseda University

      E-mail: junakaya@ncc.go.jp or jnakayama.re@gmail.com to JN

      ksemba@waseda.jp to KS

      2. Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity (Required)): * **Summary:** Nakayama and colleagues use their previously developed automated tissue microdissection punching platform to perform spatial transcriptomics on a breast cancer xenograft model. Using transcriptomics on multiple clumps of 10-30 cells from different regions in a tumor and a lymph node metastasis they identified different cell-type clusters. Two of these clusters expressed different cancer stem cell markers. This led the authors to suggest that two distinct cancer stem cell(-like) populations may exist within one (breast) tumor, which could potentially make tumors more drug-resilient.

      **Major comments:** While the quality of the presented sequencing data is good and the manuscript is mostly written in a clear and accessible style, there are some concerns that limit the impact of this story. Most importantly, the manuscript in its present form does not convince me that the MDA-MB-231 xenografts indeed contain two distinct populations of cancer stem(-like) cells.

      1.The data obtained are not single cell data, which makes it difficult -if not impossible- to draw conclusions about presence of cancer stem cells. Each data point is the average of 10-30 cells, and the interpretation of the data is severely limited by this. How can the quantification of expression of CD44/MYC/HMGA1 in clumps of 10-30 cells teach us something about the stemness of tumor cells? *

      Answer: We would thank the comment. The reviewer’s suggestion is an important point; however, this is technical limitation of spatial transcriptomics technology. Most advanced spatial transcriptomics technologies, e.g. Visium (10x Genomics), also have the same problem. It means that our technology and the advanced technologies are technics to analyze gene expression and characteristics of tissues from 10-30 cells in each spot. Although high resolution spatial transcriptomics has been developed in 2021 [1], it is not generally used yet as described in the comment (Significance) from reviewer1.

      From our spatial analysis, we identified that CD44, MYC, and HMGA1 were expressed from human cancer cell. Their expression profiles were distinct among specific parts of the tumor section. To validate the existence of two types of cancer stem-like cells in TNBC tumors, we performed the additional analysis with the public scRNA-seq datasets of high-metastatic MDA-MB-23-LM2 xenograft model (GSE163210) [2]. This study performed scRNA-seq analysis of primary tumor and circulating tumor cells in MDA-MB-231-LM2 xenograft model. We analyzed it with Seurat/R (Figure A-1). As a result of reanalysis, HMGA1 and CD44 expression were confirmed at single-cell resolution (Figure A-2,3). These results verified the existence of two cancer stem cell-like populations (HMGA1-high, CD44-high) in MDA-MB-231 xenograft. Hence, the study of MDA-MB-231 xenograft supported our findings from spatial transcriptomics.

      Additionally, we performed the immuno-staining of sections using anti-CD44 antibody and anti-HMGA1 antibody as described in reviewer’s comment 5. As a result, CD44 and HMGA1 were detected in primary tumor sections. There were cells that express either CD44 or HMGA1 and cells that co-express both CD44 and HMGA1 (Figure B). We believe that our findings are solid results because the findings were also validated by other methods.

      In the revised manuscript, Figure A are incorporated as Figure 3B-E. Figure B is incorporated as Figure 3A. Hope our new results will be now accepted by the learned Reviewer and Editor.

      Figure A-1. Reanalysis of scRNA-seq of metastatic MDA-MB-231 xenograft

      Flowchart of the public single-cell RNA-seq (scRNA-seq) reanalysis using GSE163210 datasets.

      Figure A-2. UMAP plots of xenograft and CD44/HMGA1 expression

      UMAP plot of MDA-MB-231-LM2 xenograft tumors and circulating tumor cells (Left). Expression of CD44 and HMGA1 in the UMAP plot (Right).

      Figure A-3. Pie chart of CD44/HMGA1 positive cancer cells in MDA-MB-231 xenograft

      Pie chart of cancer stem cell-like population ratio in MDA-MB-231-LM2 xenografts.

      Figure B. Fluorescent immuno-staining of MDA-MB-231 primary tumor

      Representative images immunostained with CD44 and HMGA1 in primary tumor sections of the MDA-MB-231 xenograft model. Red: HMGA1, Green: CD44, and Blue: Nucleus. Scale bars, 20 μm (left), 10 μm (right). White arrows represent cancer cells that independently expressed or co-expressed.

      * 2.Furthermore, the authors should better explain their data analysis strategy with identification of gene expression profiles. It is unclear how they found CD44, MYC, and HMGA1 other than by cherry-picking from the list of cluster markers. *Answer: In this research, to identify the characteristics of clusters, we analyzed differentially expressed genes (DEGs) by ‘FindAllMarkers’ function of Seurat. As a result, ‘Cluster 0’ significantly expressed HMGA1 gene, and ‘cluster 1’ significantly expressed CD44. HMGA1 and CD44 are popular cancer stem cell markers in triple-negative breast cancer [3, 4]. In this study, we focus on metastasis-related genes and cancer stem cell markers (described in introduction section). Therefore, we focus on cancer-stem cell markers in the presented study. Cancer stemness is an important concept in cancer metastasis [5-7]. These results suggested that the existence of two cancer stem cell-like populations could potentially make tumors more drug-resilient in xenograft models and clinical patients.

      To improve the manuscript, we revised the description in the revised manuscript (Pages 5-6, Lines 97-105).

      * 3.Following up on the above point: I looked in the supplementary tables, but couldn't find MYC. How did the authors conclude that MYC is involved in cluster 1? In fact, when I ran a quick analysis in EnrichR, I saw that putative MYC target genes were strongly enriched among the markers in the HMGA1 cluster, but not the CD44/MYC. That's opposite to what I would expect. *__Answer: __We apologize for our confusing data and description. First, we found the expression of CD44 and HMGA1 in each cluster. Therefore, we performed the up-stream enrichment analysis using gene signatures of FindAllMakers by Metascape. From the result of enrichment analysis, we found the MYC activation in CD44 high-cluster; therefore, we named the cluster “CD44/MYC-high” cluster.

      To improve the manuscript, we revised the Figure2, Supplementary Table S3, and manuscript (Pages 5-6, Lines 103-106).

      * 4.All data were produced from 1 primary tumor and 1 metastasis. Thus, reproducibility and robustness of the methodology cannot be evaluated. The interpretation of the data could be strengthened when xenografts from at least 3 different mice are shown. *__Answer: __We would thank the suggestion. As the reviewer’s comment, we performed 1 primary tumor and 1 metastasis lesion from a transplanted mouse. Since this experiment take a long time, we tried to validate the findings by other methods (Figure A: scRNA-seq analysis of MDA-MB-231 xenografts, Figure B: Immuno-staining of MDA-MB-231 primary tumor, Figure C: scRNA-seq analysis of TNBC patients).

      First, we reanalyzed the public dataset which performed single-cell RNA-seq analysis of MDA-MB-231 xenografted tumor and circulating tumor cells in immunodeficient mice as shown in the answer to comment 1 (Figure A). Next, we performed the immuno-staining of sections using anti-CD44 antibody and anti-HMGA1 antibody as described in reviewer’s comment 5. As results, CD44 and HMGA1 were detected in primary tumor sections. There were cells that express either CD44 or HMGA1 and cells that co-express both CD44 and HMGA1 (Figure B). Next, we performed the reanalysis of 19 scRNA-seq samples from integrated 3 TNBC cohorts (Figure C-1). In a UMAP plot, differences between CD44-positive cancer cell and HMGA1-positive cancer cell were observed; however, these cells did not visually form the specific clusters (Figure C-2). CD44 and HMGA1 expressed globally in the UMAP plot, but CD44 makes some specific clusters (cluster at right side). Additionally, following the comment, we performed the population analysis in each patient (Figure C-3 and C-4). Detection of double-positive population in TNBC patients suggested that the population may be more undifferentiated cancer stem cells diving into both CD44-positive cells and HMGA1-positive cells.

      In addition, we reanalyzed primary tumors and metastasis lesions from other mice as a test trial sample (Figure D-1). The microspots including test trial samples showed 3 human clusters which were classified into CD44/MYC, HMGA1, and Marker-low clusters. We believe that our findings are solid results because the findings were also validated by other methods.

      In the revised manuscript, Figure A are incorporated as Figure 3B-E. Figure B is incorporated as Figure 3A. Figure C is incorporated as Figure 5. We only showed Figure D in the response to the reviewer’s comment. Hope our new results will be now accepted by the learned Reviewer and Editor.

      Figure C-1. Reanalysis of integrated TNBC patients scRNA-seq

      A flowchart of the reanalysis of a public scRNA-seq dataset. We downloaded GSE161529, GSE176078, and GSE180286 (scRNA-seq data of 19 TNBC patients). Integrated datasets were analyzed with Seurat. Log normalization, scaling, PCA and UMAP visualization were performed following the basic protocol in Seurat. To extract the cancer cells, cells expressing EPCAM/KRT8 (epithelial marker) were filtered. A UMAP plot of cancer cell from 19 TNBC patients (right).

      Figure C-2. CD44/HMGA1 expression in TNBC patients

      Expression analysis of CD44 (Expression level > 2) and HMGA1 (Expression level > 2) with UMAP plots.

      Figure C-3. CD44/HMGA1-positive cancer cell with UMAP plot

      UMAP plots of CD44-high, HMGA1-high, HMGA1/CD44-high, and Negative cancer cells.

      Figure C-4. Ratio of CD44/HMGA1-positive cancer cell in each patient

      The bar plot showed the ratio of cancer cells that expressed CD44 and HMGA1.

      Figure D-1. Analysis of microspots of MDA-MB-231 xenografts including test trial samples

      UMAP plots of CD44-high, HMGA1-high, and Marker-low clusters with test trial samples (2 primary tumors and 1 lung metastasis). ‘Primary tumor 1’ has 20 microspots, ‘Primary tumor 2’ has 24 microspots, and ‘lung metastasis’ has 7 microspots. Most microspots of lung metastasis failed extraction of RNA; therefore, these spots classified into Marker-low cluster.

      Figure D-2. Expression analysis of CD44, HMGA1, and MYC

      Feature plot of CD44-high, HMGA1-high, and Marker-low clusters with test trial samples.

      * 5.The only methodology is single cell RNA-sequencing. Immuno-staining on relevant markers such as CD44, MYC, HMGA1 plus human epithelium and cell cycle markers would provide strong additional support for the claims made by the authors, because it's a complementary technique and it allows quantification at single cell resolution. *__Answer: __We would thank the comment. As described in the responses to the reviewer’s comment 1 and 4, we performed the immuno-staining of sections using anti-CD44 antibody and anti-HMGA1 antibody as described in reviewer’s comment 5. As a result, CD44 and HMGA1 were detected in primary tumor sections. There were cells that express either CD44 or HMGA1 and cells that co-express both CD44 and HMGA1 (Figure B).

      In the revised manuscript, Figure B is incorporated as Figure 3A.

      * 6.Line 173-175. The marker-low cluster look to me simply like spots containing a relatively high amount of dead/dying (tumor) cells. The identity/state of cells in the marker-low cluster should be characterized and discussed more extensively. *__Answer: __We would thank the comment. This suggestion is important. In fact, total count of RNA in the Marker-low cluster decreased as compared to HMGA1-high and CD44/MYC-high (Supplementary Figure S1B). Additionally, Ttr-high mouse cluster also has low total count of RNA (Supplementary Figure S1C).

      Following the comment, we described that the Marker-low cluster and Ttr-high cluster have the possibility to include dead/dying cells (Page 13, Lines 268-279).

      * 7.Figure 5 and accompanying text in line 182-194; the authors try to infer cell-to-cell interactions using a previously published tool. However, any biological interpretation is lacking. What can be concluded from this analysis? *__Answer: __Initially, algorithms of cell-to-cell interaction were reported with previously published tool [8, 9]; however, in this manuscript, we originally conducted the code for cell-to-cell interaction with the interaction database of the Bader laboratory from Toronto University (https://baderlab.org/CellCellInteractions#Download_Data) as previously described [10, 11]. We aimed to estimate the cell-to-cell interaction in each spot (including 10-30 cells). We think that this analysis will be helpful for discovering the cancer stem cell niche and metastatic niche [6].

      However, in the revised manuscript, we focused on the existence of two cancer stem cell-like populations in TNBC xenograft and patients. Therefore, CCI analysis in previous Figure 5 moved to Supplementary Figure S7. Previous Figure 6 is removed from revised manuscript.

      * 8.Figure 6. Can the authors please explain more clearly what they mean by "PT" and "Mix" groups? I had a very hard time to understand what the data in figure mean. Again, an overall interpretation at the end (line 211) is lacking. *__Answer: __We apologize for the confusing result. We examined the combinations of human cancer cell cluster and mouse stromal cell cluster. To summarize, there are 10 combinations in the MDA-MB-231 xenograft. The combination groups in only primary tumor were named “PT”; on the other hand, the combination groups in both primary tumor and lymph-node metastasis were named “Mix”. These CCI analysis focused on cluster types of cancer cell and stromal cell. However, according to this revision, our presented study mainly focuses on the existence of two types of cancer stem cell-like population in TNBC xenograft and patients. Therefore, CCI analysis with cluster types was deleted from revised manuscript.

      In the revised manuscript, we focused on the existence of two cancer stem cell-like populations in TNBC xenograft and patients. Previous Figure 6 was removed from the revised manuscript.

      * 9.Figure 7. I like the effort to align the results with public scRNA-seq data. But although the expression of the cluster-signatures is heterogeneous, there is no evidence for distinct (CSC-like) cell populations. Why don't these HMGA1 vs CD44 signature cells cluster away from each other in the UMAPs? Perhaps the patient-to-patient heterogeneity overwhelms differences within tumors, but in that case the authors could re-run their analysis for each patient separately, to make 6 patient-specific UMAPs. In its present form, this analysis does not convince me that two distinct CSC(-like) populations within one TNBC exist. *Answer: We would thank the comment. To improve the quality of reanalysis of clinical cohorts, we performed the reanalysis of 19 scRNA-seq samples from integrated 3 TNBC cohorts (Figure C-1). In a UMAP plot, there are differences between CD44-positive cancer cells and HMGA1-positive cancer cells; however, these cells did not visually form the specific clusters (Figure C-2). CD44 and HMGA1 were expressed globally in the UMAP plot, but CD44 made some specific clusters (cluster at right side). Additionally, following the comment, we performed the population analysis in each patient (Figure C-3 and C-4). There is double-positive population in TNBC patients suggesting that this population may be more undifferentiated cancer stem cells, dividing into both CD44-positive cells and HMGA1-positive cells.

      In the revised manuscript, Figure C is incorporated as Figure 5.

      * **Minor comments:** 10.In the Supplemental table 2 noticed that many of the marker genes have adjusted P values well above 0.05 (and even above 0.1). That makes the statistical analysis rather weak. This could especially be problematic since the authors entirely base their main claims on this marker analysis, and I recommend that the authors use more stringent P-value cut-offs in the cluster analysis. *Answer: We would thank the comment. We reshaped the list of differentially expressed genes (DEGs). Significantly expressed genes (adjusted p-value In mouse clusters, the enrichment analysis using significantly DEGs showed that only Tcell-like clusters had a lot of enriched terms. Citric acid (TCA) cycle, chemical stress response, and fatty acid oxidation were enriched in Tcell-like populations (Page 7, Lines 141-144).

      In the revised manuscript, enrichment analyses are showed as Supplementary Figure S2 and S3B. We revised the sentence of enrichment analyses (Page 6, Lines 114-121), (Page 7, Lines 141-144). The network visualization of enrichment analysis was removed from the revised manuscript because this result did not support conclusions of the presented study.

      * 11.Line 129/130. If I look at figure 3A, I don't see this tendency that the authors describe. Can the authors provide statistical support or visual aid to make their claim more apparent to the reader? *__Answer: __We would thank the suggestion. Following the comment, we performed the statistical analysis of spot position. The spots were categorized outer side (tumor edge) and Inner site (Center of tumor) in the primary tumor section (Figure E-1 upside). We counted the spot numbers of the clusters (Figure E-1 table) and performed statistical test by chi-test. As a result, CD44/MYC clusters significantly resided at outer side of primary tumor (Figure E-1 barplot). On the other hand, the spots in lymph-node metastasis are not readily defined the outer or inner. In addition, cell cycle analysis in the primary tumor and lymph node metastasis was performed with statistical test. As a result, HMGA1-high cluster and CD44/MYC-high cluster significantly proliferated in the lymph node metastasis section (Figure E-2).

      Therefore, in the revised manuscript, we revised the sentence of spot position in lymph-node metastasis (Pages 8-9, Lines 159-172). Figure E-1 is incorporated as Figure 4D. Figure E-2 is incorporated as Figure 4F. Hope our new results will be now accepted by the Reviewer and Editor.

      Figure E-1. Statistical analysis of spot position

      Chi-test was performed by R. *p Figure E-2. Statistical analysis of cell cycle index

      Fisher’s exact test was performed by R. *p * 12.Line 217; shouldn't this be 6 patients? I see six clusters and in the original paper six patients are mentioned. *Answer: We would thank the comment. ‘6 patients’ is correct, we revised it. However, in the revised manuscript, we added integrated analysis of TNBC as shown in the answer to comment 9.

      Previous reanalysis of clinical scRNA-seq (previous Figure 7) was removed from the revised manuscript. The reanalysis using 3 integrated TNBC cohorts (Figure C) is incorporated as Figure 5.

      Reviewer #1 (Significance (Required)): * Conceptual/biological impact: Showing the existence of distinct populations of CSCs within one (breast-)tumor potentially has a high impact on the field of fundamental and translational cancer research. As the authors state, it could be one key reason underlying drug resistance. However, the technology used by the authors does in my view not allow to make such a claim. First and foremost because the technology does not allow analysis at single cell resolution.

      Technical impact: The platform used by the authors can be of interest for some applications, but they already published this in Scientic Reports a few years ago. I'm afraid that with the rapid recent developments in the field of spatial single cell transcriptomics (See for example Srivatsan et al Science 2021; 373: 111-117), the technical impact on the field is relatively low.

      Audience: Researchers in the field of cancer biology with an interest to perform low-cost molecular analysis at low-resolution spatial-resolved tissue specimens (transcriptomics, but perhaps expanded with bisulfite sequencing, or ATAC sequencing) could be interested in the technology presented in this manuscript.

      My expertise: single cell transcriptomics, (cancer) cell cycle, cancer drug resistance, cell plasticity, mouse models. *

      **Referee Cross-commenting** I have read the comments and align mostly with reviewer #2. The authors need to improve this manuscript a lot before it's suitable for publication in any of the Review Commons journals. Answer: We are grateful to the reviewers. As indicated in the responses that follow, we have taken all of these comments and suggestions into account in the revised version of our paper, including the supplementary information.

      *

      *

      Reviewer #2 (Evidence, reproducibility and clarity (Required)): * This manuscript uses spatial transcriptomics to perform single cell-like expression analysis between a breast cancer cell line and tumor microenvironment in mice xenografted with these cells. Unfortunately, from the title, abstract, and introduction, it is difficult to understand exactly what the authors are focusing and discussing. It is also unclear the advantage of their technique for evaluating the populations observed within this manuscript. Furthermore, there is very little explanation of the results, and it does not appear to be a scientific logical structure. Hence, this manuscript is not suitable for acceptance in the journal. In order to improve the scientific quality of this study, the following concerns are presented.

      **Major concerns:** 1.Is cell-cell interaction (CCI) analysis novel method? If so, please specify detail in the manuscript. If the basic concept and the principle of CCI analysis have not been published, please mention in the discussion section as a limitation that a manuscript on CCI analysis is under submission to the preprint. In addition, please revise the abstract and related text. *__Answer: __Initially, algorithms of cell-to-cell interaction were reported with previously published tool [8, 9]; however, in this manuscript, we originally conducted the code for cell-to-cell interaction with the interaction database of the Bader laboratory from Toronto University (https://baderlab.org/CellCellInteractions#Download_Data) as previously described [10, 11]. We aimed to estimate the cell-to-cell interaction in each spot (including 10-30 cells). We think that this analysis will be helpful for discovering the cancer stem cell niche and metastatic niche [6].

      However, in the revised manuscript, we focused on the existence of two cancer stem cell-like populations in TNBC xenograft and patients. Therefore, CCI analysis in previous Figure 5 is moved to Supplementary Figure S7. Previous Figure 6 are removed from the revised manuscript. We revised the description in the manuscript (Page 18, Lines 385-387).

      * 2.The reviewer thinks that spatial transcriptomics plays an important role in your manuscript. Please describe the technique in the introduction. *__Answer: __We would thank the comments. Following the comments, we described the spatial technics in Introduction section. We revised the manuscript (Page 4, Lines 63-65) (Page 12, Lines 250-253).

      * 3.The classification by expression profile (HMGA1, CD44/MYC and marker-low) lacks an explanation. Authors should mention in detail how these populations were extracted from breast cancer cell lines. *Answer: In this research, to identify the characteristics of clusters, we analyzed differentially expressed genes (DEGs) by FindAllmarkers function of Seurat. As a result, ‘Cluster 0’ significantly expressed HMGA1 gene, and ‘cluster 1’ significantly expressed CD44. Next, we performed the up-stream enrichment analysis using gene signatures of FindAllMakers by Metascape. From result of enrichment analysis, we found the MYC activation in CD44 high-cluster; therefore, we named the cluster “CD44/MYC-high” cluster.

      HMGA1 and CD44 are popular cancer stem cell markers in triple-negative breast cancer [3, 4]; therefore, we focus on cancer-stem cell marker in presented study. Cancer stemness is an important concept in cancer metastasis [5-7].These results suggested that the existence of two cancer stem cell-like populations could potentially make tumors more drug-resilient in xenograft model and clinical patient.

      To improve the manuscript, we revised the Figure2, Supplementary Table S2 and S4, and manuscript (Pages 5-6, Lines 97-106).

      * 4.The description of the results is back and forth and confusing. Please reconsider the flow of the analysis. *__Answer: __We would thank the comment. We reconsidered the description and structure of manuscript. In revised manuscript, we focused on the existence of two cancer stem cell-like populations in TNBC xenograft and patients.

      To improve the manuscript, we revised the Figure2 for examination of cluster characteristics by clustering and gene expression profiling. Figure 3 was revised for the validation of two cancer stem cell-like populations in TNBC xenograft model. Figure 4 was revised for the elucidation of spatial characteristics of each cluster. Figure 5 was revised for the validation of two cancer stem cell-like populations in TNBC patients.

      * 5.How did you evaluate the outsides of the samples with very different spot positions in Figure 3A? Please mention your evaluation method in a scientific manner. In particular, authors should clearly indicate the outer evaluation for the metastatic case. *

      Answer: We would thank the suggestion. Following the comment, we performed the statistical analysis of spot position. The spots were categorized outer side (tumor edge) and Inner site (Center of tumor) in primary tumor section (Figure E-1 upside). We counted the spot numbers of the clusters (Figure E-1 table) and performed statistical test by chi-test. As a result, CD44/MYC clusters significantly resided at outer side of primary tumor (Figure E-1 bar plot). On the other hand, the spots in lymph-node metastasis are not readily defined the outer or inner. In addition, cell cycle analysis in the primary tumor and lymph node metastasis was performed with statistical test. As a result, HMGA1-high cluster and CD44/MYC-high cluster significantly proliferated in the lymph node metastasis section (Figure E-2).

      Therefore, in the revised manuscript, we revised the sentence of spot position in lymph-node metastasis (Pages 8-9, Lines 153-172). Figure E-1 are incorporated as Figure 4D. Figure E-2 are incorporated as Figure 4F. Hope our new results will be now accepted by the Reviewer and Editor.

      Figure E-1. Statistical analysis of spot position

      Chi-test was performed by R. *p Figure E-2. Statistical analysis of cell cycle index

      Fisher’s exact test was performed by R. *p * 6.The spots in primary tumor have few counts derived from mouse stromal/immune cells, as shown in Figure S1A. Nevertheless, Figure 3C shows that mouse stromal/immune cells are evaluated in the same way in primary and metastatic sites. The reviewer thinks that the regions identified as Tcell-like in the metastatic site, where there are many mouse-derived counts, and in the primary, where there are few mouse-derived counts, do not have the same characteristics. If many mouse-derived counts were detected in a spot using the spatial transcriptomics, then there must be many mouse-derived cells in the spot. Please discuss how this expression is evaluated on this technique, which is not a single cell analysis. *__Answer: __We would thank the comment. The reviewer’s suggestion is an important point; however, this suggestion is technical limitation of spatial transcriptomics technology. Most advanced spatial transcriptomics technologies, e.g. Visium (10x Genomics), also have the same problem. It means that our technology and the advanced technologies are technics to analyze gene expression and characteristics of tissues from 10-30 cells in each spot.

      In this spatial transcriptome analysis of mouse genes, we first performed the log normalization and scaling. Since Seurat used variable features among the samples for single-cell or spot clustering, we extracted the variable features for detection of clusters using the ‘FindVariableFeatures’ function. PCA and clustering using only mouse genes was performed for detecting the neighboring samples. After the clustering of mouse spots, we identified the character of clusters by finding the gene signatures. As the indication by the reviewer, the detected RNA counts and features are different, so it is difficult to define the exact character and cell type of stromal cells. Theoretically, spatial transcriptomics could only detect some kinds of stromal cells expressing the T-cell marker gene in the spot. Therefore, we named the cluster as “Tcell-like”. Not all of the Tcell-like cluster have the same characteristics or cell types, but they certainly express T-cell marker genes. This is also a technical limitation of spatial transcriptomics. Spatial transcriptomics with higher resolution probably is able to detect the stromal cells as a single-cell resolution, such as the one developed in previous research [1].

      In the revised manuscript, we focused on the two types of cancer stem cell-like populations that were validated by other methods (scRNA-seq and Immuno-staining). As the method is not able to define the exact cluster characters, we moved CCI analyses to supplementary figures or removed partly.

      We also revised the discussion in the revised manuscript (Pages 13-14, Lines 279-283).

      * 7.Please explain how the gene symbols listed in Figure 4A were selected. Also, please indicate the characteristics of the gene groups that are not listed. *__Answer: __We selected the gene signature list from results of ‘FindAllMarker’ function in Seurat. ‘FindAllMarker’ function enables to extract the significantly expressed genes in each cluster. Heatmap in previous Figure 4A was drawn using these marker genes (Adjusted p-value 0.1). Highlighted genes in the heatmap have been reported as cancer-related genes or cell cycle-related genes.

      The genes used for drawing heatmap are shown in Supplementary Table S2 and S4.

      * 8.Please describe the details of the division and cycle index in lines 141-142. *__Answer: __Cell cycle index is a basic function of Seurat [12] (https://satijalab.org/seurat/archive/v3.1/cell_cycle_vignette.html). A list of cell cycle markers is loaded with Seurat. We can segregate this list into markers of G2/M phase and markers of S phase. We subjected this function into our spatial transcriptomics to estimate the cell cycle in each spot.

      We revised the description manuscript (Page 16, Lines 331-332).

      * 9.In Line 148-151, the expression and prognosis of TMSB10, CTSD, and LGALS1 is mentioned based on the previous reports. Aren't these findings the result of bulk? Is the HMGA1 cluster that the authors found involved in the prognosis of mice? Please clarify, as it is unclear what you want to discuss. *

      Answer: We apologize for our confusing data and description. These highlighted genes (TMSB10, CTSD, LGALS1, CENPK, and CENPN) were extracted as DEGs of human cancer clusters (Supplementary Table S2). Previously, these genes have been reported as cancer-related genes or cell cycle-related genes, described in the manuscript (Page 6, Lines 107-110). To show the other expressed genes in each human cluster, we focused on these genes in the manuscript.

      We extracted the gene signatures from DEGs and showed the gene signatures from HMGA1-high cluster correlated to poor prognosis in TNBC patients. Our data suggested that the HMGA1 signatures from the microspot resolution has the potential to be a novel biomarker for diagnosis, and HMGA1-high cancer stem cells may contribute to poor prognosis.

      In this revision, since we reperformed DEGs analysis with significant threshold; therefore, survival analysis was reperformed with novel gene signatures with METABRIC TNBC cohorts (Figure F).

      To improve the manuscript, we revised the description of DEGs extraction and heatmap (Page 6, Lines 106-112). Hope our Reviewer will approve this revised sentence.

      Figure F. Survival analysis with gene signatures of HMGA1-high and CD44/MYC-high

      Survival analysis of TNBC patients (claudin-low subtype and basal-like subtype) in METABRIC cohorts by the Kaplan-Meier method. (Left) Survival analysis with the expression of the HMGA1 signatures (High = 151, Low = 247). Shading along the curve indicates 95% confidential interval. Log-rank test, p = 0.012. (Right) Survival analysis with the expression of the CD44/MYC signatures (High = 333, Low = 65). Log-rank test, p = 0.079.

      * 10.Please provide details of all statistical tests used in this manuscript and describe significance levels used in the p-values and FDR. *__Answer: __We performed the extraction of differentially expressed genes (DEGs) by ‘FindAllMarkers’ function with MAST method. MAST method identifies differentially expressed genes between two groups of cells using a hurdle model tailored to scRNA-seq data [13]. Adjusted p-value is calculated based on Bonferroni correction using all features in the dataset. In spatial spot analysis, statistical analyses were performed by Chi-test and Fisher’s exact test.

      We revised materials and methods section in the manuscript (Page 19, Lines 391-394).

      * 11.Please mention CCI score (line 198). *Answer: As described in answer to comment 1, the algorithms of CCI score calculation were performed using previously published tool [8, 9]; however, we originally conducted the code for cell-to-cell interaction with the interaction database of the Bader laboratory from Toronto University (https://baderlab.org/CellCellInteractions#Download_Data). We extracted the genes whose expression value was greater than 2. We selected the combinations representing ligand__-__receptor interactions, in which both ligand genes and receptor genes were expressed in the same spot.

      We revised materials and methods section in the manuscript and Supplementary Legends (Page 18, Lines 385-387).

      * 12.Lines 204-206 and Figure 6G show specific interaction of ITGB1 and CST3, but it is unclear why only these molecules were extracted. What about the other molecules? At least ITGB1 is not scored in mix5. *Answer: We selected genes that have been reported as cancer-related ones in breast cancer to discuss the interactions in primary tumor and lymph-node metastasis. However, according to this revision, our presented study mainly focused on the existence of two types of cancer stem cell-like population in TNBC xenografts and patients. Therefore, CCI analysis with cluster types moved to supplementary Figure or some were not shown now.

      In the revised manuscript, previous Figure 6 is removed.

      * 13.HMGA1 signature appears in Line 214, please explain in detail. *__Answer: __As described in answer to comment 7, we selected the gene signature list from results of ‘FindAllMarker’ function. ‘FindAllMarker’ function enables to extract the significantly expressed genes in each cluster. HMGA1 signature genes were selected from significantly differentially expressed genes of HMGA1-high clusters.

      We revised the description in the revised manuscript (Pages 9-10, Lines 190-193).

      * 14.Authors should discuss how the previously reported bulk expression data used in Figure 7E can be linked to the single-cell-like analysis in this study. *__Answer: __Previous research reported that gene signatures extracted from specific clusters in scRNA-seq study have the potential to be a prognosis marker [14]. We showed the gene signatures from HMGA1-high cluster correlated to poor prognosis in TNBC patients. Our results suggested that the gene signatures from the resolution of microspot (10-30 cells) could have the potential to be prognosis markers. This punching microdissection system enables to extract only the parts of a section that are necessary for diagnosis of cancer and to analyze at low-cost. It could be applied to diagnostics instead of the laser-capture microdissection methods.

      We performed additional survival analysis with METABRIC cohorts. As described in this revision, since we reperformed DEGs analysis with significant threshold, survival analysis was reperformed with novel gene signatures with METABRIC TNBC cohorts (Figure F).

      In revised manuscript, Figure F were incorporated as Figure 6. The usefulness of gene signatures from microspot resolution was additionally discussed (Page 12, Lines 242-245, 250-253).

      * **Minor concerns:** 15.Please describe how the normalized centrality was calculated in UMAP algorithm and explain what this means in the results. __Answer: __The data showed that the expressional diversity in each cluster based on the network centrality of a correlational network with graph theory. The differences in the centrality among the clusters suggested expressional diversity in each (Supplementary Figure 4). Higher centrality represented lower expressional diversity and vice versa*. The detailed method for the calculation of centrality was previously shown to reveal the difference between smokers and never-smokers [10, 11].

      We added the description in the Legend (Pages 7-8, Lines 145-150).

      * 16.Please mention an explanation for the red X in Figure 1B to the legend. *__Answer: __The red X means failure spot for RNA extraction. We added the description in Figure 1B.

      * 17.Please spell out the abbreviations in all figure legends. *__Answer: __We added the abbreviations in the legends of all figures.

      * 18.Please explain what is meant by the color of the lines and the size of the circles in Figure 4D. *__Answer: __The network analysis was performed by Metascape (https://metascape.org/gp/index.html#/main/step1) [15]. The node size is proportional to the number of genes belonging to the term, and the node color represents the identity of the cluster. However, as described in the answer to reviewer’s comment 9, we reperformed enrichment analysis with significant DEGs. As a result, only CD44/MYC cluster had a lot of enrichment terms.

      Therefore, network visualizations were removed from the revised manuscript.

      * 19.Please mention an explanation for the color of the spots in Figure 5D and 5F to the legend. *__Answer: __The color showed the spots categorized into the selected group.

      In the revised manuscript, previous Figure 5 was incorporated as Supplementary Figure S7. We added the description in Supplementary Figure S7 and S8 with the legends.

      * 20.Is "S51" in Line 148 a typo for "S5A"? *Answer: Thank you. We revised “S5A”.

      * 21.Please mention an explanation for the bars in Figure 6D and 6F to the legend. *__Answer: __The bars showed relative CCI scores. As described below, we removed the results of CCI analysis with cluster group (previous Figure 6) in the revised manuscript.

      * 22.Please mention an explanation for the colors in Figure 7E to the legend. *__Answer: __The color showed patients’ group based on expression levels of gene signatures. We added the description in the Legend of Figure 6.

      *

      *

      Reviewer #2 (Significance (Required)): * The approach in Figure 5 is interesting, but the rest of the results do not take full advantage of the technology developed by the authors. The structure of the manuscript should be re-examined and new perspectives added. I look forward to the future of the authors' research.

      *

      Reviewer #3 (Evidence, reproducibility and clarity (Required)): Microtissue transcriptome analysis of triple-negative breast cancer cell line MDA-MB-231 xenograft model using automated tissue microdissection punching techonology revealed that the existence of three cell-type clusters in the primary tumor and axillary lymph node metastasis. The CD44/MYC-high cluster showed aggressive proliferation with MYC expression, the HMGA1-high cluster exhibited HIF1A activation and upregulation of ribosomal processes. The cell-cell-interaction analysis revealed the interaction dynamics generated by the combination of cancer cells and stromal cells in primary tumors and metastases. The gene signature of the HMGA1-high cancer stem cell-like cluster has the potential to serve as a novel biomarker for diagnosis. The key conclusions are convincing. The data and methods are presented in a reproducible way. The experiments are adequately replicated and statistical analysis is adequate. Prior studies are appropriately referenced. The text and figures are clear and accurate. __Answer: __We would thank the valuable comments. As the reviewer mentioned, our findings showed that the existence of two cancer stem cell-like populations has the potential to make tumors more drug-resilient. Our results suggested that the gene signatures from the resolution of microspot (10-30 cells) could have the potential to be prognosis markers. This punching microdissection system enables to extract only the parts of a section that are necessary for diagnosis of cancer and to analyze at low-cost. It could be applied to diagnostics instead of the laser-capture microdissection methods.

      In this revision, we focused on the existence of two cancer stem cell-like populations in TNBC xenografts and patients. Following the other reviewer’s comments, we performed the extraction of DEGs with significant threshold; therefore, we revised the results of enrichment analysis but it did not influence our main findings.

      To validate the existence of two types of cancer stem-like cells in TNBC tumors, we performed the additional analyses (reanalysis of public scRNA-seq datasets and immuno-staining of MDA-MB-231 primary tumor). These results verified two cancer stem cell-like populations (HMGA1-high, CD44-high) in MDA-MB-231 xenograft and TNBC patients. We believe that our findings are solid results because the findings were also validated by other methods.

      Again, we would thank kind reviewing our manuscript.

      Reviewer #3 (Significance (Required)): * In the past several studies showed the heterogeneity of cell-cell interactions between cancer cells and stromal cells in situ (Andersson et al, 2021; Wu et al, 2021) and tumor microheterogeneity (Jiang et al, 2016; Liu et al, 2016; Zhang et al, 2020). Spatial transcriptomics methods are important to reveal microheterogeneity of cancer. As a physician working in gynecology and obstetrics in my opinion the results of the study and spatial transcriptomic methods could be relevant to detect new biomarkers for diagnosis and prognosis of breast cancer in future and to find novel therapeutic targets to overcome drug resistance and facilitate curative treatment of breast cancer.

      *

      References in response letter

      1. Srivatsan SR, Regier MC, Barkan E, Franks JM, Packer JS, Grosjean P, et al. Embryo-scale, single-cell spatial transcriptomics. Science. 2021;373(6550):111-7. Epub 2021/07/03. doi: 10.1126/science.abb9536. PubMed PMID: 34210887.
      2. Moravec JC, Lanfear R, Spector DL, Diermeier SD, Gavryushkin A. Cancer phylogenetics using single-cell RNA-seq data. bioRxiv. 2021:2021.01.07.425804. doi: 10.1101/2021.01.07.425804.
      3. Liu H, Patel MR, Prescher JA, Patsialou A, Qian D, Lin J, et al. Cancer stem cells from human breast tumors are involved in spontaneous metastases in orthotopic mouse models. Proc Natl Acad Sci U S A. 2010;107(42):18115-20. Epub 2010/10/06. doi: 10.1073/pnas.1006732107. PubMed PMID: 20921380; PubMed Central PMCID: PMC2964232.
      4. Pegoraro S, Ros G, Piazza S, Sommaggio R, Ciani Y, Rosato A, et al. HMGA1 promotes metastatic processes in basal-like breast cancer regulating EMT and stemness. Oncotarget. 2013;4(8):1293-308. Epub 2013/08/16. doi: 10.18632/oncotarget.1136. PubMed PMID: 23945276; PubMed Central PMCID: PMC3787158.
      5. Weiss F, Lauffenburger D, Friedl P. Towards targeting of shared mechanisms of cancer metastasis and therapy resistance. Nat Rev Cancer. 2022. Epub 2022/01/12. doi: 10.1038/s41568-021-00427-0. PubMed PMID: 35013601.
      6. Oskarsson T, Batlle E, Massagué J. Metastatic Stem Cells: Sources, Niches, and Vital Pathways. Cell Stem Cell. 2014;14(3):306-21. doi: https://doi.org/10.1016/j.stem.2014.02.002.
      7. Turdo A, Veschi V, Gaggianesi M, Chinnici A, Bianca P, Todaro M, et al. Meeting the Challenge of Targeting Cancer Stem Cells. Front Cell Dev Biol. 2019;7:16. Epub 2019/03/06. doi: 10.3389/fcell.2019.00016. PubMed PMID: 30834247; PubMed Central PMCID: PMC6387961.
      8. Armingol E, Officer A, Harismendy O, Lewis NE. Deciphering cell-cell interactions and communication from gene expression. Nat Rev Genet. 2021;22(2):71-88. Epub 2020/11/11. doi: 10.1038/s41576-020-00292-x. PubMed PMID: 33168968; PubMed Central PMCID: PMC7649713.
      9. Kumar MP, Du J, Lagoudas G, Jiao Y, Sawyer A, Drummond DC, et al. Analysis of Single-Cell RNA-Seq Identifies Cell-Cell Communication Associated with Tumor Characteristics. Cell Rep. 2018;25(6):1458-68.e4. Epub 2018/11/08. doi: 10.1016/j.celrep.2018.10.047. PubMed PMID: 30404002; PubMed Central PMCID: PMCPMC7009724.
      10. Watanabe N, Nakayama J, Fujita Y, Mori Y, Kadota T, Shimomura I, et al. Single-cell Transcriptome Analysis Reveals an Anomalous Epithelial Variation and Ectopic Inflammatory Response in Chronic Obstructive Pulmonary Disease. medRxiv. 2020:2020.12.03.20242412. doi: 10.1101/2020.12.03.20242412.
      11. Nakayama J, Yamamoto Y. Single-cell meta-analysis of cigarette smoking lung atlas. bioRxiv. 2021:2021.12.09.472029. doi: 10.1101/2021.12.09.472029.
      12. Stuart T, Butler A, Hoffman P, Hafemeister C, Papalexi E, Mauck WM, 3rd, et al. Comprehensive Integration of Single-Cell Data. Cell. 2019;177(7):1888-902.e21. Epub 2019/06/11. doi: 10.1016/j.cell.2019.05.031. PubMed PMID: 31178118; PubMed Central PMCID: PMC6687398.
      13. Finak G, McDavid A, Yajima M, Deng J, Gersuk V, Shalek AK, et al. MAST: a flexible statistical framework for assessing transcriptional changes and characterizing heterogeneity in single-cell RNA sequencing data. Genome Biol. 2015;16:278. Epub 2015/12/15. doi: 10.1186/s13059-015-0844-5. PubMed PMID: 26653891; PubMed Central PMCID: PMCPMC4676162.
      14. Cheng S, Li Z, Gao R, Xing B, Gao Y, Yang Y, et al. A pan-cancer single-cell transcriptional atlas of tumor infiltrating myeloid cells. Cell. 2021;184(3):792-809.e23. Epub 2021/02/06. doi: 10.1016/j.cell.2021.01.010. PubMed PMID: 33545035.
      15. Zhou Y, Zhou B, Pache L, Chang M, Khodabakhshi AH, Tanaseichuk O, et al. Metascape provides a biologist-oriented resource for the analysis of systems-level datasets. Nat Commun. 2019;10(1):1523. Epub 2019/04/05. doi: 10.1038/s41467-019-09234-6. PubMed PMID: 30944313; PubMed Central PMCID: PMC6447622.
    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Microtissue transcriptome analysis of triple-negative breast cancer cell line MDA-MB-231 xenograft model using automated tissue microdissection punching techonology revealed that the existence of three cell-type clusters in the primary tumor and axillary lymph node metastasis. The CD44/MYC-high cluster showed aggressive proliferation with MYC expression, the HMGA1-high cluster exhibited HIF1A activation and upregulation of ribosomal processes. The cell-cell-interaction analysis revealed the interaction dynamics generated by the combination of cancer cells and stromal cells in primary tumors and metastases. The gene signature of the HMGA1-high cancer stem cell-like cluster has the potential to serve as a novel biomarker for diagnosis.

      The key conclusions are convincing. The data and methods are presented in a reproducible way. The experiments are adequately replicated and statistical analysis is adequate.

      Prior studies are appropriately referenced. The text and figures are clear and accurate.

      Significance

      In the past several studies showed the heterogeneity of cell-cell interactions between cancer cells and stromal cells in situ (Andersson et al, 2021; Wu et al, 2021) and tumor microheterogeneity (Jiang et al, 2016; Liu et al, 2016; Zhang et al, 2020). Spatial transcriptomics methods are important to reveal microheterogeneity of cancer. As a physician working in gynecology and obstetrics in my opinion the results of the study and spatial transcriptomic methods could be relevant to detect new biomarkers for diagnosis and prognosis of breast cancer in future and to find novel therapeutic targets to overcome drug resistance and facilitate curative treatment of breast cancer.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      This manuscript uses spatial transcriptomics to perform single cell-like expression analysis between a breast cancer cell line and tumor microenvironment in mice xenografted with these cells. Unfortunately, from the title, abstract, and introduction, it is difficult to understand exactly what the authors are focusing and discussing. It is also unclear the advantage of their technique for evaluating the populations observed within this manuscript. Furthermore, there is very little explanation of the results, and it does not appear to be a scientific logical structure. Hence, this manuscript is not suitable for acceptance in the journal. In order to improve the scientific quality of this study, the following concerns are presented.

      Major concerns:

      1.Is cell-cell interaction (CCI) analysis novel method? If so, please specify detail in the manuscript. If the basic concept and the principle of CCI analysis have not been published, please mention in the discussion section as a limitation that a manuscript on CCI analysis is under submission to the preprint. In addition, please revise the abstract and related text.

      2.The reviewer thinks that spatial transcriptomics plays an important role in your manuscript. Please describe the technique in the introduction.

      3.The classification by expression profile (HMGA1, CD44/MYC and marker-low) lacks an explanation. Authors should mention in detail how these populations were extracted from breast cancer cell lines.

      4.The description of the results is back and forth and confusing. Please reconsider the flow of the analysis.

      5.How did you evaluate the outsides of the samples with very different spot positions in Figure 3A? Please mention your evaluation method in a scientific manner. In particular, authors should clearly indicate the outer evaluation for the metastatic case.

      6.The spots in primary tumor have few counts derived from mouse stromal/immune cells, as shown in Figure S1A. Nevertheless, Figure 3C shows that mouse stromal/immune cells are evaluated in the same way in primary and metastatic sites. The reviewer thinks that the regions identified as Tcell-like in the metastatic site, where there are many mouse-derived counts, and in the primary, where there are few mouse-derived counts, do not have the same characteristics. If many mouse-derived counts were detected in a spot using the spatial transcriptomics, then there must be many mouse-derived cells in the spot. Please discuss how this expression is evaluated on this technique, which is not a single cell analysis.

      7.Please explain how the gene symbols listed in Figure 4A were selected. Also, please indicate the characteristics of the gene groups that are not listed.

      8.Please describe the details of the division and cycle index in lines 141-142.

      9.In Line 148-151, the expression and prognosis of TMSB10, CTSD, and LGALS1 is mentioned based on the previous reports. Aren't these findings the result of bulk? Is the HMGA1 cluster that the authors found involved in the prognosis of mice? Please clarify, as it is unclear what you want to discuss.

      10.Please provide details of all statistical tests used in this manuscript and describe significance levels used in the p-values and FDR.

      11.Please mention CCI score (line 198).

      12.Lines 204-206 and Figure 6G show specific interaction of ITGB1 and CST3, but it is unclear why only these molecules were extracted. What about the other molecules? At least ITGB1 is not scored in mix5.

      13.HMGA1 signature appears in Line 214, please explain in detail.

      14.Authors should discuss how the previously reported bulk expression data used in Figure 7E can be linked to the single-cell-like analysis in this study.

      Minor concerns:

      15.Please describe how the normalized centrality was calculated in UMAP algorithm and explain what this means in the results.

      16.Please mention an explanation for the red X in Figure 1B to the legend.

      17.Please spell out the abbreviations in all figure legends.

      18.Please explain what is meant by the color of the lines and the size of the circles in Figure 4D.

      19.Please mention an explanation for the color of the spots in Figure 5D and 5F to the legend.

      20.Is "S51" in Line 148 a typo for "S5A"?

      21.Please mention an explanation for the bars in Figure 6D and 6F to the legend.

      22.Please mention an explanation for the colors in Figure 7E to the legend.

      Significance

      The approach in Figure 5 is interesting, but the rest of the results do not take full advantage of the technology developed by the authors. The structure of the manuscript should be re-examined and new perspectives added. I look forward to the future of the authors' research.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Nakayama and colleagues use their previously developed automated tissue microdissection punching platform to perform spatial transcriptomics on a breast cancer xenograft model. Using transcriptomics on multiple clumps of 10-30 cells from different regions in a tumor and a lymph node metastasis they identified different cell-type clusters. Two of these clusters expressed different cancer stem cell markers. This led the authors to suggest that two distinct cancer stem cell(-like) populations may exist within one (breast) tumor, which could potentially make tumors more drug-resilient.

      Major comments:

      While the quality of the presented sequencing data is good and the manuscript is mostly written in a clear and accessible style, there are some concerns that limit the impact of this story. Most importantly, the manuscript in its present form does not convince me that the MDA-MB-231 xenografts indeed contain two distinct populations of cancer stem(-like) cells.

      1.The data obtained are not single cell data, which makes it difficult -if not impossible- to draw conclusions about presence of cancer stem cells. Each data point is the average of 10-30 cells, and the interpretation of the data is severely limited by this. How can the quantification of expression of CD44/MYC/HMGA1 in clumps of 10-30 cells teach us something about the stemness of tumor cells?

      2.Furthermore, the authors should better explain their data analysis strategy with identification of gene expression profiles. It is unclear how they found CD44, MYC, and HMGA1 other than by cherry-picking from the list of cluster markers.

      3.Following up on the above point: I looked in the supplementary tables, but couldn't find MYC. How did the authors conclude that MYC is involved in cluster 1? In fact, when I ran a quick analysis in EnrichR, I saw that putative MYC target genes were strongly enriched among the markers in the HMGA1 cluster, but not the CD44/MYC. That's opposite to what I would expect.

      4.All data were produced from 1 primary tumor and 1 metastasis. Thus, reproducibility and robustness of the methodology cannot be evaluated. The interpretation of the data could be strengthened when xenografts from at least 3 different mice are shown.

      5.The only methodology is single cell RNA-sequencing. Immuno-staining on relevant markers such as CD44, MYC, HMGA1 plus human epithelium and cell cycle markers would provide strong additional support for the claims made by the authors, because it's a complementary technique and it allows quantification at single cell resolution.

      6.Line 173-175. The marker-low cluster look to me simply like spots containing a relatively high amount of dead/dying (tumor) cells. The identity/state of cells in the marker-low cluster should be characterized and discussed more extensively.

      7.Figure 5 and accompanying text in line 182-194; the authors try to infer cell-to-cell interactions using a previously published tool. However, any biological interpretation is lacking. What can be concluded from this analysis?

      8.Figure 6. Can the authors please explain more clearly what they mean by "PT" and "Mix" groups? I had a very hard time to understand what the data in figure mean. Again, an overall interpretation at the end (line 211) is lacking.

      9.Figure 7. I like the effort to align the results with public scRNA-seq data. But although the expression of the cluster-signatures is heterogeneous, there is no evidence for distinct (CSC-like) cell populations. Why don't these HMGA1 vs CD44 signature cells cluster away from each other in the UMAPs? Perhaps the patient-to-patient heterogeneity overwhelms differences within tumors, but in that case the authors could re-run their analysis for each patient separately, to make 6 patient-specific UMAPs. In its present form, this analysis does not convince me that two distinct CSC(-like) populations within one TNBC exist.

      Minor comments:

      10.In the Supplemental table 2 noticed that many of the marker genes have adjusted P values well above 0.05 (and even above 0.1). That makes the statistical analysis rather weak. This could especially be problematic since the authors entirely base their main claims on this marker analysis, and I recommend that the authors use more stringent P-value cut-offs in the cluster analysis.

      11.Line 129/130. If I look at figure 3A, I don't see this tendency that the authors describe. Can the authors provide statistical support or visual aid to make their claim more apparent to the reader?

      12.Line 217; shouldn't this be 6 patients? I see six clusters and in the original paper six patients are mentioned.

      Significance

      Conceptual/biological impact: Showing the existence of distinct populations of CSCs within one (breast-)tumor potentially has a high impact on the field of fundamental and translational cancer research. As the authors state, it could be one key reason underlying drug resistance. However, the technology used by the authors does in my view not allow to make such a claim. First and foremost because the technology does not allow analysis at single cell resolution.

      Technical impact: The platform used by the authors can be of interest for some applications, but they already published this in Scientic Reports a few years ago. I'm afraid that with the rapid recent developments in the field of spatial single cell transcriptomics (See for example Srivatsan et al Science 2021; 373: 111-117), the technical impact on the field is relatively low.

      Audience: Researchers in the field of cancer biology with an interest to perform low-cost molecular analysis at low-resolution spatial-resolved tissue specimens (transcriptomics, but perhaps expanded with bisulfite sequencing, or ATAC sequencing) could be interested in the technology presented in this manuscript.

      My expertise: single cell transcriptomics, (cancer) cell cycle, cancer drug resistance, cell plasticity, mouse models.

      Referee Cross-commenting

      I have read the comments and align mostly with reviewer #2. The authors need to improve this manuscript a lot before it's suitable for publication in any of the Review Commons journals.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Reply to the Reviewers

      I thank the Referees for their...

      Referee #1

      1. The authors should provide more information when...

      Responses + The typical domed appearance of a hydrocephalus-harboring skull is apparent as early as P4, as shown in a new side-by-side comparison of pups at that age (Fig. 1A). + Though this is not stated in the MS 2. Figure 6: Why has only...

      Response: We expanded the comparison

      Minor comments:

      1. The text contains several...

      Response: We added...

      Referee #2

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      all OK

      Significance

      This is a valuable paper that make use of the rapid mitotic cycles of the Drosophila syncytial embryo to study the recruitment of proteins in mitotit centrosome maturation. The synchrony of these cycles make this an excellent experimental system in which to follow the relative timing of recruitment of individual molecules to the centrosome and, while the system may have idiosyncrasies that facilitate rapid cycling, it provides valuable information. This is a significant data set that shows the pulsatile recruitment of Spd2 and Polo kinase peaking in mid S-phase in contrast to the continuous recruitment of Cnn.

      The authors carry out some interesting modelling to account for the pulsatile activity of Polo through recruitment to the centriole. As they have previously shown Polo recruitment to be dependent upon S-S/t motifs in An1 and Spd, the authors examine the effects of multiple mutations at these potential recruitment sites. Interestingly they show that mutation of 34 such sites in Ana1 has little effect on recruitment of Polo to old-mother centrioles but perturbs recruitment onto ne mothers. Expression of the multiply mutated Spd-2, on the other hand, perturbs the Polo pulse on both old- and new- mothers. Together this would be in line with their previous suggested role for Ana1 in initially recruiting Polo to centrioles and Spd2 having a role in expanding the PCM.

      The modelling carried out by the authors is simple but effective. As with almost any cell cycle model, the models have their short-comings and the authors are largely aware of these. I thought it would be worthwhile to have some more discussion of what activates Polo kinase. It could be partially activated by the Polo-box binding to its receptor site but do other kinases carry out its T-loop phosphorylation? There are plenty of mitotic kinases around and so this could be discussed in greater detail. Moreover, although the pulsatile association of Polo with the centrosome does not have to correspond to pulsatile activity, this is likely. In which case, further discussion of the roles of opposing phosphatases would be in order.

      All in all, however, this is a useful paper that comes up with a thorough description of the timing of events of centrosome maturation in Drosophila embryos.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Review of 'Mother centrioles generate a local pulse of Polo/PLK1 activity to initiate mitotic centrosome assembly' from Wong et al.

      In this paper, Wong et al address the mechanisms of centrosome assembly in flies. They start with the interesting observation that Polo localized at centrosomes oscillates before cells enter mitosis, while Cnn (and with it centrosome maturation) either increases or reaches a plateau. The phenomenon is local, since Polo levels at in the cell are high during mitosis. They propose that the oscillation is driven by a negative feedback loop whereby Polo inhibits its own binding to the centrosome, Ana1 being the most likely relevant receptor. Finally, they discuss the possible meaning of this oscillatory behavior, in the light of the rapidity of the early embryonic cell cycles.

      Major comments

      1- One can imagine different reasons for the fact that the model displays different dynamics for Cnn and Spd-2/Polo. For example, a major difference may be due to the different dissociation rates of the clusters Cstar and Shat. These are governed by different laws and different parameters (kdis vs kidsCstar1/n). If I understand, both parameters and dependency on Cstar^2 are assumptions. Hence, it would be important to pinpoint which component of the model is more directly responsible for the observed behavior. The analysis should not be limited to the dissociation, but should be extended to the whole model. To this aim, one could test the robustness of the model's parameters. The results of this analysis will also be a prediction of the model.

      2- The presence of a positive-feedback loop involving Cnn could offer an alternative and more robust explanation for the slower dynamics of Cnn. Such a loop between Cnn and Spd-2 was proposed by the authors (Conduit, eLife, 2014). I think some comment on this point would be interesting (eg, could the Cnn/Spd-2 loop proposed earlier work in this context? If not, why? If yes, should not this option be explored?).

      3- The prediction presented in Figure 6 is very relevant. I wonder how robust this behavior is to changes in parameters values.

      4- Additional testing of the model would be important to confirm that the negative feedback loop is actually in place, although I understand experiments may be difficult to be performed. Possible examples: constantly high levels of Polo are expected to decrease its centrosomal localization, is that correct and, if so, testable? Is it possible to delay one cycle, and then observe the decay in Cnn values? This latter experiment, for example, could help to distinguish positive feedback vs slow decay rates. If the experiments are not possible, it may be worth anyway to present some predictions worth testing.

      5- The difference between Models 2 and 3 is not clear to me. In mathematical terms, they seem to be basically the same thing: reaction (50)=(33), (51)~(34) given (40) and (52)~(35) again given (40). This is precisely since the model comes with the assumption of a well-stirred system, and thus adding P in solution is not so different from assuming P=Rphat (40). I would have imagined that also Model 2 accounts for the fact that in Spd-2-S16T and Ana1-S347T Polo is recruited slower and for a longer period. Is it not true? If so, is model 3 really needed? More in general, assuming a role for an increase of local concentration of P* is quite a jump, especially given the small distances involved, and the fast diffusion occurring within cells.

      Minor points

      1-Could the authors use the FRAP data to estimate the different kdis? If so, a comparison with the 20-fold difference used in the model would be useful.

      2- p. 6, The authors should state clearly for the worm-uneducated like me whether the fusions were done with the endogenous proteins or not.

      3- p.7 Figure 1B, in the text it is referred to display 'levels of peaks' and in the figure and legend we find 'growth period'. Not clear how the two refer to the same quantity.

      4- Spd2-mCherry is present in both Figure 1C and D, but with very different amplitudes. Why is that the case?

      5- The fact that Polo peaks in mitosis is a key observation. Unfortunately, this is often reported as a personal communication. The authors never tried to produce this piece of data?

      6- p.11 It is explained that NM and OM differ for their initial values because the OM starts with some PCM from the previous cycle. However in Figure 3A, for example, the values of Polo at the end of the cycle are identical in the two. Is not this in contrast with the explenation?

      Still p11, there is reference to Figure 3C,D, but Figure 3D does not exist, I guess it should be 3A,C.

      7- In the formulation of the model (page numbers in Suppl Mat are unfortunately missing..), one citation for the total amount of Polo being large is needed.

      8- I do not understand this point: scaled c output is 1, and the initial condition for c=1 also?

      9- It has been shown in different systems (from yeast -- haase winey reed, NCB, 2001-- to worms -- McCLeland O-Farrell CB 2008) that centrosome duplication can occur independently from the cell cycle oscillator. I was wondering whether the proposed negative feedback loop may play a role in this phenomenon. This is only a curiosity, which does not need to be addressed.

      Significance

      The new observation and hypotheses presented in the paper provide a sizeable advance. The presence of an oscillation in Polo, uncoupled from cellular levels, is new, and the model proposes a testable hypothesis to explain it. Some additional experiments to verify the model would strengthen the manuscript.

      The work is probably more appropriate for experts in the centrosome field. My primary expertise for this review was in mathematical models.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      Embryogenesis is characterized by rapid cell divisions without gap phases. How these cells achieve successive rounds of chromosome segregation in dozens of minutes without failure is of great interest to cell and developmental biologists. A key aspect of rapid divisions is the oscillatory nature of centrosome assembly, which aids in building the mitotic spindle during mitosis, and centrosome disassembly during mitotic exit. Polo kinase activation and localization to the centriole is essential for centrosome dynamics, but its molecular targets, timescales of activation and deactivation, and overall mechanism of action is still not fully determined.

      This paper aims to build a mathematical model to tease out the mechanism of Polo recruitment to centrioles and transformation of centrosome scaffold proteins (Spd2/Cep192 and Cnn/CDK5RAP2) from inactive forms to functional, multimeric platforms. The authors posit that several features are critical to describe the dynamics of Polo, Spd-2, and Cnn: 1) a negative feedback loop that releases Polo from a centriole receptor, 2) a kinetic relay that allows Spd-2 to assemble, followed by Cnn, and 3) a disassembly mechanism driven by de-phosphorylation. They validate their model in several ways, most notably by introducing an Ana1 mutant that inhibits Polo binding to centrioles: their model predicts a delay in Polo accumulation which bears out in vivo.

      The cell biology experiments of this paper are of high quality and well quantified, and I have no concerns there. However, the mathematical model elevates this study to the next level, and thus deserves greater scrutiny. I'm not concerned that the model doesn't get everything right, or that all of the parameters are correct. This is new territory. I think the value of models is their power to predict, rather than their power to explain existing data. The authors are giving the field a great hypothesis generator which we can use to plan experiments for the next 5 years. Then the model will be updated to be more accurate. Thus, this work represents a significant achievement.

      Still, some key validations regarding phosphorylation rates are missing that could be easily tested. Furthermore, the study would be strengthened by greater understanding of the PCM disassembly. mechanism. Addressing these two points will improve my confidence in the mathematical model.

      Major Comments.

      1. This study builds a model that relies heavily on rates of phosphorylation and de-phosphorylation. Further, de-phosphorylation is assumed to be the key disassembly mechanism, but this has not been rigorously studied in fly embryos. Thus, two critical aspects of the model remain unverified.

      Surely, the authors could test how changing phosphorylation rate (kcat S and kcat C) and de-phosphorylation rate (kdis) affects the recruitment and departure of Spd-2, Polo, and Cnn in vivo. This could be achieved by 1) titrating an inhibitor of Polo (e.g., BI-2536) or introducing a mutation in the T-loop of Polo (the equivalent of T210D or T210V in flies; T210D should raise kcat, while T210V should lower kcat; https://doi.org/10.1021/bi602474j), and 2) inhibiting a phosphatase such as PP2A, which is the presumed antagonist of Polo according to several C. elegans studies.

      If their model adequately predicts the outcome of these two experiments (changing phosphorylation and dephosphorylation rates), I will be more convinced.

      1. The models focus on Polo and Spd-2 pulses during mitosis, but ignore the disassembly phase of Cnn. Do Cnn levels drop during mitotic exit? Can this drop in Cnn be described by any of the authors' proposed models?
      2. These models are described as the "simplest possible" yet have many unknown parameters. For example, Model 1 has 12 parameters, none of which have been determined experimentally. How did the authors land on these values? Is it possible that one could alter any combination of these parameters and achieve a similar outcome? Or, if the Kcat of Polo is changed two-fold, does the whole model fall apart (see above)?

      Experimentally determining these parameters would greatly strengthen this paper, but I think that would require gargantuan effort that is beyond the scope of the current work. Instead, it is therefore critical to test how robust the model is by probing the parameter space. For example, could the authors show us what the model predicts (e.g., as in Figure 2C) when each parameter is changed by 2-fold? Presumably the authors have already done this, but I would like to see the outcomes.

      Minor Comments.

      -Figure 1. The authors should include representative images of centrosomes for the plots in panels A,C and D. The x-axes could have more informative labels (e.g., time relative to NEB). - Figure 1A. Much of Figure 1 has already been performed in C. Elegans, yet this fact is not mentioned until the discussion. For example, the pulsed nature of Polo and SPD-2 appearance and disappearance has been reported in C. elegans in Mittasch et al. 2020 and Magescas et al., 2019. These findings, and their implications for evolutionary conservation, should be mentioned in the main text (e.g. page 6 or 7). - Figure 2. It's hard to envision how a scaffold can both flux outward and be structurally strong. The mere fact that there is outward movement of scaffold chunks implies breaking of bonds, which indicates overall structural weakness. Are the authors talking about strength of the entire PCM, or just strength of the chunks? It would be great if the authors could clarify this. -Figure 2. One would think that scaffold flux and strength are anti-correlated. Perhaps this is the case? As far as I'm aware, previous studies of Cnn flux were performed primarily in S-phase, when there is presumably less need for PCM strength. What about during mitosis during chromosome segregation? Does the PCM become stronger during mitosis? Does Cnn flux decrease during mitosis? - Figure 2B. I would prefer a legend in the actual figure indicating what the different symbols mean. I found it difficult glancing back and forth between the text and the figure. - "We also allow the rate of 𝐶∗ disassembly to increase as the size of the 𝐶∗ scaffold increases, which appears to be the case in these embryos (Conduit et al, 2010)."

      I can't find any analysis of PCM disassembly in this study. What are the authors referring to as "disassembly"? Do they mean departure of Cnn from the PCM in S-phase? Or, disassembly of the whole PCM during mitotic exit?

      -"If the centriole and PCM receptors (Ana1 and Spd-2, respectively) recruit less Polo, the centriole receptor (Ana1) will be inactivated more slowly."

      Is Ana1 a known substrate of Polo? This seems highly speculative. The authors should note that deactivation of Ana1 could be through various other mechanisms. Furthermore, Polo could be locally degraded as shown in human cells doi: 10.1083/jcb.200309035.

      -"We note that our mathematical models are purposefully minimal to reduce the number of parameters and test possible mechanisms rather than to mimic experimental data." I appreciate this statement.

      Significance

      This paper aims to build a mathematical model to understand the cyclic nature of centrosome assembly and disassembly in fruit flies. Due to the conserved nature of the components (proteins in the system, such as Polo Kinase, Spd-2, and Cnn/CDK5RAP2), this model could likely be extended to a broad swath of eukaryotes. This approach is quite unique in the centrosome field, as only one other study (Zwicker et al., PNAS 2014) has tried seriously to model the growth kinetics of PCM, the outermost part of a centrosome. The field has been dominated by genetics and cell biology approaches, so implementing a mathematical model will advance the field and generate hypotheses, even if the model is not yet fully fleshed out. This paper represents a significant advance.

      This study will be of broad interest to the centrosome field.

      Expertise: centrosome biogenesis, mitosis, biophysics

      Note: I am not sufficiently qualified to evaluate the mathematics underlying the model.

    1. Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Summary: The Non-Photochemical Quenching (NPQ) protects photosystems from energy overloading by excess light exposure. The NPQ consists of multiple factors which function in different time scales and energy levels. One of the factors, qH, has been proposed based on chlorophyll fluorescence lifetime observation and the plastid lipocalin has been identified as the important player to regulate qH. It remains to link the qH phenotype and molecular mechanisms. The authors purify photosynthetic protein complexes from the qH mutants and tried to build a biophysical model to link qH phenomena and protein science based on chlorophyll fluorescence lifetime observation.

      Response: Thank you for your constructive comments which we have addressed and complete the manuscript nicely.

      Major comments: There are two major issues. One issue is, even many kinetics are presented, but the relationship between these values and qH phenotypes is not clearly stated or connected. One idea is to build the mathematical model(s) to explain these kinetics. The other issue is, lipid composition is not considered. Indeed, this phenomenon is observed or emphasized in low temperatures. Generally thinking, lipid composition bound to photosynthetic complexes would be disturbed or modified its conformation.

      Response: ____We have not attempted to build a descriptive model of how the different molecular players in qH operate in the membrane to yield the observed fluorescence kinetics as this is beyond the scope of our study. However, we agree that lipid composition should be considered and we have now added two additional authors, text in the method and result sections and new Fig. 6 and Fig. S9 examining lipid composition of thylakoid extract, LHCII trimer and LHCII/Lhcb monomer fractions. No significant differences can be observed between the qH ON and OFF states in the main chloroplastic lipids.

      Minor comments: Some datasets are less biological replicates or not clearly stated about the biological replicate number (Figure 2, Figure 4, Figure 5, Figure S2, Figure S5, and Figure S8). Normally, at least three independent biological replicates are required. Technical replicates are not acceptable.

      Response: ____If by biological replicates, you mean three independent plant individuals, we agree that this would be the bare minimum required, and we apologize for the confusion. The definition of biological replicate (also referred to as biological experiment) in our study is each one represents a separate batch of several plant individuals pooled (n = 2 to 8) grown at independent times. Then within each biological experiment, we perform technical replicates (i.e. independent measurements of different aliquots from the same sample) which we believe are acceptable and necessary but we agree not sufficient. For most data, we have at least 2 biological experiments, and up to 3, for assessing the quenched nature of LHCII trimer and not LHCII/Lhcb monomer (Fig. 3). We have rephrased the text so this aspect is clearer and also provide more detail below about the aforementioned figures.

      Fig. 2: TCSPC on thylakoids, n=3 technical replicates from 2 independent biological experiments; Two separate thylakoid preparations were made from independently grown plants (leaves from n > or = 5 plants were pooled each time). Fig. 4: CN-PAGE, n=3 technical replicates from 2 independent biological experiments (leaves from n > or = 3 plants were pooled each time). Fig. 5: TCSPC on isolated complexes, n=3 technical replicates from 2 independent biological experiments; Two separate thylakoid preparations were made from independently grown plants (leaves from n = 8 plants were pooled each time). Fig. S2: step solubilization, n=2 technical replicates; here 1 biological experiment was used from n = 8 plants. Fig. S5 contains the biological replicates 1 (n=2 plants) and 2 (n=8 plants) of the representative experiment shown in Fig3, biological replicate 3 (n=8 plants). Fig. S8: HPLC on isolated complexes from 2 independent biological experiments (leaves from n = 8 plants were pooled each time).

      In Figure 3 and Figure S3, extend the length of the major tick for each axis. It is hard to distinguish between major tick and minor tick.

      Response: Ok, done.

      In Figure 3, mark the measured peak wavelength value on the top for readers.

      Response: ____Ok, done, added in the legend “with maxima at 679 nm for all samples”.

      In Figure 4, Why do not you present chlorophyll kinetics? I suspect it is possible to acquire if you used SpeedZen.

      Response: In Figure 4, we present a measurement of fluorescence emission from separated pigment-protein complexes by CN-PAGE, there are no light-induced changes to be measured here hence we do not present chlorophyll fluorescence kinetics.

      In Figure 6, decrease the thickness of the border for the bar graph or marker. Markers on the top of the bar graph are not visible.

      Response: Ok, done.

      Figures S3 and S6, provide the elution volume of protein standard in the chromatograph.

      Response: We don’t make any statement regarding the molecular weights of the protein complexes from the chromatograms, so elution volumes of protein standards are not required. Composition of the different peaks were validated by Iwai et al. 2015 (Nat Plants 1: 14008) and further verified here (Fig. S6, S10).

      Figures S11 and S12, describe the number of biological replicates.

      Response: ____Ok, done (now Fig. S12 and S13).

      Reviewer #1 (Significance (Required)): The topic is important for plant physiology especially photosynthesis regulation and biophysical characterization is straightforward to interpret molecular machinery. Other studies are only for chlorophyll observation for the whole plant body, but most importantly, this study is the challenging work on qH characterization with a biochemical approach.

      Response: Thank you for your appreciation of our work!


      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Authors observed qH in isolated LHCII trimers with Chl fluorescence changes (shorter), and concluded that no single major Lhcb isomers is necessary for qH.

      Response: Thank you for your constructive comments which we have now addressed and make the manuscript clearer.

      Major concern is: LHCII trimers are divided into S, M, L trimers with different compositions. Authors are requested to interpret their results in terms of L-, M-, S-trimers.

      Response: Our solubilization conditions and isolation method don’t allow to distinguish between loosely (L), moderately (M) or strongly (S)-bound trimers, the LHCII trimer fraction is a pool of these trimers. We have now specified this aspect in the discussion and cannot interpret our results further than narrowing down the LHCII trimer as a quenching site. In future work, we will attempt their separation although getting entirely pure fractions of each is technically challenging.

      Minor comments are: Authors describes qI as reversible NPQ, but qI with D1 damage is not reversible.

      Response: ____D1 can be repaired thereby relaxing qI, see recent article from Nawrocki et al. Sci Adv 2021. We have clarified this point in the introduction.

      In page 3 - 2nd paragraph, Authors define components of NPQ one by one, but the definition or recovery kinetics for qH is skipped, And authors suddenly start explaining molecular players of qH without changing paragraph.

      Response: We have now clarified that the relaxation kinetics for sustained NPQ including qH are slow (hours to days) and changed paragraph to introduce the molecular players known to be regulating qH.

      In Fig. S6, authors tried to confirm the trimer and monomer fractions they used by using Lhcb2 and Lhcb4 antibodies, respectively. But, the distribution of Lhcb2 only in Trimer fraction in WT, which is different from the distribution in other mutants. Contamination of Lhcb4 in Trimer fraction is also of concern. Authors may use BN-PAGE or Ultracentrifugal separation, rather than gel filtration.

      Response: Regarding the different distribution of Lhcb2 between WT and mutants, we have now better labeled Fig. S6B so it is clear that WT is non-treated (non-stress condition) and the mutants underwent a cold and high light-treatment (stress condition). This difference may thus be explained by the trimers stability/propensity to be solubilized by the detergent varying between non-stress and stress conditions. It is not a concern as we’re not comparing mutants to WT. Contamination by Lhcb4 in the trimer fraction is neither a concern as its amount is similarly low between the compared samples: soq1 mutant cold HL (qH ON) and soq1 lcnp mutant cold HL (qH OFF). So presence of Lhcb4 cannot account for the observed difference in fluorescence quenching as its quantity does not differ between the ON and OFF states. Importantly, the monomeric fraction, enriched in Lhcb4, does not show fluorescence quenching. We have used CN-PAGE as a complementary approach that showed that LHCII trimers are quenched after a cold and high light-treatment in both WT and soq1 mutants (Fig. 4). These aspects are described page 7 in the results section “qH is observed in isolated major LHCII”. Here we chose not to use BN/CN-PAGE or sucrose gradient ultracentrifugation for the isolation of the trimeric and monomeric fractions for two reasons: they would not be as suitable for TCSPC experiments due to their acrylamide or sucrose content and they would take more time; gel filtration was preferred to limit buffer exchange and time required from plant protein extraction to measurement.

      Reviewer #2 (Significance (Required)):

      Localization of qH in LHCII trimers is interesting, but not surprizing.

      So, authors are recommended to rewrite the significance of their findings.

      Response:____ In the last paragraph of the introduction, we have now clarified that this study identifies qH quenching in the LHCII trimers but not in the minor monomeric Lhcbs. Prior to this work, the peripheral antenna as a whole was known to be required for qH, now this study identified the major trimeric LHCII as a quenching site. The novelty and significance of this work is further substantified by the isolation of quenched antenna directly from plants in physiological conditions, as opposed to artificial induction in vitro. Regarding the “surprising” nature of findings in general, please see answer below to reviewer #3.

      My expertise: I am working on the movement of L and M trimers in plants under photoinhibitory illumination.


      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      The studies reported in this manuscript were designed to test the hypothesis that LCNP binds (or modifies) a molecule in the vicinity of (or within) the antenna proteins, under stress conditions. This in turn triggers a conformational change that converts antenna proteins from a light-harvesting to a dissipative state. Experiments were performed to locate the qH quenching site within the peripheral antenna of PSII and determine its sensitivity to Lhcb subunit composition. The authors were able to isolate antenna complexes with active qH that remained quenched after purification. Analysis of these complexes revealed that qH can occur in the major trimeric LHCII complexes. The elegant studies reported in this manuscript have made good use of appropriate molecular techniques and genetic resources. Genome editing and genetic crosses were used to demonstrate that qH is not restricted to inherent regulation of a specific major Lhcb subunit. The data are clearly presented and the data are convincing.

      Response: Thank you very much for your appreciation of our work!

      Reviewer #3 (Significance (Required)):

      The studies reported in this manuscript build on a firm foundation of previous work by these authors and others. The conclusions are based on the analysis of Chl fluorescence lifetimes in intact leaves, thylakoids, and isolated antenna complexes in which qH was "ON" or "OFF". The findings are interesting and incremental in terms of increasing current understanding. However, the data extend our knowledge of the location of qH within the peripheral antenna of PSII. Rather unsurprisingly, the authors highlight the need to preserve thylakoid membrane macroorganisation for a full qH response.

      Response: The philosophical concept of findings not being surprising could be discussed at length. To quote a commenter from this blog: ____https://blogs.uw.edu/ajko/2009/09/17/whats-surprising/____, just because one could have guessed the outcome of an experiment is not the same as empirically validating it. We hope you agree. Plus, as Fabrice Rappaport used to say, “we’re never sheltered from a discovery” and it could have been that isolated LHCII with qH ON showed short Chl fluorescence lifetimes similar to observed in leaves. We couldn’t know until we tried!

      Data are presented showing that while qH occurs in the trimeric LHCII complexes, it does not require a specific Lhcb subunit and is insensitive to Lhcb composition. However, the discussion is rather speculative because data interpretation is limited by an absence of knowledge regarding what happens to the LHC trimers and qH during isolation of thylakoids and photosynthetic complexes. This point is considered appropriately in the discussion. The authors also acknowledge the existence of additional quenching sites beyond the LHCII trimers that are required for qH.

      Response: Indeed, thank you we have addressed these points in the discussion, and have now added new data on the lack of changes in lipid composition in the LHCII trimer with qH ON or OFF. We view this study as an important milestone to obtain knowledge on the molecular origin of qH.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      The studies reported in this manuscript were designed to test the hypothesis that LCNP binds (or modifies) a molecule in the vicinity of (or within) the antenna proteins, under stress conditions. This in turn triggers a conformational change that converts antenna proteins from a light-harvesting to a dissipative state. Experiments were performed to locate the qH quenching site within the peripheral antenna of PSII and determine its sensitivity to Lhcb subunit composition. The authors were able to isolate antenna complexes with active qH that remained quenched after purification. Analysis of these complexes revealed that qH can occur in the major trimeric LHCII complexes. The elegant studies reported in this manuscript have made good use of appropriate molecular techniques and genetic resources. Genome editing and genetic crosses were used to demonstrate that qH is not restricted to inherent regulation of a specific major Lhcb subunit. The data are clearly presented and the data are convincing.

      Significance

      The studies reported in this manuscript build on a firm foundation of previous work by these authors and others. The conclusions are based on the analysis of Chl fluorescence lifetimes in intact leaves, thylakoids, and isolated antenna complexes in which qH was "ON" or "OFF". The findings are interesting and incremental in terms of increasing current understanding. However, the data extend our knowledge of the location of qH within the peripheral antenna of PSII. Rather unsurprisingly, the authors highlight the need to preserve thylakoid membrane macroorganisation for a full qH response.

      Data are presented showing that while qH occurs in the trimeric LHCII complexes, it does not require a specific Lhcb subunit and is insensitive to Lhcb composition. However, the discussion is rather speculative because data interpretation is limited by an absence of knowledge regarding what happens to the LHC trimers and qH during isolation of thylakoids and photosynthetic complexes. This point is considered appropriately in the discussion. The authors also acknowledge the existence of additional quenching sites beyond the LHCII trimers that are required for qH.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Authors observed qH in isolated LHCII trimers with Chl fluorescence changes (shorter), and concluded that no single major Lhcb isomers is necessary for qH.

      Major concern is: LHCII trimers are divided into S, M, L trimers with different compositions. Authors are requested to interpret their results in terms of L-, M-, S-trimers.

      Minor comments are: Authors describes qI as reversible NPQ, but qI with D1 damage is not reversible.

      In page 3 - 2nd paragraph, Authors define components of NPQ one by one, but the definition or revoery kinetics for qH is skipped, And authors suddenly start explaining molecular players of qH without changing paragraph.

      In Fig. S6, authors tried to confirm the trimer and monomer fractions they used by using Lhcb2 and Lhcb4 antibodies, respectively. But, the distribution of Lhcb2 only in Trimer fraction in WT, which is diferetnf from the distribution in other mutants. Contamination of Lhcb4 in Trimer fraction is also of concern. Authors may use Bn-PAGE or Ultracentrigugal separation, rather than gel filtration.

      provide evidences for

      Significance

      Localization of qH in LHCII trimers is interesting, but not surprizing.

      So, authors are recommended to rewrite the significance of their findings.

      My expertise: I am working on the movement of L and M trimers in plants under photoinhibitory illumination.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Summary:

      The Non-Photochemical Quenching (NPQ) protects photosystems from energy overloading by excess light exposure. The NPQ consists of multiple factors which function in different time scales and energy levels. One of the factors, qH, has been proposed based on chlorophyll fluorescence lifetime observation and the plastid lipocalin has been identified as the important player to regulate qH. It remains to link the qH phenotype and molecular mechanisms. The authors purify photosynthetic protein complexes from the qH mutants and tried to build a biophysical model to link qH phenomena and protein science based on chlorophyll fluorescence lifetime observation.

      Major comments:

      There are two major issues. One issue is, even many kinetics are presented, but the relationship between these values and qH phenotypes is not clearly stated or connected. One idea is to build the mathematical model(s) to explain these kinetics. The other issue is, lipid composition is not considered. Indeed, this phenomenon is observed or emphasized in low temperatures. Generally thinking, lipid composition bound to photosynthetic complexes would be disturbed or modified its conformation.

      Minor comments:

      Some datasets are less biological replicates or not clearly stated about the biological replicate number (Figure 2, Figure 4, Figure 5, Figure S2, Figure S5, and Figure S8). Normally, at least three independent biological replicates are required. Technical replicates are not acceptable.

      In Figure 3 and Figure S3, extend the length of the major tick for each axis. It is hard to distinguish between major tick and minor tick.

      In Figure 3, mark the measured peak wavelength value on the top for readers.

      In Figure 4, Why do not you present chlorophyll kinetics? I suspect it is possible to acquire if you used SpeedZen. In Figure 6, decrease the thickness of the border for the bar graph or marker. Markers on the top of the bar graph are not visible.

      Figures S3 and S6, provide the elution volume of protein standard in the chromatograph.

      Figures S11 and S12, describe the number of biological replicates.

      Significance

      The topic is important for plant physiology especially photosynthesis regulation and biophysical characterization is straightforward to interpret molecular machinery. Other studies are only for chlorophyll observation for the whole plant body, but most importantly, this study is the challenging work on qH characterization with a biochemical approach.